Intermediate Site Reliability Engineer
Company
GitLab
Location
USA
Type
Full Time
Job Description
An overview of this role
The GitLab DevSecOps platform empowers 100000+ organizations to deliver software faster and more efficiently. We are one of the world’s largest all-remote companies with 2000+ team members and values that foster a culture where people embrace the belief that everyone can contribute. Learn more about Life at GitLab.
SREs with Gitaly work alongside Backend Engineers with a focus primarily on improving the availability and the reliability of the Gitaly fleet on GitLab.com. While the backend engineers approach their responsibilities from a software developer point of view the SREs approach the same problems from the operational perspective and collaborate closely on finding an optimal solution in addition to ensuring that new Gitaly features can run at scale and deployed to production safely.
Gitaly is the Git data storage tier of GitLab providing a reliable secure and fast distributed Git data store over gRPC. For more information about Gitaly see the team’s Direction page.
Gitaly’s high-availability storage requires developers who understand distributed storage systems their management observability and availability. Cluster team contributes features fixes bugs and improves performance of this software stack.
Currently we're building a new distributed cluster solution and improvements to our Disaster Recovery readiness.
What you’ll do
-
Work with peer SREs to maintain Gitaly’s environments within GitLab’s SaaS offerings including cost and performance optimization capacity planning migrations and debugging production issues.
-
Participate in architectural discussions and decisions surrounding Gitaly within the greater GitLab ecosystem.
-
Design RPC interfaces for the Gitaly service.
-
Scope estimate and describe tasks to reach the team’s goals.
-
Develop production automation and tooling for Gitaly for use both in SaaS and self-managed installations.
-
Help ensure that Gitaly development tooling releases and other processes serve the team and the product’s goals.
-
Develop Gitaly in accordance with the product’s goals and a focus on reliability and maintainability.
-
Instrument monitor and profile Gitaly in the production environment.
-
Build dashboards and alerts to monitor the health of your services.
-
Conduct acceptance testing of the features you’ve built.
-
Educate all team members on best practices relating to high availability.
-
Write performant maintainable and elegant code and peer review others’ code.
-
Be positive and solution-oriented.
-
Constantly improve the quality & security of the product.
-
Take initiative in improving the software in small or large ways to address pain points in your own experience as a developer.
-
Qualify developers for hiring.
-
Respond to user emergencies platform alerts and support requests including regular on-call duties.
What you’ll bring
-
Mandatory: experience running highly-available systems in production environments at scale.
-
Mandatory: hands-on experience with Cloud technologies including Kubernetes.
-
Mandatory: proven professional experience building debugging optimizing software in large-scale high-volume environments.
-
Mandatory: proven professional experience writing and testing high-quality code.
-
Mandatory: a good understanding of building instrumented observable software systems.
-
Highly desirable: Experience with Terraform infrastructure as code.
-
Highly desirable: proven professional experience writing and testing quality code in Go.
-
Highly desirable: a good understanding of git’s internal data structures or experience running git servers.
-
Highly desirable: experience with gRPC.
-
Highly desirable: willingness to learn Ruby.
About the team
The Gitaly team owns and runs services that handle all Git operations on GitLab.com one of the largest open source SaaS sites on the Internet. This means we are constantly faced with solving unique performance scalability and cost challenges that impact our users every day. Our future is about shipping improvements that can scale both GitLab.com from an infrastructure perspective as well as deploying new features that will scale with the growing size of repositories across the industry.
Date Posted
10/04/2024
Views
0
Similar Jobs
Developer II - Eventbrite, Inc.
Views in the last 30 days - 0
Eventbrite is seeking a Web Application Developer to join their highperforming GTM Gotomarket Business Systems team The role involves implementing and...
View DetailsMobile Engineering Manager - Mobile Retention - Dropbox
Views in the last 30 days - 0
Dropbox is seeking a Mobile Engineering Manager to lead a team of iOS and Android engineers working on the Dropbox apps The role involves managing cri...
View DetailsSolution Engineer - Ottimate
Views in the last 30 days - 0
Ottimate is a company that automates accounts payables for fastgrowing businesses They offer a cloudfirst approach for invoice management and payments...
View DetailsSr. Front End Engineer - ScienceLogic
Views in the last 30 days - 0
ScienceLogic is seeking a FrontEnd React with TypeScript developer to join their team The role involves building intuitive user interfaces for their A...
View DetailsExecutive Assistant - Renaissance
Views in the last 30 days - 0
Renaissance is a global leader in preK12 education technology offering solutions that help educators create personalized learning paths for students T...
View DetailsStaff Machine Learning Engineer - Twilio
Views in the last 30 days - 0
Twilio is seeking a Staff Machine Learning Engineer with a strong background in Data Science and Machine Learning to join their Efficiency Engineering...
View Details