Lead Site Reliability Engineer
Company
Factset
Location
Remote
Type
Full Time
Job Description
We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our growing team. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our software systems and infrastructure. The ideal candidate possesses a strong background in coding, automation, and system administration, combined with a passion for continuously improving system reliability.
Responsibilities:
- Collaborate with development, operations, and product teams to define, review, and implement reliability standards and best practices.
- Design, implement, and maintain highly available and scalable architectures for our applications and infrastructure.
- Develop and enhance automated tools and frameworks to optimize system monitoring, deployment, and recovery.
- Troubleshoot and resolve complex issues throughout the entire software stack, including networking, databases, and distributed systems.
- Conduct performance analysis and capacity planning to ensure system scalability and resource optimization.
- Take a proactive approach to continuously improving reliability.
- Participate in incident response, root cause analysis, and postmortem activities to identify and rectify system failures.
- Collaborate with cross-functional teams to implement and improve CI/CD pipelines, ensuring reliable and efficient software releases.
- Stay up-to-date with emerging technologies and industry trends, actively contributing to ongoing system improvements.
- Participate in on-call rotation.
Requirements:
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.
- Proven experience deploying and managing large-scale distributed systems successfully.
- Understanding of SRE concepts (error budgets, SLIs/SLOs, blameless postmortems)
- Proficiency in programming languages such as Python, C++, or Go
- Familiarity with monitoring and observability tools.
- Excellent problem-solving skills and ability to troubleshoot complex issues efficiently.
- Strong organizational and communication skills, with the ability to collaborate effectively in a cross-functional team environment.
Desirable Qualifications:
- Familiarity with security best practices and experience implementing security measures in a production environment.
- Experience with modern infrastructure technologies and tools, including cloud platforms (AWS, Azure, GCP), containers (Docker, Kubernetes), and orchestration (Ansible, Chef, Puppet).
- Solid understanding of networking protocols and technologies (TCP/IP, DNS, load balancing).
- Demonstrated experience with infrastructure as code (IaC) and automation tools (e.g., Terraform, GitHub Actions).
Join our team and contribute to creating and maintaining a highly reliable and performant infrastructure that supports our growing platform. Help shape the future of our systems architecture while working in a collaborative and innovative environment.
Date Posted
12/11/2024
Views
0
Similar Jobs
Director, Product, Customer, and Lifecycle Marketing - Garner Health
Views in the last 30 days - 0
Garner Health is seeking an experienced Product Marketing Leader to join their team The ideal candidate will lead the product marketing efforts focusi...
View DetailsLinux Support Engineer - Voltage Park
Views in the last 30 days - 0
Voltage Park is seeking a Linux Support Engineer for a fulltime remote position The ideal candidate will have command line level Linux sys administrat...
View DetailsDirector, Product (Remote) - Dscout
Views in the last 30 days - 0
Dscout is a leading company in experience research technology offering a platform for major companies to gain insights into user needs and behaviors T...
View DetailsTechnical Architect - CDW
Views in the last 30 days - 0
CDW offers a rewarding career opportunity for a Technical Architect with expertise in ServiceNow The role involves delighting customers by collaborati...
View DetailsSales Development Representative (Remote) - Dscout
Views in the last 30 days - 0
Dscout is a leading company in experience research technology offering a platform for businesses to gain insights into user needs and behaviors They a...
View DetailsFederal Security Solutions Engineer - Rapid7
Views in the last 30 days - 0
Rapid7 is seeking a Federal Solutions Engineer with 5 years of experience in cybersecurity solutions engineering or technical sales focusing on federa...
View Details