Senior Observability & Monitoring Engineer (remote)
Company
First American Title
Location
Orange County
Type
Full Time
Job Description
Who We AreJoin a team that puts its People First! Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and empowered to be innovative and reach their full potential. Our inclusive, people-first culture has earned our company numerous accolades, including being named to the Fortune 100 Best Companies to Work For® list for nine consecutive years. We have also earned awards as a best place to work for women, diversity and LGBTQ+ employees, and have been included on more than 50 regional best places to work lists. First American will always strive to be a great place to work, for all. For more information, please visit www.careers.firstam.com.
What We DoWe are looking for a Senior Observability & Monitoring Engineer to develop and manage enterprise observability infrastructure and monitoring tools such as Elasticsearch Observability, Terraform, Azure DevOps, and AWS Native tools and constructs. This candidate will also provide guidance on how to best use these tools. Must have strong communication and technical skills.Â
What You’ll Do:Â
- Build solutions to provide monitoring patterns for various in-house and off-the-shelf applications across the company.Â
- Measure and monitor all production systems with an eye toward availability, latency, and overall system health.Â
- Engage with application teams to improve and evolve systems by lobbying for changes that enhance reliability, resilience, and observability.Â
- Contribute to continuous improvement initiatives for the team and customers, with a goal of providing automation and enhancing client service, efficiency, and profitability.Â
- Fine-tune existing tools, or research, develop, and implement new tools, to deliver additional monitoring capabilities.Â
- Work on complex problems where analysis of situations or data requires an in-depth evaluation of multiple factors.Â
Â
What You’ll Bring:Â
- Proactive approach, designing telemetry strategies, implementing comprehensive monitoring systems, and leveraging advanced tools to gain real-time insights and identify potential issues before they escalate.Â
- Possess in-depth knowledge and expertise in telemetry data collection, analysis, and implementation, fully understanding the intricacies of and how to derive meaningful insights from different telemetry sources such as:Â Metrics, Events, Logs, Traces.Â
- Expertise in identifying patterns, detecting anomalies, and building a holistic understanding of system behavior beyond traditional monitoring approaches' current limitations.Â
- Experience in software engineering, software development, and/or system operations.Â
- Experience with APM and Observability using tools such as ELK Stack, AWS CloudWatch, Azure Monitor, New Relic, Splunk, Prometheus, Grafana, Sentry, etc.Â
- Extensive understanding of the complexities native to modern distributed systemsÂ
- Well-versed in the challenges posed by microservices architectures, cloud-native environments, and hybrid infrastructure setups.
- Proven ability to lead complex initiatives/projects from inception to completion.Â
- Ability to perform analysis on metrics & logs, using problem-solving techniques to provide guidance on monitoring, alerting, dashboarding and visualization.
- Ability to work with a high level of autonomy and with a globally distributed team.Â
- Excellent communication skills, both verbal and written; able to explain complex technical topics to both internal and external stakeholders with ease and in remote/distributed environments.Â
Â
Preferred Qualifications:Â
- Hands-on experience with Elasticsearch, including deployment and management of the Elastic Stack, Beats and/or Fleet Agents, APM, Dashboarding, and Reporting.Â
- Hands-on experience with DevOps practices, including using GIT & Developing CI/CD Pipelines.Â
- Hands-on experience with Infrastructure as Code (Terraform preferred)Â
- Hands-on experience with Monitoring & Log Aggregation technologiesÂ
- Hands-on experience with cloud infrastructure such as AWS, Azure, or Oracle Cloud Infrastructure.Â
- Opinions about dashboards, metrics, and SLO’sÂ
- Strong knowledge of cloud design patterns for observability monitoring, resiliency, etc.Â
- Ability to understand and write code to perform various tasks related to automation & monitoring.Â
Pay Range: $156,000 - $176,000 AnnuallyÂ
Â
This hiring range is a reasonable estimate of the base pay range for this position at the time of posting. Pay is based on a number of factors which may include job-related knowledge, skills, experience, business requirements and geographic location.Â
What We OfferBy choice, we don’t simply accept individuality – we embrace it, we support it, and we thrive on it! Our People First Culture celebrates diversity, equity and inclusion not simply because it’s the right thing to do, but also because it’s the key to our success. We are proud to foster an authentic and inclusive workplace For All. You are free and encouraged to bring your entire, unique self to work. First American is an equal opportunity employer in every sense of the term.Based on eligibility, First American offers a comprehensive benefits package including medical, dental, vision, 401k, PTO/paid sick leave and other great benefits like an employee stock purchase plan.
Date Posted
12/13/2024
Views
0
Similar Jobs
Quality Engineer, RM & Pre-Production - ARC'TERYX
Views in the last 30 days - 0
Arcteryx is seeking a Quality Engineer with 3 years of experience in manufacturing preferably in the apparel industry The role involves developing and...
View DetailsQuality Engineer (Internal Assignment / Project Hire) - The Walt Disney Company
Views in the last 30 days - 0
The job posting is for a Quality Engineer position in Worldwide Safety Assurance Disneyland Resort Quality Engineering team The role involves providin...
View DetailsMission Systems Engineer - Maxar Technologies
Views in the last 30 days - 0
Maxar Intelligence is currently hiring for a Mission Systems Engineer in Westminster CO The role involves collaborating with experts to explore remote...
View DetailsSpacecraft Systems Engineer - Maxar Technologies
Views in the last 30 days - 0
Maxar Intelligence is seeking a Spacecraft System Engineering Team member with a Bachelors degree in engineering physics or a related field and 510 ye...
View DetailsLead AIT Systems Engineer - Maxar Technologies
Views in the last 30 days - 0
Maxar Intelligence is currently hiring for a Lead AIT Systems Engineer in Westminster CO The role involves managing a team ensuring performance from c...
View DetailsSr RF Engineer - Universal Electronics
Views in the last 30 days - 0
Universal Electronics is hiring a Sr RF Engineer to lead the design and optimization of advanced RF solutions for IoT and smart home products The role...
View Details