Lead Site Reliability Engineer
Company
EPAM Systems
Location
Ogre, Latvia
Type
Full Time
Job Description
Join our dynamic team as a Lead Site Reliability Engineer! If you have a substantial background in software and systems engineering and a focus on reliability and scalability in cloud environments, your expertise is needed in managing and communicating with IoT devices via our platform. You will have a critical role in duties such as device registration and connection, bi-directional messaging between devices and the cloud, device state tracking and data storage, issuing alerts and notifications for device state changes, and integrating other cloud services like Device Registry and Firmware Upgrade.
This position offers hybrid setup with the flexibility to work from any location in Latvia, whether it's your home or our office in Riga.
Want more jobs like this?
Get jobs in Ogre, Latvia delivered to your inbox every week.
#LI-DNI#LI-VA2
Responsibilities
- Design, implement, and maintain highly scalable and available systems across Azure cloud architectures
- Regularly test and implement disaster recovery (DR) plans
- Configure and enhance monitoring and alerting processes using Prometheus, Grafana, and OpsGenie
- Develop dashboards to visualize system performance and reliability metrics
- Use Terraform for infrastructure provisioning and management
- Support the development team in ongoing projects
- Communicate with the customer's DevOps team to discuss requirements and collaborate on implementations
- Enhance release management and CI/CD processes
- Improve system security based on security team recommendations
- Document system support processes and design, write and test runbooks for operational tasks and incident response
- Minimum 5 years of experience as a DevOps or SRE engineer
- Proven experience with Azure cloud architectures
- Proficiency in Kubernetes and Docker/Linux services
- Familiarity with monitoring tools: Prometheus, Grafana, OpsGenie
- Experience with .NET Core and ASP.NET Core applications
- Strong knowledge of Cosmos DB (both Mongo API & SQL API) and MS SQL Server
- Expertise in Terraform
- Experience with CI/CD tools and Azure Networking concepts
- Excellent communication skills, ability to manage tasks and projects independently
- Experience with Azure IoT Hub and EventHub is an added advantage
- Engineering Heritage: Best-in-class experts sharing a culture of engineering excellence and tackling complex engineering challenges for over 30 years
- Advanced Tech Stack: Innovative projects where you can apply or enhance your expertise in Cloud, Data, AI, and other emerging technologies
- World-Class Clients: Work closely with 295+ of the Forbes Global 2000 on creating disruptive solutions that make a global impact
- Professional Growth: Exceptional support for career development with comprehensive resources for upskilling or reskilling in pioneering practices
- GenAI Community: Strong AI competencies with 600+ experts across 55+ locations driving GenAI-enabled transformation journeys
- Entrepreneurial Culture: If you're passionate and dedicated to improving business transformation, we provide the support you need to bring your ideas to life
- Hybrid Setup: The flexibility to work from any location in Latvia, whether it's your home or our office in Riga
- Other Benefits: Additional vacation and trust days, private health insurance, Employee Stock Purchase Plan and more
About EPAM
EPAM is a leading global provider of digital platform engineering and development services. For over 30 years, our team has helped leading brands navigate the waves of digital transformation, building solutions that help them stay competitive through constant market disruption.
With offices in 55+ countries, EPAM has grown in Latvia to over 150+ talented innovators in 3 years. We foster creativity and unconventional ways of doing things, welcoming like-minded professionals to join us.
Date Posted
01/21/2025
Views
0
Similar Jobs
Key C/C++ Linux Software Engineer - EPAM Systems
Views in the last 30 days - 0
A Key System Software Engineer position is available at a leading global data storage provider specializing in highperformance computing The role invo...
View DetailsSAP R2R Business Solution Architect - EPAM Systems
Views in the last 30 days - 0
EPAM is seeking a dynamic SAP R2R Business Solution Architect to join their team The role involves leading SAP presale and implementation project gove...
View DetailsSenior SRE (Site Reliability Engineer) - EPAM Systems
Views in the last 30 days - 0
EPAM is seeking a Senior Site Reliability Engineer with expertise in system software infrastructure management and performance optimization The role i...
View DetailsSenior ServiceNow Engineer - EPAM Systems
Views in the last 30 days - 0
EPAM is seeking a Senior ServiceNow Engineer to join their team in Latvia The role involves enhancing the ServiceNow platform collaborating with globa...
View DetailsSenior .NET Software Engineer - EPAM Systems
Views in the last 30 days - 0
EPAM is recruiting experienced NET engineers to develop an AIdriven platform for Investor Relations The project involves integrating Generative AI wit...
View DetailsSenior Test Automation Engineer in JavaScript - EPAM Systems
Views in the last 30 days - 0
EPAM is seeking an experienced Senior Test Automation Engineer with JavaScript to join their team The role requires not only technical expertise but a...
View Details