Incident Lead - DevOps

NucleusTeq Other US Location

Company

NucleusTeq

Location

Other US Location

Type

Full Time

Job Description

An Incident Lead will be responsible for managing the day-to-day operations, ensuring platform reliability, and overseeing incident management and resolution processes. You will collaborate closely with engineering, product, and infrastructure teams to ensure the smooth functioning of systems and provide a high level of operational support to meet business goals.

Key Skills: Docker, Kubernetes, CI/CD, Azure DevOps or AWS Code Pipeline

Job responsibilities

  • Lead and mentor the L1/L2 operations teams, ensuring a high level of technical support and service quality.
  • Lead incident resolution processes for L1 and L2 operations, ensuring timely and effective troubleshooting of technical issues.
  • Define and implement procedures for handling escalations and high-priority incidents.
  • Ensure root cause analysis is conducted for major incidents and follow up on remediation actions.
  • Develop and enforce Service Level Agreements (SLAs) and Key Performance Indicators (KPIs) for platform performance and support operations.
  • Monitor adherence to SLAs and manage escalations to maintain customer satisfaction.
  • Oversee the platform's operational stability and performance, ensuring high availability and scalability.
  • Monitor and manage platform performance metrics, proactively addressing any potential issues.
  • Ensure comprehensive documentation of operational procedures, troubleshooting guides, and runbooks for the L1/L2 support teams.
  • Create detailed operational reports and dashboards for tracking system health and team performance.

Qualifications:

  • Bachelor's degree in computer science, Information Technology, or a related field.
  • 10+ years of experience in IT operations, with at least 3 years in a leadership role managing platform support and L1/L2 teams.
  • Strong understanding of IT infrastructure, cloud platforms, and operational best practices.
  • Strong experience on Docker, Kubernetes & Helm along with any programming language (Java preferred) experience to support platform KLO & monitoring.
  • ITIL certification will be highly preferred.
  • Extensive experience with implementing and managing CI/CD pipelines using tools like Jenkins, GitLab CI/CD, GitHub Actions, Azure DevOps or AWS Code Pipeline.
  • Tools: Any IDE, Git, Jenkins
  • Proven experience with incident management, service management, and driving process improvements.
  • Expertise in monitoring tools, automation frameworks, and platform performance optimization.
  • Excellent leadership, communication, and problem-solving skills.
Apply Now

Date Posted

11/28/2024

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Senior Engineering Manager, Micros Foundations - Atlassian

Views in the last 30 days - 0

Atlassian is seeking a Senior Engineering Manager to lead a team of Backend Software Engineers The role involves guiding technical decisions prioritiz...

View Details

Development Underwriter - Simply Business

Views in the last 30 days - 0

Simply Business is seeking a Development Underwriter with an Underwriting background to support their new MGA brand Nupro which aims to disrupt the sm...

View Details

E2E Solution Architect - Ahold Delhaize USA

Views in the last 30 days - 0

Ahold Delhaize USA is seeking a Solution Architect with extensive experience in IT architecture BigData Analytics and various software designs and dev...

View Details

E2E Solution Architect - Ahold Delhaize USA

Views in the last 30 days - 0

Ahold Delhaize USA is seeking a Solution Architect with extensive experience in IT architecture BigData Analytics and various software designs and dev...

View Details

E2E Solution Architect - Ahold Delhaize USA

Views in the last 30 days - 0

Ahold Delhaize USA a division of a global food retailer is seeking a Solution Architect for its US operations The role involves translating business r...

View Details

Senior Product Analyst - FinCrime Platform - WISE

Views in the last 30 days - 0

Wise is seeking a Senior Product Analyst for its FinCrime Platform The role involves driving analytics efforts in the Financial Crime Platform product...

View Details