Principal Site Reliability Engineer (Cloud Services - AI Infrastructure)

Palo Alto Networks Santa Clara, CA

Company

Palo Alto Networks

Location

Santa Clara, CA

Type

Full Time

Job Description

Your Career

We are looking for an exceptional Principal Site Reliability Engineer to enhance our ATP Infra team. This role will work on producing mission-critical platforms, tools, and processes that will ensure the highest levels of availability and reliability of all our applications. We need creative and innovative problem solvers who can partner with our developers and researchers to make the services more usable. The ideal candidate will possess a deep understanding of cloud infrastructure, particularly within the Google Cloud Platform (GCP), and have a proactive approach to exploring new tools/frameworks to elevate our infrastructure automation, stability and scalability.

Your Impact

Want more jobs like this?

Get jobs in Santa Clara, CA delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

  • Write automation code for provisioning and operating infrastructure at massive scale
  • Design, build and operate Cloud infrastructure to enable reliable and rapid deployment of microservices with effective monitoring and resilient operations
  • Work with development teams to make sure the applications are production ready, scalable and reliable from the grounds up
  • Identify and drive opportunities to improve automation for code deployment, management, and visibility of application services
  • Develop tools and framework to automate operational tasks, deployment of machines, services, applications
  • Establish end-to-end monitoring and alerting on all critical components of the application
  • Participate in the on-call rotation supporting the platform and or the production application
  • Directs root cause analysis of critical business and production issues
  • Develop and mentor other SREs on standard methodology from Infra orchestration and troubleshooting application service in production
  • Represent SRE in design reviews and work cross-functionally with Engineering teams on operational readiness

Your Experience

  • BS or MS in Computer Science, a related field, or equivalent professional experience or equivalent military experience required
  • Expertise in configuration management with a framework such as Terraform, Ansible, and Helm
  • Strong experience with Kubernetes
  • Strong Linux administration, internals, and network troubleshooting
  • Expertise in Google cloud computing (GCP) and resource management/operations on its related services
  • Proficiency with a programming language like Python and shell scripting to automate tasks
  • Strong experience with CI/CD pipeline, GitHub, Jenkins, Artifactory
  • Strong experience with metrics and monitoring tools such as Grafana and Prometheus
  • Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions
  • Strong fundamentals in API gateway including Nginx or Envoy
  • Experience with cloud infrastructure and their performance & cost optimizations
  • Experience with AWS is a big plus
  • Excellent interpersonal skills and the ability to work well in a team
  • Passionate to learn, understand, and dissect new technology stack quickly on own
  • Have experience on building and managing large relational database cluster (MySQL/Percona etc.) will be a plus

The Team

Our engineering team is at the core of our products - connected directly to the mission of preventing cyberattacks. We are constantly innovating - challenging the way we, and the industry, think about cybersecurity. Our engineers don't shy away from building products to solve problems no one has pursued before.

We define the industry, instead of waiting for directions. We need individuals who feel comfortable in ambiguity, excited by the prospect of a challenge, and empowered by the unknown risks facing our everyday lives that are only enabled by a secure digital environment.and downtime.

Compensation Disclosure

The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/commissioned roles) is expected to be between $147000 - $237500/YR. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

#LI-TD1

Our Commitment

We're problem solvers that take risks and challenge cybersecurity's status quo. It's simple: we can't accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at [email protected].

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Apply Now

Date Posted

01/13/2025

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Software Engineer, Data Platform (Lead) - Benchling

Views in the last 30 days - 0

Benchling a leading biotechnology company is seeking a Senior Software Engineer to design and implement scalable multitenant services and APIs The rol...

View Details

Senior Software Engineer, Optics - Red 6

Views in the last 30 days - 0

Red 6 is an innovative AR technology startup specializing in synthetic air combat training The company is seeking a core technology team member with a...

View Details

Senior Product Manager, Dev Solutions - Atlassian

Views in the last 30 days - 0

Atlassian offers a remote position for a Product Manager in the Dev Solutions team The role involves collaborating with crossfunctional teams to lead ...

View Details

Treasury Management Officer - Technology and Disruptive Commerce - JPMorganChase

Views in the last 30 days - 0

The job posting is for a Treasury Management Officer in Commercial Banking The role involves generating new treasury management business maintaining c...

View Details

Relationship Executive, Middle Market Banking - Executive Director - JPMorganChase

Views in the last 30 days - 0

The job description is for a Relationship Executive role in the Middle Market Banking team The role involves building and retaining profitable relatio...

View Details

Senior Account Sales Representative - Spectrum

Views in the last 30 days - 0

The job involves selling products and services to customers in assigned nonbulk multidwelling units through doortodoor solicitation lobby events and b...

View Details