GenAI Engineer - GPU Resource Management

Publicis Groupe • New York City, NY

Company

Publicis Groupe

Location

New York City, NY

Type

Full Time

Job Description

Company Description

Publicis Sapient is a digital transformation partner helping established organizations get to their future, digitally-enabled state, both in the way they work and the way they serve their customers. We help unlock value through a start-up mindset and modern methods, fusing strategy, consulting and customer experience with agile engineering and problem-solving creativity. United by our core values and our purpose of helping people thrive in the brave pursuit of next, our 20,000+ people in 53 offices around the world combine experience across technology, data sciences, consulting and customer obsession to accelerate our clients’ businesses through designing the products and services their customers truly value.

Job Description

The Platform Engineer – AI & GPU Services will be responsible for implementing and maintaining AI/ML platforms and GPU resource management across cloud (GCP) and on-premise infrastructure. This role combines expertise in cloud services, AI/ML technologies, and infrastructure automation to support both product engineering and platform engineering functions. The ideal candidate will have experience working with generative AI services, GPU management, and container orchestration platforms.

Responsibilities:

• Architect, build, and maintain AI/ML platforms using Google Cloud Platform (GCP) services like Compute, Storage, IAM, and VPC.
• Manage NVIDIA GPU resources across projects using Run.ai or similar tools.
• Develop and maintain MLOps pipelines on platforms like Vertex AI, supporting AI/ML model training and deployment.
• Write Python scripts for model development, automation, and infrastructure management.
• Use Terraform for Infrastructure as Code (IaC) to automate provisioning and deployment of cloud resources.
• Deploy and manage AI/ML models on container orchestration platforms such as OpenShift and GKE.
• Collaborate with AI teams to facilitate LLM deployment (e.g., Llama, Mistral) and GPU utilization.
• Automate and enhance CI/CD pipelines for seamless integration and deployment of services.
• Monitor performance and capacity with Prometheus, Grafana, and other observability tools to ensure system stability.
• Engage in DevOps practices, including containerization, orchestration, and infrastructure management.

Qualifications

• Strong experience with Google Cloud Platform (GCP) and its core services (Compute, Storage, IAM, VPC).
• Experience with GPU resource management tools (e.g., Run.ai).
• Proficiency with Python for AI/ML workflows and automation.
• Hands-on experience with MLOps platforms like Vertex AI.
• Experience with Terraform for managing cloud infrastructure using Infrastructure as Code (IaC) practices.
• Knowledge of Kubernetes and container orchestration platforms such as OpenShift and GKE.
• Familiarity with monitoring and logging tools like Prometheus, Grafana, and the ELK Stack.
• Proven track record of working with CI/CD pipelines and DevOps automation tools.

Additional Information

Pay Range: $75,000 - $146,000

The range shown represents a grouping of relevant ranges currently in use at Publicis Sapient. Actual range for this position may differ, depending on location and specific skillset required for the work itself.

Benefits of Working Here:

  • Flexible vacation policy; time is not limited, allocated, or accrued
  • 16 paid holidays throughout the year
  • Generous parental leave and new parent transition program
  • Tuition reimbursement 
  • Corporate gift matching program 

As part of our dedication to an inclusive and diverse workforce, Publicis Sapient is committed to Equal Employment Opportunity without regard for race, color, national origin, ethnicity, gender, protected veteran status, disability, sexual orientation, gender identity, or religion. We are also committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, you may contact us at [email protected] or you may call us at +1-617-621-0200.

Apply Now

Date Posted

12/03/2024

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Enterprise Customer Success Manager - Rokt

Views in the last 30 days - 0

mParticle by Rokt a leading customer data platform is seeking an Enterprise Customer Success Manager The role involves serving as a trusted consultant...

View Details

AWS Alliance Driver, Director - PwC

Views in the last 30 days - 0

The text describes a role for an AWS Alliance Director at PwC The individual will lead the AWS Alliance across various sectors focusing on the overall...

View Details

Business Account Executive - Spectrum

Views in the last 30 days - 0

The Business Account Executive role involves selling primary and ancillary communications solutions to small and mediumsized businesses within a speci...

View Details

Senior Software Engineer, Devices Automation - Block

Views in the last 30 days - 0

Square a company that has evolved since its inception in 2009 is seeking a Software Engineer with extensive experience in embedded devices and test en...

View Details

Software Engineering Lead - Dotdash Meredith

Views in the last 30 days - 0

Dotdash Meredith is seeking a skilled Engineering Lead for a missioncritical role in designing and scaling their nextgeneration publishing platform Th...

View Details

Principal Product Marketing Manager - Rokt

Views in the last 30 days - 0

mParticle by Rokt a leading customer data platform is seeking a Principal Product Marketing Manager The role involves driving market leadership creati...

View Details