Post-training Researcher/Engineer

xAI Peninsula

Company

xAI

Location

Peninsula

Type

Full Time

Job Description

About the Role

The post-training team at xAI transforms powerful pre-trained models to become steerable, versatile, and capable of understanding and addressing real-world challenges.

As a post-training researcher/engineer, you will enhance the model's instruction-following capability and general usefulness to fulfill our mission – developing AI systems that can accurately understand the universe, create new knowledge, and improve themselves through interactions.

Focus

  • Creating and driving research agenda to advance model quality.
  • Improving data mixtures by building data collection pipelines and developing data generation techniques.
  • Creating generalizable reward models and developing novel reinforcement learning algorithms.
  • Designing and implementing robust model evaluations.
  • Designing and implementing large-scale model training frameworks.
  • Collaborating with pre-training, reasoning, data, multimodal, applied, product efforts to push the frontiers of model capability.

Ideal Experiences

  • Expert in ML and fine-tuning large language models.
  • Track record in leading research that significantly impacts AI advancement.
  • Experience in data-driven large language model behavior improvements.
  • Experience in advanced reinforcement learning or inference-time search techniques.
  • Experience in developing benchmarks or large-scale distributed machine learning systems.
  • Experience in model optimizations under complex setups (e.g., multi-modality, multi-context, multi-agent, long-horizon tasks, diverse user preference/feedback).

Location

The role is based in the Bay Area [San Francisco and Palo Alto]. Candidates are expected to be located near the Bay Area or open to relocation.

Tech Stack

  • Python
  • Jax
  • Rust

Interview Process

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

  1. Coding assessment in a language of your choice.
  2. 2 x post-training technical sessions: These sessions will be testing your ability to formulate, design and solve concrete problems in post-training. It can be research or engineering, depending on background/experience. 
  3. Meet the Team: Present your past exceptional work and your vision with xAI to a small audience.

Our goal is to finish the main process within one week. We don’t rely on recruiters for assessments. Every application is reviewed by a member of our technical team. All interviews will be conducted via Google Meet.

Annual Salary Range

$180,000 - $440,000 USD

California Consumer Privacy Act (CCPA) Notice

Apply Now

Date Posted

12/28/2024

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Support Engineer - Pricefx

Views in the last 30 days - 0

Pricefx a leading SaaS Pricing Price Optimization Management provider is seeking a Tier 34 Support Engineer The role involves providing technical sup...

View Details

People Operations Specialist II - Guardant Health

Views in the last 30 days - 0

Guardant Health a leading precision oncology company is seeking a detailoriented People Operations and Employee Relations Specialist II The role invol...

View Details

Senior Product Manager - Instrumental

Views in the last 30 days - 0

Instrumental is seeking a Senior Product Manager with extensive experience in enterprise SaaS products or deep domain expertise in electronics manufac...

View Details

Inside Sales & Technical Support Specialist - Gator Bio

Views in the last 30 days - 0

Gator Bio headquartered in Palo Alto CA is a leading developer and manufacturer of BioLayer Interferometry BLI instrumentation and consumable products...

View Details

Sr. Flight Software Engineer (Verification) - Reliable Robotics Corporation

Views in the last 30 days - 0

Reliable Robotics is a team of missiondriven engineers developing safetyenhancing technology for aviation aiming to make air transportation safer more...

View Details

Distributed Systems Engineer - Kumo

Views in the last 30 days - 0

Kumo is a company building a machine learning platform for data lakehouses enabling data scientists to train powerful Graph Neural Net models directly...

View Details