Post-training Researcher/Engineer
Company
xAI
Location
Peninsula
Type
Full Time
Job Description
About the Role
The post-training team at xAI transforms powerful pre-trained models to become steerable, versatile, and capable of understanding and addressing real-world challenges.
As a post-training researcher/engineer, you will enhance the model's instruction-following capability and general usefulness to fulfill our mission – developing AI systems that can accurately understand the universe, create new knowledge, and improve themselves through interactions.
Focus
- Creating and driving research agenda to advance model quality.
- Improving data mixtures by building data collection pipelines and developing data generation techniques.
- Creating generalizable reward models and developing novel reinforcement learning algorithms.
- Designing and implementing robust model evaluations.
- Designing and implementing large-scale model training frameworks.
- Collaborating with pre-training, reasoning, data, multimodal, applied, product efforts to push the frontiers of model capability.
Ideal Experiences
- Expert in ML and fine-tuning large language models.
- Track record in leading research that significantly impacts AI advancement.
- Experience in data-driven large language model behavior improvements.
- Experience in advanced reinforcement learning or inference-time search techniques.
- Experience in developing benchmarks or large-scale distributed machine learning systems.
- Experience in model optimizations under complex setups (e.g., multi-modality, multi-context, multi-agent, long-horizon tasks, diverse user preference/feedback).
Location
The role is based in the Bay Area [San Francisco and Palo Alto]. Candidates are expected to be located near the Bay Area or open to relocation.
Tech Stack
- Python
- Jax
- Rust
Interview Process
After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:
- Coding assessment in a language of your choice.
- 2 x post-training technical sessions: These sessions will be testing your ability to formulate, design and solve concrete problems in post-training. It can be research or engineering, depending on background/experience.
- Meet the Team: Present your past exceptional work and your vision with xAI to a small audience.
Our goal is to finish the main process within one week. We don’t rely on recruiters for assessments. Every application is reviewed by a member of our technical team. All interviews will be conducted via Google Meet.
Annual Salary Range
$180,000 - $440,000 USD
California Consumer Privacy Act (CCPA) Notice
Date Posted
12/28/2024
Views
0
Similar Jobs
Support Engineer - Pricefx
Views in the last 30 days - 0
Pricefx a leading SaaS Pricing Price Optimization Management provider is seeking a Tier 34 Support Engineer The role involves providing technical sup...
View DetailsPeople Operations Specialist II - Guardant Health
Views in the last 30 days - 0
Guardant Health a leading precision oncology company is seeking a detailoriented People Operations and Employee Relations Specialist II The role invol...
View DetailsSenior Product Manager - Instrumental
Views in the last 30 days - 0
Instrumental is seeking a Senior Product Manager with extensive experience in enterprise SaaS products or deep domain expertise in electronics manufac...
View DetailsInside Sales & Technical Support Specialist - Gator Bio
Views in the last 30 days - 0
Gator Bio headquartered in Palo Alto CA is a leading developer and manufacturer of BioLayer Interferometry BLI instrumentation and consumable products...
View DetailsSr. Flight Software Engineer (Verification) - Reliable Robotics Corporation
Views in the last 30 days - 0
Reliable Robotics is a team of missiondriven engineers developing safetyenhancing technology for aviation aiming to make air transportation safer more...
View DetailsDistributed Systems Engineer - Kumo
Views in the last 30 days - 0
Kumo is a company building a machine learning platform for data lakehouses enabling data scientists to train powerful Graph Neural Net models directly...
View Details