ML Systems Engineer (Inference)
GW130
Posted: 04/11/2025
- $250,000-$350,000
- Palo Alto, CA
- Permanent
About the job
ML Systems Engineer (Inference)
We are seeking an inference focussed Machine Learning Systems Engineer to join a scale up team founded by ex-Tesla AI leadership, building advanced custom silicon.
The Founders are a world class group of Engineers and technical Leaders from the likes of Tesla, AMD, Cerebras and Apple and are pushing the boundaries of what ML hardware can achieve.
You'll focus on their inference and serving stack, building and optimizing the inference platform, with a focus on latency, batching and dynamic shape support.
We are seeking an ML Systems Engineer (Inference) with:
- Demonstrable expertise building and optimizing inference stacks for ML workloads
- Experience working with various inference related tools and frameworks which could include vLLM, TensorRT, SGLang and Pytorch
- Exposure to custom accelerators or wider ML hardware would be highly beneficial
Location: Palo Alto, primarily in office
Compensation: Competitive against most in the Bay with meaningful early stage equity
Tom Parker
Senior Software Systems & HPC Recruiter