Software Engineer - Distributed Systems

GW435 Posted: 29/04/2026

$250,000-$270,000
San Francisco, CA
Permanent

About the job

Software Engineer - Distributed Systems

We’re working with a well-funded Series A company building a new class of cloud infrastructure for AI. They’re tackling a fundamental problem: today’s AI systems are tightly coupled to specific hardware, creating limits in cost, scale, and efficiency.

Their approach decouples workloads from hardware — dynamically partitioning and scheduling them across heterogeneous compute (GPUs, accelerators, multi-gen systems). This is deep, production-grade distributed systems work operating at real scale.

What you’ll do

Own core distributed systems from design → build → deployment → operation
Design scheduling, routing, and resource management systems across thousands of nodes
Build production-grade control planes and APIs for workload orchestration
Make explicit tradeoffs around performance, reliability, and efficiency at scale
Debug complex distributed failures and continuously improve system behaviour

What makes this interesting

High ownership: you’re building foundational infrastructure, not abstracted layers
Real scale: systems designed for large, multi-cluster / datacenter environments
Hard problems: concurrency, scheduling, failure modes, and resource allocation
Heterogeneous compute: working beyond standard cloud abstractions
Early-stage: opportunity to shape architecture with real production constraints

We’re looking for

Engineers who have built or operated distributed systems in production
Strong fundamentals in concurrency, systems design, and failure handling
Evidence of ownership over meaningful systems (not just contributions)
Comfort reasoning about tradeoffs in large-scale environments
Ability to clearly explain design decisions and system behaviour

It's not necessary, but it's great if you have:

Experience with Kubernetes or similar systems beyond basic usage
Background in scheduling, queues, or resource management systems
Experience designing service-oriented architectures (RPC, async systems)
Systems-level programming experience (e.g. Go, C++, Python)

Anna Heneghan Senior ML Research & Engineering Recruiter

Apply for this role

First Name

Last Name

Telephone Number

Email Address

Resume, LinkedIn or Dropbox URL

Resume Upload

Choose File

LinkedIn / Dropbox URL

Message

By submitting this form you agree to our Terms & Conditions, Privacy Policy & Cookie Policy

Not yet registered? Create an account today

Already have an account? Sign in now

Still looking? What about...

Featured Jobs

View all jobs

Posted: 15/05/2026

Senior AI/ML Robotics Engineer

GW489

$200,000-$300,000
North Reading, MA
Permanent

About the job🚨 Senior AI/ML Engineer – Physical AI Solutions📍 North Reading, MA | Onsite (R...

View Job

Posted: 15/05/2026

Research Engineer - Interpretability Systems

GW488

$250,000-$350,000
San Francisco, CA
Permanent

About the job🚨 Research Engineer – Interpretability Systems📍 San Francisco, CA | Onsite🧠 E...

View Job

Posted: 15/05/2026

Inference Engineer

GW487

$200,000-$350,000
Santa Clara, CA
Permanent

About the jobSenior / Principal Machine Learning Engineer – Inference Serving FrameworksFull-time ...

View Job

Posted: 15/05/2026

Research Engineer – Experimental ML Systems

GW486

$250,000-$350,000
San Francisco, CA
Permanent

About the job🚨 Research Engineer – Experimental ML Systems📍 San Francisco, CA | Onsite🧠 Ea...

View Job

Posted: 15/05/2026

Performance Modeling Engineer

GW485

$200,000-$350,000
Santa Clara, CA
Permanent

About the jobSr/Principal Software Engineer – Simulator DeveloperLocation: Santa Clara, CA | Onsit...

View Job

Posted: 15/05/2026

Senior Engineer (Electron)

GW484

$220,000-$240,000
San Francisco, CA
Permanent

About the jobSoftware Engineer (Electron) We are seeking a Software Engineer to architect and engin...

View Job

Posted: 15/05/2026

Forward Deployed Engineer

GW483

$150,000-$200,000
United States
Permanent

About the jobHiring for a Remote Senior Forward Deployed position, across the US and Canada, f...

View Job

Posted: 08/05/2026

Senior AI Engineer

GW482

$150,000-$180,000
United States
Permanent

About the job📍 Remote across the U.S. (Eastern/Central time zones) 💰 $150-180k + strong benefitsT...

View Job

Posted: 08/05/2026

Senior Backend Engineer

GW481

$180,000-$250,000
Boston, MA
Permanent

About the jobSenior Software Engineer (Agentic AI) - Boston, MA A fast-growing deep tech startup is...

View Job

Posted: 08/05/2026

Lead AI Engineer

GW480

$160,000-$245,000
United States
Permanent

About the jobWe are looking for a Lead AI Engineer with 7+ years’ experience buildi...

View Job

Quick Resume Dropoff

Software Engineer - Distributed Systems

About the job

Apply for this role

Still looking? What about...

Featured Jobs

Senior AI/ML Robotics Engineer

Research Engineer - Interpretability Systems

Inference Engineer

Research Engineer – Experimental ML Systems

Performance Modeling Engineer

Senior Engineer (Electron)

Forward Deployed Engineer

Senior AI Engineer

Senior Backend Engineer

Lead AI Engineer

Contact Us

Find us on social

Useful Links

Legal