← Back to opportunities

Sr. Software Engineer, Inference

📍 Location
london
⏰ Job Type
Full-time
📅 Posted
May 23, 2026

About the Role

About the role:

Our Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide. We bring Claude to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request routing to fleet-wide orchestration across diverse AI accelerators.

The team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, while enabling breakthrough research by giving our scientists the high-performance inference infrastructure they need to develop next-generation models. We tackle complex, distributed systems challenges across multiple accelerator families and emerging AI hardware running in multiple cloud platforms.

Strong candidates may also have experience with:

  • Implementing and deploying machine learning systems at scale
  • Load bala...

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position