Software Development Engineer II, AI/ML Elastic Collectives - Annapurna Labs

Amazon • Cupertino, United States

📍 Location

Cupertino

⏰ Job Type

Full-time

📅 Posted

June 06, 2026

About the Role

                Description
We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important. Experience with embedded systems is valued, and experience with high-speed networking or HPC interconnects is valued highly.
  
If you like solving hard problems, want to work with HPC and ML customers, iterate fast and deliver meaningful solutions at scale, then come join us! This truly is a role on the forefront of AI/ML, you’ll be working on features for the largest clusters, with the largest customers, for the largest AI models.
  
The org you would be joining is Annapurna Labs, an integral part of AWS and develops hardware and software components that are critical building blocks for EC2 infrastructure. Every inst...

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position