← Back to opportunities

Machine Learning Performance Engineer, Annapurna Labs

📍 Location
Tel Aviv-Yafo
⏰ Job Type
full-time
📅 Posted
April 10, 2026

About the Role

Our team is responsible for the AWS Neuron software stack, which powers Generative AI and other advanced ML workloads on AWS's custom-built ML accelerators — Inferentia and Trainium. These accelerators deliver best-in-class performance and cost-efficiency for ML inference and training in the cloud.
We're building a new core group of engineers in TLV (Tel Aviv) to drive innovation in ML systems performance and software. As a Machine Learning Performance Engineer, you'll help shape the direction of the team from the ground up and work on:

Optimizing system performance across the entire ML software stack
Analyzing high-performance ML workloads running on Annapurna hardware
Developing high-performance kernels for critical ML operations
Enhancing the Neuron SDK to improve developer experience and system capabilities
Collaborating across Compiler, Frameworks, and Hardware teams to maximize end-to-end performance

As part of the Performance Engineering Te...

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position