Senior Deep Learning Research Engineer, LLM Inference

NVIDIA • Tel Aviv, Israel

📍 Location

Tel Aviv

⏰ Job Type

Full-time

📅 Posted

June 06, 2026

About the Role

                We are seeking a Deep Learning Research Engineer to join our team and help develop the next generation of Large Language Model (LLM) inference algorithms. You will work on technologies that directly enhance NVIDIA's software, making the latest LLMs more efficient and accessible to users worldwide. This role is designed for someone with strong research foundations who also wants to build software that runs and scales into production systems across the world.
  
By joining us, you will be part of a strategic effort to establish NVIDIA as the definitive platform for high-performance LLM inference. The work requires a combination of research taste, experimental rigor, and engineering ownership: you will explore new ideas, run rigorous evaluations, and help transform successful approaches into tools and implementations. 
  
What you'll be doing:
+ Develop and improve benchmarks, profiling workflows, and evaluation pipelines that make inference performance measurable and...

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position