AI Computing Development Engineer, TensorRT and TensorRT-LLM

NVIDIA • Shanghai, China

📍 Location

Shanghai

⏰ Job Type

Full-time

📅 Posted

June 03, 2026

About the Role

                NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like generative AI, computer vision, speech recognition, recommender systems, and large-scale language and multimodal models. Join the team building the inferencing software (TensorRT/TensorRT-LLM) that will be used across our product lines. The ability to work in a fast-paced, delivery-focused environment is required, and excellent interpersonal skills are a must.
  
  
What you'll be doing:
+ Design and develop robust inferencing software (TensorRT/TensorRT-LLM) optimized for functionality and performance across platforms
+ Perform performance analysis, optimization, and tuning of deep learning inference workloads
+ Track and integrate academic and industry advancements in AI and feature-update TensorRT/TensorRT-LLM accordingly
+ Provide feedback into archit...
            

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position