← Back to opportunities
📍 Location
Shanghai
⏰ Job Type
Full-time
📅 Posted
June 03, 2026

About the Role

NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like generative AI, computer vision, speech recognition, recommender systems, and large-scale language and multimodal models. Join the team building the inferencing software (TensorRT/TensorRT-LLM) that will be used across our product lines. The ability to work in a fast-paced, delivery-focused environment is required, and excellent interpersonal skills are a must.


What you'll be doing:
+ Design and develop robust inferencing software (TensorRT/TensorRT-LLM) optimized for functionality and performance across platforms
+ Perform performance analysis, optimization, and tuning of deep learning inference workloads
+ Track and integrate academic and industry advancements in AI and feature-update TensorRT/TensorRT-LLM accordingly
+ Provide feedback into archit...

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position