← Back to opportunities

Associate Director, Software Engineering (Model Hosting/Inference Optimisation)

📍 Location
Shenzhen
⏰ Job Type
Full-time
📅 Posted
May 22, 2026

About the Role

Some careers have more impact than others.

If you’re looking for a career where you can make a real impression, join HSBC and discover how valued you’ll be.

 

We are currently seeking an experienced professional to join our team in the role of Associate Director, Software Engineering (Model Hosting/Inference Optimisation).

 

Business: CTO Platforms (AI Platforms)

Location: Shenzhen / Guangzhou

Req ID: 44990

 

Principal responsibilities

  • Design, build, and operate scalable, reliable model hosting platforms for LLMs, embeddings, and STT/TTS across heterogeneous hardware. 
  • Drive inference optimisation for latency, throughput, and cost (quantisation, KV-cache optimisation, dynamic/continuous batching). 
  • Evaluate, integrate, and tailor inference frameworks (e.g., vLLM, TensorRT-LLM,...

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position