← Back to opportunities
Design and implement the architecture for model training, fine-tuning, and serving. Build platform components that support heterogeneous compute environments (GPUs, NPUs, accelerators). Develop and optimize high-performance inference stacks using frameworks such as vLLM, SGLang, TensorRT-LLM, or Triton. Develop APIs, CLI tools, and backend services for model lifecycle management.
About the Role
Job Responsibilities :
We are hiring a Senior / Lead Software Engineer to design and build an AI/ML platform capable of high-throughput training and inference across local and cloud GPU environments. This role focuses on systems architecture, GPU acceleration, performance engineering, and reliable operation of AI workloads at scale. You will lead engineering initiatives, define platform architecture, and collaborate closely with ML and hardware teams.Responsibilities
System Architecture & Core Engineering
Local & Cloud G...
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position