← Back to opportunities
About the Role
Job Description
Role Overview
We are looking for a Software Lead (8+ years’ experience) to own the runtime and neural network (NN) layer of a next-generation AI accelerator platform. This role focuses on designing, optimizing, and implementing NN operators and developing new ops using CUDA/custom runtime APIs to deliver high-performance execution on custom AI hardware.
Key Responsibilities
- Design and optimize NN operators for performance-critical workloads
- Develop new NN ops using CUDA/custom runtime APIs
- Drive runtime-level optimizations across compute, memory, and scheduling
- Own runtime ↔ NN layer interfaces and execution model
- Implement and optimize operator fusion (e.g., matmul + bias + LayerNorm) for efficient hardware utilization<...
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position