← Back to opportunities
About the Role
We are strengthening a client-facing delivery team that operates Kubernetes and Linux compute stacks for advanced AI workloads, including GPU scheduling with Volcano. You will automate day-to-day operations with Python and UNIX Shell, manage namespaces, RBAC, and quotas, and partner with researchers to keep platforms fast and dependable.
Responsibilities
- Deliver and support GPU-enabled Kubernetes clusters plus standalone Linux compute environments with strong scheduling behavior and throughput
- Run Volcano scheduling operations, including queue setup, POD execution, GPU allocation, and enforcement of namespace quotas
- Own Kubernetes administration across namespaces, RBAC, resource quotas, and workload isolation strategies
- Create and evolve Python and Shell scripts that automate job submission, resource provisioning, and system reporting
- Partner with orchestration, optimization, and observability teams to improve scheduling ef...
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position