← Back to opportunities
About the Role
Responsibilities
- Design, deploy, and maintain HPC platforms including compute clusters, GPU acceleration, and high-performance interconnects (InfiniBand, advanced Ethernet).
- Scheduling and orchestration using SLURM, PBS, or similar workload managers.
- Design and maintain storage systems: parallel file systems (Lustre, GPFS), object storage, and hybrid solutions.
- Implement containers and reproducible environments with Singularity/Apptainer, Docker, and Kubernetes for HPC workloads.
- Automate infrastructure as code using Ansible, Terraform, and CI/CD pipelines.
- Integrate quantum computing into HPC workflows via simulators, SDKs (Qiskit, Cirq, Braket), and cloud or hardware backends.
- Perform performance tuning and resource optimization.
- Provide advanced user support and incident resolution.
- Develop and maintain technical documentation and knowledge sharing.
- Collaborate with vendors, research...
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position