← Back to opportunities

Senior AI Inference Engineer 100% Remote

📍 Location
manila
⏰ Job Type
Full-time
📅 Posted
June 03, 2026

About the Role

Responsibilities

  • Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx.
  • Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments.
  • Integrate AI features into existing products, enriching them with the latest advancements in machine learning.

Qualifications

  • Excellent programming skills in C++; experience in Javascript is a bonus.
  • Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with transformers, LLMs, Diffusion models.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track r...

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position