← Back to opportunities
📍 Location
bengaluru
⏰ Job Type
Full-time
📅 Posted
June 04, 2026

About the Role

Scribie is an AI-powered, Human Verified audio and video transcription service, trusted globally since 2008. We specialize in delivering accurate and reliable transcription solutions by blending advanced AI technology with human expertise. Headquartered in the US, we operate with a hybrid model in our Bangalore office, combining the flexibility of remote work with the collaboration of in-person engagement. This approach offers our team both autonomy and growth opportunities in a dynamic and supportive environment.
We’re building production-grade audio foundation models for high-stakes legal and enterprise transcription — real customer data, messy audio, real consequences.
This is not a paper-only research role.
You’ll own the full ML lifecycle:
Fine-tuning large audio / multimodal models using SFT, Lo RA, and RL-based preference optimization (DPO / PPO / ORPO)
Beating strong baselines like Whisper-large, GPT-4o, Gemini, Claude on domain-specific data
Designing WER, di...

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position