About the Role
Job description:
KEY RESPONSIBILITIES
· Design and own multi-stage ingestion pipelines — handling HTML, PDF, and image sources with layout parsing, metadata extraction, and vector storage
· Architect RAG systems with hybrid search (BM25 + semantic), document versioning, and cross-reference resolution
· Build production-grade FastAPI services with typed response envelopes, OpenAPI compliance, and Langfuse tracing integration
· Engineer prompt systems — structured prompts, prompt versioning, few-shot strategies, and judge-based evaluation
· Integrate and manage LLM routing via LiteLLM: model fallback, cost control, and per-route configuration
· Design agentic workflows using LangGraph: multi-step retrieval, tool use, and conditional branching
· Build and maintain knowledge graphs in NebulaGraph / Neo4j — entity extraction, relationship modelling, and domain ontology alignment
· Implement...
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position