← Back to opportunities
💼
V&V Engineer- AI-Driven Testing & Validation
Datum Software, Inc • Plano, Texas, United States
About the Role
V&V Engineer – AI-Driven Testing & Validation
Plano, TX
Key Responsibilities:
Plano, TX
Key Responsibilities:
- Lead end-to-end quality engineering for enterprise AI applications, including LLM-powered products, RAG pipelines, and agentic workflows
- Design and execute prompt validation strategies, evaluating LLM responses for accuracy, semantic relevance, hallucination risk, and safety compliance
- Build automated evaluation pipelines for AI model outputs using metrics such as BLEU, ROUGE, embedding-based similarity, precision, recall, and F1-score
- Validate agentic systems (tool use, multi-step reasoning, planner-executor workflows) for correctness, determinism, and failure mode handling
- Architect and maintain Python-based automation frameworks for AI/ML model evaluation, regression testing, and continuous model quality monitoring
- Integrate AI testing into CI/CD pipeline...
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position