Role & seniority: Principal QA – AI & Conversational Systems (senior/principal level)

Stack/tools: LLMs and conversational AI; AI evaluation metrics; AI safety and bias validation frameworks; synthetic call testing; prompt robustness and fallback testing; RAG/knowledge grounding validation

Top 3 responsibilities

Lead evaluation of AI-driven voice bots, agent assist, and summarization systems; design LLM validation frameworks for accuracy, safety, and latency
Hallucination detection, response accuracy scoring, and bias/safety testing standards; ensure compliance with data privacy
Synthetic call testing, AI load benchmarking, prompt robustness validation, and fallback handling

Must-have skills

Experience with LLM or conversational AI testing
Hands-on exposure to AI evaluation metrics and AI safety/bias validation
Ability to design and implement AI testing frameworks and validation processes

Nice-to-haves

Background in responsible AI, bias testing, and security/compliance testing
Experience with RAG and knowledge grounding validation
Familiarity with latency and performance benchmarking for AI systems
Location & work type: Not specified; role implies remote or on-site at Codvo, with potential global team collaboration.

Full Description

Principal QA – AI & Conversational Systems

Company Overview

At Codvo, software and people transformations go hand-in-hand. We are a global empathy-led technology services company. Product innovation and mature software engineering are part of our core DNA. Respect, Fairness, Growth, Agility, and Inclusiveness are the core values that we aspire to live by each day. We continue to expand our digital strategy, design, architecture, and product management capabilities to offer expertise, outside-the-box thinking, and measurable results
Role Overview
Lead evaluation of AI-driven voice bots, agent assist, and summarization systems.
Design LLM validation frameworks ensuring accuracy, safety, and latency compliance.
Drive responsible AI and bias testing standards.

Core Responsibilities Hallucination detection and response accuracy scoring. RAG and knowledge grounding validation. Synthetic call testing and AI load benchmarking. Prompt robustness and fallback validation. Sensitive data leakage and AI compliance testing.

Ideal Background Experience in LLM or conversational AI testing. Hands-on exposure to AI evaluation metrics. Strong understanding of AI safety and bias validation. Experience designing AI testing frameworks.

Principal QA – AI & Conversational Systems (Pune)

Top 3 responsibilities

Must-have skills

Nice-to-haves

Full Description

Company Overview