Cookies & analytics consent
We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.
Read how we use data in our Privacy Policy and Terms of Service.
π€ 15+ AI Agents working for you. Find jobs, score and update resumes, cover letter, interview questions, missing keywords, and lots more.

LeewayHertz β’ Gurugram, Haryana, India
Role & seniority: QA Lead β AI & GenAI Systems (8β12+ years in software QA; 3β5+ years leading QA teams)
Stack/tools: API, backend, and integration test automation; CI/CD pipelines; GenAI/LLM-based testing; RAG architectures (vector databases, embeddings); LLMOps/MLOps tools; test management: Jira, Confluence, TestRail (preferred)
Define end-to-end QA strategy, test planning, and AI/non-AI quality metrics; own QA governance
Lead automation initiatives, design automated test frameworks, and integrate QA into CI/CD for rapid feedback
Validate GenAI/LLM systems, non-deterministic outputs, RAG pipelines, safety/bias guardrails; mentor QA teams; collaborate with engineering/ML/product
8β12+ years QA experience with 3β5+ years in leadership
Strong API, backend, and integration test automation; hands-on CI/CD
Proven GenAI/LLM testing in production; non-deterministic/probabilistic validation
Experience with RAG, embeddings, vector databases; familiarity with LLMOps/MLOps; multi-agent testing
SDLC/Agile knowledge; effective cross-functional collaboration
Startup/fast-paced product experience
AI evaluation frameworks, benchmarking, A/B testing
Prompt engineering, prompt regression/versioning
Experience with Jira/Confluence/TestRail
Location & work type: Remote position (India)
Note: Neutral, concise, no hype.
Job Description
This is a remote position.
Job Summary
We are seeking an experienced and hands-on QA Lead β AI & GenAI Systems to own and drive quality across next-generation AI-powered products. The ideal candidate will bring deep expertise in traditional QA practices along with strong experience testing GenAI, LLM-based, and agentic systems in high-scale production environments.
You will define the QA strategy, lead automation initiatives, establish AI-specific quality metrics, and work closely with engineering, ML, and product teams to ensure reliability, safety, and performance of complex AI workflows. This role is critical in shaping quality standards for non-deterministic systems, multi-agent architectures, and retrieval-augmented generation (RAG) pipelines in a fast-paced startup environment.
Responsibilities
Own and define the end-to-end QA strategy, test planning, and quality metrics for AI and non-AI systems. Lead and mentor QA engineers, ensuring best practices in automation, test design, and execution. Design and implement automated test frameworks for API, backend, integration, and regression testing. Integrate QA processes into CI/CD pipelines to enable continuous testing and rapid feedback loops. Collaborate closely with engineering, ML, and product teams to validate functional and non-functional requirements. Define and track AI-specific quality metrics including accuracy, relevance, hallucination rate, latency, and consistency. Test GenAI / LLM-based systems including hosted and open-source model integrations. Validate non-deterministic behaviors, probabilistic outputs, and prompt-based workflows. Lead testing for RAG systems, including vector search, embeddings, retrieval accuracy, and response grounding. Execute safety, bias, and guardrail testing to ensure responsible AI behavior. Support evaluation frameworks such as human-in-the-loop, offline benchmarking, and online experimentation. Validate data quality used for model training, fine-tuning, and inference pipelines. Test agent workflows involving multi-step reasoning, tool calling, memory/state handling, and orchestration logic. Collaborate on prompt engineering, prompt regression testing, and prompt versioning strategies.
Requirements
Job
8β12+ years of experience in software QA with 3β5+ years leading QA teams. Strong expertise in API, backend, and integration test automation. Hands-on experience with CI/CD pipelines and automated regression testing. Proven experience testing GenAI / LLM-based applications in production environments. Deep understanding of non-deterministic systems and probabilistic output validation. Experience with RAG architectures, embeddings, vector databases, and retrieval quality testing. Familiarity with LLMOps / MLOps tools, model monitoring, and evaluation pipelines. Experience testing multi-agent systems or agent orchestration frameworks. Strong understanding of SDLC, Agile methodologies, and quality governance.
Personal
Strong leadership and mentoring capabilities. Excellent analytical, problem-solving, and decision-making skills. Ability to collaborate effectively with cross-functional technical and non-technical teams. Strong communication skills with the ability to explain complex QA and AI concepts clearly.
Preferred Skills
Job
Experience working in startup or fast-paced product development environments. Exposure to AI evaluation frameworks, benchmarking techniques, and A/B testing. Knowledge of prompt engineering best practices, prompt regression, and version control. Experience testing high-scale, distributed, or cloud-native AI systems. Familiarity with tools such as Jira, Confluence, TestRail, or similar QA management platforms.
Personal
Proactive mindset with a strong sense of ownership and accountability. Ability to work under tight timelines and evolving requirements. Strong attention to detail while balancing speed and quality Passion for building reliable, responsible, and scalable AI products.
Other Relevant Information
Bachelorβs degree in Engineering (BE/B.Tech β CS/IT) or Masterβs degree in Computer Applications (MCA) or equivalent qualification. Ability to work independently and collaboratively in a global, fast-paced environment.
Benefits
This role offers the flexibility of working remotely in India.
LeewayHertz is an equal opportunity employer and does not discriminate based on race, colour, religion, sex, age, disability, national origin, sexual orientation, gender identity, or any other protected status. We encourage a diverse range of applicants.
check(event) ; career-website-detail-template-2 => apply(record.id,meta)" mousedown="lyte-button => check(event)" final-style="background-color:#6875E2;border-color:#6875E2;color:white;" final-class="lyte-button lyteBackgroundColorBtn lyteSuccess" lyte-rendered=""> Show more Show less