Cookies & analytics consent
We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.
Read how we use data in our Privacy Policy and Terms of Service.
🤖 15+ AI Agents working for you. Find jobs, score and update resumes, cover letter, interview questions, missing keywords, and lots more.

Kavak GCC • Punjab, Pakistan
Salary: AED 3,675 / month
Role (seniority)
Stack / tools
Conversational AI frameworks: LangChain, LangGraph, prompt-driven workflows
Voice automation: Twilio, Dialogflow, Amazon Lex, Synthflow
Testing tools: Postman, Playwright, pytest, JMeter
NLP testing concepts: intent/slot recognition, confusion matrix, false positives/negatives
Top 3 responsibilities
Design and execute test plans for voice and chat AI agents; build/maintain automated regression testing frameworks
Conduct load testing, latency measurement, barge-in handling, and edge-case simulations; report bugs and UX issues
Evaluate intent accuracy, NLU robustness, fallback handling; monitor analytics, logs, and prompt usage; collaborate with engineers/product leads to prioritize fixes
Must-have skills
3+ years QA/testing in conversational AI
Proficiency with LangChain/LangGraph or prompt-driven workflows
Experience with voice automation tools and testing stacks (Twilio, Dialogflow, Lex, Synthflow; Postman/Playwright/pytest/JMeter)
Strong NLP testing knowledge (intent/slot recognition, confusion matrix, FP/FN), debugging, documentation, and bug reporting
Nice-to-haves
Experience with real-time voice TTS/STT, barge-in, silence detection
Familiarity with LLM apps (OpenAI, Claude, Gemini)
QA automation experience (CI/CD tests, conversational analytics, prompt-eval tools like LangSmith, Promptfoo, TruLens)
Domain exposure (automotive, e-commerce, sup
Simulate edge cases: noisy environments, overlapping speech, disconnections, multilingual input. Report bugs, behavioral inconsistencies, hallucinations, and voice UX issues. Monitor and interpret analytics, call logs, response latencies, prompt token usage, etc. Collaborate closely with engineers and product leads to prioritize fixes.
Requirements
Experience with voice automation tools: Twilio, Dialogflow, Amazon Lex, or Synthflow. Strong knowledge of testing tools (Postman, Playwright, pytest, JMeter, etc.).
Understanding of NLP testing: intent/slot recognition, confusion matrix, false positives/negatives. Excellent debugging, test documentation, and bug reporting skills.
Nice to Have Experience testing systems with real-time voice TTS/STT, barge-in, and silence detection.
Familiarity with LLM-based applications: OpenAI, Claude, Gemini, etc.
Hands-on with QA automation: CI/CD test suites, conversational analytics, or prompt eval tools like LangSmith, Promptfoo, or TruLens. Knowledge of automotive, e-commerce, or support systems. Arabic language testing experience. Do you want to be part of this story? Apply now!