Cookies & analytics consent
We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.
Read how we use data in our Privacy Policy and Terms of Service.
🤖 15+ AI Agents working for you. Find jobs, score and update resumes, cover letter, interview questions, missing keywords, and lots more.

Scaled Cognition • Boston, Massachusetts, United States
Role & seniority: QA Manager at Scaled Cognition
Stack/tools: Python (intermediate), conversational AI/LLM systems, testing pipelines, evaluation benchmarks, production monitoring metrics; AI/LLM libraries and tooling
Develop and implement scalable QA plans for evaluating AI agents; define KPIs to track progress over time
Collaborate with product and engineering to document findings, test fixes, and recommend improvements to models and conversational flows
Lead and mentor QA engineers; establish testing best practices and processes for conversational AI
Intermediate Python
Experience building/testing conversational AI/LLM systems
Background in evaluation benchmarks and production monitoring metrics
Documentation precision for test plans, cases, and bug reports
Ability to work with AI tooling to enable rapid iteration
Experience building automated testing pipelines for scalable QA
Familiarity with AI-powered assistants/tooling and rapid prototyping
History of cross-functional collaboration in product/engineering
Location & work type: Location and work type not specified in the provided text
Scaled Cognition is the world’s only model lab dedicated exclusively to customer experience and pioneering agentic models purpose-built for reliable action-taking enterprise applications. Backed by Khosla Ventures, the company’s flagship Agentic Pretrained Transformer (APT) eliminates hallucinations, enforces enterprise policies and increases reliability in real-world CX workflows.
Founded by serial AI entrepreneurs, former Microsoft Corporate Vice President of Conversational AI Dan Roth, and UC Berkeley AI Professor Dan Klein, and built by a team of world-class PhD researchers and engineers, Scaled Cognition advances the science of agentic AI to deliver safe, policy-aligned automation that enterprises can trust.