Role & seniority

AI Tester (3–5 years of experience)

Stack/tools

Programming: Python or JavaScript/TypeScript

QA/Automation: Playwright, Selenium, API testing (Postman, Rest Assured, Python Requests)

CI/CD & DevOps: Jenkins; version control (Git); issue tracking (Jira)

Observability: CloudWatch, Datadog

AI/LLM focus: Prompt Engineering, LLM Evaluation techniques

Other: JSON/JSONL data handling; familiar with Docker (nice-to-have)

Top 3 responsibilities

Design and execute AI/LLM evaluation strategies (accuracy, hallucination, tone, relevance); perform prompt engineering and defect identification
Develop and maintain automated test scripts validating AI pipelines; UI testing for AI-integrated apps
Conduct robust API/integration testing; ensure token usage, latency, error handling; integrate tests into CI/CD pipelines (e.g., Jenkins)

Must-have skills

Practical understanding of Generative AI concepts, Prompt Engineering, LLM evaluation
Proficiency in Python (preferred) or JavaScript/TypeScript
API testing expertise (Postman collections, assertions) and RESTful validation
Core QA tools: Git, Jira, Jenkins
Data handling with JSON/JSONL; strong analytical/problem-solving and communication

Nice-to-haves

Experience with AI evaluation frameworks (e.g., DeepEval, RAG)
Cloud exposure (AWS: EC2, Lambda, S3)
Docker/containerized testing
Performance testing basics (JMeter, k6)

Location & work type

Sector-142, Noida, India
Hybrid working model (

Full Description

Who we are At 3CLogic, we are big believers that our people are the most important asset we have and that winning is a team sport. 3CLogic is a fast growing, venture-backed, SaaS “startup” with our headquarters in Rockville, Maryland. Some of our roles are local to the main office and others are remote, but we have talented individuals working from everywhere as we continue to build our safety-first hybrid remote and in-person culture, and we care more about what you might bring to our team and where you want to go in your career than where you are located. We realize you've very likely read tons of job descriptions that look a whole lot like this one. But what we can't put in words is why we would love to hear from you. You've heard the term "living in the gray area," right? Well, a great fit for 3CLogic is someone who wants to live in technicolor. There's never a gray moment here! We are all entrepreneurs at heart, who believe that when you bring your full self to work, the possibilities are infinite. If your interest is piqued, let's chat! We'd love to show you, rather than tell you, what makes us special, and find a place in our organization where you can thrive. What we do Ever call a company or organization for help and wait on hold forever only to get to a person who can’t help you? Well we are the ones that fix that! 3CLogic is a global provider of voice AI, Contact Center, and SMS solutions to enterprise and Global 2000 organizations worldwide – think 7-Eleven, Swiss Railways, Regeneron, Northeastern University, Hyatt Hotels, or LabCorp. Organizations leverage our technology and services every day to increase the quality of service to their customers/employees, improve the performance of the agents serving them, lower their operational costs, and optimize how easy it is to analyze and manage it all. We make calling for help a positive experience and efficient channel for everyone! A strategic ServiceNow and SAP partner, 3CLogic is paving the way for organizations to digitally transform customer and employee experiences, deliver conversational voice self-service offerings, enable remote work at scale, and leverage AI to drive better business outcomes.

General Job Details

Position Name: AI Tester

Experience: 3 to 5 Years

Job Type: Full-Time

Location: Sector-142, Noida (Hybrid Working Model)

Position Summary

We are looking for an innovative AI Quality Engineer to join our team. In this role, you will go beyond traditional software testing to ensure the accuracy, safety, and reliability of our Generative AI and LLM-based applications. You will combine your solid foundation in QA automation (Python/TS/Playwright) and API testing with emerging skills in Prompt Engineering and LLM Evaluation.
If you are a QA professional who is excited about how AI works, loves to "break" models, and wants to be at the forefront of AI testing, this role is for you.

Key Responsibilities

AI & LLM Evaluation:

Design and execute evaluation strategies for Large Language Models (LLMs), focusing on metrics like Accuracy, Hallucination, Tone, and Relevance.
Perform Prompt Engineering and testing to optimize model outputs and ensure alignment with business requirements..
Identify and document AI-specific defects.

Automation & Scripting:

Develop and maintain automated test scripts using Python or JavaScript/TypeScript to validate AI pipelines.
Utilize tools like Playwright or Selenium for UI testing of AI-integrated web applications.

API & Integration Testing:

Perform robust API testing using Postman or programmatic methods (Rest Assured/Python Requests) to validate AI model endpoints.
Ensure efficient token usage, latency standards, and proper error handling in API responses.

CI/CD & DevOps Integration:

Integrate automated AI evaluation suites into Jenkins pipelines for continuous testing.
Analyze logs and metrics using tools like CloudWatch or Datadog to identify performance bottlenecks

Skills

AI/LLM Exposure: Practical understanding of Generative AI concepts, Prompt Engineering, and LLM Evaluation techniques (RAG, Fine-tuning concepts).

Programming: Hands-on experience with Python (highly preferred for AI tasks) or JavaScript/TypeScript.

API Testing: Proficient in Postman (creating collections, assertions) and testing RESTful APIs.

Core QA Tools: Experience with Git for version control, Jira for defect tracking, and Jenkins for CI/CD.

Data Handling: Comfortable working with JSON/JSONL formats and handling data for test scenarios.

Soft Skills: Strong analytical and problem-solving abilities, effective communication, an ownership mindset, and a willingness to learn and adapt.

Qualifications

Bachelor’s degree in Computer Science, Information Technology, or a related field with 70% or equivalent.

Desired Skills

Familiarity with AI Evaluation frameworks like DeepEval, Ragas.

Experience with cloud platforms (AWS: EC2, Lambda, S3) for setting up test environments. Knowledge of Docker basics for containerized testing. Basic understanding of Performance Testing tools (JMeter/k6).

Benefits

Flexible Working Hours.
Hybrid Working Style.
Personal Accidental Insurance.
Health Insurance to Self, Spouse and two kids.
5 days working week.