Crossing Hurdles logo

Quality Assurance Specialist (Vietnamese) | $11/hr - Remote

Crossing Hurdles Vietnam

remote
Posted Feb 25, 2026

Role & seniority: LLM – AI Quality Analyst (Personalization), Vietnamese; Short-Term Contract, entry–mid level to contributor; immediate start.

Stack/tools: Remote work setup; strong emphasis on data annotation, AI quality evaluation, SxS (side-by-side) comparisons; use of primary personal Google account with enabled personal data sources; documentation of rationales and “Debug Info.”

Top 3 responsibilities

  • Design multi-turn prompts (1–5 turns) using personal context; conduct SxS evaluation and rank model responses.

  • Evaluate grounding, integration, usefulness, data usage accuracy; identify flaws, hallucinations, reasoning gaps; ensure natural personalization without over-narration.

  • Write clear, structured rationales tied to specific turns; extract/verify Debug Info; maintain strict data hygiene by deleting evaluation conversations.

Must-have skills

  • Vietnamese proficiency (reading/writing).

  • Experience in data annotation, AI quality evaluation, content moderation, or related roles.

  • Strong analytical ability for nuanced AI outputs; experience with creative prompt engineering and multi-turn conversations.

  • Understanding of personalization concepts and ability to document feedback clearly.

  • High attention to detail, excellent written communication, BS/BA or equivalent.

  • Self-motivated, able to work independently in a remote setup; reliable desktop/laptop and internet.

Nice-to-haves

  • Prior work with evaluative/ranking frameworks; exp

Full Description

Position: LLM – AI Quality Analyst (Personalization) – Vietnamese

Type: Short-Term Contract

Location: Remote

Commitment: 30–40 hours/week, 4-hour overlap with PST

Engagement Length: 2 months

Start Date: Immediate

Role Responsibilities Design multi-turn conversational prompts (1–5 turns) using personal context Evaluate personalized AI responses for grounding, integration, and helpfulness Assess correct usage of personal data and identify flawed inferences or hallucinations Review integration quality to ensure personalization feels natural and not over-narrated Conduct side-by-side (SxS) evaluation and ranking of model responses Identify personalization errors, reasoning gaps, and grounding issues Write clear, structured rationales referencing specific conversation turns Extract and verify “Debug Info” to confirm correct data source utilization Maintain strict data hygiene by deleting evaluation conversations

Requirements Vietnamese proficiency (reading and writing) Strong experience in data annotation, AI quality evaluation, content moderation, or related roles Strong analytical skills for evaluating nuanced and ambiguous AI outputs Experience with creative prompt engineering and multi-turn conversations Understanding of personalization concepts and AI response evaluation High attention to detail for SxS comparisons Excellent written communication and feedback documentation skills BS/BA degree or equivalent experience in a relevant field Willingness to use a primary personal Google account with enabled personal data sources Self-motivated and able to work independently in a remote setup Desktop/laptop with reliable internet connection

Application Process Fill out the application form Complete the ICF Complete the assessment

Cookies & analytics consent

We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.

Read how we use data in our Privacy Policy and Terms of Service.