
Quality Assurance Specialist (Vietnamese) | $11/hr Remote
Crossing Hurdles • Australia
Role & seniority: LLM – AI Quality Analyst (Personalization); short-term contract, start immediate; remote, full-time (30–40 hrs/week) for 2 months
Stack/tools: data annotation; AI quality evaluation; content moderation; multi-turn prompt design; side-by-side evaluation; documentation of rationales; use of a primary Google account with personal data sources
Top 3 responsibilities
-
Design multi-turn prompts (1–5 turns) using personal context; assess grounding and usefulness
-
Evaluate personalized AI responses for accuracy, integration, and helpfulness; identify errors and hallucinations
-
Conduct SxS comparisons, write structured rationales with references to conversation turns; ensure data hygiene and proper data source usage
Must-have skills
-
Vietnamese reading/writing proficiency
-
Experience in data annotation, AI quality evaluation, or content moderation
-
Strong analytical skills for nuanced outputs; prompt engineering and multi-turn conversation experience
-
Understanding of personalization concepts; high attention to detail; excellent written communication
-
BS/BA or equivalent; self-motivated, able to work independently in a remote setup; reliable internet
-
Comfortable using a primary Google account with enabled personal data sources
Nice-to-haves
-
Experience specifically with personalization evaluation
-
Familiarity with debugging data provenance and grounding checks
-
Prior remote-contract work or agile evaluation environ
Full Description
Position: LLM – AI Quality Analyst (Personalization) – Vietnamese
Type: Short-Term Contract
Location: Remote
Commitment: Full-time (30–40 hours/week, 4-hour overlap with PST)
Engagement Length: 2 months
Start Date: Immediate
Role Responsibilities
Design multi-turn conversational prompts (1–5 turns) using personal context Evaluate personalized AI responses for grounding, integration, and helpfulness Assess correct usage of personal data and identify flawed inferences or hallucinations Review integration quality to ensure personalization feels natural and not over-narrated Conduct side-by-side (SxS) evaluation and ranking of model responses Identify personalization errors, reasoning gaps, and grounding issues Write clear, structured rationales referencing specific conversation turns Extract and verify “Debug Info” to confirm correct data source utilization Maintain strict data hygiene by deleting evaluation conversations
Requirements
Vietnamese proficiency (reading and writing) Strong experience in data annotation, AI quality evaluation, content moderation, or related roles Strong analytical skills for evaluating nuanced and ambiguous AI outputs Experience with creative prompt engineering and multi-turn conversations Understanding of personalization concepts and AI response evaluation High attention to detail for SxS comparisons Excellent written communication and feedback documentation skills BS/BA degree or equivalent experience in a relevant field Willingness to use a primary personal Google account with enabled personal data sources Self-motivated and able to work independently in a remote setup Desktop/laptop with reliable internet connection
Application Process
Fill out the application form Complete the ICF Complete the assessment