Cookies & analytics consent
We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.
Read how we use data in our Privacy Policy and Terms of Service.
🤖 15+ AI Agents working for you. Find jobs, score and update resumes, cover letter, interview questions, missing keywords, and lots more.

Crossing Hurdles • Australia
Role & seniority: LLM – AI Quality Analyst (Personalization) – Thai; Short-Term Contract; remote; immediate start.
Stack/tools: Thai reading/writing; data annotation and AI quality evaluation; content moderation; multi-turn prompt design; side-by-side (SxS) evaluation; debugging/“Debug Info” extraction; strict data hygiene; independent remote work; compatible with personal Google account for data sources.
Design multi-turn prompts (1–5 turns) using personal context.
Evaluate personalized AI responses for grounding, integration, and helpfulness; identify errors and hallucinations.
Conduct SxS evaluations, ranking, and document clear rationales referencing specific turns; verify data source usage.
Fluent Thai (reading/writing) and strong communication/documentation skills.
Experience in data annotation, AI quality evaluation, content moderation, or related roles.
Strong analytical ability for nuanced outputs; creative prompt engineering; understanding of personalization concepts.
High attention to detail; ability to work independently in a remote setting; BS/BA or equivalent.
Reliable desktop/laptop and internet; willingness to use a primary personal Google account with enabled data sources.
Experience with multi-turn conversations and grounding/verification tasks; data hygiene practices.
Location & work type: Remote; Short-Term Contract (2 months); 30–40 hours/week; 4-hour
Position: LLM – AI Quality Analyst (Personalization) – Thai
Type: Short-Term Contract
Location: Remote
Commitment: 30–40 hours/week, 4-hour overlap with PST
Engagement Length: 2 months
Start Date: Immediate
Role Responsibilities
Design multi-turn conversational prompts (1–5 turns) using personal context Evaluate personalized AI responses for grounding, integration, and helpfulness Assess correct usage of personal data and identify flawed inferences or hallucinations Review integration quality to ensure personalization feels natural and not over-narrated Conduct side-by-side (SxS) evaluation and ranking of model responses Identify personalization errors, reasoning gaps, and grounding issues Write clear, structured rationales referencing specific conversation turns Extract and verify “Debug Info” to confirm correct data source utilization Maintain strict data hygiene by deleting evaluation conversations
Requirements
Thai proficiency (reading and writing) Strong experience in data annotation, AI quality evaluation, content moderation, or related roles Strong analytical skills for evaluating nuanced and ambiguous AI outputs Experience with creative prompt engineering and multi-turn conversations Understanding of personalization concepts and AI response evaluation High attention to detail for SxS comparisons Excellent written communication and feedback documentation skills BS/BA degree or equivalent experience in a relevant field Willingness to use a primary personal Google account with enabled personal data sources Self-motivated and able to work independently in a remote setup Desktop/laptop with reliable internet connection
Application Process
Fill out the application form Complete the ICF Complete the assessment