Cookies & analytics consent
We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.
Read how we use data in our Privacy Policy and Terms of Service.
🤖 15+ AI Agents working for you. Find jobs, score and update resumes, cover letter, interview questions, missing keywords, and lots more.

Crossing Hurdles • China
Role & seniority: LLM – AI Quality Analyst (Personalization) – Chinese; short-term contract (2 months); part-time (30–40 hrs/week)
Stack/tools: remote evaluation framework; data annotation/QA for LLMs; creative prompt engineering; use of a primary Google account and personal data sources for assessment; ability to extract/verify debug information and maintain data hygiene
Evaluate personalization feature for Gemini and assess use of past conversations and activity
Design and execute multi-turn prompts; perform side-by-side evaluations and rank model responses by helpfulness and naturalness
Write clear rationales referencing specific turns; verify data sources/summaries; delete evaluation conversations to maintain data hygiene
Experience in data annotation, AI quality evaluation, content moderation, or related roles
Strong Chinese reading/writing proficiency
Willingness to use a personal Google account and personal data sources
Strong analytical thinking, attention to detail
Experience with creative prompt engineering and personalization concepts
Ability to provide structured feedback and work independently remotely
Prior AI/LLM QA experience; familiarity with data privacy and handling personal data
Additional multilingual capabilities; experience in evaluating grounding and hallucinations
Location & work type: Remote, global; short-term contract (2 months), part-ti
Position: LLM – AI Quality Analyst (Personalization) – Chinese
Type: Short-Term Contract (2 months)
Compensation: $11 per hour
Location: Remote (Global)
Commitment: Part-time availability required (30–40 hrs/week)
Role Responsibilities Evaluate a personalization feature for Gemini Design and execute multi-turn conversational prompts that require the AI to utilize personal information and experiences Assess how effectively the model uses past conversations and activity to generate relevant and helpful responses Evaluate model responses based on intent and appropriate personalization Analyze responses for grounding issues, including flawed inferences or hallucinations Assess integration quality to ensure personal data is incorporated naturally into responses Perform side-by-side evaluations and stack-rank model responses based on helpfulness and naturalness Write clear rationales referencing specific conversation turns Extract and verify debug information to confirm correct use of summaries and data sources Maintain data hygiene by deleting evaluation conversations after completion
Requirements Experience in data annotation, AI quality evaluation, content moderation, or related roles Strong Chinese proficiency (reading and writing) Willingness to use a primary personal Google account and enable personal data sources for assessment Strong analytical thinking and attention to detail Experience with creative prompt engineering and personalization concepts Ability to provide structured feedback and clear written explanations Ability to work independently in a remote environment
Application Process Fill out the application form Complete the ICF Complete the assessment