Role & seniority: LLM – AI Quality Analyst (Personalization), Japanese; short-term contract (2 months), part-time availability 30–40 hrs/week

Stack/tools: data annotation / AI quality evaluation / content moderation; creative prompt engineering and personalization concepts; strong Japanese reading/writing; use of a primary personal Google account and personal data sources; remote tools for evaluation, debugging data, and documentation

Top 3 responsibilities

Evaluate a personalization feature for Gemini and assess how past conversations and activity are used
Design and execute multi-turn prompts requiring personal data; perform side-by-side evaluations and rank model responses
Write clear rationales referencing turns, extract/verify debug information, and ensure data hygiene by deleting evaluation conversations

Must-have skills

Strong Japanese proficiency (reading/writing)
Experience in data annotation, AI quality evaluation, content moderation, or related roles
Analytical thinking, attention to detail, structured feedback, and independent remote work capability
Experience with creative prompt engineering and personalization concepts

Nice-to-haves

Familiarity with model grounding issues and debugging data sources
Prior contract/remote freelancing experience
Comfort using personal Google account for data sources and assessment
Location & work type: Remote (Global); part-time, short-term contract (2 months)

Full Description

Position: LLM – AI Quality Analyst (Personalization) – Japanese

Type: Short-Term Contract (2 months)

Compensation: $11 per hour

Location: Remote (Global)

Commitment: Part-time availability required (30–40 hrs/week)

Role Responsibilities Evaluate a personalization feature for Gemini Design and execute multi-turn conversational prompts that require the AI to utilize personal information and experiences Assess how effectively the model uses past conversations and activity to generate relevant and helpful responses Evaluate model responses based on intent and appropriate personalization Analyze responses for grounding issues, including flawed inferences or hallucinations Assess integration quality to ensure personal data is incorporated naturally into responses Perform side-by-side evaluations and stack-rank model responses based on helpfulness and naturalness Write clear rationales referencing specific conversation turns Extract and verify debug information to confirm correct use of summaries and data sources Maintain data hygiene by deleting evaluation conversations after completion

Requirements Experience in data annotation, AI quality evaluation, content moderation, or related roles Strong Japanese proficiency (reading and writing) Willingness to use a primary personal Google account and enable personal data sources for assessment Strong analytical thinking and attention to detail Experience with creative prompt engineering and personalization concepts Ability to provide structured feedback and clear written explanations Ability to work independently in a remote environment Desktop or laptop with a stable internet connection

Application Process Upload resume Interview Submit form

Quality Assurance Specialist (Japanese) | $11/hr Remote

Top 3 responsibilities

Must-have skills

Nice-to-haves

Full Description