
Quality Assurance Specialist (Korean) | $11/hr Remote
Crossing Hurdles • Canada
Role & seniority: LLM – AI Quality Analyst (Personalization) – Korean; short-term contract (2 months); part-time availability
Stack/tools: Remote/global; data annotation/AI quality evaluation; creative prompt engineering; personalization concepts; primary Google account enabled for data sources; evaluation rubric, side-by-side comparisons, rationale writing
Top 3 responsibilities
-
Evaluate a Gemini personalization feature and assess use of past conversations and activity
-
Design/execute multi-turn prompts; judge intent, personalization quality, grounding, and hallucination risk
-
Perform side-by-side evaluations, stack-rank responses, write rationales with references to specific turns; ensure data hygiene by deleting evaluation conversations
Must-have skills
-
Strong Korean reading/writing proficiency
-
Experience in data annotation, AI quality evaluation, content moderation, or related roles
-
Strong analytical thinking, attention to detail
-
Experience with creative prompt engineering and personalization concepts
-
Ability to provide structured feedback and clear written explanations
-
Ability to work independently in a remote environment
Nice-to-haves
-
Familiarity with AI model evaluation frameworks
-
Experience extracting/verifying debug information about data sources and summaries
-
Location & work type: Remote (global); contract-based, part-time commitment required; compensation $11/hour
Full Description
Position: LLM – AI Quality Analyst (Personalization) – Korean
Type: Short-Term Contract (2 months)
Compensation: $11 per hour
Location: Remote (Global)
Commitment: Part-time availability required
Role Responsibilities Evaluate a personalization feature for Gemini Design and execute multi-turn conversational prompts that require the AI to utilize personal information and experiences Assess how effectively the model uses past conversations and activity to generate relevant and helpful responses Evaluate model responses based on intent and appropriate personalization Analyze responses for grounding issues, including flawed inferences or hallucinations Assess integration quality to ensure personal data is incorporated naturally into responses Perform side-by-side evaluations and stack-rank model responses based on helpfulness and naturalness Write clear rationales referencing specific conversation turns Extract and verify debug information to confirm correct use of summaries and data sources Maintain data hygiene by deleting evaluation conversations after completion
Requirements Experience in data annotation, AI quality evaluation, content moderation, or related roles Strong Korean proficiency (reading and writing) Willingness to use a primary personal Google account and enable personal data sources for assessment Strong analytical thinking and attention to detail Experience with creative prompt engineering and personalization concepts Ability to provide structured feedback and clear written explanations Ability to work independently in a remote environment
Application Process Fill out the application form Complete the ICF Complete the assessment