Crossing Hurdles logo

Quality Assurance Specialist (Spanish) | $11/hr Remote

Crossing Hurdles Spain

remote
Posted Feb 25, 2026
  • Role & seniority

    • LLM – AI Quality Analyst (Personalization) – Spanish

    • Short-Term Contract (2 months); part-time commitment (30–40 hrs/week)

  • Stack/tools

    • Data annotation / AI quality evaluation

    • Creative prompt engineering and personalization concepts

    • Evaluation workflows, side-by-side comparisons, and rationale writing

    • Use of personal Google account and personal data sources for assessment

  • Top 3 responsibilities

    • Evaluate a Gemini personalization feature and model responses for relevance and usefulness

    • Design and execute multi-turn prompts that leverage personal information and past conversations

    • Provide structured feedback with clear rationales, rank responses by helpfulness/naturalness, and verify data sources and summaries

  • Must-have skills

    • Strong Spanish reading/writing proficiency

    • Experience in data annotation, AI quality evaluation, or content moderation

    • Analytical thinking with high attention to detail

    • Experience with prompt engineering and personalization concepts

    • Ability to work independently in a remote environment

    • Ability to produce clear written explanations and maintain data hygiene

  • Nice-to-haves

    • Experience with grounding/avoiding hallucinations in AI outputs

    • Familiarity with evaluating data integration of personal data into responses

    • Comfort with using personal data sources for assessment

  • Location & work type

    • Remote, global

    • Part-time, 30–40 hrs/week

    • Compensation: $11

Full Description

Position: LLM – AI Quality Analyst (Personalization) – Spanish

Type: Short-Term Contract (2 months)

Compensation: $11 per hour

Location: Remote (Global)

Commitment: Part-time availability required (30–40 hrs/week)

Role Responsibilities Evaluate a personalization feature for Gemini Design and execute multi-turn conversational prompts that require the AI to utilize personal information and experiences Assess how effectively the model uses past conversations and activity to generate relevant and helpful responses Evaluate model responses based on intent and appropriate personalization Analyze responses for grounding issues, including flawed inferences or hallucinations Assess integration quality to ensure personal data is incorporated naturally into responses Perform side-by-side evaluations and stack-rank model responses based on helpfulness and naturalness Write clear rationales referencing specific conversation turns Extract and verify debug information to confirm correct use of summaries and data sources Maintain data hygiene by deleting evaluation conversations after completion

Requirements Experience in data annotation, AI quality evaluation, content moderation, or related roles Strong Spanish proficiency (reading and writing) Willingness to use a primary personal Google account and enable personal data sources for assessment Strong analytical thinking and attention to detail Experience with creative prompt engineering and personalization concepts Ability to provide structured feedback and clear written explanations Ability to work independently in a remote environment

Application Process Fill out the application form Complete the ICF Complete the assessment

Cookies & analytics consent

We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.

Read how we use data in our Privacy Policy and Terms of Service.