Cookies & analytics consent
We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.
Read how we use data in our Privacy Policy and Terms of Service.
🤖 15+ AI Agents working for you. Find jobs, score and update resumes, cover letter, interview questions, missing keywords, and lots more.

YO HR Consultancy • Poland
Role & seniority: Experienced Software Engineer; freelance/independent contractor (part-time)
Stack/tools: Python, JavaScript, Java, or C++; debugging, testing, code validation; technical writing; content creation from real PRs; model/explanation quality for coding prompts
Craft realistic developer prompts across categories (code review, debugging, error diagnosis, configuration, etc.)
Source and adapt content from real PRs to create authentic scenarios
Write clear, technically accurate model responses with strong reasoning and explanation quality
2+ years in software engineering, technical research, or educational content development
Degree in Software Engineering, Computer Science, or related field (Bachelor’s minimum)
Proficiency in Python, JavaScript, Java, or C++
Experience with debugging, testing, and validating code
Comfort with technical writing and attention to detail
Advanced degree (preferred)
Demonstrated experience in educational or technical content development
Fully remote
Start: immediate
Duration: 1–2 months
Hours: part-time (15–25 hrs/week, flexible up to 40 hrs/week)
Independent contractor with potential project extensions or adjustments based on need/performance
hiring experienced Software Engineers to support high-impact research collaborations with leading AI labs. Freelancers will contribute to building evaluation datasets that assess AI reasoning, explanation quality, and technical judgment in coding-related interactions.
This is a unique opportunity to apply your engineering expertise toward shaping the next generation of intelligent systems.
About The Project
This evaluation dataset is code question and answer data. This data is designed to assess natural-language reasoning, explanation quality, and technical judgment in coding-related interactions, rather than executable correctness. Tasks are structured as chat-pasteable prompts that reflect realistic developer questions and include all necessary context inline (e.g., code snippets, error messages, logs, or requirements)
Key Responsibilities
Craft realistic developer prompts across multiple categories (code review, debugging, error diagnosis, configuration, and more) Source and adapt content from real PRs to create authentic scenarios Write clear, technically accurate model responses that demonstrate strong reasoning and explanation quality
Ideal Qualifications
2+ years of experience in software engineering, technical research, or educational content development Degree in Software Engineering, Computer Science, or a related field (Bachelor’s minimum; advanced degree preferred) Strong proficiency in languages like Python, JavaScript, Java, or C++ Experience with debugging, testing, and validating code Comfortable with technical writing and attention to detail
Project Timeline
Start Date: Immediate
Duration: 1-2 months
Commitment: Part-time (15–25 hours/week, with flexibility up to 40 hours/week)
Application & Onboarding Process
Upload your resume
AI interview: A short, 15-minute conversational session to understand your background, experience, and interest in the role Follow-up communication within a few days with next steps and onboarding details
Apply today and leverage your software engineering expertise to help build the future of AI-driven systems!
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Contract and Payment Terms
You will be engaged as an independent contractor. This is a fully remote role that can be completed on your own schedule. Projects can be extended, shortened, or concluded early depending on needs and performance.
Skills: debugging,software,qa,application,code