Jobs
My ads
My job alerts
Sign in
Find a job Career Tips Companies
Find

Senior agent evaluation engineer (freelance)

Newcastle
Independant
Mindrift
Posted: 10 March
Offer description

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

What This Opportunity Involves
* Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources
* Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks
* Craft \"fair but hard\" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required)
* Analyze AI failures to understand what the model struggles with vs. what it masters
* Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria
What We Look For
* Degree in Computer Science, Software Engineering or related fields
* 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
* Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
* Experience writing tests (functional, integration - not just running them)
* Docker containers (running evaluations locally in containers)
* CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
* English proficiency - B2
How It Works

Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid

Effort estimate

Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.

Payment
* Paid contributions, with rates up to $45/hour*
* Fixed project rate or individual rates, depending on the project
* Some projects include incentive payments
* Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project
#J-18808-Ljbffr

Send an application
Create a job alert
Alert activated
Saved
Save
Similar jobs
jobs Newcastle
jobs New South Wales
Home > Jobs > Senior Agent Evaluation Engineer (Freelance)

About Jobstralia

  • Career Advice
  • Company Reviews

Search for jobs

  • Jobs by job title
  • Jobs by sector
  • Jobs by company
  • Jobs by location

Contact / Partnership

  • Contact
  • Publish your job offers on Jobijoba

Legal notice - Terms of Service - Privacy Policy - Manage my cookies - Accessibility: Not compliant

© 2026 Jobstralia - All Rights Reserved

Send an application
Create a job alert
Alert activated
Saved
Save