Mercor is collaborating with a leading AI lab to contract experienced professionals for an AI model evaluation project. Contractors will assess the quality, accuracy, and safety of AI-generated responses across specialized domains such as finance, law, medicine, and accounting. The project offers an opportunity to directly improve the reliability of AI systems in high-stakes contexts where inaccurate information carries serious risk.
Key Responsibilities
* Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance
* Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness
* Identify fabricated claims, incorrect references, or misleading reasoning across model outputs
* Score and rank multiple model responses using structured rubrics across dimensions
* Provide written justifications with specific evidence for each evaluation
Ideal Qualifications
* Master's degree or higher in a relevant professional field (e.g., Finance, Accounting, Law, Medicine, Healthcare, Engineering)
* Professional experience applying domain expertise in a practitioner or advisory capacity
* Familiarity with industry-specific standards, regulations, or clinical guidelines
* Strong written communication and critical reasoning skills
More About The Opportunity
* Expected commitment: ~20 hours/week
Application Process
* Submit your resume to begin
* Complete a Training Assessment
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Contract and Payment Terms
* You will be engaged as an independent contractor.
* This is a fully remote role that can be completed on your own schedule.
* Projects can be extended, shortened, or concluded early depending on needs and performance.
* Your work at Mercor will not involve access to confidential or proprietary information from any employer, client, or institution.
* Payments are weekly on Stripe or Wise based on services rendered.
* Please note: We are unable to support H1-B or STEM OPT candidates at this time.
* Pay for this job is USD $60 to $80 per hour, which is approx AUD $80 to $110. It will be subject to exchange rates.
About Mercor
Mercor partners with leading AI labs and enterprises to train frontier models using human expertise. You will work on projects that focus on training and enhancing AI systems. You will be paid competitively, collaborate with leading researchers, and help shape the next generation of AI systems in your area of expertise.
Please consider applying even if you don't meet 100% of what's outlined
Key Strengths
* Familiarity with industry standards
Why Mercor is partnering with Hatch on this role. Hatch exists to level the playing field for people as they discover a career that's right for them. So when you apply you have the chance to show more than just your resume.
A Final Note
This is a role with Mercor not with Hatch.
#J-18808-Ljbffr