• *About The Job
• *Mercor
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
• *Benchmark**
,
• *General Catalyst**
,
• *Peter Thiel**
,
• *Adam D'Angelo**
,
• *Larry Summers**
, and
• *Jack Dorsey**
.
• *Position:**
AI Model Evaluation Specialist
• *Type:
• *Contract
• Compensation:
• $50–$75/hour
• Commitment:
• ~20 hours/week
• *Role Responsibilities
• Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance.
• Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
• Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
• Score and rank multiple model responses using structured rubrics across dimensions.
• Provide written justifications with specific evidence for each evaluation.
• *Qualifications
• *Must-Have
• Master’s degree or higher in Legal or a relevant professional field.
• Professional experience applying domain expertise in a practitioner or advisory capacity.
• Familiarity with industry-specific standards, regulations, or clinical guidelines.
• Strong written communication and critical reasoning skills.
• *Application Process (Takes 20–30 mins to complete)
• Submit your resume to begin.
• Complete the Model Response Evaluation assessment.
• *Resources & Support**
• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
• For any help or support, reach out to:
[email protected]
• PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*
,