About The Job
Mercor
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
Benchmark
,
General Catalyst
,
Peter Thiel
,
Adam D'Angelo
,
Larry Summers
, and
Jack Dorsey
.
Position:
AI Model Evaluation Contractor
Type:
Contract
Compensation:
$23–$30/hour
Commitment:
20 hours/week
Role Responsibilities
-
Write realistic prompts reflecting professional and consumer domain-specific guidance.
-
Evaluate AI-generated responses for factual accuracy, regulatory correctness, and practical usefulness.
-
Identify fabricated claims, incorrect references, or misleading reasoning in model outputs.
-
Score and rank multiple model responses using structured rubrics across dimensions.
-
Provide written justifications with specific evidence for each evaluation.
Qualifications
Must-Have
-
Master’s degree in a relevant professional field.
-
Professional experience applying domain expertise in a practitioner or advisory capacity.
-
Familiarity with industry-specific standards, regulations, or clinical guidelines.
-
Strong written communication and critical reasoning skills.
Application Process (Takes 20–30 mins to complete)
-
Submit your resume to begin.
-
Complete the Model Response Evaluation assessment.
Resources & Support
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.