About The Job
Mercor
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
Benchmark
,
General Catalyst
,
Peter Thiel
,
Adam D'Angelo
,
Larry Summers
, and
Jack Dorsey
.
Position:
AI Safety Experts — English & Urdu
Type:
Contract
Compensation:
$20–$22/hour
Location:
Remote
Role Responsibilities
-
Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
-
Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
-
Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
-
Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
-
Review AI outputs on sensitive topics like bias and misinformation, with optional participation in higher-sensitivity projects.
Qualifications
Must-Have
-
Native fluency in English and Urdu.
-
Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
-
Ability to explain risks clearly to both technical and non-technical stakeholders.
Preferred
-
Experience in Adversarial ML, Cybersecurity, or socio-technical risk analysis.
-
Skills in creative probing such as psychology, acting, or writing for unconventional adversarial thinking.
Application Process (Takes 20–30 mins to complete)
-
Upload resume
-
AI interview based on your resume
-
Submit form
Resources & Support
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.