About The Role
AI is only as good as the experts who train it. We're looking for data scientists to help evaluate, refine, and improve next-generation AI systems — bringing your quantitative expertise directly to bear on how the world's most advanced models reason, analyze, and communicate.
This is a fully remote, flexible contract role. You set your hours and work at your own pace, contributing to projects that sit at the frontier of applied AI research.
-
Organization: Alignerr
-
Type: Hourly Contract
-
Location: Remote
-
Commitment: 10–40 hours/week
What You'll Do
-
Evaluate AI model outputs for statistical soundness, reasoning quality, and analytical accuracy
-
Design and apply data-driven evaluation criteria and scoring rubrics
-
Analyze patterns in AI-generated responses to surface systematic errors or biases
-
Create high-quality training data — including prompts, worked solutions, and expert annotations — across data science and ML domains
-
Review AI-generated code, visualizations, and statistical analyses for correctness and best practices
-
Provide structured, detailed feedback that directly improves model performance
-
Work independently and asynchronously on your own schedule
Who You Are
-
Degree in Data Science, Statistics, Computer Science, Mathematics, or a related quantitative field (MS or PhD preferred)
-
Strong foundation in statistics, probability, and machine learning concepts
-
Proficient in Python, R, SQL, or similar data analysis tools
-
Experienced with data wrangling, exploratory data analysis, and model evaluation
-
Sharp analytical thinker with excellent attention to detail
-
Clear written communicator — able to explain complex technical concepts concisely
-
Self-motivated and comfortable working independently in an async environment
Nice to Have
-
Experience with deep learning frameworks such as PyTorch or TensorFlow
-
Familiarity with NLP, large language models, or AI evaluation workflows
-
Published research or hands-on industry experience in applied machine learning
-
Background in A/B testing, causal inference, or experimental design
Why Join Us
-
Work on cutting-edge AI projects alongside top research labs and AI teams globally
-
Get rare, inside exposure to how state-of-the-art LLMs are trained and evaluated
-
Fully remote and async — work when and where it suits you
-
Complete autonomy over your schedule and workload (10–40 hrs/week)
-
Join a growing community of expert contributors who are actively shaping the future of AI
-
Potential for ongoing work and long-term contract extension