Data Scientist / ML Engineer – 3+ Years of Experience
About Us:
At The Nth Bit Lab, we are building a Global Food Health & Recommendation Engine to empower consumers to make healthier choices. Our platform leverages ingredient-level data from products worldwide, cross-referenced with regulatory and health authority standards (FSSAI, FDA, WHO, EFSA, and more), to calculate health scores and suggest healthier alternatives.
We are looking for a talented Data Scientist / ML Engineer with 2+ years of experience to lead the data acquisition, cleaning, normalization, and scoring engine development. This is a high-impact role with opportunities to influence product architecture and help launch in multiple regions.
Key Responsibilities:
- Collect, scrape, and curate ingredient-level data for food and grocery products from public databases, manufacturer labels, and other sources.
- Collect, parse, and structure regulatory and health authority datasets (FSSAI, FDA, WHO, EFSA, Codex, etc.) including permissible limits, recommended intake, banned substances, and additives.
- Perform data cleaning, normalization, and mapping of ingredients across different datasets and authorities.
- Develop a rules-based / ML engine to calculate health scores for products based on ingredient composition and authority guidelines.
- Build recommendation logic to identify healthier alternatives across product categories.
- Design data pipelines and ETL processes to keep datasets up-to-date.
- Collaborate with the engineering team to expose data and scoring results via APIs or other interfaces.
- Document methodologies, assumptions, and scoring logic for transparency and compliance purposes.
Required Qualifications:
- Bachelor’s or Master’s degree in Data Science, Computer Science, Food Science, Nutrition, or related field.
- Candidates must have at least 3+ years of experience
- Strong experience in data collection, cleaning, normalization, and structuring.
- Familiarity with regulatory datasets (FSSAI, FDA, WHO, EFSA, Codex) is a plus.
- Proficiency in Python (Pandas, NumPy, scikit-learn), SQL, and data pipelines / ETL processes.
- Experience with NLP or text parsing to extract data from ingredient lists or regulatory PDFs.
- Experience in building scoring, ranking, or recommendation engines.
- Strong analytical and problem-solving skills, with attention to detail.
- Ability to document assumptions and methodologies clearly.
Location: New Delhi
Job Types: Full-time, Permanent
Pay: ₹600,000.00 - ₹840,000.00 per year
Ability to commute/relocate:
- New Delhi, Delhi: Reliably commute or planning to relocate before starting work (Required)
Application Question(s):
- Are you familiar with nutrition science concepts like recommended daily intake, additive limits, or allergen labeling?
Education:
Experience:
- Data scientist: 2 years (Required)
Work Location: In person