AI Data Engineer- HealthTech AI
Up to $180,000 | U.S (Remote) | Full-Time
Deeprec.ai is partnering up with an AI focused HealthTech company centred around early-stage cancer detection.
This is a remote Data Engineering role focused on building and maintaining scalable pipelines that ingest, clean, and structure large, complex healthcare datasets.
What You’ll Do
-
Work with Data Scientists and ML Engineers to define data needs for LLM and ML models.
-
Build and maintain scalable data pipelines for large healthcare datasets.
-
Ensure data quality through cleaning, validation, and monitoring.
-
Design efficient data structures and schemas for model training and use.
-
Source new data while ensuring compliance with healthcare regulations (e.g., HIPAA)
Requirements
-
Bachelor’s degree in Computer Science, Engineering, or a related field.
-
Experience as a Data Engineer working with large-scale or big data systems such as Apache Spark
-
Strong programming skills in Python, Scala, or Java.
-
Experience with ETL pipelines, data warehousing, and data modelling.
-
Familiarity with cloud platforms (AWS, GCP, or Azure) and tools like Apache Spark.
-
Strong problem-solving skills
Nice to Have
-
Master’s degree in Computer Science, Engineering, Data Science, or a related field.
-
Experience working with healthcare data and standards such as FHIR or HL7.
-
Familiarity with machine learning concepts and LLM fine-tuning workflows.
-
Experience using data orchestration tools such as Apache Airflow.
Why Join?
-
Help shape the future of healthcare by building AI that improves early cancer detection and saves lives.
-
Work on high-impact, real-world AI used directly in clinical settings at scale.
-
Competitive salary, benefits, and flexible remote/hybrid working options.
-
Join a mission-driven, fast-growing team focused on innovation and health equity.
-
Continuous learning with exposure to cutting-edge AI, ML, and healthcare technologies.