Role Overview: Data Scientist
Location:
Remote/ Indore/ Mumbai/ Chennai/ Gurugram
Experience:
Min 5 Years
Work Mode:
Remote
Notice Period:
Max. 30 Days (45 for Notice Serving)
Interview Process:
2 Rounds
Interview Mode:
Virtual Face-to-Face
Interview Timeline:
1 Week
Industry:
Must be from a BPO/ KPO/ Shared Services or Healthcare Org.
Key Responsibilities:
-
AI/ML Development & Research
-
Design, develop, and deploy advanced machine learning and deep learning models to solve complex business problems.
-
Implement and optimize Large Language Models (LLMs) and Generative AI solutions for real-world applications.
-
Build agent-based AI systems with autonomous decision-making capabilities.
-
Conduct cutting-edge research on emerging AI technologies and explore their practical applications.
-
Perform model evaluation, validation, and continuous optimization to ensure high performance.
-
Cloud Infrastructure & Full-Stack Development:
-
Architect and implement scalable, cloud-native ML/AI solutions using AWS, Azure, or GCP.
-
Develop full-stack applications that seamlessly integrate AI models with modern web technologies.
-
Build and maintain robust ML pipelines using cloud services (e.g., SageMaker, ML Engine).
-
Implement CI/CD pipelines to streamline ML model deployment and monitoring processes.
-
Design and optimize cloud infrastructure to support high-performance computing workloads.
-
Data Engineering & Database Management
-
Design and implement data pipelines to enable large-scale data processing and real-time analytics.
-
Work with both SQL and NoSQL databases (e.g., PostgreSQL, MongoDB, Cassandra) to manage structured and unstructured data.
-
Optimize database performance to support machine learning workloads and real-time applications.
-
Implement robust data governance frameworks and ensure data quality assurance practices.
-
Manage and process streaming data to enable real-time decision-making.
-
Leadership & Collaboration
-
Mentor junior data scientists and assist in technical decision-making to drive innovation.
-
Collaborate with cross-functional teams, including product, engineering, and business stakeholders, to develop solutions that align with organizational goals.
-
Present findings and insights to both technical and non-technical audiences in a clear and actionable manner.
-
Lead proof-of-concept projects and innovation initiatives to push the boundaries of AI/ML applications.
Required Qualifications:
-
Education & Experience
-
Master’s or PhD in Computer Science, Data Science, Statistics, Mathematics, or a related field.
-
5+ years of hands-on experience in data science and machine learning, with a focus on real-world applications.
-
3+ years of experience working with deep learning frameworks and neural networks.
-
2+ years of experience with cloud platforms and full-stack development.
-
Technical Skills - Core AI/ML
-
Machine Learning: Proficient in Scikit-learn, XGBoost, LightGBM, and advanced ML algorithms.
-
Deep Learning: Expertise in TensorFlow, PyTorch, Keras, CNNs, RNNs, LSTMs, and Transformers.
-
Large Language Models: Experience with GPT, BERT, T5, fine-tuning, and prompt engineering.
-
Generative AI: Hands-on experience with Stable Diffusion, DALL-E, text-to-image, and text generation models.
-
Agentic AI: Knowledge of multi-agent systems, reinforcement learning, and autonomous agents.
-
Technical Skills - Development & Infrastructure
-
Programming: Expertise in Python, with proficiency in R, Java/Scala, JavaScript/TypeScript.
-
Cloud Platforms: Proficient with AWS (SageMaker, EC2, S3, Lambda), Azure ML, or Google Cloud AI.
-
Databases: Proficiency with SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, Cassandra, DynamoDB).
-
Full-Stack Development: Experience with React/Vue.js, Node.js, FastAPI, Flask, Docker, Kubernetes.
-
MLOps: Experience with MLflow, Kubeflow, model versioning, and A/B testing frameworks.
-
Big Data: Expertise in Spark, Hadoop, Kafka, and streaming data processing.
Non Negotiables:
-
Cloud Infrastructure - ML/AI solutions on AWS, Azure, or GCP
-
Build and maintain ML pipelines using cloud services (SageMaker, ML Engine, etc.)
-
Implement CI/CD pipelines for ML model deployment and monitoring
-
Work with both SQL and NoSQL databases (PostgreSQL, MongoDB, Cassandra, etc.)
-
Machine Learning: Scikit-learn
-
Deep Learning: TensorFlow
-
Programming: Python (expert), R, Java/Scala, JavaScript/TypeScript
-
Cloud Platforms: AWS (SageMaker, EC2, S3, Lambda)
-
Vector databases and embeddings (Pinecone, Weaviate, Chroma)
-
Knowledge of LangChain, LlamaIndex, or similar LLM frameworks.
-
Industry:
Must be a BPO or Healthcare Org.