Description : Summary :
Responsibilities
We are seeking a highly skilled Data Engineer / Data Scientist with strong expertise in Python, Pandas, PySpark, SQL, and NLP libraries. The ideal candidate will be responsible for developing scalable data solutions, performing advanced analytics, and deploying models on cloud platforms such as AWS, Azure, or Responsibilities :
-
Design, build, and optimize scalable data pipelines using Python, Pandas, and PySpark.
-
Develop and maintain SQL scripts and queries for data extraction, transformation, and
analysis.
-
Apply Natural Language Processing (NLP) techniques for text analytics and insights Collaborate with cross-functional teams to deploy and manage data solutions on cloud
platforms (AWS, Azure, or GCP).
-
Ensure data quality, consistency, and performance across data systems.
- Document technical processes and communicate findings effectively with technical and non-
technical stakeholders.
-
Participate in continuous improvement and innovation of data workflows and ML model Skills & Qualifications :
-
Proficiency in Python, Pandas, PySpark, and SQL.
-
Experience with NLP frameworks (e.g., spaCy, NLTK, Transformers, Hugging Face).
-
Hands-on experience in cloud environments (Azure, AWS, or GCP) and cloud-native
deployment practices (Docker, Kubernetes, CI/CD pipelines).
-
Strong problem-solving and analytical skills.
-
Excellent written and verbal communication Qualifications :
-
Familiarity with big data tools (Databricks, Hadoop, Spark Streaming).
-
Experience with ML model lifecycle management (MLOps).
(ref:
hirist.tech)