Big Data Developer

India

Big Data DeveLoper

Responsibilities

Design and implement data pipelines for migration from HDFS/Hive to cloud object storage (e.g., S3, Ceph).
Optimize Spark (and optionally Flink) jobs for performance and scalability in a Kubernetes environment.
Ensure data consistency, schema evolution, and governance with Apache Iceberg or equivalent table formats.
Support migration strategy definition by providing technical input and identifying risks.
Mentor junior developers and review their code / design decisions.
Collaborate with platform engineers, cloud architects, and product stakeholders to align technical implementation with project goals.
Troubleshoot complex distributed system issues in data pipelines or storage integration.

Requirements

Experience 7 to 15 Years
Scala and Python
Apache Spark (batch & streaming) – must!
Deep knowledge of HDFS internals and migration strategies.
Experience with Apache Iceberg (or similar table formats like Delta Lake / Apache Hudi) for schema evolution, ACID transactions, and time travel.
Running Spark and/or Flink jobs on Kubernetes (e.g., Spark-on-K8s operator, Flink-on-K8s).
Experience with distributed blob storages like Ceph or AWS S3 and similar
Building ingestion, transformation, and enrichment pipelines for large-scale datasets.
Infrastructure-as-Code (Terraform, Helm) for provisioning data infrastructure.
Ability to work independently while guiding juniors.

Nice to have

Experience with Apache Flink
Prior experience in migration projects or large-scale data platform modernization.
Apple experience preferred (to enable him/her to get up to speed on our tooling set quickly and more independently)

We offer

Opportunity to work on bleeding-edge projects
Work with a highly motivated and dedicated team
Competitive salary
Flexible schedule
Benefits package - medical insurance, sports
Corporate social events
Professional development opportunities
Well-equipped office

About us
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.

Similar jobs

Sr. Supervisor Data Engineer

Orange

Giza, Egypt

about 3 hours ago

Lead I - Big Data Engineer AWS

UST

Hyderabad, Pakistan

about 5 hours ago

Industry Experts – Data Science (Industry Connect Sessions) - Pune

School of Data Science and Business Intelligence

India

8 days ago

Data Engineer (Python + GCP)

Decillion Digital

India

8 days ago

Big Data Support Expert

TAWANTECH

Riyadh, Saudi Arabia

8 days ago

Big Data R&D Consultant

TAWANTECH

Riyadh, Saudi Arabia

8 days ago

Big Data Consultant & Architect

TAWANTECH

Riyadh, Saudi Arabia

8 days ago

Term of use Privacy policy