Job Title:
Principal Data Architect
Location:
India (Remote/Hybrid Kochi, Bengaluru, Chennai, Dehradun preferred)
Job Description:
We are seeking a highly experienced and hands-on Principal Data Architect with deep expertise in
cloud-native big data architectures. The ideal candidate will have a strong background in designing
and implementing enterprise-grade data solutions across Azure, AWS, and GCP environments. You
will be responsible for leading the design, architecture, and development of large-scale data
platforms, with a focus on data lakes, pipelines, data virtualization, analytics, and cloud migrations.
Key Responsibilities:
Lead architecture, design, and implementation of cloud-based data platforms using Azure,
AWS, and GCP.
Architect data lakes, cloud data warehouses, and ETL/ELT pipelines for batch and real-time
processing.
Guide teams in adopting big data and cloud best practices, ensuring data quality,
governance, and security.
Work closely with cross-functional teams and business stakeholders to translate
requirements into scalable data solutions.
Provide thought leadership on modern data architecture patterns like Data Mesh, Data
Lakehouse, and Data Virtualization.
Act as a technical SME for platforms like Databricks, Snowflake, Synapse, Airflow, and
Dremio.
Define and enforce architecture standards and best practices across the data engineering
lifecycle.
Mentor junior data engineers and participate in code reviews, architecture reviews, and
design sessions.
Must-Have Skills:
Cloud Platforms:
o Azure (7+ yrs): ADF, Synapse, Databricks, ADLS, Logic Apps
o AWS (7+ yrs): S3, Glue, Redshift, EMR, Lambda
o GCP (1+ yr): Dataproc, GCS, BigQuery
Big Data Ecosystem:
o Spark (7+ yrs), Hadoop (HDFS, Hive), Kafka (3+ yrs), Airflow (3+ yrs), Dremio
Programming & Scripting:
o Python (10 yrs), SQL (10 yrs), Scala/Java (5 yrs), Bash, Unix Scripting
Data Engineering Tools & Frameworks:
o Databricks (5+ yrs), Snowflake (3+ yrs), Docker, Git, Jenkins, CI/CD
Data Modeling & Architecture:
o OLAP/OLTP modeling, Star/Snowflake schema, Data Vault, Virtualization strategies
Certifications (Preferred):
o AWS Certified Solutions Architect Associate
o AWS Certified Data Analytics Specialty
o Databricks Certified Data Engineer Associate
o SnowPro Core Certification
Nice to Have:
Experience with SAP HANA, PostgreSQL, Oracle SQL, DBT, and DevOps workflows
Exposure to data governance tools and metadata management
Strong presentation and stakeholder communication skills
Experience with presales, client engagement, and RFP responses
Education:
Bachelor of Technology in Electronics & Telecommunication Mahatma Gandhi University
Experience Level:
10+ years
Employment Type:
Full-time