Big Data Python Engineer
Job Summary:
We are looking for a skilled Big Data Python Engineer who can develop, optimize, and manage large-scale data processing solutions. The ideal candidate should have strong experience in Python, Big Data technologies, and data pipelines.
Key Responsibilities:
- Design, develop, and maintain scalable data pipelines using Python and Big Data frameworks.
- Work with Hadoop, Hive, Spark, and distributed data systems to process large datasets.
- Optimize ETL workflows for performance, reliability, and scalability.
- Integrate data from multiple sources and ensure high data quality.
- Collaborate with data scientists, analysts, and engineering teams to meet business requirements.
- Troubleshoot data issues and perform data validation.
- Write clean, efficient, and well-documented Python code.
- Implement automation for repetitive data processing tasks.
- Monitor and improve system performance and data pipeline efficiency.
- Ensure data security, compliance, and best practices.
Required Skills:
- Strong proficiency in Python and object-oriented programming (OOPs).
- 1-3 Hands-on experience with Hadoop, Hive, Spark, HDFS.
- Experience with ETL pipelines and data warehousing concepts.
- Good understanding of SQL and NoSQL databases.
- Knowledge of APIs, REST, and data integration workflows.
- Ability to work with large, complex datasets.
Job Types: Full-time, Permanent
Education:
Experience:
- Python: 1 year (Preferred)
Location:
- Mohali, Punjab (Preferred)
Work Location: In person