Role description
We’re seeking a detail-oriented Data QA Associate to ensure the accuracy, consistency, and reliability of data products built on the Databricks Lakehouse Platform. You’ll work closely with data engineers and analysts to validate data pipelines, enforce quality standards, and support scalable analytics.
Responsibilities
- Validate data transformations across Bronze, Silver, and Gold Delta Lake layers
- Write and execute Spark SQL and Python tests in Databricks notebooks
- Collaborate with engineers to test Databricks Workflows, ETL jobs, and streaming pipelines
- Implement and maintain data quality checks using tools like Great Expectations or dbt
- Monitor data freshness, schema drift, and anomalies using Databricks SQL dashboards and s
- Document QA processes, test cases, and data lineage using Unity Catalog and Confluence
- Participate in code reviews and contribute to QA automation best practices
Required Skills
- Proficiency in SQL (especially Spark SQL) and Python
- Familiarity with Databricks notebooks, Delta Lake, and Unity Catalog
- Experience with data testing frameworks (e.g., Great Expectations, dbt tests)
- Understanding of ETL/ELT pipelines, data modeling, and schema validation
- Strong attention to detail and ability to troubleshoot data issues
Nice to Have
- Experience with CI/CD pipelines for data (e.g., GitHub Actions, Azure DevOps)
- Exposure to data governance and access control in Databricks
- Familiarity with Apache Spark, MLflow, or Power BI
(Ignore just keep in mind)
Onboarding Checklist: Data QA Associate (Databricks)
Week 1: Orientation & Access
- [ ] Complete onboarding and security training
- [ ] Gain access to Databricks workspace, Unity Catalog, and Git repos
- [ ] Review Delta Lake architecture and QA documentation
- [ ] Set up personal workspace and test notebooks
Week 2: Environment & Tools
- [ ] Walk through existing ETL pipelines and data models
- [ ] Review QA test suites and validation notebooks
- [ ] Shadow a QA engineer on a pipeline release
- [ ] Learn how to use Databricks SQL s and dashboards
Week 3–4: Hands-On QA
- [ ] Write and run validation tests for a Bronze-to-Silver pipeline
- [ ] Implement a Great Expectations suite for a Gold table
- [ ] Contribute to QA documentation and test case library
- [ ] Present findings in a QA sync or team stand-up
Month 2+: Deep Integration
- [ ] Own QA for a specific data domain or pipeline
- [ ] Propose improvements to test coverage or automation
- [ ] Collaborate with data engineers on pipeline refactoring
- [ ] Participate in quarterly data quality reviews
SkillsAbout RCG Global Services
At Myridius, we transform the way businesses operate. Formerly known as RCG Global Services, our more than 50 years of expertise now drive a new vision—propelling organizations through the rapidly evolving landscapes of technology and business. We offer tailored solutions in AI, data analytics, digital engineering, and cloud innovation, addressing the unique challenges each industry faces. Our integration of cutting-edge technology with deep domain knowledge enables businesses to seize new opportunities, drive significant growth, and maintain a competitive edge in the global market. Our commitment is not just to meet expectations but to exceed them, ensuring measurable impact and fostering sustainable innovation. The success of Myridius is directly tied to the breakthroughs achieved by our clients. Together, we co-create solutions that not only solve today’s challenges but also anticipate future trends. At Myridius, we go beyond typical service delivery. We craft transformative outcomes that help businesses not just adapt, but thrive in a world of continuous change. Discover how Myridius can elevate your business to new heights of innovation. Visit us at
www.myridius.com and start leading the change.