Key Responsibilities
- Design, develop, and maintain robust ETL/ELT data pipelines using Python and AWS services
- Build and optimize data lakes, data warehouses, and data marts on AWS
- Ingest, transform, and process structured and unstructured data from multiple sources
- Ensure data quality, integrity, security, and compliance with federal standards
- Collaborate with data scientists, analysts, and application teams to support analytics and reporting
- Optimize performance, cost, and scalability of data workflows in AWS
- Implement data governance, metadata management, and logging/monitoring practices
- Support documentation, data models, and technical design artifacts
- Troubleshoot production issues and support ongoing enhancements
Required Qualifications
- 5+ years of hands-on experience as a Data Engineer or similar role
- Strong proficiency in Python for data processing and automation
- Extensive experience with AWS, including services such as:
- S3, EC2, Lambda
- Glue, Athena, Redshift
- EMR, CloudWatch (preferred)
- Solid understanding of SQL and relational/non-relational databases
- Experience building batch and/or streaming data pipelines
- Familiarity with CI/CD pipelines, version control (Git), and DevOps practices
- Experience working in federal or regulated environments
Preferred Qualifications
- US Census Clearance (active or previously held)
- Experience with big data technologies (Spark, PySpark, Hadoop)
- Knowledge of data security, encryption, and access controls in AWS
- Exposure to data orchestration tools (Airflow, Step Functions, etc.)
- Experience supporting analytics, BI, or AI/ML workloads
Nice to Have
- AWS Certification (Data Analytics, Solutions Architect, or Developer)
- Experience with Terraform or CloudFormation
- Familiarity with federal data standards and compliance frameworks
Job Types: Full-time, Contract
Pay: $95,000.00 - $110,000.00 per year
Benefits:
- 401(k)
- Dental insurance
- Health insurance
- Paid time off
- Relocation assistance
Education:
Experience:
- Python: 3 years (Required)
- R: 3 years (Required)
- SQL: 3 years (Required)
- ETL: 3 years (Required)
- AWS: 3 years (Required)
- Redshift: 3 years (Required)
Work Location: Hybrid remote in Vienna, VA 22182