Job Description:
TMC Technologies is in search of a Senior Data Engineer to support ongoing development within the FBI's Enterprise Cyber Security Section. The candidate must be a US citizen and possess a minimum of an active Top Secret clearance to start due to federal contract requirements.
The Senior Data Engineer will have the following responsibilities:
- Deploy and develop pipelines and platforms that organize and make disparate data meaningful
- Guide a multi-disciplinary team of analysts, data engineers, developers, and data consumers in an agile environment
- Provide experience in analytical exploration and data examination
- Manage the assessment, design, building, and maintenance of scalable platforms
- Responsible for carrying out engineering tasks to deliver an advanced visual analytic application to include: developing data analytics for desktop and web-developed visual analytic software; facilitating the bulk analysis of relational information utilizing advanced graphics, visualization, and high-performance computing, as well as using the specified open source, COTS, and GOTS technologies; and establishing an application that produces manageable, actionable intelligence from streams of a structured and semi-structured data
- Design strategies for enterprise database systems and set standards for operations, programming, and security
- Design and construct large relational databases
- Integrate new systems with existing warehouse structure and refine system performance and functionality
- US Citizenship and active Top Secret clearance required
- Bachelor’s degree and 8+ years of experience designing, developing, operationalizing, and maintaining data systems, or 12+ years of experience designing, developing, operationalizing, and maintaining data systems in lieu of degree
- Experience performing tuning on data lake architecture to optimize cloud storage and pipeline performance
- Experience with data management solutions that address aspects of data catalog, data usage, governance policies, and lifecycle management
- Experience designing data warehouse or data lake, RESTful API models, event processing, or business intelligence for software systems
- Experience building scalable ETL or ELT data pipelines, automating large-scale batch and streaming data workflows, and parsing structured, unstructured, or event-streaming data
- Experience with coding in software programming languages, including SQL, Python, Scala, Spark, or Java
Preferred Qualifications:
- SCI eligibility
- 3+ years of experience using and developing data systems with AWS and Microsoft Azure cloud data products such as AWS EMR, AWS Glue, Azure Dataverse, Azure Data Factory, and Power Apps
- Experience with distributed data or computing tools, including Apache Spark, Apache NiFi, AirFlow, Databricks, Snowflake, Redshift, or BigQuery
- Experience creating and improving continuous automation across multiple technical stacks, and automating CD/CD pipelines with tools such as Jenkins or GitHub