Job Requirements
Washington, DC
Top Secret Polygraph Unspecified
Career Level not specified
$140,000 - $155,000
Job Description
Zachary Piper Solutions is seeking a Data Engineer to join a federal-focused technology firm specializing in data-driven solutions, cloud engineering, and digital transformation for government agencies. This is a Hybrid role, 2 days on-site in the Washington D.C. area, and requires either an active DHS clearance or active DoD Top Secret Clearance. The Data Engineer will work closely with the Department of Homeland Security (DHS) client, to build and manage data infrastructure that supports advanced data analytics, AI, machine learning, and automation initiatives aimed at enhancing operations, reducing costs, and improving efficiency.
Responsibilities of the Data Engineer:
Qualifications for the Data Engineer:
Compensation for the Data Engineer:
This job opens for applications on 04/02. Applications for this job will be accepted for at least 30 days from the posting date.
#LI-MS1
#LI-HYRBID
Keywords: Data Engineer, data engineering, data infrastructure, data architecture, data pipelines, ETL, ELT, data integration, data ingestion, data transformation, data processing, big data, distributed systems, Hadoop, Spark, Kafka, data modeling, database design, relational databases, Oracle, PostgreSQL, MySQL, Redshift, data quality, data integrity, data validation, data governance, data security, compliance, federal regulations, classified data, sensitive data handling, DHS, Department of Homeland Security, cloud computing, AWS, Amazon Web Services, S3, EC2, RDS, Lambda, cloud architecture, cloud data platforms, cloud modernization, scalable systems, system reliability, performance optimization, Python, R, SQL, Pandas, R Shiny, scripting, automation, workflow automation, Airflow, ETL tools, Informatica, Talend, data analytics, advanced analytics, artificial intelligence, AI, machine learning, ML, generative AI, predictive analytics, business intelligence, BI, Tableau, Power BI, dashboard development, data visualization, data storytelling, reporting, cross-functional collaboration, stakeholder engagement, data availability, data consistency, data lifecycle, analytics lifecycle, data acquisition, model deployment, model validation, performance monitoring, software development lifecycle, SDLC, version control, Git, GitHub, testing, agile methodologies, iterative development, prototype development, proof of concept, POC, automation tools, UiPath, Power Automate, data management, data optimization, problem solving, analytical thinking, technical communication, executive communication, client-facing presentations, big data analytics, cloud-based analytics, change management
Responsibilities of the Data Engineer:
- Design, develop, and optimize data pipelines and architectures that support data-driven decision-making across AI and ML initiatives
- Collaborate with data scientists, analysts, and other stakeholders to ensure data availability, integrity, and quality
- Implement ETL (Extract, Transform, Load) processes to integrate data from various sources into centralized systems
- Design, develop and maintain data models to support advanced analytics initiatives, AI/ML, Generative AI, and predictive analytics
- Ensure data security and compliance with federal regulations, including managing sensitive and classified data
- Monitor and maintain data pipelines to ensure continuous data flow and resolve any data-related issues
- Collaborate with cross-functional teams to identify and integrate new data sources and optimize data management practices
- Stay current with industry trends and emerging technologies in data engineering, cloud infrastructure, and big data processing
- Perform other related duties as required to support the mission
- Design and create interactive dashboards and reports using Tableau and Power BI
- Present findings in a clear, concise manner tailored to both technical and non-technical audiences
- Work with relational databases (e.g., Oracle, PostgreSQL, MySQL, Redshift) to support data integrity and consistency
- Automate routine data processes using a variety of scripts and tools. (e.g., Python, Airflow, SQL, AWS)
- Leverage AWS Cloud services (e.g., S3, EC2, RDS, Lambda) for data processing, storage, and retrieval
- Ability to guide and support cloud modernization of data processes
- Collaborate with cross-functional teams, including engineers, analysts, and business stakeholders
- Clearly articulate analytical concepts and results to a wide range of audiences
- Contribute to and present client-facing presentations and written reports
Qualifications for the Data Engineer:
- Bachelor's Degree in Mathematics, Computer Science, Information Systems, or a related field with 6+ years of progressive experience in data science, advanced analytics, data visualization, and reporting, with demonstrated ownership of analytical solutions from concept through delivery and operationalization.
- Proven ability to lead the design, development, and deployment of data-driven solutions, including AI/ML models, predictive analytics, and business intelligence products, in production environments.
- Advanced proficiency in Python for data manipulation, automation, and development of scalable analytical workflows.
- Strong expertise in SQL and relational databases (e.g., Oracle, PostgreSQL, MySQL), with the ability to design efficient data models and support complex data integration needs.
- Extensive experience developing automated data pipelines and analytics workflows using Python, R, SQL, and related tools, with a focus on scalability, reliability, and maintainability (e.g., Pandas, R Shiny)
- Hands-on experience leveraging cloud services (e.g., S3, EC2, RDS, Lambda) to design and implement data processing, storage, and analytics solutions.
- Demonstrated ability to design and deliver impactful data visualizations and dashboards using Tableau and/or Power BI, with a strong emphasis on data storytelling and executive-level communication.
- Experience leading or significantly contributing to the full analytics lifecycle, including problem framing, data acquisition, modeling, validation, deployment, and performance monitoring.
- Familiarity with modern software development practices, including version control (e.g., Git), testing, and iterative development methodologies.
- Experience developing prototypes and proof-of-concept solutions, with the ability to rapidly iterate and transition solutions into production-ready systems.
- Active DHS clearance or active DoD Top Secret Clearance
- Desired Skills
- Prior experience working with ETL tools like Informatica, Talend, or Airflow
- Experience with big data technologies (e.g., Hadoop, Spark, Kafka) and distributed systems
- Familiarity with automation tools (e.g., UiPath, Power Automate) and their integration into data pipelines
- Experience working in environments requiring DHS Suitability Clearance is highly preferred.
- Strong knowledge of change management processes and technologies (e.g., GitHub)
- Experience modeling and analyzing big data in cloud environments.
- Demonstrated experience developing prototypes and proof-of-concept applications or software from end-to-end.
- Knowledge of federal data governance standards, compliance requirements, frameworks, and best practices.
Compensation for the Data Engineer:
- Salary Range: $140,000 - $155,000 *depending on experience*
- Comprehensive Benefits: Comprehensive Medical, Dental, Vision, 401k Plan, PTO, Holidays, Sick Leave if required by law
This job opens for applications on 04/02. Applications for this job will be accepted for at least 30 days from the posting date.
#LI-MS1
#LI-HYRBID
Keywords: Data Engineer, data engineering, data infrastructure, data architecture, data pipelines, ETL, ELT, data integration, data ingestion, data transformation, data processing, big data, distributed systems, Hadoop, Spark, Kafka, data modeling, database design, relational databases, Oracle, PostgreSQL, MySQL, Redshift, data quality, data integrity, data validation, data governance, data security, compliance, federal regulations, classified data, sensitive data handling, DHS, Department of Homeland Security, cloud computing, AWS, Amazon Web Services, S3, EC2, RDS, Lambda, cloud architecture, cloud data platforms, cloud modernization, scalable systems, system reliability, performance optimization, Python, R, SQL, Pandas, R Shiny, scripting, automation, workflow automation, Airflow, ETL tools, Informatica, Talend, data analytics, advanced analytics, artificial intelligence, AI, machine learning, ML, generative AI, predictive analytics, business intelligence, BI, Tableau, Power BI, dashboard development, data visualization, data storytelling, reporting, cross-functional collaboration, stakeholder engagement, data availability, data consistency, data lifecycle, analytics lifecycle, data acquisition, model deployment, model validation, performance monitoring, software development lifecycle, SDLC, version control, Git, GitHub, testing, agile methodologies, iterative development, prototype development, proof of concept, POC, automation tools, UiPath, Power Automate, data management, data optimization, problem solving, analytical thinking, technical communication, executive communication, client-facing presentations, big data analytics, cloud-based analytics, change management
group id: 10430981