Job Requirements
Charlottesville, VA
Top Secret/SCI Polygraph not specified
Senior Level Career (10+ yrs experience)
Salary not specified
Join Premium to unlock estimated salaries
Job Description
About the Role
We are seeking an experienced Senior Data Engineer to lead the design, development, and maintenance of our mission-critical data infrastructure, with a focus on bioinformatics and the processing of large-scale genomic data. This role requires a hands-on engineer who can architect robust data solutions, establish best practices for data management, and ensure the scalability and reliability of systems supporting secure government operations. You will be responsible for mentoring team members and driving technical excellence across our data ecosystem.
Primary Responsibilities
Design, build, and maintain scalable, reliable, and performant data pipelines for bioinformatics and genomic data.
Architect and implement data models and database schemas to meet application and analytics requirements.
Lead technical decision-making for the evolution and modernization of our data technology stack.
Establish and enforce data engineering standards, best practices, and architectural patterns.
Mentor junior and mid-level engineers through code reviews, pair programming, and technical guidance.
Optimize and scale our data infrastructure, including microservices, databases, and real-time data streams.
Ensure security compliance and implement secure data handling practices for classified environments.
Troubleshoot and resolve complex production issues across distributed data systems.
Required Technical Skills
Core Languages: Expert-level proficiency in Python and/or Javascript.
Scientific Data Expertise:
Proven experience in bioinformatics, scientific data management, or a related domain involving the processing and analysis of large-scale datasets.
Data Engineering & Processing:
Workflow Orchestration: Deep experience with tools like Apache Airflow, Prefect, or Dagster.
Big Data Frameworks: Hands-on experience with processing frameworks such as Apache Spark.
Stream & Message Brokers: Experience with real-time data streaming using Apache Kafka, RabbitMQ, AWS SQS/SNS, or Azure Service Bus.
Data Storage & Databases:
SQL Databases: Expertise in relational databases (PostgreSQL, MySQL, Oracle) including advanced query optimization and schema design.
Object Stores?
Search & Indexing: Familiarity with search engines like Elasticsearch or OpenSearch.
Cloud & Infrastructure:
Cloud Platforms: Strong experience with services from AWS, Azure, or GCP.
DevOps: Experience with automation tools like Jenkins, GitLab CI/CD, or GitHub Actions as well as infrastructure tools like Helm and Terraform.
Preferred Qualifications
Experience designing event-driven architectures and microservices.
Knowledge of data governance, data quality, and observability frameworks.
Background in migrating legacy data systems to modern, scalable architectures.
Deep understanding of database performance tuning: indexing, partitioning, and connection pooling.
Experience with government security compliance (STIG, ATO, FedRAMP).
Front-end experience with frameworks like React, Vue, or Angular is a plus but not required.
Soft Skills
Strong communication skills for technical documentation and cross-team collaboration.
Ability to balance technical debt with feature delivery in a pragmatic way.
Experience leading technical discussions and making architectural decisions.
Comfortable working in an Agile/Scrum environment.
Education & Experience
Bachelor's degree in Computer Science, Bioinformatics, Engineering, or a related field (or equivalent experience).
7+ years of professional software/data engineering experience.
3+ years in a senior or lead engineering role focused on data systems.
Active security clearance or ability to obtain one (required for DoD work).
We are seeking an experienced Senior Data Engineer to lead the design, development, and maintenance of our mission-critical data infrastructure, with a focus on bioinformatics and the processing of large-scale genomic data. This role requires a hands-on engineer who can architect robust data solutions, establish best practices for data management, and ensure the scalability and reliability of systems supporting secure government operations. You will be responsible for mentoring team members and driving technical excellence across our data ecosystem.
Primary Responsibilities
Design, build, and maintain scalable, reliable, and performant data pipelines for bioinformatics and genomic data.
Architect and implement data models and database schemas to meet application and analytics requirements.
Lead technical decision-making for the evolution and modernization of our data technology stack.
Establish and enforce data engineering standards, best practices, and architectural patterns.
Mentor junior and mid-level engineers through code reviews, pair programming, and technical guidance.
Optimize and scale our data infrastructure, including microservices, databases, and real-time data streams.
Ensure security compliance and implement secure data handling practices for classified environments.
Troubleshoot and resolve complex production issues across distributed data systems.
Required Technical Skills
Core Languages: Expert-level proficiency in Python and/or Javascript.
Scientific Data Expertise:
Proven experience in bioinformatics, scientific data management, or a related domain involving the processing and analysis of large-scale datasets.
Data Engineering & Processing:
Workflow Orchestration: Deep experience with tools like Apache Airflow, Prefect, or Dagster.
Big Data Frameworks: Hands-on experience with processing frameworks such as Apache Spark.
Stream & Message Brokers: Experience with real-time data streaming using Apache Kafka, RabbitMQ, AWS SQS/SNS, or Azure Service Bus.
Data Storage & Databases:
SQL Databases: Expertise in relational databases (PostgreSQL, MySQL, Oracle) including advanced query optimization and schema design.
Object Stores?
Search & Indexing: Familiarity with search engines like Elasticsearch or OpenSearch.
Cloud & Infrastructure:
Cloud Platforms: Strong experience with services from AWS, Azure, or GCP.
DevOps: Experience with automation tools like Jenkins, GitLab CI/CD, or GitHub Actions as well as infrastructure tools like Helm and Terraform.
Preferred Qualifications
Experience designing event-driven architectures and microservices.
Knowledge of data governance, data quality, and observability frameworks.
Background in migrating legacy data systems to modern, scalable architectures.
Deep understanding of database performance tuning: indexing, partitioning, and connection pooling.
Experience with government security compliance (STIG, ATO, FedRAMP).
Front-end experience with frameworks like React, Vue, or Angular is a plus but not required.
Soft Skills
Strong communication skills for technical documentation and cross-team collaboration.
Ability to balance technical debt with feature delivery in a pragmatic way.
Experience leading technical discussions and making architectural decisions.
Comfortable working in an Agile/Scrum environment.
Education & Experience
Bachelor's degree in Computer Science, Bioinformatics, Engineering, or a related field (or equivalent experience).
7+ years of professional software/data engineering experience.
3+ years in a senior or lead engineering role focused on data systems.
Active security clearance or ability to obtain one (required for DoD work).
group id: 10461772