Job Requirements
McLean, VA
Top Secret/SCI Polygraph Unspecified
Career Level not specified
Salary not specified
Join Premium to unlock estimated salaries
Job Description
What Impact You'll Have
GRVTY is a member of 100% of the winning teams for the largest technology program in the Intel Community. We've been supporting this customer on many different sub-projects of this program since our founding in 2013. We've grown on this effort by providing the customer with Engineers who have done exceptional work, and we've retained our staff by paying very strong salaries, and working hard to ensure each Engineer is doing work that aligns with their career interest.
What You'll be Owning
GRVTY is seeking a Data Scientist with a TS/SCI + Poly clearance (applicable to this customer) to join one of our top projects in McLean, VA. The Data Scientist will be working in a fast-paced, dynamic, agile software development environment. The multi-disciplinary project team works together on multiple projects that includes automating processing of large forensic images, extracting and enriching metadata, and displaying resulting information in meaningful ways for analysts to conduct assessments. Team members utilize a mix of COTS and GOTS tools and technologies; as well as build integrations with a variety of external partner applications. Most solutions are cloud-based. The Sponsor adheres to Agile Scrum development methodology best practices and has 2-week sprint cycles.
What You Must Have
#LI-BPJ
GRVTY is a member of 100% of the winning teams for the largest technology program in the Intel Community. We've been supporting this customer on many different sub-projects of this program since our founding in 2013. We've grown on this effort by providing the customer with Engineers who have done exceptional work, and we've retained our staff by paying very strong salaries, and working hard to ensure each Engineer is doing work that aligns with their career interest.
What You'll be Owning
GRVTY is seeking a Data Scientist with a TS/SCI + Poly clearance (applicable to this customer) to join one of our top projects in McLean, VA. The Data Scientist will be working in a fast-paced, dynamic, agile software development environment. The multi-disciplinary project team works together on multiple projects that includes automating processing of large forensic images, extracting and enriching metadata, and displaying resulting information in meaningful ways for analysts to conduct assessments. Team members utilize a mix of COTS and GOTS tools and technologies; as well as build integrations with a variety of external partner applications. Most solutions are cloud-based. The Sponsor adheres to Agile Scrum development methodology best practices and has 2-week sprint cycles.
What You Must Have
- Demonstrated experience building production data pipelines and ETL/ELT workflows at scale.
- Demonstrated experience with Apache Spark and PySpark for distributed data processing.
- Demonstrated experience with advanced Python programming skills including data manipulation libraries (Pandas, NumPy) and data engineering best practices.
- Demonstrated experience understanding data security, privacy, governance, and compliance principles.
- Demonstrated experience with workflow orchestration tools such as Step Functions and Airflow.
- Demonstrated experience with containerization such as Docker or Podman, and deploying data applications in cloud environments.
- Demonstrated experience with AWS services (in particular S3, Lambda, and Step Functions).
- Demonstrated experience with PostgreSQL and MySQL in production environments, including performance tuning and schema design.
- Demonstrated experience with SQL and query optimization for complex analytical workloads.
- Demonstrated experience with version control (Git) and CI/CD practices for data pipelines.
- Demonstrated experience working with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight.
- Demonstrated experience with strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks.
#LI-BPJ
group id: 90883154