Today
Top Secret/SCI
Mid Level Career (5+ yrs experience)
IT - Data Science
Reston, VA (On/Off-Site)
Job Description
Red Gate Group is seeking a talented Data Scientist to join our team supporting the Defense Threat Reduction Agency (DTRA) in Reston, VA or Ft. Belvoir, VA. As a Data Scientist, you will have a strong background in machine learning, natural language processing (NLP), and large language model (LLM) implementation to drive impactful solutions in a high-security environment. The ideal candidate has 5+ years of hands-on experience in applied data science, including data exploration, cleaning, analysis, visualization, and mining. You’ll lead the design and deployment of ML models for document classification, extraction, summarization, and search across large-scale, real-time systems. Prior experience with production-level workflows, data lakes, and streaming technologies like Kafka is essential.
Qualifications
5+ years of experience in applied data science or ML roles, with strong Python skills and experience in NLP and LLM implementation
5+ years of experience with data exploration, data cleaning, data analysis, data visualization, or data mining
Exposure to production-level systems, data lake environments, and streaming data (e.g. Kafka)
Experience implementing end-to-end ML workflows, from data prep to deployment and evaluation
Ability to quickly learn infrastructure or systems concepts (e.g. how pipelines interface with data lakes)
Ability to design, implement, and iterate on ML models for document classification, extraction, summarization, and search
Ability to take ownership of data science workflows that interact with a production system streaming millions of documents per week
Active TS/SCI clearance
Desired Qualifications:
Experience collaborating with MLOps and infrastructure engineers to ensure robust model deployment, monitoring, and retraining pipelines
Experience supporting platform components such as distributed storage (e.g. Cloudera), documents indexing/search, and GPU workloads
Experience in the development of algorithms leveraging R, Python, or SQL/NoSQL
Experience with Distributed data/computing tools, including MapReduce, Hadoop, Hive, EMR, Spark, Gurobi, or MySQL
Experience with visualization packages, including Plotly, Seaborn, or ggplot2
Bachelor’s degree
Red Gate Group is seeking a talented Data Scientist to join our team supporting the Defense Threat Reduction Agency (DTRA) in Reston, VA or Ft. Belvoir, VA. As a Data Scientist, you will have a strong background in machine learning, natural language processing (NLP), and large language model (LLM) implementation to drive impactful solutions in a high-security environment. The ideal candidate has 5+ years of hands-on experience in applied data science, including data exploration, cleaning, analysis, visualization, and mining. You’ll lead the design and deployment of ML models for document classification, extraction, summarization, and search across large-scale, real-time systems. Prior experience with production-level workflows, data lakes, and streaming technologies like Kafka is essential.
Qualifications
5+ years of experience in applied data science or ML roles, with strong Python skills and experience in NLP and LLM implementation
5+ years of experience with data exploration, data cleaning, data analysis, data visualization, or data mining
Exposure to production-level systems, data lake environments, and streaming data (e.g. Kafka)
Experience implementing end-to-end ML workflows, from data prep to deployment and evaluation
Ability to quickly learn infrastructure or systems concepts (e.g. how pipelines interface with data lakes)
Ability to design, implement, and iterate on ML models for document classification, extraction, summarization, and search
Ability to take ownership of data science workflows that interact with a production system streaming millions of documents per week
Active TS/SCI clearance
Desired Qualifications:
Experience collaborating with MLOps and infrastructure engineers to ensure robust model deployment, monitoring, and retraining pipelines
Experience supporting platform components such as distributed storage (e.g. Cloudera), documents indexing/search, and GPU workloads
Experience in the development of algorithms leveraging R, Python, or SQL/NoSQL
Experience with Distributed data/computing tools, including MapReduce, Hadoop, Hive, EMR, Spark, Gurobi, or MySQL
Experience with visualization packages, including Plotly, Seaborn, or ggplot2
Bachelor’s degree
group id: 10349707