user avatar

Data Scientist

SHR CONSULTING GROUP, LLC

Today
Top Secret
Unspecified
Unspecified
IT - Data Science
Alexandria, VA (On-Site/Office)

Job Title: Data Scientist

Job Category: IT

Location: Remote with occasional onsite work within the DMV area

Security Clearance: Active TS/SCI Clearance or eligibility

SHR is a premier technology integrator solving our nation's most complex modernization and readiness challenges across the defense, federal civilian, and intelligence markets. Our robust portfolio of offerings includes high-end solutions in systems engineering and integration; enterprise IT, including cloud services; cyber; software; advanced analytics and AI. With an intimate understanding of our customers' challenges and deep expertise in existing and emerging technologies, we integrate the best components from our own portfolio and our partner ecosystem to deliver innovative, effective, and efficient solutions.

The Data Scientist will have a background in diverse machine learning domains, including but not limited to natural language understanding, image analysis, and foundational concepts in statistical learning. Direct experience working with NLP techniques and tools such as large language models (LLMs), vector-based text representations, semantic search capabilities, and generative AI workflows like retrieval-augmented generation (RAG). Well-versed in preparing datasets for modeling, including tasks like data cleansing, feature construction, and applying rigorous model validation protocols. In-depth knowledge of a range of ML methods, including classification, clustering, deep learning, reinforcement learning, and neural network architectures.

Proficient in using Git for collaborative development, version tracking, and maintaining source code integrity across projects. Proven track record in roles focused on developing, training, and refining machine learning models either as a data scientist or machine learning engineer. Comfortable coding in multiple languages, particularly Python, R, Scala, Java, or C++, with attention to code clarity and reproducibility. Experienced in leveraging distributed data processing platforms such as Apache Spark and Databricks for large-scale analytics and model development. Adept at working with very large datasets, uncovering insights through exploratory analysis, crafting effective visualizations, writing performant SQL, and utilizing GPU infrastructure for model training. Practical experience managing data in cloud environments, especially relational databases like RDS PostgreSQL and extended capabilities through pgvector. Skilled in storing and retrieving high-dimensional vector data generated from embedding processes, using platforms such as Elasticsearch, OpenSearch, or PostgreSQL with vector search extensions.

Job Responsibilities:
  • Develops, customizes, and maintains data-driven applications and analytical tools tailored to diverse mission-focused needs across various technical domains.
  • Partners with interdisciplinary teams including software developers and ML experts to seamlessly integrate external AI capabilities from across CDAO or DoD entities into the Search Portfolio suite when beneficial.
  • Refines and enhances machine learning models for speed, reliability, and scalability by leveraging distributed processing technologies such as Databricks and Apache Spark, with the flexibility to deploy solutions within GPU-powered Kubernetes environments.
  • Continuously monitors advancements in the AI field, applying relevant innovations to elevate the performance and effectiveness of Search Portfolio offerings.
  • Oversees the full development pipeline for AI/ML features within the Search Portfolio-from initial experimentation through production deployment and ongoing refinement.
  • Uses quantitative methods to identify and troubleshoot data anomalies, formulate and implement solutions, and assess outcomes through performance metrics.
  • Prepares and communicates technical documentation, system designs, and evaluation summaries to both internal teams and stakeholders to support informed decision-making.
  • Establishes and evolves robust data modeling frameworks, ensuring statistical integrity and precision in identifying correlations and causal insights within large datasets.

Education:
  • Bachelor's degree in Mathematics, statistics, computer science, data science or field directly related to the position

Experience:
  • Must have7 yrs of experience

Why Join Us:

At SHR, you will join a team that fosters growth, supports innovation, and encourages continuous learning. You will have the opportunity to impact significant government initiatives and contribute to national security and public welfare. We offer competitive compensation, comprehensive benefits, and a flexible work environment. Join us and make a difference!

At SHR, we foster an environment that promotes growth, innovation, and continuous learning. As a valued member of our team, you will:
  • Contribute to impactful government initiatives that enhance national security and public welfare.
  • Work in a collaborative, flexible, and forward-thinking work environment.
  • Receive competitive compensation, comprehensive benefits, and career development opportunities.

Join us and make a difference!
group id: 10409777

Match Score

Powered by IntelliSearchâ„¢
image match score
Create an account or Login to see how closely you match to this job!