Posted today
Top Secret/SCI
Unspecified
Unspecified
Charlottesville, VA (On-Site/Office)
All hired employees are expected to have experience with Microsoft Copilot and / or an approved equivalent AI solution.
Description:
Data Systems Analysts, Inc. (DSA) is seeking a TS/SCI cleared HPC Support Engineer to assist users executing computational workloads within secure High Performance Computing (HPC) environments. The HPC Support Engineer will work directly with engineers, analysts, and researchers to support job execution, troubleshoot workload failures, and improve the performance and efficiency of compute workloads running on HPC clusters.
The Engineer will assist users with scheduler job scripts, application execution, and workload performance troubleshooting while promoting HPC best practices for efficient cluster utilization. This role serves as the primary interface between mission users and HPC platform infrastructure teams.
This position requires strong Linux experience, scripting capability, and familiarity with distributed computing environments supporting scientific or engineering workloads.
This position is onsite in Charlottesville, VA.
Responsibilities:
Required Education, Certifications and Security Clearance:
Required Experience/Qualifications:
Preferred Experience/Qualifications:
#DSA209
#LI-KE1
Many of DSA's positions require the ability to obtain a security clearance. Security clearances may only be granted to U.S. citizens. In addition, applicants who accept a conditional offer of employment may be subject to government security investigation(s) and must meet eligibility requirements for access to classified information. DSA is proud to be an Equal Opportunity Employer. DSA is committed to treating all employees and applicants for employment with respect and dignity and maintaining a workplace that is free from unlawful discrimination. All qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status. DSA requires background checks , where permitted , by law. DSA is an E-Verify Employer.
Create a Job Alert
Interested in building your career at Data Systems Analysts, Inc.? Get future opportunities sent straight to your email.
Create alert
Description:
Data Systems Analysts, Inc. (DSA) is seeking a TS/SCI cleared HPC Support Engineer to assist users executing computational workloads within secure High Performance Computing (HPC) environments. The HPC Support Engineer will work directly with engineers, analysts, and researchers to support job execution, troubleshoot workload failures, and improve the performance and efficiency of compute workloads running on HPC clusters.
The Engineer will assist users with scheduler job scripts, application execution, and workload performance troubleshooting while promoting HPC best practices for efficient cluster utilization. This role serves as the primary interface between mission users and HPC platform infrastructure teams.
This position requires strong Linux experience, scripting capability, and familiarity with distributed computing environments supporting scientific or engineering workloads.
This position is onsite in Charlottesville, VA.
Responsibilities:
- Provide user support for computational workloads running on HPC clusters in classified and unclassified environments.
- Assist users in developing, submitting, and troubleshooting scheduler job scripts for systems such as Slurm or PBS, including resource allocation for CPU, GPU, and distributed compute workloads.
- Troubleshoot slow, hanging, or failing HPC jobs including MPI based distributed workloads, GPU jobs, and large scale parallel applications.
- Support users compiling and executing scientific, modeling, or data processing applications within Linux based HPC environments.
- Provide guidance on HPC best practices for job scheduling, compute resource allocation, and workload performance.
- Monitor workload execution patterns and provide guidance to improve cluster throughput and resource utilization.
- Develop scripts or tools using Bash or Python to automate common operational tasks.
- Maintain documentation and knowledge base articles describing system capabilities, job execution procedures, and troubleshooting guidance.
- Support performance analysis of compute workloads to identify inefficiencies or configuration issues.
- Coordinate with HPC systems engineers when infrastructure or cluster configuration issues impact workload performance.
- Provide responsive on site support for users executing HPC workloads in mission environments.
- Maintain source controlled scripting and tools using Git or similar version control platforms.
- Assist users with environment modules and runtime environments required for executing HPC applications.
Required Education, Certifications and Security Clearance:
- BS degree in Engineering, Computer Science, or related STEM field
- Experience may be substituted for degree
- TS/SCI Clearance
- Ability to obtain DoD 8140 (8570) IAT Level II certification
Required Experience/Qualifications:
- Minimum 5 years of Linux experience including command line system usage, scripting, and troubleshooting applications in multi-user server environments.
- Professional experience administering or supporting command line Linux systems (RHEL derivatives preferred).
- Experience developing scripts using Bash, Python, or similar scripting languages.
- Experience troubleshooting software execution issues in distributed computing environments.
- Working knowledge of job scheduling systems such as Slurm, PBS, Torque, or similar platforms.
- Experience supporting users in technical computing or engineering environments.
- Strong troubleshooting and analytical skills.
- Ability to communicate technical concepts clearly to both technical and non technical users.
- Active TS/SCI security clearance.
Preferred Experience/Qualifications:
- Experience as a user or administrator of HPC clusters.
- Experience supporting parallel computing frameworks such as MPI, OpenMP, or CUDA based GPU workloads.
- Experience supporting scientific or engineering applications requiring large scale compute resources.
- Experience using performance monitoring and optimization tools for compute workloads.
- Experience compiling applications using C, C++, Fortran, or Python based environments.
- Experience working in classified computing environments.
- Experience supporting GPU enabled workloads.
#DSA209
#LI-KE1
Many of DSA's positions require the ability to obtain a security clearance. Security clearances may only be granted to U.S. citizens. In addition, applicants who accept a conditional offer of employment may be subject to government security investigation(s) and must meet eligibility requirements for access to classified information. DSA is proud to be an Equal Opportunity Employer. DSA is committed to treating all employees and applicants for employment with respect and dignity and maintaining a workplace that is free from unlawful discrimination. All qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status. DSA requires background checks , where permitted , by law. DSA is an E-Verify Employer.
Create a Job Alert
Interested in building your career at Data Systems Analysts, Inc.? Get future opportunities sent straight to your email.
Create alert
group id: 10119839
N
60th video