Posted today
Top Secret
Unspecified
Unspecified
Engineering - Mechanical
Washington, DC (On-Site/Office)
As a Sr. Site Reliability Engineer (SRE) III, you'll work as part of a collaborative and high-performing team providing your expertise to deliver technical solutions within the highest levels of the federal government.
We believe great technology services start with great people, and we are committed to building high-performing teams that prioritize collaboration, growth, and excellence. Our focus is on empowering individuals to make meaningful contributions while delivering exceptional value to our customers.
If you see yourself contributing to this mission, check out the role below:
What you'll do:
What you'll need to succeed:
We believe great technology services start with great people, and we are committed to building high-performing teams that prioritize collaboration, growth, and excellence. Our focus is on empowering individuals to make meaningful contributions while delivering exceptional value to our customers.
If you see yourself contributing to this mission, check out the role below:
What you'll do:
- Design, deploy, and maintain mission-critical application workloads in virtualized or containerized environments (e.g., VMware or Kubernetes), ensuring scalability, availability, and compliance with government requirements
- Develop and maintain automated CI/CD pipelines, monitoring, and configuration management workflows to support reliable software delivery and operational observability across development, integration, staging, and production environments
- Provision, configure, and maintain developer environments and toolchains to enable rapid, secure, and efficient development workflows
- Identify friction points across the software development lifecycle and implement solutions that create a more developer-first environment
- Establish and maintain a high level of customer trust and confidence through deep technical expertise, while delivering innovative solutions aligned with mission needs
What you'll need to succeed:
- Active Top Secret with current or previously held SCI access
- Certification meeting DoD 8140 requirements (e.g., Security+ or higher)
- Bachelor's degree in Computer Science or related engineering field preferred (relevant experience may substitute)
- 7+ years of experience in software development, systems engineering, or operations roles focused on availability, performance, and reliability of production systems
- Demonstrated experience combining software engineering and systems administration to support highly available, scalable applications
- Experience designing and managing monitoring, alerting, and observability solutions tied to Service Level Objectives
- Experience leading or participating in incident response, root cause analysis, and continuous improvement efforts
- Experience with Ansible and Desired State Configuration
- Experience with GitLab CI/CD automation and Bash scripting
- Experience supporting container-native and object storage solutions (e.g., MinIO, S3-compatible services, Portworx)
- Experience with enterprise load-balancing solutions (e.g., F5 or similar platforms)
- Ability to contribute immediately with minimal ramp-up in a mission-critical operational environment
group id: 91082210
N