user avatar

MCS Site Reliability Engineer

SciTec, Inc.

Posted today
Public Trust
$146,000 - $175,000
Unspecified
Engineering - Mechanical
Aurora, CO (On-Site/Office)

Description

SciTec is a dynamic small business, with the mission to deliver advanced sensor data processing technologies and scientific instrumentation capabilities in support of National Security and Defense, and we are growing our creative team! We support customers throughout the Department of Defense and U.S. Government in building innovative new tools to deliver unique world-class data exploitation capabilities.

Important Notice: SciTec exclusively works on U.S. government contracts that require U.S. citizenship for all employees. Applicants that do not meet this requirement will not be considered.

SciTec has an immediate opportunity for a talented engineer to support our programs delivering Next-Generation Missile Warning software. This is a unique opportunity to join a small business delivering core capabilities for National defense. You will work within a fast-paced team delivering end-to-end software processing of Overhead Persistent InfraRed (OPIR) sensor data for Missile Warning, Missile Defense, Battlespace Awareness, and Technical Intelligence.

We are seeking an MCS Site Reliability Engineer (SRE) to support the reliability, performance, and operational excellence of mission-critical infrastructure services. This role sits within the Infrastructure as a Service (IaaS) team and focuses on availability, scalability, observability, and automation across compute, storage, networking, and platform services deployed at a customer site.

The ideal candidate is a strong systems engineer with an SRE mindset-someone who can troubleshoot complex infrastructure issues, improve system resilience, and reduce operational toil through automation.

Responsibilities
  • Support the availability, reliability, and performance of IaaS services supporting mission systems
  • Monitor infrastructure health using metrics, logs, and alerts; respond to and resolve incidents
  • Perform root-cause analysis for infrastructure and service outages; implement corrective and preventative actions
  • Improve system reliability through automation, standardization, and proactive engineering
  • Support capacity planning, performance analysis, and scaling of infrastructure services
  • Maintain and enhance monitoring, logging, and alerting solutions
  • Participate in incident response, on-call rotations (as required), and post-incident reviews
  • Collaborate with network, systems, platform, and application teams to resolve cross-stack issues
  • Support infrastructure lifecycle activities including upgrades, patches, and configuration changes
  • Apply security best practices and support compliance requirements in a regulated environment
  • Develop and maintain runbooks, procedures, and operational documentation
  • Contribute to CI/CD and Infrastructure-as-Code workflows supporting IaaS services
  • Participate in Agile ceremonies and operational planning activities
  • Perform other duties as assigned

Requirements
  • 5+ years of professional experience in systems engineering, SRE, DevOps, or infrastructure operations
  • Strong experience administering Linux systems
  • Experience supporting on-prem, cloud, or hybrid infrastructure environments
  • Hands-on experience with monitoring, logging, and alerting systems
  • Strong troubleshooting skills across compute, storage, networking, and OS layers
  • Experience scripting or automating tasks using Bash, Python, or similar languages
  • Familiarity with Infrastructure as Code concepts and tooling
  • Strong verbal and written communication skills
  • Detail-oriented, self-motivated, and able to own issues through resolution
  • Ability to obtain and maintain a DoD security clearance
  • Ability to work on-site at the customer location

Candidates who have any of the following skills will be preferred:
  • Experience working on an IaaS or platform operations team
  • Experience with virtualization platforms (e.g., VMware vSphere)
  • Experience supporting container platforms (Kubernetes, OpenShift)
  • Experience with cloud environments (AWS, Azure, or GovCloud)
  • Familiarity with SRE concepts such as SLIs, SLOs, error budgets, and toil reduction
  • Experience with configuration management or automation tools (Ansible, Terraform)
  • Experience with CI/CD pipelines (GitLab CI, Jenkins, or similar)
  • Experience operating systems in government or secure environments
  • Experience with incident management and operational readiness reviews

Benefits

SciTec offers a highly competitive salary and benefits package, including:
  • 4% Safe Harbor 401(k) match
  • 100% company paid HSA Medical insurance, with a choice of 2 buy-up options
  • 80% company paid Dental insurance
  • 100% company paid Vision insurance
  • 100% company paid Life insurance
  • 100% company paid Long-term Disability insurance
  • Short-term Disability insurance
  • Annual Profit-Sharing Plan
  • Discretionary Performance Bonus
  • Paid Parental Leave
  • Generous Paid Time Off, including Holiday, Vacation, and Sick Pay
  • Flexible work hours


The pay range for this position is $146,000 - $175,000 / year. SciTec considers several factors when extending an offer of employment, including but not limited to the role and associated responsibilities, a candidate's work experience, education/training, and key skills. This is not a guarantee of compensation.

SciTec is proud to be an Equal Opportunity employer. VET/Disabled.
group id: 10337270

Match Score

Powered by IntelliSearchâ„¢
image match score
Create an account or Login to see how closely you match to this job!