user avatar

DevOps / Site Reliability Engineer

CATHEXIS

Posted today
Top Secret
Mid Level Career (5+ yrs experience)
$160,000 - $200,000
No Traveling
IT - Software
Joint Base Pearl Hbr Hickam, HI (On-Site/Office)

Paradyme, a CATHEXIS Company has partnered with an industry leader in enterprise Artificial Intelligence software and is seeking a highly skilled Site Reliability Engineer (SRE) to join our team to manage, monitor, and optimize our clusters on Kubernetes. Together we’re accelerating our client’s digital transformation through the building and deployment of data-driven, scalable AI solutions. The ideal candidate will have an understanding of Kubernetes, Cloud Infrastructure, and Infrastructure as Code (IaC) practices. You will be responsible for supporting the reliability, scalability of our Kubernetes clusters and Cloud Infrastructure

Active TOP SECRET clearance or higher is required for consideration.

Work location is at Joint Base Pearl Harbor in Hawaii. We offer a Hybrid work schedule with 4 days Onsite and 1 day Remote per week.

Responsibilities:
- DevOps and development support: Including the management and restarting of Kubernetes Clusters in an environment where the Kubernetes clusters are already operational. Ensure the stability, health, and scalability of Kubernetes Clusters, deploying applications and services on Kubernetes
- Monitoring & Incident Response: Set up monitoring solutions, define alerts, and manage the incident response process for any issues related to Jenkins, or Kubernetes clusters.
- Automate Infrastructure Processes: Build automation tools for scaling, monitoring, and maintaining infrastructure using modern tools like Terraform, Ansible, or equivalent.
- Collaborate Across Teams: Work closely with development, services, and operations teams to ensure a seamless integration between application development and infrastructure.
- Security & Compliance: Ensure all systems follow best practices in terms of security and compliance with relevant regulations. This includes role-based access, encryption, and automated vulnerability scanning.

Requirements:
- Active TOP SECRET clearance or higher is required for consideration.
- Bachelor’s degree (or equivalent) in computer science or related discipline
- A minimum of five (5) years of experience working with on-premise and off-premise cloud environments.
- Experience with AWS, Azure and / or GCP
- Experience with Kubernetes and/or Red Hat OpenShift is highly desired
- Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript
- Experience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn)
- Proactive approach to identifying problems, performance bottlenecks, and areas for improvement
Agile/Scrum experience.
group id: 10477716
N
Name HiddenRecruiting Manager
Find CATHEXIS on Social Media
Network Employers
user avatar
About Us
CATHEXIS is a federal systems integrator built on focus, discipline, and integrity. We design and deliver tech-enabled solutions for defense, civilian, and intelligence agencies.

CATHEXIS Jobs


Job Category
IT - Software
Clearance Level
Top Secret
Employer
CATHEXIS