Posted today
Public Trust
Unspecified
Unspecified
IT - Hardware
Annapolis Junction, MD (On-Site/Office)
For the OPS Consulting team, 'the power to help' means helping our clients, helping serve the mission, helping our employees and their families, and helping the community. Headquartered in Hanover, MD. OPS Consulting has over two decades of experience specializing in the most mission-critical operations. We are thought leaders and innovators. The ingenuity of our developers, engineers, cyber experts, linguists, and analysts are dedicated to empowering our clients, fulfilling The Mission, and remaining trusted leaders and advisers in national security and technology solutions.
We are looking for a Cloud Systems Administrator 3 to join a growing team in Annapolis Junction, MD.
Provide installation, configuration, tuning, and management of large Hadoop (Apache Accumulo) clusters supporting data-intensive computing. Responsibilities include installation and maintenance of tools required for the cloud environment and monitoring and sustainment of the environment, including identification and resolution of issues, problems, and trouble tickets.
Responsibilities:
Requirements:
Desired Experience:
We are looking for a Cloud Systems Administrator 3 to join a growing team in Annapolis Junction, MD.
Provide installation, configuration, tuning, and management of large Hadoop (Apache Accumulo) clusters supporting data-intensive computing. Responsibilities include installation and maintenance of tools required for the cloud environment and monitoring and sustainment of the environment, including identification and resolution of issues, problems, and trouble tickets.
Responsibilities:
- Identify, deploy, and operate software tools to efficiently manage and monitor large compute clusters
- Diagnose and resolve hardware and software issues within Hadoop and Apache Accumulo environments
- Install, configure, and maintain tools supporting distributed data processing and storage platforms
- Support cluster performance, reliability, and scalability through proactive monitoring and tuning
- Collaborate with engineering and operations teams to support high-performance, mission-critical systems
Requirements:
- Within the past ten (10) years, at least seven (7) years of experience managing and monitoring large Hadoop clusters exceeding 200 nodes
- Within the past ten (10) years, at least five (5) years of experience writing automation scripts using Python, Perl, or Ruby
- Within the past ten (10) years, at least five (5) years of experience implementing and supporting multi-platform, multi-system networks, including environments built on UNIX/Linux and Cisco-based hardware, with hands-on troubleshooting and issue resolution
- Within the past ten (10) years, at least five (5) years of experience implementing network solutions for complex, high-performance systems using UNIX or Linux platforms
- At least two (2) years of experience with open-source NoSQL or distributed data platforms, such as HBase, Apache Accumulo, or BigTable, supporting massively parallel computation
- At least three (3) years of experience with Hadoop Distributed File System (HDFS)
- Within the past two (2) years, a minimum of six (6) months of system administration experience in one or more of the following environments:
- Kubernetes
- Amazon Web Services (AWS)
- Microsoft Azure
- Google Cloud Platform (GCP)
- US citizenship and an active security clearance required
Desired Experience:
- Strong Linux system administration background
- Experience supporting high-performance computing (HPC) or large-scale analytics platforms
- Familiarity with monitoring, logging, and cluster management tools in distributed environments
group id: 90970707