Posted today
Public Trust
Unspecified
Unspecified
IT - Hardware
GA (On-Site/Office)
Job Details
Seeking a Systems Engineer to support federal client with enterprise monitoring of distributed systems. The candidate will have 5 years of experience demonstrating comprehensive knowledge of key tasks in one or more system monitoring software packages to include DataDog, Dynatrace, IBM Omnibus, IBM NOI (Netcool Operations Insight), HP APM (App Performance Manager), Operations Center, Topaz (MicroFocus), or Operations Center MVS. The installation, configuration, and management of these tools to include working with the application teams to gather requirements and tune the systems for alert thresholds. Supporting after hours incidents as needed.
#cjpost
Job Requirements:
Minimum Qualifications
Required Skills and Qualifications
Seeking a Systems Engineer to support federal client with enterprise monitoring of distributed systems. The candidate will have 5 years of experience demonstrating comprehensive knowledge of key tasks in one or more system monitoring software packages to include DataDog, Dynatrace, IBM Omnibus, IBM NOI (Netcool Operations Insight), HP APM (App Performance Manager), Operations Center, Topaz (MicroFocus), or Operations Center MVS. The installation, configuration, and management of these tools to include working with the application teams to gather requirements and tune the systems for alert thresholds. Supporting after hours incidents as needed.
- Architecture and Deployment: Engineer and deploy monitoring agents, collectors, and integrations across Windows, Linux, and virtualized environments.
- Develop and maintain integrations between monitoring systems and IT Service Management (ITSM) tools for incident generation (ServiceNow), CMDB synchronization, and workflow automation.
- Write scripts and use APIs for automation, data processing, and integration between disparate systems.
- Provide root cause analysis and troubleshooting for monitoring and integration issues, working with support teams to resolve incidents.
- Develop operational procedures, runbooks, and 'as-built' documentation.
- Implement solutions with security and compliance requirements in mind, such as STIGs and FISMA.
#cjpost
Job Requirements:
Minimum Qualifications
- Bachelor's Degree required or equivalent relevant experience. Master's Degree preferred.
- 8-10 years of relevant experience to include: 5 years of experience in IT enterprise monitoring system design, implementation, and Level 3 support activities, 5 years of experience as an administrator of IBM AIX and or Red Hat Linux servers, 3 years of experience collecting requirements from application owners and implementing those requirements.
Required Skills and Qualifications
- 5+ years of hands-on experience in enterprise monitoring and IT infrastructure management.
- Proficiency with Windows Server and Linux (e.g., RHEL, CentOS).
- Strong skills in scripting languages such as GO, and Python, and experience with REST APIs.
- Solid understanding of networking concepts like TCP/IP, DNS, and protocols like SNMP, NetFlow, and syslog.
- Experience with virtualization platforms like VMware vSphere or Hyper-V.
- Experience integrating with ServiceNow.
- Experience with log aggregation and monitoring tools like Splunk.
- After hours support of incidents as needed.
- Ability to solve complex technical problems and work under pressure.
group id: 10238000