Job Requirements
Washington, DC
Public Trust Polygraph not specified
Career Level not specified
Salary not specified
Join Premium to unlock estimated salaries
Job Description
Monitoring Lead- Application Hosting
Location: Washington, DC
Type: Contract
Compensation: $63.34/h
Onsite – onsite
Security Clearance: Public Trust Required
Overview
You’ll lead monitoring for application hosting- improving alert quality, building dashboards, and integrating tools (especially OpenText OBM/Operations Bridge/SiteScope and SCOM) to support 24x7 operations. Requires 7+ years monitoring/IT ops experience, strong scripting (PowerShell/VBScript), and U.S. citizenship + Public Trust.
Responsibilities
Assist in driving, standardizing, and managing a unified configuration management database for DOT HQ.
Collect and aggregate data to support decisions across ITIL processes including configuration, event, capacity, availability, demand, incident, event, and problem management, performing detailed analysis.
Assess and fine-tune monitoring capabilities to provide accurate and actionable alerts for 24x7 operations systems.
Create and maintain intuitive dashboards displaying current and past performance and service status.
Configure, maintain, and optimize monitoring dashboards to oversee health and performance across diverse IT infrastructure components.
Deploy, manage, and update Management Packs, connectors, and monitoring policies to support business application and service monitoring needs.
Perform event correlation and filtering to streamline incident triage, reduce noise, and ensure timely escalation.
Integrate data sources from third-party monitoring tools such as OpenText OBM, SiteScope, and Microsoft SCOM into the unified OBM event console.
Conduct proactive performance and availability monitoring, identify root causes, and implement preventive measures to improve service delivery.
Requirements
BS degree or higher in a relevant field.
Minimum of 7 years of related monitoring, IT operations, or configuration management experience.
Extensive knowledge of multi-vendor server operating systems.
Minimum 2 years of experience managing OpenText suite including AI Operations Management, Operations Bridge, SiteScope, and Optic.
Expertise with Management Protocols such as SNMP and WMI.
Scripting experience with PowerShell, VBScript, or similar languages.
Experience managing monitoring systems with over 250 hosts and more than 3,000 sensors.
Proficiency with monitoring solutions like Zenoss, PRTG, Zabbix, or Nagios.
Strong background in monitoring servers, storage, databases, networking, and applications.
Proven track record of engineering monitoring solutions and fostering a collaborative work environment.
Supporting a 24x7 operations environment preferred.
Experience leading troubleshooting efforts during service outages and collaborating across multiple teams.
Systems administrator experience with Windows and Linux operating systems.
Advanced scripting and automation skills, including integrating monitoring tools with ServiceNow and automating alerts for service tickets.
Strong understanding of ITIL and ITSM processes, including monitoring, demand, availability, and capacity management.
ITIL certification(s) preferred.
Experience analyzing monitoring data to inform capacity and availability decisions.
Ability to create dashboards and visualizations based on performance metrics and availability data.
Location: Washington, DC
Type: Contract
Compensation: $63.34/h
Onsite – onsite
Security Clearance: Public Trust Required
Overview
You’ll lead monitoring for application hosting- improving alert quality, building dashboards, and integrating tools (especially OpenText OBM/Operations Bridge/SiteScope and SCOM) to support 24x7 operations. Requires 7+ years monitoring/IT ops experience, strong scripting (PowerShell/VBScript), and U.S. citizenship + Public Trust.
Responsibilities
Assist in driving, standardizing, and managing a unified configuration management database for DOT HQ.
Collect and aggregate data to support decisions across ITIL processes including configuration, event, capacity, availability, demand, incident, event, and problem management, performing detailed analysis.
Assess and fine-tune monitoring capabilities to provide accurate and actionable alerts for 24x7 operations systems.
Create and maintain intuitive dashboards displaying current and past performance and service status.
Configure, maintain, and optimize monitoring dashboards to oversee health and performance across diverse IT infrastructure components.
Deploy, manage, and update Management Packs, connectors, and monitoring policies to support business application and service monitoring needs.
Perform event correlation and filtering to streamline incident triage, reduce noise, and ensure timely escalation.
Integrate data sources from third-party monitoring tools such as OpenText OBM, SiteScope, and Microsoft SCOM into the unified OBM event console.
Conduct proactive performance and availability monitoring, identify root causes, and implement preventive measures to improve service delivery.
Requirements
BS degree or higher in a relevant field.
Minimum of 7 years of related monitoring, IT operations, or configuration management experience.
Extensive knowledge of multi-vendor server operating systems.
Minimum 2 years of experience managing OpenText suite including AI Operations Management, Operations Bridge, SiteScope, and Optic.
Expertise with Management Protocols such as SNMP and WMI.
Scripting experience with PowerShell, VBScript, or similar languages.
Experience managing monitoring systems with over 250 hosts and more than 3,000 sensors.
Proficiency with monitoring solutions like Zenoss, PRTG, Zabbix, or Nagios.
Strong background in monitoring servers, storage, databases, networking, and applications.
Proven track record of engineering monitoring solutions and fostering a collaborative work environment.
Supporting a 24x7 operations environment preferred.
Experience leading troubleshooting efforts during service outages and collaborating across multiple teams.
Systems administrator experience with Windows and Linux operating systems.
Advanced scripting and automation skills, including integrating monitoring tools with ServiceNow and automating alerts for service tickets.
Strong understanding of ITIL and ITSM processes, including monitoring, demand, availability, and capacity management.
ITIL certification(s) preferred.
Experience analyzing monitoring data to inform capacity and availability decisions.
Ability to create dashboards and visualizations based on performance metrics and availability data.
group id: COMPHLP