Search All Sites
Nagios Live WebinarsLet our experts show you how Nagios can help your organization.
- Nagios 2.x
- Nagios 3.x
- Nagios 4.x
- Release 3.7.12: A couple of fixes for physical disk reporting. Failed physical disks are now reported as critical.
- Release 3.7.11: A regression on Nagios systems with ePN enabled was fixed
- Release 3.7.10: This release fixes documentation URLs in the HTML output to reflect the new layout on Dell's website, adds a new option "--vdisk-critical" to make all alerts for virtual disks appear as critical, and fixes a bug with the option "--only servicetag"
About this plugin:
The main philosophy behind check_openmanage is that the plugin's output should be immediately useful to the system administrator. Great care has been taken to ensure that the output is accurate, compact and detailed enough to tell you exactly what you need to know in case of a hardware problem. It will identify blades and will not report "missing" power supplies, fans etc. on blade servers.
dell-server01 ~$ check_openmanage OK - System: 'PowerEdge R710', SN: 'XXXXXXX', 72 GB ram (18 dimms), 1 logical drives, 2 physical drives
check_openmanage differs from other Dell plugins in the level of detail in the output. It tries to be as precise as possible about whatever is wrong. In most cases, it shouldn't be necessary to investigate further when you get an alert from check_openmanage.
nagios-server ~$ check_openmanage -H dell-server02 Physical Disk 1:3 [Seagate ST3300007LC, 300GB] on ctrl 0 needs attention: Failure Predicted
In addition, many options have been added to the plugin to let you decide for yourself how it should behave in your environment. You can blacklist components that you'll never fix, and control which of the components that are checked in the first place. The default setting is that almost everything is checked.
nagios-server ~$ check_openmanage -H dell-server03 -e Memory module 4 [DIMM2_B, 1024 MB] needs attention: ECC single bit correction logging disabled ------ SYSTEM: PowerEdge 2850, SN: XXXXXXX
The plugin is designed to be used either locally with NRPE, or remotely via SNMP. In either case, it relies entirely on the Dell OpenManage System Administrator (OMSA) management software and cannot function if OMSA is not installed and running on the monitored system.Storage components checked:
- Physical drives
- Logical drives
- Cache batteries
- Connectors (channels)
- Enclosure fans
- Enclosure power supplies
- Enclosure temperature probes
- Enclosure management modules (EMMs)
- Memory modules
- Cooling fans
- Temperature probes
- Power supplies
- Voltage probes
- Power usage
- Chassis intrusion
- Removable flash media (SD cards)
- Sane and not empty service tag (serial number)
- ESM Log health
- ESM Log content (default disabled)
- Alert Log content (default disabled, not SNMP)
check_openmanage has been testet on a variety of Dell servers running RHEL3, RHEL4, RHEL5, RHEL6, VmWare ESX and various Windows releases, with recent OMSA versions. It may work with old versions of OMSA, and should work on other operating systems providing a fairly recent and functioning OMSA version.
So I naturally looked to see what was available and found check_openmanage which was well documented, easy to set up and immediately discovered some previously unknown problems -two machines had out of date firmware and one has a battery issue.
22 => 'iDRAC8', # iDRAC8
Just mention individual plugins in NRPE.CFG or Nagios Server. The example is as given below.
command[check_openmanage]=/usr/local/nagios/libexec/check_openmanage --only memory
command[check_openmanage1]=/usr/local/nagios/libexec/check_openmanage --only voltage
command[check_openmanage2]=/usr/local/nagios/libexec/check_openmanage --only fans
command[check_openmanage3]=/usr/local/nagios/libexec/check_openmanage --only cpu
command[check_openmanage4]=/usr/local/nagios/libexec/check_openmanage --only esmhealth
command[check_openmanage5]=/usr/local/nagios/libexec/check_openmanage --only temp
Sample output is:
[root@dell-nfs libexec]# ./check_openmanage --only memory
MEMORY OK - 4 memory modules, 4096 MB total memory
System: PowerEdge 6850 OMSA version: 5.5.0
ServiceTag: 268XZC1 Plugin version: 3.7.6
BIOS/date: A04 08/07/2006 Checking mode: local
STATE | ID | MESSAGE TEXT
OK | 0 | Memory module 0 [Mem Card A/DIMM_1A, 1024 MB] is Ok
OK | 1 | Memory module 1 [Mem Card A/DIMM_1B, 1024 MB] is Ok
OK | 2 | Memory module 2 [Mem Card B/DIMM_1A, 1024 MB] is Ok
OK | 3 | Memory module 3 [Mem Card B/DIMM_1B, 1024 MB] is Ok
We check all our Dell servers with it.