nanog mailing list archives

Re: Arista hardware health and environmental nagios plugin


From: Piotr Iwanejko <piotr.iwanejko () gmail com>
Date: Mon, 22 May 2017 09:10:36 +0200

Hello,

Wiadomość napisana przez bas <kilobit () gmail com> w dniu 19.05.2017, o godz. 21:34:

I had hoped not to have to re-invent the wheel.

Some custom scripts I use on 7050SX: https://github.com/piwanejko/Arista-monitoring-tools
Nagios checks:

CPU1 temperature                        check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006001'!'550'!'600'
CPU1 load                               check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.25.3.3.1.2.1'!'70'!'90'
CPU2 temperature                        check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006002'!'550'!'600'
CPU2 load                               check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.25.3.3.1.2.2'!'70'!'90'
CPU3 temperature                        check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006003'!'550'!'600'
CPU3 load                               check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.25.3.3.1.2.3'!'70'!'90'
CPU4 temperature                        check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006004'!'550'!'600'
CPU4 load                               check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.25.3.3.1.2.4'!'70'!'90'
Fan tray 1 status                       check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100601111'!''!'1'    
Fan tray 2 status                       check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100602111'!''!'1'
Fan tray 3 status                       check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100603111'!''!'1'
Fan tray 4 status                       check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100604111'!''!'1'
Lower board temperature check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006011'!'500'!'600'
PSU1 fan status                 check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100711211'!''!'1'
PSU1 in current status          check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100711103'!''!'1'
PSU1 in voltage status          check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100711105'!''!'1'
PSU2 fan status                 check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100721211'!''!'1'
PSU2 in current status          check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100721103'!''!'1'
PSU2 in voltage status          check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100721105'!''!'1'
SUP temperature                 check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006005'!'550'!'600'
Upper board temperature check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006009'!'500'!'600'
Uptime                                  check_snmp_sw!'2c'!'public'!'.1.3.6.1.2.1.1.3.0'!'@60000:70000'!'60000:'


check_snmp_sw   ->      check_snmp -H $HOSTADDRESS$ -P $ARG1$ -C $ARG2$ -o $ARG3$ -w $ARG4$ -c $ARG5$

I also made custom script to check discs and memory utilization, but it's too old and terribly written to be shared.

Best regards,

Current thread: