nanog mailing list archives
Re: Temperature monitoring
From: Peter Beckman <beckman () angryox com>
Date: Tue, 18 Jul 2017 22:33:16 -0400
Agreed -- there are already tons of temp sensors throughout old and new hardware. I've used SCSI drive queries via sdparm and more recently hddtemp to get the current temperature of the drives. No need for SNMP or ILO, though that can give you a more detailed picture where possible. You first monitor and record for 24 hours to get your baseline temp for a given rack or server, then set your threshold, then let your monitoring platform do the rest. Since I use hosted dedicated servers, I don't want to pay for yet another device. In monitoring only those disk temps I've caught two cooling issues before they became a crisis, one of which my hosting provider was not aware of. If you control the hardware, or at least have access to it, there should be enough sensors to let you know at least something is causing a problem. Beckman On Thu, 13 Jul 2017, Andrew Latham wrote:
On Thu, Jul 13, 2017 at 9:33 PM, Dovid Bender <dovid () telecurve com> wrote:All, We had an issue with a DC where temps were elevated. The one bit of hardware that wasn't watched much was the one that sent out the initial alert. Looking for recommendations on hardware that I can mount/hang in each cabinet that is easy to set up and will alert us if temps go beyond a certain point. TIA. DovidMost everything has temperature sensors from switches, servers and most modern PDUs. A dedicated solution is just creating the problem again in the future. Monitor the temps on everything and gain knowledge related to failure rates. Most companies with physical infrastructure could pay for another engineer to discover these unexpected expenses. Also note that modern air conditioning and refrigeration have SNMP or BACNET protocol support, just download the manual. -- - Andrew "lathama" Latham -
--------------------------------------------------------------------------- Peter Beckman Internet Guy beckman () angryox com http://www.angryox.com/ ---------------------------------------------------------------------------
Current thread:
- Temperature monitoring Dovid Bender (Jul 13)
- Re: Temperature monitoring Gary E. Miller (Jul 13)
- Re: Temperature monitoring Andrew Latham (Jul 13)
- Re: Temperature monitoring Peter Beckman (Jul 18)
- Re: Temperature monitoring Harlan Stenn (Jul 13)
- Re: Temperature monitoring Nick Hilliard (Jul 14)
- Re: Temperature monitoring Pete Baldwin (Jul 13)
- Re: Temperature monitoring Mel Beckman (Jul 13)
- Re: Temperature monitoring Richard Holbo (Jul 13)
- Re: Temperature monitoring Eric Kuhnke (Jul 13)
- Re: Temperature monitoring Eric Kuhnke (Jul 14)
- Re: Temperature monitoring Dan White (Jul 14)
- Re: Temperature monitoring David Charlebois (Jul 16)
- RE: Temperature monitoring Edwin Pers (Jul 18)
- Re: Temperature monitoring David Charlebois (Jul 16)