nanog mailing list archives

Re: Cloudflare is down


From: Saku Ytti <saku () ytti fi>
Date: Mon, 4 Mar 2013 20:40:58 +0200

On (2013-03-04 13:23 -0500), Jeff Wheeler wrote:
 
We have lots of stupid people in our industry because so few
understand "The Way Things Work."

We have tendency to view mistakes we do as unavoidable human errors and
mistakes other people do as avoidable stupidity.

We should actively plan for mistakes/errors, if you actively plan for no
'stupid mistakes', you're gonna have bad time

From my point of view, outages are caused by:
1) operator
2) software defect
3) hardware defect

Most people design only against 3), often with design which actually
increases likelihood of 2) and 1), reducing overall MTBF on design which
strictly theoretically increases it.

-- 
  ++ytti


Current thread: