more on SF outage

Peter Kranz pkranz at unwiredltd.com
Wed Jul 25 04:06:50 UTC 2007


Once the final analysis of this event is provided, it is likely going to be
due to a failure of one of the redundant systems to handle the event as
designed due to a software or other low level failure. It's a very complex
system designed to exceed anything in the region as far as redundancy goes,
but as a result it's got a lot of moving parts, and like the space shuttle,
can fail unexpectedly. You can bet engineering is scratching their head and
calling in the vendors to figure out what went wrong. Last time this
occurred it took weeks to pinpoint the root cause.





More information about the NANOG mailing list