nanog mailing list archives

Re: Amazon diagnosis


From: Jay Ashworth <jra () baylink com>
Date: Sun, 1 May 2011 14:10:40 -0400 (EDT)

----- Original Message -----
From: "Mike" <mike-nanog () tiedyenetworks com>

On 04/29/2011 12:35 PM, Joly MacFie wrote:
*http://aws.amazon.com/message/65648/*<http://aws.amazon.com/message/65648/>

So, in a nut shell, Amazon had a single point of failure which touched
off this entire incident.

I am still waiting for proof that single points of failure can
realistically be completely eliminated from any moderately complicated
network environment / application. So far, I think murphy is still
winning on this one.

Well, in fairness to Amazon, let's ask this: did the failure occur *behind
a component interface they advertise as Reliable*?  Either way, was it possible
for a single customer to avoid that possible failure, and at what cost in
expansion of scope and money?

Cheers,
-- jra


Current thread: