That big Amazon S3 outage was caused by a typo, company admits

Originally published at: http://boingboing.net/2017/03/02/oops.html

5 Likes

Well, that explains the tinder Netflix, err, outage.

9 Likes

Well, sounds like they fixed it. We should, as a society, put more things on the Amazon servers!

4 Likes

“Wups.” ~ Darrel, the IT guy

9 Likes

RIP that tech’s job at Amazon. Unless the playbook was actually wrong, then RIP the guy that wrote it. And possibly RIP the developer that created one such command to take down so many servers at once, even though he probably created it correctly based on requirements. Maybe RIP the requirements writers or IT architect that allowed such a tool to exist. But all management overseers will be fine. Accountability lies with the little guy. We’ve all heard what it’s like to work there.

7 Likes

http://img.pandawhale.com/post-40444-Johnny-Airplane-Just-Kidding-g-T1Q3.gif

19 Likes

I got escorted out the door after a single digit mistake on a DNS update. Freaked me out at the time, I now wear it as a badge of honor (I may have uttered, “by the power of greyskull!!”)

24 Likes

With great power comes great responsibility.

1 Like

I’m actually impressed that they revealed so much detail to the world at large. I guess they needed to make a giant mea culpa to ensure that their clients using Amazon’s cloud knew Amazon was taking it seriously. Still at least it sounds like they’re doing the right thing (figuring out how their system allowed a single tech to create a massive DOS with a single typo, and fixing it so it no longer works that way) instead of doing the typical thing (figuring out how to find the right scapegoat to blame it on so they don’t have to admit that their system currently requires humans to be perfect operators in order to guarantee uptime).

7 Likes

Well, they’re telling us about the technical fix. Maybe not so much about the human resources fix.

3 Likes

"I brought down S3"
Achievement unlocked

27 Likes

Probably doing both.

9 Likes

That guy is going to have a failure mode named after him.

“Oh, cripes, I McKenzied the command line!”

9 Likes

‘Some people, when confronted with a problem, think “I know, I’ll use regular expressions.” Now they have two problems.’

12 Likes

Just one wrong bit and you’re in the shit

And this is why I always use a “?” Instead of typing out “PRINT.”

4 Likes

that’s a c64 reference, is it not?

1 Like

C64? NEVER!

4 Likes

$_ =~ s/./ohcrap/bg;

5 Likes