• On GameSpot: The All-Time Greatest Game Hero revealed
June 6, 2008 3:21 PM PDT

Amazon working again, but what went wrong?

by Stephen Shankland
  • Font size
  • Print
  • 17 comments

Update 4:36 p.m. PDT with outside comment about possible causes of the Amazon.com outage.

Amazon posted an apology placeholder page for broken links.

Amazon posted an apology placeholder page for broken links during a two-hour outage.

(Credit: Amazon.com)

A two-hour Amazon.com outage is over. Now on to the post-mortem: what triggered the problem?

Amazon declared itself clear of the problem this afternoon. "The Amazon retail site was down for approximately two hours earlier today beginning around 10:25 a.m. The site (is) back up," the company said in statement.

But as to the explanation, the company only hinted that its complicated computing infrastructure was, unsurprisingly, a culprit.

"Amazon's systems are very complex and on rare occasions, despite our best efforts, they may experience problems. We work to minimize any disruption and to get the site back as quickly as possible," the company said, declining to comment further.

Human error?
The most likely culprit was simple human error, in the estimation of Shawn White, director of operations for Keynote Systems, which monitors Web site availability.

"Some engineer might have made a particular change, not knowing it could cause a trickle-down effect" that eventually brought down the site.

For example, he said, somebody in charge of maintenance might have been directing Internet traffic to a particular group of servers, but selected the wrong group.

But at Amazon? "What I find still so surprising is it happened in the middle of the day. Typically you do that in off-peak hours," White said. "They rank on the top with performance and availability, consistently, time and time again."

Network attack?
Another possible explanation is an attack such as the distributed denial-of-service (DDOS) attack that struck Amazon and other high-profile sites in 2000. White thinks it unlikely, though, that a crushing load of network traffic brought Amazon down.

"These guys are experts at dealing with flash floods of users," including those that routinely arrive during peak shopping days. "Usually, when you see a site going under because of traffic issues or a denial-of-service attack, you see a gradual slowdown in performance and drop in availability. Here we saw at 10:16 a.m. it completely dropped off 100 percent."

Soups Ranjan, a senior member of the technical staff of network protection and management company Narus, hasn't yet found any attack evidence.

"It doesn't seem to be the result of a network-initiated attack, at least from my preliminary analysis from our probes," Ranjan said.

Human error may not sound as gripping a tale as a network attack, but there's plenty of drama for the people responsible. And it's the career-limiting variety of drama, said Illuminata analyst Gordon Haff, who hazarded a guess that Amazon's problem involved its front-end Web servers.

The security group of WebSense, a Web site and communications protection company, also saw no evidence Amazon's problem was security related.

CNET staff writer Robert Vamosi contributed to this report.

Stephen Shankland writes about a wide range of technology and products, but has a particular focus on browsers and digital photography. He joined CNET News in 1998 and since then also has covered Google, Yahoo, servers, supercomputing, Linux and open-source software, and science. E-mail Stephen, or follow him on Twitter at http://www.twitter.com/stshank.
advertisement
Click Here
Recent posts from News Blog
Nvidia puts NForce chipset development on hold
Opera 10 browser is here
Neil Young Archives Blu-ray: Rip off?
Acronis revises survey results about backup habits
Acronis miscalculates data on users' bad backup habits
Flickr co-founder presses beta button
Comcast, Sony open retail store
Cox to try coaxing the Internet into submission
Add a Comment (Log in or register) (17 Comments)
  • prev
  • 1
  • next
by J-Hawaii June 6, 2008 4:20 PM PDT
One word: Vista
Reply to this comment
by n3td3v June 6, 2008 6:07 PM PDT
It was U.S Air Force Cyber testing military botnet capabilities.
Reply to this comment
by esamos June 6, 2008 6:17 PM PDT
idiot
Reply to this comment
by jotadavida June 6, 2008 7:52 PM PDT
Akamai? The old gen CDN backbone days of the pioneering wild web west? Hmmm. Hardware does not beat software. Ever been to Akamai's NOC in Cambridge, MA? A sea of winking red lights?
Reply to this comment
by EcuadorHomesOnline June 7, 2008 7:14 AM PDT
They must be using Unix/Linux servers. For the last ten years, Cnet has ALWAYS reported what OS is being used ONLY if the affected site uses Windows servers. Cnet is hugely biased in that regard - Windows is the most reliable and scalable server system, but the biased Cnet reporting is such that they only mention the OS in a website outage if it's a Windows server - so if they don't mention what OS is being used on the affected site, then it's probably Linux/Unix that's at fault.
Reply to this comment
by EcuadorHomesOnline June 7, 2008 7:21 AM PDT
Amazon must be using Linux/Unix servers. For the past ten years, Cnet has had very biased reporting. If a website goes down and the site is using Windows servers, that fact is always mentioned in the article. If the website is not using Windows servers, then they don't mention what OS the site uses - so Amazon.com must be using Linux/Unix or some other operating system less reliable than Windows.
Reply to this comment
by EcuadorHomesOnline June 7, 2008 8:30 AM PDT
It said that the message failed to post, so I re-typed it.
Reply to this comment
by darylicked June 7, 2008 9:56 AM PDT
LOL less reliable than windows. thats a good one.

by the way, its down again as of 9:50 am saturday pacific time
Reply to this comment
by darylicked June 7, 2008 9:56 AM PDT
LOL less reliable than windows. thats a good one.

by the way, its down again as of 9:50 am saturday pacific time
Reply to this comment
by Demolition June 7, 2008 5:30 PM PDT
Actually, Amazon does use Linux. They have been since 2001. They switched away from Windows because it was too costly to maintain and the uptime was abysmal. Meanwhile, Linux costs them 25% less to operate and maintain, and the uptime/availability has been stellar

Now, before someone says "But, if Linux' uptime is so good, then what about this two hour outage?" Well, if Shawn White, Soups Ranjan, and Gordon Haff are correct, then it was due to human error, and not some problem with Linux.
Reply to this comment
by Wookiee-1138 June 7, 2008 7:33 PM PDT
It was probably due to the flood of MGS4 preorders.
Reply to this comment
by jimwhite467 June 8, 2008 8:29 AM PDT
The logic is very straightforward. When it runs Windows, it is the OS's fault. When it is Linux, it is human error. Duh.
Reply to this comment
by Dr_Zinj June 9, 2008 7:36 AM PDT
"God! What a rat's nest of Christmas-treed plugs and wires. Let's see, it's got to be one of these plugs here with a black cable ..."
Reply to this comment
by popservationsj June 9, 2008 11:47 AM PDT
Happened again to me about a half-hour ago. Same error message when going to the site, while trying to browse certain items further, etc.
Reply to this comment
by kwilsonjr June 9, 2008 3:22 PM PDT
There are still some problems. I got that page (in the example gif) just now while trying to download some mp3's.
Reply to this comment
by as901 June 11, 2008 3:55 AM PDT
It could not have been Linux. Almost all of the top 100 companies use Linux, and they were not down. Perhaps the trouble is that many folks are now using the latest Microsoft program, and Amazon was trying to write Java that would work with Microsoft's latest misadventure?

My advice would be to stick with Linux and standard Sun Java only!
Reply to this comment
by benjaminstraight July 29, 2008 3:56 PM PDT
It doesn't matter, it works now.
Reply to this comment
(17 Comments)
  • prev
  • 1
  • next
advertisement

S.F. hacker space: Heaven for the DIY set?

The Noisebridge hacker space offers sewing and Mandarin classes, soldering workshops, Internet-controlled front door access, and a server room with no door.
• Photos: Circuits, code, community

The browser battles go on and on

roundup From Firefox to IE and from Chrome to Opera and Safari, there's no sitting still for browser makers looking to keep their products fresh and competitive.

About News Blog

Recent posts on technology, trends, and more.

Add this feed to your online news reader

advertisement
advertisement

Inside CNET News

Scroll Left Scroll Right