Skype: Overloaded Servers Caused Ripple Effect update from December 2010

Skype says last week's massive outage was triggered when a cluster of servers became overloaded, causing congestion and capacity problems that were exacerbated by a bug in the latest version of Skype's Windows software client.

Data Center Knowledge

December 29, 2010

1 Min Read
ITPro Today logo in a gray background | ITPro Today

Skype has published a detailed explanation of the causes of last week's massive outage for the Internet telephony service. The root cause: a cluster of servers became overloaded, causing congestion and capacity problems that were exacerbated by a bug in the latest version of Skype's Windows software client.

As a result, the problems rippled through Skype's peer-to-peer infrastructure, taking out many of the supernodes that are key to the smooth operation of Skype's network. A supernode acts like a directory,helping to establish connections between Skype clients and creating local clusters typically of several hundred peer nodes.

"Although Skype staff responded quickly to disable the overloaded servers and to eliminate client requests to them, a significant number of supernodes had already failed," writes Skype's CIO Lars Rabbe. "Once a supernode has failed, even when restarted, it takes some time to become available as a resource to the P2P network again. As a result, the P2P network was left with 25–30% fewer supernodes than normal. This caused a disproportionate load on the remaining available supernodes."

Rabbe also outlined the steps Skype will take to prevent a recurrence, which focused primarily on better testing of software updates and improved bug detection processes. There also may be additional spending on core infrastructure. "We will keep under constant review the capacity of our core systems that support the Skype user base, and continue to invest in both capacity and resilience of these systems," Rabbe wrote. "An investment program we initiated a year ago has significantly increased our capacity already and more investment is planned for 2011."

Read Post Mortem on the Skype Outage for details.

Read more about:

Data Center Knowledge

About the Author

Data Center Knowledge

Data Center Knowledge, a sister site to ITPro Today, is a leading online source of daily news and analysis about the data center industry. Areas of coverage include power and cooling technology, processor and server architecture, networks, storage, the colocation industry, data center company stocks, cloud, the modern hyper-scale data center space, edge computing, infrastructure for machine learning, and virtual and augmented reality. Each month, hundreds of thousands of data center professionals (C-level, business, IT and facilities decision-makers) turn to DCK to help them develop data center strategies and/or design, build and manage world-class data centers. These buyers and decision-makers rely on DCK as a trusted source of breaking news and expertise on these specialized facilities.

Sign up for the ITPro Today newsletter
Stay on top of the IT universe with commentary, news analysis, how-to's, and tips delivered to your inbox daily.

You May Also Like