Detailed Overview of Content Delivery Networks (CDNs)

Today, I found a wonderful post from Royal Pingdom that covers in detail Content Delivery Networks, their services, challenges, outlook and perspective, future predictions, etc… If you have ever wanted to know more about CDNs or wondered what needs they service, this is the article for you. Clearly with online video and the file sizes that go along with it, CDNs are crucial for delivering online video with speed and reliability. Thanks Peter for allowing us to share this with ReelSEO readers ;-) Below is the article:

Content Delivery Networks help websites and services to deliver online content in an efficient manner. Since they act transparently behind the scenes, CDNs are often anonymous contributors to large online services.

To get a little more insight, we spoke to the CDN providers Voxel dot Net, Panther Express, EdgeCast Networks and IndyNET to get some information on how they work, their perspective on the CDN market and the challenges they face, and what the future looks like for CDNs.

Why use a CDN?

CDNs seem to be used everywhere these days, but why is that? Why don’t more companies set up their own infrastructure? Gad Stanislas, Vice President of Panther Express, explains why CDNs are so widely used.

“Increasingly, companies around the globe view digital content delivery as a utility service, much like electricity and telecom,” he says. “But few companies have the ability or the desire to create their own CDN infrastructure in order to implement an effective digital content strategy. As a result, they are outsourcing content delivery to CDN providers. For these companies, a CDN provider must offer fast, reliable delivery on terms that are highly flexible and affordable.”

This equation of cost, convenience and performance is echoed by the other CDNs as well. Anthony Citrano at EdgeCast sums it up:

“The ideal CDN provides end users with the highest possible performance and quality of content delivery, regardless of the user location or connection to the Internet, while at the same time reducing the overall cost of content delivery.”

Lots of servers, lots of locations

Much of the advantage that CDNs offer comes from serving content from multiple geographical locations, unlike traditional hosting providers who usually have just one location. Visitors to a site can therefore be routed to a relatively nearby and high-performance “cache” of content, which gives less latency and more optimal transfer rates.

While just the number of servers doesn’t tell the whole story about size or performance (where server specifications can play a huge role, as well as storage solutions and network setups), we thought it would be interesting to get a picture of what is needed to run a CDN. Below are some numbers from the companies we spoke to:

  • Panther Express
    800 servers in 22 different geographical locations.
  • IndyNET Limited
    80 servers in 11 different geographical locations.
  • Voxel dot Net
    2000+ servers in 14 different geographical locations.
  • EdgeCast Networks
    Wouldn’t disclose the number of servers but say they have 13 different geographical locations.

These locations have to have the same content replicated and synchronized between them (i.e. acting as caches of content). The system also needs to be able to quickly route users to the location closest to them.

Jasper Bryant-Greene, Director of IndyNET, explains more about how they handle their content distribution: “Static content is delivered to our network via a REST API. This content is propagated around the entire network within approximately 10 seconds. Clients are directed to the closest server via GeoDNS and IP Anycast.”

Linux and high-speed web servers

Because of the specific needs of CDNs they often use customized software. All of the CDNs we spoke to ran Linux, except EdgeCast who use a combination of Windows and Unix. IndyNET told us that Gentoo Linux is their primary choice but they also use Debian and Ubuntu.

Panther Express has their own custom-built web server software, Voxel uses a customized version of Apache, and IndyNET uses the high-performance Lighttpd web server. These networks serve huge amounts of data so it really makes sense for them to use software that is optimized for fast content delivery.

Monitoring is crucial

Monitoring is crucial for hosting companies and other online service providers, and it may be even truer for CDNs due to their strong focus on delivery performance. And it’s not all about performance, either. Availability is another important factor. If part of a CDN stops working, this needs to be noticed immediately. It is clear that monitoring is taken very seriously by CDNs.

IndyNET handle all monitoring themselves: “We utilize the fantastic open-source monitoring system Nagios both for monitoring our own network and as a platform for selling monitoring to customers,” says Jasper Bryant-Greene.

Panther Express and Voxel, on the other hand, have chosen to use a mix of internal and external monitoring tools and services.

“Our internal monitoring tools run frequent health checks that perform data pulls on test objects,” says Gad Stanislas. “In addition, pings and trace routes are run to measure latency and network health. All data centers are interconnected via a software mesh to determine real-time data center health in the area of latency, packet loss and node performance, which is fed back to our Network Operations Center. Our external monitoring tools include Nagios to determine infrastructure health, Smokeping to monitor network links globally, Cactii to monitor all routers and network providers for any oversubscribed ports, latency, and bandwidth utilization, Gomez and Keynote for third party performance metrics.”

Since reacting to problems is just as important as detecting them, CDNs benefit from having automatic procedures in place to handle at least the most common errors. For example, Panther Express claim they can update their server mappings within 20 seconds of any issue (so if a data center goes down or has performance issues, that location can be excluded from the CDN until it works again).

The biggest challenge of running a CDN

Regarding the challenges of running a CDN, there seems to be a consensus that scaling and properly adding capacity is something that CDNs constantly have to deal with.

“Staying far ahead of customer growth curves is our largest challenge,” says Zachary Smith, President of Voxel.

According to Gad Stanislas of Panther Express, the challenge is two-fold. “The biggest challenge in running a CDN is capacity planning. The second important consideration is gathering critical metrics on system performance.”

This is not surprising when you consider that CDNs are basically selling delivery capacity. The importance of gathering metrics to keep track of this cannot be understated.

The future of CDNs

The internet as a whole is growing with more and more video delivery and other demanding services. How will CDNs adapt and shape themselves to handle the increased demands, or in short, what will CDNs look like in the future?

Jasper Bryant-Greene thinks they will look much like they do today, “but with better technologies for locality, and nodes distributed closer to the customer.”

“Larger caches will be required to serve increasingly larger files such as HD video versus standard video files,” says Gad Stanislas. “We also anticipate increasing concerns regarding security and content protection.”

Zachary Smith believes in service integration with hosting solutions. “CDN services will be an extension of a client’s hosting infrastructure. It will be up to the hosting integrator to provide a solution that ‘just works’ to scale their clients.”

Pingdom would like to extend a big thank you to Jasper Bryant-Greene of IndyNET, Anthony Citrano of EdgeCast, Gad Stanislas of Panther Express and Zachary Smith of Voxel dot Net, who took time out of their busy days to speak to us.

Here are a few of the CDN’s that were interviewed for this post:

Voxel dot Net logo
Since 1999, Voxel has powered the high performance needs of a demanding client base by offering scalable delivery of high-bandwidth media, applications and content.
www.voxel.net
Panther Express logo
Panther Express is a global content delivery network provider established in 2005 by the former CEO of DoubleClick, Kevin Ryan, and the co-founder and former CTO of DoubleClick, Dwight Merriman.
www.pantherexpress.net
EdgeCast logo
EdgeCast was founded in 2006 by a group of successful technology entrepreneurs who have years of experience building network infrastructure, web hosting, and application delivery products and companies.
www.edgecast.com
IndyNET Limited logo
IndyNET is a worldwide content distribution and diverse monitoring network headquartered in Christchurch, New Zealand.
indy.net.nz

Thanks again Peter!

Topics:
About the Author - Mark R Robertson
Mark Robertson is the Founder and Creator of ReelSEO.com. Mark has had extensive experience in online marketing, particularly in Search Engine Marketing (SEM), Local Search, Video Marketing, and Search Engine Optimization (SEO). You can follow Mark on Twitter @markrrobertson

View more articles from Mark R Robertson
web video university
  • Hi Mark,
    I was wondering if you had any opinion or data on whether delivery of video via a CDN had a negative impact in terms of SEO.

    From what I've been told, there are potentially two issues:

    1. The content isn't from my server
    The page content is coming from myserver.com, but the actual video content source is on another server. So the "google juice" for that video content goes to the server holding the video rather than the site.

    2. The video content comes from different locations
    Using a CDN the content is often served from different locations. I'm in Sydney Australia. Sometimes the video I'm watching is getting served from Sydney, while other times it is coming from Hong Kong or San Francisco. This is a problem if geographic location is important for SEO efforts (which it is for us - we make sure that servers for a particular country are located in a datacentre in that country)

    However, from a development point of view, using a CDN has a huge benefit- particularly for video content. So, I was wondering if you'd run into this issue before and had any comment or thoughts on the matter

    thanks

    Ben
  • Hey Ben... Good question. Well, here is what I know.. Google doesn\\'t seem to apply link juice to anything but the HTML page that contains a video at this point. As for any impact, what I have seen has more to do with issues currently in getting thumbnails to appear in search results and getting video sitemaps to show up. So long as Google can \\"find\\" the video itself, in my case, FLV\\'s, they seem to have little problem. In fact, the easiest way is to actually point to the FLV directly. In tests I have done, I am successfully getting Google to index videos that are served through Amazon\\'s S3. It also has a lot to do with the Flash player publishing method - One of our authors - Glenn Gabe - wrote a great post about this with regard to serving dynamic data in a flash player and the search engine\\'s ability to index information contained within - <a href="http://www.searchenginejournal.com/flash-seo/9292...">" target="_blank">http://www.searchenginejournal.com/flash-seo/9292...

    I think that you do bring up a good point then when it comes to the future of Video indexing, specifically with regard to the international concern. However, with other content served via CDNs, usually it is masked to the domain using the service and so Ive never had problems with video, or images to this point. Does that help at all?
blog comments powered by Disqus