DL Workshop | Inside the Web in 2022

Tejas Narenchania (pictured above) is an assistant professor of Law at UC Berkeley and Nick Merill is the director of the Daylight Lab at the UC Berkeley center for long-term cybersecurity. Together they presented Inside the Internet, a book that they are co-writing.

Setting the Stage

Tejas starts the talk by setting the stage for how the internet works. Under the old system in the early 2000s, if Jane wanted to communicate with Joe, she would first go to her ISP who would then hand off to a tier 2 provider, then a tier 1 provider, then a different tier 1 provider then another tier 2 provider, then to an ISP who finally connects Jane with Joe. This is the old, decentralized story and there are two problems with this structure. One is latency, which becomes more important with the rise of web 2.0 and e-commerce. The other problem is security. DDoS (distributed denial of service) attacks become increasingly common and in 2000, a 16-year-old is able to take down the biggest and most well-known sites on the internet. As a result, we get CDNs (Content Delivery Networks) which address both problems. CDNs reduce latency by distributing content across a network of geographically dispersed servers and CDNs improve security by offering a wider view into internet traffic and using it to identify that which is malicious.

Current View

The dominant view is that the market for internet traffic exchange is highly competitive. On this view, many providers across several classes of services compete to offer service. The ISP market is not competitive, but because the US Government has viewed the inside of the internet as competitive, they have not placed any regulation over the market for internet traffic exchange.

But …

The questions we must ask ourselves are what have CDNs given us? What have they taken away? And what is the state of the market? Recently, there have been two widespread internet outages due to Fastly, a CDN which suggests that the CDN market might not be as competitive as initially thought.

Nick Merill

Nick continues the talk by answering 3 questions. One, how do we know what the market for CDNs look like? Two, what does that market look like? Three, so what?

To answer the first question, Nick outlines an experiment that he and his team undertook. They make requests and analyze responses for the top one million websites and see if it uses a CDN and if so, which CDN. The results of the experiment are that 11 providers control 99% of the CDN market. 5 firms control 96%. And 80.7% of websites that use a CDN use Cloudflare. The Herfindahl Hirschman index of the CDN market is 6559, where 2500 is the threshold for a highly concentrated market. But, out of all websites on the internet only 22.6% use a CDN at all. This leads us to the real question. What proportion of user-facing bits deal with a CDN? The answer, roughly 76%.

So what?

This centralization leads to two potential problems, one is cyberattacks and the other is speech. A state sponsored attacker could potentially take down a CDN. Russia and China are potential adversaries who don't have as much to lose as it might seem. Furthermore, the most realistic way of mediating speech on internet is to contact the CDNs first. In fact, the great firewall of China is just a large state run CDN that Chinese ISPs and service providers are forced to use.


