Sampling biases in IP topology measurements
- 1 March 2004
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 332-341
- https://doi.org/10.1109/infcom.2003.1208685
Abstract
Considerable attention has been focused on the properties of graphs derived from Internet measurements. Router-level topologies collected via traceroute studies have led some authors to conclude that the router graph of the Internet is a scale-free graph, or more generally a power-law random graph. In such a graph, the degree distribution of nodes follows a distribu- tion with a power-law tail. In this paper we argue that the evidence to date for this conclusion is at best insufficient. We show that graphs appearing to have power-law degree distributions can arise surprisingly easily, when sampling graphs whose true degree distribution is not at all like a power-law. For example, given a classical Erd ¨ os-Renyi sparse, random graph, the subgraph formed by a collection of shortest paths from a small set of random sources to a larger set of random destinations can easily appear to show a degree distribution remarkably like a power-law. We explore the reasons for how this effect arises, and show that in such a setting, edges are sampled in a highly biased manner. This insight allows us to distinguish measurements taken from the Erd¨ os-Renyi graphs from those taken from power-law random graphs. When we apply this distinction to a number of well-known datasets, we find that the evidence for sampling bias in these datasets is strong.Keywords
This publication has 16 references indexed in Scilit:
- Measuring ISP Topologies With RocketfuelIEEE/ACM Transactions on Networking, 2004
- Network topology generatorsPublished by Association for Computing Machinery (ACM) ,2002
- Issues with inferring Internet topological attributesPublished by SPIE-Intl Soc Optical Eng ,2002
- On the geographic location of internet resourcesPublished by Association for Computing Machinery (ACM) ,2002
- On the efficiency of multicastIEEE/ACM Transactions on Networking, 2001
- The Diameter of Sparse Random GraphsAdvances in Applied Mathematics, 2001
- On the marginal utility of network topology measurementsPublished by Association for Computing Machinery (ACM) ,2001
- A random graph model for massive graphsPublished by Association for Computing Machinery (ACM) ,2000
- On power-law relationships of the Internet topologyACM SIGCOMM Computer Communication Review, 1999
- On routes and multicast trees in the InternetACM SIGCOMM Computer Communication Review, 1998