SNAP, Small-world Network Analysis and Partitioning: An open-source parallel graph framework for the exploration of large-scale networks
- 1 April 2008
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2008 IEEE International Symposium on Parallel and Distributed Processing
Abstract
We present SNAP (small-world network analysis and partitioning), an open-source graph framework for exploratory study and partitioning of large-scale networks. To illustrate the capability of SNAP, we discuss the design, implementation, and performance of three novel parallel community detection algorithms that optimize modularity, a popular measure for clustering quality in social network analysis. In order to achieve scalable parallel performance, we exploit typical network characteristics of small-world networks, such as the low graph diameter, sparse connectivity, and skewed degree distribution. We conduct an extensive experimental study on real-world graph instances and demonstrate that our parallel schemes, coupled with aggressive algorithm engineering for small-world networks, give significant running time improvements over existing modularity-based clustering heuristics, with little or no loss in clustering quality. For instance, our divisive clustering approach based on approximate edge betweenness centrality is more than two orders of magnitude faster than a competing greedy approach, for a variety of large graph instances on the Sun Fire T2000 multicore system. SNAP also contains parallel implementations of fundamental graph-theoretic kernels and topological analysis metrics (e.g., breadth-first search, connected components, vertex and edge centrality) that are optimized for small- world networks. The SNAP framework is extensible; the graph kernels are modular, portable across shared memory multicore and symmetric multiprocessor systems, and simplify the design of high-level domain-specific applications.Keywords
This publication has 30 references indexed in Scilit:
- Advanced Shortest Paths Algorithms on a Massively-Multithreaded ArchitecturePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Modularity and community structure in networksProceedings of the National Academy of Sciences, 2006
- Community detection in complex networks using extremal optimizationPhysical Review E, 2005
- Data Streams: Algorithms and ApplicationsFoundations and Trends® in Theoretical Computer Science, 2005
- Spectral Analysis of Random Graphs with Skewed Degree DistributionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Finding community structure in very large networksPhysical Review E, 2004
- Graph-based technologies for intelligence analysisCommunications of the ACM, 2004
- The Structure and Function of Complex NetworksSIAM Review, 2003
- On the Eigenvalue Power LawPublished by Springer Nature ,2002
- Randomized search treesAlgorithmica, 1996