Group and topic discovery from relations and text
- 21 August 2005
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
We present a probabilistic generative model of entity relationships and textual attributes that simultaneously discovers groups among the entities and topics among the corresponding text. Block-models of relationship data have been studied in social network analysis for some time. Here we simultaneously cluster in several modalities at once, incorporating the words associated with certain relationships. Significantly, joint inference allows the discovery of groups to be guided by the emerging topics, and vice-versa. We present experimental results on two large data sets: sixteen years of bills put before the U.S. Senate, comprising their corresponding text and voting records, and 43 years of similar data from the United Nations. We show that in comparison with traditional, separate latent-variable models for words or Blockstructures for votes, the Group-Topic model's joint inference improves both the groups and topics discovered.Keywords
This publication has 12 references indexed in Scilit:
- How does Europe Make Its Mind Up? Connections, cliques, and compatibility between countries in the Eurovision Song ContestPhysica A: Statistical Mechanics and its Applications, 2006
- Power to the Parties: Cohesion and Competition in the European Parliament, 1979–2001British Journal of Political Science, 2005
- Information-theoretic co-clusteringPublished by Association for Computing Machinery (ACM) ,2003
- On Measuring Partisanship in Roll-Call Voting: The U.S. House of Representatives, 1877-1999American Journal of Political Science, 2002
- Estimation and Prediction for Stochastic BlockstructuresJournal of the American Statistical Association, 2001
- Agglomerative clustering of a search engine query logPublished by Association for Computing Machinery (ACM) ,2000
- A comparison of artificial and human organizationsJournal of Economic Behavior & Organization, 1996
- The application of network analysis to criminal intelligence: An assessment of the prospectsSocial Networks, 1991
- A Theory of Group StabilityAmerican Sociological Review, 1991
- Aranda and Alyawara kinship: a quantitative argument for a double helix modelAmerican Ethnologist, 1979