Pseudo-likelihood methods for community detection in large sparse networks

When:
2014-02-11 @ 15:30 – 16:30
2014-02-11T15:30:00-05:00
2014-02-11T16:30:00-05:00
Where:
Sidney Smith Hall, Room 2118
100 Saint George Street
University of Toronto - St. George Campus, Toronto, ON M5S 3G3
Canada

Tuesday February 11, 2014 at 3:30pm

Sidney Smith Hall, Room 2118

*Refreshments will be served at 3:15pm

Pseudo-likelihood methods for community detection in large sparse networks 

Dr. Arash Amini, University of Michigan

We consider the problem of community detection in a network, that is, partitioning the nodes into groups that, in some sense, reveal the structure of the network. Many algorithms have been proposed for fitting network models with communities, but most of them do not scale well to large networks, and often fail on sparse networks. We present a fast pseudo-likelihood method for fitting the stochastic block model, a well-known model for networks with communities, as well as a variant that allows for an arbitrary degree distribution by conditioning on degrees.

We provide empirical results showing that the algorithms perform well under a range of settings, including on very sparse networks, and illustrate on the example of a network of political blogs. We also present spectral clustering with perturbations, a method of independent interest, which works well on sparse networks where regular spectral clustering fails, and use it to provide an initial value for pseudo-likelihood. We discuss theoretical results showing that pseudo-likelihood provides consistent estimates of the communities under mild conditions on the starting value, for the case of a block model with two communities. Time permitting, we give some insights as to why perturbations help with spectral clustering on sparse networks.

 

 

http://www.utstat.toronto.edu/wordpress/wp-content/uploads/2014/01/ArashAminiFeb112014.pdf

 

Leave a Reply