Discovering social circles in ego networks
|
Tools and Resources
Share: |
||||||||||||||||||||||
ABSTRACTPeople's personal social networks are big and cluttered, and currently there is no good way to automatically organize them. Social networking sites allow users to manually categorize their friends into social circles (e.g., “circles” on Google+, and “lists” on Facebook and Twitter). However, circles are laborious to construct and must be manually updated whenever a user's network grows. In this article, we study the novel task of automatically identifying users' social circles. We pose this task as a multimembership node clustering problem on a user's ego network, a network of connections between her friends. We develop a model for detecting circles that combines network structure as well as user profile information. For each circle, we learn its members and the circle-specific user profile similarity metric. Modeling node membership to multiple circles allows us to detect overlapping as well as hierarchically nested circles. Experiments show that our model accurately identifies circles on a diverse set of data from Facebook, Google+, and Twitter, for all of which we obtain hand-labeled ground truth.
AUTHORS
|
|
||||||||||||||||||||||||||||||||||||||||
| View colleagues of Julian Mcauley | |||||||||||||||||||||||||||||||||||||||||
|
|
|||||||||||||||||||||||||||||||||||||||
| View colleagues of Jure Leskovec | ||||||||||||||||||||||||||||||||||||||||
REFERENCESNote: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
|
1
|
D. Agarwal, B.-C. Chen, P. Elango, N. Motgi, S.-T. Park, R. Ramakrishnan, S. Roy, and J. Zachariah. 2008. Online models for content optimization. In Neural Information Processing Systems.
|
|
|
2
|
Y.-Y. Ahn, J. Bagrow, and S. Lehmann. 2010. Link communities reveal multiscale complexity in networks. Nature.
|
|
|
3
|
||
| |
4
|
|
|
5
|
R. Balasubramanyan and W. Cohen. 2011. Block-LDA: Jointly modeling entity-annotated text and entity-entity links. In SIAM International Conference on Data Mining.
|
|
|
6
|
||
|
7
|
J. Chang and D. Blei. 2009. Relational topic models for document networks. In International Conference on Artificial Intelligence and Statistics.
|
|
| |
8
|
|
| |
9
|
|
|
10
|
Y. Chen and C. Lin. 2006. Combining SVMs with Various Feature Selection Strategies. Springer.
|
|
| |
11
|
|
|
12
|
Scott L. Feld. 1981. The focused organization of social ties. American Journal of Sociology.
|
|
|
13
|
||
|
14
|
Steve Gregory. 2010a. Finding overlapping communities in networks by label propagation. New Journal of Physics.
|
|
|
15
|
Steve Gregory. 2010b. Fuzzy overlapping communities in networks. CoRR abs/1010.1523.
|
|
|
16
|
P. Hammer, P. Hansen, and B. Simeone. 1984. Roof duality, complementation and persistency in quadratic 0-1 optimization. Mathematical Programming.
|
|
|
17
|
M. Handcock, A. Raftery, and J. Tantrum. 2007a. Model-based clustering for social networks. Journal of the Royal Statistical Society Series A.
|
|
|
18
|
Mark S. Handcock, Adrian E. Raftery, and Jeremy M. Tantrum. 2007b. Model-based clustering for social networks. Journal of the Royal Statistical Society.
|
|
|
19
|
M. B. Hastings. 2006. Community detection as an inference problem. Physical Review E.
|
|
|
20
|
David Haussler. 1999. Convolution Kernels on Discrete Structures. Technical Report. University of California at Santa Cruz.
|
|
|
21
|
Peter D. Hoff, Adrian E. Raftery, and Mark S. Handcock. 2002. Latent space approaches to social network analysis. Journal of the American Statistical Association.
|
|
|
22
|
S. Johnson. 1967. Hierarchical clustering schemes. Psychometrika.
|
|
|
23
|
D. Kim, Y. Jo, L.-C. Moon, and A. Oh. 2010. Analysis of Twitter lists as a potential source for discovering latent characteristics of users. In CHI.
|
|
|
24
|
||
|
25
|
||
|
26
|
P. Krivitsky, M. Handcock, A. Raftery, and P. Hoff. 2009. Representing degree distributions, clustering, and homophily in social networks with latent cluster random effects models. Social Networks.
|
|
|
27
|
Andrea Lancichinetti and Santo Fortunato. 2009a. Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Physical Review E.
|
|
|
28
|
A. Lancichinetti and S. Fortunato. 2009b. Community detection algorithms: A comparative analysis. arXiv:0908.1062.
|
|
|
29
|
Andrea Lancichinetti, Santo Fortunato, and Janos Kertesz. 2009. Detecting the overlapping and hierarchical community structure in complex networks. New Journal of Physics.
|
|
|
30
|
P. Lazarsfeld and R. Merton. 1954. Friendship as a social process: A substantive and methodological analysis. In Freedom and Control in Modern Society.
|
|
| |
31
|
|
| |
32
|
|
|
33
|
||
|
34
|
J. McAuley and J. Leskovec. 2012. Learning to discover social circles in ego networks. In Neural Information Processing Systems.
|
|
|
35
|
M. McPherson. 1983. An ecology of affiliation. American Sociological Review.
|
|
|
36
|
Miller McPherson, Lynn Smith-Lovin, and James M. Cook. 2001. Birds of a feather: Homophily in social networks. Annual Review of Sociology.
|
|
|
37
|
||
|
38
|
||
| |
39
|
Alan Mislove , Bimal Viswanath , Krishna P. Gummadi , Peter Druschel, You are who you know: inferring user profiles in online social networks, Proceedings of the third ACM international conference on Web search and data mining, February 04-06, 2010, New York, New York, USA [doi>10.1145/1718487.1718519]
|
|
40
|
||
|
41
|
M. Newman. 2006. Modularity and community structure in networks. In Proceedings of the National Academy of Sciences.
|
|
|
42
|
M. E. J. Newman. 2003. Fast algorithm for detecting community structure in networks. Physical Review E.
|
|
|
43
|
M. E. J. Newman. 2004. Detecting community structure in networks. European Physical Journal B.
|
|
|
44
|
M. E. J. Newman and G. T. Barkema. 1999. Monte Carlo Methods in Statistical Physics. Oxford University Press.
|
|
|
45
|
Jorge Nocedal. 1980. Updating quasi-Newton matrices with limited storage. Mathematics of Computation.
|
|
|
46
|
G. Palla, I. Derenyi, I. Farkas, and T. Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. Nature.
|
|
|
47
|
M. A. Porter, J.-P. Onnela, and P. J. Mucha. 2009. Communities in networks.
|
|
|
48
|
E. Ravasz and A.-L. Barabási. 2003. Hierarchical organization in complex networks. Physical Review E.
|
|
|
49
|
C. Rother, V. Kolmogorov, V. Lempitsky, and M. Szummer. 2007. Optimizing binary MRFs via extended roof duality. In Computer Vision and Pattern Recognition.
|
|
|
50
|
||
|
51
|
Georg Simmel. 1964. Conflict and the Web of Group Affiliations. Simon and Schuster.
|
|
|
52
|
J. Ugander, B. Karrer, L. Backstrom, and C. Marlow. 2011. The Anatomy of the Facebook Social Graph. Preprint.
|
|
|
53
|
S. V. N. Vishwanathan and Alexander J. Smola. 2002. Fast kernels for string and tree matching. In Neural Information Processing Systems.
|
|
|
54
|
C. Volinsky and A. Raftery. 2000. Bayesian information criterion for censored survival models. Biometrics.
|
|
|
55
|
D. Vu, A. Asuncion, D. Hunter, and P. Smyth. 2011. Dynamic egocentric models for citation networks. In International Conference on Machine Learning.
|
|
| |
56
|
|
|
57
|
||
|
58
|
||
|
59
|
J. Zhao. 2011. Examining the evolution of networks based on lists in Twitter. In International Conference on Internet Multimedia System Architectures and Applications.
|
CITED BY20 Citations
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
INDEX TERMSThe ACM Computing Classification System (CCS rev.2012)
PUBLICATION| Title | ACM Transactions on Knowledge Discovery from Data (TKDD) - Casin special issue TKDD Homepage table of contents archive |
| Volume 8 Issue 1, February 2014 | |
| Article No. | 4 |
| Publication Date | 2014-02-01 (yyyy-mm-dd) |
| Publisher | ACM New York, NY, USA |
| ISSN: 1556-4681 EISSN: 1556-472X doi>10.1145/2556612 |
REVIEWS
A social circle in a user's ego network is a group of interconnected people that have common attributes between themselves and the user. In order to automatically detect such circles, an unsupervised/semi-supervised learning approach is designed in this paper to optimize the circle and user profile similarity parameters by using the network structure and user profile information. Researchers in information retrieval, social media, ego networks, and machine learning areas will want to study this work.
The proposed approach models circles to be latent graph variables that are inferred from the user's ego network. Within each graph and driven by edges that are “likely to form within circles and unlikely to form outside of them,” the model computes the edge forming probability in a circle using the profile similarity of the edge's two user nodes, “rewarding edges that appear within circles ... and penalizing edges that appear outside of circles” via a tradeoff constant. Then the proposed machine learning algorithms calculate the profile similarity parameter. If the learning algorithm only relies on the user node attributes and edge information, the algorithm is unsupervised. If the learning algorithm starts with user-labeled members of circles, the algorithm becomes semi-supervised.
The proposed algorithms have been evaluated on a social network “dataset of 1,143 ego networks and 5,541 ground-truth circles obtained from Facebook, Google+, and Twitter.” Compared to baseline methods considering network structure, profile information, or both, the proposed model has achieved decent performance and scalability in finding disjoint, overlapping, and hierarchically nested circles.
Online Computing Reviews Service
COMMENTSBe the first to comment To Post a comment please sign in or create a free Web account
Table of ContentsVolume 8 Issue 1, February 2014
| Introduction to special issue on computational aspects of social and information networks: Theory, methodologies, and applications (TKDD-CASIN) | |
| Wei Chen, Jie Tang | |
| Article No.: 1 | |
| doi>10.1145/2556608 | |
Full text: PDF
|
|
| Uncovering social network Sybils in the wild | |
| Zhi Yang, Christo Wilson, Xiao Wang, Tingting Gao, Ben Y. Zhao, Yafei Dai | |
| Article No.: 2 | |
| doi>10.1145/2556609 | |
Full text: PDF
|
|
|
Sybil accounts are fake identities created to unfairly increase the power or resources of a single malicious user. Researchers have long known about the existence of Sybil accounts in online communities such as file-sharing systems, but they have not ...
expand
|
|
| Scalable and axiomatic ranking of network role similarity | |
| Ruoming Jin, Victor E. Lee, Longjie Li | |
| Article No.: 3 | |
| doi>10.1145/2518176 | |
Full text: PDF
|
|
|
A key task in analyzing social networks and other complex networks is role analysis: describing and categorizing nodes according to how they interact with other nodes. Two nodes have the same role if they interact with equivalent sets of neighbors. ...
expand
|
|
| Discovering social circles in ego networks | |
| Julian Mcauley, Jure Leskovec | |
| Article No.: 4 | |
| doi>10.1145/2556612 | |
Full text: PDF
|
|
|
People's personal social networks are big and cluttered, and currently there is no good way to automatically organize them. Social networking sites allow users to manually categorize their friends into social circles (e.g., “circles” on Google+, ...
expand
|
|
| A separability framework for analyzing community structure | |
| Bruno Abrahao, Sucheta Soundarajan, John Hopcroft, Robert Kleinberg | |
| Article No.: 5 | |
| doi>10.1145/2527231 | |
Full text: PDF
|
|
|
Four major factors govern the intricacies of community extraction in networks: (1) the literature offers a multitude of disparate community detection algorithms whose output exhibits high structural variability across the collection, (2) communities ...
expand
|
|
| User behavior learning and transfer in composite social networks | |
| Erheng Zhong, Wei Fan, Qiang Yang | |
| Article No.: 6 | |
| doi>10.1145/2556613 | |
Full text: PDF
|
|
|
Accurate prediction of user behaviors is important for many social media applications, including social marketing, personalization, and recommendation. A major challenge lies in that although many previous works model user behavior from only historical ...
expand
|