Contact The DL Team Contact Us | Switch to tabbed view

top of pageABSTRACT

People's personal social networks are big and cluttered, and currently there is no good way to automatically organize them. Social networking sites allow users to manually categorize their friends into social circles (e.g., “circles” on Google+, and “lists” on Facebook and Twitter). However, circles are laborious to construct and must be manually updated whenever a user's network grows. In this article, we study the novel task of automatically identifying users' social circles. We pose this task as a multimembership node clustering problem on a user's ego network, a network of connections between her friends. We develop a model for detecting circles that combines network structure as well as user profile information. For each circle, we learn its members and the circle-specific user profile similarity metric. Modeling node membership to multiple circles allows us to detect overlapping as well as hierarchically nested circles. Experiments show that our model accurately identifies circles on a diverse set of data from Facebook, Google+, and Twitter, for all of which we obtain hand-labeled ground truth.

top of pageAUTHORS



Author image not provided  Julian Mcauley

No contact information provided yet.

Bibliometrics: publication history
Publication years2006-2016
Publication count28
Citation Count424
Available for download16
Downloads (6 Weeks)320
Downloads (12 Months)3,106
Downloads (cumulative)9,087
Average downloads per article567.94
Average citations per article15.14
View colleagues of Julian Mcauley


Jure Leskovec Jure Leskovec

Homepage
jureatcs.stanford.edu
Bibliometrics: publication history
Publication years2005-2016
Publication count85
Citation Count3,507
Available for download70
Downloads (6 Weeks)1,604
Downloads (12 Months)16,278
Downloads (cumulative)88,588
Average downloads per article1,265.54
Average citations per article41.26
View colleagues of Jure Leskovec

top of pageREFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
D. Agarwal, B.-C. Chen, P. Elango, N. Motgi, S.-T. Park, R. Ramakrishnan, S. Roy, and J. Zachariah. 2008. Online models for content optimization. In Neural Information Processing Systems.
 
2
Y.-Y. Ahn, J. Bagrow, and S. Lehmann. 2010. Link communities reveal multiscale complexity in networks. Nature.
 
3
4
 
5
R. Balasubramanyan and W. Cohen. 2011. Block-LDA: Jointly modeling entity-annotated text and entity-entity links. In SIAM International Conference on Data Mining.
 
6
 
7
J. Chang and D. Blei. 2009. Relational topic models for document networks. In International Conference on Artificial Intelligence and Statistics.
8
9
 
10
Y. Chen and C. Lin. 2006. Combining SVMs with Various Feature Selection Strategies. Springer.
11
 
12
Scott L. Feld. 1981. The focused organization of social ties. American Journal of Sociology.
 
13
 
14
Steve Gregory. 2010a. Finding overlapping communities in networks by label propagation. New Journal of Physics.
 
15
Steve Gregory. 2010b. Fuzzy overlapping communities in networks. CoRR abs/1010.1523.
 
16
P. Hammer, P. Hansen, and B. Simeone. 1984. Roof duality, complementation and persistency in quadratic 0-1 optimization. Mathematical Programming.
 
17
M. Handcock, A. Raftery, and J. Tantrum. 2007a. Model-based clustering for social networks. Journal of the Royal Statistical Society Series A.
 
18
Mark S. Handcock, Adrian E. Raftery, and Jeremy M. Tantrum. 2007b. Model-based clustering for social networks. Journal of the Royal Statistical Society.
 
19
M. B. Hastings. 2006. Community detection as an inference problem. Physical Review E.
 
20
David Haussler. 1999. Convolution Kernels on Discrete Structures. Technical Report. University of California at Santa Cruz.
 
21
Peter D. Hoff, Adrian E. Raftery, and Mark S. Handcock. 2002. Latent space approaches to social network analysis. Journal of the American Statistical Association.
 
22
S. Johnson. 1967. Hierarchical clustering schemes. Psychometrika.
 
23
D. Kim, Y. Jo, L.-C. Moon, and A. Oh. 2010. Analysis of Twitter lists as a potential source for discovering latent characteristics of users. In CHI.
 
24
 
25
 
26
P. Krivitsky, M. Handcock, A. Raftery, and P. Hoff. 2009. Representing degree distributions, clustering, and homophily in social networks with latent cluster random effects models. Social Networks.
 
27
Andrea Lancichinetti and Santo Fortunato. 2009a. Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Physical Review E.
 
28
A. Lancichinetti and S. Fortunato. 2009b. Community detection algorithms: A comparative analysis. arXiv:0908.1062.
 
29
Andrea Lancichinetti, Santo Fortunato, and Janos Kertesz. 2009. Detecting the overlapping and hierarchical community structure in complex networks. New Journal of Physics.
 
30
P. Lazarsfeld and R. Merton. 1954. Friendship as a social process: A substantive and methodological analysis. In Freedom and Control in Modern Society.
31
32
 
33
 
34
J. McAuley and J. Leskovec. 2012. Learning to discover social circles in ego networks. In Neural Information Processing Systems.
 
35
M. McPherson. 1983. An ecology of affiliation. American Sociological Review.
 
36
Miller McPherson, Lynn Smith-Lovin, and James M. Cook. 2001. Birds of a feather: Homophily in social networks. Annual Review of Sociology.
 
37
 
38
39
 
40
 
41
M. Newman. 2006. Modularity and community structure in networks. In Proceedings of the National Academy of Sciences.
 
42
M. E. J. Newman. 2003. Fast algorithm for detecting community structure in networks. Physical Review E.
 
43
M. E. J. Newman. 2004. Detecting community structure in networks. European Physical Journal B.
 
44
M. E. J. Newman and G. T. Barkema. 1999. Monte Carlo Methods in Statistical Physics. Oxford University Press.
 
45
Jorge Nocedal. 1980. Updating quasi-Newton matrices with limited storage. Mathematics of Computation.
 
46
G. Palla, I. Derenyi, I. Farkas, and T. Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. Nature.
 
47
M. A. Porter, J.-P. Onnela, and P. J. Mucha. 2009. Communities in networks.
 
48
E. Ravasz and A.-L. Barabási. 2003. Hierarchical organization in complex networks. Physical Review E.
 
49
C. Rother, V. Kolmogorov, V. Lempitsky, and M. Szummer. 2007. Optimizing binary MRFs via extended roof duality. In Computer Vision and Pattern Recognition.
 
50
 
51
Georg Simmel. 1964. Conflict and the Web of Group Affiliations. Simon and Schuster.
 
52
J. Ugander, B. Karrer, L. Backstrom, and C. Marlow. 2011. The Anatomy of the Facebook Social Graph. Preprint.
 
53
S. V. N. Vishwanathan and Alexander J. Smola. 2002. Fast kernels for string and tree matching. In Neural Information Processing Systems.
 
54
C. Volinsky and A. Raftery. 2000. Bayesian information criterion for censored survival models. Biometrics.
 
55
D. Vu, A. Asuncion, D. Hunter, and P. Smyth. 2011. Dynamic egocentric models for citation networks. In International Conference on Machine Learning.
56
 
57
 
58
 
59
J. Zhao. 2011. Examining the evolution of networks based on lists in Twitter. In International Conference on Internet Multimedia System Architectures and Applications.

top of pageCITED BY

20 Citations

 
 
 
 
 
 
 

top of pageINDEX TERMS

The ACM Computing Classification System (CCS rev.2012)

Note: Larger/Darker text within each node indicates a higher relevance of the materials to the taxonomic classification.

top of pagePUBLICATION

Title ACM Transactions on Knowledge Discovery from Data (TKDD) - Casin special issue TKDD Homepage table of contents archive
Volume 8 Issue 1, February 2014
Article No. 4
Publication Date2014-02-01 (yyyy-mm-dd)
PublisherACM New York, NY, USA
ISSN: 1556-4681 EISSN: 1556-472X doi>10.1145/2556612

top of pageREVIEWS


Reviewer: Yingjie Li

A social circle in a user's ego network is a group of interconnected people that have common attributes between themselves and the user. In order to automatically detect such circles, an unsupervised/semi-supervised learning approach is designed in this paper to optimize the circle and user profile similarity parameters by using the network structure and user profile information. Researchers in information retrieval, social media, ego networks, and machine learning areas will want to study this work.

The proposed approach models circles to be latent graph variables that are inferred from the user's ego network. Within each graph and driven by edges that are “likely to form within circles and unlikely to form outside of them,” the model computes the edge forming probability in a circle using the profile similarity of the edge's two user nodes, “rewarding edges that appear within circles ... and penalizing edges that appear outside of circles” via a tradeoff constant. Then the proposed machine learning algorithms calculate the profile similarity parameter. If the learning algorithm only relies on the user node attributes and edge information, the algorithm is unsupervised. If the learning algorithm starts with user-labeled members of circles, the algorithm becomes semi-supervised.

The proposed algorithms have been evaluated on a social network “dataset of 1,143 ego networks and 5,541 ground-truth circles obtained from Facebook, Google+, and Twitter.” Compared to baseline methods considering network structure, profile information, or both, the proposed model has achieved decent performance and scalability in finding disjoint, overlapping, and hierarchically nested circles.

  Online Computing Reviews Service

Computing Reviews logo

top of pageCOMMENTS

Be the first to comment To Post a comment please sign in or create a free Web account

top of pageTable of Contents

ACM Transactions on Knowledge Discovery from Data (TKDD) - Casin special issue

Volume 8 Issue 1, February 2014

Table of Contents
Introduction to special issue on computational aspects of social and information networks: Theory, methodologies, and applications (TKDD-CASIN)
Wei Chen, Jie Tang
Article No.: 1
doi>10.1145/2556608
Full text: PDFPDF
Uncovering social network Sybils in the wild
Zhi Yang, Christo Wilson, Xiao Wang, Tingting Gao, Ben Y. Zhao, Yafei Dai
Article No.: 2
doi>10.1145/2556609
Full text: PDFPDF

Sybil accounts are fake identities created to unfairly increase the power or resources of a single malicious user. Researchers have long known about the existence of Sybil accounts in online communities such as file-sharing systems, but they have not ...
expand
Scalable and axiomatic ranking of network role similarity
Ruoming Jin, Victor E. Lee, Longjie Li
Article No.: 3
doi>10.1145/2518176
Full text: PDFPDF

A key task in analyzing social networks and other complex networks is role analysis: describing and categorizing nodes according to how they interact with other nodes. Two nodes have the same role if they interact with equivalent sets of neighbors. ...
expand
Discovering social circles in ego networks
Julian Mcauley, Jure Leskovec
Article No.: 4
doi>10.1145/2556612
Full text: PDFPDF

People's personal social networks are big and cluttered, and currently there is no good way to automatically organize them. Social networking sites allow users to manually categorize their friends into social circles (e.g., “circles” on Google+, ...
expand
A separability framework for analyzing community structure
Bruno Abrahao, Sucheta Soundarajan, John Hopcroft, Robert Kleinberg
Article No.: 5
doi>10.1145/2527231
Full text: PDFPDF

Four major factors govern the intricacies of community extraction in networks: (1) the literature offers a multitude of disparate community detection algorithms whose output exhibits high structural variability across the collection, (2) communities ...
expand
User behavior learning and transfer in composite social networks
Erheng Zhong, Wei Fan, Qiang Yang
Article No.: 6
doi>10.1145/2556613
Full text: PDFPDF

Accurate prediction of user behaviors is important for many social media applications, including social marketing, personalization, and recommendation. A major challenge lies in that although many previous works model user behavior from only historical ...
expand

Powered by The ACM Guide to Computing Literature


The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2017 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us
Did you know the ACM DL App is now available?
Did you know your Organization can subscribe to the ACM Digital Library?
The ACM Guide to Computing Literature
All Tags
Export Formats
 
 
Save to Binder