Contact The DL Team Contact Us | Switch to tabbed view

top of pageABSTRACT

This article reviews the state-of-the-art in overlapping community detection algorithms, quality measures, and benchmarks. A thorough comparison of different algorithms (a total of fourteen) is provided. In addition to community-level evaluation, we propose a framework for evaluating algorithms' ability to detect overlapping nodes, which helps to assess overdetection and underdetection. After considering community-level detection performance measured by normalized mutual information, the Omega index, and node-level detection performance measured by F-score, we reached the following conclusions. For low overlapping density networks, SLPA, OSLOM, Game, and COPRA offer better performance than the other tested algorithms. For networks with high overlapping density and high overlapping diversity, both SLPA and Game provide relatively stable performance. However, test results also suggest that the detection in such networks is still not yet fully resolved. A common feature observed by various algorithms in real-world networks is the relatively small fraction of overlapping nodes (typically less than 30%), each of which belongs to only 2 or 3 communities.

top of pageAUTHORS



Author image not provided  Jierui Xie

No contact information provided yet.

Bibliometrics: publication history
Publication years2009-2014
Publication count8
Citation Count175
Available for download3
Downloads (6 Weeks)87
Downloads (12 Months)1,052
Downloads (cumulative)4,394
Average downloads per article1,464.67
Average citations per article21.88
View colleagues of Jierui Xie


Author image not provided  Stephen Kelley

No contact information provided yet.

Bibliometrics: publication history
Publication years2008-2013
Publication count6
Citation Count104
Available for download1
Downloads (6 Weeks)75
Downloads (12 Months)900
Downloads (cumulative)3,692
Average downloads per article3,692.00
Average citations per article17.33
View colleagues of Stephen Kelley


Author image not provided  Boleslaw K. Szymanski

No contact information provided yet.

Bibliometrics: publication history
Publication years2011-2016
Publication count3
Citation Count121
Available for download1
Downloads (6 Weeks)75
Downloads (12 Months)900
Downloads (cumulative)3,692
Average downloads per article3,692.00
Average citations per article40.33
View colleagues of Boleslaw K. Szymanski

top of pageREFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Ahn, Y.-Y., Bagrow, J. P., and Lehmann, S. 2010. Link communities reveal multiscale complexity in networks. Nature 466, 761--764.
2
 
3
Arenas, A., Diaz-Guilera, A., and Perez-Vicente, C. J. 2006. Synchronization reveals topological scales in complex networks. Phys. Rev. Lett. 96, 11.
 
4
Ball, B., Karrer, B., and Newman, M. E. J. 2011. Efficient and principled method for detecting communities in networks. Phys. Rev. E 84, 3.
 
5
Baumes, J., Goldberg, M., Krishnamoorthy, M., Magdon-Ismail, M., and Preston, N. 2005. Finding communities by clustering a graph into overlapping subgraphs. In Proceedings of the IADIS International Conference on Applied Computing. 97--104.
 
6
Bianconi, G., Pin, P., and Marsili, M. 2008. Assessing the relevance of node features for network structure. Proc. Natl. Acad. Sci. USA 106, 28, 7.
 
7
Blatt, M., Wiseman, S., and Domany, E. 1996. Superparamagnetic clustering of data. Phys. Rev. Lett. 76, 3251--3254.
 
8
Boguna, M., Pastor-Satorras, R., Diaz-Guilera, A., and Arenas, A. 2004. Models of social networks based on social distance attachment. Phys. Rev. E 70, 5.
 
9
 
10
 
11
 
12
 
13
Chen, D., Shang, M., Lv, Z., and Fu, Y. 2010a. Detecting overlapping communities of weighted networks via a local algorithm. Physica A 389, 19, 4177--4187.
 
14
 
15
 
16
Collins, L. M. and Dent, C. W. 1988. Omega: A general formulation of the rand index of cluster recovery suitable for non-disjoint solutions. Multivar. Behav. Res. 23, 2, 231--242.
 
17
 
18
Danon, L., Duch, J., Arenas, A., and Diaz-Guilera, A. 2005. Comparing community structure identification. J. Stat. Mech. Thoer. Exp. 2005, 9.
 
19
Davis, G. B. and Carley, K. 2008. Clearing the fog: Fuzzy, overlapping groups for social networks. Soc. Netw. 30, 3, 201--212.
 
20
Ding, F., Luo, Z., Shi, J., and Fang, X. 2010. Overlapping community detection by kernel-based fuzzy affinity propagation. In Proceedings of the International Workshop on Indoor Spatial Awareness (ISA'10). 1--4.
21
 
22
Evans, T. 2010. Clique graphs and overlapping communities. J. Stat. Mech.-Theor. Exp. 2010, 12.
 
23
Evans, T. and Lambiotte, R. 2010. Line graphs of weighted networks for overlapping communities. Euro. Phys. J. B 77, 265.
 
24
Evans, T. S. and Lambiotte, R. 2009. Line graphs, link partitions and overlapping communities. Phys. Rev. E 80, 1.
 
25
Farkas, I., Abel, D., Palla, G., and Vicsek, T. 2007. Weighted network modules. New J. Phys. 9, 6, 180.
 
26
Fisher, D. C. 1989. Lower bounds on the number of triangles in a graph. J. Graph Theor. 13, 4, 505--512.
 
27
Fortunato, S. 2010. Community detection in graphs. Phys. Rep. 486, 75--174.
 
28
Frey, B. J. and Dueck, D. 2007. Clustering by passing messages between data points. Sci. 315, 972--976.
 
29
 
30
 
31
Gfeller, D., Chappelier, J.-C., and de Los Rios, P. 2005. Finding instabilities in the community structure of complex networks. Phys. Rev. E 72, 5.
 
32
Girvan, M. and Newman, M. E. J. 2002. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA 99, 12, 7821--7826.
 
33
 
34
 
35
Gregory, S. 2009. Finding overlapping communities using disjoint community detection algorithms. CompleNet 207, 47--61.
 
36
Gregory, S. 2010. Finding overlapping communities in networks by label propagation. New J. Phys. 12, 10.
 
37
Gregory, S. 2011. Fuzzy overlapping communities in networks. J. Stat. Mech. 2011, 2.
 
38
Guimera, R., Sales-Pardo, M., and Amaral, L. A. N. 2004. Modularity from fluctuations in random graphs and complex networks. Phys. Rev. E 70, 2.
 
39
Havemann, F., Heinz, M., Struck, A., and Glaser, J. 2011. Identification of overlapping communities and their hierarchy by locally calculating community-changing resolution levels. J. Statist. Mech. 2011, 1.
 
40
Hubert, L. and Arabie, P. 1985. Comparing partitions. J. Classif. 2, 193--218.
 
41
Hullermeier, E. and Rifqi, M. 2009. A fuzzy variant of the rand index for comparing clustering structures. In Proceedings of the Joint International Fuzzy Systems Association World Congress and European Society of Fuzzy Logic and Technology Conference. 1294--1298.
 
42
Jin, D., Yang, B., Baquero, C., Liu, D., He, D., and Liu, J. 2011. A markov random walk under constraint for discovering overlapping communities in complex networks. J. Statist. Mech. 2011, 5.
 
43
Karrer, B., Levina, E., and Newman, M. E. J. 2008. Robustness of community structure in networks. Phys. Rev. E 77, 4.
 
44
Kelley, S. 2009. The existence and discovery of overlapping communities in large-scale networks. Ph.D. thesis, Rensselaer Polytechnic Institute, Troy, NY.
 
45
Kelley, S., Goldberg, M., Magdon-Ismail, M., Mertsalov, K., and Wallace, A. 2011. Defining and discovering communities in social networks. In Handbook of Optimization in Complex Networks, Springer, 139--168.
 
46
Kim, Y. and Jeong, H. 2011. The map equation for link community (unpublished). http://stat.kaist.ac.kr/∼hjeong/papers/2011_Map.pdf.
 
47
Kovacs, I. A., Palotai, R., Szalay, M., and Csermely, P. 2010. Community landscapes: An integrative approach to determine overlapping network module hierarchy, identify key nodes and predict network dynamics. PLoS ONE 5, 9.
 
48
Kumpula, J. M., Kivela, M., Kaski, K., and Saramaki, J. 2008. Sequential algorithm for fast clique percolation. Phys. Rev. E 78, 2.
 
49
Lancichinetti, A. and Fortunato, S. 2009. Community detection algorithms: A comparative analysis. Phys. Rev. E 80, 5.
 
50
Lancichinetti, A., Fortunato, S., and Kertesz, J. 2009. Detecting the overlapping and hierarchical community structure of complex networks. New J. Phys. 11, 3.
 
51
Lancichinetti, A., Fortunato, S., and Radicchi, F. 2008. Benchmark graphs for testing community detection algorithms. Phys. Rev. E 78, 4.
 
52
Lancichinetti, A., Radicchi, F., Ramasco, J. J., and Fortunato, S. 2011. Finding statistically significant communities in networks. PLoS ONE 6, 4.
 
53
Langfelder, P. and Horvath, S. 2008. WGCNA: An r package for weighted correlation network analysis. BMC Bioinf. 1, 559.
 
54
Latouche, P., Birmele, E., and Ambroise, C. 2011. Overlapping stochastic block models with application to the french political blogosphere. Annals Appl. Statist. 5, 309--336.
 
55
Lee, C., Reid, F., Mcdaid, A., and Hurley, N. 2010. Detecting highly overlapping community structure by greedy clique expansion. In Proceedings of the 4<sup>th</sup> Workshop on Social Network Mining and Analysis held in Conjunction with the International Conference on Knowledge Discovery and Data Mining (SNA/KDD'10). 33--42.
56
57
 
58
Leskovec, J., Lang, K. J., Dasgupta, A., and Mahoney, M. W. 2009. Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters. Internet Math. 6, 29--123.
59
 
60
Li, D., Leyva, I., Almendral, J., Sendina-Nadal, I., Buldu, J., Havlin, S., and Boccaletti, S. 2008. Synchronization interfaces and overlapping communities in complex networks. Phys. Rev. Lett. 101, 16.
 
61
Lu, Q., Korniss, G., and Szymanski, B. K. 2009. The naming game in social networks: Community formation and consensus engineering. J. Econ. Interact. Coord. 4, 221--235.
 
62
Magdon-Ismail, M. and Purnell, J. 2011. Fast overlapping clustering of networks using sampled spectral distance embedding and gmms. Tech. rep., Rensselaer Polytechnic Institute, Troy, NY.
 
63
Massen, C. and Doye, J. 2005. Identifying communities within energy landscapes. Phys. Rev. E 71, 4.
 
64
Massen, C. and Doye, J. 2007. Thermodynamics of community structure. Preprint arXiv:con-mat/0610077v1.
 
65
 
66
 
67
Moon, J. and Moser, L. 1965. On cliques in graphs. Israel J. Math. 3, 23--28.
 
68
Nepusz, T., Petroczi, A., Negyessy, L., and Bazso, F. 2008. Fuzzy communities and the concept of bridgeness in complex networks. Phys. Rev. E 77, 1.
 
69
Newman, M. E. J. 2006. Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 74, 3.
 
70
Newman, M. E. J. and Leicht, E. A. 2007. Mixture models and exploratory analysis in networks. Proc. Natl. Acad. Sci. USA 104, 9564--9569.
 
71
Newman, M. E. J., Strogatz, S. H., and Watts, D. J. 2001. Random graphs with arbitrary degree distributions and their applications. Phys. Rev. E 64, 2.
 
72
Nicosia, V., Mangioni, G., Carchiolo, V., and Malgeri, M. 2009. Extending the definition of modularity to directed graphs with overlapping communities. J. Stat. Mech. 2009, 3.
 
73
Nowicki, K. and Snijders, T. A. B. 2001. Estimation and prediction for stochastic blockstructures. J. Amer. Statist. Assoc. 96, 455, 1077--1087.
 
74
Padrol-Sureda, A., Perarnau-Llobet, G., Pfeifle, J., and Munts-Mulero, V. 2010. Overlapping community search for social networks. In Proceedings of the 26<sup>th</sup> International Conference on Data Engineering (ICDE'10). 992--995.
 
75
Palla, G., Derenyi, I., Farkas, I., and Vicsek, T. 2005. Uncovering the overlapping community structure of complex networks in nature and society. Nature 435, 814--818.
 
76
Psorakis, I., Roberts, S., Ebden, M., and Sheldon, B. 2011. Overlapping community detection using bayesian non-negative matrix factorization. Phys. Rev. E 83, 6.
 
77
Raghavan, U. N., Albert, R., and Kumara, S. 2007. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76, 3.
 
78
 
79
Reichardt, J. and Bornholdt, S. 2004. Detecting fuzzy community structures in complex networks with a potts model. Phys. Rev. Lett. 93, 2.
 
80
Reichardt., J. and Bornholdt, S. 2006a. Statistical mechanics of community detection. Phys. Rev. E 74, 1.
 
81
Reichardt, J. and Bornholdt, S. 2006b. When are networks truly modular? Physica D224, 20--26.
 
82
 
83
Ren, W., Yan, G., Liao, X., and Xiao, L. 2009. Simple probabilistic algorithm for detecting community structure. Phys. Rev. E 79, 3.
 
84
Richardson, M., Agrawal, R., and Domingos, P. 2003. Trust management for the semantic web. In Proceedings of the 2<sup>nd</sup> International Semantic Web Conference (ISWC'03). Lecture Notes in Computer Science, vol. 2870. Springer, 351--368.
 
85
 
86
Ronhovde, P. and Nussinov, Z. 2009. Multiresolution community detection for megascale networks by information-based replica correlations. Phys. Rev. E 80, 1.
 
87
Rosvall, M. and Bergstrom, C. T. 2008. Maps of random walks on complex networks reveal community structure. Proc. Natl. Acad. Sci. 105, 1118--1123.
 
88
Sawardecker, E., Sales-Pardo, M., and Amaral, L. 2009. Detection of node group membership in networks with group overlap. Euro. Phys. J. B67, 277.
 
89
Shen, H., Cheng, X., Cai, K., and Hu, M.-B. 2009a. Detect overlapping and hierarchical community structure. Physica A388, 1706.
 
90
Shen, H., Cheng, X., and Guo, J. 2009b. Quantifying and identifying the overlapping community structure in networks. J. Stat. Mech. 2009, 7, 9.
 
91
Wang, X., Jiao, L., and Wu, J. 2009. Adjusting from disjoint to overlapping community detection of complex networks. Physica A388, 5045--5056.
 
92
White, S. and Smyth, P. 2005. A spectral clustering approach to finding communities in graphs. In Proceedings of the SIAM International Conference on Data Mining. 76--84.
 
93
Wu, Z., Lin, Y., Wan, H., and Tian, S. 2010. A fast and reasonable method for community detection with adjustable extent of overlapping. In Proceedings of the Conference on Intelligent Systems and Knowledge Engineering (ISKE'10). 376--379.
 
94
 
95
 
96
 
97
Zarei, M., Izadi, D., and Samani, K. A. 2009. Detecting overlapping community structure of networks based on vertex-vertex correlations. J. Stat. Mech. 2009, 11.
 
98
Zhang, S., Wang, R.-S., and Zhang, X.-S. 2007a. Identification of overlapping community structure in complex networks using fuzzy c-means clustering. Physica A374, 483--490.
 
99
Zhang, S., Wang, R.-S., and Zhang, X.-S. 2007b. Uncovering fuzzy community structure in complex networks. Phys. Rev. E 76, 4.
100
 
101
Zhao, K., Zhang, S.-W., and Pan, Q. 2010. Fuzzy analysis for overlapping community structure of complex network. In Proceedings of the Chinese Control and Decision Conference (CCDC'10). 3976--3981.

top of pageCITED BY

91 Citations

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

top of pageINDEX TERMS

The ACM Computing Classification System (CCS rev.2012)

Note: Larger/Darker text within each node indicates a higher relevance of the materials to the taxonomic classification.

top of pagePUBLICATION

Title ACM Computing Surveys (CSUR) Surveys Homepage table of contents archive
Volume 45 Issue 4, August 2013
Article No. 43
Publication Date2013-08-01 (yyyy-mm-dd)
Funding Sources Office of Naval Research
U.S. Army Research Laboratory
PublisherACM New York, NY, USA
ISSN: 0360-0300 EISSN: 1557-7341 doi>10.1145/2501654.2501657

top of pageREVIEWS


Reviews are not available for this item
Computing Reviews logo

top of pageCOMMENTS

Be the first to comment To Post a comment please sign in or create a free Web account

top of pageTable of Contents

ACM Computing Surveys (CSUR)

Volume 45 Issue 4, August 2013

Table of Contents
Architectures of flexible symmetric key crypto engines—a survey: From hardware coprocessor to multi-crypto-processor system on chip
Lilian Bossuet, Michael Grand, Lubos Gaspar, Viktor Fischer, Guy Gogniat
Article No.: 41
doi>10.1145/2501654.2501655
Full text: PDFPDF

Throughput, flexibility, and security form the design trilogy of reconfigurable crypto engines; they must be carefully considered without reducing the major role of classical design constraints, such as surface, power consumption, dependability, and ...
expand
Semantic trajectories modeling and analysis
Christine Parent, Stefano Spaccapietra, Chiara Renso, Gennady Andrienko, Natalia Andrienko, Vania Bogorny, Maria Luisa Damiani, Aris Gkoulalas-Divanis, Jose Macedo, Nikos Pelekis, Yannis Theodoridis, Zhixian Yan
Article No.: 42
doi>10.1145/2501654.2501656
Full text: PDFPDF

Focus on movement data has increased as a consequence of the larger availability of such data due to current GPS, GSM, RFID, and sensors techniques. In parallel, interest in movement has shifted from raw movement data analysis to more application-oriented ...
expand
Overlapping community detection in networks: The state-of-the-art and comparative study
Jierui Xie, Stephen Kelley, Boleslaw K. Szymanski
Article No.: 43
doi>10.1145/2501654.2501657
Full text: PDFPDF

This article reviews the state-of-the-art in overlapping community detection algorithms, quality measures, and benchmarks. A thorough comparison of different algorithms (a total of fourteen) is provided. In addition to community-level evaluation, ...
expand
Near-duplicate video retrieval: Current research and future trends
Jiajun Liu, Zi Huang, Hongyun Cai, Heng Tao Shen, Chong Wah Ngo, Wei Wang
Article No.: 44
doi>10.1145/2501654.2501658
Full text: PDFPDF

The exponential growth of online videos, along with increasing user involvement in video-related activities, has been observed as a constant phenomenon during the last decade. User's time spent on video capturing, editing, uploading, searching, and viewing ...
expand
Survey and taxonomy of botnet research through life-cycle
Rafael A. Rodríguez-Gómez, Gabriel Maciá-Fernández, Pedro García-Teodoro
Article No.: 45
doi>10.1145/2501654.2501659
Full text: PDFPDF

Of all current threats to cybersecurity, botnets are at the top of the list. In consequence, interest in this problem is increasing rapidly among the research community and the number of publications on the question has grown exponentially in recent ...
expand
The state of peer-to-peer network simulators
Anirban Basu, Simon Fleming, James Stanier, Stephen Naicken, Ian Wakeman, Vijay K. Gurbani
Article No.: 46
doi>10.1145/2501654.2501660
Full text: PDFPDF

Networking research often relies on simulation in order to test and evaluate new ideas. An important requirement of this process is that results must be reproducible so that other researchers can replicate, validate, and extend existing work. We look ...
expand
A survey of trust in social networks
Wanita Sherchan, Surya Nepal, Cecile Paris
Article No.: 47
doi>10.1145/2501654.2501661
Full text: PDFPDF

Web-based social networks have become popular as a medium for disseminating information and connecting like-minded people. The public accessibility of such networks with the ability to share opinions, thoughts, information, and experience offers great ...
expand
A survey of checker architectures
Rajshekar Kalayappan, Smruti R. Sarangi
Article No.: 48
doi>10.1145/2501654.2501662
Full text: PDFPDF

Reliability is quickly becoming a primary design constraint for high-end processors because of the inherent limits of manufacturability, extreme miniaturization of transistors, and the growing complexity of large multicore chips. To achieve a high degree ...
expand
Analyzing and defending against web-based malware
Jian Chang, Krishna K. Venkatasubramanian, Andrew G. West, Insup Lee
Article No.: 49
doi>10.1145/2501654.2501663
Full text: PDFPDF

Web-based malware is a growing threat to today's Internet security. Attacks of this type are prevalent and lead to serious security consequences. Millions of malicious URLs are used as distribution channels to propagate malware all over the Web. After ...
expand
A survey of pipelined workflow scheduling: Models and algorithms
Anne Benoit, Ümit V. Çatalyürek, Yves Robert, Erik Saule
Article No.: 50
doi>10.1145/2501654.2501664
Full text: PDFPDF

A large class of applications need to execute the same workflow on different datasets of identical size. Efficient execution of such applications necessitates intelligent distribution of the application components and tasks on a parallel machine, and ...
expand
Separation of concerns in feature diagram languages: A systematic survey
Arnaud Hubaux, Thein Than Tun, Patrick Heymans
Article No.: 51
doi>10.1145/2501654.2501665
Full text: PDFPDF

The need for flexible customization of large feature-rich software systems, according to requirements of various stakeholders, has become an important problem in software development. Among the many software engineering approaches dealing with variability ...
expand
A survey on reactive programming
Engineer Bainomugisha, Andoni Lombide Carreton, Tom van Cutsem, Stijn Mostinckx, Wolfgang de Meuter
Article No.: 52
doi>10.1145/2501654.2501666
Full text: PDFPDF

Reactive programming has recently gained popularity as a paradigm that is well-suited for developing event-driven and interactive applications. It facilitates the development of such applications by providing abstractions to express time-varying values ...
expand
State-based model slicing: A survey
Kelly Androutsopoulos, David Clark, Mark Harman, Jens Krinke, Laurence Tratt
Article No.: 53
doi>10.1145/2501654.2501667
Full text: PDFPDF

Slicing is a technique, traditionally applied to programs, for extracting the parts of a program that affect the values computed at a statement of interest. In recent years authors have begun to consider slicing at model level. We present a detailed ...
expand
Decentralized resource discovery mechanisms for distributed computing in peer-to-peer environments
Daniel Lazaro, Joan Manuel Marques, Josep Jorba, Xavier Vilajosana
Article No.: 54
doi>10.1145/2501654.2501668
Full text: PDFPDF

Resource discovery is an important part of distributed computing and resource sharing systems, like grids and utility computing. Because of the increasing importance of decentralized and peer-to-peer environments, characterized by high dynamism and churn, ...
expand
Critical success factors in enterprise resource planning systems: Review of the last decade
Levi Shaul, Doron Tauber
Article No.: 55
doi>10.1145/2501654.2501669
Full text: PDFPDF

Organizations perceive ERP as a vital tool for organizational competition as it integrates dispersed organizational systems and enables flawless transactions and production. This review examines studies investigating Critical Success Factors (CSFs) in ...
expand

Powered by The ACM Guide to Computing Literature


The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2017 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us
Did you know the ACM DL App is now available?
Did you know your Organization can subscribe to the ACM Digital Library?
The ACM Guide to Computing Literature
All Tags
Export Formats
 
 
Save to Binder