There is a great deal of interest in analyzing data that is best represented as a graph. Examples include the WWW, social networks, biological networks, communication networks, and many others. The importance of being able to effectively mine and learn from such data is growing, as more and more structured and semi-structured data is becoming available.
Traditionally, a number of subareas have worked with mining and learning from graph structured data, including communities in graph mining, learning from structured data, statistical relational learning, inductive logic programming, and, moving beyond sub-disciplines in computer science, social network analysis, and, more broadly network science.
The objective of this workshop is to bring together researchers from a variety of these areas, and discuss commonality and differences in challenges faced, survey some of the different approaches, and provide a forum to present and learn about some of the most cutting edge research in this area. As an outcome, we expect participants to walk away with a better sense of the variety of different tools available for graph mining and learning, and an appreciation for some of the interesting emerging applications for mining and learning from graphs.
Proceeding Downloads
Time-based sampling of social network activity graphs
While most research in online social networks (OSNs) in the past has focused on static friendship networks, social network activity graphs are quite important as well. However, characterizing social network activity graphs is computationally intensive; ...
Structure, tie persistence and event detection in large phone and SMS networks
The effect of the network structure on the dynamics of social and communication networks has been of interest in recent years. It has been observed that network properties such as neighborhood overlap, clustering coefficient, etc. influence the tie ...
A compact representation of graph databases
Graph databases have emerged as an alternative data model with applications in many complex domains. Typically, the problems to be solved in such domains involve managing and mining huge graphs. The need for efficient processing in such applications has ...
Enhancing link-based similarity through the use of non-numerical labels and prior information
Several key applications like recommender systems require to compute similarities between the nodes (objects or entities) of a bipartite network. These similarities serve many important purposes, such as finding users sharing common interests or items ...
Network community discovery: solving modularity clustering via normalized cut
Modularity clustering is a recently introduced clustering objective function for graph clustering. It has been widely used in bioinformatics and social networks. Its relation to data mining field has yet to be explored. In this paper, we show that a ...
Analyzing graph databases by aggregate queries
An important step in data analysis is the exploration of data. For traditional relational databases one of the most powerful tools for performing such analysis is the relational database and the aggregates and rankings that they can compute: for ...
Multi-network fusion for collective inference
Although much of the recent work in statistical relational learning has focused on homogeneous networks, many relational domains naturally consist of multiple observed networks, where each network source records a different type of relationship between ...
Bayesian block modelling for weighted networks
This paper presents a Bayesian approach to block modelling weighted networks to identify role assignments. This data arises commonly in many forms of social networks where we have a count of the number of communications between users. By using ...
An efficient block model for clustering sparse graphs
Models for large, sparse graphs are found in many applications and are an active topic in machine learning research. We develop a new generative model that combines rich block structure and simple, efficient estimation by collapsed Gibbs sampling. Novel ...
Centrality metric for dynamic networks
Centrality is an important notion in network analysis and is used to measure the degree to which network structure contributes to the importance of a node in a network. While many different centrality measures exist, most of them apply to static ...
Design patterns for efficient graph algorithms in MapReduce
Graphs are analyzed in many important contexts, including ranking search results based on the hyperlink structure of the world wide web, module detection of proteinprotein interaction networks, and privacy analysis of social networks. Many graphs of ...
Document classification utilising ontologies and relations between documents
Two major types of relational information can be utilized in automatic document classification as background information: relations between terms, such as ontologies, and relations between documents, such as web links or citations in articles. We ...
Graph visualization with latent variable models
Graphs are central representations of information in many domains including biological and social networks. Graph visualization is needed for discovering underlying structures or patterns within the data, for example communities in a social network, or ...
Relational motif discovery via graph spectral ranking
Music summarization aims at finding the most representative parts of a music piece (motifs) that can be exploited for efficient music indexing. Here we present a novel approach for motif discovery in music pieces based on an graph spectral ranking. ...
Pruthak: mining and analyzing graph substructures
In many scientific and commercial domains, graph as a data structure has become increasingly important for modeling of sophisticated structures. In the past few years, there has been sharp increase in research on mining graph data. We had proposed a ...
Structural correlation pattern mining for large graphs
In this paper we define the Structural Correlation Pattern (SCP) mining problem, which consists of determining correlations among vertex attributes and dense components in an undirected graph. Vertex attributes play an important role in several real-...
Meaningful selection of temporal resolution for dynamic networks
The understanding of dynamics of data streams is greatly affected by the choice of temporal resolution at which the data are discretized, aggregated, and analyzed. Our paper focuses explicitly on data streams represented as dynamic networks. We propose ...
Community evolution detection in dynamic heterogeneous information networks
As the rapid development of all kinds of online databases, huge heterogeneous information networks thus derived are ubiquitous. Detecting evolutionary communities in these networks can help people better understand the structural evolution of the ...
Network quantification despite biased labels
The increasing availability of participatory web and social media presents enormous opportunities to study human relations and collective behaviors. Many applications involving decision making want to obtain certain generalized properties about the ...
Frequent subgraph discovery in dynamic networks
In many application domains, graphs are utilized to model entities and their relationships, and graph mining is important to detect patterns within these relationships. While the majority of recent data mining techniques deal with static graphs that do ...
Querying graphs with uncertain predicates
In many applications the available data give rise to an attributed graph, with the nodes corresponding to the entities of interest, edges to their relationships and attributes on both provide additional characteristics. To mine such data structures we ...
Frequent subgraph mining on a single large graph using sampling techniques
Frequent subgraph mining has always been an important issue in data mining. Several frequent graph mining methods have been developed for mining graph transactions. However, these methods become less usable when the dataset is a single large graph. Also,...
Index Terms
Proceedings of the Eighth Workshop on Mining and Learning with Graphs
Recommendations
Eighth workshop on mining and learning with graphs
The Eighth Workshop on Mining and Learning with Graphs (MLG)1was held at KDD 2010 in Washington DC. It brought together a variete of researchers interested in analyzing data that is best represented as a graph. Examples include the WWW, social networks, ...



