- Sponsor:
- sigmm
We are delighted to welcome you to the First ACM Workshop on Large-Scale Multimedia Retrieval and Mining (LS-MMRM 2009). Recent years have witnessed an explosive growth of multimedia content driven by the wide availability of massive storage devices, high-resolution video cameras and fast networks. Stimulated by recent progress in scalable machine learning, feature indexing and multi-modal analysis techniques, researchers are becoming increasingly interested in exploring challenges and new opportunities for developing much larger scale approaches for multimedia retrieval and mining. Many of these computationally-intensive ideas are now becoming practical because of the broader availability of high-speed clusters and the advent of cloud computing. This workshop aims to bring together researchers and industrial practitioners interested in large-scale multimedia retrieval and mining. The workshop will provide a venue for the participants to explore a variety of aspects and applications on how advanced multimedia analysis techniques can be leveraged to address the challenges in large-scale data collections.
In total we have received 31 submissions, each of which was reviewed by three members from the program committee. From these, we selected 7 papers for oral presentation and 10 posters, resulting in a full-day workshop. The program was supplemented with a keynote presentation on topics of particular interest, and a panel session on large-scale multimedia mining. We would like to thank the program committee and the additional reviewers for their hard work on reviewing submissions under such tight deadlines.
This year, we are excited to share all of our technical sessions with the Workshop on Web-Scale Multimedia Corpus (WSMC), another workshop also collocated with ACM Multimedia. We hope that the shared sessions will present attendees from both workshops the opportunity to interact on aspects of multimedia retrieval and mining that apply to specifically web-scale corpora..
Proceeding Downloads
Rethinking multimedia search in the "clients + cloud" era
The emergence of cloud computing is presenting us with unprecedented opportunities to redefine how people search for and interact with multimedia on client devices (such as PC, TV, and phone). In this talk, I will introduce some key trends in computing ...
Canonical contextual distance for large-scale image annotation and retrieval
To realize generic image recognition, the system needs to learn an enormous amount of targets in the world and their appearances. Therefore, visual knowledge acquisition using massive amounts of web images has been studied recently, and search-based ...
Image categorization combining neighborhood methods and boosting
We describe an efficient and scalable system for automatic image categorization. Our approach seeks to marry scalable "model-free" neighborhood-based annotation with accurate boosting-based per-tag modeling. For accelerated neighborhood-based ...
Semantics-preserving bag-of-words models for efficient image annotation
The Bag-of-Words (BoW) model is a promising image representation for annotation. One critical limitation of existing BoW models is the semantic loss during the codebook generation process, in which BoW simply clusters visual words in Euclidian space. ...
Leveraging large-scale weakly-tagged images to train inter-related classifiers for multi-label annotation
In this paper, we have developed a new multi-label multi-task learning framework to leverage large-scale weakly-tagged images for inter-related classifier training. A novel image and tag cleansing algorithm is developed for tackling the issues of spam, ...
Large-scale multimedia semantic concept modeling using robust subspace bagging and MapReduce
With the rapid growth of multimedia data, it becomes increasingly important to develop semantic concept modeling approaches that are consistently effective, highly efficient, and easily scalable. To this end, we first propose the robust subspace bagging ...
Indexing local configurations of features for scalable content-based video copy detection
Content-based video copy detection is relevant for structuring large video databases. The use of local features leads to good robustness to most types of photometric or geometric transformations. However, to achieve both good precision and good recall ...
From text question-answering to multimedia QA on web-scale media resources
With the proliferation of text and multimedia information, users are now able to find answers to almost any questions on the Web. Meanwhile, they are also bewildered by the huge amount of information routinely presented to them. Question-answering (QA) ...
On improving robustness of video fingerprints based on projections of features
In this paper, we study two methods to improve the robustness property of projection based hashing methods. For this class of hashing methods, a feature matrix is projected onto a set of projection matrices. Then, the projected values are compared to a ...
Image copy detection using a robust gabor texture descriptor
In this paper, we propose a novel scale and rotation invariant Gabor texture descriptor for content-based image copy detection. Firstly, the Gabor filters with different orientations and scales using a fixed-size window are constructed. Secondly, the ...
Near-duplicate detection for images and videos
In this paper, we describe a system for detecting duplicate images and videos in a large collection of multimedia data. Our system consists of three major elements: Local-Difference-Pattern (LDP) as the unified feature to describe both images and videos,...
Unsupervised image ranking
In the paper, we propose and test an unsupervised approach for image ranking. Prior solutions are based on image content and the similarity graph connecting images. We generalize this idea by directly estimating the likelihood of each photo in a feature ...
An efficient key point quantization algorithm for large scale image retrieval
We focus on the problem of large-scale near duplicate image retrieval. Recent studies have shown that local image features, often referred to as key points, are effective for near duplicate image retrieval. The most popular approach for key point based ...
Large scale content analysis engine
The evolution of IP video systems has resulted in unprecedented access to a wide range of video material for consumers via IPTV and Web delivery. Retrieval technologies help users find relevant content, but suffer from a paucity of reliable content ...
Tagging and retrieving images with co-occurrence models: from corel to flickr
This paper presents two models for content-based automatic image annotation and retrieval in web image repositories, based on the co-occurrence of tags and visual features in the images. In particular, we show how additional measures can be taken to ...
Topology selection for stream mining systems
Multi-concept identification in high volume multimedia streams is critical for a number of applications, including large-scale multimedia analysis, processing, and retrieval. Content of interest is filtered using a collection of binary classifiers that ...
Visual ContextRank for web image re-ranking
Previous web image re-ranking approaches usually construct similarity measure on image level. Considering the diversity of large scale web image database, these approaches ignore the difference of importance between target area and background area in ...
Large-scale news topic tracking and key-scene ranking with video near-duplicate constraints
To make full use of the overwhelming volume of news videos available today, it is necessary to track the development of news stories from different channels, mine their dependencies, and organize them in a semantic way. We propose a novel news topic ...




