10.1145/2461466.2461468acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedings
research-article

Retrieving geo-location of videos with a divide & conquer hierarchical multimodal approach

Published:16 April 2013

ABSTRACT

This paper presents a strategy to identify the geographic location of videos. First, it relies on a multi-modal cascade pipeline that exploits the available sources of information, namely the user's upload history, his social network and a visual-based matching technique. Second, we present a novel divide & conquer strategy to better exploit the tags associated with the input video. It pre-selects one or several geographic area of interest of higher expected relevance and performs a deeper analysis inside the selected area(s) to return the coordinates most likely to be related to the input tags. The experiments were conducted as part of the MediaEval 2012 Placing Task. Our approach, which differs significantly from the other submitted techniques, achieves the best results on this benchmark when considering the same amount of external information, i.e. when not using any gazetteers nor any other kind of external information.

References

  1. J. Choi, G. Friedland, V. Ekambaram, and K. Ramchandran. The 2012 ICSI/Berkeley Video Location Estimation System. In MediaEval, 2012.Google ScholarGoogle Scholar
  2. D. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the World's Photos. In WWW, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Hays and A. A. Efros. IM 2 GPS : estimating geographic information from a single image. In CVPR, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  4. H. Jégou and O. Chum. Negative evidences and co-occurrences in image retrieval: the benefit of PCA and whitening. In ECCV, Oct. 2012.Google ScholarGoogle ScholarCross RefCross Ref
  5. H. Jégou, M. Douze, and C. Schmid. Product quantization for nearest neighbor search. PAMI, 33(1), Jan. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez, and C. Schmid. Aggregating local image descriptors into compact codes. PAMI, Sep. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Kelm, S. Schmiedeke, and T. Sikora. How Spatial Segmentation improves the Multimodal. In MediaEval, 2012.Google ScholarGoogle Scholar
  8. O. V. Laere, S. Schockaert, and J. Quinn. Ghent and Cardiff University at the 2012 Placing Task. In MediaEval, 2012.Google ScholarGoogle Scholar
  9. L. Li, J. Almeida, and D. Pedronette. A Multimodal Approach for Video Geocoding. In MediaEval, 2012.Google ScholarGoogle Scholar
  10. J. Luo, D. Joshi, J. Yu, and A. Gallagher. Geotagging in multimedia and computer vision--a survey. Multimedia Tools Appl., 51(1), Jan. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. N. O'Hare and V. Murdock. Modeling locations with social media. Information Retrieval, Apr. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. O. A. B. Penatti, L. T. Li, J. Almeida, and R. da S. Torres. A Visual Approach for Video Geocoding using Bag-of-Scenes. In ICMR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Popescu and N. Ballas. CEA LIST's Participation at MediaEval 2012 Placing Task. In MediaEval, 2012.Google ScholarGoogle Scholar
  15. A. Rae and P. Kelm. Working Notes for the Placing Task at MediaEval 2012. In MediaEval, 2012.Google ScholarGoogle Scholar
  16. P. Serdyukov, V. Murdock, and R. van Zwol. Placing flickr photos on a map. In SIGIR, May 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. H. M. Sergieh, G. Gianini, M. Döller, H. Kosch, E. Egyed-Zsigmond, and J.-M. Pinon. Geo-based Automatic Image Annotation. In ICMR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. B. Sigurbjörnsson and R. van Zwol. Flickr tag recommendation based on collective knowledge. In WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Whissell and C. Clarke. Improving document clustering using Okapi BM25 feature weighting. Information Retrieval, 14, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Retrieving geo-location of videos with a divide & conquer hierarchical multimodal approach

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      ACM Conferences cover image
      ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
      April 2013
      362 pages
      ISBN:9781450320337
      DOI:10.1145/2461466

      Copyright © 2013 ACM

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 April 2013

      Permissions

      Request permissions about this article.

      Request Permissions

      Qualifiers

      • research-article

      Acceptance Rates

      ICMR '13 Paper Acceptance Rate 38 of 96 submissions, 40%
      Overall Acceptance Rate 254 of 830 submissions, 31%

      Upcoming Conference

      MM '22

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!