skip to main content
research-article

Multimedia sensor fusion for retrieving identity in biometric access control systems

Published:26 November 2010Publication History
Skip Abstract Section

Abstract

In this article, we propose a novel multimedia sensor fusion approach based on heterogeneous sensors for biometric access control applications. The proposed fusion technique uses multiple acoustic and visual sensors for extracting dominant biometric cues, and combines them with nondominant cues. The performance evaluation of the proposed fusion protocol and a novel cascaded authentication approach using a 3D stereovision database shows a significant improvement in performance and robustness, with equal error rates of 42.9% (audio only), 32% (audio + 3D face + 2D lip features), 15% (audio + 3D face + 2D eye features), and 7.3% (audio-3D face + 2D lip + 2D eye-eyebrows) respectively.

Skip Supplemental Material Section

Supplemental Material

References

  1. Bowyer, K. W., Chang, K., and Flynn, P. 2006. A survey of approaches and challenges in 3D and multimodal 3D + 2D face recognition. Comput. Vis. Image Understand. 101, 1, 1--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Brunnelli, R. and Fala Vigna, D. 1995. Person identification using multiple cues. IEEE Trans. Patt. Anal. Mach. Intel. 17, pp. 955--966. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Callan, D., Jones J. A., Munhall, K. G., Kroos, C., Callan, A., and Vatikiotis-Bateson, E. 2003. Neural processes underlying perceptual enhancement by visual speech gestures. Neuroreport 14, 2213--2218.Google ScholarGoogle ScholarCross RefCross Ref
  4. Chelubishi, C. C., Deravi, F., and Mason, J. S. D. 2002. A review of speech-based bimodal recognition. IEEE Trans. Multimedia 4, 23--35. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Chetty, G. and Wagner, M. 2004. Automated lip feature extraction for liveness verification in audio-video authentication. In Proceedings of Image and Vision Computing New Zealand Conferences, 17--22.Google ScholarGoogle Scholar
  6. Chetty, G. and Wagner, M. 2007. Audio-visual speaker identity verification using lip motion features. In Proceedings of the International Conference on Spoken Language Processing (INTERSPEECH '07).Google ScholarGoogle Scholar
  7. Dasarathy, B. V. 1997. Sensor fusion potential exploitation-innovative architectures and illustrative applications. Proc. IEEE 85, 24--38.Google ScholarGoogle ScholarCross RefCross Ref
  8. Dutagaci, H., Sankur, B., and Yemez, Y. 2006. 3D face recognition by projection-based features. In Proceedings of the SPIE Conference on Electronic Imaging: Security, Steganography, and Watermarking of Multimedia.Google ScholarGoogle Scholar
  9. Goecke, R. and Millar, J. B. 2004. The audio-video Australian English speech data corpus AVOZES. In Proceedings of the 8th International Conference on Spoken Language Processing (INTERSPEECH '04). 2525--2528.Google ScholarGoogle Scholar
  10. Gokberk, B., Irfanoglu, M. O., and Akarun, L. 2006. 3D shape-based face representation and facial feature extraction for face recognition. Image Vision Comput. To appear.Google ScholarGoogle Scholar
  11. Halld, L. and Linas, J. 1997. An introduction to multisensor data fusion. Proc IEEE 85, 6--23.Google ScholarGoogle ScholarCross RefCross Ref
  12. Hani, C. Y., Kuratate, T., and Vatikiotis-Bateson, E. 2002. Linking facial animation, head motion, and speech acoustics. J. Phonetics 30, 3, 555--568.Google ScholarGoogle ScholarCross RefCross Ref
  13. Hyvarinen A. and Oja, E., 2000. Independent component analysis: Algorithms and applications. Neural Netw. 13, 4--5, 411--430. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Kahraman, F. and Stegmann, M. B. 2006, Towards illumination-invariant localization of faces using active appearance models. Proceedings of the IEEE Nordic Signal Processing Symposium.Google ScholarGoogle Scholar
  15. Kroos, C., Kuratate, T., and Vatikiotis-Bateson, E. 2002 Video-based face motion measurement. J. Phonetics 30, 3, 569--590.Google ScholarGoogle ScholarCross RefCross Ref
  16. Ortega-Garcia J. 2003, MCYT baseline corpus: A bimodal biometric database. In IEE Proceedings on Vision, Image and Signal Processing.Google ScholarGoogle ScholarCross RefCross Ref
  17. Pigeon, S. and Vandendorpe, L. 1998. Image-based multimodal face authentication. Signal Process. 69, 59--79. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Potamianos, G. G., Net, C., Gravier, G., Garg, A., and Senior, A. W., 2003, Recent advances in the automatic recognition of audiovisual speech. Proc. IEEE 91, 1306--1324.Google ScholarGoogle Scholar
  19. Quatieri, T. F. 2002. Discrete Time Speech Signal Processing. Signal Processing Series. Prentice Hall. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Santi, A., Servos, P., Vatikiotis-Bateson, E., Kuratate, T. and Munhall, K. 2003. Perceiving biological motion: Dissociating talking from walking. J. Cogn. Neurosci. 15, 800--809. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Multimedia sensor fusion for retrieving identity in biometric access control systems

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Multimedia Computing, Communications, and Applications
        ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 6, Issue 4
        November 2010
        159 pages
        ISSN:1551-6857
        EISSN:1551-6865
        DOI:10.1145/1865106
        Issue’s Table of Contents

        Copyright © 2010 ACM

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 26 November 2010
        • Accepted: 1 August 2010
        • Revised: 1 July 2010
        • Received: 1 January 2010
        Published in tomm Volume 6, Issue 4

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader
      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!