skip to main content
10.5555/1712759.1712796guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article

Text dependent speaker verification system using discriminative weighting method and Artificial Neural Networks

Authors Info & Claims
Published:06 February 2008Publication History

ABSTRACT

Speaker recognition is a process to recognize someone by their voice. The goal of speaker recognition is to extract, characterize and recognize the information about speaker identity. In this paper, we discuss both conventional and Artificial Neural Network (ANN) approach to speaker recognition system. The proposed system comprises three main modules, a feature extraction module to extract necessary features from speech waves, a Vector Quantization (VQ) module to generate the speaker codebook and an ANN module to classify the speakers based on their high discriminative power. The proposed intelligent learning system has been applied to a case study of text-dependent speaker recognition system and the performance is evaluated by applying two types of feature extraction techniques: Mel Frequency Cepstral Coefficients (MFCC) and Linear Predictive Cepstral Coefficients (LPCC). Experiment shows that the new proposed system provides significantly higher performance compare to conventional method.

References

  1. F. Nolan, The phonetic bases of speaker recognition (Cambridge CUP, Cambridge, 1983).Google ScholarGoogle Scholar
  2. L. Rabiner, B. Juang, Fundamentals of speech recognition (NJ: Prentice Hall, 1993). Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Furui, Recent advances in speaker recognition, Pattern Recognition Letters, 18: 859--872, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  4. S. Furui, Cepstral analysis technique for automatic speaker verification, IEEE Transactions on Acoustics, Speech and SignalProcessing, 29(2): 254--272, 1981.Google ScholarGoogle ScholarCross RefCross Ref
  5. B. Zhen, X. Wu, Z. Liu, H. Chi, On the use of bandpass liftering in speaker recognition, Proc. 6th Int. Conf. of Spoken Lang. Processing(ICSLP), Beijing, China, 2000.Google ScholarGoogle Scholar
  6. Jr. J. R. Deller, J. H. L. Hansen, J. G. Proakis, Discrete-time Processing of Speech Signals (Macmillan Publishing Company, New York, 2000). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Vergin and D. O'Shaughnessy, Pre-Emphasis and Speech Recognition, Canadian Conference on Electrical and Computer Engineering, 2: 1062--1065, 1995.Google ScholarGoogle Scholar
  8. X. Huang, A. Acero and H.-W. Hon, Spoken language processing: a guide to theory, algorithm, and system development (Englewood Cliffs, NJ: Prentice-Hall, 2001). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Y. Linde, A. Buzo and R. M. Gray, R. M., An algorithm for vector quantizer design. IEEE Trans. COM-28. 84--95, 1980.Google ScholarGoogle ScholarCross RefCross Ref
  10. D. E. Rumelhart, G. E. Hinton and R. J. Williams, Learning internal representations by error propagation, Nature, 323:533--536, 1986.Google ScholarGoogle ScholarCross RefCross Ref
  11. J. He, L. Liu, G. Palm, A discriminative training algorithm for VQ-based speaker identification, IEEE Transactions on Speech and Audio Processing, 7(3): 353--356, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  12. T. Kinnunen, P. Fränti, Speaker discriminative weighting method for VQ-based speaker identification, Proc. 3rd International Conference on Audio- and Video-Based Biometric Person Authentication (AVBPA): 150--156, Halmstad, Sweden, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. F. K. Soong, A. E. Rosenberg, B-H. Juang and L. R. Rabiner, A vector quantization approach to speaker recognition, AT&T Technical Journal, 66: 14--26, 1987.Google ScholarGoogle ScholarCross RefCross Ref
  14. T. Kohonen, Self-Organizing Maps (Springer-Verlag, Heidelberg, 1995). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. A. Finan, A. T. Sapeluk and R. I. Damper, Impostor cohort selection for score normalization in speaker verification, Pattern Recognition Letters, 18: 881--888, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  16. R. Battiti, First and second-order methods for learning: between steepest descent and newton's method. Neural Computation. 4(2): 141--166.1992. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Text dependent speaker verification system using discriminative weighting method and Artificial Neural Networks
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Article Metrics

          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0

          Other Metrics