skip to main content
research-article

A Preliminary Analysis on the Correlates of Stress and Tones in Mizo

Published:27 December 2022Publication History
Skip Abstract Section

Abstract

Stress is the property of a language to exhibit prominence or distinction in one or more syllables in a given domain. The existence of word stress has not been suitably explored in previous acoustic studies of the Mizo language, which is a tonal language of the Kuki-Chin sub-category in Tibeto-Burman language families. In this study, we attempt to analyze word stress on disyllabic target words, specifically in three lexical categories—adjectives, nouns, and verbs. Utterances of the target words are recorded in isolated setting (out of focus) and in sentence frames (in focus). First, averages of features, namely—duration, intensity, F0, formants, and spectral tilt, are extracted and investigated for identification of stressed and unstressed syllables on a total of 2,880 samples. Next, the interaction of word stress on the four tones of Mizo is investigated. While it is found that H-tone is generally stressed, inferences are made that stressed syllables are not unique to a specific tone. Third, significance of the selected features are validated using a two-tailed paired sample t-test. Our analysis indicates that the mean differences in duration, intensity, and F0 of the stressed and unstressed syllables are significant across the lexical categories at p < 0.05. Next, validations on the significance of the mean differences are carried out using Cohen’s d effect size and Pearson’s Correlation Coefficient (r). Finally, three machine learning models—Support Vector Machines (SVM), Naive Baye’s, and Ensemble learning methods (AdaBoost and Boosted Aggregation), are used to identify stressed and unstressed syllables associated with tones in Mizo. Discriminating differences, especially in disyllabic verbs, are observed between stressed vs. unstressed syllables. Conclusions are drawn that duration is a strong and robust cue for acoustic correlates of stress, while intensity is a medium cue for stress and F0 a weak cue for stress.

REFERENCES

  1. [1] Bentham Jereemi, Pakray Partha, Majumder Goutam, Lalbiaknia Sunday, and Gelbukh Alexander. 2016. Identification of rules for recognition of named entity classes in Mizo language. In Proceedings of the 15th Mexican International Conference on Artificial Intelligence. IEEE, 813. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Boersma Paul. 2014. The Use of Praat in Corpus Research. Oxford University Press, Oxford, UK, 342–360. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Campbell Nick and Beckman Mary. 1997. Stress, prominence, and spectral tilt. In Proceedings of ESCA Workshop on Intonation: Theory, Models and Applications. ISCA, 6770.Google ScholarGoogle Scholar
  4. [4] Caplow Nancy. 2017. Stress patterns and acoustic correlates of stress in Balti Tibetan. Himal. Ling. 15, 2 (2017), 149. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Chávez-Peón Mario E.. 2008. Phonetic cues to stress in a Tonal language: Prosodic prominence in San Lucas Quiaviní Zapotec. In Proceedings of the Annual Conference of the Canadian Linguistic Association.Google ScholarGoogle Scholar
  6. [6] Chen Matthew Y.. 2000. Tone Sandhi: Patterns across Chinese Dialects. Vol. 92. Cambridge University Press, Cambridge, UK.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Chhangte Lalnunthangi. 1986. A Preliminary Grammar of the Mizo Language (Tibeto-Burman). Ph. D. Dissertation. The University of Texas at Arlington.Google ScholarGoogle Scholar
  8. [8] Childers Donald G.. 1978. Modern Spectrum Analysis. IEEE Computer Society Press, New York. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Coe Robert. 2002. It’s the effect size, stupid. In Proceedings of the British Educational Research Association Annual Conference, Vol. 12. BERA, Exeter, 118. Retrieved from http://www.leeds.ac.uk/educol/documents/00002182.htm.Google ScholarGoogle Scholar
  10. [10] Cohen Jacob. 1960. A coefficient of agreement for nominal scales. Educ. Psychol. Measur. 20, 1 (1960), 3746. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Cohen Jacob. 1992. Statistical power analysis. Curr. Direct. Psychol. Sci. 1, 3 (1992), 98101. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Dey Abhishek, Sarma Biswajit Dev, Lalhminghlui Wendy, Ngente Lalnunsiami, Gogoi Parismita, Sarmah Priyankoo, Prasanna S. R. M., Sinha Rohit, and Nirmala S. R.. 2018. Robust Mizo continuous speech recognition. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH’18). ISCA, 10361040. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Dicanio Christian T.. 2009. The phonetics of register in Takhian Thong Chong. J. Int. Phonet. Assoc. 39, 2 (2009), 162188. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Dong Xibin, Yu Zhiwen, Cao Wenming, Shi Yifan, and Ma Qianli. 2020. A survey on ensemble learning. Front. Comput. Sci. 14, 2 (2020), 241258. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Erickson Donna. 2002. Articulation of extreme formant patterns for emphasized vowels. J. Phonetica 59, 2-3 (2002), 134149. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Fanai Lalrindiki T.. 1992. Some Aspects of the Lexical Phonology of Mizo and English: An Autosegmental Approach. Ph. D. Dissertation. CIEFL, Hyderabad. Retrieved from http://hdl.handle.net/10603/130415.Google ScholarGoogle Scholar
  17. [17] Farid Dewan Md, Rahman Mohammad Masudur, and Al-Mamuny M. A.. 2014. Efficient and scalable multi-class classification using naïve Bayes tree. In Proceedings of the International Conference on Informatics, Electronics & Vision (ICIEV’14). IEEE Computer Society Press, 14. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Farid Dewan Md, Zhang Li, Rahman Chowdhury Mofizur, Hossain M. Alamgir, and Strachan Rebecca. 2014. Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks. Exp. Syst. Applic. 41, 4 (2014), 19371946. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Fery Caroline. 2017. Tone and stress at the word level. In Intonation and Prosodic Structure, Key Topics in Phonology. Cambridge University Press, Cambridge, UK, 178224. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Fokes Joann and Bond Z. S.. 1989. The vowels of stressed and unstressed syllables in nonnative English. Lang. Learn. 39, 3 (1989), 341373. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Gogoi Parismita, Dey Abhishek, Lalhminghlui Wendy, Sarmah Priyankoo, and Prasanna S. R. M.. 2020. Lexical tone recognition in Mizo using acoustic-prosodic features. In Proceedings of the 12th Conference on Language Resources and Evaluation (LREC’20). European Language Resources Association, 64586461. Retrieved from https://www.aclweb.org/anthology/2020.lrec-1.795.Google ScholarGoogle Scholar
  22. [22] Gogoi Parismita, Kalita Sishir, Lalhminghlui Wendy, Sarmah Priyankoo, and Prasanna S. R. M.. 2021. Learning Mizo tones from F0 contours using 1D-CNN. In Proceedings of the International Conference on Speech and Computer (LNAI, 12997). Springer, 214225. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Gordon Matthew and Roettger Timo. 2017. Acoustic correlates of word stress: A cross-linguistic survey. Ling. Van. 3, 1 (2017), 111. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Govind D., Sarmah Priyankoo, and Prasanna S. R. Mahadev. 2012. Role of pitch slope and duration in synthesized Mizo tones. In Proceedings of the 6th International Conference on Speech Prosody. ISCA, 3942.Google ScholarGoogle Scholar
  25. [25] Halle M. and Vergnaud J.. 1987. Stress and the cycle. Ling. Inq. 18, 1 (1987), 4584. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Haykin Simon and Veen Barry Van. 2007. Signals and Systems. John Wiley & Sons, New Delhi.Google ScholarGoogle Scholar
  27. [27] Kakouros Sofoklis, Räsänen Okko, Alku Paavo, et al. 2017. Evaluation of spectral tilt measures for sentence prominence under different noise conditions. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH’17). ISCA, 32113215. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Kaland C. C. L.. 2018. Spectral tilt as a correlate of Papuan Malay word stress. In Proceedings of the 9th International Conference on Speech Prosody. ISCA, 339343. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Kim Tae Kyun. 2015. T test as a parametric statistic. Kor. J. Anesthesiol. 68, 6 (2015), 540. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Lalhminghlui Wendy and Sarmah Priyankoo. 2018. Production and perception of rising tone sandhi in Mizo. In Proceedings of the 6th International Symposium on Tonal Aspects of Languages (TAL’18). ISCA, 114118. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Lalhminghlui Wendy and Sarmah Priyankoo. 2020. Interaction of tone and voicing in Mizo. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH’20). ISCA, 19031907. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Liberman Mark and Prince Alan. 1977. On stress and linguistic rhythm. Ling. Inq. 8, 2 (1977), 249336.Google ScholarGoogle Scholar
  33. [33] Liu Min, Shi Shuju, and Zhang Jinsong. 2014. A preliminary study on acoustic correlates of Tone2+Tone2 disyllabic word stress in Mandarin. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH’14). ISCA, 179183.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] MATLAB. 2021. version 9.10.0 (R2021a). Retrieved from https://in.mathworks.com/products/matlab.html.Google ScholarGoogle Scholar
  35. [35] McHugh Mary L.. 2012. Interrater reliability: The kappa statistic. Biochem. Medic. 22, 3 (2012), 276282.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Michael Lev. 2009. The Interaction of Tone and Stress in the Prosodic System of Iquito (Zaparoan). Technical Report. University of California, Berkeley. 123. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Nair Rami et al. 2001. Acoustic correlates of lexical stress in Hindi. In Linguistic Structure and Language Dynamics in South Asia-Papers from the Proceedings of SALA XVIII Roundtable. Motilal Banarsidass Publishers Pvt. Ltd., Delhi, India, 123143.Google ScholarGoogle Scholar
  38. [38] Narasimhan S. V., Jose Litha Rose, Hasain B. Shahid, and Chand Rohini. 2010. Acoustic correlates of stress in Mizo, a tonal language. Lang. India 10, 4 (2010).Google ScholarGoogle Scholar
  39. [39] Narasimhan S. V., Thingom M. D., and Kiran R.. 2013. Influence of stress pattern of native language on non-native language: A comparative study between tone and non-tone language speakers. Lang India 13, 8(Aug.2013), 239250.Google ScholarGoogle Scholar
  40. [40] Obilor Esezi Isaac and Amadi Eric Chikweru. 2018. Test for significance of pearson’s correlation coefficient. Int. J. Innov. Math., Statist. Energ. Polic. 6, 1 (2018), 1123.Google ScholarGoogle Scholar
  41. [41] Potisuk Siripong, Gandour Jack, and Harper Mary P.. 1996. Acoustic correlates of stress in Thai. J. Phonet. 53, 4 (1996), 200220. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Potisuk Siripong, Gandour Jackson T., and Harper Mary. 1994. F0 correlates of stress in Thai. Ling. Tibeto-Burm. Area 17, 2 (1994), 127. Retrieved from http://sealang.net/sala/archives/pdf4/potisuk1994f0.pdf.Google ScholarGoogle Scholar
  43. [43] Rao Archana and Aishwarya R.. 2019. Acoustic correlates of stress in tone language: A comparison between Indian and Chinese. Lang. India 19, 1 (2019), 3238.Google ScholarGoogle Scholar
  44. [44] Sarma Biswajit Dev, Dey Abhishek, Lalhminghlui Wendy, Gogoi Parismita, Sarmah Priyankoo, and Prasanna S. R. Mahadev. 2018. Robust Mizo digit recognition using data augmentation and tonal information. In Proceedings of the 9th International Conference on Speech Prosody. ISCA, 621625. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Sarma Biswajit Dev, Sarmah Priyankoo, Lalhminghlui Wendy, and Prasanna S. R. Mahadev. 2015. Detection of Mizo tones. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH’15). ISCA, 934937.Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Sarmah Priyankoo and Wiltshire Caroline R.. 2010. A preliminary acoustic study of Mizo vowels and tones. J. Acoust. Societ. India 37, 3 (2010), 121129.Google ScholarGoogle Scholar
  47. [47] Sawilowsky Shlomo S.. 2009. New effect size rules of thumb. J. Mod. Appl. Statist. Meth. 8, 2 (2009), 26. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Schuller B., Rigoll G., and Lang M.. 2004. Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP’04), Vol. 1. IEEE Computer Society Press, New York, I577–I580. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Sedgwick Philip. 2012. Pearson’s correlation coefficient. Brit. Med. J. 345 (2012), 12. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Shaheen Muhammad, Zafar Tanveer, and Khan Sajid Ali. 2020. Decision tree classification: Ranking journals using IGIDI. J. Inf. Sci. 46, 3 (June2020), 325339. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. [51] Shen Xiaonan Susan. 1992. On tone sandhi and tonal coarticulation. In Acta Linguistica Hafniensia, Vol. 25. Taylor and Francis Online, 8394. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  52. [52] Sluijter Agaath M. C. and Heuven Vincent J. van. 1996. Spectral balance as an acoustic correlate of linguistic stress. J. Acoust. Soc. Amer. 100, 4 (1996), 24712485. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Takeda Shoichi, Ueno Yuudai, Nakasako Noboru, and Nakagawa Hideo. 2010. Spectral-tilt features of emotional speech -research on emotional-speech synthesis based on voice-quality conversion. In Proceedings of the International Conference on Kansei Engineering and Emotion Research.Google ScholarGoogle Scholar
  54. [54] Tallman Adam J. R. and Elías-Ulloa José. 2020. The acoustic correlates of stress and tone in Chácobo (Pano): A production study. J. Acoust. Societ. Amer. 147, 4 (2020), 30283042. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  55. [55] Team The Audacity. 2021. Audacity ® – Audio Editor and Recorder. Muse Group. Retrieved from http://audacityteam.org/download/.Google ScholarGoogle Scholar
  56. [56] Weidert Alfons. 1975. Componential Analysis of Lushai Phonology. Vol. 2. John Benjamins Publishing. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  57. [57] Zhang Jie. 2014. Tones, tonal phonology, and tone sandhi. Handb. Chin. Ling. (2014), 443464. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  58. [58] Zhou Zhi-Hua. 2009. Ensemble learning. Mach. Learn. 1 (2009), 181210. DOI:Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Preliminary Analysis on the Correlates of Stress and Tones in Mizo

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 2
      February 2023
      624 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3572719
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 December 2022
      • Online AM: 6 July 2022
      • Accepted: 24 June 2022
      • Revised: 24 April 2022
      • Received: 31 December 2021
      Published in tallip Volume 22, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed
    • Article Metrics

      • Downloads (Last 12 months)101
      • Downloads (Last 6 weeks)5

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!