Speech annotation system for the comparative analysis of pronunciation in different dialects

Authors

DOI:

https://doi.org/10.17308/sait.2020.1/2631

Keywords:

speech recognition, dialect, speech, phoneme, speech corpus, voice dependence

Abstract

The article describes the development of an annotation system for the comparative analysis of pronunciation in different dialects of the Russian language. It emphasizes that voice input is nowadays widely used. The main challenge still facing voice assistants is the need to recognise different dialects. The most popular systems for the recognition of accents and dialects of Romance and Germanic languages were analysed. The key differences between speakers were determined. The article shows that phonetic features are informative enough to be used for distinguishing dialects. It also demonstrates that a large number of dialects in the Russian language requires the creation of an acoustic model for each of them. The article focuses on the analysis and modelling of the prosodic structure of the Russian dialects. The prosodic approach is based on the stress theory, which considers syllables from the point of view of emphasis and duration. It is noted that the dialects of the Russian language differ significantly in terms of their prosodic structure, i.e. their rhythmic structure, the tempo of speech, and the length of vowels. Software was developed for speech annotation which allows the user to interact with the speech database. The interface of the developed system is demonstrated. Audio recordings are visualized by oscillograms. The primary and secondary entities used in the database are highlighted. Each entity stores certain information regarding the phonemes and the speaker. The developed system for the recognition of dialects of the Russian language can be used to create a speech corpus, which will allow obtaining information about the pronunciation of phonemes using specific parameters.

Author Biographies

  • Iuliia I. Butenko, Bauman Moscow State Technical University

    PhD in Technical Sciences, Associate Professor, Department of Romance and Germanic Languages, Bauman Moscow State Technical University

  • Iurii V. Stroganov, Bauman Moscow State Technical University

    senior lecturer, department of Computer software and information technology, Bauman Moscow State Technical University

  • Valeriia I. Shevchenko

    Senior lecturer, Department of Computer Software and Information Technologies, Bauman Moscow State Technical University

  • Nikolai V. Slavnov, Bauman Moscow State Technical University

    MSc student, Faculty of Linguistics, Bauman Moscow State Technical University

  • Alexander V. Kvasnikov, Bauman Moscow State Technical University

    MSc student, Faculty of Computer Science and Control Systems, Bauman Moscow State Technical University

References

Downloads

Published

2020-03-24

Issue

Section

Computer Linguistics and Natural Language Processing

How to Cite

Speech annotation system for the comparative analysis of pronunciation in different dialects. (2020). Proceedings of Voronezh State University. Series: Systems Analysis and Information Technologies, 1, 168-176. https://doi.org/10.17308/sait.2020.1/2631