Studying the feature space for the description of segments of speech signals in speech recognition problems

Authors

DOI:

https://doi.org/10.17308/sait.2020.4/3208

Keywords:

speech signal, features, sub-band method, mel scale, frequency axis, fraction of energy, entropy

Abstract

The article considers a set of features for the description of segments of speech signals with regard to their informative value. Feature sets are used in recognition problems. Effectively solving recognition problems largely depends on the information content of the feature set. The information content of a set of features is estimated using information entropy. In this article, we compare information entropy of the feature sets obtained using the Fourier method and the sub-band method. The article also considers modifications to these methods, namely the transition to the mel scale, used to analyse speech signals. In addition to the mel scale, other frequency axis distortions are considered for the sub-band method. The article presents the results of calculating the information entropy performed using different methods of obtaining a set of features describing segments of speech signals. The study demonstrated that the information content of the feature set increases when using the sub-band method with a nonlinear division of the frequency axis into analysed bands.

Author Biographies

  • Evgeniy G. Zhilyakov, Belgorod State National Research University

    DSc in Technical Sciences, Professor, Head of the Department of Information and Telecommunication Systems and Technologies, Belgorod State National Research University

  • Alexey V. Boldyshev, Belgorod branch of PAO “Rostelekom”

    PhD in Technical Sciences, Head of network switching system unit No.1 of the Belgorod branch of PAO “Rostelekom”

  • Sergei V. Umanets, Belgorod branch of PAO “Rostelekom”

    lead engineer at network switching system unit No.1 of the Belgorod branch of PAO “Rostelekom”

References

Downloads

Published

2021-02-02

Issue

Section

Computer Linguistics and Natural Language Processing

How to Cite

Studying the feature space for the description of segments of speech signals in speech recognition problems. (2021). Proceedings of Voronezh State University. Series: Systems Analysis and Information Technologies, 4, 111-120. https://doi.org/10.17308/sait.2020.4/3208

Most read articles by the same author(s)