Abstract:
The article presents a method of detecting prosodically prominent words, i.e. words that carry most of the information in the utterance. The method relies on lexical, grammatical and syntactic markers of prominence, and can be used in Text-to-Speech synthesis systems to make synthesized speech sound more natural.
Three different classification methods were used: Naive Bayes, Maximum Entropy and Conditional Random Fields models. The results of the experiments show that discriminative models provide more balanced values of the performance metrics, while the generative model is potentially more useful for detecting prominent words in speech signal.
The results of the study are comparable with the performances of similar systems developed for other languages, and in some cases surpass them.
Keywords:
prosodic prominence; emphasis; prosody; lexical analysis; syntax analysis; Naive Bayes classifier; Maximum Entropy classifier; Conditional Random Fields; Russian language.
\Bibitem{KocMen17}
\by D.~A.~Kocharov, A.~P.~Menshikova
\paper Detection of prominent words in Russian texts using linguistic features
\jour Tr. SPIIRAN
\yr 2017
\vol 55
\pages 216--236
\mathnet{http://mi.mathnet.ru/trspy983}
\crossref{https://doi.org/10.15622/sp.55.9}
\elib{https://elibrary.ru/item.asp?id=30685508}
Linking options:
https://www.mathnet.ru/eng/trspy983
https://www.mathnet.ru/eng/trspy/v55/p216
This publication is cited in the following 1 articles:
Dmitriy Levonevskiy, Dmitrii Malov, Irina Vatamaniuk, Lecture Notes in Computer Science, 11658, Speech and Computer, 2019, 270