|
Training Personal Voice Model of a Speaker with Unified Phonetic Space of Features Using Artificial Neural Network
E. Azarov, A. A. Petrovsky Belarussian State University of Computer Science and Radioelectronic Engineering
Abstract:
The paper investigates possibility of creating a personal voice model using transcribed speech samples of a specified speaker. The paper presents a practical way of building such speech model and some experimental results of applying the model to voice conversion. The model uses an artificial neural network organized as autoencoder that establishes correspondence between space of speech parameters and space of possible phonetic states, unified for any voice.
Keywords:
Voice Conversion; Speech Synthesis; Artificial Neural Network.
Citation:
E. Azarov, A. A. Petrovsky, “Training Personal Voice Model of a Speaker with Unified Phonetic Space of Features Using Artificial Neural Network”, Tr. SPIIRAN, 36 (2014), 128–150
Linking options:
https://www.mathnet.ru/eng/trspy753 https://www.mathnet.ru/eng/trspy/v36/p128
|
Statistics & downloads: |
Abstract page: | 181 | Full-text PDF : | 90 |
|