Trudy SPIIRAN
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Informatics and Automation:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Trudy SPIIRAN, 2014, Issue 36, Pages 128–150
DOI: https://doi.org/10.15622/sp.36.8
(Mi trspy753)
 

Training Personal Voice Model of a Speaker with Unified Phonetic Space of Features Using Artificial Neural Network

E. Azarov, A. A. Petrovsky

Belarussian State University of Computer Science and Radioelectronic Engineering
Abstract: The paper investigates possibility of creating a personal voice model using transcribed speech samples of a specified speaker. The paper presents a practical way of building such speech model and some experimental results of applying the model to voice conversion. The model uses an artificial neural network organized as autoencoder that establishes correspondence between space of speech parameters and space of possible phonetic states, unified for any voice.
Keywords: Voice Conversion; Speech Synthesis; Artificial Neural Network.
Document Type: Article
UDC: 004.934
Language: Russian


Citation: E. Azarov, A. A. Petrovsky, “Training Personal Voice Model of a Speaker with Unified Phonetic Space of Features Using Artificial Neural Network”, Tr. SPIIRAN, 36 (2014), 128–150
Linking options:
  • https://www.mathnet.ru/eng/trspy753
  • https://www.mathnet.ru/eng/trspy/v36/p128
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Informatics and Automation
    Statistics & downloads:
    Abstract page:181
    Full-text PDF :90
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024