|
Trudy SPIIRAN, 2013, Issue 26, Pages 332–348
(Mi trspy626)
|
|
|
|
Usage of Speech Signal Segmentation for the Construction of Complex Model in the Speaker Identification System.
T. V. Yermolenkoab, M. S. Klymenkoa a Institute of Artificial Intelligence
b Donetsk National Technical University
Abstract:
The article is devoted to development of a complex speaker model for using at the text-independent speaker identification. The complex speaker model is based on gaussian mixture method. The model is formed by preliminary segmented speech signal, where each segment matches to certain broad phonetic class. Method of speaker models structuring is proposed. Speaker models are structured as a tree, which allows to identify speaker without running a full search on the set of models. Researches have shown the division of the acoustic space of speaker's voice on the set of classes that represent some phonetic events, increases the efficiency of voice identification and the proposed structuring method of models accelerates the search operation.
Keywords:
clustering, gaussian mixture, speaker models, broad phonetic classes, mel-frequency cepstral coefficients.
Received: 04.04.2013
Citation:
T. V. Yermolenko, M. S. Klymenko, “Usage of Speech Signal Segmentation for the Construction of Complex Model in the Speaker Identification System.”, Tr. SPIIRAN, 26 (2013), 332–348
Linking options:
https://www.mathnet.ru/eng/trspy626 https://www.mathnet.ru/eng/trspy/v26/p332
|
Statistics & downloads: |
Abstract page: | 170 | Full-text PDF : | 101 | References: | 36 | First page: | 1 |
|