|
Training a speaker verification system on unlabelled data
A. V. Ermilov, I. M. Gostev National Research University "Higher School of Economics"
Abstract:
In the article we consider a method of labeling speaker data using clusterization techniques. Labelling problems arise when one needs to use a speaker database from new channels, for example, mobile devices. Newly labelled database might then be used to construct a speaker verification system. In the article described a speaker verification task along with some methods to solve it which are based on GMM-UBM, also some channel normalization techniques are described, which might enhance the quality of recognition. Methods based on supervectors and PLDA are also presented. We also study the quality of labeling obtained through clusterization with different metrics. Resulting labelled database is then used to train several PLDA models. Then these models fused and used to solve a speaker verification task on i-vectors from NIST are i-vector Machine Learning Challenge 2014.
Keywords:
patern recognition, automatic speaker verification, clusterization, PLDA.
Received: 30.03.2015
Citation:
A. V. Ermilov, I. M. Gostev, “Training a speaker verification system on unlabelled data”, Matem. Mod., 27:7 (2015), 51–57
Linking options:
https://www.mathnet.ru/eng/mm3622 https://www.mathnet.ru/eng/mm/v27/i7/p51
|
Statistics & downloads: |
Abstract page: | 259 | Full-text PDF : | 147 | References: | 34 | First page: | 9 |
|