UPM Institutional Repository

Voice verification using i-vectors and neural networks with limited training data


Citation

Mamyrbayev, Orken Zh. and Othman, Mohamed and Akhmediyarova, A. T. and Kydyrbekova, Aizada S. and Mekebayev, Nurbapa O. (2019) Voice verification using i-vectors and neural networks with limited training data. Bulletin of the National Academy of Sciences of the Republic of Kazakhstan, 3 (379). pp. 36-43. ISSN 1991-3494; ESSN: 2518-1467

Abstract

This study proposes an approach to voice identification based on neural networks (DNN) for i-Vector. Modern voice identification systems based on DNN use large amounts of labeled training data. Using the LRE i-Vector Machine Learning Challenge restricts access to ready-to-use i-Vector for learning and testing the voice identification system. This poses unique challenges in developing DNN-based voice identification systems, since optimized external interfaces and network architectures can no longer be used. We propose to use the training i-Vectors to train the initial DNN to identify the voice. Next, we present a novel strategy for using this initial DNN to strip the language labels of the inappropriate set from the development data. The final DNN for voice identification is trained using the original training data and the estimated out-of-set language data. We show that augmenting the training set with out-of- set labels leads to a significant improvement in voice identification performance. In this paper, we studied the possibility of using neural networks for speech identification. In particular, standard approaches to speech recognition were considered, the concept of an artificial neuron as an object used in speech identification was defined. A speech recognition option using a neural network was investigated, and steps were presented to perform this task. Accuracy using neural networks with limited learning data and a higher i-vector dimension is superior to others with a score of 92.1%. From this study, we can conclude that the size of the UBM and the dimension of the i-vector affect the accuracy of voice identification based on the i-vector.


Download File

[img] Text
Voice verification .pdf

Download (10kB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.32014/2019.2518-1467.66
Publisher: National Academy of Sciences of the Republic of Kazakhstan
Keywords: Voice identification; i-Vector; Deep neural network.
Depositing User: Mr. Sazali Mohamad
Date Deposited: 04 Jun 2021 23:43
Last Modified: 05 Jun 2021 00:12
Altmetrics: http://www.altmetrics.com/details.php?domain=pasir.upm.edu.my&doi=10.32014/2019.2518-1467.66
URI: http://psasir.upm.edu.my/id/eprint/82733
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item