Ghahabi, O. and Hernando, J. (2017) Deep Learning Backend for Single and Multisession I-Vector Speaker Recognition. IEEE ACM Transactions on Audio Speech and Language Processing, 25, 807-817. - References

Journals by Subject

Publish with us

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Article citationsMore>>

Ghahabi, O. and Hernando, J. (2017) Deep Learning Backend for Single and Multisession I-Vector Speaker Recognition. IEEE ACM Transactions on Audio Speech and Language Processing, 25, 807-817.
https://doi.org/10.1109/TASLP.2017.2661705

has been cited by the following article:

TITLE: Adaptive Threshold Estimation of Open Set Voiceprint Recognition Based on OTSU and Deep Learning

AUTHORS: Xudong Li, Xinjia Yang, Linhua Zhou

KEYWORDS: Voiceprint Recognition, Deep Neural Network (DNN), OTSU, Adaptive Threshold

JOURNAL NAME: Journal of Applied Mathematics and Physics, Vol.8 No.11, November 30, 2020

ABSTRACT: Aiming at the problem of open set voiceprint recognition, this paper proposes an adaptive threshold algorithm based on OTSU and deep learning. The bottleneck technology of open set voiceprint recognition lies in the calculation of similarity values and thresholds of speakers inside and outside the set. This paper combines deep learning and machine learning methods, and uses a Deep Belief Network stacked with three layers of Restricted Boltzmann Machines to extract deep voice features from basic acoustic features. And by training the Gaussian Mixture Model, this paper calculates the similarity value of the feature, and further determines the threshold of the similarity value of the feature through OTSU. After experimental testing, the algorithm in this paper has a false rejection rate of 3.00% for specific speakers, a false acceptance rate of 0.35% for internal speakers, and a false acceptance rate of 0 for external speakers. This improves the accuracy of traditional methods in open set voiceprint recognition. This proves that the method is feasible and good recognition effect.

Open Access

Articles

Deep Learning Recognition for Arabic Alphabet Sign Language RGB Dataset

Rabie El Kharoua, Xiaoming Jiang

Journal of Computer and Communications Vol.12 No.3, March 11, 2024

DOI: 10.4236/jcc.2024.123003
Open Access

Articles

The Effect of Speech Fragmentation and Audio Encodings on Automatic Parkinson’s Disease Recognition

Dávid Sztahó, Attila Zoltán Jenei, István Valálik, Klára Vicsi

Journal of Biomedical Science and Engineering Vol.15 No.1, January 10, 2022

DOI: 10.4236/jbise.2022.151002
Open Access

Articles

Improving Speech Recognition during Phone Calls in Noisy Environment through the Use of Wireless Audio Streaming in Hearing Aids

Chiyuen Tan, Lei Tu, Yonghua Wang, Dongdong Jin, Yuan Wang, Wendi Shi

Open Access Library Journal Vol.11 No.3, March 27, 2024

DOI: 10.4236/oalib.1111343
Open Access

Articles

Phoneme Sequence Modeling in the Context of Speech Signal Recognition in Language “Baoule”

Hyacinthe Konan, Etienne Soro, Olivier Asseu, Bi Tra Goore, Raymond Gbegbe

Engineering Vol.8 No.9, September 14, 2016

DOI: 10.4236/eng.2016.89055
Open Access

Articles

Applying Deep Learning Models to Mouse Behavior Recognition

Ngoc Giang Nguyen, Dau Phan, Favorisen Rosyking Lumbanraja, Mohammad Reza Faisal, Bahriddin Abapihi, Bedy Purnama, Mera Kartika Delimayanti, Kunti Robiatul Mahmudah, Mamoru Kubo, Kenji Satou

Journal of Biomedical Science and Engineering Vol.12 No.2, February 28, 2019

DOI: 10.4236/jbise.2019.122012

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals by Subject

Publish with us

Article citationsMore>>

Home

About SCIRP

Service

Policies