Feature Optimization of Speech Emotion Recognition - Journal of Biomedical Science and Engineering

JBiSE > Vol.9 No.10, September 2016

Feature Optimization of Speech Emotion Recognition ()

HTML XML

Download as PDF (Size: 271KB) PP. 37-43

DOI: 10.4236/jbise.2016.910B005 1,748 Downloads 3,830 Views Citations

Author(s)

Chunxia Yu, Ling Xie, Weiping Hu^*

Affiliation(s)

GuangXi Key Lab of Multi-Source Information Mining and Security, GuangXi Normal University, Guilin, China.

ABSTRACT

Speech emotion is divided into four categories, Fear, Happy, Neutral and Surprise in this paper. Traditional features and their statistics are generally applied to recognize speech emotion. In order to quantify each feature’s contribution to emotion recogni-tion, a method based on the Back Propagation (BP) neural network is adopted. Then we can obtain the optimal subset of the features. What’s more, two new characteristics of speech emotion, MFCC feature extracted from the fundamental frequency curve (MFCCF0) and amplitude perturbation parameters extracted from the short- time av-erage magnitude curve (APSAM), are added to the selected features. With the Gaus-sian Mixture Model (GMM), we get the highest average recognition rate of the four emotions 82.25%, and the recognition rate of Neutral 90%.

KEYWORDS

Speech Emotion Recognition, Feature Selection, Feature Extraction, BP Neural Network, GMM

Share and Cite:

Yu, C. , Xie, L. and Hu, W. (2016) Feature Optimization of Speech Emotion Recognition. Journal of Biomedical Science and Engineering, 9, 37-43. doi: 10.4236/jbise.2016.910B005.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies