Emotional Speech Synthesis Based on Prosodic Feature Modification - Engineering

ENG > Vol.5 No.10, October 2013

Emotional Speech Synthesis Based on Prosodic Feature Modification ()

HTML

Download as PDF (Size: 170KB) PP. 73-77

DOI: 10.4236/eng.2013.510B015 3,258 Downloads 5,368 Views Citations

Author(s)

Ling He, Hua Huang, Margaret Lech

Affiliation(s)

School of Electrical and Computer Engineering, RMIT University, Melbourne, Australia.
School of Electrical Engineering and Information, Sichuan University, Chengdu, China.

ABSTRACT

The synthesis of emotional speech has wide applications in the field of human-computer interaction, medicine, industry and so on. In this work, an emotional speech synthesis system is proposed based on prosodic features modification and Time Domain Pitch Synchronous OverLap Add (TD-PSOLA) waveform concatenative algorithm. The system produces synthesized speech with four types of emotion: angry, happy, sad and bored. The experiment results show that the proposed emotional speech synthesis system achieves a good performance. The produced utterances present clear emotional expression. The subjective test reaches high classification accuracy for different types of synthesized emotional speech utterances.

KEYWORDS

Emotional Speech Synthesis; Prosodic Features; Time Domain Pitch Synchronous Overlap Add

Share and Cite:

He, L. , Huang, H. and Lech, M. (2013) Emotional Speech Synthesis Based on Prosodic Feature Modification. Engineering, 5, 73-77. doi: 10.4236/eng.2013.510B015.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies