ITU (2001) ITU-T Recommendation P.862 Perceptual Evaluation of Speech Quality (PESQ) An Objective Method for End-to-End Speech Quality Assessment of Narrow-Band Telephone Networks and Speech Codecs. Technical Report, ITU. - References

Journals by Subject

Publish with us

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Article citationsMore>>

ITU (2001) ITU-T Recommendation P.862: Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-End Speech Quality Assessment of Narrow-Band Telephone Networks and Speech Codecs. Technical Report, ITU.

has been cited by the following article:

TITLE: An Intonation Speech Synthesis Model for Indonesian Using Pitch Pattern and Phrase Identification

AUTHORS: Yohanes Suyanto, Subanar , Agus Harjoko, Sri Hartati

KEYWORDS: Speech Synthesis, PESQ, Intonation, Indonesian

JOURNAL NAME: Journal of Signal and Information Processing, Vol.5 No.3, July 30, 2014

ABSTRACT: Prosody in speech synthesis systems (text-to-speech) is a determinant of tone, duration, and loudness of speech sound. Intonation is a part of prosody which determines the speech tone. In Indonesian, intonation is determined by the structure of sentences, types of sentences, and also the position of the word in a sentence. In this study, a model of speech synthesis that focuses on its intonation is proposed. The speech intonation is determined by sentence structure, intonation patterns of the example sentences, and general rules of Indonesian pronunciation. The model receives texts and intonation patterns as inputs. Based on the general principle of Indonesian pronunciation, a prosody file was made. Based on input text, sentence structure is determined and then interval among parts of a sentence (phrase) can be determined. These intervals are used to correct the duration of the initial prosody file. Furthermore, the frequencies in prosody file were corrected using intonation patterns. The final result is prosody file that can be pronounced by speech engine application. Experiment results of studies using the original voice of radio news announcer and the speech synthesis show that the peaks ofF0are determined by general rules or intonation patterns which are dominant. Similarity test with the PESQ method shows that the result of the synthesis is 1.18 at MOS-LQO scale.

Open Access

Articles

PCRR Based Bandpass Filter for C and L+U Bands of ITU-T G.694.2 CWDM Systems

S. Robinson, R. Nakkeeran

Optics and Photonics Journal Vol.1 No.3, September 30, 2011

DOI: 10.4236/opj.2011.13024
Open Access

Articles

The Misconceived Search for the Meaning of “Speech” in Freedom of Speech

Larry Alexander

Open Journal of Philosophy Vol.5 No.1, January 23, 2015

DOI: 10.4236/ojpp.2015.51005
Open Access

Articles

Wireless Bioradar Sensor Networks for Speech Detection and Communication

Ying Tian, Sheng Li, Jianqi Wang

Engineering Vol.5 No.5B, July 26, 2013

DOI: 10.4236/eng.2013.55B008
Open Access

Articles

Artificial Intelligence for Speech Recognition Based on Neural Networks

Takialddin Al Smadi, Huthaifa A. Al Issa, Esam Trad, Khalid A. Al Smadi

Journal of Signal and Information Processing Vol.6 No.2, March 31, 2015

DOI: 10.4236/jsip.2015.62006
Open Access

Articles

Automatic evaluation of speech impairment caused by wearing a dental appliance

Mariko Hattori, Yuka I. Sumita, Hisashi Taniguchi

Open Journal of Stomatology Vol.3 No.7, October 24, 2013

DOI: 10.4236/ojst.2013.37062

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals by Subject

Publish with us

Article citationsMore>>

Home

About SCIRP

Service

Policies