Restricted Domain Malay Speech Synthesizer Using Syntax-Prosody Representation

Sabrina Tiun; Rosni Abdullah; Tang Enya Kong

doi:10.3844/jcssp.2012.1961.1969

Research Article Open Access

Restricted Domain Malay Speech Synthesizer Using Syntax-Prosody Representation

Sabrina Tiun¹, Rosni Abdullah² and Tang Enya Kong²

¹ University Kebangsaan Malaysia, Malaysia
² University Sains Malaysia, Malaysia

Abstract

The speech synthesis approach required in restricted domain speech application is a synthesizer that has high quality like the speech output of ‘slot-filler’ approach but have at least the least flexibility of the ‘genuine’ speech synthesizer. Thus, in this research study, we propose an alternative approach of creating a speech synthesizer to be used in a restricted domain speech application. In our approach, we use word unit as the primary unit and our speech corpus is represented by syntax-prosody tree structures. Speech synthesis is performed by constructing a syntax-prosody tree of a target input sentence. The construction of the tree is by done by adapting an example-based syntactic parsing approach and the concatenated of synthesis units from the constructed tree nodes will be the synthesized utterance. For evaluation, we performed MOS subjective evaluation on our speech synthesizer with natural speech and two other Malay TTS system. Based on an ANOVA and T-Tests analysis, we found the overall MOS scores of our speech synthesizer output, sound B was (mean = 3.34, sd = 1.10), the other two Malay TTS system; C (mean = 1.95, sd = 0.72) and D (mean = 1.80, sd = 1.04) and the natural speech, A (mean = 4.71, sd = 0.21). We conclude that our Malay speech synthesizer sounded more natural, easier to listen, more pleasant and more fluent compared to the sounds of the other two Malay TTS systems. As expected, the recorded speech was perceived more natural than the output of our Malay speech synthesizer.

Journal of Computer Science

Volume 8 No. 12, 2012, 1961-1969

DOI: https://doi.org/10.3844/jcssp.2012.1961.1969

Submitted On: 29 August 2012 Published On: 14 November 2012

How to Cite: Tiun, S., Abdullah, R. & Kong, T. E. (2012). Restricted Domain Malay Speech Synthesizer Using Syntax-Prosody Representation. Journal of Computer Science, 8(12), 1961-1969. https://doi.org/10.3844/jcssp.2012.1961.1969

Copyright: © 2012 Sabrina Tiun, Rosni Abdullah and Tang Enya Kong. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

5,534 Views
4,034 Downloads
0 Citations

Download

Keywords

Malay Speech Synthesis
Restricted Domain Speech Synthesis
Syntax-Prosody Representation