Towards the Development of Speaker-Dependent and Speaker-Independent Hidden Markov Model-Based Thai Speech Synthesis

Suphattharachai Chomphan

doi:10.3844/jcssp.2009.905.914

Research Article Open Access

Towards the Development of Speaker-Dependent and Speaker-Independent Hidden Markov Model-Based Thai Speech Synthesis

Suphattharachai Chomphan

Abstract

Problem statement: Tone distortion in Thai languages can deteriorate not only the intelligibility of speech but also its naturalness. Therefore, the correctness of tone must be carefully taken into account in continuous speech synthesis. The preliminary work confronted this problem when applying HMM-based speech synthesis to Thai. Approach: This study presented a study on speaker-dependent and speaker-independent Hidden Markov Model (HMM)-based Thai speech synthesis. In the speaker-dependent system, we developed a simple tone-separated tree structure in the tree-based context clustering process of the training stage to treat the tone distortion problem. In the speaker-independent system or averaged-voice-model system, a number of tonal features are extracted and applied with the Speaker Adaptive Training (SAT) and Shared Decision Tree (STC) techniques to release the tone distortion problem. Results: Our objective evaluation revealed that the proposed features could make the F0 contour closer to the target speaker’s real contour. The results from our subjective test also revealed that the proposed tonal features could improve the tone intelligibility of all speech-model scenarios of male and female. Conclusion: By applying our approach, the problem of tone distortion can be relieved effectively. The better tone correctness can improve the intelligibility and the naturalness of speech significantly.

Journal of Computer Science

Volume 5 No. 12, 2009, 905-914

DOI: https://doi.org/10.3844/jcssp.2009.905.914

Submitted On: 20 September 2009 Published On: 31 December 2009

How to Cite: Chomphan, S. (2009). Towards the Development of Speaker-Dependent and Speaker-Independent Hidden Markov Model-Based Thai Speech Synthesis. Journal of Computer Science, 5(12), 905-914. https://doi.org/10.3844/jcssp.2009.905.914

Copyright: © 2009 Suphattharachai Chomphan. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

6,208 Views
4,008 Downloads
1 Citations

Download

Keywords

Tone correctness
speaker-dependent
speaker-independent
hidden Markov models
speech synthesis