Research Article Open Access

Towards the Development of Speaker-Dependent and Speaker-Independent Hidden Markov Model-Based Thai Speech Synthesis

Suphattharachai Chomphan1
  • 1 ,
Journal of Computer Science
Volume 5 No. 12, 2009, 905-914

DOI: https://doi.org/10.3844/jcssp.2009.905.914

Submitted On: 20 September 2009 Published On: 31 December 2009

How to Cite: Chomphan, S. (2009). Towards the Development of Speaker-Dependent and Speaker-Independent Hidden Markov Model-Based Thai Speech Synthesis. Journal of Computer Science, 5(12), 905-914. https://doi.org/10.3844/jcssp.2009.905.914

Abstract

Problem statement: Tone distortion in Thai languages can deteriorate not only the intelligibility of speech but also its naturalness. Therefore, the correctness of tone must be carefully taken into account in continuous speech synthesis. The preliminary work confronted this problem when applying HMM-based speech synthesis to Thai. Approach: This study presented a study on speaker-dependent and speaker-independent Hidden Markov Model (HMM)-based Thai speech synthesis. In the speaker-dependent system, we developed a simple tone-separated tree structure in the tree-based context clustering process of the training stage to treat the tone distortion problem. In the speaker-independent system or averaged-voice-model system, a number of tonal features are extracted and applied with the Speaker Adaptive Training (SAT) and Shared Decision Tree (STC) techniques to release the tone distortion problem. Results: Our objective evaluation revealed that the proposed features could make the F0 contour closer to the target speaker’s real contour. The results from our subjective test also revealed that the proposed tonal features could improve the tone intelligibility of all speech-model scenarios of male and female. Conclusion: By applying our approach, the problem of tone distortion can be relieved effectively. The better tone correctness can improve the intelligibility and the naturalness of speech significantly.

  • 1,342 Views
  • 1,478 Downloads
  • 0 Citations

Download

Keywords

  • Tone correctness
  • speaker-dependent
  • speaker-independent
  • hidden Markov models
  • speech synthesis