Thai Expressive Speech Processing Technology: A Review

Suphattharachai Chomphan

doi:10.3844/ajassp.2012.293.299

Research Article Open Access

Thai Expressive Speech Processing Technology: A Review

Suphattharachai Chomphan¹

¹ Department of Electrical Engineering, Faculty of Engineering at Si Racha, Kasetsart University, 199 M.6, Tungsukhla, Si Racha, Chonburi, 20230, Thailand

Abstract

Problem statement: The studies on Thai expressive speech or emotional speech have been conducted for years. Most of them are expected to analysis the characteristics of Thai expressive speech. However, the conclusive reviews on these studies have not been conducted for further study on the speech technology or application of Thai expressive speech. Approach: The review of research on Thai expressive speech in various aspects has been performed. They include an analysis of fundamental frequency contours using Fujisaki’s model, an analysis of fundamental frequency contours using structural model and speech compression with noisy environments. It has been noted that four speaking emotions include enjoyable, sad, angry and reading styles. Results: A comparison of two successful F₀ models has been reviewed. One approach is based on the Fujisaki’s model which has been applied for many tonal and toneless languages. Another one is based on the structural model which has been conducted primarily for Mandarin Chinese. Moreover, a study of speech compression for noise-corrupted Thai expressive speech by using two coding methods of CS-ACELP and MP-CELP has been summarized. Conclusion: From the study, it can be seen that two mathematical models have been successfully applied to model the fundamental frequency contour of Thai expressive speech. As for speech compression, it can be seen that coding methods, types of noise, levels of noise, speech gender influence on the coding speech quality.

American Journal of Applied Sciences

Volume 9 No. 3, 2012, 293-299

DOI: https://doi.org/10.3844/ajassp.2012.293.299

Submitted On: 29 September 2011 Published On: 12 January 2012

How to Cite: Chomphan, S. (2012). Thai Expressive Speech Processing Technology: A Review. American Journal of Applied Sciences, 9(3), 293-299. https://doi.org/10.3844/ajassp.2012.293.299

Copyright: © 2012 Suphattharachai Chomphan. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

4,122 Views
3,015 Downloads
0 Citations

Download

Keywords

Linear Prediction (LP)
fundamental frequency contours
speech gender influence
coding speech quality
expressive speech
fundamental frequency
rhythmic structures
Conjugate Structure Algebraic Code Excited Linear Predictive (CS-ACELP)