Improved Statistical Speech Segmentation Using Connectionist Approach

M. S. Salam; Dzulkifli Mohamad; S. H. Salleh

doi:10.3844/jcssp.2009.275.282

Research Article Open Access

Improved Statistical Speech Segmentation Using Connectionist Approach

M. S. Salam, Dzulkifli Mohamad and S. H. Salleh

Abstract

Problem statement: Speech segmentation is an important part for speech recognition, synthesizing and coding. Statistical based approach detects segmentation points via computing spectral distortion of the signal without prior knowledge of the acoustic information proved to be able to give good match, less omission but lot of insertion. These insertion points dropped segmentation accuracy. Approach: This study proposed a fusion method between statistical and connectionist approaches namely the divergence algorithm and Multi Layer Perceptron (MLP) with adaptive learning for segmentation of Malay connected digit with the aim to improve statistical approach via detection of insertion points. The neural network was optimized via trial and error in finding suitable parameters and speech time normalization methods. The best neural network classifier was then fusion with divergence algorithm to make segmentation. Results: The results of the experiments showed that the best neural network classifier used learning rate of value 1.0 and momentum rate of value 0.9 with data normalization based on zero-padded. The segmentation using fusion of statistical and connectionist was able to reduce insertion points up to 10.4% while maintaining match points above 99% and omission point below 0.7% within time tolerance of 0.09 second. Conclusion: The result of segmentation using the proposed fusion method indicated potential use of connectionist approach in improving continuous segmentation by statistical approach.

Journal of Computer Science

Volume 5 No. 4, 2009, 275-282

DOI: https://doi.org/10.3844/jcssp.2009.275.282

Submitted On: 2 April 2009 Published On: 30 April 2009

How to Cite: Salam, M. S., Mohamad, D. & Salleh, S. H. (2009). Improved Statistical Speech Segmentation Using Connectionist Approach. Journal of Computer Science, 5(4), 275-282. https://doi.org/10.3844/jcssp.2009.275.282

Copyright: © 2009 M. S. Salam, Dzulkifli Mohamad and S. H. Salleh. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

5,830 Views
3,882 Downloads
1 Citations

Download

Keywords

Speech segmentation
speech recognition
divergence algorithm
neural network