An Arabic Text-To-Speech System Based on Artificial Neural Networks

Ghadeer Al-Said; Moussa Abdallah

doi:10.3844/jcssp.2009.207.213

Research Article Open Access

An Arabic Text-To-Speech System Based on Artificial Neural Networks

Ghadeer Al-Said and Moussa Abdallah

Abstract

Problem statement: With the rapid advancement in information technology and communications, computer systems increasingly offer the users the opportunity to interact with information through speech. The interest in speech synthesis and in building voices is increasing. Worldwide, speech synthesizers have been developed for many popular languages English, Spanish and French and many researches and developments have been applied to those languages. Arabic on the other hand, has been given little attention compared to other languages of similar importance and the research in Arabic is still in its infancy. Based on these ideas, we introduced a system to transform Arabic text that was retrieved from a search engine into spoken words. Approach: We designed a text-to-speech system in which we used concatenative speech synthesis approach to synthesize Arabic text. The synthesizer was based on artificial neural networks, specifically the unsupervised learning paradigm. Different sizes of speech units had been used to produce spoken utterances, which are words, diphones and triphones. We also built a dictionary of 500 common words of Arabic. The smaller speech units (diphones and triphones) used for synthesis were chosen to achieve unlimited vocabulary of speech, while the word units were used for synthesizing limited set of sentences. Results: The system showed very high accuracy in synthesizing the Arabic text and the output speech was highly intelligible. For the word and diphone unit experiments, we could reach an accuracy of 99% while for the triphone units we reached an accuracy of 86.5%. Conclusion: An Arabic text-to-speech synthesizer was built with the ability to produce unlimited number of words with high quality voice.

Journal of Computer Science

Volume 5 No. 3, 2009, 207-213

DOI: https://doi.org/10.3844/jcssp.2009.207.213

Submitted On: 20 March 2009 Published On: 31 March 2009

How to Cite: Al-Said, G. & Abdallah, M. (2009). An Arabic Text-To-Speech System Based on Artificial Neural Networks. Journal of Computer Science, 5(3), 207-213. https://doi.org/10.3844/jcssp.2009.207.213

Copyright: © 2009 Ghadeer Al-Said and Moussa Abdallah. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

4,569 Views
3,850 Downloads
12 Citations

Download

Keywords

Artificial neural networks
text-to-speech synthesis
concatenative synthesis
signal processing