Research Article Open Access

An Arabic Text-To-Speech System Based on Artificial Neural Networks

Ghadeer Al-Said and Moussa Abdallah

Abstract

Problem statement: With the rapid advancement in information technology and communications, computer systems increasingly offer the users the opportunity to interact with information through speech. The interest in speech synthesis and in building voices is increasing. Worldwide, speech synthesizers have been developed for many popular languages English, Spanish and French and many researches and developments have been applied to those languages. Arabic on the other hand, has been given little attention compared to other languages of similar importance and the research in Arabic is still in its infancy. Based on these ideas, we introduced a system to transform Arabic text that was retrieved from a search engine into spoken words. Approach: We designed a text-to-speech system in which we used concatenative speech synthesis approach to synthesize Arabic text. The synthesizer was based on artificial neural networks, specifically the unsupervised learning paradigm. Different sizes of speech units had been used to produce spoken utterances, which are words, diphones and triphones. We also built a dictionary of 500 common words of Arabic. The smaller speech units (diphones and triphones) used for synthesis were chosen to achieve unlimited vocabulary of speech, while the word units were used for synthesizing limited set of sentences. Results: The system showed very high accuracy in synthesizing the Arabic text and the output speech was highly intelligible. For the word and diphone unit experiments, we could reach an accuracy of 99% while for the triphone units we reached an accuracy of 86.5%. Conclusion: An Arabic text-to-speech synthesizer was built with the ability to produce unlimited number of words with high quality voice.

Journal of Computer Science
Volume 5 No. 3, 2009, 207-213

DOI: https://doi.org/10.3844/jcssp.2009.207.213

Submitted On: 20 March 2009 Published On: 31 March 2009

How to Cite: Al-Said, G. & Abdallah, M. (2009). An Arabic Text-To-Speech System Based on Artificial Neural Networks. Journal of Computer Science, 5(3), 207-213. https://doi.org/10.3844/jcssp.2009.207.213

  • 2,937 Views
  • 2,922 Downloads
  • 11 Citations

Download

Keywords

  • Artificial neural networks
  • text-to-speech synthesis
  • concatenative synthesis
  • signal processing