Enhancing Indian Language Speech Recognition Systems with Language-Independent Phonetic Script: An Experimental Exploration
- 1 Department of Computer Science and Engineering, Noorul Islam Centre for Higher Education, Kumarakoil, Kanyakumari District, Tamilnadu, India
Abstract
India has a distinct linguistic profile that makes it extremely challenging to train Automatic Speech Recognition (ASR) systems correctly because most Indian languages have limited training material and are generally low-resource in nature. However, due to their similarity in phonemes, these languages offer an opportunity to create a single speech recognition technology. This study proposes an optimized phonetic script which can be generalized for all major Indian languages. To prove the efficiency of the phonetic script, speech recognition models using phonetic and language scripts in Hindi and Malayalam language were created using a wave2Vec2-based Deep Neural Network (DNN) model via transfer learning. Furthermore, a model based on Long Short-Term Memory (LSTM) is created to translate phonetic script text back into its original languages. The findings show that the phonetic script ASR model performed noticeably better than the language-specific model, reducing WER roughly to 2%, especially for the Hindi language, which is further reduced up to 1% for the model trained with mixed language. This demonstrates the model's ability to improve performance by using cross-lingual phonetic similarities. This study establishes the foundation for cross-linguistic, scalable ASR systems that use phonetic similarities to enhance ASR performance in low-resource language contexts across India.
DOI: https://doi.org/10.3844/jcssp.2025.1176.1186
Copyright: © 2025 Jose Stephan and Muthayyan Kamalam Jayakumar. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 54 Views
- 12 Downloads
- 0 Citations
Download
Keywords
- Deep Neural Network
- End-to-End Automatic Speech Recognition
- Language Independent Pronunciation Script
- Speech Recognition for Indian Languages
- Transformer-based Wave2Vec2