Research Article Open Access

Enhancing Indian Language Speech Recognition Systems with Language-Independent Phonetic Script: An Experimental Exploration

Jose Stephan1 and Muthayyan Kamalam Jayakumar1
  • 1 Department of Computer Science and Engineering, Noorul Islam Centre for Higher Education, Kumarakoil, Kanyakumari District, Tamilnadu, India

Abstract

India has a distinct linguistic profile that makes it extremely challenging to train Automatic Speech Recognition (ASR) systems correctly because most Indian languages have limited training material and are generally low-resource in nature. However, due to their similarity in phonemes, these languages offer an opportunity to create a single speech recognition technology. This study proposes an optimized phonetic script which can be generalized for all major Indian languages. To prove the efficiency of the phonetic script, speech recognition models using phonetic and language scripts in Hindi and Malayalam language were created using a wave2Vec2-based Deep Neural Network (DNN) model via transfer learning. Furthermore, a model based on Long Short-Term Memory (LSTM) is created to translate phonetic script text back into its original languages. The findings show that the phonetic script ASR model performed noticeably better than the language-specific model, reducing WER roughly to 2%, especially for the Hindi language, which is further reduced up to 1% for the model trained with mixed language. This demonstrates the model's ability to improve performance by using cross-lingual phonetic similarities. This study establishes the foundation for cross-linguistic, scalable ASR systems that use phonetic similarities to enhance ASR performance in low-resource language contexts across India.

Journal of Computer Science
Volume 21 No. 5, 2025, 1176-1186

DOI: https://doi.org/10.3844/jcssp.2025.1176.1186

Submitted On: 3 August 2024 Published On: 9 May 2025

How to Cite: Stephan, J. & Jayakumar, M. K. (2025). Enhancing Indian Language Speech Recognition Systems with Language-Independent Phonetic Script: An Experimental Exploration. Journal of Computer Science, 21(5), 1176-1186. https://doi.org/10.3844/jcssp.2025.1176.1186

  • 54 Views
  • 12 Downloads
  • 0 Citations

Download

Keywords

  • Deep Neural Network
  • End-to-End Automatic Speech Recognition
  • Language Independent Pronunciation Script
  • Speech Recognition for Indian Languages
  • Transformer-based Wave2Vec2