Research Article Open Access

A Deep Learning Approach for Telugu Domain Identification with Multichannel LSTM-CNN

Buddha Hari Kumar1, Chitra Perumal1, Inakoti Ramesh Raja2, Chukka Ramesh Babu3, Srinivas Rao Gorre4 and Santosh Tripurana5
  • 1 Electronics and Communication Engineering Department, Sathyabama Institute of Science and Technology, Chennai, India
  • 2 Department of Electronics and Communication Engineering, Aditya University, Surampalem, India
  • 3 Department of Electronics and Communication Engineering, Vignan's Institute Of Information Technology, Visakhapatnam, India
  • 4 Department of Electronics and Communication Engineering, Vasavi College of Engineering, Hyderabad, Telangana, India
  • 5 Department of Electronics and Communication Engineering , Vignan's Institute of Engineering for Women, Visakhapatnam, India

Abstract

The vast growth of textual data has ushered into the limelight, a plethora of applications in information retrieval and natural language processing (NLP). Proper extraction of information from text is heavily dependent on recognizing the thematic content, which becomes crucial in the tasks of document summarization, information extraction, question answering, machine translation, and sentiment analysis. The great complexity of this challenge arises for regional languages such as Telugu, where unique linguistic features demand specialized approaches. In this work, we propose a Telugu Technical Domain Identification model based on a Multichannel Long Short-Term Memory Convolutional Neural Network (LSTM-CNN) architecture. This methodology benefits from the sequential data treatment capabilities of LSTM combined with the local feature extractive powers of CNN, which enable effective domain identification in Telugu texts. The model was assessed at the ICON Shared Challenge "TechDOfication 2020," scoring an F1 score of 90.01% on the validation set and 69.90% on the test set. The results indicate a great improvement over conventional models and show the tremendous efficacy of multichannel deep learning techniques for domain identification in Telugu. The proposed model will serve as a milestone toward enhancing NLP applications for regional languages while providing a scalable solution to the heightened demands for accurate thematic classification of techno-domain risks.

Journal of Computer Science
Volume 21 No. 9, 2025, 2181-2190

DOI: https://doi.org/10.3844/jcssp.2025.2181.2190

Submitted On: 30 January 2025 Published On: 26 November 2025

How to Cite: Kumar, B. H., Perumal, C., Raja, I. R., Babu, C. R., Gorre, S. R. & Tripurana, S. (2025). A Deep Learning Approach for Telugu Domain Identification with Multichannel LSTM-CNN. Journal of Computer Science, 21(9), 2181-2190. https://doi.org/10.3844/jcssp.2025.2181.2190

  • 94 Views
  • 17 Downloads
  • 0 Citations

Download

Keywords

  • Natural Language Processing (NLP)
  • Multichannel LSTMCNN
  • Long Short-Term Memory (LSTM)
  • Text Summarization
  • Multilingual Text Processing
  • F1 Score