Research Article Open Access

A Generic Syntax Analyzer for the Computerization of African Languages

Yahaya Morou Ganda1, Harouna Naroua1 and Bachir Moussa Idi1
  • 1 Département de Mathématiques et Informatique, Faculté des Sciences et Techniques, Université Abdou Moumouni, Niamey, Niger

Abstract

This article enters within the framework of a contribution to the computerization of African Languages. It presents a generic parsing tool designed for West African languages despite their individual specificities. The genericity of the tool resides in the fact that it is adaptable to any West African language while using a minimum of resources. It only needs an etiquette dictionary and context-free grammar. From these two resources, the tool is designed to generate an LR (1) parser. The choice of this technique is based on its supremacy over other techniques despite its disadvantage of generating an exponentially large parsing table. However, to reduce the size of the automaton while maintaining the power of the technique, the system brings together dictionary entries depending on their type, gender, and number in the form of terminals. The results obtained show that it is possible to generate a relatively small and efficient analysis table instead of a large one. An application of the proposed tool to the Hausa nominal phrases has succeeded in reducing the number of terminals from more than 10,000 to only 88. Instead of having an automaton with thousands of states, we end up with an automaton with only 198 states.

Journal of Computer Science
Volume 21 No. 4, 2025, 810-816

DOI: https://doi.org/10.3844/jcssp.2025.810.816

Submitted On: 13 August 2024 Published On: 4 March 2025

How to Cite: Ganda, Y. M., Naroua, H. & Idi, B. M. (2025). A Generic Syntax Analyzer for the Computerization of African Languages. Journal of Computer Science, 21(4), 810-816. https://doi.org/10.3844/jcssp.2025.810.816

  • 67 Views
  • 46 Downloads
  • 0 Citations

Download

Keywords

  • Natural Language Processing
  • Syntax Analysis
  • Automaton
  • Generic Tool
  • African Language