Research Article Open Access

A Hybrid Approach to Pronominal Anaphora Resolution in Arabic

Abdullatif Abolohom1 and Nazlia Omar1
  • 1 University Kebangsaan Malaysia, Malaysia

Abstract

One of the challenges in natural language processing is to determine which pronouns to be referred to their intended referents in the discourse. Performing anaphora resolution is considered as an important task for a number of natural language processing applications such as information extraction, question answering and text summarization. Most of the earlier works of anaphora resolution have been applied to English and other languages. However, the work done in Arabic is not sufficiently studied. In this study, a hybrid approach that combines different architectures for resolving pronominal anaphora in Arabic language is presented. The hybrid model adopted the strategy based on the combination of a rule-based and machine learning approach. The collection of anaphora and respective possible antecedents was identified in a rule-based manner with morphological information taken into account. In addition, the selection of the most probable candidate as the antecedent of the anaphor was done by machine learning based on a k-Nearest Neighbor (k-NN) approach. In this study, the appropriate features to be used in this task were determined and their effect on the performance of anaphora resolution was investigated. Experiments of the proposed method were performed using the corpus of the Quran annotated with pronominal anaphora. The experimental results indicate that the proposed hybrid approach is completely reasonable and feasible for Arabic pronominal anaphora resolution.

Journal of Computer Science
Volume 11 No. 5, 2015, 764-771

DOI: https://doi.org/10.3844/jcssp.2015.764.771

Submitted On: 7 March 2015 Published On: 17 August 2015

How to Cite: Abolohom, A. & Omar, N. (2015). A Hybrid Approach to Pronominal Anaphora Resolution in Arabic. Journal of Computer Science, 11(5), 764-771. https://doi.org/10.3844/jcssp.2015.764.771

  • 2,536 Views
  • 2,182 Downloads
  • 6 Citations

Download

Keywords

  • Natural Language Processing
  • Anaphora Resolution
  • Machine Learning Approach
  • Rule-Based Approach