Arabic Poetry Authorship Attribution using Machine Learning Techniques
- 1 IBB University, Yemen
- 2 FSTM Université Hassan II, Morocco
- 3 Institut National des Postes et Télécommunications, Morocco
Published On: 27 July 2019
Copyright: © 2020 Al-Falahi Ahmed, Ramdani Mohamed and Bellafkih Mostafa. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
In this study, authorship attribution in Arabic poetry will be conducted to determine the authorship of a specified text after documents with recognized authorships have been allocated. This work also measures the impact performance of Naïve Bayes, Support Vector Machine and Linear discriminant analysis for Arabic poetry authorship attribution using text mining classification. Several features such as lexical features, character features, structural features, poetry features, syntactic features, semantic features and specific word features are utilized as the input data for text mining, using classification algorithms Linear discriminant analysis, Support Vector Machine and Naïve Bayes by Arabic Poetry Authorship Attribution Model (APAAM). The dataset of Arabic poetry is divided into two sets: known poetic in training dataset texts and anonymous poetic texts in a test dataset part. In the experiment, a set of 114 random poets from entirely different eras are used. The highest performance accuracy value is 99, 12%; the performance rate at the attribute level is 98.246%; the level of techniques is 92.836%.
- Authorship Attribution
- Arabic Poetry
- Text Classification
- Machine Learning Techniques