Research Article Open Access

Categorizer Agent for Electronic Computer Science Academic Papers

Khalifa Chekima1 and Patricia Anthony1
  • 1 University Malaysia Sabah, Malaysia

Abstract

Problem statement: With the rapid development of World Wide Web (WWW), a huge amount of information is now accessible to the web users. This phenomenon has attracted academic users to publish their research papers online, at the same time downloading and sharing academic papers among them through WWW. Categorizing a document manually can take up considerable amount of user’s time whereby user will have to read each of the documents to decide which category it is suitable. Approach: Our research study proposes the use of set of terms stored in a database to categorize computer science papers. The categorizer agent focuses on categorizing the text document into predetermined categories based on the extracted keyword. Results: We have evaluated our document categorizer agent on a number of computer science papers. The categorization process is done by parsing the document, calculating the frequency of each term and matching the terms found in the database. Conclusion: The Categorizer Agent proposed in this research paper is evaluated as a good approach to categorize electronic papers. Moreover, the results indicated that the use of this term database is a sustainable way to categorize computer science electronic documents.

American Journal of Economics and Business Administration
Volume 3 No. 1, 2011, 213-218

DOI: https://doi.org/10.3844/ajebasp.2011.213.218

Submitted On: 19 August 2009 Published On: 31 January 2011

How to Cite: Chekima, K. & Anthony, P. (2011). Categorizer Agent for Electronic Computer Science Academic Papers. American Journal of Economics and Business Administration, 3(1), 213-218. https://doi.org/10.3844/ajebasp.2011.213.218

  • 4,475 Views
  • 3,009 Downloads
  • 0 Citations

Download

Keywords

  • Artificial intelligence
  • information retrieval
  • document categorization
  • data mining
  • Support Vector Machine (SVM)
  • Artificial Neural Networks (ANN)
  • Generalized Instance Set (GIS)
  • Self Organizing Map (SOM)