Journal of Computer Science

An Automatic Topic Identification Algorithm

Hossein Shahsavand Baghdadi and Bali Ranaivo-Malan

DOI : 10.3844/jcssp.2011.1363.1367

Journal of Computer Science

Volume 7, Issue 9

Pages 1363-1367

Abstract

Problem statement: Topic is a stream of words which stands for the content of a text. Knowing the topic of a document can help people to be aware from its content and facilitate their searching process. Approach: This paper proposes an automatic algorithm to identify the topic for a textual document based on the chunks corresponding to each sentences in the document. Results and conclusion: We achieved 86% matching for both total and partial matching in our experimental data sample.

Copyright

© 2011 Hossein Shahsavand Baghdadi and Bali Ranaivo-Malan. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.