An Automatic Topic Identification Algorithm
Hossein Shahsavand Baghdadi and Bali Ranaivo-Malan
DOI : 10.3844/jcssp.2011.1363.1367
Journal of Computer Science
Volume 7, Issue 9
Problem statement: Topic is a stream of words which stands for the content of a text. Knowing the topic of a document can help people to be aware from its content and facilitate their searching process. Approach: This paper proposes an automatic algorithm to identify the topic for a textual document based on the chunks corresponding to each sentences in the document. Results and conclusion: We achieved 86% matching for both total and partial matching in our experimental data sample.
© 2011 Hossein Shahsavand Baghdadi and Bali Ranaivo-Malan. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.