Research Article Open Access

An Ontological Crawling Approach for Improving Information Aggregation over eGovernment Websites

Heru Agus Santoso1, Junta Zeniarja1, Ardytha Luthfiarta1 and Bima Jati Wijaya1
  • 1 Dian Nuswantoro University, Indonesia

Abstract

E-Government applications in developing countries are still lagging behind e-Governments in advanced countries. For example, the use of information integration for Web portal content is still very limited. This paper proposes an automated approach for information aggregation over   e-Government portals using ontological approach. The study uses data obtained from 10 local government Websites in the Central Java province-Indonesia. The data in the form of HTML Web document text, meta-data, hyperlinks and other rich-contents are effectively crawled. This paper focuses on the development of a crawler, which consists of two main modules, i.e., multi-tread downloader and scheduler. The use of ontology in the focused crawler producesamore effective result as compared to the Breadth First Search (BFS) approach as it reaches 37% of effectiveness in terms of the number of relevant documents downloaded.

Journal of Computer Science
Volume 12 No. 9, 2016, 455-463

DOI: https://doi.org/10.3844/jcssp.2016.455.463

Submitted On: 3 January 2016 Published On: 25 November 2016

How to Cite: Santoso, H. A., Zeniarja, J., Luthfiarta, A. & Wijaya, B. J. (2016). An Ontological Crawling Approach for Improving Information Aggregation over eGovernment Websites. Journal of Computer Science, 12(9), 455-463. https://doi.org/10.3844/jcssp.2016.455.463

  • 2,717 Views
  • 1,617 Downloads
  • 3 Citations

Download

Keywords

  • E-Government
  • Information Aggregation
  • Focused Crawler
  • Semantic Web
  • Ontology