Research Article Open Access

Similarity Based Clustering with Indexing for Semi-Structured Document

S. Palanisamy and K. Baskaran

Abstract

Problem statement: To improve the performance of data retrieval in a homogeneous large XML document. Approach: Clustering of XML elements based on the content with indexing. The element which is used for clustering has been identified from the document and/or XML schema. This element is used as a parameter for clustering. The suitable index is created after clustering. Results: The clustering combined with indexing strategy support the efficient retrieval of XML element from the document. Conclusion: The proposed method is used to improve the efficiency of XML data manipulation and comparatively give the better performance rather than clustering or indexing alone.

Journal of Computer Science
Volume 8 No. 4, 2012, 545-550

DOI: https://doi.org/10.3844/jcssp.2012.545.550

Submitted On: 18 November 2011 Published On: 7 February 2012

How to Cite: Palanisamy, S. & Baskaran, K. (2012). Similarity Based Clustering with Indexing for Semi-Structured Document. Journal of Computer Science, 8(4), 545-550. https://doi.org/10.3844/jcssp.2012.545.550

  • 2,631 Views
  • 2,495 Downloads
  • 0 Citations

Download

Keywords

  • Clustering
  • indexing
  • XML
  • query