This paper is published in Volume-7, Issue-4, 2021
Area
Computer Science
Author
Kaushik I.
Org/Univ
National Institute of Technology Karnataka (NITK), Mangalore, Karnataka, India
Pub. Date
22 July, 2021
Paper ID
V7I4-1473
Publisher
Keywords
JSON, Cluster, Schema, Attributes, Similarity, Decision Tree, X-Path

Citationsacebook

IEEE
Kaushik I.. Clustering of JSON document using Derived X-Path Method, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.

APA
Kaushik I. (2021). Clustering of JSON document using Derived X-Path Method. International Journal of Advance Research, Ideas and Innovations in Technology, 7(4) www.IJARIIT.com.

MLA
Kaushik I.. "Clustering of JSON document using Derived X-Path Method." International Journal of Advance Research, Ideas and Innovations in Technology 7.4 (2021). www.IJARIIT.com.

Abstract

In this modern era where data creation is in abundance, handling that data is the biggest question researchers have been trying to solve for the past couple of decades. The data in any form needs to be stored in such a way that the retrieval and manipulation of the data are easier and simpler. In this paper, we look at how huge datasets in JSON format are being analyzed and made into clusters for an easier understanding of the documents that we are handling. This process is done by the first pattern analyzing the documents followed by clustering those documents one by one based on similarity of the attributes (arrays and sub-documents included). The clusters are then evaluated on an algorithm designed specifically for this purpose. To conclude, this paper lays out the model on how to cluster huge JSON documents with varying levels and types.