UPM Institutional Repository

Analyses of indexing techniques on uncertain data with high dimensionality


Citation

Mohammed Lawal, Ma’aruf and Ibrahim, Hamidah and Mohd Sani, Nor Fazlida and Yaakob, Razali (2020) Analyses of indexing techniques on uncertain data with high dimensionality. IEEE Access, 8. 74101 - 74117. ISSN 2169-3536

Abstract

Deploying a solution for handling critical decision-based problem efficiently requires the processing of high-dimensional data. Over the years, due to modern technological advancement, unprecedented volume of uncertain data is been captured and this has necessitated the need to organize such data for better data access performance. To this effect, the use of indexing technique for supporting, organizing, and storing of uncertain data with high dimensionality has become pertinent. However, the choice of an indexing technique to improve search performance is highly influenced by the properties of the underlying data set, data construction methods employed by the indexing structure, and the query types it supports. This paper is motivated to conduct an extensive performance analysis among existing indexing techniques, namely: R-tree, R*-tree and X-tree, in order to realize the most efficient indexing structure for organizing, storing and ultimately improving search performance over uncertain data with high dimensionality. The results of the analyses with regard to CPU processing time and number of nodes visited clearly show the superiority of X-tree over R-tree and R*-tree, as its superiority holds for different data set sizes, data distributions, number of dimensions and even with varying selectivity ratio.


Download File

[img] Text (Abstract)
ABSTRACT.pdf

Download (5kB)
Official URL or Download Paper: https://ieeexplore.ieee.org/document/9069901

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
Publisher: Institute of Electrical and Electronics Engineers
Keywords: Data partitioning; Indexing techniques; MBR; Uncertain data; High-dimensional data
Depositing User: Ms. Nuraida Ibrahim
Date Deposited: 14 Jun 2022 08:35
Last Modified: 14 Jun 2022 08:35
URI: http://psasir.upm.edu.my/id/eprint/87851
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item