UPM Institutional Repository

Improving data reliability assessment in ETL processes through quality scoring technique in data analytics


Citation

Atika Razali, Nor Famiera and Baharom, Salmi and Abdullah, Salfarina and Admodisastro, Novia Indriaty (2024) Improving data reliability assessment in ETL processes through quality scoring technique in data analytics. International Journal on Informatics Visualization, 8 (4). pp. 2195-2202. ISSN 2549-9904

Abstract

The foundation of a relevant and accurate data analysis is reliable data. Technique and measurement are essential to evaluate current data quality regarding reliability and establish a baseline for ongoing improvement initiatives. Without tools or visualizations, data engineers may find it challenging to monitor and maintain the reliability of the massive data from the extraction, transformation, and loading (ETL) data load process. Data reliability assessment is a helpful technique in analyzing the quality of data reliability and information on the present state of data before commencing any analytics. The proposed technique hinges on the metric and measurement defining data reliability and the dashboard platform where the integration with the user in dictating the weight of data and the final output, which is the final data reliability score, will be projected. The score obtained affirms whether improvements are needed on the data or if an organization can proceed with data analytics. The technique considers the data extraction, transformation, and loading (ETL) procedures used to gather datasets. Data significance or weight was determined according to the analytics needs and preferences, indicating an acceptable score for generating insights. Ultimately, when utilizing the data reliability assessment metrics technique, we are credited with an overall picture of our data’s reliability aspect, as only one look is offered based on the intended analysis. This new approach boosts the confidence among data practitioners and stakeholders, especially those relying on findings generated from data analysis. Furthermore, the overview assists in enhancing the current state of data, where the derived score helps identify possible areas of improvement in the ETL process. Accuracy and efficiency assessment of the proposed technique also showed positive feedback in measuring the method in measuring the reliability of data.


Download File

[img] Text
117927.pdf - Published Version
Available under License Creative Commons Attribution Share Alike.

Download (3MB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.62527/joiv.8.4.3632
Publisher: Politeknik Negeri Padang
Keywords: Data reliability; Extraction; Transformation; Data reliability metrics; Data weight
Depositing User: Ms. Zaimah Saiful Yazan
Date Deposited: 17 Jun 2025 02:53
Last Modified: 17 Jun 2025 02:53
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.62527/joiv.8.4.3632
URI: http://psasir.upm.edu.my/id/eprint/117927
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item