UPM Institutional Repository

Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks


Citation

Weiyuan, Yu and Osman, Mohd Hafeez and Atan, Rodziah and Wan Ab Rahman, Wan Nurhayati (2024) Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks. International Journal on Advanced Science, Engineering and Information Technology, 14 (6). pp. 2059-2067. ISSN 2088-5334; eISSN: 2460-6952

Abstract

Conventional approaches, such as static load testing and synthetic monitoring, typically evaluate system performance under controlled conditions but do not fully capture the unpredictable scenarios encountered in real-world operations. For instance, static load testing involves applying a predetermined load to the system to measure performance metrics like response time and throughput, which may not reflect the variability and chaos of actual usage. Similarly, synthetic monitoring uses scripted transactions to check system availability and performance, but these scripts often lack the complexity and variability of real-world interactions. This research aims to overcome these limitations by utilizing advanced chaos engineering techniques to simulate a range of faults, including network latency, service crashes, resource exhaustion, message loss, and security attacks. The proposed tool integrates components for data generation, fault injection, storage, monitoring, and visualization, allowing for a thorough evaluation of system robustness. The methodology involves conducting controlled experiments within an AWS-based cloud-native IoT environment to assess the tool's effectiveness. These experiments demonstrate that the tool effectively identifies weaknesses in system resilience and improves overall robustness. By replicating real-world disruptions and analyzing system responses, the tool provides critical insights into the behavior of IoT devices under stress. The study concludes that this chaos engineering tool significantly enhances the ability to detect and address vulnerabilities, supporting creating more resilient IoT systems. Future work will expand the range of simulated faults, validate the tool across various cloud platforms, and incorporate additional real-time analysis features.


Download File

[img] Text
117862.pdf - Published Version
Available under License Creative Commons Attribution Share Alike.

Download (3MB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.18517/ijaseit.14.6.15956
Publisher: Insight Society
Keywords: Amazon Web Services (AWS); Chaos engineering; Cloud-native environments; Fault injection; IoT resilience
Depositing User: Ms. Nur Faseha Mohd Kadim
Date Deposited: 13 Jun 2025 08:06
Last Modified: 13 Jun 2025 08:06
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.18517/ijaseit.14.6.15956
URI: http://psasir.upm.edu.my/id/eprint/117862
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item