UPM Institutional Repository

Explainable deep learning framework for binary corrosion image classification using Grad-CAM


Citation

Aminudin, Muhammad Amir Imran and Abdullah, Mohd Na’im and Mustapha, Faizal and Eng, Kee Kok and Mustapha, Mazli and Mustapha, Aliyu (2025) Explainable deep learning framework for binary corrosion image classification using Grad-CAM. Sensors, 25 (22). art. no. 7070. pp. 1-27. ISSN 1424-8220

Abstract

Corrosion in metallic materials is a critical challenge in maintenance and safety, and traditional visual inspection methods are often time-consuming, labor-intensive, and dependent on human expertise, highlighting the need for more efficient and reliable solutions. Deep learning, particularly convolutional neural networks (CNNs), provides a promising approach by enabling automated and accurate image-based classification. This study investigates binary image classification of corrosion using four pre-trained CNN architectures, namely ResNet50, MobileNetV2, NASNetMobile, and EfficientNetV2B0, and integrates explainable artificial intelligence (XAI) techniques to provide interpretability and insight into each model’s decision-making process. A curated dataset of 4012 images, divided between corroded and non-corroded surfaces, was pre-processed, and augmented images resulted in a total of 9636 images used to train and evaluate the models. Performance was assessed through accuracy, confusion matrices, computational timing, receiver operating characteristic curves, precision–recall curves, and Cohen’s Kappa. In this paper, Gradient-weighted Class Activation Mapping (Grad-CAM) visualizations are incorporated as an XAI technique to provide interpretable insight into the model’s reasoning process, enabling clear identification of corrosion regions and offering justification for each prediction produced by the system. A key contribution of this work is the integration of Grad-CAM to enhance explainability. The results showed that EfficientNetV2B0 demonstrates stable training with minimal sign overfitting compared to other models. MobileNetV2 achieved the lowest time to train with the datasets given, and ResNet50 achieved the highest classification performance in terms of confusion matrix, with an accuracy of 96.58%. Through Grad-CAM reasoning, EfficientNetV2B0 shows a specific high activation towards corroded regions compared to the other three models that were evaluated.


Download File

[img] Text
126168.pdf - Published Version
Available under License Creative Commons Attribution.

Download (3MB)
Official URL or Download Paper: https://www.mdpi.com/1424-8220/25/22/7070

Additional Metadata

Item Type: Article
Subject: Analytical Chemistry
Subject: Information Systems
Subject: Atomic and Molecular Physics, and Optics
Divisions: Faculty of Engineering
DOI Number: https://doi.org/10.3390/s25227070
Publisher: Multidisciplinary Digital Publishing Institute (MDPI)
Keywords: Binary classification; Convolutional neural networks (CNN); Corrosion detection; Deep learning; Non-destructive testing
Sustainable Development Goals (SDGs): SDG 9: Industry, Innovation and Infrastructure, SDG 11: Sustainable Cities and Communities, SDG 12: Responsible Consumption and Production
Depositing User: Ms. Nur Faseha Mohd Kadim
Date Deposited: 18 Jun 2026 08:42
Last Modified: 18 Jun 2026 08:42
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.3390/s25227070
URI: http://psasir.upm.edu.my/id/eprint/126168
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item