UPM Institutional Repository

Effect of datasets size on the machine learning performance of the bagworm, Metisa plana (Walker) infestation using UAV remote sensing


Citation

Mohd Johari, Siti Nurul Afiah and Khairunniza-Bejo, Siti and Mohamed Shariff, Abdul Rashid and Husin, Nur Azuan and Mohd Masri, Mohamed Mazmira and Kamarudin, Noorhazwani (2024) Effect of datasets size on the machine learning performance of the bagworm, Metisa plana (Walker) infestation using UAV remote sensing. Journal of Plant Diseases and Protection, 132 (1). art. no. 52. pp. 1-17. ISSN 1861-3829; eISSN: 1861-3837

Abstract

A leaf-eating pest, Metisa plana (Lepidoptera: Psychidae), could cause 10–13% leaf defoliation and up to 40% crop losses, which would have a significant detrimental economic influence on Malaysian oil palm on yield production. A manual census was carried out to measure the current level of infestation; however, it became time-consuming when covering a large area. Unmanned aerial vehicles (UAVs) were chosen as the solution due to their rapid assess of the severity of the bagworm infestation. Nevertheless, there is a greater chance of unbalanced data when employing UAV imagery, which may be a problem when determining the degree of infestation. Therefore, this study evaluated the impact of both balanced and imbalanced infestation level data on machine learning classification performance via three combinations of vegetation indices: NDVI-NDRE, NDVI-GNDVI and NDRE-GNDVI. Resampling method was carried out using random oversampling (ROS), synthetic minority oversampling techniques (SMOTE), random undersampling (RUS), 3-interval undersampling and 5-interval undersampling. Results showed that the best performance with 86.84% successful classification of 100% F1-score using imbalanced data of 3-interval undersampling. Fine KNN was constantly well performed in classifying all infestation levels in NDVI-NDRE combination across all datasets. The results unequivocally show that the 66.67% reduction in the sample size increases the chances of successful classification, even in situations where the data are unbalanced.


Download File

[img] Text
117901.pdf - Published Version
Restricted to Repository staff only

Download (3MB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Engineering
Institute of Plantation Studies
Smart Farming Technology Research Centre
DOI Number: https://doi.org/10.1007/s41348-024-01020-x
Publisher: Springer Science and Business Media LLC
Keywords: Bagworm; Machine learning; Oversampling; SMOTE; UAV remote sensing; Undersampling
Depositing User: Ms. Nuraida Ibrahim
Date Deposited: 16 Jun 2025 07:50
Last Modified: 16 Jun 2025 07:50
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1007/s41348-024-01020-x
URI: http://psasir.upm.edu.my/id/eprint/117901
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item