UPM Institutional Repository

A review of CNN-based typical urban land cover segmentation techniques in multispectral remote sensing imagery


Citation

Zhao, Haimeng and Raihani, Mohamed and Ng, Seng Beng and Mohd, Ismail (2026) A review of CNN-based typical urban land cover segmentation techniques in multispectral remote sensing imagery. Sains Malaysiana, 55 (2). pp. 209-219. ISSN 0126-6039

Abstract

Compared with visible-light remote sensing, multispectral remote sensing provides multi-band land surface information and enhances spectral separability through data fusion, thereby enabling more accurate surface representation. However, spectral redundancy, resolution discrepancies, and highly complex urban environments impose greater challenges on existing methods. Deep learning approaches based on convolutional neural network (CNN) offer superior capabilities in extracting and integrating multispectral features, enabling more accurate urban land cover segmentation. This review focuses on pixel-level urban land cover segmentation and systematically summarizes recent advances in deep learning for multispectral remote sensing. First, we emphasize that the rich spectral information and spatial complementarity of multispectral data effectively enhance segmentation performance and alleviate ambiguities caused by the ‘same spectrum-different objects’ and ‘same object-different spectra’. Second, we review 19 publicly available multispectral datasets, highlighting differences in spectral bands, spatial resolution, and application scenarios, and summarize a standardized preprocessing pipeline including radiometric calibration, geometric correction, band normalization, and spectral dimensionality reduction to support reproducibility. Third, we discuss representative spectral-spatial feature extraction and cross-scale context modeling strategies, covering dilated convolution, 3D-2D hybrid structures, dual-branch architectures, and multi-scale enhancement modules. Extensive comparative experiments on ISPRS Potsdam and GID datasets further demonstrate the applicability and performance differences of representative models. Finally, future research trends and directions are discussed, encompassing multi-temporal and multi-scale temporal learning, cross-modal fusion, and the lightweight design of complex models.


Download File

[img] Text
124131.pdf - Published Version
Restricted to Repository staff only

Download (1MB)

Additional Metadata

Item Type: Article
Subject: Multidisciplinary
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.17576/jsm-2026-5502-03
Publisher: Penerbit Universiti Kebangsaan Malaysia
Keywords: Convolutional neural network (cnn); Multispectral features; Remote sensing data; Semantic segmentation; Surface feature extraction
Depositing User: Ms. Siti Radziah Mohamed@mahmod
Date Deposited: 13 Apr 2026 08:00
Last Modified: 13 Apr 2026 08:00
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.17576/jsm-2026-5502-03
URI: http://psasir.upm.edu.my/id/eprint/124131
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item