UPM Institutional Repository

A stacking ensemble framework integrating radiomics and deep learning for prognostic prediction in head and neck cancer


Citation

Wang, Bingzhen and Liu, Jinghua and Zhang, Xiaolei and Lin, Jianpeng and Li, Shuyan and Wang, Zhongxiao and Cao, Zhendong and Wen, Dong and Liu, Tiange and Harun Ramli, Hafiz Rashidi and Harith, Hazreen Haizi and Wan Hasan, Wan Zuha and Dong, Xianling (2025) A stacking ensemble framework integrating radiomics and deep learning for prognostic prediction in head and neck cancer. Radiation Oncology, 20 (1). art. no. 127. pp. 1-14. ISSN 1748-717X

Abstract

Background: Radiomics models frequently face challenges related to reproducibility and robustness. To address these issues, we propose a multimodal, multi-model fusion framework utilizing stacking ensemble learning for prognostic prediction in head and neck cancer (HNC). This approach seeks to improve the accuracy and reliability of survival predictions. Methods: A total of 806 cases from nine centers were collected; 143 cases from two centers were assigned as the external validation cohort, while the remaining 663 were stratified and randomly split into training (n = 530) and internal validation (n = 133) sets. Radiomics features were extracted according to IBSI standards, and deep learning features were obtained using a 3D DenseNet-121 model. Following feature selection, the selected features were input into Cox, SVM, RSF, DeepCox, and DeepSurv models. A stacking fusion strategy was employed to develop the prognostic model. Model performance was evaluated using Kaplan-Meier survival curves and time-dependent ROC curves. Results: On the external validation set, the model using combined PET and CT radiomics features achieved superior performance compared to single-modality models, with the RSF model obtaining the highest concordance index (C-index) of 0.7302. When using deep features extracted by 3D DenseNet-121, the PET + CT-based models demonstrated significantly improved prognostic accuracy, with Deepsurv and DeepCox achieving C-indices of 0.9217 and 0.9208, respectively. In stacking models, the PET + CT model using only radiomics features reached a C-index of 0.7324, while the deep feature-based stacking model achieved 0.9319. The best performance was obtained by the multi-feature fusion model, which integrated both radiomics and deep learning features from PET and CT, yielding a C-index of 0.9345. Kaplan–Meier survival analysis further confirmed the fusion model’s ability to distinguish between high-risk and low-risk groups. Conclusion: The stacking-based ensemble model demonstrates superior performance compared to individual machine learning models, markedly improving the robustness of prognostic predictions.


Download File

[img] Text
120354.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (3MB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Engineering
Faculty of Medicine and Health Science
DOI Number: https://doi.org/10.1186/s13014-025-02695-8
Publisher: BioMed Central
Keywords: DenseNet; Ensemble; Head and neck Cancer; Prognostic; Radiomics; Stacking
Depositing User: Ms. Zaimah Saiful Yazan
Date Deposited: 01 Oct 2025 00:18
Last Modified: 01 Oct 2025 00:18
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1186/s13014-025-02695-8
URI: http://psasir.upm.edu.my/id/eprint/120354
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item