UPM Institutional Repository

Steganalysis of adaptive multi-rate speech with unknown embedding rates using multi-scale transformer and multi-task learning mechanism


Citation

Sun, Congcong and Abdullah, Azizol and Samian, Normalia and Roslan, Nuur Alifah (2025) Steganalysis of adaptive multi-rate speech with unknown embedding rates using multi-scale transformer and multi-task learning mechanism. Journal of Cybersecurity and Privacy, 5 (2). art. no. 29. pp. 1-20. ISSN 2624-800X

Abstract

As adaptive multi-rate (AMR) speech applications become increasingly widespread, AMR-based steganography presents growing security risks. Conventional steganalysis methods often assume known embedding rates, limiting their practicality in real-world scenarios where embedding rates are unknown. To overcome this limitation, we introduce a novel framework that integrates a multi-scale transformer architecture with multi-task learning for joint classification and regression. The classification task effectively distinguishes between cover and stego samples, while the regression task enhances feature representation by predicting continuous embedding values, providing deeper insights into embedding behaviors. This joint optimization strategy improves model adaptability to diverse embedding conditions and captures the underlying relationships between discrete embedding classes and their continuous distributions. The experimental results demonstrate that our approach achieves higher accuracy and robustness than existing steganalysis methods across varying embedding rates.


Download File

[img] Text
123114.pdf - Published Version
Available under License Creative Commons Attribution.

Download (694kB)
Official URL or Download Paper: https://www.mdpi.com/2624-800X/5/2/29

Additional Metadata

Item Type: Article
Subject: Computer Science (miscellaneous)
Subject: Artificial Intelligence
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.3390/jcp5020029
Publisher: Multidisciplinary Digital Publishing Institute (MDPI)
Keywords: Multi-scale transformer; Multi-task learning; Steganalysis; Steganography; Embedding rates
Sustainable Development Goals (SDGs): SDG 9: Industry, Innovation and Infrastructure, SDG 16: Peace, Justice and Strong Institutions, SDG 11: Sustainable Cities and Communities
Depositing User: Ms. Siti Radziah Mohamed@mahmod
Date Deposited: 29 Apr 2026 03:16
Last Modified: 29 Apr 2026 03:16
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.3390/jcp5020029
URI: http://psasir.upm.edu.my/id/eprint/123114
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item