UPM Institutional Repository

A multi-layer CNN-GRUSKIP model based on transformer for spatial: temporal traffic flow prediction


Citation

Mohammad Ata, Karimeh Ibrahim and Hassan, Mohd Khair and Ismaeel, Ayad Ghany and Al-Haddad, Syed Abdul Rahman and Alquthami, Thamer‎ and Alani, Sameer (2024) A multi-layer CNN-GRUSKIP model based on transformer for spatial: temporal traffic flow prediction. Ain Shams Engineering Journal, 15 (12). art. no. 103045. pp. 1-17. ISSN 2090-4479

Abstract

Traffic flow prediction remains a cornerstone for intelligent transportation systems (ITS), influencing both route optimization and environmental efforts. While Recurrent Neural Networks (RNN) and traditional Convolutional Neural Networks (CNN) offer some insights into the spatial–temporal dynamics of traffic data, they're often limited when navigating sparse and extended spatial–temporal patterns. In response, the CNN-GRUSKIP model emerges as a pioneering approach. Notably, it integrates the GRU-SKIP mechanism, a hybrid model that leverages the Gate Recurrent Unit's (GRU) capabilities to process sequences with the ’SKIP’ feature's ability to bypass and connect longer temporal dependencies, making it especially potent for traffic flow predictions with erratic and extended patterns. Another distinctive aspect is its non-standard 6-layer CNN, meticulously designed for in-depth spatiotemporal correlation extraction. The model comprises (1) the specialized CNN feature extraction, (2) the GRU-SKIP enhanced long-temporal module adept at capturing extended patterns, (3) a transformer module employing encoder-decoder and multi-attention mechanisms to hone prediction accuracy and trim model complexity, and (4) a bespoke prediction module. When tested against real-world datasets from California's Caltrans Performance Measurement System (PeMS), specifically PeMS districts 4 and 8, the CNN-GRUSKIP consistently outperformed established models such as ARIMA, Graph Wave Net, HA, LSTM, STGCN, and APTN. With its potent predictive prowess and adaptive architecture, the CNN-GRUSKIP model stands to redefine ITS applications, especially where nuanced traffic dynamics are in play. © 2024 THE AUTHORS


Download File

[img] Text
114860.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (7MB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Engineering
DOI Number: https://doi.org/10.1016/j.asej.2024.103045
Publisher: Elsevier
Keywords: Convolutional neural network; GRU; Skip function; Spatial-temporal prediction; Traffic flow prediction; Transformer
Depositing User: Ms. Nur Faseha Mohd Kadim
Date Deposited: 05 Feb 2025 03:24
Last Modified: 05 Feb 2025 03:24
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1016/j.asej.2024.103045
URI: http://psasir.upm.edu.my/id/eprint/114860
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item