UPM Institutional Repository

Reinforcement learning in risk management for pharmaceutical construction projects: frontiers, challenges, and improvement strategies


Citation

Junjia, Yin and Jiawen, Liu and Alias, Aidi Hizami and Haron, Nuzul Azam and Abu Bakar, Nabilah (2025) Reinforcement learning in risk management for pharmaceutical construction projects: frontiers, challenges, and improvement strategies. Sustainable Futures, 10. art. no. 101534. pp. 1-18. ISSN 2666-1888

Abstract

The intelligent construction of pharmaceutical facilities faces dynamic and nonlinear risks, and traditional management methods struggle to meet the high demands for real-time response and compliance. However, the existing reinforcement learning (RL) research in this field still lacks systematic application architecture and industry governance considerations. Therefore, this paper reviews the practical applications of six algorithms—Deep Q-Network (DQN), Deep Deterministic Policy Gradient (DDPG), and Proximity Policy Optimization (PPO)—in construction safety, temperature control, resource scheduling, and automated equipment optimization, validating the potential of reinforcement learning to effectively manage dynamic risks through adaptive learning. Simultaneously, this paper accurately identifies key bottlenecks in current applications: the fidelity gap between the simulation environment and actual medical regulations, the lack of standardized reinforcement learning deployment procedures, and the ambiguity between algorithmic decision-making authority and human oversight responsibility. To address these issues, this paper pioneers a high-fidelity environment simulation scheme integrating multiple technologies, a standardized reinforcement learning application framework, and a clear rights and responsibilities governance system, providing crucial theoretical support and practical pathways for constructing a reliable and efficient paradigm for pharmaceutical facility construction risk management.


Download File

[img] Text
124575.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (5MB)

Additional Metadata

Item Type: Article
Subject: Sociology and Political Science
Subject: Management Science and Operations Research
Subject: Management of Technology and Innovation
Divisions: Faculty of Engineering
Faculty of Humanities, Management and Science
DOI Number: https://doi.org/10.1016/j.sftr.2025.101534
Publisher: Elsevier
Keywords: Deep deterministic policy gradient; Deep Q-network; Pharmaceutical construction projects; Proximal policy optimization; Reinforcement learning; Risk management
Sustainable Development Goals (SDGs): SDG 9: Industry, Innovation and Infrastructure, SDG 3: Good Health and Well-being, SDG 11: Sustainable Cities and Communities
Depositing User: Ms. Nur Faseha Mohd Kadim
Date Deposited: 20 Apr 2026 08:23
Last Modified: 20 Apr 2026 08:23
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1016/j.sftr.2025.101534
URI: http://psasir.upm.edu.my/id/eprint/124575
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item