Citation
Abstract
The intelligent construction of pharmaceutical facilities faces dynamic and nonlinear risks, and traditional management methods struggle to meet the high demands for real-time response and compliance. However, the existing reinforcement learning (RL) research in this field still lacks systematic application architecture and industry governance considerations. Therefore, this paper reviews the practical applications of six algorithms—Deep Q-Network (DQN), Deep Deterministic Policy Gradient (DDPG), and Proximity Policy Optimization (PPO)—in construction safety, temperature control, resource scheduling, and automated equipment optimization, validating the potential of reinforcement learning to effectively manage dynamic risks through adaptive learning. Simultaneously, this paper accurately identifies key bottlenecks in current applications: the fidelity gap between the simulation environment and actual medical regulations, the lack of standardized reinforcement learning deployment procedures, and the ambiguity between algorithmic decision-making authority and human oversight responsibility. To address these issues, this paper pioneers a high-fidelity environment simulation scheme integrating multiple technologies, a standardized reinforcement learning application framework, and a clear rights and responsibilities governance system, providing crucial theoretical support and practical pathways for constructing a reliable and efficient paradigm for pharmaceutical facility construction risk management.
Download File
Official URL or Download Paper: https://www.sciencedirect.com/science/article/pii/...
|
Additional Metadata
| Item Type: | Article |
|---|---|
| Subject: | Sociology and Political Science |
| Subject: | Management Science and Operations Research |
| Subject: | Management of Technology and Innovation |
| Divisions: | Faculty of Engineering Faculty of Humanities, Management and Science |
| DOI Number: | https://doi.org/10.1016/j.sftr.2025.101534 |
| Publisher: | Elsevier |
| Keywords: | Deep deterministic policy gradient; Deep Q-network; Pharmaceutical construction projects; Proximal policy optimization; Reinforcement learning; Risk management |
| Sustainable Development Goals (SDGs): | SDG 9: Industry, Innovation and Infrastructure, SDG 3: Good Health and Well-being, SDG 11: Sustainable Cities and Communities |
| Depositing User: | Ms. Nur Faseha Mohd Kadim |
| Date Deposited: | 20 Apr 2026 08:23 |
| Last Modified: | 20 Apr 2026 08:23 |
| Altmetrics: | http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1016/j.sftr.2025.101534 |
| URI: | http://psasir.upm.edu.my/id/eprint/124575 |
| Statistic Details: | View Download Statistic |
Actions (login required)
![]() |
View Item |
