UPM Institutional Repository

A multi-depot vehicle routing problem with stochastic road capacity and reduced two-stage stochastic integer linear programming models for rollout algorithm


Citation

Anuar, Wadi Khalid and Lee, Lai Soon and Seow, Hsin Vonn and Pickl, Stefan (2021) A multi-depot vehicle routing problem with stochastic road capacity and reduced two-stage stochastic integer linear programming models for rollout algorithm. Mathematics, 9 (13). art. no. 1572. pp. 1-44. ISSN 2227-7390

Abstract

A matheuristic approach based on a reduced two-stage Stochastic Integer Linear Programming (SILP) model is presented. The proposed approach is suitable for obtaining a policy constructed dynamically on the go during the rollout algorithm. The rollout algorithm is part of the Approximate Dynamic Programming (ADP) lookahead solution approach for a Markov Decision Processes (MDP) framed Multi-Depot Dynamic Vehicle Routing Problem with Stochastic Road Capacity (MDDVRPSRC). First, a Deterministic Multi-Depot VRP with Road Capacity (D-MDVRPRC) is presented. Then an extension, MDVRPSRC-2S, is presented as an offline two-stage SILP model of the MDDVRPSRC. These models are validated using small simulated instances with CPLEX. Next, two reduced versions of the MDVRPSRC-2S model (MDVRPSRC-2S1 and MDVRPSRC-2S2) are derived. They have a specific task in routing: replenishment and delivering supplies. These reduced models are to be utilised interchangeably depending on the capacity of the vehicle, and repeatedly during the execution of rollout in reinforcement learning. As a result, it is shown that a base policy consisting of an exact optimal decision at each decision epoch can be obtained constructively through these reduced two-stage stochastic integer linear programming models. The results obtained from the resulting rollout policy with CPLEX execution during rollout are also presented to validate the reduced model and the matheuristic algorithm. This approach is proposed as a simple implementation when performing rollout for the lookahead approach in ADP.


Download File

Full text not available from this repository.
Official URL or Download Paper: https://www.mdpi.com/2227-7390/9/13/1572

Additional Metadata

Item Type: Article
Divisions: Faculty of Science
Institute for Mathematical Research
DOI Number: https://doi.org/10.3390/math9131572
Publisher: Multidisciplinary Digital Publishing Institute
Keywords: Reinforcement learning; Approximate dynamic programming; Rollout algorithm; Matheuristic; Two-stage stochastic programming; Vehicle routing problem
Depositing User: Ms. Nuraida Ibrahim
Date Deposited: 05 Apr 2023 03:21
Last Modified: 05 Apr 2023 03:21
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.3390/math9131572
URI: http://psasir.upm.edu.my/id/eprint/95781
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item