A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties

Yuchang Zhang; Ruibin Bai; Rong Qu; Chaofan Tu; Jiahuan Jin

doi:10.1016/j.ejor.2021.10.032

A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties

Yuchang Zhang, Ruibin Bai, Rong Qu, Chaofan Tu, Jiahuan Jin

School of Computer Science

Research output: Journal Publication › Article › peer-review

92 Citations (Scopus)

Abstract

In the past decade, considerable advances have been made in the field of computational intelligence and operations research. However, the majority of these optimisation approaches have been developed for deterministically formulated problems, the parameters of which are often assumed perfectly predictable prior to problem-solving. In practice, this strong assumption unfortunately contradicts the reality of many real-world problems which are subject to different levels of uncertainties. The solutions derived from these deterministic approaches can rapidly deteriorate during execution due to the over-optimisation without explicit consideration of the uncertainties. To address this research gap, a deep reinforcement learning based hyper-heuristic framework is proposed in this paper. The proposed approach enhances the existing hyper-heuristics with a powerful data-driven heuristic selection module in the form of deep reinforcement learning on parameter-controlled low-level heuristics, to substantially improve their handling of uncertainties while optimising across various problems. The performance and practicality of the proposed hyper-heuristic approach have been assessed on two combinatorial optimisation problems: a real-world container terminal truck routing problem with uncertain service times and the well-known online 2D strip packing problem. The experimental results demonstrate its superior performance compared to existing solution methods for these problems. Finally, the increased interpretability of the proposed deep reinforcement learning hyper-heuristic has been exhibited in comparison with the conventional deep reinforcement learning methods.

Original language	English
Pages (from-to)	418-427
Number of pages	10
Journal	European Journal of Operational Research
Volume	300
Issue number	2
DOIs	https://doi.org/10.1016/j.ejor.2021.10.032
Publication status	Published - 16 Jul 2022

Keywords

2D packing
Container truck routing
Deep reinforcement learning
Hyper-heuristics
Transportation

ASJC Scopus subject areas

General Computer Science
Modelling and Simulation
Management Science and Operations Research
Information Systems and Management

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1016/j.ejor.2021.10.032

Cite this

@article{ca94c5979a794d2e9a9e3e01d8eb25a0,

title = "A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties",

abstract = "In the past decade, considerable advances have been made in the field of computational intelligence and operations research. However, the majority of these optimisation approaches have been developed for deterministically formulated problems, the parameters of which are often assumed perfectly predictable prior to problem-solving. In practice, this strong assumption unfortunately contradicts the reality of many real-world problems which are subject to different levels of uncertainties. The solutions derived from these deterministic approaches can rapidly deteriorate during execution due to the over-optimisation without explicit consideration of the uncertainties. To address this research gap, a deep reinforcement learning based hyper-heuristic framework is proposed in this paper. The proposed approach enhances the existing hyper-heuristics with a powerful data-driven heuristic selection module in the form of deep reinforcement learning on parameter-controlled low-level heuristics, to substantially improve their handling of uncertainties while optimising across various problems. The performance and practicality of the proposed hyper-heuristic approach have been assessed on two combinatorial optimisation problems: a real-world container terminal truck routing problem with uncertain service times and the well-known online 2D strip packing problem. The experimental results demonstrate its superior performance compared to existing solution methods for these problems. Finally, the increased interpretability of the proposed deep reinforcement learning hyper-heuristic has been exhibited in comparison with the conventional deep reinforcement learning methods.",

keywords = "2D packing, Container truck routing, Deep reinforcement learning, Hyper-heuristics, Transportation",

author = "Yuchang Zhang and Ruibin Bai and Rong Qu and Chaofan Tu and Jiahuan Jin",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier B.V.",

year = "2022",

month = jul,

day = "16",

doi = "10.1016/j.ejor.2021.10.032",

language = "English",

volume = "300",

pages = "418--427",

journal = "European Journal of Operational Research",

issn = "0377-2217",

publisher = "Elsevier B.V.",

number = "2",

}

TY - JOUR

T1 - A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties

AU - Zhang, Yuchang

AU - Bai, Ruibin

AU - Qu, Rong

AU - Tu, Chaofan

AU - Jin, Jiahuan

PY - 2022/7/16

Y1 - 2022/7/16

N2 - In the past decade, considerable advances have been made in the field of computational intelligence and operations research. However, the majority of these optimisation approaches have been developed for deterministically formulated problems, the parameters of which are often assumed perfectly predictable prior to problem-solving. In practice, this strong assumption unfortunately contradicts the reality of many real-world problems which are subject to different levels of uncertainties. The solutions derived from these deterministic approaches can rapidly deteriorate during execution due to the over-optimisation without explicit consideration of the uncertainties. To address this research gap, a deep reinforcement learning based hyper-heuristic framework is proposed in this paper. The proposed approach enhances the existing hyper-heuristics with a powerful data-driven heuristic selection module in the form of deep reinforcement learning on parameter-controlled low-level heuristics, to substantially improve their handling of uncertainties while optimising across various problems. The performance and practicality of the proposed hyper-heuristic approach have been assessed on two combinatorial optimisation problems: a real-world container terminal truck routing problem with uncertain service times and the well-known online 2D strip packing problem. The experimental results demonstrate its superior performance compared to existing solution methods for these problems. Finally, the increased interpretability of the proposed deep reinforcement learning hyper-heuristic has been exhibited in comparison with the conventional deep reinforcement learning methods.

AB - In the past decade, considerable advances have been made in the field of computational intelligence and operations research. However, the majority of these optimisation approaches have been developed for deterministically formulated problems, the parameters of which are often assumed perfectly predictable prior to problem-solving. In practice, this strong assumption unfortunately contradicts the reality of many real-world problems which are subject to different levels of uncertainties. The solutions derived from these deterministic approaches can rapidly deteriorate during execution due to the over-optimisation without explicit consideration of the uncertainties. To address this research gap, a deep reinforcement learning based hyper-heuristic framework is proposed in this paper. The proposed approach enhances the existing hyper-heuristics with a powerful data-driven heuristic selection module in the form of deep reinforcement learning on parameter-controlled low-level heuristics, to substantially improve their handling of uncertainties while optimising across various problems. The performance and practicality of the proposed hyper-heuristic approach have been assessed on two combinatorial optimisation problems: a real-world container terminal truck routing problem with uncertain service times and the well-known online 2D strip packing problem. The experimental results demonstrate its superior performance compared to existing solution methods for these problems. Finally, the increased interpretability of the proposed deep reinforcement learning hyper-heuristic has been exhibited in comparison with the conventional deep reinforcement learning methods.

KW - 2D packing

KW - Container truck routing

KW - Deep reinforcement learning

KW - Hyper-heuristics

KW - Transportation

UR - http://www.scopus.com/inward/record.url?scp=85118782370&partnerID=8YFLogxK

U2 - 10.1016/j.ejor.2021.10.032

DO - 10.1016/j.ejor.2021.10.032

M3 - Article

AN - SCOPUS:85118782370

SN - 0377-2217

VL - 300

SP - 418

EP - 427

JO - European Journal of Operational Research

JF - European Journal of Operational Research

IS - 2

ER -

A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties

Abstract

Keywords

ASJC Scopus subject areas

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this