A deep reinforcement learning hyper-heuristic with feature fusion for online packing problems

Chaofan TU; Ruibin Bai; Uwe  Aickelin; Yuchang ZHANG; Heshan Du

A deep reinforcement learning hyper-heuristic with feature fusion for online packing problems

Chaofan TU, Ruibin Bai, Uwe Aickelin, Yuchang ZHANG, Heshan Du

Research output: Journal Publication › Article › peer-review

20 Citations (Scopus)

Abstract

In recent years, deep reinforcement learning has shown great potential in solving computer games with sequential decision-making scenarios. Hyper-heuristic is a generic search framework, capable of intelligently selecting or generating algorithms to solve a class of optimisation problems with stochastic or dynamic settings. This paper proposes a new general framework for solving online packing problems using deep reinforcement learning hyper-heuristics. Although analytical approaches can address most offline packing problems successfully, their online versions have proved much more challenging and the performance of the existing methods is often not satisfactory. In this paper, we extend a recent deep reinforcement learning hyper-heuristic framework by fusing the visual information of real-time packing with distributional information of random parameters of the problem. Computational experiments show that our method outperforms the state of the art online methods with reductions in optimality gap between 2%–19% for knapsack problem and 0.7% for the online strip packing problem. In addition, a new visual analysis presentation is also devised to better interpret the learned packing strategies, which can reveal more information than the widely used landscape analysis. As online packing problems are widely available in production environments, the proposed approach can serve as an important reference to solve other similar combinatorial optimisation problems for which visual layout inputs would aid learning.

Original language	English
Pages (from-to)	120568
Journal	Expert Systems with Applications
Volume	230
Early online date	2 Jun 2023
Publication status	Published - 15 Nov 2023

Keywords

Hyper-heuristic
Deep reinforcement learning
Feature fusion
Knapsack problem
Strip packing problem

Access to Document

https://doi.org/10.1016/j.eswa.2023.120568Licence: CC BY

Cite this

@article{c0c8ccec31104f8b8d21f40e1fc97123,

title = "A deep reinforcement learning hyper-heuristic with feature fusion for online packing problems",

abstract = "In recent years, deep reinforcement learning has shown great potential in solving computer games with sequential decision-making scenarios. Hyper-heuristic is a generic search framework, capable of intelligently selecting or generating algorithms to solve a class of optimisation problems with stochastic or dynamic settings. This paper proposes a new general framework for solving online packing problems using deep reinforcement learning hyper-heuristics. Although analytical approaches can address most offline packing problems successfully, their online versions have proved much more challenging and the performance of the existing methods is often not satisfactory. In this paper, we extend a recent deep reinforcement learning hyper-heuristic framework by fusing the visual information of real-time packing with distributional information of random parameters of the problem. Computational experiments show that our method outperforms the state of the art online methods with reductions in optimality gap between 2%–19% for knapsack problem and 0.7% for the online strip packing problem. In addition, a new visual analysis presentation is also devised to better interpret the learned packing strategies, which can reveal more information than the widely used landscape analysis. As online packing problems are widely available in production environments, the proposed approach can serve as an important reference to solve other similar combinatorial optimisation problems for which visual layout inputs would aid learning.",

keywords = "Hyper-heuristic, Deep reinforcement learning, Feature fusion, Knapsack problem, Strip packing problem",

author = "Chaofan TU and Ruibin Bai and Uwe Aickelin and Yuchang ZHANG and Heshan Du",

year = "2023",

month = nov,

day = "15",

language = "English",

volume = "230",

pages = "120568",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - A deep reinforcement learning hyper-heuristic with feature fusion for online packing problems

AU - TU, Chaofan

AU - Bai, Ruibin

AU - Aickelin, Uwe

AU - ZHANG, Yuchang

AU - Du, Heshan

PY - 2023/11/15

Y1 - 2023/11/15

N2 - In recent years, deep reinforcement learning has shown great potential in solving computer games with sequential decision-making scenarios. Hyper-heuristic is a generic search framework, capable of intelligently selecting or generating algorithms to solve a class of optimisation problems with stochastic or dynamic settings. This paper proposes a new general framework for solving online packing problems using deep reinforcement learning hyper-heuristics. Although analytical approaches can address most offline packing problems successfully, their online versions have proved much more challenging and the performance of the existing methods is often not satisfactory. In this paper, we extend a recent deep reinforcement learning hyper-heuristic framework by fusing the visual information of real-time packing with distributional information of random parameters of the problem. Computational experiments show that our method outperforms the state of the art online methods with reductions in optimality gap between 2%–19% for knapsack problem and 0.7% for the online strip packing problem. In addition, a new visual analysis presentation is also devised to better interpret the learned packing strategies, which can reveal more information than the widely used landscape analysis. As online packing problems are widely available in production environments, the proposed approach can serve as an important reference to solve other similar combinatorial optimisation problems for which visual layout inputs would aid learning.

AB - In recent years, deep reinforcement learning has shown great potential in solving computer games with sequential decision-making scenarios. Hyper-heuristic is a generic search framework, capable of intelligently selecting or generating algorithms to solve a class of optimisation problems with stochastic or dynamic settings. This paper proposes a new general framework for solving online packing problems using deep reinforcement learning hyper-heuristics. Although analytical approaches can address most offline packing problems successfully, their online versions have proved much more challenging and the performance of the existing methods is often not satisfactory. In this paper, we extend a recent deep reinforcement learning hyper-heuristic framework by fusing the visual information of real-time packing with distributional information of random parameters of the problem. Computational experiments show that our method outperforms the state of the art online methods with reductions in optimality gap between 2%–19% for knapsack problem and 0.7% for the online strip packing problem. In addition, a new visual analysis presentation is also devised to better interpret the learned packing strategies, which can reveal more information than the widely used landscape analysis. As online packing problems are widely available in production environments, the proposed approach can serve as an important reference to solve other similar combinatorial optimisation problems for which visual layout inputs would aid learning.

KW - Hyper-heuristic

KW - Deep reinforcement learning

KW - Feature fusion

KW - Knapsack problem

KW - Strip packing problem

M3 - Article

SN - 0957-4174

VL - 230

SP - 120568

JO - Expert Systems with Applications

JF - Expert Systems with Applications

ER -

A deep reinforcement learning hyper-heuristic with feature fusion for online packing problems

Abstract

Keywords

Access to Document

Fingerprint

Cite this