Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

Tianxiang Cui; Nanjiang Du; Xiaoying Yang; Shusheng Ding

doi:10.1016/j.techfore.2023.122944

Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

Tianxiang Cui, Nanjiang Du, Xiaoying Yang, Shusheng Ding

Research output: Journal Publication › Article › peer-review

33 Citations (Scopus)

Abstract

Portfolio optimization concerns with periodically allocating the limited funds to invest in a variety of potential assets in order to satisfy investors’ appetites for risk and return goals. Recently, Deep Reinforcement Learning (DRL) has shown its promising capabilities in sequential decision making problems. However, traditional DRL algorithms directly operate in the space of low-level actions, which exhibits poor scalability and becomes intractable in real-world problem instances when the dimensionality of the environment increases. To deal with this, in this work, a novel DRL hyper-heuristic framework is proposed for multi-period portfolio optimization problem. Instead of exploiting the entire action domain, our proposed approach is more effective by searching for low-level well-developed trading strategies. In addition, our proposed approach is data-driven and respects the nature of the problem by taking advantage of expert domain knowledge and posing it multidimensional states to further leverage additional diverse information from alternative views of the environment. The proposed approach is evaluated on five real-world capital market problem instances and numerous experimental results demonstrate our proposed method can achieve notable performance gains compared to state-of-art trading strategies as well as traditional DRL baseline method. The data we used are from five stock indices, covering the period from the 2012 to 2022. Our study can have salient policy implications for investment strategy formulation and effective regulatory frameworks establishment.

Original language	English
Article number	122944
Journal	Technological Forecasting and Social Change
Volume	198
DOIs	https://doi.org/10.1016/j.techfore.2023.122944
Publication status	Published - Jan 2024

Keywords

Decision making
Deep reinforcement learning
Hyper-heuristic
Portfolio optimization
Uncertainty

ASJC Scopus subject areas

Business and International Management
Applied Psychology
Management of Technology and Innovation

Access to Document

10.1016/j.techfore.2023.122944

Cite this

@article{033dc477815240afaa356be8adea511f,

title = "Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach",

abstract = "Portfolio optimization concerns with periodically allocating the limited funds to invest in a variety of potential assets in order to satisfy investors{\textquoteright} appetites for risk and return goals. Recently, Deep Reinforcement Learning (DRL) has shown its promising capabilities in sequential decision making problems. However, traditional DRL algorithms directly operate in the space of low-level actions, which exhibits poor scalability and becomes intractable in real-world problem instances when the dimensionality of the environment increases. To deal with this, in this work, a novel DRL hyper-heuristic framework is proposed for multi-period portfolio optimization problem. Instead of exploiting the entire action domain, our proposed approach is more effective by searching for low-level well-developed trading strategies. In addition, our proposed approach is data-driven and respects the nature of the problem by taking advantage of expert domain knowledge and posing it multidimensional states to further leverage additional diverse information from alternative views of the environment. The proposed approach is evaluated on five real-world capital market problem instances and numerous experimental results demonstrate our proposed method can achieve notable performance gains compared to state-of-art trading strategies as well as traditional DRL baseline method. The data we used are from five stock indices, covering the period from the 2012 to 2022. Our study can have salient policy implications for investment strategy formulation and effective regulatory frameworks establishment.",

keywords = "Decision making, Deep reinforcement learning, Hyper-heuristic, Portfolio optimization, Uncertainty",

author = "Tianxiang Cui and Nanjiang Du and Xiaoying Yang and Shusheng Ding",

note = "Publisher Copyright: {\textcopyright} 2023 The Author(s)",

year = "2024",

month = jan,

doi = "10.1016/j.techfore.2023.122944",

language = "English",

volume = "198",

journal = "Technological Forecasting and Social Change",

issn = "0040-1625",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

AU - Cui, Tianxiang

AU - Du, Nanjiang

AU - Yang, Xiaoying

AU - Ding, Shusheng

PY - 2024/1

Y1 - 2024/1

N2 - Portfolio optimization concerns with periodically allocating the limited funds to invest in a variety of potential assets in order to satisfy investors’ appetites for risk and return goals. Recently, Deep Reinforcement Learning (DRL) has shown its promising capabilities in sequential decision making problems. However, traditional DRL algorithms directly operate in the space of low-level actions, which exhibits poor scalability and becomes intractable in real-world problem instances when the dimensionality of the environment increases. To deal with this, in this work, a novel DRL hyper-heuristic framework is proposed for multi-period portfolio optimization problem. Instead of exploiting the entire action domain, our proposed approach is more effective by searching for low-level well-developed trading strategies. In addition, our proposed approach is data-driven and respects the nature of the problem by taking advantage of expert domain knowledge and posing it multidimensional states to further leverage additional diverse information from alternative views of the environment. The proposed approach is evaluated on five real-world capital market problem instances and numerous experimental results demonstrate our proposed method can achieve notable performance gains compared to state-of-art trading strategies as well as traditional DRL baseline method. The data we used are from five stock indices, covering the period from the 2012 to 2022. Our study can have salient policy implications for investment strategy formulation and effective regulatory frameworks establishment.

AB - Portfolio optimization concerns with periodically allocating the limited funds to invest in a variety of potential assets in order to satisfy investors’ appetites for risk and return goals. Recently, Deep Reinforcement Learning (DRL) has shown its promising capabilities in sequential decision making problems. However, traditional DRL algorithms directly operate in the space of low-level actions, which exhibits poor scalability and becomes intractable in real-world problem instances when the dimensionality of the environment increases. To deal with this, in this work, a novel DRL hyper-heuristic framework is proposed for multi-period portfolio optimization problem. Instead of exploiting the entire action domain, our proposed approach is more effective by searching for low-level well-developed trading strategies. In addition, our proposed approach is data-driven and respects the nature of the problem by taking advantage of expert domain knowledge and posing it multidimensional states to further leverage additional diverse information from alternative views of the environment. The proposed approach is evaluated on five real-world capital market problem instances and numerous experimental results demonstrate our proposed method can achieve notable performance gains compared to state-of-art trading strategies as well as traditional DRL baseline method. The data we used are from five stock indices, covering the period from the 2012 to 2022. Our study can have salient policy implications for investment strategy formulation and effective regulatory frameworks establishment.

KW - Decision making

KW - Deep reinforcement learning

KW - Hyper-heuristic

KW - Portfolio optimization

KW - Uncertainty

UR - http://www.scopus.com/inward/record.url?scp=85175582739&partnerID=8YFLogxK

U2 - 10.1016/j.techfore.2023.122944

DO - 10.1016/j.techfore.2023.122944

M3 - Article

AN - SCOPUS:85175582739

SN - 0040-1625

VL - 198

JO - Technological Forecasting and Social Change

JF - Technological Forecasting and Social Change

M1 - 122944

ER -

Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this