Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm

Xiaoshan Gao; Liang Yan; Gang Wang; Tiantian Wang; Nannan Du; Chris Gerada

doi:10.1109/ICIEA51954.2021.9516114

Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm

Xiaoshan Gao, Liang Yan, Gang Wang, Tiantian Wang, Nannan Du, Chris Gerada

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

5 Citations (Scopus)

Abstract

The state-of-the-art deep reinforcement learning algorithm, i.e., the deep deterministic policy gradient (DDPG), has achieved good performance in continuous control problems for the robotics. However, the conventional experience replay mechanism of the DDPG algorithm stores the experience explored by the mobile robot in the bufer pool, and trains the neural network through random sampling, without considering whether the transition is valuable, which can probably influence the network performance. To overcome the limitation, the DDPG framework with separating experience is developed for mobile robot collision-free navigation in this study, to replay the transitions of valuable and the failed experience discretely. Additionally, environment state vector is designed including mobile robot and obstacles, the reward function and action space are also designed. The simulation results show that the proposed model can possess the collision-free navigation capacity to deal with multiple obstacles.

Original language	English
Title of host publication	Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	2136-2139
Number of pages	4
ISBN (Electronic)	9781665422482
DOIs	https://doi.org/10.1109/ICIEA51954.2021.9516114
Publication status	Published - 1 Aug 2021
Externally published	Yes
Event	16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021 - Chengdu, China Duration: 1 Aug 2021 → 4 Aug 2021

Publication series

Name	Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021

Conference

Conference	16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021
Country/Territory	China
City	Chengdu
Period	1/08/21 → 4/08/21

Keywords

deep deterministic policy gradient
mobile robot
obstacle avoidance

ASJC Scopus subject areas

Electrical and Electronic Engineering
Industrial and Manufacturing Engineering
Instrumentation
Computer Vision and Pattern Recognition
Energy Engineering and Power Technology

Access to Document

10.1109/ICIEA51954.2021.9516114

Cite this

Gao, X., Yan, L., Wang, G., Wang, T., Du, N., & Gerada, C. (2021). Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm. In Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021 (pp. 2136-2139). (Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICIEA51954.2021.9516114

Gao, Xiaoshan ; Yan, Liang ; Wang, Gang et al. / Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm. Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 2136-2139 (Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021).

@inproceedings{be4d10f5d2754e0aa92afe179b741d80,

title = "Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm",

abstract = "The state-of-the-art deep reinforcement learning algorithm, i.e., the deep deterministic policy gradient (DDPG), has achieved good performance in continuous control problems for the robotics. However, the conventional experience replay mechanism of the DDPG algorithm stores the experience explored by the mobile robot in the bufer pool, and trains the neural network through random sampling, without considering whether the transition is valuable, which can probably influence the network performance. To overcome the limitation, the DDPG framework with separating experience is developed for mobile robot collision-free navigation in this study, to replay the transitions of valuable and the failed experience discretely. Additionally, environment state vector is designed including mobile robot and obstacles, the reward function and action space are also designed. The simulation results show that the proposed model can possess the collision-free navigation capacity to deal with multiple obstacles.",

keywords = "deep deterministic policy gradient, mobile robot, obstacle avoidance",

author = "Xiaoshan Gao and Liang Yan and Gang Wang and Tiantian Wang and Nannan Du and Chris Gerada",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021 ; Conference date: 01-08-2021 Through 04-08-2021",

year = "2021",

month = aug,

day = "1",

doi = "10.1109/ICIEA51954.2021.9516114",

language = "English",

series = "Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "2136--2139",

booktitle = "Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021",

address = "United States",

}

Gao, X, Yan, L, Wang, G, Wang, T, Du, N & Gerada, C 2021, Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm. in Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021. Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021, Institute of Electrical and Electronics Engineers Inc., pp. 2136-2139, 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021, Chengdu, China, 1/08/21. https://doi.org/10.1109/ICIEA51954.2021.9516114

Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm. / Gao, Xiaoshan; Yan, Liang; Wang, Gang et al.
Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 2136-2139 (Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm

AU - Gao, Xiaoshan

AU - Yan, Liang

AU - Wang, Gang

AU - Wang, Tiantian

AU - Du, Nannan

AU - Gerada, Chris

PY - 2021/8/1

Y1 - 2021/8/1

N2 - The state-of-the-art deep reinforcement learning algorithm, i.e., the deep deterministic policy gradient (DDPG), has achieved good performance in continuous control problems for the robotics. However, the conventional experience replay mechanism of the DDPG algorithm stores the experience explored by the mobile robot in the bufer pool, and trains the neural network through random sampling, without considering whether the transition is valuable, which can probably influence the network performance. To overcome the limitation, the DDPG framework with separating experience is developed for mobile robot collision-free navigation in this study, to replay the transitions of valuable and the failed experience discretely. Additionally, environment state vector is designed including mobile robot and obstacles, the reward function and action space are also designed. The simulation results show that the proposed model can possess the collision-free navigation capacity to deal with multiple obstacles.

AB - The state-of-the-art deep reinforcement learning algorithm, i.e., the deep deterministic policy gradient (DDPG), has achieved good performance in continuous control problems for the robotics. However, the conventional experience replay mechanism of the DDPG algorithm stores the experience explored by the mobile robot in the bufer pool, and trains the neural network through random sampling, without considering whether the transition is valuable, which can probably influence the network performance. To overcome the limitation, the DDPG framework with separating experience is developed for mobile robot collision-free navigation in this study, to replay the transitions of valuable and the failed experience discretely. Additionally, environment state vector is designed including mobile robot and obstacles, the reward function and action space are also designed. The simulation results show that the proposed model can possess the collision-free navigation capacity to deal with multiple obstacles.

KW - deep deterministic policy gradient

KW - mobile robot

KW - obstacle avoidance

UR - http://www.scopus.com/inward/record.url?scp=85115445208&partnerID=8YFLogxK

U2 - 10.1109/ICIEA51954.2021.9516114

DO - 10.1109/ICIEA51954.2021.9516114

M3 - Conference contribution

AN - SCOPUS:85115445208

T3 - Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021

SP - 2136

EP - 2139

BT - Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021

Y2 - 1 August 2021 through 4 August 2021

ER -

Gao X, Yan L, Wang G, Wang T, Du N, Gerada C. Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm. In Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 2136-2139. (Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications, ICIEA 2021). doi: 10.1109/ICIEA51954.2021.9516114

Toward Obstacle Avoidance for Mobile Robots Using Deep Reinforcement Learning Algorithm

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this