Two-Step Deep Reinforcement Q-Learning based Relay Selection in Cooperative WPCNs

Gulnur Tolebi, Theodoros A. Tsiftsis, Galymzhan Nauryzbayev

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

In this paper, we propose an intelligent relay selection scheme employing deep reinforcement learning for a wireless powered cooperative network. We formulate the given problem as a Markov decision process with an unknown transitional probability between states. Therefore, a model-free off-policy relay selection model is proposed. The given model was deployed using a deep Q-network, with an updated relay selection process. Using channel characteristics, we find inaccessible nodes to form a pool of relays available for transmission and encourage the neural network to choose them. In addition, we propose a novel reward policy to train the model that is based on stored energy levels on the relays and promotes the system to expend energy. We numerically quantity the network performance in terms of outage probability and energy outage probability and compare them with the basic Q-learning.

Original languageEnglish
Title of host publication2023 International Balkan Conference on Communications and Networking, BalkanCom 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350339109
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event2023 International Balkan Conference on Communications and Networking, BalkanCom 2023 - Istanbul, Turkey
Duration: 5 Jun 20238 Jun 2023

Publication series

Name2023 International Balkan Conference on Communications and Networking, BalkanCom 2023

Conference

Conference2023 International Balkan Conference on Communications and Networking, BalkanCom 2023
Country/TerritoryTurkey
CityIstanbul
Period5/06/238/06/23

Keywords

  • outage probability (OP)
  • Q-learning
  • reinforcement learning (RL)
  • Relay selection
  • wireless powered communication network (WPCN)

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture
  • Information Systems
  • Safety, Risk, Reliability and Quality
  • Instrumentation

Fingerprint

Dive into the research topics of 'Two-Step Deep Reinforcement Q-Learning based Relay Selection in Cooperative WPCNs'. Together they form a unique fingerprint.

Cite this