TY - GEN
T1 - Two-Step Deep Reinforcement Q-Learning based Relay Selection in Cooperative WPCNs
AU - Tolebi, Gulnur
AU - Tsiftsis, Theodoros A.
AU - Nauryzbayev, Galymzhan
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In this paper, we propose an intelligent relay selection scheme employing deep reinforcement learning for a wireless powered cooperative network. We formulate the given problem as a Markov decision process with an unknown transitional probability between states. Therefore, a model-free off-policy relay selection model is proposed. The given model was deployed using a deep Q-network, with an updated relay selection process. Using channel characteristics, we find inaccessible nodes to form a pool of relays available for transmission and encourage the neural network to choose them. In addition, we propose a novel reward policy to train the model that is based on stored energy levels on the relays and promotes the system to expend energy. We numerically quantity the network performance in terms of outage probability and energy outage probability and compare them with the basic Q-learning.
AB - In this paper, we propose an intelligent relay selection scheme employing deep reinforcement learning for a wireless powered cooperative network. We formulate the given problem as a Markov decision process with an unknown transitional probability between states. Therefore, a model-free off-policy relay selection model is proposed. The given model was deployed using a deep Q-network, with an updated relay selection process. Using channel characteristics, we find inaccessible nodes to form a pool of relays available for transmission and encourage the neural network to choose them. In addition, we propose a novel reward policy to train the model that is based on stored energy levels on the relays and promotes the system to expend energy. We numerically quantity the network performance in terms of outage probability and energy outage probability and compare them with the basic Q-learning.
KW - outage probability (OP)
KW - Q-learning
KW - reinforcement learning (RL)
KW - Relay selection
KW - wireless powered communication network (WPCN)
UR - https://www.scopus.com/pages/publications/85165637945
U2 - 10.1109/BalkanCom58402.2023.10167871
DO - 10.1109/BalkanCom58402.2023.10167871
M3 - Conference contribution
AN - SCOPUS:85165637945
T3 - 2023 International Balkan Conference on Communications and Networking, BalkanCom 2023
BT - 2023 International Balkan Conference on Communications and Networking, BalkanCom 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 International Balkan Conference on Communications and Networking, BalkanCom 2023
Y2 - 5 June 2023 through 8 June 2023
ER -