Joint Beam Training and Data Transmission Control for mmWave Delay-Sensitive Communications: A Parallel Reinforcement Learning Approach

  • Wanlu Lei
  • , Deyou Zhang*
  • , Yu Ye
  • , Chenguang Lu
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Future communication networks call for new solutions to support their capacity and delay demands by leveraging potentials of the millimeter wave (mmWave) frequency band. However, the beam training procedure in mmWave systems incurs significant overhead as well as huge energy consumption. As such, deriving an adaptive control policy is beneficial to both delay-sensitive and energy-efficient data transmission over mmWave networks. To this end, we investigate the problem of joint beam training and data transmission control for mmWave delay-sensitive communications in this paper. Specifically, the considered problem is firstly formulated as a constrained Markov Decision Process (MDP), which aims to minimize the cumulative energy consumption over the whole considered period of time under delay constraint. By introducing a Lagrange multiplier, we transform the constrained MDP into an unconstrained one, which is then solved via a parallel-rollout-based reinforcement learning method in a data-driven manner. Our numerical results demonstrate that the optimized policy via parallel-rollout significantly outperforms other baseline policies in both energy consumption and delay performance.

Original languageEnglish
Pages (from-to)447-459
Number of pages13
JournalIEEE Journal on Selected Topics in Signal Processing
Volume16
Issue number3
DOIs
StatePublished - 1 Apr 2022
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Beam training
  • Markov decision process
  • data-driven
  • delay-sensitive
  • millimeter wave
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Joint Beam Training and Data Transmission Control for mmWave Delay-Sensitive Communications: A Parallel Reinforcement Learning Approach'. Together they form a unique fingerprint.

Cite this