Service function chaining in NFV-enabled edge networks with natural actor-critic deep reinforcement learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, by exploiting the natural policy gradient-based actor-critic framework, we study the service function chaining in network function virtualization (NFV)-enabled edge networks. First, a long-run function chaining problem is formulated to minimize the end-to-end service latency, involving not only the server and wired link resources, but also radio resource in wireless links; the Markov decision process (MDP) model is further leveraged to capture dynamics in both server and radio resources, whereby the transition probability over state space is explicitly derived. Second, a natural actor-critic framework is presented, which utilizes natural policy gradient to train the deep neural network (DNN), thereby avoiding trapped into the local optimum. In particular, to overcome the high-dimensionally issue in action space, we further resort to one integer linear programming (ILP) formulation, reducing the space size from cube to first power. Finally, simulations are conducted to demonstrate the effectiveness of proposed approach, revealing that the latency minimization could benefit from the learning in not only service function chain (SFC) routing across edge servers, but also radio resource allocation in wireless links.

Original languageEnglish
Title of host publication2021 IEEE/CIC International Conference on Communications in China, ICCC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1095-1100
Number of pages6
ISBN (Electronic)9781665443852
DOIs
StatePublished - 28 Jul 2021
Externally publishedYes
Event2021 IEEE/CIC International Conference on Communications in China, ICCC 2021 - Xiamen, China
Duration: 28 Jul 202130 Jul 2021

Publication series

Name2021 IEEE/CIC International Conference on Communications in China, ICCC 2021

Conference

Conference2021 IEEE/CIC International Conference on Communications in China, ICCC 2021
Country/TerritoryChina
CityXiamen
Period28/07/2130/07/21

Keywords

  • Actor-critic
  • Edge networks
  • Natural policy gradient
  • Radio resource
  • Server resource

Fingerprint

Dive into the research topics of 'Service function chaining in NFV-enabled edge networks with natural actor-critic deep reinforcement learning'. Together they form a unique fingerprint.

Cite this