Skip to main navigation Skip to search Skip to main content

Split Learning Over Wireless Networks: Parallel Design and Resource Management

  • Wen Wu
  • , Mushu Li
  • , Kaige Qu*
  • , Conghao Zhou
  • , Xuemin Shen
  • , Weihua Zhuang
  • , Xu Li
  • , Weisen Shi
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Split learning (SL) is a collaborative learning framework, which can train an artificial intelligence (AI) model between a device and an edge server by splitting the AI model into a device-side model and a server-side model at a cut layer. The existing SL approach conducts the training process sequentially across devices, which incurs significant training latency especially when the number of devices is large. In this paper, we design a novel SL scheme to reduce the training latency, named Cluster-based Parallel SL (CPSL) which conducts model training in a 'first-parallel-then-sequential' manner. Specifically, the CPSL is to partition devices into several clusters, parallelly train device-side models in each cluster and aggregate them, and then sequentially train the whole AI model across clusters, thereby parallelizing the training process and reducing training latency. Furthermore, we propose a resource management algorithm to minimize the training latency of CPSL considering device heterogeneity and network dynamics in wireless networks. This is achieved by stochastically optimizing the cut layer selection, device clustering, and radio spectrum allocation. The proposed two-timescale algorithm can jointly make the cut layer selection decision in a large timescale and device clustering and radio spectrum allocation decisions in a small timescale. Extensive simulation results on non-independent and identically distributed data demonstrate that the proposed solution can greatly reduce the training latency as compared with the existing SL benchmarks, while adapting to network dynamics.

Original languageEnglish
Pages (from-to)1051-1066
Number of pages16
JournalIEEE Journal on Selected Areas in Communications
Volume41
Issue number4
DOIs
StatePublished - 1 Apr 2023
Externally publishedYes

Keywords

  • Split learning
  • device clustering
  • parallel model training
  • resource management

Fingerprint

Dive into the research topics of 'Split Learning Over Wireless Networks: Parallel Design and Resource Management'. Together they form a unique fingerprint.

Cite this