Multimodal Cloud-Edge Collaborative Optimization with Bidirectional Knowledge Flow

  • Yangyifei Luo
  • , Zhenying Tai*
  • , Shiqi Gao
  • , Feihong Lu
  • , Haoyi Zhou
  • , Qingyun Sun
  • , Qinghe Ye
  • , Yue Wang
  • , Chunpeng Wu
  • , Fei Zhou
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the rapid evolution of smart grid technology, real-time monitoring and analysis of power system operation status face challenges in multimodal data processing in key scenarios such as transformer defect detection. Aiming at the dual technical bottlenecks of limited computing resources and cross-modal feature differences in existing cloud-edge collaborative analysis systems, this study proposes a heterogeneous cloud-edge collaborative multimodal analysis framework, which is innovative in dual-tower architecture design and bidirectional knowledge flow collaboration mechanism. In the cloud model, by extending the CLIP-ViT-L/14 visual encoder, integrating the dilated convolution module and the Transformer-GRU hybrid temporal modeling unit, fine-grained feature extraction and global temporal dependency capture of power equipment images are achieved; on the edge side, a lightweight MobileViT network and a temporal convolutional network (TCN) are used to adapt to the efficient reasoning requirements under the resource constraints of the edge side. Through the bidirectional knowledge flow interaction of the 'visual anchor cloud-to-edge optimization strategy' and the 'temporal consistency edge-to-cloud optimization mechanism', the co-evolution and performance complementarity of the cloud-edge model are achieved. Experiments show that the accuracy of this framework is 5.4% higher than that of traditional methods in multimodal defect detection tasks such as transformer oil chromatography and partial discharge, providing a high-precision, low-latency solution for multimodal analysis of smart grids.

Original languageEnglish
Title of host publication2025 IEEE 5th International Conference on Computer Communication and Artificial Intelligence, CCAI 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1053-1059
Number of pages7
ISBN (Electronic)9798331535674
DOIs
StatePublished - 2025
Event5th IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2025 - Hybrid, Haikou, China
Duration: 23 May 202525 May 2025

Publication series

Name2025 IEEE 5th International Conference on Computer Communication and Artificial Intelligence, CCAI 2025

Conference

Conference5th IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2025
Country/TerritoryChina
CityHybrid, Haikou
Period23/05/2525/05/25

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Bidirectional Knowledge Flow
  • Cloud-Edge Collaborative
  • Model Optimization
  • Multimodal

Fingerprint

Dive into the research topics of 'Multimodal Cloud-Edge Collaborative Optimization with Bidirectional Knowledge Flow'. Together they form a unique fingerprint.

Cite this