Abstract
With the rapid evolution of smart grid technology, real-time monitoring and analysis of power system operation status face challenges in multimodal data processing in key scenarios such as transformer defect detection. Aiming at the dual technical bottlenecks of limited computing resources and cross-modal feature differences in existing cloud-edge collaborative analysis systems, this study proposes a heterogeneous cloud-edge collaborative multimodal analysis framework, which is innovative in dual-tower architecture design and bidirectional knowledge flow collaboration mechanism. In the cloud model, by extending the CLIP-ViT-L/14 visual encoder, integrating the dilated convolution module and the Transformer-GRU hybrid temporal modeling unit, fine-grained feature extraction and global temporal dependency capture of power equipment images are achieved; on the edge side, a lightweight MobileViT network and a temporal convolutional network (TCN) are used to adapt to the efficient reasoning requirements under the resource constraints of the edge side. Through the bidirectional knowledge flow interaction of the 'visual anchor cloud-to-edge optimization strategy' and the 'temporal consistency edge-to-cloud optimization mechanism', the co-evolution and performance complementarity of the cloud-edge model are achieved. Experiments show that the accuracy of this framework is 5.4% higher than that of traditional methods in multimodal defect detection tasks such as transformer oil chromatography and partial discharge, providing a high-precision, low-latency solution for multimodal analysis of smart grids.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE 5th International Conference on Computer Communication and Artificial Intelligence, CCAI 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 1053-1059 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798331535674 |
| DOIs | |
| State | Published - 2025 |
| Event | 5th IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2025 - Hybrid, Haikou, China Duration: 23 May 2025 → 25 May 2025 |
Publication series
| Name | 2025 IEEE 5th International Conference on Computer Communication and Artificial Intelligence, CCAI 2025 |
|---|
Conference
| Conference | 5th IEEE International Conference on Computer Communication and Artificial Intelligence, CCAI 2025 |
|---|---|
| Country/Territory | China |
| City | Hybrid, Haikou |
| Period | 23/05/25 → 25/05/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- Bidirectional Knowledge Flow
- Cloud-Edge Collaborative
- Model Optimization
- Multimodal
Fingerprint
Dive into the research topics of 'Multimodal Cloud-Edge Collaborative Optimization with Bidirectional Knowledge Flow'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver