跳到主要导航 跳到搜索 跳到主要内容

Overcoming Heterogeneous Data in Federated Medical Vision-Language Pre-training: A Triple-Embedding Model Selector Approach

  • Aowen Wang
  • , Zhiwang Zhang
  • , Dongang Wang
  • , Fanyi Wang
  • , Haotian Hu
  • , Jinyang Guo
  • , Yipeng Zhou
  • , Chaoyi Pang
  • , Shiting Wen*
  • *此作品的通讯作者
  • Zhejiang University
  • Zhejiang University Ningbo Institute of Technology
  • The University of Sydney
  • LTD.
  • Macquarie University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The scarcity of data in the medical field brings challenges to collaborative training in medical vision-language pre-training (VLP) across different clients Thus, collaborative training in medical VLP faces two significant challenges: First, the medical data requires privacy and therefore cannot be directly shared across different clients. Second, medical data distribution across institutes is typically heterogeneous, hindering local model alignment and representation capabilities. To simultaneously overcome these two challenges, we propose a framework called personalized model selector with fused multimodal information (PMS-FM). The contribution of PMS-FM is two-fold: 1) PMS-FM uses embeddings to represent information in different formats, allowing for the fusion of multimodal data. 2) PMS-FM adapts to personalized data distributions by training multiple models. A model selector then identifies and selects the best-performing model for each individual client. Extensive experiments with multiple real-world medical datasets demonstrate the superb performance of PMS-FM over existing federated learning methods on different zero-shot classification tasks.

源语言英语
主期刊名Special Track on AI Alignment
编辑Toby Walsh, Julie Shah, Zico Kolter
出版商Association for the Advancement of Artificial Intelligence
7500-7508
页数9
版本7
ISBN(电子版)157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978
DOI
出版状态已出版 - 11 4月 2025
活动39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 - Philadelphia, 美国
期限: 25 2月 20254 3月 2025

出版系列

姓名Proceedings of the AAAI Conference on Artificial Intelligence
编号7
39
ISSN(印刷版)2159-5399
ISSN(电子版)2374-3468

会议

会议39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
国家/地区美国
Philadelphia
时期25/02/254/03/25

指纹

探究 'Overcoming Heterogeneous Data in Federated Medical Vision-Language Pre-training: A Triple-Embedding Model Selector Approach' 的科研主题。它们共同构成独一无二的指纹。

引用此