Skip to main navigation Skip to search Skip to main content

OEMLLM: Ophthalmology Expert MLLM for Various Fundus Disease Assisted Diagnosis

  • Junfeng Sun
  • , Yunchao Gu*
  • , Xinliang Wang
  • *Corresponding author for this work
  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

While Multimodal Large Language Models (MLLMs) have demonstrated remarkable performance in general-domain tasks, their application in specialized medical fields, such as the assisted diagnosis of fundus diseases, remains limited due to the lack of domain-specific knowledge. To bridge this gap, we introduce the FUNDUS-BENCH dataset, a multi-task benchmark tailored for fundus images. Based on the FUNDUSBENCH dataset, a multimodal medical auxiliary diagnosis system, Ophthalmology Expert MLLM (OEMLLM) is designed, which is an innovative system that leverages a hierarchical feature extraction method based on Vision Transformer to fully utilize both low-level lesion features and high-level semantic features from fundus images. OEMLLM further integrates with a Large Language Model (LLM) to perform multi-task learning for comprehensive fundus disease diagnosis. Extensive experiments show that OEMLLM outperforms state-of-the-art MLLMs with comparable parameter scales (approximately 2B parameters) and maintains competitive performance against larger-scale models. The dataset and code associated with this system will be open-sourced shortly, aiming to facilitate research and development of practical AI-assisted diagnostic tools in medical applications.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE 37th International Conference on Tools with Artificial Intelligence, ICTAI 2025
PublisherIEEE Computer Society
Pages1229-1236
Number of pages8
ISBN (Electronic)9798331549190
DOIs
StatePublished - 2025
Event37th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2025 - Athens, Greece
Duration: 3 Nov 20255 Nov 2025

Publication series

NameProceedings - International Conference on Tools with Artificial Intelligence, ICTAI
ISSN (Print)1082-3409

Conference

Conference37th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2025
Country/TerritoryGreece
CityAthens
Period3/11/255/11/25

Keywords

  • Fundus Disease Assisted Diagnosis
  • Multi-Task Learning
  • Multimodal Large Language Model

Fingerprint

Dive into the research topics of 'OEMLLM: Ophthalmology Expert MLLM for Various Fundus Disease Assisted Diagnosis'. Together they form a unique fingerprint.

Cite this