TAMO:Fine-Grained Root Cause Analysis via Tool-Assisted LLM Agent With Multi-Modality Observation Data in Cloud-Native Systems

  • Xiao Zhang
  • , Qi Wang
  • , Mingyi Li
  • , Yuan Yuan
  • , Mengbai Xiao
  • , Fuzhen Zhuang
  • , Dongxiao Yu*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Implementing large language models (LLMs)-driven root cause analysis (RCA) in cloud-native systems has become a key topic of modern software operations and maintenance. However, existing LLM-based approaches face three key challenges: multi-modality input constraint, context window limitation, and dynamic dependence graph. To address these issues, we propose a tool-assisted LLM agent with multi-modality observation data for fine-grained RCA, namely TAMO, including multi-modality alignment tool, root cause localization tool, and fault types classification tool. In detail, TAMO unifies multi-modal observation data into time-aligned representations for cross-modal feature consistency. Based on the unified representations, TAMO then invokes its specialized root cause localization tool and fault types classification tool for further identifying root cause and fault type underlying system context. This approach overcomes the limitations of LLMs in processing real-time raw observational data and dynamic service dependencies, guiding the model to generate repair strategies that align with system context through structured prompt design. Experiments on two benchmark datasets demonstrate that TAMO outperforms state-of-the-art (SOTA) approaches with comparable performance.

Original languageEnglish
Pages (from-to)4221-4233
Number of pages13
JournalIEEE Transactions on Services Computing
Volume18
Issue number6
DOIs
StatePublished - 2025

Keywords

  • Root cause analysis
  • cloud-native systems
  • diffusion
  • multimodal data
  • tool-assisted LLM agent

Fingerprint

Dive into the research topics of 'TAMO:Fine-Grained Root Cause Analysis via Tool-Assisted LLM Agent With Multi-Modality Observation Data in Cloud-Native Systems'. Together they form a unique fingerprint.

Cite this