Skip to main navigation Skip to search Skip to main content

ECLIPSE: Efficient Cross-Lingual Log Intelligence Parser with Semantic Entropy-Enhanced LCS Algorithm

  • Wei Zhang
  • , Xianfu Cheng
  • , Xiang Li
  • , Jian Yang*
  • , Liying Zhang
  • , Xiangyuan Guan
  • , Zhoujun Li
  • *Corresponding author for this work
  • Beihang University
  • Information Engineering University
  • Shenzhen Intelligent Strong Technology Co.,Ltd.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Log parsing is essential in software engineering but is challenged by the immense complexity of log templates and diverse cross-platform and cross-lingual log semantics and structures in industrial logs. We propose ECLIPSE, an Efficient Cross-platform and Cross-lingual Log Intelligent Parsing framework with Semantic Entropy-Enhanced Longest Common Subsequence algorithm in industrial Environments. ECLIPSE leverages large language models to extract log keywords and maintains a dynamic dictionary mapping these keywords to log templates. When parsing, it retrieves candidate templates based on the keywords and log length. We design an algorithm named Semantic Entropy-Enhanced Longest Common Subsequence (Entropy-ELCS) for identifying the best template, improving token-level accuracy by incorporating information entropy and semantic elements into the longest common subsequence algorithm. The dictionary is updated with new keywords and templates for continuous improvement. Experiments on public benchmarks and our industrial log parsing benchmark ECLIPSE-BENCH demonstrate that ECLIPSE achieves strong performance and superior efficiency, especially when handling large template sets.

Original languageEnglish
Title of host publicationCIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery, Inc
Pages4191-4201
Number of pages11
ISBN (Electronic)9798400720406
DOIs
StatePublished - 10 Nov 2025
Event34th ACM International Conference on Information and Knowledge Management, CIKM 2025 - Seoul, Korea, Republic of
Duration: 10 Nov 202514 Nov 2025

Publication series

NameCIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management

Conference

Conference34th ACM International Conference on Information and Knowledge Management, CIKM 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period10/11/2514/11/25

Keywords

  • ai for it operations
  • cross-platform and cross-lingual
  • industrial log parsing system
  • information entropy
  • large language model

Fingerprint

Dive into the research topics of 'ECLIPSE: Efficient Cross-Lingual Log Intelligence Parser with Semantic Entropy-Enhanced LCS Algorithm'. Together they form a unique fingerprint.

Cite this