TY - GEN
T1 - Towards Multi-System Log Anomaly Detection
AU - Wang, Boyang
AU - Zang, Runqiang
AU - Guo, Hongcheng
AU - Zhang, Shun
AU - Cao, Shaosheng
AU - Di, Donglin
AU - Li, Zhoujun
N1 - Publisher Copyright:
©2025 Association for Computational Linguistics.
PY - 2025
Y1 - 2025
N2 - Despite advances in unsupervised log anomaly detection, current models require dataset-specific training, causing costly procedures, limited scalability, and performance bottlenecks. Furthermore, numerous models lack cognitive reasoning abilities, limiting their transferability to similar systems. Additionally, these models often encounter the "identical shortcut" predicament, erroneously predicting normal classes when confronted with rare anomaly logs due to reconstruction errors. To address these issues, we propose MLAD, a novel Multi-system Log Anomaly Detection model incorporating semantic relational reasoning. Specifically, we extract cross-system semantic patterns and encode them as high-dimensional learnable vectors. Subsequently, we revamp attention formulas to discern keyword significance and model the overall distribution through vector space diffusion. Lastly, we employ a Gaussian mixture model to highlight rare word uncertainty, optimizing the vector space with maximum expectation.
AB - Despite advances in unsupervised log anomaly detection, current models require dataset-specific training, causing costly procedures, limited scalability, and performance bottlenecks. Furthermore, numerous models lack cognitive reasoning abilities, limiting their transferability to similar systems. Additionally, these models often encounter the "identical shortcut" predicament, erroneously predicting normal classes when confronted with rare anomaly logs due to reconstruction errors. To address these issues, we propose MLAD, a novel Multi-system Log Anomaly Detection model incorporating semantic relational reasoning. Specifically, we extract cross-system semantic patterns and encode them as high-dimensional learnable vectors. Subsequently, we revamp attention formulas to discern keyword significance and model the overall distribution through vector space diffusion. Lastly, we employ a Gaussian mixture model to highlight rare word uncertainty, optimizing the vector space with maximum expectation.
UR - https://www.scopus.com/pages/publications/105020388138
U2 - 10.18653/v1/2025.acl-industry.8
DO - 10.18653/v1/2025.acl-industry.8
M3 - 会议稿件
AN - SCOPUS:105020388138
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 83
EP - 91
BT - Industry Track
A2 - Rehm, Georg
A2 - Li, Yunyao
PB - Association for Computational Linguistics (ACL)
T2 - 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025
Y2 - 27 July 2025 through 1 August 2025
ER -