TY - JOUR
T1 - LogSay
T2 - An Efficient Comprehension System for Log Numerical Reasoning
AU - Qi, Jiaxing
AU - Luan, Zhongzhi
AU - Huang, Shaohan
AU - Fung, Carol
AU - Yang, Hailong
N1 - Publisher Copyright:
© 1968-2012 IEEE.
PY - 2024/7/1
Y1 - 2024/7/1
N2 - With the growth of smart systems and applications, high volume logs are generated that record important data for system maintenance. System developers are usually required to analyze logs to track the status of the system or applications. Therefore, it is essential to find the answers in large-scale logs when they have some questions. In this work, we design a multi-step 'Retriever-Reader' question-answering system, namely LogSay, which aims at predicting answers accurately and efficiently. Our system can not only answers simple questions, such as a segment log or span, but also can answer complex logical questions through numerical reasoning. LogSay has two key components: Log Retriever and Log Reasoner, and we designed five operators to implement them. Log Retriever aims at retrieving some relevant logs based on a question. Then, Log Reasoner performs numerical reasoning to infer the final answer. In addition, due to the lack of available question-answering datasets for system logs, we constructed question-answering datasets based on three public log datasets and will make them publicly available. Our evaluation results show that LogSay outperforms the state-of-the-art works in terms of accuracy and efficiency.
AB - With the growth of smart systems and applications, high volume logs are generated that record important data for system maintenance. System developers are usually required to analyze logs to track the status of the system or applications. Therefore, it is essential to find the answers in large-scale logs when they have some questions. In this work, we design a multi-step 'Retriever-Reader' question-answering system, namely LogSay, which aims at predicting answers accurately and efficiently. Our system can not only answers simple questions, such as a segment log or span, but also can answer complex logical questions through numerical reasoning. LogSay has two key components: Log Retriever and Log Reasoner, and we designed five operators to implement them. Log Retriever aims at retrieving some relevant logs based on a question. Then, Log Reasoner performs numerical reasoning to infer the final answer. In addition, due to the lack of available question-answering datasets for system logs, we constructed question-answering datasets based on three public log datasets and will make them publicly available. Our evaluation results show that LogSay outperforms the state-of-the-art works in terms of accuracy and efficiency.
KW - Log data analysis
KW - numerical reasoning
KW - question answering
KW - retriever-reader
UR - https://www.scopus.com/pages/publications/85190170652
U2 - 10.1109/TC.2024.3386068
DO - 10.1109/TC.2024.3386068
M3 - 文章
AN - SCOPUS:85190170652
SN - 0018-9340
VL - 73
SP - 1809
EP - 1821
JO - IEEE Transactions on Computers
JF - IEEE Transactions on Computers
IS - 7
ER -