Tools and Benchmarks for Automated Log Parsing

11/08/2018
by   Jieming Zhu, et al.
0

Logs are imperative in the development and maintenance process of many software systems. They record detailed runtime information during system operation that allows developers and support engineers to monitor their systems and dissect anomalous behaviors and errors. The increasing scale and complexity of modern software systems, however, make the volume of logs explodes, thus rendering the infeasibility of the traditional way of manual log inspection. Many recent studies, as well as industrial tools, resort to powerful text search and machine learning-based analytics solutions. Due to the unstructured nature of logs, a first crucial step is to parse log messages into structured data for subsequent analysis. In recent years, automated log parsing has been widely studied in both academia and industry, producing a series of log parsers by different techniques. To better understand the characteristics of these log parsers, in this paper, we present a comprehensive evaluation study on automated log parsing and further release the tools and benchmarks to researchers and practitioners. More specifically, we evaluate 13 log parsers on a total of 16 log datasets spanning distributed systems, supercomputers, operating systems, mobile systems, server applications, and standalone software. We report the benchmarking results in terms of accuracy, robustness, and efficiency, which are of practical importance when deploying automated log parsing in production. We also share the success stories and lessons learned in an industrial application at Huawei. We believe that our work could serve as the basis and provide valuable guidance to future research and technology transfer of automated log parsing.

READ FULL TEXT
research
01/07/2020

Logram: Efficient Log Parsing Using n-Gram Dictionaries

Software systems usually record important runtime information in their l...
research
06/02/2023

An Evaluation of Log Parsing with ChatGPT

Software logs play an essential role in ensuring the reliability and mai...
research
08/21/2023

A Large-scale Benchmark for Log Parsing

Log data is pivotal in activities like anomaly detection and failure dia...
research
08/17/2023

Log Parsing Evaluation in the Era of Modern Software Systems

Due to the complexity and size of modern software systems, the amount of...
research
08/14/2020

Loghub: A Large Collection of System Log Datasets towards Automated Log Analytics

Logs have been widely adopted in software system development and mainten...
research
08/18/2021

What Distributed Systems Say: A Study of Seven Spark Application Logs

Execution logs are a crucial medium as they record runtime information o...
research
06/12/2018

A Directed Acyclic Graph Approach to Online Log Parsing

Logs are widely used in modern software system management because they a...

Please sign up or login with your details

Forgot password? Click here to reset