Loghub

June 13, 2025 ยท View on GitHub

Loghub

Loghub maintains a collection of system logs, which are freely accessible for AI-driven log analytics research. Some of the logs are production data released from previous studies, while some others are collected from real systems in our lab environment. Wherever possible, the logs are NOT sanitized, anonymized or modified in any way. These log datasets are freely available for research or academic work.

๐Ÿค— We proudly announce that the loghub datasets have attained total by more than 450 organizations from both industry and academia.

Logs currently available

๐Ÿ”— Get raw logs via hyperlinks in the Download column.

DatasetDescriptionLabeledTime Span#LinesRaw SizeDownload
:open_file_folder: Distributed systems
HDFS_v1Hadoop distributed file system log:heavy_check_mark:38.7 hours11,175,6291.47GiB:link:
HDFS_v2Hadoop distributed file system logN.A.71,118,07316.06GiB:link:
HDFS_v3Instrumented HDFS trace log (TraceBench):heavy_check_mark:N.A.14,778,0792.96GiB:link:
HadoopHadoop mapreduce job log:heavy_check_mark: (Check #56)N.A.394,30848.61MiB:link:
SparkSpark job logN.A.33,236,6042.75GiB:link:
ZookeeperZooKeeper service log26.7 days74,3809.95MiB:link:
OpenStackOpenStack infrastructure log:heavy_check_mark:N.A.207,82058.61MiB:link:
:open_file_folder: Super computers
BGLBlue Gene/L supercomputer log:heavy_check_mark:214.7 days4,747,963708.76MiB:link:
HPCHigh performance cluster logN.A.433,48932.00MiB:link:
ThunderbirdThunderbird supercomputer log:heavy_check_mark:244 days211,212,19229.60GiB:link:
:open_file_folder: Operating systems
WindowsWindows event log226.7 days114,608,38826.09GiB:link:
LinuxLinux system log263.9 days25,5672.25MiB:link:
MacMac OS log7.0 days117,28316.09MiB:link:
:open_file_folder: Mobile systems
Android_v1Android framework logN.A.1,555,005183.37MiB:link:
Android_v2Android framework logN.A.30,348,0423.38GiB:link:
HealthAppHealth app log10.5 days253,39522.44MiB:link:
:open_file_folder: Server applications
ApacheApache web server error log263.9 days56,4814.90MiB:link:
OpenSSHOpenSSH server log28.4 days655,14670.02MiB:link:
:open_file_folder: Standalone software
ProxifierProxifier software logN.A.21,3292.42MiB:link:

๐Ÿ”ฅ Citation

Please cite the following two papers if you use the loghub datasets in your research.

๐ŸŒˆ License

The datasets are freely available for research or academic work. For any usage or distribution of the datasets, please refer to the loghub repository URL https://github.com/logpai/loghub and cite the loghub paper where applicable.

๐Ÿ™‹ Discussion

Welcome to open a discussion here for any question and discussion.