Multi-source Distributed System Data for AI-Powered Analytics - Service-Oriented and Cloud Computing
Conference Papers Year : 2020

Multi-source Distributed System Data for AI-Powered Analytics

Abstract

The emerging field of Artificial Intelligence for IT Operations (AIOps) utilizes monitoring data, big data platforms, and machine learning, to automate operations and maintenance (O&M) tasks in complex IT systems. The available research data usually contain only a single source of information, often logs or metrics. The inability of the single-source data to describe precise state of the distributed systems leads to methods that fail to make effective use of the joint information, thus, producing large number of false predictions. Therefore, current data limits the possibilities for greater advances in AIOps research. To overcome these constraints, we created a complex distributed system testbed, which generates multi-source data composed of distributed traces, application logs, and metrics. This paper provides detailed descriptions of the infrastructure, testbed, experiments, and statistics of the generated data. Furthermore, it identifies how such data can be utilized as a stepping stone for the development of novel methods for O&M tasks such as anomaly detection, root cause analysis, and remediation.The data from the testbed and its code is available at https://zenodo.org/record/3549604 .
Fichier principal
Vignette du fichier
493832_1_En_13_Chapter.pdf (277.27 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-03203288 , version 1 (20-04-2021)

Licence

Identifiers

Cite

Sasho Nedelkoski, Jasmin Bogatinovski, Ajay Kumar Mandapati, Soeren Becker, Jorge Cardoso, et al.. Multi-source Distributed System Data for AI-Powered Analytics. 8th European Conference on Service-Oriented and Cloud Computing (ESOCC), Sep 2020, Heraklion, Crete, Greece. pp.161-176, ⟨10.1007/978-3-030-44769-4_13⟩. ⟨hal-03203288⟩
154 View
296 Download

Altmetric

Share

More