-
Performance Diagnosis in Cloud Microservices using Deep Learning
Li Wu, Jasmin Bogatinovski, Sasho Nedelkoski, Johan Tordsson and Odej Kao
-
Anomaly Detection at Scale: The Case for Deep Distributional Time Series Models
Fadhel Ayed, Lorenzo Stella, Tim Januschowski and Jan Gasthaus
-
Localization of operational faults in cloud applications by mining causal dependencies in logs using Golden Signals
Pooja Aggarwal, Ajay Gupta, Prateeti Mohapatra, Seema Nagar, Atri Mandal, Qing Wang and Amitkumar M Paradkar
-
Using Language Models to Pre-train Features for Optimizing Information Technology Operations Management Tasks
Xiaotong Liu, Yingbei Tong, Anbang Xu and Rama Akkiraju
-
Towards Runtime Verification via Event Stream Processing in Cloud Computing Infrastructures
Domenico Cotroneo, Luigi De Simone, Pietro Liguori, Roberto Natella and Angela Scibelli
-
Decentralized Federated Learning Preserves Model and Data Privacy
Thorsten Wittkopp and Alexander Acker
-
Online Memory Leak Detection in the Cloud-based Infrastructures
Anshul Jindal, Paul Staab, Jorge Cardoso, Michael Gerndt and Vladimir Podolskiy
-
TELESTO: A Graph Neural Network Model for Anomaly Classification in Cloud Services
Dominik Scheinert and Alexander Acker
-
Discovering Alarm Correlation Rules for Network Fault Management
Philippe Fournier-Viger, Ganghuan He, Min Zhou, Mourad Nouioua and Jiahong Liu
-
SLMAD: Statistical Learning Based Metric Anomaly Detection
Arsalan Shahid, Gary White, Jaroslaw Diuwe, Alexandros Agapitos and Owen O'Brien
-
An Influence-based Approach for Root Cause Alarm Discovery in Telecom Networks
Keli Zhang, Marcus Kalander, Min Zhou, Xi Zhang and Junjian Ye
-
Multi-Source Anomaly Detection in Distributed IT Systems
Jasmin Bogatinovski, Sasho Nedelkoski
-
A Systematic Mapping Study in AIOps
Paolo Notaro, Jorge Cardoso and Michael Gerndt
-
Resource Sharing in Public Cloud System with Evolutionary Multi-agent Artificial Swarm Intelligence
Beiran Chen, Yi Zhang and George Iosifidis
-
Software Reliability Engineering for Resilient Cloud Operations
Michael R. Lyu and Yuxin Su