TY - GEN
T1 - Knowledge-based Intelligent System for IT Incident DevOps
AU - Ahmed, Salman
AU - Singh, Muskaan
AU - Doherty, Brendan
AU - Ramlan, Effirul
AU - Harkin, Kathryn
AU - Bucholc, Magda
AU - Coyle, Damien
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - The automation of IT incident management (i.e., handling of any unusual events that hamper the quality of IT services) is a main focus in Artificial Intelligence for IT Operations (AIOPS). The success and reputation of large-scale firms depend on their customer service and helpdesk system. These systems tend to handle client requests and track customer service agent interactions. In this research, we present a complete knowledge-based system that automates two core components of IT incident service management (ITSM): (1) Ticket Assignment Group(TAG) and (2) Incident Resolution (IR). Our proposed system bypasses the 4 core steps of the traditional ITSM process, including data investigation, event correlation, situation room collaboration, and probable root cause. It provides immediate solutions that can save companies key performance indicator(KPIs) resources and reduce the mean time to resolution (MTTR). The experiment used an industrial, real-time ITSM dataset from a prominent IT organization comprising 500,000 real-time incident descriptions with encoded labels. Furthermore, our systems are then evaluated with an open-source dataset. Compared to the existing benchmark methodologies, there is a 5 % improvement in terms of Accuracy score. The study demonstrates AI automation capabilities in incident handling (TAG and IR) for large real- world IT systems.
AB - The automation of IT incident management (i.e., handling of any unusual events that hamper the quality of IT services) is a main focus in Artificial Intelligence for IT Operations (AIOPS). The success and reputation of large-scale firms depend on their customer service and helpdesk system. These systems tend to handle client requests and track customer service agent interactions. In this research, we present a complete knowledge-based system that automates two core components of IT incident service management (ITSM): (1) Ticket Assignment Group(TAG) and (2) Incident Resolution (IR). Our proposed system bypasses the 4 core steps of the traditional ITSM process, including data investigation, event correlation, situation room collaboration, and probable root cause. It provides immediate solutions that can save companies key performance indicator(KPIs) resources and reduce the mean time to resolution (MTTR). The experiment used an industrial, real-time ITSM dataset from a prominent IT organization comprising 500,000 real-time incident descriptions with encoded labels. Furthermore, our systems are then evaluated with an open-source dataset. Compared to the existing benchmark methodologies, there is a 5 % improvement in terms of Accuracy score. The study demonstrates AI automation capabilities in incident handling (TAG and IR) for large real- world IT systems.
KW - Artificial Intelligence for IT Operations (AIOPS)
KW - Assignment Group
KW - Dataset Imbalance
KW - Information Technology Infrastructure Library (ITIL)
KW - IT Incidents
KW - IT Service Management (ITSM)
KW - Risk prediction
KW - Text Resolution
UR - http://www.scopus.com/inward/record.url?scp=85170821436&partnerID=8YFLogxK
U2 - 10.1109/AIOps59134.2023.00005
DO - 10.1109/AIOps59134.2023.00005
M3 - Conference Publication
AN - SCOPUS:85170821436
T3 - Proceedings - 2023 IEEE/ACM International Workshop on Cloud Intelligence and AIOps, AIOps 2023
SP - 1
EP - 7
BT - Proceedings - 2023 IEEE/ACM International Workshop on Cloud Intelligence and AIOps, AIOps 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE/ACM International Workshop on Cloud Intelligence and AIOps, AIOps 2023
Y2 - 15 May 2023
ER -