论文标题
碳到钻石:现场可靠性工程师在混合云操作中的对话中的事件补救助理系统
Carbon to Diamond: An Incident Remediation Assistant System From Site Reliability Engineers' Conversations in Hybrid Cloud Operations
论文作者
论文摘要
会话渠道正在改变混合云服务管理的景观。这些渠道已成为现场可靠性工程师(SRES)%主题专家(SME)的重要途径,以协作共同努力解决事件或问题。通过使用信息检索机制,从类似事件中使用信息检索机制来确定分段的对话并从中提取关键见解或人工制品可以帮助工程师提高事件修复过程的效率。但是,从经验上观察到,由于这种对话的半正式行为(人类语言),它们本质上非常独特,还包含许多特定领域的术语。这使得很难直接使用标准的自然语言处理框架,该框架通常用于标准NLP任务。 %重要的是要确定对话聊天中存在的正确关键字和伪像,例如症状,问题等。在本文中,我们构建了一个框架,该框架可以利用对话渠道,并使用各种学习方法来理解和从采取的诊断步骤和解决方案措施等对话中理解和提取关键的人工制品,以及(b)提出一种方法来识别过去关于类似问题的对话的方法。数据集上的实验结果显示了我们提出的方法的功效。
Conversational channels are changing the landscape of hybrid cloud service management. These channels are becoming important avenues for Site Reliability Engineers (SREs) %Subject Matter Experts (SME) to collaboratively work together to resolve an incident or issue. Identifying segmented conversations and extracting key insights or artefacts from them can help engineers to improve the efficiency of the incident remediation process by using information retrieval mechanisms for similar incidents. However, it has been empirically observed that due to the semi-formal behavior of such conversations (human language) they are very unique in nature and also contain lot of domain-specific terms. This makes it difficult to use the standard natural language processing frameworks directly, which are popularly used in standard NLP tasks. %It is important to identify the correct keywords and artefacts like symptoms, issue etc., present in the conversation chats. In this paper, we build a framework that taps into the conversational channels and uses various learning methods to (a) understand and extract key artefacts from conversations like diagnostic steps and resolution actions taken, and (b) present an approach to identify past conversations about similar issues. Experimental results on our dataset show the efficacy of our proposed method.