Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Holmes project provides alarm correlation and analysis for Telecom cloud infrastructure and services, including hosts, vims, VNFs and NSs. Holmes aims to find the real reason which causes the failure or degradation of services by digging into the ocean of events collected from different levels of the Telecom cloud.
  • Holmes provides a docker based fault correlation analysis system with APIs which could be called by external systems.


Info
titleDifferences between Policy and Holmes
  • The business scope of Holmes is different from that of Policy

Both Holmes and Policy adopt Drools as the rules engine. The main difference between these two projects is that Holmes is mainly targeted at correlation analysis between different alarms while Policy is aimed to implement control loops by triggering a series of actions. Briefly speaking, Holmes is targeted at root cause analysis but policy is aimed for auto-healing/auto-scaling.

  • Holmes is necessary for reducing the pressure caused by the large alarm quantity for Policy

Policy does not need to face the original alarms directly with the help of Holmes. The root cause is picked out from all the original alarms by Holmes and then, the most suitable policy ID is selected and published accordingly. In this way, Policy is liberated from triggering similar or duplicated actions which are caused by the alarms with internal relations.

For example, if there are 3 events A, B and C which could lead to a power down fault, and B and C are caused by A. Without Holmes, all of these 3 events will be sent to Policy and 3 corresponding actions are going to be triggered. After we add Holmes to the close loop controller and make it the upstream system of Policy, only Event A will be sent to Policy and thus only one action will be triggered, which makes the close loop control more precise and efficient.

A Scenario without Holmes

A Scenario with Holmes


...

  • Holmes is an application that processes events published by managed resources or other applications that detect specific conditions. Based on defined root cause analysis rules about the network, an application of this kind would determine root cause for various conditions and notify other interested applications.

  • Holmes is designed in compliance with the VES standard. It can take event data from DMaaP, consume them and then send the correlation result back to DMaaP in the form of VES structure. Any other projects subscribe to the corresponding topic could fetch and use the result.

  • Holmes consists of a rule designer and a correlation engine.

  • Real-time Analytic application get the analysis result, which is the root cause can be used to drive the policy run-time for automation operation. 

  • Correlation Engine receives original alarms from monitor and outputs the result back to monitor after analysis based on rules and the resources relationships from A&AI.

  • The Rule Designer provides a user-friendly GUI to design the Correlation rules. The GUI can be either run in standalone mode or integrated into other projects if needed.

Stretch Goal

As as stretch goal, Holmes provides docker(s) to be integrated with DCAE as an analytic application. Actual deployment options are decided by the user/use cases.

Resources:


RoleNameGerrit IDCompany
Email
TimeZone
Primary ContactGuangrong Fu
ZTEBeijing, China. UTC +8
CommitersGuangrong Fu




Peng Tang
ZTEtang.peng45@zte.com.cnBeijing, China. UTC +8
ContributorsJiaqiang Du
ZTEdu.jiaqiang@zte.com.cnBeijing, China. UTC +8

Yi Li
ZTEli.yi101@zte.com.cnBeijing, China. UTC +8

Youbo Wu
ZTE

wu.youbo@zte.com.cn

Beijing, China. UTC +8

Liang Feng
ZTEfeng.liang1@zte.com.cnBeijing, China. UTC +8

Yuan Liu
China Mobileliuyuanyjy@chinamobile.comBeijing, China. UTC +8

Chengli Wang

China Moblile

wangchengli@chinamobile.comBeijing, China. UTC +8

Xin(Saw) Jin
Huaweisaw.jin@huawei.comBeijing, China. UTC +8

...