[go: up one dir, main page]

CN105991332A - Alarm processing method and device - Google Patents

Alarm processing method and device Download PDF

Info

Publication number
CN105991332A
CN105991332A CN201510088503.3A CN201510088503A CN105991332A CN 105991332 A CN105991332 A CN 105991332A CN 201510088503 A CN201510088503 A CN 201510088503A CN 105991332 A CN105991332 A CN 105991332A
Authority
CN
China
Prior art keywords
alarm
controller
resource
occupied
managed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510088503.3A
Other languages
Chinese (zh)
Inventor
陈俏钢
薄开涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Publication of CN105991332A publication Critical patent/CN105991332A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/40Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks using virtualisation of network functions or resources, e.g. SDN or NFV entities

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

本发明公开了一种告警处理方法及装置,其中,该方法包括:控制器接收到来自网络设备的告警;该控制器对告警进行分析得出分析结果;该控制器依据分析结果对告警进行处理,通过本发明,解决了相关技术中,不仅存在告警处理不及时,而且在告警数量多的情况下,造成上层网络设备性能低下的问题,进而达到了高效及时处理告警的效果。

The invention discloses an alarm processing method and device, wherein the method comprises: a controller receives an alarm from a network device; the controller analyzes the alarm to obtain an analysis result; the controller processes the alarm according to the analysis result , through the present invention, the problem of not only untimely alarm processing in the related art, but also low performance of upper-layer network equipment caused by a large number of alarms is solved, and the effect of efficient and timely alarm processing is achieved.

Description

告警处理方法及装置Alarm processing method and device

技术领域technical field

本发明涉及通信领域,具体而言,涉及一种告警处理方法及装置。The present invention relates to the communication field, in particular to an alarm processing method and device.

背景技术Background technique

在通讯网络中,网络由许多通讯设备节点组成,这些网络节点叫做网元。网元之间通过通讯线路进行连接,包括光纤电缆等多种形式。网元则分散分布在各个地区,有的在城市里的通讯大楼实验室里,有的网元在偏远地区。然而这些网元的设备需要进行配置,维护和监控,不可能每处都派人值守,这样就需要一个中心网络管理系统,放在中心机房,通过远程通讯对网络上各个节点进行配置,维护和监控。In a communication network, the network is composed of many communication device nodes, and these network nodes are called network elements. Network elements are connected through communication lines, including various forms such as optical fiber cables. The network elements are scattered in various regions, some are in the communication building laboratories in the city, and some are in remote areas. However, the equipment of these network elements needs to be configured, maintained and monitored, and it is impossible to send people on duty everywhere. In this way, a central network management system is required, placed in the central computer room, and configured, maintained and monitored for each node on the network through remote communication. monitor.

通过控制器管理网络可以由一种新出现的网络管理控制系统来实现。在这个控制系统中,把传统网管中对业务资源的控制功能独立出来,仅仅关注业务资源。控制器可以按树型层次化组织,以便关联大规模网络。其中,直接管理网元的可以称为域控制器(Domain Controller,或者D-Controller,简称为DC);而上层的控制器(Super Controller,或者S-Controller,简称为SC)不直接管理网元,而是管理域控制器,然后再通过域控制器提供的虚拟网络管理,实现对实际网络的管理。图1是相关技术中使用控制器管控网络的组网示意图,如图1所示,在一个应用场景中,控制器形成树形管理体系,上层是SC,底层的DC划分管理域,管理通讯网络和网元。控制器除了南向和网络设备通过接口进行管理,还有北向接口可以让应用层的网络APP(Application)接入进行网络管理,还可以通过侧接口,同网管平面(包括:网元管理系统(Element Management System,简称为EMS),网络管理系统(Network Management System,简称为NMS),或者操作支撑系统(Operation Support System,简称为OSS))沟通管理信息。网络APP是网络的实际业务应用,使用控制器提供的资源,发出业务建立,删除,修改的请求。而控制器根据网络APP的请求建立,删除,修改业务,并对业务的告警,性能进行监控。图2是相关技术中软件定义网络(Software Defined Network,简称为SDN)网络中控制器与其它网络实体之间的关系示意图,如图2所示,DC直接管理通讯网络,而SC管理DC,同时还可以和传统网络管理系统进行交流,最终提供给APP提供资源和服务。Network management through controllers can be realized by a new emerging network management control system. In this control system, the control function of service resources in traditional network management is separated, and only focus on service resources. Controllers can be organized hierarchically in a tree-like fashion for associating large-scale networks. Among them, the one that directly manages network elements can be called a domain controller (Domain Controller, or D-Controller, referred to as DC); while the upper-layer controller (Super Controller, or S-Controller, referred to as SC) does not directly manage network elements. , but to manage the domain controller, and then realize the management of the actual network through the virtual network management provided by the domain controller. Figure 1 is a schematic diagram of a network that uses a controller to manage and control the network in related technologies. As shown in Figure 1, in an application scenario, the controller forms a tree management system, the upper layer is SC, and the lower layer DC divides the management domain to manage the communication network. and network elements. In addition to southbound and network device management through the interface, the controller also has a northbound interface that allows the network APP (Application) at the application layer to access the network for network management. It can also be connected to the network management plane (including: network element management system ( Element Management System (EMS for short), Network Management System (NMS for short), or Operation Support System (OSS for short)) communicate management information. The network APP is the actual business application of the network. It uses the resources provided by the controller to send requests for business establishment, deletion and modification. The controller establishes, deletes, and modifies services according to network APP requests, and monitors service alarms and performance. Figure 2 is a schematic diagram of the relationship between a controller and other network entities in a Software Defined Network (SDN) network in the related art. As shown in Figure 2, the DC directly manages the communication network, while the SC manages the DC, and at the same time It can also communicate with traditional network management systems, and finally provide resources and services to APP.

网络实际运行过程中,可能会遇到故障,或者收到干扰而通讯质量下降。一旦发送这种情况,网络设备就会报告警。由于网络设备直接的通讯是密切关联的,一个设备或者一部分资源有故障上报告警,就会引发网络大面积产生告警。产生非常多的告警量。对网络管理员来说,这么多告警要找出真正的故障原因是困难的。网络中一个故障的出现会引发一个直接反映该故障的告警,同时由于故障影响到其他设备或者业务,会引发一系列告警。例如,告警A引发了告警B,告警A是根源告警,告警B是衍生告警。During the actual operation of the network, failures may occur, or interference may be received and the communication quality may decline. Once this condition is sent, the network device will report an alarm. Since the direct communication of network devices is closely related, if a device or a part of resources fails and reports an alarm, it will cause a large area of the network to generate alarms. Generates a very high volume of alerts. For network administrators, it is difficult to find out the real fault cause with so many alarms. The occurrence of a fault in the network will trigger an alarm that directly reflects the fault, and at the same time, a series of alarms will be triggered because the fault affects other devices or services. For example, alarm A triggers alarm B, alarm A is the root alarm, and alarm B is a derived alarm.

在传统的网络中,告警都是在设备产生的,网络控制器接收到告警后直接将告警转发给处理告警的上层网络设备,因而在相关技术中,不仅存在告警处理不及时,而且在告警数量多的情况下,造成上层网络设备性能低下的问题。In a traditional network, alarms are generated by devices, and the network controller directly forwards the alarms to the upper-layer network equipment that handles the alarms after receiving the alarms. If there are too many, it will cause the problem of low performance of upper-layer network equipment.

发明内容Contents of the invention

本发明提供了一种告警处理方法及装置,以至少解决相关技术中,不仅存在告警处理不及时,而且在告警数量多的情况下,造成上层网络设备性能低下的问题。The present invention provides an alarm processing method and device to at least solve the problem in the related art that not only the alarm processing is not timely, but also the performance of the upper layer network equipment is low due to the large number of alarms.

根据本发明的一个方面,提供了一种告警处理方法,包括:控制器接收到来自网络设备的告警;所述控制器对所述告警进行分析得出分析结果;所述控制器依据所述分析结果对所述告警进行处理。According to one aspect of the present invention, an alarm processing method is provided, including: a controller receives an alarm from a network device; the controller analyzes the alarm to obtain an analysis result; the controller obtains an analysis result according to the analysis As a result, the alarm is processed.

优选地,所述控制器对所述告警进行分析得到所述分析结果包括以下至少之一:依据产生所述告警的告警原因,分析得出所述告警是否对控制器管理的业务产生影响的所述分析结果;依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系,分析得出所述告警是否对所述控制器管理的所述业务产生影响的所述分析结果。Preferably, the controller analyzes the alarm to obtain the analysis result including at least one of the following: According to the cause of the alarm that generates the alarm, analyze whether the alarm has an impact on the service managed by the controller. According to the above analysis results; according to the relationship between the resources that generate the alarm and the resources occupied by the business managed by the controller, analyze whether the alarm has an impact on the business managed by the controller. Analyze the results.

优选地,依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系,分析得出所述告警是否对所述控制器管理的所述业务产生影响的所述分析结果包括以下之一:在产生所述告警的根资源不被所述控制器管理的业务所占用,并且也没有所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务不产生影响;在产生所述告警的根资源不被所述控制器管理的业务所占用,但所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务产生影响;在产生所述告警的根资源不被所述控制器管理的业务所占用,所述根资源对应的子资源也不被所述控制器管理的业务所占用,但所述根资源和/或所述根资源对应的子资源与所述控制器管理的业务存在映射关系的情况下,确定所述告警对所述控制器管理的所述业务产生影响;在产生所述告警的根资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务产生影响。Preferably, according to the relationship between the resources that generate the alarm and the resources occupied by the business managed by the controller, the analysis of whether the alarm has an impact on the business managed by the controller is obtained by analyzing The result includes one of the following: when the root resource generating the alarm is not occupied by the service managed by the controller, and no sub-resource corresponding to the root resource is occupied by the service managed by the controller , it is determined that the alarm has no impact on the business managed by the controller; the root resource that generates the alarm is not occupied by the business managed by the controller, but the sub-resource corresponding to the root resource is occupied by the If the service managed by the controller is occupied by the controller, determine that the alarm has an impact on the service managed by the controller; if the root resource generating the alarm is not occupied by the service managed by the controller, the The sub-resources corresponding to the root resource are not occupied by the services managed by the controller, but there is a mapping relationship between the root resource and/or the sub-resources corresponding to the root resource and the services managed by the controller , determine that the alarm has an impact on the service managed by the controller; if the root resource that generates the alarm is occupied by the service managed by the controller, determine that the alarm has an impact on the service managed by the controller impact on said business.

优选地,所述控制器依据所述分析结果对所述告警进行处理包括以下之一:在依据所述告警原因,分析得出所述告警对所述控制器管理的业务不产生影响的情况下,对所述告警进行抑制;在依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系确定所述告警对所述控制器管理的所述业务不产生影响的情况下,对所述告警进行抑制;在产生所述告警的根资源不被所述控制器管理的业务所占用,但所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,将所述告警衍生为子资源衍生告警;在产生所述告警的根资源不被所述控制器管理的业务所占用,所述根资源对应的子资源也不被所述控制器管理的业务所占用,但所述根资源和/或所述根资源对应的子资源与所述控制器管理的业务存在映射关系的情况下,依据所述映射关系,将所述告警衍生为关系衍生告警;在产生所述告警的根资源被所述控制器管理的业务所占用的情况下,直接将所述告警上报给上层网络设备。Preferably, the processing of the alarm by the controller according to the analysis result includes one of the following: according to the cause of the alarm, it is analyzed that the alarm has no impact on the services managed by the controller , suppressing the alarm; determining that the alarm has no impact on the business managed by the controller according to the relationship between the resources that generate the alarm and the resources occupied by the business managed by the controller In this case, suppress the alarm; if the root resource that generates the alarm is not occupied by the business managed by the controller, but the sub-resource corresponding to the root resource is occupied by the business managed by the controller In this case, the alarm is derived as a sub-resource derived alarm; when the root resource generating the alarm is not occupied by the business managed by the controller, the sub-resource corresponding to the root resource is not managed by the controller In the case that the root resource and/or the sub-resource corresponding to the root resource has a mapping relationship with the service managed by the controller, according to the mapping relationship, the alarm is derived as a relationship derived Alarm: when the root resource generating the alarm is occupied by the service managed by the controller, directly report the alarm to the upper-layer network device.

优选地,在对所述告警进行衍生处理的情况下,还包括:记录所述告警与衍生后的衍生告警之间的关联关系,其中,所述衍生告警包括:子资源衍生告警和/或关系衍生告警;将所述衍生告警以及所述关联关系上报给上层网络设备,和/或提供所述关联关系供所述控制器管理系统中的网元设备查询。Preferably, in the case of deriving the alarm, it further includes: recording the association relationship between the alarm and the derived alarm after derivation, wherein the derived alarm includes: sub-resource derived alarm and/or relationship Derived alarms: reporting the derived alarms and the associated relationship to an upper-layer network device, and/or providing the associated relationship for query by network element devices in the controller management system.

优选地,在所述控制器依据所述分析结果对所述告警进行处理之后,还包括:依据所述告警所引起的网元设备的状态变化,或者所述告警的代表资源的对象的状态变化,生成所述网元设备或者所述代表资源的对象的状态通知;将生成的所述网元设备或者所述代表资源的对象的状态通知上报给上层网络设备。Preferably, after the controller processes the alarm according to the analysis result, it further includes: according to the state change of the network element equipment caused by the alarm, or the state change of the object representing the resource in the alarm , generating a status notification of the network element device or the resource-representing object; reporting the generated status notification of the network element device or the resource-representing object to an upper-layer network device.

根据本发明的另一方面,提供了一种告警处理装置,应用于控制器,包括:接收模块,用于接收到来自网络设备的告警;分析模块,用于对所述告警进行分析得出分析结果;处理模块,用于依据所述分析结果对所述告警进行处理。According to another aspect of the present invention, an alarm processing device is provided, which is applied to a controller, including: a receiving module, configured to receive an alarm from a network device; an analysis module, configured to analyze the alarm to obtain an analysis Result; a processing module, configured to process the alarm according to the analysis result.

优选地,所述分析模块包括以下至少之一:第一分析单元,用于依据产生所述告警的告警原因,分析得出所述告警是否对控制器管理的业务产生影响的所述分析结果;第二分析单元,用于依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系,分析得出所述告警是否对所述控制器管理的所述业务产生影响的所述分析结果。Preferably, the analysis module includes at least one of the following: a first analysis unit, configured to analyze and obtain the analysis result of whether the alarm has an impact on the service managed by the controller according to the cause of the alarm that generates the alarm; The second analysis unit is configured to analyze whether the alarm affects the service managed by the controller according to the relationship between the resource generating the alarm and the resource occupied by the service managed by the controller of the analysis results.

优选地,所述第二分析单元包括以下之一:第一确定子单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,并且也没有所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务不产生影响;第二确定子单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,但所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务产生影响;第三确定子单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,所述根资源对应的子资源也不被所述控制器管理的业务所占用,但所述根资源和/或所述根资源对应的子资源与所述控制器管理的业务存在映射关系的情况下,确定所述告警对所述控制器管理的所述业务产生影响;第四确定子单元,用于在产生所述告警的根资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务产生影响。Preferably, the second analysis unit includes one of the following: a first determining subunit, configured to generate the alarm when the root resource is not occupied by the service managed by the controller and does not correspond to the root resource When the sub-resources of the controller are occupied by the business managed by the controller, it is determined that the alarm has no impact on the business managed by the controller; the second determination subunit is configured to generate the alarm at the root If the resource is not occupied by the service managed by the controller, but the sub-resource corresponding to the root resource is occupied by the service managed by the controller, it is determined that the alarm is for the service managed by the controller produce an impact; the third determining subunit is configured to generate the alarm root resource is not occupied by the business managed by the controller, and the sub-resource corresponding to the root resource is not occupied by the business managed by the controller Occupied, but in the case that the root resource and/or the sub-resource corresponding to the root resource has a mapping relationship with the service managed by the controller, determine that the alarm has an impact on the service managed by the controller; The fourth determining subunit is configured to determine that the alarm has an impact on the service managed by the controller when the root resource generating the alarm is occupied by the service managed by the controller.

优选地,所述处理模块包括以下之一:第一抑制单元,用于在依据所述告警原因,分析得出所述告警对所述控制器管理的业务不产生影响的情况下,对所述告警进行抑制;第二抑制单元,用于在依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系确定所述告警对所述控制器管理的所述业务不产生影响的情况下,对所述告警进行抑制;第一衍生单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,但所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,将所述告警衍生为子资源衍生告警;第二衍生单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,所述根资源对应的子资源也不被所述控制器管理的业务所占用,但所述根资源和/或所述根资源对应的子资源与所述控制器管理的业务存在映射关系的情况下,依据所述映射关系,将所述告警衍生为关系衍生告警;第一上报单元,用于在产生所述告警的根资源被所述控制器管理的业务所占用的情况下,直接将所述告警上报给上层网络设备。Preferably, the processing module includes one of the following: a first suppressing unit, configured to suppress the Suppressing the alarm; the second suppression unit is configured to determine that the alarm does not affect the business managed by the controller according to the relationship between the resources that generate the alarm and the resources occupied by the business managed by the controller In the case of an impact, suppress the alarm; the first derivation unit is configured to generate the root resource that generates the alarm is not occupied by the service managed by the controller, but the sub-resource corresponding to the root resource is occupied by In the case that the service managed by the controller is occupied, the alarm is derived as a sub-resource derived alarm; the second derivation unit is configured to generate the root resource of the alarm is not occupied by the service managed by the controller , the sub-resources corresponding to the root resource are not occupied by the services managed by the controller, but there is a mapping relationship between the root resource and/or the sub-resources corresponding to the root resource and the services managed by the controller In some cases, according to the mapping relationship, the alarm is derived into a relationship-derived alarm; the first reporting unit is configured to directly generate the alarm when the root resource generating the alarm is occupied by the service managed by the controller. The alarm is reported to the upper layer network device.

优选地,该处理模块还包括:记录单元,用于在对所述告警进行衍生处理的情况下,记录所述告警与衍生后的衍生告警之间的关联关系,其中,所述衍生告警包括:子资源衍生告警和/或关系衍生告警;第二上报单元,用于将所述衍生告警以及所述关联关系上报给上层网络设备,和/或,提供单元,用于提供所述关联关系供所述控制器管理系统中的网元设备查询。Preferably, the processing module further includes: a recording unit, configured to record the correlation between the alarm and the derived derived alarm when derivation processing is performed on the alarm, wherein the derived alarm includes: Sub-resource-derived alarms and/or relationship-derived alarms; a second reporting unit, configured to report the derived alarms and the association relationship to an upper-layer network device, and/or a providing unit, configured to provide the association relationship for the The network element device query in the controller management system is described above.

优选地,该装置还包括:生成模块,用于依据所述告警所引起的网元设备的状态变化,或者所述告警的代表资源的对象的状态变化,生成所述网元设备或者所述代表资源的对象的状态通知;上报模块,用于将生成的所述网元设备或者所述代表资源的对象的状态通知上报给上层网络设备。Preferably, the device further includes: a generating module, configured to generate the network element device or the representative resource according to the state change of the network element device caused by the alarm, or the state change of the object representing the resource of the alarm. A status notification of a resource object; a reporting module, configured to report the generated status notification of the network element device or the object representing the resource to an upper-layer network device.

优选地,所述控制器包括以下至少之一:域控制器DC、超级控制器SC,其中,所述SC为所述DC的上层控制器。Preferably, the controller includes at least one of the following: a domain controller DC and a super controller SC, wherein the SC is an upper controller of the DC.

通过本发明,采用控制器接收到来自网络设备的告警;所述控制器对所述告警进行分析得出分析结果;所述控制器依据所述分析结果对所述告警进行处理,解决了相关技术中,不仅存在告警处理不及时,而且在告警数量多的情况下,造成上层网络设备性能低下的问题,进而达到了高效及时处理告警的效果。Through the present invention, a controller is used to receive an alarm from a network device; the controller analyzes the alarm to obtain an analysis result; the controller processes the alarm according to the analysis result, which solves the problem of related technologies In this method, not only the alarms are not processed in time, but also the performance of the upper-layer network equipment is low when there are a large number of alarms, and the effect of efficient and timely alarm processing is achieved.

附图说明Description of drawings

此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The accompanying drawings described here are used to provide a further understanding of the present invention and constitute a part of the application. The schematic embodiments of the present invention and their descriptions are used to explain the present invention and do not constitute improper limitations to the present invention. In the attached picture:

图1是相关技术中使用控制器管控网络的组网示意图;FIG. 1 is a schematic diagram of a network using a controller to manage and control a network in the related art;

图2是相关技术中SDN网络中控制器与其它网络实体之间的关系示意图;FIG. 2 is a schematic diagram of the relationship between the controller and other network entities in the SDN network in the related art;

图3是根据本发明实施例的告警处理方法的流程图;Fig. 3 is a flowchart of an alarm processing method according to an embodiment of the present invention;

图4是根据本发明实施例的告警处理装置的结构框图;Fig. 4 is a structural block diagram of an alarm processing device according to an embodiment of the present invention;

图5是根据本发明实施例的告警处理装置中分析模块44的优选结构框图;Fig. 5 is a preferred structural block diagram of the analysis module 44 in the alarm processing device according to an embodiment of the present invention;

图6是根据本发明实施例的告警处理装置中分析模块44中第二分析单元54的优选结构框图;6 is a preferred structural block diagram of the second analysis unit 54 in the analysis module 44 of the alarm processing device according to an embodiment of the present invention;

图7是根据本发明实施例的告警处理装置中处理模块46的优选结构框图一;Fig. 7 is a preferred structural block diagram 1 of the processing module 46 in the alarm processing device according to an embodiment of the present invention;

图8是根据本发明实施例的告警处理装置中处理模块46的优选结构框图二;FIG. 8 is a second preferred structural block diagram of the processing module 46 in the alarm processing device according to an embodiment of the present invention;

图9是根据本发明实施例的告警处理装置的优选结构框图;Fig. 9 is a preferred structural block diagram of an alarm processing device according to an embodiment of the present invention;

图10是根据本发明实施例的控制器告警分析模块的结构示意图;10 is a schematic structural diagram of a controller alarm analysis module according to an embodiment of the present invention;

图11是根据本发明优选实施例一的告警上报处理的示意图;Fig. 11 is a schematic diagram of alarm reporting processing according to the preferred embodiment 1 of the present invention;

图12是根据本发明优选实施例二的告警上报处理的示意图。Fig. 12 is a schematic diagram of alarm reporting processing according to the second preferred embodiment of the present invention.

具体实施方式detailed description

下文中将参考附图并结合实施例来详细说明本发明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。Hereinafter, the present invention will be described in detail with reference to the drawings and examples. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other.

在本实施例中提供了一种告警处理方法,图3是根据本发明实施例的告警处理方法的流程图,如图3所示,该流程包括如下步骤:In this embodiment, a method for processing an alarm is provided. FIG. 3 is a flowchart of a method for processing an alarm according to an embodiment of the present invention. As shown in FIG. 3 , the process includes the following steps:

步骤S302,控制器接收到来自网络设备的告警,需要说明的是,该网络设备可以是产生该告警的网元设备本身,也可以是控制器设备;In step S302, the controller receives an alarm from a network device. It should be noted that the network device may be the network element device itself that generates the alarm, or may be a controller device;

步骤S304,该控制器对告警进行分析得出分析结果;Step S304, the controller analyzes the alarm to obtain an analysis result;

步骤S306,该控制器依据分析结果对告警进行处理。Step S306, the controller processes the alarm according to the analysis result.

通过上述步骤,由控制器来对接收到的告警进行分析处理,解决了相关技术中,不仅存在告警处理不及时,而且在告警数量多的情况下,造成上层网络设备性能低下的问题,进而达到了高效及时处理告警的效果。Through the above steps, the controller analyzes and processes the received alarms, which solves the problem in the related technology that not only the alarms are not processed in time, but also the performance of the upper-layer network equipment is low when there are a large number of alarms, thereby achieving This ensures efficient and timely handling of alarms.

优选地,控制器对告警进行分析得到分析结果时,采用不同的分析依据,分析的结果也可以不同,例如,可以依据产生告警的告警原因,分析得出告警是否对控制器管理的业务产生影响的分析结果;也可以依据产生告警的资源与控制器管理的业务所占用的资源之间的关系,分析得出告警是否对控制器管理的业务产生影响的分析结果。Preferably, when the controller analyzes the alarm to obtain the analysis result, different analysis basis is used, and the analysis result can also be different. For example, according to the alarm cause of the alarm, it can be analyzed whether the alarm has an impact on the business managed by the controller The analysis result of the alarm can also be obtained based on the relationship between the resources that generate the alarm and the resources occupied by the business managed by the controller to obtain the analysis result of whether the alarm has an impact on the business managed by the controller.

在依据产生告警的资源与控制器管理的业务所占用的资源之间的关系,分析得出告警是否对控制器管理的业务产生影响的分析结果时,可以包括以下之一:在产生告警的根资源不被控制器管理的业务所占用,并且也没有根资源对应的子资源被控制器管理的业务所占用的情况下,确定告警对控制器管理的业务不产生影响;在产生告警的根资源不被控制器管理的业务所占用,但根资源对应的子资源被控制器管理的业务所占用的情况下,确定告警对控制器管理的业务产生影响;在产生告警的根资源不被控制器管理的业务所占用,根资源对应的子资源也不被控制器管理的业务所占用,但根资源和/或根资源对应的子资源与控制器管理的业务存在映射关系的情况下,确定告警对控制器管理的业务产生影响;在产生告警的根资源被控制器管理的业务所占用的情况下,确定告警对控制器管理的业务产生影响。According to the relationship between the resources that generate the alarm and the resources occupied by the business managed by the controller, when analyzing whether the alarm has an impact on the business managed by the controller, one of the following can be included: If the resource is not occupied by the business managed by the controller, and no sub-resource corresponding to the root resource is occupied by the business managed by the controller, it is determined that the alarm will not affect the business managed by the controller; If it is not occupied by the business managed by the controller, but the sub-resource corresponding to the root resource is occupied by the business managed by the controller, it is determined that the alarm has an impact on the business managed by the controller; The service managed by the controller is occupied, and the sub-resources corresponding to the root resource are not occupied by the services managed by the controller, but the root resource and/or the sub-resources corresponding to the root resource have a mapping relationship with the service managed by the controller, and the alarm is determined. It has an impact on the business managed by the controller; in the case that the root resource generating the alarm is occupied by the business managed by the controller, it is determined that the alarm has an impact on the business managed by the controller.

控制器依据分析结果对告警进行处理时,依据分析结果的不同,对该告警进行处理也不同,下面举例说明:可以在以下情况下,对告警进行抑制处理:例如,在依据告警原因,分析得出告警对控制器管理的业务不产生影响的情况下,对告警进行抑制;又例如,在依据产生告警的资源与控制器管理的业务所占用的资源之间的关系确定告警对控制器管理的业务不产生影响的情况下,对告警进行抑制;可以在以下情况下,对该告警进行衍生处理:例如,在产生告警的根资源不被控制器管理的业务所占用,但根资源对应的子资源被控制器管理的业务所占用的情况下,将告警衍生为子资源衍生告警;又例如,在产生告警的根资源不被控制器管理的业务所占用,根资源对应的子资源也不被控制器管理的业务所占用,但根资源和/或根资源对应的子资源与控制器管理的业务存在映射关系的情况下,依据映射关系,将告警衍生为关系衍生告警;另外,在产生告警的根资源被控制器管理的业务所占用的情况下,直接将告警上报给上层网络设备。需要说明的是,此处的上层网络设备是一个相对概念,例如,当控制器DC接收到该告警时,该上层网络设备即是该控制器DC的上层网络设备SC,再例如,当上层网络设备SC接收到该告警时,该上层网络设备即是该SC的上层网络设备NMS。When the controller processes the alarm according to the analysis results, the alarm processing is different according to the analysis results. The following example illustrates that the alarm can be suppressed in the following situations: for example, based on the alarm cause, the analysis results are When the alarm has no impact on the business managed by the controller, the alarm is suppressed; another example is to determine the impact of the alarm on the controller management based on the relationship between the resources that generate the alarm and the resources occupied by the business managed by the controller. Suppress the alarm when the business is not affected; the alarm can be derived in the following cases: For example, when the root resource that generates the alarm is not occupied by the business managed by the controller, but the sub When the resource is occupied by the business managed by the controller, the alarm is derived as a sub-resource-derived alarm; for example, if the root resource that generates the alarm is not occupied by the business managed by the controller, the sub-resource corresponding to the root resource is also not The business managed by the controller is occupied, but the root resource and/or the sub-resources corresponding to the root resource have a mapping relationship with the business managed by the controller, according to the mapping relationship, the alarm is derived as a relationship-derived alarm; in addition, when the alarm is generated When the root resource of the controller is occupied by the business managed by the controller, the alarm is directly reported to the upper-layer network device. It should be noted that the upper-layer network device here is a relative concept. For example, when the controller DC receives the alarm, the upper-layer network device is the upper-layer network device SC of the controller DC. For example, when the upper-layer network When the device SC receives the alarm, the upper-layer network device is the upper-layer network device NMS of the SC.

需要说明的是,在对告警进行衍生处理的情况下,还可以:记录告警与衍生后的衍生告警之间的关联关系,其中,衍生告警包括:子资源衍生告警和/或关系衍生告警;将衍生告警以及关联关系上报给上层网络设备,和/或提供关联关系供控制器管理系统中的网元设备查询。It should be noted that, in the case of deriving the alarm, it is also possible to: record the association relationship between the alarm and the derived derived alarm, wherein the derived alarm includes: a sub-resource derived alarm and/or a relationship derived alarm; Derived alarms and associated relationships are reported to upper-layer network devices, and/or associated relationships are provided for query by network element devices in the controller management system.

另外,在依据分析结果对告警进行处理之后,还可以包括以下处理:依据告警所引起的网元设备的状态变化,或者告警的代表资源的对象的状态变化,生成网元设备或者代表资源的对象的状态通知;将生成的网元设备或者代表资源的对象的状态通知上报给上层网络设备。In addition, after the alarm is processed according to the analysis result, the following processing may also be included: according to the state change of the network element device caused by the alarm, or the state change of the object representing the resource in the alarm, generate the network element device or the object representing the resource status notification; report the generated status notification of the network element device or the object representing the resource to the upper-layer network device.

在本实施例中还提供了一种告警处理装置,该装置用于实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。This embodiment also provides an alarm processing device, which is used to implement the above embodiments and preferred implementation manners, and what has been described will not be repeated. As used below, the term "module" may be a combination of software and/or hardware that realizes a predetermined function. Although the devices described in the following embodiments are preferably implemented in software, implementations in hardware, or a combination of software and hardware are also possible and contemplated.

图4是根据本发明实施例的告警处理装置的结构框图,如图4所示,该装置可以应用于控制器,包括接收模块42、分析模块44和处理模块46,下面对该装置进行说明。Fig. 4 is a structural block diagram of an alarm processing device according to an embodiment of the present invention. As shown in Fig. 4, the device can be applied to a controller, including a receiving module 42, an analyzing module 44 and a processing module 46, and the device will be described below .

接收模块42,用于接收到来自网络设备的告警;分析模块44,连接至上述接收模块42,用于对告警进行分析得出分析结果;处理模块46,连接至上述分析模块44,用于依据分析结果对告警进行处理。The receiving module 42 is used to receive an alarm from the network device; the analysis module 44 is connected to the above receiving module 42, and is used to analyze the alarm to obtain an analysis result; the processing module 46 is connected to the above analysis module 44, and is used to Analyze the results and handle the alarms.

图5是根据本发明实施例的告警处理装置中分析模块44的优选结构框图,如图5所示,该分析模块44包括以下至少之一:第一分析单元52、第二分析单元54,下面对该分析模块44进行说明。Fig. 5 is a preferred structural block diagram of an analysis module 44 in an alarm processing device according to an embodiment of the present invention. As shown in Fig. 5, the analysis module 44 includes at least one of the following: a first analysis unit 52, a second analysis unit 54, and The analysis module 44 will be described.

第一分析单元52,用于依据产生告警的告警原因,分析得出告警是否对控制器管理的业务产生影响的分析结果;第二分析单元54,用于依据产生告警的资源与控制器管理的业务所占用的资源之间的关系,分析得出告警是否对控制器管理的业务产生影响的分析结果。The first analysis unit 52 is used to analyze and obtain the analysis result of whether the alarm has an impact on the business managed by the controller according to the cause of the alarm; the second analysis unit 54 is used to analyze the resource and the resource managed by the controller according to the alarm. The relationship between the resources occupied by the business can be analyzed to obtain the analysis result of whether the alarm has an impact on the business managed by the controller.

图6是根据本发明实施例的告警处理装置中分析模块44中第二分析单元54的优选结构框图,如图6所示,该第二分析单元54包括以下之一:第一确定子单元62、第二确定子单元64、第三确定子单元66、第四确定子单元68,下面对该第二分析单元54进行说明。FIG. 6 is a preferred structural block diagram of the second analysis unit 54 in the analysis module 44 of the alarm processing device according to an embodiment of the present invention. As shown in FIG. 6 , the second analysis unit 54 includes one of the following: a first determination subunit 62 , the second determination subunit 64 , the third determination subunit 66 , and the fourth determination subunit 68 , the second analysis unit 54 will be described below.

第一确定子单元62,用于在产生告警的根资源不被控制器管理的业务所占用,并且也没有根资源对应的子资源被控制器管理的业务所占用的情况下,确定告警对控制器管理的业务不产生影响;第二确定子单元64,用于在产生告警的根资源不被控制器管理的业务所占用,但根资源对应的子资源被控制器管理的业务所占用的情况下,确定告警对控制器管理的业务产生影响;第三确定子单元66,用于在产生告警的根资源不被控制器管理的业务所占用,根资源对应的子资源也不被控制器管理的业务所占用,但根资源和/或根资源对应的子资源与控制器管理的业务存在映射关系的情况下,确定告警对控制器管理的业务产生影响;第四确定子单元68,用于在产生告警的根资源被控制器管理的业务所占用的情况下,确定告警对控制器管理的业务产生影响。The first determining subunit 62 is configured to determine the impact of the alarm on the control when the root resource that generates the alarm is not occupied by the business managed by the controller, and no sub-resource corresponding to the root resource is occupied by the business managed by the controller. The business managed by the controller does not have an impact; the second determination subunit 64 is used for the case where the root resource that generates the alarm is not occupied by the business managed by the controller, but the sub-resource corresponding to the root resource is occupied by the business managed by the controller Next, it is determined that the alarm has an impact on the business managed by the controller; the third determining subunit 66 is used to determine that the root resource that generates the alarm is not occupied by the business managed by the controller, and the sub-resources corresponding to the root resource are not managed by the controller is occupied by the business, but there is a mapping relationship between the root resource and/or the sub-resources corresponding to the root resource and the business managed by the controller, it is determined that the alarm has an impact on the business managed by the controller; the fourth determination subunit 68 is used to In the case that the root resource generating the alarm is occupied by the service managed by the controller, it is determined that the alarm has an impact on the service managed by the controller.

图7是根据本发明实施例的告警处理装置中处理模块46的优选结构框图一,如图7所示,该处理模块46包括以下之一:第一抑制单元70、第二抑制单元72、第一衍生单元74、第二衍生单元76、第一上报单元78,下面对该处理模块46进行说明。Fig. 7 is a preferred structural block diagram 1 of the processing module 46 in the alarm processing device according to an embodiment of the present invention. As shown in Fig. 7, the processing module 46 includes one of the following: a first suppressing unit 70, a second suppressing unit 72, a A derivation unit 74, a second derivation unit 76, and a first reporting unit 78. The processing module 46 will be described below.

第一抑制单元70,用于在依据告警原因,分析得出告警对控制器管理的业务不产生影响的情况下,对告警进行抑制;第二抑制单元72,用于在依据产生告警的资源与控制器管理的业务所占用的资源之间的关系确定告警对控制器管理的业务不产生影响的情况下,对告警进行抑制;第一衍生单元74,用于在产生告警的根资源不被控制器管理的业务所占用,但根资源对应的子资源被控制器管理的业务所占用的情况下,将告警衍生为子资源衍生告警;第二衍生单元76,用于在产生告警的根资源不被控制器管理的业务所占用,根资源对应的子资源也不被控制器管理的业务所占用,但根资源和/或根资源对应的子资源与控制器管理的业务存在映射关系的情况下,依据映射关系,将告警衍生为关系衍生告警;第一上报单元78,用于在产生告警的根资源被控制器管理的业务所占用的情况下,直接将告警上报给上层网络设备。The first suppression unit 70 is configured to suppress the alarm based on the analysis of the cause of the alarm and shows that the alarm has no impact on the business managed by the controller; the second suppression unit 72 is configured to suppress the alarm based on the resource and When the relationship between the resources occupied by the business managed by the controller determines that the alarm does not affect the business managed by the controller, the alarm is suppressed; the first derivation unit 74 is used to control the root resource that generates the alarm In the case that the service managed by the controller is occupied, but the sub-resource corresponding to the root resource is occupied by the service managed by the controller, the alarm is derived as a sub-resource derived alarm; the second derivation unit 76 is used to generate the alarm if the root resource is not It is occupied by the business managed by the controller, and the sub-resources corresponding to the root resource are not occupied by the business managed by the controller, but there is a mapping relationship between the root resource and/or the sub-resources corresponding to the root resource and the business managed by the controller , deriving the alarm into a relationship-derived alarm according to the mapping relationship; the first reporting unit 78 is configured to directly report the alarm to the upper-layer network device when the root resource generating the alarm is occupied by a service managed by the controller.

图8是根据本发明实施例的告警处理装置中处理模块46的优选结构框图二,如图8所示,该处理模块46除包括上述第一衍生模块74和/或第一衍生模块76外,还包括:记录单元82、第二上报单元84和/或提供单元86,下面对该处理模块46进行说明。FIG. 8 is a second preferred structural block diagram of the processing module 46 in the alarm processing device according to an embodiment of the present invention. As shown in FIG. 8 , the processing module 46 includes the first derivative module 74 and/or the first derivative module 76, It also includes: a recording unit 82 , a second reporting unit 84 and/or a providing unit 86 , and the processing module 46 will be described below.

记录单元82,用于在对告警进行衍生处理的情况下,记录告警与衍生后的衍生告警之间的关联关系,其中,衍生告警包括:子资源衍生告警和/或关系衍生告警;第二上报单元84,用于将衍生告警以及关联关系上报给上层网络设备,和/或,提供单元86,用于提供关联关系供控制器管理系统中的网元设备查询。The recording unit 82 is configured to record the association relationship between the alarm and the derived alarm when deriving the alarm, wherein the derived alarm includes: a sub-resource derived alarm and/or a relationship derived alarm; the second report The unit 84 is configured to report the derived alarm and the associated relationship to the upper-layer network device, and/or the providing unit 86 is configured to provide the associated relationship for query by the network element devices in the controller management system.

图9是根据本发明实施例的告警处理装置的优选结构框图,如图9所示,该装置除包括图4所示的所有模块外,还包括:生成模块92和上报模块94,下面对该装置进行说明。Fig. 9 is a preferred structural block diagram of an alarm processing device according to an embodiment of the present invention. As shown in Fig. 9, in addition to all the modules shown in Fig. 4, the device also includes: a generating module 92 and a reporting module 94, as follows The device is described.

生成模块92,连接至上述处理模块46,用于依据告警所引起的网元设备的状态变化,或者告警的代表资源的对象的状态变化,生成网元设备或者代表资源的对象的状态通知;上报模块94,连接至上述生成模块92,用于将生成的网元设备或者代表资源的对象的状态通知上报给上层网络设备。The generation module 92 is connected to the above-mentioned processing module 46, and is used to generate a status notification of the network element device or the object representing the resource according to the state change of the network element device caused by the alarm, or the state change of the object representing the resource in the alarm; report The module 94 is connected to the generating module 92, and is configured to report the generated status notification of the network element device or the object representing the resource to the upper layer network device.

优选地,该控制器可以包括以下至少之一:域控制器DC、超级控制器SC,其中,SC为DC的上层控制器。需要指出的是,SC可以有多个,从DC看,SC都是上层控制器,SC上面如果还有SC,上面的是更上层的控制器,这个是相对关系。Preferably, the controller may include at least one of the following: a domain controller DC and a super controller SC, wherein the SC is an upper controller of the DC. It should be pointed out that there can be multiple SCs. From the perspective of DC, SCs are upper-level controllers. If there are SCs above SCs, the upper ones are higher-level controllers. This is a relative relationship.

在SDN控制器管控的网络中,由于SDN控制器的控制特性,其只关心业务相关的资源,而且由于权限划分原因,原始告警可能是由与业务无关的资源产生的,或者是由控制器不管理的业务产生的,但是对控制器管理的业务会产生影响,造成业务不通,或者质量下降。由于控制器的主要核心是进行业务管理,在网络管理系统或者控制平面是不对设备资源和业务产生告警的,鉴于此,在本实施例中,提出了告警也要表现成业务的告警,要由控制器衍生出受影响的业务告警。另外,除了告警衍生外,控制器还需要对告警进行影响业务的分析,告警继续上报等处理。In the network controlled by the SDN controller, due to the control characteristics of the SDN controller, it only cares about service-related resources, and due to the division of rights, the original alarm may be generated by resources that are not related to the service, or the controller does not The management service is generated, but it will have an impact on the service managed by the controller, resulting in service failure or quality degradation. Since the main core of the controller is to perform business management, the network management system or the control plane does not generate alarms for equipment resources and services. The controller generates alarms about the affected services. In addition, in addition to alarm derivation, the controller also needs to analyze the impact of the alarm on the business, and continue to report the alarm.

下面对该方案进行简单说明。A brief description of the scheme is given below.

在网络控制器中增加一个告警分析模块,图10是根据本发明实施例的控制器告警分析模块的结构示意图,如图10所示,该告警分析模块执行以下步骤的分析处理:An alarm analysis module is added in the network controller. FIG. 10 is a schematic structural diagram of the controller alarm analysis module according to an embodiment of the present invention. As shown in FIG. 10 , the alarm analysis module performs the analysis and processing of the following steps:

步骤1:设备检测到故障后产生告警,同时上报给控制器和网管。Step 1: After the device detects a fault, it generates an alarm and reports it to the controller and network management at the same time.

步骤2:控制器收到设备告警后进行分析和处理,包括下面几个子步骤的一个或多个的灵活组合:Step 2: After the controller receives the device alarm, it analyzes and processes it, including a flexible combination of one or more of the following sub-steps:

步骤2.1:根据告警原因进行分类,部分告警原因对业务没有影响,如果收到的是这些告警原因之一,该告警可以不处理,或者加以抑制。这样的告警原因包括但不限于:温度告警,电压告警,烟雾告警,等等很多,不一一列出。控制器需要处理的告警,包括但不限于:LOS(Loss of Signal),DEG(Degraded Signal),LOC(Loss of Connectivity/Loss of Connection),AIS(Alarm Indication Signal)。Step 2.1: Classify according to the cause of the alarm. Some of the reasons for the alarm have no impact on the business. If one of the reasons for the alarm is received, the alarm can not be processed or suppressed. Such alarm reasons include but are not limited to: temperature alarms, voltage alarms, smoke alarms, etc., which are not listed one by one. The alarms that the controller needs to handle include but are not limited to: LOS (Loss of Signal), DEG (Degraded Signal), LOC (Loss of Connectivity/Loss of Connection), AIS (Alarm Indication Signal).

步骤2.2:分析告警,发现产生告警的资源,不是业务占用的资源,也不对业务所占用的资源产生影响,这样的告警也可以不处理,或者加以抑制。这样的资源包括但不限于:电源模块,风扇模块,背板总线等等不一一列出。控制器需要处理的产生告警资源,包括但不限于:物理端口,连接终端点,路径终端点,逻辑终端点,流点,流点池。Step 2.2: Analyze the alarm and find that the resource that generates the alarm is not the resource occupied by the business, nor will it affect the resource occupied by the business. Such an alarm can also be ignored or suppressed. Such resources include but are not limited to: power supply modules, fan modules, backplane buses, etc. are not listed one by one. The resources that the controller needs to handle to generate alarms include but are not limited to: physical ports, connection endpoints, path endpoints, logical endpoints, flow points, and flow point pools.

步骤2.3:分析告警,发现产生告警的资源,不是业务直接占用的资源,但是该资源下有子资源被业务占用,可以把告警衍生为业务所占用的子资源的告警,然后控制器把衍生告警上报给上层控制器和网管平面。原始告警可以根据实现需要继续上报或者不上报。Step 2.3: Analyze the alarm and find that the resource that generates the alarm is not directly occupied by the business, but there are sub-resources under the resource that are occupied by the business. The alarm can be derived as an alarm for the sub-resources occupied by the business, and then the controller sends the derived alarm Report to the upper controller and network management plane. The original alarm can continue to be reported or not reported according to the implementation needs.

步骤2.4:分析告警,发现产生告警的资源,是业务占用的资源,上报的告警直接就属于业务的告警,可以不做衍生,把该告警继续上报给上层控制器和网管平面。Step 2.4: Analyze the alarm and find that the resource that generates the alarm is the resource occupied by the service. The reported alarm directly belongs to the service alarm, and there is no need to make a derivation, and continue to report the alarm to the upper controller and network management plane.

步骤2.5:分析告警,发现产生告警的资源,没有和业务的资源对象有直接对应关系,而是存在间接的映射关系(例如,虚拟化virtualization),控制器根据这种间接的映射关系,衍生出业务资源的告警,然后控制器把衍生告警上报给上层控制和网管平面。原始告警可以根据实现需要继续上报或者不上报。Step 2.5: Analyze the alarm and find that the resource that generates the alarm does not have a direct correspondence with the resource object of the business, but has an indirect mapping relationship (for example, virtualization), and the controller derives the Alarms of service resources, and then the controller reports the derived alarms to the upper control and network management planes. The original alarm can continue to be reported or not reported according to the implementation needs.

步骤2.6:分析告警,发现该告警是网元告警,但是控制器对该告警不感兴趣,而是对告警所引起的网元状态变化感兴趣,控制器抑制或丢弃该告警,生成状态通知,修改本地网元状态,将状态通知上报上层控制器和网管平面。Step 2.6: Analyze the alarm and find that the alarm is a network element alarm, but the controller is not interested in the alarm, but is interested in the state change of the network element caused by the alarm, the controller suppresses or discards the alarm, generates a status notification, and modifies Local network element status, and report the status notification to the upper controller and network management plane.

步骤2.7:控制器衍生出的新告警,原始告警和衍生告警的对应关系记录下来,并在上报告警时提供,或者提供查询,以便网管平面进行根因分析,找出故障根源,进行修复和排除故障。Step 2.7: For new alarms derived from the controller, record the corresponding relationship between the original alarm and the derived alarm, and provide it when reporting the alarm, or provide a query, so that the network management plane can perform root cause analysis, find out the root cause of the fault, and repair and repair it. Troubleshoot.

步骤3:EMS从设备收到原始告警和从控制器收到衍生告警后,可以进行本地保存、显示、确认、注释等管理,EMS把这两个告警都上报给NMS/OSS。Step 3: After the EMS receives the original alarm from the device and the derived alarm from the controller, it can perform local storage, display, confirmation, comment, etc. management, and the EMS reports both alarms to the NMS/OSS.

步骤4:上层控制器收到下层控制器上报的衍生告警后,采用步骤2的分析处理方法,进行本地分析处理,结果上报APP和OSS/NMS。Step 4: After the upper-layer controller receives the derived alarm reported by the lower-layer controller, it uses the analysis and processing method in step 2 to perform local analysis and processing, and reports the result to APP and OSS/NMS.

步骤5:APP收到业务告警后,用于界面显示,以及业务质量评估等用途。Step 5: After the APP receives the service alarm, it is used for interface display and service quality evaluation.

步骤6:OSS/NMS收到EMS上报的原始告警,和衍生告警,以及从上层控制器上报的衍生告警,进行综合分析,包括分析告警关联关系,根因分析等,为网络运行、维护、排除故障、计费等提供根据。Step 6: OSS/NMS receives the original alarms reported by the EMS, derived alarms, and derived alarms reported from the upper controller, and conducts a comprehensive analysis, including analysis of alarm correlations, root cause analysis, etc., for network operation, maintenance, and troubleshooting. Provide basis for faults, billing, etc.

通过控制器对设备资源告警进行处理的方法,包括收到告警后的分析,处理,衍生,和继续向其他系统上报的方法。为控制器处理告警提供一种高效可行的,满足业务管理目的的告警处理方法。The method for processing device resource alarms through the controller includes analyzing, processing, deriving, and continuing to report to other systems after receiving the alarm. Provides an efficient and feasible alarm processing method for the controller to process alarms, which meets the purpose of business management.

下面以一个跨多控制器的业务中产生告警为例,说明告警分析处理方法。The following takes an alarm generated in a service across multiple controllers as an example to illustrate the alarm analysis and processing method.

优选实施例一Preferred embodiment one

图11是根据本发明优选实施例一的告警上报处理的示意图,如图11所示,域控制器DC1和DC2,划分区域管理通讯网络,以及超级控制器SC,作为上层控制器管理DC1和DC2。两个管理域中连接NE1,NE2到NE6组成通讯网络。NE1到NE6的连线表示一个APP关心的业务,业务A端点在NE1,Z端点在NE6,中间经过NE2,NE3,NE4,NE5,分别被DC1,DC2直接管理,图中虚线表示管理关系。DC1、DC2和SC分别有配对的网管系统,假设在网元1产生了一个告警Alarm1,其他几个网元也有告警产生。Fig. 11 is a schematic diagram of the alarm reporting process according to the first preferred embodiment of the present invention. As shown in Fig. 11, the domain controllers DC1 and DC2 divide the regional management communication network, and the super controller SC manages DC1 and DC2 as the upper controller . The two management domains connect NE1, NE2 to NE6 to form a communication network. The connection from NE1 to NE6 represents a service concerned by the APP. The service A endpoint is at NE1, Z endpoint is at NE6, and the service passes through NE2, NE3, NE4, and NE5 in the middle, and is directly managed by DC1 and DC2 respectively. The dotted line in the figure indicates the management relationship. DC1, DC2, and SC have paired network management systems respectively. Suppose an alarm Alarm1 is generated on NE 1, and alarms are also generated on several other NEs.

第1步:NE1检测到故障,烟雾探测器感觉到烟雾超标,烟雾探测模块产生告警alarm1,同时上报给DC1和EMS1。Step 1: NE1 detects a fault, the smoke detector senses that the smoke exceeds the limit, the smoke detection module generates an alarm alarm1, and reports to DC1 and EMS1 at the same time.

第2步:DC1收到Alarm1后分析,分析模块对告警原因进行分析,发现alarm1的告警原因是烟雾告警,属于和业务无关的告警原因,于是DC1忽略这个告警。Step 2: DC1 analyzes after receiving Alarm1, and the analysis module analyzes the cause of the alarm, and finds that the cause of the alarm of alarm1 is a smoke alarm, which is irrelevant to the service, so DC1 ignores the alarm.

第3步:NE2检测到故障,风扇模块发现风扇转速异常,产生告警alarm2,同时上报给DC1和EMS1。Step 3: NE2 detects a fault, and the fan module finds that the fan speed is abnormal, generates an alarm alarm2, and reports it to DC1 and EMS1 at the same time.

第4步:DC1收到alarm2后分析,分析模块对告警源进行分析,发现alarm2产生的资源是风扇模块,和业务无关,于是DC1忽略这个告警。Step 4: DC1 analyzes after receiving alarm2, and the analysis module analyzes the alarm source, and finds that the resource generated by alarm2 is a fan module, which has nothing to do with the business, so DC1 ignores the alarm.

第5步:NE1的OTN光端口有故障,OTN有信号劣化告警,产生告警alarm3,同时上报给DC1和EMS1。Step 5: The OTN optical port of NE1 is faulty, and the OTN has a signal degradation alarm, generates an alarm alarm3, and reports it to DC1 and EMS1 at the same time.

第6步:DC1收到alarm3后分析,分析模块分析OTN端口,该资源虽然没有被业务直接使用,但是OTN下的子通道ODU1-1被正在业务1使用,DC1衍生出了ODU1-1资源的劣化告警alarm4,并记录alarm3-alarm4的对应关系,然后DC1把alarm4报告给SC和EMS1,并附带alarm3-alarm4的关联关系。Step 6: DC1 analyzes after receiving alarm3, and the analysis module analyzes the OTN port. Although this resource is not directly used by the service, the sub-channel ODU1-1 under the OTN is used by the service 1, and DC1 derives the ODU1-1 resource. Deterioration alarm alarm4, and record the correspondence between alarm3-alarm4, and then DC1 reports alarm4 to SC and EMS1, with the association of alarm3-alarm4 attached.

第7步:EMS总共收到alarm1,alarm2,alarm3,alarm4。在进行传统的EMS告警管理的同时,把这几个告警都上报给NMS/OSS1。Step 7: EMS received alarm1, alarm2, alarm3, alarm4 in total. While performing traditional EMS alarm management, report these alarms to NMS/OSS1.

第8步:SC收到告警alarm4,分析模块发现,由于告警的资源ODU1以及所在的业务1在SC经过虚拟化(virtualization)和映射到业务2,并且ODU1虚拟化成LTP1。衍生模块SC从alarm4衍生出LTP1资源的alarm5,并记录alarm4-alarm5的对应关系。SC把alarm5报告给app和NMS/OSS1,并附带alarm4-alarm5的关联关系。Step 8: The SC receives the alarm alarm4, and the analysis module finds that the alarm resource ODU1 and the service 1 are virtualized and mapped to the service 2 in the SC, and ODU1 is virtualized into LTP1. The derivation module SC derives alarm5 of the LTP1 resource from alarm4, and records the corresponding relationship between alarm4 and alarm5. The SC reports alarm5 to the app and NMS/OSS1 with the alarm4-alarm5 association.

第9步:NMS/OSS1总共收到告警alarm1,alarm2,alarm3,alarm4,alarm5。由于已经有alarm3-alarm4-alarm5的衍生对应关系,NMS/OSS1很容易分析得出alarm3是根源告警,需要通知维护人员修复设备,排除故障。Step 9: NMS/OSS1 receives alarm1, alarm2, alarm3, alarm4, alarm5 in total. Since there is already a derivative corresponding relationship of alarm3-alarm4-alarm5, NMS/OSS1 can easily analyze that alarm3 is the root alarm, and the maintenance personnel need to be notified to repair the device and troubleshoot the fault.

第10步:APP收到alarm5后,得知自己的业务2,产生了告警5,业务2收到影响。App用户以此和网络运营商沟通协调。Step 10: After the APP receives alarm5, it knows that its business 2 generates alarm 5, and business 2 is affected. App users use this to communicate and coordinate with network operators.

类似的NE3,NE4产生的告警,在DC2,EMS2,NMS/OSS2进行了类似的处理流程。Alarms generated by NE3 and NE4 are similarly processed in DC2, EMS2, and NMS/OSS2.

优选实施例二Preferred embodiment two

图12是根据本发明优选实施例二的告警上报处理的示意图,如图12所示,网络中的一个网元NE1,被实施的域控制器DC1管理,同时被EMS1网管系统管理,NE1产生了一个告警a,结果处理流程如下Fig. 12 is a schematic diagram of the alarm reporting process according to the second preferred embodiment of the present invention. As shown in Fig. 12, a network element NE1 in the network is managed by the implemented domain controller DC1 and managed by the EMS1 network management system at the same time. NE1 generates An alarm a, the result processing flow is as follows

第1步:NE1检测到故障,上报了告警a。Step 1: NE1 detects a fault and reports alarm a.

第2步:DC1对告警a进行分析,根据告警原因和告警源进行分析,发现告警a的子资源被业务使用,按照分析结果把告警a丢弃,衍生了告警b,然后把告警b上报给上层控制器SC1,和网管系统EMS1.Step 2: DC1 analyzes alarm a, analyzes the cause and source of the alarm, finds that the sub-resource of alarm a is used by the service, discards alarm a according to the analysis result, derives alarm b, and then reports alarm b to the upper layer Controller SC1, and network management system EMS1.

第3步:EMS1收到告警a和告警b后,告警a和b都上报到上层网管OSS1。Step 3: After EMS1 receives alarm a and alarm b, both alarm a and b are reported to upper network management OSS1.

第4步:SC1收到DC1上报的告警b后,根据告警原因和告警源进行分析,发现告警b的资源经过虚拟化后,变成了逻辑对象obj1,他们属于虚拟化的映射关系,SC1丢弃告警b,衍生了告警c。SC1把告警c同时上报给app和OSS1。Step 4: After SC1 receives the alarm b reported by DC1, it analyzes according to the cause and source of the alarm, and finds that the resource of alarm b becomes the logical object obj1 after being virtualized. They belong to the virtualized mapping relationship, and SC1 discards them. Alarm b generates alarm c. SC1 reports alarm c to app and OSS1 at the same time.

第5步:app收到告警通知,得知自己的业务中的抽象对象obj1有告警c,出于排除故障的目的,app向OSS1发出查询所有告警以及关联关系的请求。Step 5: The app receives the alarm notification and learns that the abstract object obj1 in its own business has an alarm c. For the purpose of troubleshooting, the app sends a request to OSS1 to query all alarms and related relationships.

第6步:OSS1从SC1查询告警关联关系,得到告警b和告警c之间的关联关系。Step 6: OSS1 queries the alarm association relationship from SC1, and obtains the association relationship between alarm b and alarm c.

第7步:OSS1从EMS1查询告警关联关系。Step 7: OSS1 queries the alarm association relationship from EMS1.

第8步:EMS1从DC1那里查到告警a和告警b的关联关系。汇报给OSS1。Step 8: EMS1 finds the relationship between alarm a and alarm b from DC1. Report to OSS1.

第9步:OSS1汇总得到告警a,b,c的关联关系,报告给APP。Step 9: OSS1 summarizes and obtains the association relationship of alarms a, b, and c, and reports to APP.

第10步:app从查询应答得到所有告警a,b,c并知道a是根源告警。用于通知运维人员进行修复设备排除故障。Step 10: The app gets all alarms a, b, c from the query response and knows that a is the root alarm. It is used to notify the operation and maintenance personnel to repair the device and troubleshoot the fault.

显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that each module or each step of the above-mentioned present invention can be realized by a general-purpose computing device, and they can be concentrated on a single computing device, or distributed in a network formed by multiple computing devices Alternatively, they may be implemented in program code executable by a computing device so that they may be stored in a storage device to be executed by a computing device, and in some cases in an order different from that shown here The steps shown or described are carried out, or they are separately fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation. As such, the present invention is not limited to any specific combination of hardware and software.

以上仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above are only preferred embodiments of the present invention, and are not intended to limit the present invention. For those skilled in the art, the present invention may have various modifications and changes. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included within the protection scope of the present invention.

Claims (13)

1.一种告警处理方法,其特征在于,包括:1. An alarm processing method, characterized in that, comprising: 控制器接收到来自网络设备的告警;The controller receives an alarm from the network device; 所述控制器对所述告警进行分析得出分析结果;The controller analyzes the alarm to obtain an analysis result; 所述控制器依据所述分析结果对所述告警进行处理。The controller processes the alarm according to the analysis result. 2.根据权利要求1所述的方法,其特征在于,所述控制器对所述告警进行分析得到所述分析结果包括以下至少之一:2. The method according to claim 1, wherein the controller analyzes the alarm to obtain the analysis result including at least one of the following: 依据产生所述告警的告警原因,分析得出所述告警是否对控制器管理的业务产生影响的所述分析结果;According to the cause of the alarm that generates the alarm, analyze and obtain the analysis result of whether the alarm has an impact on the business managed by the controller; 依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系,分析得出所述告警是否对所述控制器管理的所述业务产生影响的所述分析结果。The analysis result of whether the alarm affects the service managed by the controller is obtained by analyzing the relationship between the resource generating the alarm and the resource occupied by the service managed by the controller. 3.根据权利要求2所述的方法,其特征在于,依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系,分析得出所述告警是否对所述控制器管理的所述业务产生影响的所述分析结果包括以下之一:3. The method according to claim 2, characterized in that, according to the relationship between the resources that generate the alarm and the resources occupied by the business managed by the controller, it is analyzed whether the alarm has an impact on the control The analysis result that affects the business managed by the server includes one of the following: 在产生所述告警的根资源不被所述控制器管理的业务所占用,并且也没有所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务不产生影响;If the root resource that generates the alarm is not occupied by the service managed by the controller, and no sub-resource corresponding to the root resource is occupied by the service managed by the controller, determine that the alarm is The business managed by the controller will not be affected; 在产生所述告警的根资源不被所述控制器管理的业务所占用,但所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务产生影响;If the root resource that generates the alarm is not occupied by the service managed by the controller, but the sub-resource corresponding to the root resource is occupied by the service managed by the controller, determine that the alarm has a significant impact on the The said business managed by the controller has an impact; 在产生所述告警的根资源不被所述控制器管理的业务所占用,所述根资源对应的子资源也不被所述控制器管理的业务所占用,但所述根资源和/或所述根资源对应的子资源与所述控制器管理的业务存在映射关系的情况下,确定所述告警对所述控制器管理的所述业务产生影响;The root resource that generates the alarm is not occupied by the service managed by the controller, and the sub-resource corresponding to the root resource is not occupied by the service managed by the controller, but the root resource and/or the If there is a mapping relationship between the sub-resource corresponding to the root resource and the service managed by the controller, determine that the alarm has an impact on the service managed by the controller; 在产生所述告警的根资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务产生影响。If the root resource that generates the alarm is occupied by the service managed by the controller, it is determined that the alarm affects the service managed by the controller. 4.根据权利要求3所述的方法,其特征在于,所述控制器依据所述分析结果对所述告警进行处理包括以下之一:4. The method according to claim 3, wherein the processing of the alarm by the controller according to the analysis result includes one of the following: 在依据所述告警原因,分析得出所述告警对所述控制器管理的业务不产生影响的情况下,对所述告警进行抑制;Suppressing the alarm when it is analyzed based on the cause of the alarm that the alarm has no impact on services managed by the controller; 在依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系确定所述告警对所述控制器管理的所述业务不产生影响的情况下,对所述告警进行抑制;When it is determined according to the relationship between the resource generating the alarm and the resource occupied by the service managed by the controller that the alarm has no impact on the service managed by the controller, performing an alarm on the alarm inhibition; 在产生所述告警的根资源不被所述控制器管理的业务所占用,但所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,将所述告警衍生为子资源衍生告警;In the case that the root resource that generates the alarm is not occupied by the service managed by the controller, but the sub-resource corresponding to the root resource is occupied by the service managed by the controller, derive the alarm as a sub-resource Resource-derived alerts; 在产生所述告警的根资源不被所述控制器管理的业务所占用,所述根资源对应的子资源也不被所述控制器管理的业务所占用,但所述根资源和/或所述根资源对应的子资源与所述控制器管理的业务存在映射关系的情况下,依据所述映射关系,将所述告警衍生为关系衍生告警;The root resource that generates the alarm is not occupied by the service managed by the controller, and the sub-resource corresponding to the root resource is not occupied by the service managed by the controller, but the root resource and/or the If there is a mapping relationship between the sub-resource corresponding to the root resource and the service managed by the controller, deriving the alarm into a relationship-derived alarm according to the mapping relationship; 在产生所述告警的根资源被所述控制器管理的业务所占用的情况下,直接将所述告警上报给上层网络设备。When the root resource generating the alarm is occupied by the service managed by the controller, directly report the alarm to the upper-layer network device. 5.根据权利要求4所述的方法,其特征在于,在对所述告警进行衍生处理的情况下,还包括:5. The method according to claim 4, further comprising: 记录所述告警与衍生后的衍生告警之间的关联关系,其中,所述衍生告警包括:子资源衍生告警和/或关系衍生告警;Recording the association relationship between the alarm and the derived alarm after derivation, wherein the derived alarm includes: a sub-resource derived alarm and/or a relationship derived alarm; 将所述衍生告警以及所述关联关系上报给上层网络设备,和/或提供所述关联关系供所述控制器管理系统中的网元设备查询。Reporting the derived alarm and the association relationship to an upper-layer network device, and/or providing the association relationship for query by network element devices in the controller management system. 6.根据权利要求1至5中任一项所述的方法,其特征在于,在所述控制器依据所述分析结果对所述告警进行处理之后,还包括:6. The method according to any one of claims 1 to 5, further comprising: after the controller processes the alarm according to the analysis result: 依据所述告警所引起的网元设备的状态变化,或者所述告警的代表资源的对象的状态变化,生成所述网元设备或者所述代表资源的对象的状态通知;generating a status notification of the network element device or the object representing the resource according to the state change of the network element device caused by the alarm, or the state change of the object representing the resource in the alarm; 将生成的所述网元设备或者所述代表资源的对象的状态通知上报给上层网络设备。Reporting the generated status notification of the network element device or the object representing the resource to the upper layer network device. 7.一种告警处理装置,其特征在于,应用于控制器包括:7. An alarm processing device, characterized in that being applied to a controller includes: 接收模块,用于接收到来自网络设备的告警;A receiving module, configured to receive an alarm from a network device; 分析模块,用于对所述告警进行分析得出分析结果;An analysis module, configured to analyze the alarm to obtain an analysis result; 处理模块,用于依据所述分析结果对所述告警进行处理。A processing module, configured to process the alarm according to the analysis result. 8.根据权利要求7所述的装置,其特征在于,所述分析模块包括以下至少之一:8. The device according to claim 7, wherein the analysis module comprises at least one of the following: 第一分析单元,用于依据产生所述告警的告警原因,分析得出所述告警是否对控制器管理的业务产生影响的所述分析结果;The first analysis unit is configured to analyze and obtain the analysis result of whether the alarm has an impact on the business managed by the controller according to the cause of the alarm that generates the alarm; 第二分析单元,用于依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系,分析得出所述告警是否对所述控制器管理的所述业务产生影响的所述分析结果。The second analysis unit is configured to analyze whether the alarm has an impact on the service managed by the controller according to the relationship between the resource generating the alarm and the resource occupied by the service managed by the controller of the analysis results. 9.根据权利要求8所述的装置,其特征在于,所述第二分析单元包括以下之一:9. The device according to claim 8, wherein the second analysis unit comprises one of the following: 第一确定子单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,并且也没有所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务不产生影响;The first determining subunit is configured to: when the root resource generating the alarm is not occupied by the service managed by the controller, and no sub-resource corresponding to the root resource is occupied by the service managed by the controller case, determining that the alarm has no impact on the service managed by the controller; 第二确定子单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,但所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务产生影响;The second determining subunit is configured to: when the root resource generating the alarm is not occupied by the service managed by the controller, but the subresource corresponding to the root resource is occupied by the service managed by the controller , determining that the alarm has an impact on the service managed by the controller; 第三确定子单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,所述根资源对应的子资源也不被所述控制器管理的业务所占用,但所述根资源和/或所述根资源对应的子资源与所述控制器管理的业务存在映射关系的情况下,确定所述告警对所述控制器管理的所述业务产生影响;The third determining subunit is configured to: when the root resource generating the alarm is not occupied by the service managed by the controller, and the subresource corresponding to the root resource is not occupied by the service managed by the controller, but If there is a mapping relationship between the root resource and/or the sub-resource corresponding to the root resource and the service managed by the controller, determine that the alarm has an impact on the service managed by the controller; 第四确定子单元,用于在产生所述告警的根资源被所述控制器管理的业务所占用的情况下,确定所述告警对所述控制器管理的所述业务产生影响。The fourth determining subunit is configured to determine that the alarm has an impact on the service managed by the controller when the root resource generating the alarm is occupied by the service managed by the controller. 10.根据权利要求9所述的装置,其特征在于,所述处理模块包括以下之一:10. The device according to claim 9, wherein the processing module comprises one of the following: 第一抑制单元,用于在依据所述告警原因,分析得出所述告警对所述控制器管理的业务不产生影响的情况下,对所述告警进行抑制;A first suppression unit, configured to suppress the alarm when it is analyzed according to the cause of the alarm that the alarm has no impact on the business managed by the controller; 第二抑制单元,用于在依据产生所述告警的资源与所述控制器管理的业务所占用的资源之间的关系确定所述告警对所述控制器管理的所述业务不产生影响的情况下,对所述告警进行抑制;A second suppressing unit, configured to determine that the alarm does not affect the service managed by the controller according to the relationship between the resources that generate the alarm and the resources occupied by the service managed by the controller Next, suppress the alarm; 第一衍生单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,但所述根资源对应的子资源被所述控制器管理的业务所占用的情况下,将所述告警衍生为子资源衍生告警;The first derivation unit is configured to: when the root resource generating the alarm is not occupied by the service managed by the controller, but the sub-resource corresponding to the root resource is occupied by the service managed by the controller, deriving the alert as a sub-resource derived alert; 第二衍生单元,用于在产生所述告警的根资源不被所述控制器管理的业务所占用,所述根资源对应的子资源也不被所述控制器管理的业务所占用,但所述根资源和/或所述根资源对应的子资源与所述控制器管理的业务存在映射关系的情况下,依据所述映射关系,将所述告警衍生为关系衍生告警;The second derivation unit is configured to generate the alarm. The root resource is not occupied by the service managed by the controller, and the sub-resource corresponding to the root resource is not occupied by the service managed by the controller. If there is a mapping relationship between the root resource and/or the sub-resource corresponding to the root resource and the service managed by the controller, deriving the alarm into a relationship-derived alarm according to the mapping relationship; 第一上报单元,用于在产生所述告警的根资源被所述控制器管理的业务所占用的情况下,直接将所述告警上报给上层网络设备。The first reporting unit is configured to directly report the alarm to an upper-layer network device when the root resource generating the alarm is occupied by a service managed by the controller. 11.根据权利要求10所述的装置,其特征在于,还包括:11. The device according to claim 10, further comprising: 记录单元,用于在对所述告警进行衍生处理的情况下,记录所述告警与衍生后的衍生告警之间的关联关系,其中,所述衍生告警包括:子资源衍生告警和/或关系衍生告警;The recording unit is configured to record the association relationship between the alarm and the derived alarm after derivative processing is performed on the alarm, wherein the derived alarm includes: a sub-resource derived alarm and/or a relationship derivation alarm; 第二上报单元,用于将所述衍生告警以及所述关联关系上报给上层网络设备,和/或,提供单元,用于提供所述关联关系供所述控制器管理系统中的网元设备查询。A second reporting unit, configured to report the derived alarm and the association relationship to an upper-layer network device, and/or a providing unit, configured to provide the association relationship for query by network element devices in the controller management system . 12.根据权利要求7至11中任一项所述的装置,其特征在于,还包括:12. The device according to any one of claims 7 to 11, further comprising: 生成模块,用于依据所述告警所引起的网元设备的状态变化,或者所述告警的代表资源的对象的状态变化,生成所述网元设备或者所述代表资源的对象的状态通知;A generating module, configured to generate a status notification of the network element device or the object representing the resource according to the state change of the network element device caused by the alarm, or the state change of the object representing the resource in the alarm; 上报模块,用于将生成的所述网元设备或者所述代表资源的对象的状态通知上报给上层网络设备。A reporting module, configured to report the generated status notification of the network element device or the object representing the resource to an upper-layer network device. 13.根据权利要求7至11中任一项所述的装置,其特征在于,所述控制器包括以下至少之一:域控制器DC、超级控制器SC,其中,所述SC为所述DC的上层控制器。13. The device according to any one of claims 7 to 11, wherein the controller comprises at least one of the following: a domain controller DC, a super controller SC, wherein the SC is the DC upper controller.
CN201510088503.3A 2015-01-27 2015-02-26 Alarm processing method and device Pending CN105991332A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510042309 2015-01-27
CN2015100423091 2015-01-27

Publications (1)

Publication Number Publication Date
CN105991332A true CN105991332A (en) 2016-10-05

Family

ID=56542308

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510088503.3A Pending CN105991332A (en) 2015-01-27 2015-02-26 Alarm processing method and device

Country Status (2)

Country Link
CN (1) CN105991332A (en)
WO (1) WO2016119436A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108183810A (en) * 2016-12-08 2018-06-19 中兴通讯股份有限公司 Multiple services parallel recovery method, apparatus and system under SDN frameworks
CN109309577A (en) * 2017-07-27 2019-02-05 杭州达乎科技有限公司 Alert processing method, apparatus and system for SDN network
WO2020093959A1 (en) * 2018-11-06 2020-05-14 华为技术有限公司 Method and apparatus for diagnosing root cause
CN111865785A (en) * 2020-06-28 2020-10-30 烽火通信科技股份有限公司 SR-TP tunnel signal transmission method, device, server and storage medium
CN115941442A (en) * 2022-12-01 2023-04-07 中国联合网络通信集团有限公司 Business fault analysis method and device, electronic equipment and medium

Families Citing this family (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9781004B2 (en) 2014-10-16 2017-10-03 Cisco Technology, Inc. Discovering and grouping application endpoints in a network environment
US10826788B2 (en) 2017-04-20 2020-11-03 Cisco Technology, Inc. Assurance of quality-of-service configurations in a network
US10623264B2 (en) 2017-04-20 2020-04-14 Cisco Technology, Inc. Policy assurance for service chaining
US10560328B2 (en) 2017-04-20 2020-02-11 Cisco Technology, Inc. Static network policy analysis for networks
US10623271B2 (en) 2017-05-31 2020-04-14 Cisco Technology, Inc. Intra-priority class ordering of rules corresponding to a model of network intents
US10505816B2 (en) 2017-05-31 2019-12-10 Cisco Technology, Inc. Semantic analysis to detect shadowing of rules in a model of network intents
US10554483B2 (en) 2017-05-31 2020-02-04 Cisco Technology, Inc. Network policy analysis for networks
US10812318B2 (en) 2017-05-31 2020-10-20 Cisco Technology, Inc. Associating network policy objects with specific faults corresponding to fault localizations in large-scale network deployment
US10693738B2 (en) 2017-05-31 2020-06-23 Cisco Technology, Inc. Generating device-level logical models for a network
US20180351788A1 (en) 2017-05-31 2018-12-06 Cisco Technology, Inc. Fault localization in large-scale network policy deployment
US10581694B2 (en) 2017-05-31 2020-03-03 Cisco Technology, Inc. Generation of counter examples for network intent formal equivalence failures
US10439875B2 (en) 2017-05-31 2019-10-08 Cisco Technology, Inc. Identification of conflict rules in a network intent formal equivalence failure
US11469986B2 (en) 2017-06-16 2022-10-11 Cisco Technology, Inc. Controlled micro fault injection on a distributed appliance
US10686669B2 (en) 2017-06-16 2020-06-16 Cisco Technology, Inc. Collecting network models and node information from a network
US10498608B2 (en) 2017-06-16 2019-12-03 Cisco Technology, Inc. Topology explorer
US10904101B2 (en) 2017-06-16 2021-01-26 Cisco Technology, Inc. Shim layer for extracting and prioritizing underlying rules for modeling network intents
US11150973B2 (en) 2017-06-16 2021-10-19 Cisco Technology, Inc. Self diagnosing distributed appliance
US10547715B2 (en) 2017-06-16 2020-01-28 Cisco Technology, Inc. Event generation in response to network intent formal equivalence failures
US10587621B2 (en) 2017-06-16 2020-03-10 Cisco Technology, Inc. System and method for migrating to and maintaining a white-list network security model
US10574513B2 (en) 2017-06-16 2020-02-25 Cisco Technology, Inc. Handling controller and node failure scenarios during data collection
US11645131B2 (en) 2017-06-16 2023-05-09 Cisco Technology, Inc. Distributed fault code aggregation across application centric dimensions
US10805160B2 (en) 2017-06-19 2020-10-13 Cisco Technology, Inc. Endpoint bridge domain subnet validation
US10652102B2 (en) 2017-06-19 2020-05-12 Cisco Technology, Inc. Network node memory utilization analysis
US10673702B2 (en) 2017-06-19 2020-06-02 Cisco Technology, Inc. Validation of layer 3 using virtual routing forwarding containers in a network
US10348564B2 (en) 2017-06-19 2019-07-09 Cisco Technology, Inc. Validation of routing information base-forwarding information base equivalence in a network
US10536337B2 (en) 2017-06-19 2020-01-14 Cisco Technology, Inc. Validation of layer 2 interface and VLAN in a networked environment
US10554493B2 (en) 2017-06-19 2020-02-04 Cisco Technology, Inc. Identifying mismatches between a logical model and node implementation
US10567229B2 (en) 2017-06-19 2020-02-18 Cisco Technology, Inc. Validating endpoint configurations between nodes
US11283680B2 (en) 2017-06-19 2022-03-22 Cisco Technology, Inc. Identifying components for removal in a network configuration
US10411996B2 (en) 2017-06-19 2019-09-10 Cisco Technology, Inc. Validation of routing information in a network fabric
US10623259B2 (en) 2017-06-19 2020-04-14 Cisco Technology, Inc. Validation of layer 1 interface in a network
US10437641B2 (en) 2017-06-19 2019-10-08 Cisco Technology, Inc. On-demand processing pipeline interleaved with temporal processing pipeline
US11343150B2 (en) 2017-06-19 2022-05-24 Cisco Technology, Inc. Validation of learned routes in a network
US10700933B2 (en) 2017-06-19 2020-06-30 Cisco Technology, Inc. Validating tunnel endpoint addresses in a network fabric
US10528444B2 (en) 2017-06-19 2020-01-07 Cisco Technology, Inc. Event generation in response to validation between logical level and hardware level
US10644946B2 (en) 2017-06-19 2020-05-05 Cisco Technology, Inc. Detection of overlapping subnets in a network
US10218572B2 (en) 2017-06-19 2019-02-26 Cisco Technology, Inc. Multiprotocol border gateway protocol routing validation
US10432467B2 (en) 2017-06-19 2019-10-01 Cisco Technology, Inc. Network validation between the logical level and the hardware level of a network
US10560355B2 (en) 2017-06-19 2020-02-11 Cisco Technology, Inc. Static endpoint validation
US10567228B2 (en) 2017-06-19 2020-02-18 Cisco Technology, Inc. Validation of cross logical groups in a network
US10547509B2 (en) 2017-06-19 2020-01-28 Cisco Technology, Inc. Validation of a virtual port channel (VPC) endpoint in the network fabric
US10333787B2 (en) 2017-06-19 2019-06-25 Cisco Technology, Inc. Validation of L3OUT configuration for communications outside a network
US10505817B2 (en) 2017-06-19 2019-12-10 Cisco Technology, Inc. Automatically determining an optimal amount of time for analyzing a distributed network environment
US10812336B2 (en) 2017-06-19 2020-10-20 Cisco Technology, Inc. Validation of bridge domain-L3out association for communication outside a network
US10341184B2 (en) 2017-06-19 2019-07-02 Cisco Technology, Inc. Validation of layer 3 bridge domain subnets in in a network
US10587484B2 (en) 2017-09-12 2020-03-10 Cisco Technology, Inc. Anomaly detection and reporting in a network assurance appliance
US10587456B2 (en) 2017-09-12 2020-03-10 Cisco Technology, Inc. Event clustering for a network assurance platform
US10554477B2 (en) 2017-09-13 2020-02-04 Cisco Technology, Inc. Network assurance event aggregator
US10333833B2 (en) 2017-09-25 2019-06-25 Cisco Technology, Inc. Endpoint path assurance
CN108156019B (en) * 2017-11-29 2022-10-25 全球能源互联网研究院有限公司 An SDN-based network-derived alarm filtering system and method
US11102053B2 (en) 2017-12-05 2021-08-24 Cisco Technology, Inc. Cross-domain assurance
US10873509B2 (en) 2018-01-17 2020-12-22 Cisco Technology, Inc. Check-pointing ACI network state and re-execution from a check-pointed state
US10572495B2 (en) 2018-02-06 2020-02-25 Cisco Technology Inc. Network assurance database version compatibility
US10812315B2 (en) 2018-06-07 2020-10-20 Cisco Technology, Inc. Cross-domain network assurance
US11218508B2 (en) 2018-06-27 2022-01-04 Cisco Technology, Inc. Assurance of security rules in a network
US11044273B2 (en) 2018-06-27 2021-06-22 Cisco Technology, Inc. Assurance of security rules in a network
US10911495B2 (en) 2018-06-27 2021-02-02 Cisco Technology, Inc. Assurance of security rules in a network
US10659298B1 (en) 2018-06-27 2020-05-19 Cisco Technology, Inc. Epoch comparison for network events
US11019027B2 (en) 2018-06-27 2021-05-25 Cisco Technology, Inc. Address translation for external network appliance
US10904070B2 (en) 2018-07-11 2021-01-26 Cisco Technology, Inc. Techniques and interfaces for troubleshooting datacenter networks
US10826770B2 (en) 2018-07-26 2020-11-03 Cisco Technology, Inc. Synthesis of models for networks using automated boolean learning
US10616072B1 (en) 2018-07-27 2020-04-07 Cisco Technology, Inc. Epoch data interface
CN112073208B (en) * 2019-05-25 2022-01-14 成都华为技术有限公司 Alarm analysis method, device, chip system and storage medium
CN113055213B (en) * 2019-12-27 2024-05-28 中兴通讯股份有限公司 Alarm information management method, alarm information management system and server
CN114024828B (en) * 2021-10-15 2023-05-23 烽火通信科技股份有限公司 Platform side alarm suppression method, device and storage medium
CN113965452B (en) * 2021-11-02 2023-11-03 烽火通信科技股份有限公司 Equipment switching state acquisition method and device
CN114285726A (en) * 2021-12-27 2022-04-05 中国联合网络通信集团有限公司 Fault positioning method and device and computer storage medium
CN114389960B (en) * 2022-01-04 2023-11-28 烽火通信科技股份有限公司 Method and system for collecting and reporting network service performance
CN115396287B (en) * 2022-08-29 2023-05-12 武汉烽火技术服务有限公司 Fault analysis method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101409644A (en) * 2007-10-12 2009-04-15 中兴通讯股份有限公司 Method and system for pre-warning service quality
CN101917288A (en) * 2010-08-04 2010-12-15 中兴通讯股份有限公司 Alarm processing method and network management system
CN102571407A (en) * 2010-12-30 2012-07-11 中国移动通信集团河北有限公司 Alarm correlation analysis method and device
CN103378980A (en) * 2012-04-16 2013-10-30 中兴通讯股份有限公司 Method for correlation analysis of layer network alarm with business and device thereof
US20150006459A1 (en) * 2013-06-29 2015-01-01 Huawei Technologies Co., Ltd. Alarm Correlation Analysis Method, Apparatus and System

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100375435C (en) * 2004-06-22 2008-03-12 中兴通讯股份有限公司 Alarm correlation analysis of light synchronous transmitting net
CN1992636B (en) * 2005-12-29 2010-04-21 华为技术有限公司 System and method for processing warning information
CN101183989B (en) * 2007-12-03 2010-09-01 中兴通讯股份有限公司 Incremental analysis method of optical synchronization transmission network alarm correlation
CN103596208B (en) * 2013-11-15 2017-02-15 大唐移动通信设备有限公司 Method and system for judging fault of network element

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101409644A (en) * 2007-10-12 2009-04-15 中兴通讯股份有限公司 Method and system for pre-warning service quality
CN101917288A (en) * 2010-08-04 2010-12-15 中兴通讯股份有限公司 Alarm processing method and network management system
CN102571407A (en) * 2010-12-30 2012-07-11 中国移动通信集团河北有限公司 Alarm correlation analysis method and device
CN103378980A (en) * 2012-04-16 2013-10-30 中兴通讯股份有限公司 Method for correlation analysis of layer network alarm with business and device thereof
US20150006459A1 (en) * 2013-06-29 2015-01-01 Huawei Technologies Co., Ltd. Alarm Correlation Analysis Method, Apparatus and System

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108183810A (en) * 2016-12-08 2018-06-19 中兴通讯股份有限公司 Multiple services parallel recovery method, apparatus and system under SDN frameworks
CN108183810B (en) * 2016-12-08 2019-06-04 中兴通讯股份有限公司 Parallel recovery method, device and system for multi-service under SDN architecture
CN109309577A (en) * 2017-07-27 2019-02-05 杭州达乎科技有限公司 Alert processing method, apparatus and system for SDN network
WO2020093959A1 (en) * 2018-11-06 2020-05-14 华为技术有限公司 Method and apparatus for diagnosing root cause
CN111865785A (en) * 2020-06-28 2020-10-30 烽火通信科技股份有限公司 SR-TP tunnel signal transmission method, device, server and storage medium
CN115941442A (en) * 2022-12-01 2023-04-07 中国联合网络通信集团有限公司 Business fault analysis method and device, electronic equipment and medium
CN115941442B (en) * 2022-12-01 2025-01-28 中国联合网络通信集团有限公司 Business failure analysis method, device, electronic equipment and medium

Also Published As

Publication number Publication date
WO2016119436A1 (en) 2016-08-04

Similar Documents

Publication Publication Date Title
CN105991332A (en) Alarm processing method and device
US9483343B2 (en) System and method of visualizing historical event correlations in a data center
US9311160B2 (en) Elastic cloud networking
US9100289B2 (en) Creating searchable and global database of user visible process traces
EP3451587B1 (en) Creating searchable and global database of user visible process traces
CA2676925C (en) Distributed network management system and method
US11228507B2 (en) Baselining service-tagged data from subservices of a service for service assurance
CN104104543B (en) Server managing system and method based on SNMP and IPMI protocol
US20150169353A1 (en) System and method for managing data center services
WO2017080161A1 (en) Alarm information processing method and device in cloud computing
CN103139248B (en) Machine frame system
CN105790972B (en) A controller and method for processing alarm correlation
CN114244676A (en) Intelligent IT integrated gateway system
CN106302076A (en) Set up the method in VXLAN tunnel, system and SDN controller
CN110752959A (en) An intelligent substation process layer physical link fault location system
US20190207805A1 (en) Node fault isolation
US20140325279A1 (en) Target failure based root cause analysis of network probe failures
US20150170037A1 (en) System and method for identifying historic event root cause and impact in a data center
Lee et al. Fault localization in NFV framework
CN114666373A (en) Internet of things terminal maintenance method and related equipment
Sankari et al. Network traffic analysis of cloud data centre
CN113268891B (en) Modeling method and device of operation and maintenance system
CN108476149B (en) Operation management and maintenance system
沖田 Integrated Network Operations and Management
Al-Rubaiey et al. Failure detection in virtual network environment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20161005

WD01 Invention patent application deemed withdrawn after publication