[go: up one dir, main page]

CN100426756C - Network management system for integrative supervision and management of application software system and host resource - Google Patents

Network management system for integrative supervision and management of application software system and host resource Download PDF

Info

Publication number
CN100426756C
CN100426756C CNB2006101411891A CN200610141189A CN100426756C CN 100426756 C CN100426756 C CN 100426756C CN B2006101411891 A CNB2006101411891 A CN B2006101411891A CN 200610141189 A CN200610141189 A CN 200610141189A CN 100426756 C CN100426756 C CN 100426756C
Authority
CN
China
Prior art keywords
management subsystem
network
subsystem
host
network management
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2006101411891A
Other languages
Chinese (zh)
Other versions
CN1968148A (en
Inventor
桂润堂
曾成勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CNB2006101411891A priority Critical patent/CN100426756C/en
Publication of CN1968148A publication Critical patent/CN1968148A/en
Application granted granted Critical
Publication of CN100426756C publication Critical patent/CN100426756C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention relates to a network manage system, used in detecting software and host, wherein it comprises device access module for building projection software system and public information mode of host, while the public information mode has logic network unit layer whose logic network units are used in each part of projection software system, with inquired standard interface; physical network unit layer with physical network units for projecting the hosts of each part, with inquired standard interface, and modular service layer for projecting the logic network unit and physical network unit. And the network manage system also comprises service network subsystem for using the standard interface of logic network unit layer to manage the software system, and an IP network manage subsystem for using the standard interface of physical network unit layer to manage host. The invention also provides relative network manage method.

Description

实现应用软件系统与主机资源综合监管的网管系统和方法 Network management system and method for realizing integrated supervision of application software system and host resources

技术领域 technical field

本发明涉及通信领域,更具体而言,涉及用于实现应用软件系统与主机资源综合监管的网管系统和方法。The present invention relates to the communication field, and more specifically, relates to a network management system and method for realizing comprehensive supervision of application software systems and host resources.

背景技术 Background technique

电信级网络中分布着数量众多种类各异的硬件设备和规模不一的应用软件系统,它们在电信管理网(TMN)的网元管理层(Network Element Management Layer)被抽象为网络单元(NetworkElement,简称网元),其中各型网络设备、服务器等硬件设备通常被称为为物理网元(Physical Network Element,缩写为PNE),位于IP层次;而各种应用软件系统则被抽象为逻辑网元(Logic NetworkElement,缩写为LNE)。逻辑网元一般由若干模块(Module)组成,每个模块又包含若干运行在服务器(主机)之上的进程实体。这些模块、进程实体可部署于一个或多台服务器(主机)之上,并且多个逻辑网元之间可共享这些资源。There are a large number of different hardware devices and application software systems of different scales distributed in the carrier-class network. They are abstracted as Network Elements (Network Elements) in the Network Element Management Layer of the Telecom Management Network (TMN). Network element for short), in which various network devices, servers and other hardware devices are usually called physical network elements (Physical Network Element, abbreviated as PNE), which are located at the IP level; and various application software systems are abstracted into logical network elements (Logic NetworkElement, abbreviated as LNE). A logical network element generally consists of several modules (Module), and each module contains several process entities running on a server (host). These modules and process entities can be deployed on one or more servers (hosts), and these resources can be shared among multiple logical network elements.

图1示出了根据相关技术的一个简单的网络管理模型。图中的NE可能是物理网元,也可能是逻辑网元。Fig. 1 shows a simple network management model according to the related art. The NEs in the figure may be physical network elements or logical network elements.

在该网络管理模型中,逻辑网元的实际部署情况可划分为逻辑网元(LNE)、模块(Module)、进程实体(Process)及主机四个层次,按照从上至下的顺序,每一层都对自己的下一层存在依赖。根据依赖的传递性,最上层的逻辑网元存在对最下层的主机的依赖,而这些主机在电信管理网的网元管理层被抽象为物理网元,因此存在逻辑网元对于物理网元的依赖。图2示出了根据相关技术的逻辑网元部署的层次结构。In this network management model, the actual deployment of logical network elements can be divided into four levels: logical network element (LNE), module (Module), process entity (Process) and host. According to the sequence from top to bottom, each Each layer is dependent on its next layer. According to the transitivity of dependencies, the logical network elements at the top layer have dependencies on the hosts at the bottom layer, and these hosts are abstracted as physical network elements in the network element management layer of the telecom management network, so there is a relationship between logical network elements and physical network elements rely. Fig. 2 shows a hierarchical structure of logical network element deployment according to related technologies.

在图2中,这种依赖具体表现为,主机的运行状况及处理性能(如CPU是否过于繁忙、内存是否够用、硬盘是否有足够剩余空间等)会直接影响软件系统整体的运行状况,因此保障软件系统部署运行所在主机的可用性、可靠性和性能,是保障软件系统正常运行的基础之一,所以有必要把对主机、进程及应用软件系统的监控管理结合起来。In Figure 2, this kind of dependence is embodied in that the operating status and processing performance of the host (such as whether the CPU is too busy, whether the memory is sufficient, whether the hard disk has enough free space, etc.) will directly affect the overall operating status of the software system, so Ensuring the availability, reliability, and performance of the host where the software system is deployed and running is one of the foundations for ensuring the normal operation of the software system, so it is necessary to combine the monitoring and management of the host, process, and application software system.

对于众多网络管理系统(Network Management System,以下简称NMS)的具体实现而言,一般是将主机及进程的管理纳入IP网管,而软件系统及其模块的管理则纳入与软件系统产品厂商紧密相关的业务网管,由这两类不同的网管系统分别进行监控管理。For the specific implementation of many Network Management Systems (NMS), the management of hosts and processes is generally included in the IP network management, while the management of software systems and their modules is included in the closely related software system product manufacturers. Service network management is monitored and managed by these two different network management systems.

业界对于主机本身的监管已经形成了一些较为成熟的工业标准及应用解决方案。根据RFC2790标准,定义了统一、公共的主机资源管理模型,确定了CPU、网络设备(网卡)、打印机、磁盘(硬盘、软盘、光盘等)、键盘、Modem、端口等硬件资源的管理接口;RFC1514标准则提供了存储实体(内存、虚拟内存、磁盘等等)的利用率的管理接口;各类型的主机一般都实现了以上标准中定义的内容,网管系统的实现者可以以较小的代价来实现主机监管功能。The industry has formed some relatively mature industrial standards and application solutions for the supervision of the host itself. According to the RFC2790 standard, a unified and public host resource management model is defined, and the management interface of hardware resources such as CPU, network device (network card), printer, disk (hard disk, floppy disk, CD, etc.), keyboard, Modem, port, etc. is determined; RFC1514 The standard provides a management interface for the utilization rate of storage entities (memory, virtual memory, disk, etc.); all types of hosts generally implement the content defined in the above standards, and implementers of the network management system can implement them at a relatively small cost. Realize the host supervision function.

下面将结合附图和实施例来说明相关技术中对应用软件系统和主机资源进行监管的网管解决方案。A network management solution for supervising application software systems and host resources in the related art will be described below with reference to the drawings and embodiments.

图3示出了根据相关技术提供的一种网管系统模块划分方案。在该方案中,网管系统模块划分如下:Fig. 3 shows a network management system module division scheme provided according to related technologies. In this solution, the network management system modules are divided as follows:

软件系统、进程与主机由集成在一个大网管系统中的业务网管与IP网管子系统分别进行管理,保持一致的界面特性及用户操作习惯,共享相同的基础功能子系统(如告警、性能、数据采集等子系统),仅满足最简单的界面集成。业务网管与IP网管子系统之间没有业务逻辑的关联性。The software system, process, and host are managed separately by the service network management and IP network management subsystems integrated in a large network management system, maintaining consistent interface features and user operating habits, and sharing the same basic functional subsystems (such as alarm, performance, data, etc.) collection and other subsystems), only the simplest interface integration is satisfied. There is no business logic correlation between the service network management and the IP network management subsystem.

然而,由于软件系统与主机等设备分别由不同的网管子系统进行监控管理,导致这两个子系统间不共享相关的管理信息,功能也不互通,所以丢失了软件系统与主机之间的依赖信息,主机运行状态和性能对于软件系统的影响,系统本身无法直接表达出来,只能通过人工去发现及判别这种影响。However, since the software system and the host computer are monitored and managed by different network management subsystems, the two subsystems do not share relevant management information and their functions do not communicate with each other, so the dependency information between the software system and the host computer is lost. , The impact of the host's operating status and performance on the software system cannot be directly expressed by the system itself, and can only be discovered and identified manually.

另外,由于这两个网管子系统实际上可能由不同的人员负责维护,所以导致这种影响无法被及时有效地发现出来,从而实际上无法有效缩短设备维护周期,及降低维护成本。In addition, since the two network management subsystems may actually be maintained by different personnel, this effect cannot be detected in a timely and effective manner, so that it is actually impossible to effectively shorten the equipment maintenance cycle and reduce maintenance costs.

一个比较有代表性的例子:例如,某主机因为磁盘剩余空间不足而产生告警,但是负责软件系统的维护人员并不知道该告警的产生,直至磁盘完全占满而导致软件系统运行异常而产生告警,所以,维护人员在该软件系统告警产生之前无法预警,在告警产生之后也无法迅速定位其根源。A more representative example: For example, a host generates an alarm because the remaining disk space is insufficient, but the maintenance personnel in charge of the software system do not know that the alarm is generated until the disk is completely full, causing the software system to run abnormally and generate an alarm , Therefore, the maintenance personnel cannot give an early warning before the software system alarm occurs, and cannot quickly locate the root cause of the alarm after the alarm is generated.

相关技术中还提供了另外一种网管解决方案。Another network management solution is also provided in the related art.

根据该方案,针对部署于软件系统中的软件系统代理(Agent)模块进行定制开发,使之能够收集、存储所在主机的CPU占用率、内存使用率、磁盘使用情况等主机监控信息,并提供相应接口,使得网管侧在接入管理目标应用软件及业务模块的同时,也能够访问这些信息;同时,软件系统预定义一些主机相关的告警,软件系统代理收集并上报这些告警;业务网管子系统也针对主机监控接口进行开发,提供主机监控的管理界面。也就是说,业务和硬件资源节点的关联性由网元侧实现。According to this scheme, the software system agent (Agent) module deployed in the software system is customized and developed, so that it can collect and store the host monitoring information such as the CPU usage rate, memory usage rate, and disk usage of the host, and provide corresponding interface, so that the network management side can access the information while accessing the management target application software and business modules; at the same time, the software system pre-defines some host-related alarms, and the software system agent collects and reports these alarms; the business network management subsystem also Develop for the host monitoring interface and provide a management interface for host monitoring. That is to say, the association between services and hardware resource nodes is realized by the network element side.

然而,该解决方案主要存在如下缺点:However, this solution mainly has the following disadvantages:

(1)无法利用已经标准化且相对成熟的主机管理接口,受管软件系统需要自行实现主机信息的采集、存储及上报功能,增加了应用软件及业务模块的复杂度及开发难度;(1) The standardized and relatively mature host management interface cannot be used, and the managed software system needs to realize the collection, storage and reporting of host information by itself, which increases the complexity and development difficulty of application software and business modules;

(2)缺乏相应的行业标准,各受管软件系统提供的主机监管功能不一致,接口不兼容;(2) There is a lack of corresponding industry standards, the host monitoring functions provided by each managed software system are inconsistent, and the interfaces are not compatible;

(3)网管系统开发者无法提供一个一致的主机监管模型,不得不为各种不同的受管软件系统定制相应的主机监管模型及界面,导致接入管理成本较高。(3) The network management system developer cannot provide a consistent host monitoring model, and has to customize corresponding host monitoring models and interfaces for various managed software systems, resulting in high access management costs.

由以上描述可知,上述相关技术的解决方案都存在着各自的问题,因此,人们需要一种新的网管解决方案,能够解决上述相关技术中的问题。It can be seen from the above description that the solutions of the above-mentioned related technologies have their own problems. Therefore, people need a new network management solution that can solve the problems in the above-mentioned related technologies.

发明内容 Contents of the invention

本发明旨在提供一种能够综合监管应用软件系统及其部署所在主机的解决方案,该方案解决了上述相关技术中存在的业务软件系统及业务模块的复杂度及开发难度大等问题。The present invention aims to provide a solution capable of comprehensively supervising the application software system and the host where it is deployed. This solution solves the problems of the complexity and development difficulty of the business software system and business modules in the above-mentioned related technologies.

根据本发明的一个方面,提供了一种网管系统,用于综合监管软件系统及其所部署主机,包括:设备接入模块,用于创建映射软件系统及其所部署主机的公共信息模型,公共信息模型包括:逻辑网元层,其所包括的各个逻辑网元用于映射软件系统的各个部分,具有可供查询的标准接口;物理网元层,其所包括的各个物理网元用于映射各个部分所部署的主机,具有可供查询的标准接口;以及模块服务层,用于映射逻辑网元与物理网元之间的对应关系;网管系统还包括:业务网管子系统,用于通过逻辑网元层的标准接口来监管软件系统;以及IP网管子系统,用于通过物理网元层的标准接口来监管主机。According to one aspect of the present invention, a network management system is provided, which is used for comprehensive supervision of the software system and its deployed host, including: a device access module, used to create a public information model of the mapping software system and its deployed host, public The information model includes: logical network element layer, each logical network element included in it is used to map various parts of the software system, and has a standard interface available for query; physical network element layer, each physical network element included in it is used for mapping The hosts deployed in each part have standard interfaces that can be queried; and the module service layer is used to map the corresponding relationship between logical network elements and physical network elements; the network management system also includes: business network management The standard interface of the network element layer is used to supervise the software system; and the IP network management subsystem is used to supervise the host computer through the standard interface of the physical network element layer.

在上述的网管系统中,设备接入模块,用于通过记录逻辑网元所部署主机的IP地址,并根据IP地址定位物理网元,在逻辑网元与物理网元之间创建关联,将关联信息存储在模块服务层中。In the above-mentioned network management system, the device access module is used to record the IP address of the host where the logical network element is deployed, and locate the physical network element according to the IP address, create an association between the logical network element and the physical network element, and associate the Information is stored in the module service layer.

在上述的网管系统中,设备接入模块,用于通过主机IP地址查找与之相匹配的逻辑网元,并在逻辑网元与物理网元之间创建关联,将关联信息存储在模块服务层中。In the above-mentioned network management system, the device access module is used to find the matching logical network element through the host IP address, create an association between the logical network element and the physical network element, and store the association information in the module service layer middle.

在上述的网管系统中,业务网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、计费管理子系统、业务管理子系统、告警子系统、数据采集子系统;IP网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、告警子系统、数据采集子系统。In the above network management system, the service network management subsystem includes at least one of the following: configuration management subsystem, security management subsystem, fault management subsystem, performance management subsystem, billing management subsystem, service management subsystem, alarm subsystem System, data acquisition subsystem; IP network management subsystem includes at least one of the following: configuration management subsystem, safety management subsystem, fault management subsystem, performance management subsystem, alarm subsystem, data acquisition subsystem.

在上述的网管系统中,业务网管子系统和IP网管子系统均包括告警、性能管理、配置管理子系统,告警、性能管理、配置管理子系统都包含预处理模块,这些预处理模块用于查询存储在公共信息模型中的逻辑网元与物理网元的关联信息,使得子系统能够识别逻辑网元与物理网元之间的关系。In the above-mentioned network management system, both the service network management subsystem and the IP network management subsystem include alarm, performance management, and configuration management subsystems, and the alarm, performance management, and configuration management subsystems all include preprocessing modules, which are used for query The association information of logical network elements and physical network elements stored in the public information model enables the subsystem to identify the relationship between logical network elements and physical network elements.

在上述的网管系统中,在主机发生硬件故障并发送告警后,IP网管子系统用于接收并处理该告警,IP网管子系统中的告警子系统的预处理模块根据存储在公共信息模型中的物理网元与逻辑网元的关联信息来查询哪些逻辑网元因此受到影响,并通知业务网管子系统进行相应处理,从而在业务网管子系统侧能够观察到已经发生在主机上的告警。In the above-mentioned network management system, after the host computer has a hardware failure and sends an alarm, the IP network management subsystem is used to receive and process the alarm, and the preprocessing module of the alarm subsystem in the IP network management subsystem The association information between physical network elements and logical network elements is used to query which logical network elements are affected, and notify the service network management subsystem to handle accordingly, so that the alarms that have occurred on the host can be observed on the service network management subsystem side.

在上述的网管系统中,主机上发生的告警如果可能导致软件系统出现运行异常,业务网管子系统会发出预警,并通知业务维护人员采取措施。In the above-mentioned network management system, if an alarm on the host may cause abnormal operation of the software system, the service network management subsystem will issue an early warning and notify the service maintenance personnel to take measures.

在上述的网管系统中,在逻辑网元上创建监控物理网元运行状况的监视任务或故障检测任务后,IP网管子系统中的性能管理子系统的预处理模块用于将相关参数发送给IP网管子系统,IP网管子系统用于根据这些参数采集物理网元的状态数据,并将已采集结果实时送到状态监控界面。In the above-mentioned network management system, after the monitoring task or fault detection task of monitoring the operation status of the physical network element is created on the logical network element, the preprocessing module of the performance management subsystem in the IP network management subsystem is used to send relevant parameters to the IP The network management subsystem, the IP network management subsystem is used to collect the status data of the physical network elements according to these parameters, and send the collected results to the status monitoring interface in real time.

在上述的网管系统中,业务网管子系统和IP网管子系统是基于管理领域划分,二者区分为业务拓扑视图与主机拓扑视图。In the above network management system, the service network management subsystem and the IP network management subsystem are divided based on the management field, and they are divided into a service topology view and a host topology view.

根据本发明的另一方面,提供了一种网管方法,用于综合监管软件系统及其所部署主机,包括以下步骤:创建映射软件系统及其所部署主机的公共信息模型,公共信息模型包括:逻辑网元层,其所包括的各个逻辑网元用于映射软件系统的各个部分,具有可供查询的标准接口;物理网元层,其所包括的各个物理网元用于映射各个部分所部署的主机,具有可供查询的标准接口;以及模块服务层,用于映射逻辑网元与物理网元之间的对应关系;业务网管子系统通过逻辑网元层的标准接口来监管软件系统;以及IP网管子系统通过物理网元层的标准接口来监管主机。According to another aspect of the present invention, a network management method is provided for comprehensive supervision of a software system and its deployed host, comprising the following steps: creating a public information model that maps the software system and its deployed host, the public information model includes: Logical network element layer, each logical network element it includes is used to map each part of the software system, and has a standard interface available for query; physical network element layer, each physical network element it includes is used to map each part deployed The host has a standard interface available for query; and the module service layer is used to map the corresponding relationship between logical network elements and physical network elements; the service network management subsystem supervises the software system through the standard interface of the logical network element layer; and The IP network management subsystem supervises the host through the standard interface of the physical network element layer.

在上述的网管方法中,创建公共信息模型中的物理网元层包括以下步骤:记录逻辑网元所部署主机的IP地址,并根据IP地址映射物理网元,然后在逻辑网元与物理网元之间创建关联,将关联信息存储在模块服务层中。In the above-mentioned network management method, creating the physical network element layer in the public information model includes the following steps: record the IP address of the host where the logical network element is deployed, and map the physical network element according to the IP address, and then connect the logical network element and the physical network element Create an association between them, and store the association information in the module service layer.

在上述的网管方法中,创建公共信息模型中的逻辑网元层包括以下步骤:通过主机IP地址查找与之相匹配的逻辑网元,并在逻辑网元与物理网元之间创建关联,将关联信息存储在模块服务层中。In the above-mentioned network management method, creating the logical network element layer in the public information model includes the following steps: searching for a matching logical network element through the host IP address, and creating an association between the logical network element and the physical network element, and Association information is stored in the module service layer.

在上述的网管方法中,业务网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、计费管理子系统、业务管理子系统、告警子系统、数据采集子系统;IP网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、告警子系统、数据采集子系统。In the above network management method, the service network management subsystem includes at least one of the following: configuration management subsystem, safety management subsystem, fault management subsystem, performance management subsystem, billing management subsystem, service management subsystem, alarm subsystem System, data acquisition subsystem; IP network management subsystem includes at least one of the following: configuration management subsystem, safety management subsystem, fault management subsystem, performance management subsystem, alarm subsystem, data acquisition subsystem.

在上述的网管方法中,业务网管子系统和IP网管子系统均包括告警、性能管理、配置管理子系统,告警、性能管理、配置管理子系统都包含预处理模块,监管主机包括以下步骤:这些预处理模块查询存储在公共信息模型中的逻辑网元与物理网元的关联信息,使得子系统能够识别逻辑网元与物理网元之间的关系。In the above-mentioned network management method, the service network management subsystem and the IP network management subsystem both include alarm, performance management, and configuration management subsystems, and the alarm, performance management, and configuration management subsystems all include preprocessing modules, and the supervisory host includes the following steps: The preprocessing module queries the association information of the logical network element and the physical network element stored in the public information model, so that the subsystem can identify the relationship between the logical network element and the physical network element.

在上述的网管方法中,监管主机包括以下步骤:在主机发生硬件故障并发送告警后,IP网管子系统接收并处理该告警,IP网管子系统中的告警子系统的预处理模块根据存储在公共信息模型中的物理网元与逻辑网元的关联信息来查询哪些逻辑网元因此受到影响,并通知业务网管子系统进行相应处理,从而在业务网管子系统侧能够观察到已经发生在主机上的告警。In the above-mentioned network management method, supervising the host includes the following steps: after a hardware failure occurs in the host and sends an alarm, the IP network management subsystem receives and processes the alarm, and the preprocessing module of the alarm subsystem in the IP network management subsystem is based on the data stored in the public In the information model, use the correlation information between physical network elements and logical network elements to query which logical network elements are affected, and notify the service network management subsystem to deal with it accordingly, so that the service network management subsystem can observe the events that have occurred on the host alarm.

在上述的网管方法中,监管主机包括以下步骤:主机上发生的告警如果可能导致软件系统出现运行异常,业务网管子系统会发出预警,并通知业务维护人员采取措施。In the above-mentioned network management method, supervising the host includes the following steps: if an alarm on the host may cause abnormal operation of the software system, the service network management subsystem will issue an early warning and notify the service maintenance personnel to take measures.

在上述的网管方法中,监管主机包括以下步骤:在逻辑网元上创建监控物理网元运行状况的监视任务或故障检测任务后,IP网管子系统中的性能管理子系统的预处理模块将相关参数发送给IP网管子系统,IP网管子系统根据这些参数采集物理网元的状态数据,并将已采集结果实时送到状态监控界面。In the above-mentioned network management method, the supervisory host includes the following steps: After creating a monitoring task or a fault detection task for monitoring the health of the physical network element on the logical network element, the preprocessing module of the performance management subsystem in the IP network management subsystem will The parameters are sent to the IP network management subsystem, and the IP network management subsystem collects the status data of the physical network elements according to these parameters, and sends the collected results to the status monitoring interface in real time.

在上述的网管方法中,业务网管子系统和IP网管子系统是基于管理领域划分,二者区分为业务拓扑视图与主机拓扑视图。In the above-mentioned network management method, the service network management subsystem and the IP network management subsystem are divided based on the management field, and the two are divided into a service topology view and a host topology view.

从以上的描述中,可以看出,本发明充分利用主机监管的标准接口,不需要应用软件系统进行大规模适配性的开发,网管系统实现较为简单。From the above description, it can be seen that the present invention makes full use of the standard interface supervised by the host, and does not require large-scale adaptability development of the application software system, and the implementation of the network management system is relatively simple.

具体来说,本发明实现了如下技术效果:Specifically, the present invention achieves the following technical effects:

通过在业务网管子系统中集成主机监管的功能以实现应用软件系统及其部署所在主机的综合监管后,可以有效降低因硬件故障导致的软件系统故障的产生率(通过及时预警),并提升该类故障的排查效率,降低应用软件系统的维护成本,降低网管系统及应用软件系统提供商的总体开发成本。By integrating the host supervision function in the service network management subsystem to realize the comprehensive supervision of the application software system and the host where it is deployed, the occurrence rate of software system failures caused by hardware failures can be effectively reduced (by timely early warning), and the system can be improved. Improve the troubleshooting efficiency of such faults, reduce the maintenance cost of the application software system, and reduce the overall development cost of the network management system and application software system providers.

网管系统不区分业务网管与IP网管子系统,仅区分不同的拓扑视图(业务拓扑视图与主机拓扑视图),在完成应用软件系统接入后同时提供这些不同类型的视图,并根据软件模块部署的IP地址自动执行对于相关主机的监控。The network management system does not distinguish between service network management and IP network management subsystems, but only distinguishes different topology views (service topology view and host topology view), and provides these different types of views at the same time after the application software system is connected. IP addresses automatically perform monitoring for associated hosts.

本发明的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点可通过在所写的说明书、权利要求书、以及附图中所特别指出的结构来实现和获得。Additional features and advantages of the invention will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention may be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

附图说明 Description of drawings

此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The accompanying drawings described here are used to provide a further understanding of the present invention and constitute a part of the application. The schematic embodiments of the present invention and their descriptions are used to explain the present invention and do not constitute improper limitations to the present invention. In the attached picture:

图1示出了根据相关技术的一个简单的网络管理模型;Fig. 1 shows a simple network management model according to the related art;

图2示出了根据相关技术的逻辑网元部署的层次结构;FIG. 2 shows a hierarchical structure of logical network element deployment according to related technologies;

图3示出了根据相关技术提供的一种网管系统模块划分方案;Fig. 3 shows a kind of network management system module division scheme provided according to the related art;

图4示出了根据本发明实施例的网管系统的方框图;Fig. 4 shows the block diagram of the network management system according to the embodiment of the present invention;

图5示出了根据本发明的一个实施例的网元管理信息模型;FIG. 5 shows a network element management information model according to an embodiment of the present invention;

图6示出了根据本发明的一个实施例的网元管理信息模型;FIG. 6 shows a network element management information model according to an embodiment of the present invention;

图7示出了根据本发明的一个实施例的网管系统;以及Figure 7 shows a network management system according to an embodiment of the present invention; and

图8示出了根据本发明实施例的网管方法的流程图。Fig. 8 shows a flowchart of a network management method according to an embodiment of the present invention.

具体实施方式 Detailed ways

下面将参考附图并结合实施例,来详细说明本发明。The present invention will be described in detail below with reference to the accompanying drawings and in combination with embodiments.

本发明提出,应用软件系统、主机仍然由集成在一个大网管系统中的业务网管与IP网管子系统分别进行管理,二者保持一致的界面特性及用户操作习惯,共享相同的基础功能模块(如告警、性能、数据采集等模块),以数据集成为核心,同时满足界面集成与功能集成。应用软件系统与主机管理信息模型之间的数据集成基于IP地址实现。The present invention proposes that the application software system and the host are still managed separately by the service network management and IP network management subsystems integrated in a large network management system, and the two maintain consistent interface characteristics and user operating habits, and share the same basic function modules (such as Alarm, performance, data acquisition and other modules), with data integration as the core, while satisfying interface integration and function integration. The data integration between the application software system and the host management information model is realized based on the IP address.

在原有系统中增加一个专门的设备接入子系统,建立公共的信息模型,并在其他子系统增加一些预处理模块,从而将主机和应用在模型和各个关联功能上完整地关联起来。Add a dedicated equipment access subsystem to the original system, establish a public information model, and add some preprocessing modules to other subsystems, so as to completely associate the host computer and the application in the model and each associated function.

图4示出了根据本发明实施例的网管系统的方框图。如图4所示,根据本发明实施例的网管系统400包括:Fig. 4 shows a block diagram of a network management system according to an embodiment of the present invention. As shown in FIG. 4, a network management system 400 according to an embodiment of the present invention includes:

设备接入模块406,用于创建映射软件系统及其所部署主机的公共信息模型,公共信息模型包括:逻辑网元层,其所包括的各个逻辑网元用于映射软件系统的各个部分,具有可供查询的标准接口;物理网元层,其所包括的各个物理网元用于映射各个部分所部署的主机,具有可供查询的标准接口;以及模块服务层,用于映射逻辑网元与物理网元之间的对应关系;The device access module 406 is used to create a public information model that maps the software system and its deployed host. The public information model includes: a logical network element layer, and each logical network element included in it is used to map various parts of the software system, with The standard interface available for query; the physical network element layer, each physical network element included in it is used to map the hosts deployed in each part, and has a standard interface available for query; and the module service layer, used for mapping logical network elements and Correspondence between physical network elements;

业务网管子系统404,用于通过逻辑网元层的标准接口来监管软件系统;以及The service network management subsystem 404 is used to supervise the software system through the standard interface of the logical network element layer; and

IP网管子系统402,用于通过物理网元层的标准接口来监管主机。The IP network management subsystem 402 is used for supervising the host through the standard interface of the physical network element layer.

可选地,设备接入模块406用于通过记录逻辑网元所部署主机的IP地址,并根据IP地址定位物理网元,在逻辑网元与物理网元之间创建关联,将关联信息存储在模块服务层中。Optionally, the device access module 406 is configured to record the IP address of the host where the logical network element is deployed, locate the physical network element according to the IP address, create an association between the logical network element and the physical network element, and store the association information in In the module service layer.

可选地,设备接入模块406用于通过主机IP地址查找与之相匹配的逻辑网元,并在逻辑网元与物理网元之间创建关联,将关联信息存储在模块服务层中。Optionally, the device access module 406 is configured to search for a matching logical network element through the host IP address, create an association between the logical network element and the physical network element, and store the association information in the module service layer.

可选地,业务网管子系统404包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、计费管理子系统、业务管理子系统、告警子系统、数据采集子系统;IP网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、告警子系统、数据采集子系统。Optionally, the service network management subsystem 404 includes at least one of the following: configuration management subsystem, security management subsystem, fault management subsystem, performance management subsystem, billing management subsystem, service management subsystem, alarm subsystem, Data acquisition subsystem; the IP network management subsystem includes at least one of the following: configuration management subsystem, safety management subsystem, fault management subsystem, performance management subsystem, alarm subsystem, and data acquisition subsystem.

可选地,业务网管子系统404和IP网管子系统402均包括告警、性能管理、配置管理子系统,告警、性能管理、配置管理子系统都包含预处理模块,这些预处理模块用于查询存储在公共信息模型中的逻辑网元与物理网元关联信息,使得子系统能够识别逻辑网元与物理网元之间的关系。Optionally, both the service network management subsystem 404 and the IP network management subsystem 402 include alarm, performance management, and configuration management subsystems, and the alarm, performance management, and configuration management subsystems all include preprocessing modules, and these preprocessing modules are used for querying and storing The association information between logical network elements and physical network elements in the public information model enables the subsystem to identify the relationship between logical network elements and physical network elements.

可选地,在主机发生硬件故障并发送告警后,IP网管子系统402用于接收并处理该告警,IP网管子系统402中的告警子系统的预处理模块根据存储在公共信息模型中的物理网元与逻辑网元的关联信息来查询哪些逻辑网元因此受到影响,并通知业务网管子系统进行相应处理,从而在业务网管子系统侧能够观察到已经发生在主机上的告警。Optionally, after a hardware fault occurs in the host and sends an alarm, the IP network management subsystem 402 is used to receive and process the alarm, and the preprocessing module of the alarm subsystem in the IP network management subsystem 402 is based on the physical The associated information between network elements and logical network elements is used to query which logical network elements are affected, and notify the service network management subsystem to handle accordingly, so that the alarms that have occurred on the host can be observed on the service network management subsystem side.

可选地,主机上发生的告警如果可能导致软件系统出现运行异常,业务网管子系统404会发出预警,并通知业务维护人员采取措施。Optionally, if an alarm on the host may cause abnormal operation of the software system, the service network management subsystem 404 will issue an early warning and notify service maintenance personnel to take measures.

可选地,在逻辑网元上创建监控物理网元运行状况的监视任务或故障检测任务后,IP网管子系统中的性能管理子系统的预处理模块用于将相关参数发送给IP网管子系统402,IP网管子系统402用于根据这些参数采集物理网元的状态数据,并将已采集结果实时送到状态监控界面。Optionally, after creating a monitoring task or a fault detection task for monitoring the operating status of a physical network element on a logical network element, the preprocessing module of the performance management subsystem in the IP network management subsystem is used to send relevant parameters to the IP network management subsystem 402. The IP network management subsystem 402 is used to collect status data of physical network elements according to these parameters, and send the collected results to the status monitoring interface in real time.

可选地,业务网管子系统404和IP网管子系统402是基于管理领域划分,二者区分为业务拓扑视图与主机拓扑视图。Optionally, the service network management subsystem 404 and the IP network management subsystem 402 are divided based on the management field, and they are divided into a service topology view and a host topology view.

图8示出了根据本发明实施例的网管方法的流程图,其包括以下步骤:Fig. 8 shows the flowchart of the network management method according to the embodiment of the present invention, and it comprises the following steps:

步骤S802,创建映射软件系统及其所部署主机的公共信息模型,公共信息模型包括:逻辑网元层,其所包括的各个逻辑网元用于映射软件系统的各个部分,具有可供查询的标准接口;物理网元层,其所包括的各个物理网元用于映射各个部分所部署的主机,具有可供查询的标准接口;以及模块服务层,用于映射逻辑网元与物理网元之间的对应关系;Step S802, creating a public information model for mapping the software system and its deployed hosts. The public information model includes: a logical network element layer, each logical network element included in it is used to map various parts of the software system, and has standards that can be queried Interface; physical network element layer, each physical network element included is used to map the hosts deployed in each part, with standard interfaces available for query; and module service layer, used to map between logical network elements and physical network elements corresponding relationship;

步骤S804,业务网管子系统通过逻辑网元层的标准接口来监管软件系统;以及Step S804, the service network management subsystem monitors the software system through the standard interface of the logical network element layer; and

步骤S806,IP网管子系统通过物理网元层的标准接口来监管主机。Step S806, the IP network management subsystem supervises the host through the standard interface of the physical network element layer.

可选地,创建公共信息模型中的物理网元层包括以下步骤:记录逻辑网元所部署主机的IP地址,并根据IP地址映射物理网元,然后在逻辑网元与物理网元之间创建关联,将关联信息存储在模块服务层中。Optionally, creating the physical network element layer in the common information model includes the following steps: record the IP address of the host where the logical network element is deployed, and map the physical network element according to the IP address, and then create a network element between the logical network element and the physical network element. Association, store association information in the module service layer.

可选地,创建公共信息模型中的逻辑网元层包括以下步骤:通过主机IP地址查找与之相匹配的逻辑网元,并在逻辑网元与物理网元之间创建关联,将关联信息存储在模块服务层中。Optionally, creating the logical network element layer in the public information model includes the following steps: searching for a matching logical network element through the host IP address, creating an association between the logical network element and the physical network element, and storing the association information in the module service layer.

可选地,业务网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、计费管理子系统、业务管理子系统、告警子系统、数据采集子系统;IP网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、告警子系统、数据采集子系统。Optionally, the service network management subsystem includes at least one of the following: configuration management subsystem, security management subsystem, fault management subsystem, performance management subsystem, billing management subsystem, service management subsystem, alarm subsystem, data The acquisition subsystem; the IP network management subsystem includes at least one of the following: configuration management subsystem, safety management subsystem, fault management subsystem, performance management subsystem, alarm subsystem, and data acquisition subsystem.

可选地,业务网管子系统和IP网管子系统均包括告警、性能管理、配置管理子系统,告警、性能管理、配置管理子系统都包含预处理模块,监管主机包括以下步骤:这些预处理模块查询存储在公共信息模型中的逻辑网元与物理网元关联信息,使得子系统能够识别逻辑网元与物理网元之间的关系。Optionally, both the service network management subsystem and the IP network management subsystem include alarm, performance management, and configuration management subsystems, and the alarm, performance management, and configuration management subsystems all include preprocessing modules, and the supervisory host includes the following steps: These preprocessing modules Query the association information between logical network elements and physical network elements stored in the public information model, so that the subsystem can identify the relationship between logical network elements and physical network elements.

可选地,监管主机包括以下步骤:在主机发生硬件故障并发送告警后,IP网管子系统接收并处理该告警,IP网管子系统中的告警子系统的预处理模块根据存储在公共信息模型中的物理网元与逻辑网元的关联信息来查询哪些逻辑网元因此受到影响,并通知业务网管子系统进行相应处理,从而在业务网管子系统侧能够观察到已经发生在主机上的告警。Optionally, supervising the host computer includes the following steps: after a hardware failure occurs in the host computer and an alarm is sent, the IP network management subsystem receives and processes the alarm, and the preprocessing module of the alarm subsystem in the IP network management subsystem is stored in the public information model according to The associated information between physical network elements and logical network elements can be used to query which logical network elements are affected, and notify the service network management subsystem to handle accordingly, so that the alarms that have occurred on the host can be observed on the service network management subsystem side.

可选地,监管主机包括以下步骤:主机上发生的告警如果可能导致软件系统出现运行异常,业务网管子系统会发出预警,并通知业务维护人员采取措施。Optionally, supervising the host includes the following steps: if an alarm on the host may cause abnormal operation of the software system, the service network management subsystem will issue an early warning and notify service maintenance personnel to take measures.

可选地,监管主机包括以下步骤:在逻辑网元上创建监控物理网元运行状况的监视任务或故障检测任务后,IP网管子系统中的性能管理子系统的预处理模块将相关参数发送给IP网管子系统,IP网管子系统根据这些参数采集物理网元的状态数据,并将已采集结果实时送到状态监控界面。Optionally, the supervisory host includes the following steps: After creating a monitoring task or a fault detection task for monitoring the health of the physical network element on the logical network element, the preprocessing module of the performance management subsystem in the IP network management subsystem sends relevant parameters to IP network management subsystem, the IP network management subsystem collects status data of physical network elements according to these parameters, and sends the collected results to the status monitoring interface in real time.

可选地,业务网管子系统和IP网管子系统是基于管理领域划分,二者区分为业务拓扑视图与主机拓扑视图。Optionally, the service network management subsystem and the IP network management subsystem are divided based on the management field, and the two are divided into a service topology view and a host topology view.

下面对相应的网元管理信息模型进行详细描述。The corresponding network element management information model is described in detail below.

图5示出了根据本发明的一个实施例的网元管理信息模型。Fig. 5 shows a network element management information model according to an embodiment of the present invention.

如图5所示,网元管理信息模型分为四层:逻辑网元(LNE)/模块(Module)/服务(Service)/主机(Host);其中LNE是应用软件系统的映射,Service是进程实体的映射;Module对象存储有包含该对象的网元对象标识(NEID),每个Service对象都存储有包含该对象的模块标识(ModuleID),同时Service对象也存储有该服务运行所在的主机IP地址。Host对象是主机资源的映射。As shown in Figure 5, the network element management information model is divided into four layers: logical network element (LNE)/module (Module)/service (Service)/host (Host); where LNE is the mapping of the application software system, and Service is the process Mapping of entities; the Module object stores the network element object identifier (NEID) containing the object, each Service object stores the module identifier (ModuleID) containing the object, and the Service object also stores the host IP where the service runs address. The Host object is a map of host resources.

图6示出了根据本发明的一个实施例的网元管理信息模型。Fig. 6 shows a network element management information model according to an embodiment of the present invention.

对于部分结构不太复杂的应用软件系统而言,每个模块就是单独的一个进程实体。因此根据本发明的另一个实施例,如图6所示,网元管理信息模型也可以进行一定简化:Service层合并到Module层,则Module对象既存储有包含该对象的网元对象标识(NEID),也存储了该Module运行所在主机的IP地址。For some application software systems with less complex structures, each module is a separate process entity. Therefore, according to another embodiment of the present invention, as shown in FIG. 6, the network element management information model can also be simplified to a certain extent: the Service layer is merged into the Module layer, and the Module object stores the network element object identifier (NEID) containing the object. ), also stores the IP address of the host where the Module runs.

图7示出了根据本发明的一个实施例的网管系统。在该实施例中,应用软件系统及主机综合监管的基本流程如下:Fig. 7 shows a network management system according to an embodiment of the present invention. In this embodiment, the basic process of comprehensive supervision of the application software system and the host is as follows:

A、应用软件系统的代理提供应用软件系统各部分的部署信息(包括所在主机的IP地址);A. The agent of the application software system provides the deployment information of each part of the application software system (including the IP address of the host);

B、对于主机的监管基于业界的工业标准接口进行;B. The supervision of the host is based on the industry standard interface of the industry;

C、在将应用软件系统接入业务网管子系统进行管理时,设备接入子系统从代理处查询应用软件系统各部分的部署信息并存储到公共模型中;C. When the application software system is connected to the service network management subsystem for management, the device access subsystem queries the deployment information of each part of the application software system from the agent and stores it in the public model;

D、设备接入模块如果发现该应用软件系统部署所在的主机已经由IP网管接入,则在应用软件系统与主机抽象映射后的逻辑网元与物理网元间创建关系,标识逻辑网元对于物理网元的依赖,后续流程中各功能模块都会查询及使用这些关系进行一些关联操作;D. If the device access module finds that the host where the application software system is deployed has been accessed by the IP network management, it will create a relationship between the logical network element and the physical network element after the abstract mapping between the application software system and the host computer, and identify the logical network element for Dependence on physical network elements, each functional module in the subsequent process will query and use these relationships to perform some associated operations;

E、主机发生硬件故障并发送告警后,IP网管子系统接收并处理该告警,由预处理模块通知业务网管子系统,某软件系统部署所在的某主机发生故障,业务网管子系统接收该通知后重置目标软件系统对应的逻辑网元的告警状态;E. After the host has a hardware failure and sends an alarm, the IP network management subsystem receives and processes the alarm, and the preprocessing module notifies the service network management subsystem. If a host where a software system is deployed fails, the service network management subsystem receives the notification. Reset the alarm state of the logical network element corresponding to the target software system;

F、业务维护人员查询逻辑网元的告警状态时,可以观察到软件系统部署所在的主机上产生的硬件告警,如果某些硬件告警可能最终导致软件系统运行异常,则需要通知主机维护人员及时修复这些硬件告警;业务维护人员在对这些告警进行确认、清除操作时,相应的操作命令最终下发到相应的主机上;F. When business maintenance personnel query the alarm status of logical network elements, they can observe the hardware alarms generated on the host where the software system is deployed. If some hardware alarms may eventually lead to abnormal operation of the software system, they need to notify the host maintenance personnel to repair them in time These hardware alarms; when business maintenance personnel confirm and clear these alarms, the corresponding operation commands are finally issued to the corresponding hosts;

G、业务维护人员在逻辑网元上创建监控主机运行状况的监视任务或故障检测任务后,性能管理预处理模块将相关参数发送给IP网管子系统,IP网管子系统再根据这些参数采集目标主机的状态数据,并将已采集结果实时送到状态监控界面;业务维护人员一旦发现目标主机运行异常,则通知IP业务维护人员及时处理以恢复目标主机的正常运行;G. After business maintenance personnel create monitoring tasks or fault detection tasks for monitoring the running status of hosts on logical network elements, the performance management preprocessing module sends relevant parameters to the IP network management subsystem, and the IP network management subsystem collects target hosts based on these parameters status data, and send the collected results to the status monitoring interface in real time; once the service maintenance personnel find that the target host is running abnormally, they will notify the IP service maintenance personnel to deal with it in time to restore the normal operation of the target host;

H、如果软件系统某模块产生了特定的告警,业务维护人员首先查询该模块部署的主机上是否产生了一些特定的告警,并确认这些告警是否会引发目标告警;如确认软件故障由主机硬件故障引发,则通知主机维护人员进行修复。H. If a module of the software system generates specific alarms, the business maintenance personnel first check whether some specific alarms have been generated on the host where the module is deployed, and confirm whether these alarms will trigger target alarms; if the software failure is confirmed by the host hardware failure If triggered, the host maintainer is notified to fix it.

流程C与D对于应用软件系统等于相关主机设备接入网管的情况也一样适用,只是前者是根据应用软件系统模块部署所在主机IP来查询相关的物理网元,后者是通过主机IP来查询相关的逻辑网元。另外业务维护人员与主机维护人员可能是同一批人,以上流程对于此种情况也同样适用。Processes C and D are also applicable to the situation where the application software system is equal to the relevant host equipment connected to the network management, but the former queries the relevant physical network elements according to the host IP where the application software system module is deployed, and the latter queries the relevant physical network elements through the host IP. logical network element. In addition, the business maintenance personnel and the host maintenance personnel may be the same group of people, and the above process is also applicable to this situation.

实现应用软件系统及其部署所在主机综合监管的网管系统,除去配置管理、故障管理、性能管理、安全管理等传统功能子系统外,还包括公共模型管理模块、故障管理预处理模块、性能管理预处理模块以及扩展的设备接入模块,其中公共模型管理模块负责存储及管理各类NE、Module及Service等对象以及它们之间的关系,故障管理预处理模块负责将主机(物理网元)上产生的告警关联到相应的逻辑网元上,性能管理预处理模块负责将主机(物理网元)上预设置的部分性能统计任务及监控任务关联到相应的逻辑网元上,扩展的设备接入模块则负责在设备接入的过程中发现及完成物理网元与逻辑网元之间的关联。The network management system that realizes the comprehensive supervision of the application software system and the host where it is deployed includes, in addition to traditional functional subsystems such as configuration management, fault management, performance management, and security management, it also includes a public model management module, a fault management preprocessing The processing module and the extended device access module, among which the public model management module is responsible for storing and managing various objects such as NE, Module and Service and the relationship between them, and the fault management preprocessing module is responsible for the host (physical network element) The alarms are associated with the corresponding logical network elements, the performance management preprocessing module is responsible for associating part of the performance statistics tasks and monitoring tasks preset on the host (physical network elements) with the corresponding logical network elements, and the extended device access module It is responsible for discovering and completing the association between physical network elements and logical network elements during the process of device access.

从以上的描述中,可以看出,本发明充分利用主机监管的标准接口,不需要应用软件系统进行大规模适配性的开发,网管系统实现较为简单。From the above description, it can be seen that the present invention makes full use of the standard interface supervised by the host, and does not require large-scale adaptability development of the application software system, and the implementation of the network management system is relatively simple.

具体来说,本发明实现了如下技术效果:Specifically, the present invention achieves the following technical effects:

通过在业务网管子系统中集成主机监管的功能以实现应用软件系统及其部署所在主机的综合监管后,可以有效降低因硬件故障导致的软件系统故障的产生率(通过及时预警),并提升该类故障的排查效率,降低应用软件系统的维护成本,降低网管系统及应用软件系统提供商的总体开发成本。By integrating the host supervision function in the service network management subsystem to realize the comprehensive supervision of the application software system and the host where it is deployed, the occurrence rate of software system failures caused by hardware failures can be effectively reduced (by timely early warning), and the system can be improved. Improve the troubleshooting efficiency of such faults, reduce the maintenance cost of the application software system, and reduce the overall development cost of the network management system and application software system providers.

网管系统不区分业务网管与IP网管子系统,仅区分不同的拓扑视图(业务拓扑视图与主机拓扑视图),在完成应用软件系统接入后同时提供这些不同类型的视图,并根据软件模块部署的IP地址自动执行对于相关主机的监控。The network management system does not distinguish between service network management and IP network management subsystems, but only distinguishes different topology views (service topology view and host topology view), and provides these different types of views at the same time after the application software system is connected. IP addresses automatically perform monitoring for associated hosts.

显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。应该明白,这些具体实施中的变化对于本领域的技术人员来说是显而易见的,不脱离本发明的精神保护范围。Obviously, those skilled in the art should understand that each module or each step of the above-mentioned present invention can be realized by a general-purpose computing device, and they can be concentrated on a single computing device, or distributed in a network formed by multiple computing devices Optionally, they can be implemented with program codes executable by a computing device, so that they can be stored in a storage device and executed by a computing device, or they can be made into individual integrated circuit modules, or they can be integrated into Multiple modules or steps are fabricated into a single integrated circuit module to realize. As such, the present invention is not limited to any specific combination of hardware and software. It should be understood that changes in these specific implementations are obvious to those skilled in the art and do not depart from the spirit protection scope of the present invention.

以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. For those skilled in the art, the present invention may have various modifications and changes. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included within the protection scope of the present invention.

Claims (18)

1.一种网管系统,用于综合监管软件系统及其所部署主机,其特征在于,包括:1. A network management system, used for integrated supervisory software system and deployed host computer thereof, is characterized in that, comprising: 设备接入模块,用于创建映射所述软件系统及其所部署主机的公共信息模型,所述公共信息模型包括:A device access module, configured to create a public information model that maps the software system and its deployed host, the public information model includes: 逻辑网元层,其所包括的各个逻辑网元用于映射所述软件系统的各个部分,具有可供查询的标准接口;Logical network element layer, each logical network element included in it is used to map various parts of the software system, and has a standard interface available for query; 物理网元层,其所包括的各个物理网元用于映射所述各个部分所部署的主机,具有可供查询的标准接口;以及The physical network element layer, each physical network element included in it is used to map the hosts deployed by the various parts, and has a standard interface available for query; and 模块服务层,用于映射所述逻辑网元与所述物理网元之间的对应关系;A module service layer, configured to map the correspondence between the logical network element and the physical network element; 所述网管系统还包括:The network management system also includes: 业务网管子系统,用于通过所述逻辑网元层的标准接口来监管所述软件系统;以及A service network management subsystem, configured to supervise the software system through the standard interface of the logical network element layer; and IP网管子系统,用于通过所述物理网元层的标准接口来监管所述主机。The IP network management subsystem is used for supervising the host through the standard interface of the physical network element layer. 2.根据权利要求1所述的网管系统,其特征在于,所述设备接入模块,用于通过记录所述逻辑网元所部署主机的IP地址,并根据所述IP地址定位所述物理网元,在所述逻辑网元与所述物理网元之间创建关联,将所述关联信息存储在所述模块服务层中。2. The network management system according to claim 1, wherein the device access module is configured to locate the physical network according to the IP address by recording the IP address of the host on which the logical network element is deployed. Create an association between the logical network element and the physical network element, and store the association information in the module service layer. 3.根据权利要求1所述的网管系统,其特征在于,所述设备接入模块,用于通过主机IP地址查找与之相匹配的所述逻辑网元,并在所述逻辑网元与所述物理网元之间创建关联,将所述关联信息存储在所述模块服务层中。3. The network management system according to claim 1, wherein the device access module is configured to search for the matching logical network element through the host IP address, and connect the logical network element with the logical network element. Create an association between the physical network elements, and store the association information in the module service layer. 4.根据权利要求1所述的网管系统,其特征在于,所述业务网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、计费管理子系统、业务管理子系统、告警子系统、数据采集子系统;所述IP网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、告警子系统、数据采集子系统。4. The network management system according to claim 1, wherein the service network management subsystem comprises at least one of the following: configuration management subsystem, safety management subsystem, fault management subsystem, performance management subsystem, billing Management subsystem, service management subsystem, alarm subsystem, data acquisition subsystem; the IP network management subsystem includes at least one of the following: configuration management subsystem, security management subsystem, fault management subsystem, performance management subsystem, Alarm subsystem, data acquisition subsystem. 5.根据权利要求1所述的网管系统,其特征在于,所述业务网管子系统和所述IP网管子系统均包括告警、性能管理、配置管理子系统,所述告警、性能管理、配置管理子系统都包含预处理模块,这些预处理模块用于查询存储在公共信息模型中的逻辑网元与物理网元的关联信息,使得所述子系统能够识别逻辑网元与物理网元之间的关系。5. The network management system according to claim 1, wherein the service network management subsystem and the IP network management subsystem all include an alarm, performance management, configuration management subsystem, and the alarm, performance management, configuration management The subsystems all include preprocessing modules, and these preprocessing modules are used to query the association information between logical network elements and physical network elements stored in the public information model, so that the subsystems can identify the relationship between logical network elements and physical network elements. relation. 6.根据权利要求5所述的网管系统,其特征在于,在所述主机发生硬件故障并发送告警后,所述IP网管子系统用于接收并处理该告警,所述IP网管子系统中的告警子系统的预处理模块根据存储在公共信息模型中的物理网元与逻辑网元的关联信息来查询哪些逻辑网元因此受到影响,并通知业务网管子系统进行相应处理,从而在业务网管子系统侧能够观察到已经发生在主机上的告警。6. The network management system according to claim 5, wherein, after a hardware failure occurs in the host and an alarm is sent, the IP network management subsystem is used to receive and process the alarm, and the IP network management subsystem in the IP network management subsystem The preprocessing module of the alarm subsystem inquires which logical network elements are affected according to the association information between physical network elements and logical network elements stored in the public information model, and notifies the service network management subsystem to deal with it accordingly. The system side can observe the alarms that have occurred on the host. 7.根据权利要求6所述的网管系统,其特征在于,所述主机上发生的告警如果可能导致所述软件系统出现运行异常,业务网管子系统会发出预警,并通知业务维护人员采取措施。7. The network management system according to claim 6, wherein, if the alarm generated on the host may cause the software system to run abnormally, the service network management subsystem will issue an early warning and notify service maintenance personnel to take measures. 8.根据权利要求5所述的网管系统,其特征在于,在逻辑网元上创建监控物理网元运行状况的监视任务或故障检测任务后,所述IP网管子系统中的性能管理子系统的预处理模块用于将相关参数发送给所述IP网管子系统,所述IP网管子系统用于根据这些参数采集物理网元的状态数据,并将已采集结果实时送到状态监控界面。8. The network management system according to claim 5, characterized in that, after the monitoring task or the fault detection task of monitoring the physical network element operating condition is created on the logical network element, the performance management subsystem in the IP network management subsystem The preprocessing module is used to send relevant parameters to the IP network management subsystem, and the IP network management subsystem is used to collect status data of physical network elements according to these parameters, and send the collected results to the status monitoring interface in real time. 9.根据权利要求1所述的网管系统,其特征在于,所述业务网管子系统和所述IP网管子系统是基于管理领域划分,二者区分为业务拓扑视图与主机拓扑视图。9. The network management system according to claim 1, characterized in that, the service network management subsystem and the IP network management subsystem are divided based on management fields, and the two are divided into a service topology view and a host topology view. 10.一种网管方法,用于综合监管软件系统及其所部署主机,其特征在于,包括以下步骤:10. A network management method, used for comprehensive supervision software system and the deployed host thereof, is characterized in that, comprising the following steps: 创建映射所述软件系统及其所部署主机的公共信息模型,所述公共信息模型包括:Create a common information model that maps the software system and its deployed host, the common information model includes: 逻辑网元层,其所包括的各个逻辑网元用于映射所述软件系统的各个部分,具有可供查询的标准接口;Logical network element layer, each logical network element included in it is used to map various parts of the software system, and has a standard interface available for query; 物理网元层,其所包括的各个物理网元用于映射所述各个部分所部署的主机,具有可供查询的标准接口;以及The physical network element layer, each physical network element included in it is used to map the hosts deployed by the various parts, and has a standard interface available for query; and 模块服务层,用于映射所述逻辑网元与所述物理网元之间的对应关系;A module service layer, configured to map the correspondence between the logical network element and the physical network element; 业务网管子系统通过所述逻辑网元层的标准接口来监管所述软件系统;以及The service network management subsystem supervises the software system through the standard interface of the logical network element layer; and IP网管子系统通过所述物理网元层的标准接口来监管所述主机。The IP network management subsystem supervises the host through the standard interface of the physical network element layer. 11.根据权利要求10所述的网管方法,其特征在于,创建所述公共信息模型中的物理网元层包括以下步骤:记录所述逻辑网元所部署主机的IP地址,并根据所述IP地址映射所述物理网元,然后在所述逻辑网元与所述物理网元之间创建关联,将所述关联信息存储在所述模块服务层中。11. The network management method according to claim 10, wherein creating the physical network element layer in the public information model comprises the following steps: recording the IP address of the host where the logical network element is deployed, and according to the IP address The address is mapped to the physical network element, and then an association is created between the logical network element and the physical network element, and the association information is stored in the module service layer. 12.根据权利要求10所述的网管方法,其特征在于,创建所述公共信息模型中的逻辑网元层包括以下步骤:通过主机IP地址查找与之相匹配的所述逻辑网元,并在所述逻辑网元与所述物理网元之间创建关联,将所述关联信息存储在所述模块服务层中。12. The network management method according to claim 10, characterized in that, creating the logical network element layer in the public information model comprises the following steps: searching for the matching logical network element by the host IP address, and An association is established between the logical network element and the physical network element, and the association information is stored in the module service layer. 13.根据权利要求10所述的网管方法,其特征在于,所述业务网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、计费管理子系统、业务管理子系统、告警子系统、数据采集子系统;所述IP网管子系统包括以下至少之一:配置管理子系统、安全管理子系统、故障管理子系统、性能管理子系统、告警子系统、数据采集子系统。13. The network management method according to claim 10, wherein the service network management subsystem comprises at least one of the following: configuration management subsystem, security management subsystem, fault management subsystem, performance management subsystem, billing Management subsystem, service management subsystem, alarm subsystem, data acquisition subsystem; the IP network management subsystem includes at least one of the following: configuration management subsystem, security management subsystem, fault management subsystem, performance management subsystem, Alarm subsystem, data acquisition subsystem. 14.根据权利要求10所述的网管方法,其特征在于,所述业务网管子系统和所述IP网管子系统均包括告警、性能管理、配置管理子系统,所述告警、性能管理、配置管理子系统都包含预处理模块,监管所述主机包括以下步骤:这些预处理模块查询存储在公共信息模型中的逻辑网元与物理网元的关联信息,使得所述子系统能够识别逻辑网元与物理网元之间的关系。14. The network management method according to claim 10, characterized in that, the service network management subsystem and the IP network management subsystem both include alarm, performance management, configuration management subsystems, and the alarm, performance management, configuration management The subsystems all include preprocessing modules, and supervising the host includes the following steps: These preprocessing modules query the association information between logical network elements and physical network elements stored in the public information model, so that the subsystems can identify logical network elements and physical network elements. The relationship between physical network elements. 15.根据权利要求14所述的网管方法,其特征在于,监管所述主机包括以下步骤:在所述主机发生硬件故障并发送告警后,所述IP网管子系统接收并处理该告警,所述IP网管子系统中的告警子系统的预处理模块根据存储在公共信息模型中的物理网元与逻辑网元的关联信息来查询哪些逻辑网元因此受到影响,并通知业务网管子系统进行相应处理,从而在业务网管子系统侧能够观察到已经发生在主机上的告警。15. The network management method according to claim 14, wherein supervising the host comprises the following steps: after a hardware failure occurs in the host and an alarm is sent, the IP network management subsystem receives and processes the alarm, the The preprocessing module of the alarm subsystem in the IP network management subsystem queries which logical network elements are affected based on the association information between physical network elements and logical network elements stored in the public information model, and notifies the service network management subsystem to handle accordingly , so that the alarms that have occurred on the host can be observed on the service network management subsystem side. 16.根据权利要求15所述的网管方法,其特征在于,监管所述主机包括以下步骤:所述主机上发生的告警如果可能导致所述软件系统出现运行异常,业务网管子系统会发出预警,并通知业务维护人员采取措施。16. The network management method according to claim 15, wherein supervising the host comprises the following steps: if an alarm occurring on the host may cause abnormal operation of the software system, the service network management subsystem will issue an early warning, And notify the service maintenance personnel to take measures. 17.根据权利要求14所述的网管方法,其特征在于,监管所述主机包括以下步骤:在逻辑网元上创建监控物理网元运行状况的监视任务或故障检测任务后,所述IP网管子系统中的性能管理子系统的预处理模块将相关参数发送给所述IP网管子系统,所述IP网管子系统根据这些参数采集物理网元的状态数据,并将已采集结果实时送到状态监控界面。17. The network management method according to claim 14, wherein supervising the host comprises the following steps: after creating a monitoring task or a fault detection task for monitoring the running status of a physical network element on a logical network element, the IP network management sub The preprocessing module of the performance management subsystem in the system sends relevant parameters to the IP network management subsystem, and the IP network management subsystem collects the status data of the physical network elements according to these parameters, and sends the collected results to the status monitoring in real time interface. 18.根据权利要求10所述的网管方法,其特征在于,所述业务网管子系统和所述IP网管子系统是基于管理领域划分,二者区分为业务拓扑视图与主机拓扑视图。18. The network management method according to claim 10, characterized in that, the service network management subsystem and the IP network management subsystem are divided based on management fields, and the two are divided into a service topology view and a host topology view.
CNB2006101411891A 2006-10-13 2006-10-13 Network management system for integrative supervision and management of application software system and host resource Expired - Fee Related CN100426756C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006101411891A CN100426756C (en) 2006-10-13 2006-10-13 Network management system for integrative supervision and management of application software system and host resource

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006101411891A CN100426756C (en) 2006-10-13 2006-10-13 Network management system for integrative supervision and management of application software system and host resource

Publications (2)

Publication Number Publication Date
CN1968148A CN1968148A (en) 2007-05-23
CN100426756C true CN100426756C (en) 2008-10-15

Family

ID=38076713

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006101411891A Expired - Fee Related CN100426756C (en) 2006-10-13 2006-10-13 Network management system for integrative supervision and management of application software system and host resource

Country Status (1)

Country Link
CN (1) CN100426756C (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101217397B (en) * 2007-12-27 2010-07-07 华为技术有限公司 A network management method, system and device
CN101488889B (en) * 2008-01-17 2011-04-20 中兴通讯股份有限公司 Detecting and recovering method for network management systematic engineering failure
CN102572868B (en) * 2010-12-27 2015-03-11 中国移动通信集团设计院有限公司 Wireless network information processing method and device as well as wireless network information management server
CN104104529A (en) * 2013-04-03 2014-10-15 中兴通讯股份有限公司 Network management method and system based on common information model
CN105740140A (en) * 2014-12-10 2016-07-06 中兴通讯股份有限公司 Software system failure diagnosis method, server and system
CN107517108A (en) * 2017-09-05 2017-12-26 合肥丹朋科技有限公司 System for managing application program of computer network

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1547357A (en) * 2003-12-16 2004-11-17 北京航空航天大学 A Dynamically Constructed Network Management Method
US6963916B1 (en) * 1998-12-31 2005-11-08 Qwest Communications International Inc. Network management system and graphical user interface
CN1741466A (en) * 2004-08-27 2006-03-01 华为技术有限公司 Centralized network management system alarm processing method and centralized network management system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6963916B1 (en) * 1998-12-31 2005-11-08 Qwest Communications International Inc. Network management system and graphical user interface
CN1547357A (en) * 2003-12-16 2004-11-17 北京航空航天大学 A Dynamically Constructed Network Management Method
CN1741466A (en) * 2004-08-27 2006-03-01 华为技术有限公司 Centralized network management system alarm processing method and centralized network management system

Also Published As

Publication number Publication date
CN1968148A (en) 2007-05-23

Similar Documents

Publication Publication Date Title
US8516106B2 (en) Use tag clouds to visualize components related to an event
US9658914B2 (en) Troubleshooting system using device snapshots
US7065566B2 (en) System and method for business systems transactions and infrastructure management
US6269401B1 (en) Integrated computer system and network performance monitoring
US7225250B1 (en) Method and system for predictive enterprise resource management
US7197489B1 (en) Methods and apparatus for maintaining object data for components in a network
US10924329B2 (en) Self-healing Telco network function virtualization cloud
CN106170947B (en) A kind of alarm information processing method, related equipment and system
CN100426756C (en) Network management system for integrative supervision and management of application software system and host resource
CN110175451A (en) A kind of method for safety monitoring and system based on electric power cloud
EP3520328A1 (en) Network health data aggregation service
WO2005069999A2 (en) Automated and embedded software reliability measurement and classification in network elements
CN106789412A (en) Method, the apparatus and system of monitoring information collection main website performance
CN109997337B (en) Visualization of network health information
CN112230847B (en) A method, system, terminal and storage medium for monitoring K8s storage volume
US7836156B1 (en) Method and apparatus for analyzing network infrastructure
CN105610648A (en) Operation and maintenance monitoring data collection method and server
CN107562601A (en) A kind of alarm method and device
US20250110818A1 (en) Computing cluster health reporting engine
WO2018010176A1 (en) Method and device for acquiring fault information
CN111371570B (en) A fault detection method and device for an NFV network
US20060053021A1 (en) Method for monitoring and managing an information system
US11237892B1 (en) Obtaining data for fault identification
CN114513398B (en) Network equipment alarm processing methods, devices, equipment and storage media
US7475076B1 (en) Method and apparatus for providing remote alert reporting for managed resources

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20081015

Termination date: 20121013