[go: up one dir, main page]

CN109039714A - The management method and device of resource in cloud computing system - Google Patents

The management method and device of resource in cloud computing system Download PDF

Info

Publication number
CN109039714A
CN109039714A CN201810781903.6A CN201810781903A CN109039714A CN 109039714 A CN109039714 A CN 109039714A CN 201810781903 A CN201810781903 A CN 201810781903A CN 109039714 A CN109039714 A CN 109039714A
Authority
CN
China
Prior art keywords
resource
performance
information
processing
alarm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810781903.6A
Other languages
Chinese (zh)
Inventor
张少杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201810781903.6A priority Critical patent/CN109039714A/en
Publication of CN109039714A publication Critical patent/CN109039714A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0893Assignment of logical groups to network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/16Threshold monitoring

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

本发明公开了一种云计算系统中资源的管理方法和装置。所述方法包括:获取每种资源的资源管理信息,其中所述资源管理信息包括资源标识和资源的性能参数;按照预先设置的时间策略,采集每个资源标识对应的性能参数的实时状态;根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果;根据所述判断结果,对所述资源进行管理。

The invention discloses a resource management method and device in a cloud computing system. The method includes: acquiring resource management information of each resource, wherein the resource management information includes resource identifiers and resource performance parameters; collecting the real-time status of performance parameters corresponding to each resource identifier according to a preset time strategy; The preset resource performance monitoring strategy judges whether the real-time status of the performance parameter corresponding to each resource identifier is in a healthy working state, and obtains a judgment result; according to the judgment result, the resource is managed.

Description

云计算系统中资源的管理方法和装置Resource management method and device in cloud computing system

技术领域technical field

本发明涉及信息处理领域,尤指一种云计算系统中资源的管理方法和装置。The invention relates to the field of information processing, in particular to a resource management method and device in a cloud computing system.

背景技术Background technique

云计算(Cloud Computing)是基于互联网的相关服务的增加、使用和交付模式,通常涉及通过互联网来提供动态易扩展且经常是虚拟化的资源。云是网络、互联网的一种比喻说法。过去在图中往往用云来表示电信网,后来也用来表示互联网和底层基础设施的抽象。因此,云计算甚至可以让你体验每秒10万亿次的运算能力,拥有这么强大的计算能力可以模拟核爆炸、预测气候变化和市场发展趋势。用户通过电脑、笔记本、手机等方式接入数据中心,按自己的需求进行运算。Cloud Computing (Cloud Computing) is the growth, use and delivery model of Internet-based related services, usually involving the provision of dynamically scalable and often virtualized resources through the Internet. Cloud is a metaphor for network, internet. In the past, the cloud was often used to represent the telecommunications network in the figure, and later it was also used to represent the abstraction of the Internet and the underlying infrastructure. Therefore, cloud computing can even allow you to experience the computing power of 10 trillion times per second. With such powerful computing power, you can simulate nuclear explosions, predict climate change and market development trends. Users access the data center through computers, notebooks, mobile phones, etc., and perform calculations according to their own needs.

对云计算的定义,现阶段广为接受的是美国国家标准与技术研究院定义:云计算是一种按使用量付费的模式,这种模式提供可用的、便捷的、按需的网络访问,进入可配置的计算资源共享池,其中资源包括网络、服务器、存储、应用软件和服务等,这些资源能够被快速提供,只需投入很少的管理工作,或与服务供应商进行很少的交互。在数据中心建设的初期主要是完成网络、计算、存储的搭建,构建统一的资源管理平台,完成资源的统一化管理和自主服务;在后续的发展过程中用户逐渐关注资源的真正使用情况,以便更好的规划资源和控制资源的使用。The definition of cloud computing is widely accepted at this stage by the National Institute of Standards and Technology definition: Cloud computing is a pay-per-use model that provides available, convenient, and on-demand network access. Access to a shared pool of configurable computing resources, including networks, servers, storage, applications, and services, that can be provisioned quickly with little management effort or interaction with service providers . In the initial stage of data center construction, it is mainly to complete the construction of network, computing, and storage, build a unified resource management platform, and complete the unified management of resources and independent services; in the subsequent development process, users gradually pay attention to the actual use of resources, so that Better planning of resources and control over resource usage.

随着云计算的发展,云平台中往往需要对多种云资源进行管理,但是云资源在使用的过程中不可避免会出现各种异常,如何对多种云资源进行健康管理就成为一个需要解决的问题。目前的云平台中对云资源异常情况的处理往往需要手动进行,在云资源数量比较大的情况下会造成处理效率的下降,从而降低云平台的可用性。With the development of cloud computing, it is often necessary to manage multiple cloud resources in the cloud platform. However, various abnormalities will inevitably occur during the use of cloud resources. How to manage the health of multiple cloud resources has become a problem that needs to be solved. The problem. In the current cloud platform, the processing of cloud resource abnormalities often needs to be done manually, and when the number of cloud resources is relatively large, the processing efficiency will decrease, thereby reducing the availability of the cloud platform.

发明内容Contents of the invention

为了解决上述技术问题,本发明提供了一种云计算系统中资源的管理方法和装置,提高异常处理的效率。In order to solve the above technical problems, the present invention provides a resource management method and device in a cloud computing system to improve the efficiency of exception handling.

为了达到本发明目的,本发明提供了一种云计算系统中资源的管理方法,包括:In order to achieve the object of the present invention, the present invention provides a resource management method in a cloud computing system, including:

获取每种资源的资源管理信息,其中所述资源管理信息包括资源标识和资源的性能参数;Acquiring resource management information of each resource, where the resource management information includes resource identifiers and resource performance parameters;

按照预先设置的时间策略,采集每个资源标识对应的性能参数的实时状态;Collect the real-time status of the performance parameters corresponding to each resource identifier according to the preset time strategy;

根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果;According to the preset resource performance monitoring strategy, judge whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state, and obtain the judgment result;

根据所述判断结果,对所述资源进行管理。According to the judgment result, the resources are managed.

其中,所述方法还具有如下特点:所述方法还包括:Wherein, the method also has the following characteristics: the method also includes:

接收对资源的性能监控策略的管理请求,其中所述管理请求包括对性能监控策略的新增、修改、删除、开启和关闭中的至少一个操作;receiving a management request for a resource performance monitoring policy, wherein the management request includes at least one operation of adding, modifying, deleting, opening and closing the performance monitoring policy;

根据所述管理请求,对所述性能监控策略进行处理。The performance monitoring policy is processed according to the management request.

其中,所述方法还具有如下特点:所述判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果包括:Wherein, the method also has the following characteristics: the judging whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state, and obtaining the judging result includes:

判断所述资源标识对应的性能参数的实时状态是否为预先设置的正常工作状态;和/或,Judging whether the real-time state of the performance parameter corresponding to the resource identifier is a preset normal working state; and/or,

判断所述资源标识对应的性能参数的实时状态是否在预先设置的阈值范围内。It is judged whether the real-time state of the performance parameter corresponding to the resource identifier is within a preset threshold range.

其中,所述方法还具有如下特点:所述根据所述判断结果,对所述资源进行管理,包括:Wherein, the method also has the following characteristics: the management of the resource according to the judgment result includes:

在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息。When the judging result is that the resource is not in a healthy working state, an alarm message of resource abnormality is issued, and the alarm information is recorded.

其中,所述方法还具有如下特点:所述在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息之后,所述方法还包括:Wherein, the method also has the following characteristics: when the result of the judgment is that it is not in a healthy working state, after sending out an alarm message of resource abnormality and recording the alarm information, the method further includes:

根据预先存储的各性能信息对应的告警处理策略,获取出现异常的性能信息对应的处理策略;According to the alarm processing strategy corresponding to each performance information stored in advance, obtain the processing strategy corresponding to the abnormal performance information;

利用所述处理策略对所述资源进行处理,得到处理结果;Process the resource by using the processing strategy to obtain a processing result;

如果所述处理结果为成功,则停止告警操作,并删除告警信息的记录。If the processing result is successful, the alarm operation is stopped, and the record of the alarm information is deleted.

一种云计算系统中资源的管理装置,包括:A resource management device in a cloud computing system, comprising:

获取模块,用于获取每种资源的资源管理信息,其中所述资源管理信息包括资源标识和资源的性能参数;An acquisition module, configured to acquire resource management information of each resource, wherein the resource management information includes resource identifiers and resource performance parameters;

采集模块,用于按照预先设置的时间策略,采集每个资源标识对应的性能参数的实时状态;The collection module is used to collect the real-time status of the performance parameters corresponding to each resource identifier according to the preset time strategy;

判断模块,用于根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果;The judging module is used to judge whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state according to the preset resource performance monitoring strategy, and obtain the judgment result;

管理模块,用于根据所述判断结果,对所述资源进行管理。A management module, configured to manage the resource according to the judgment result.

其中,所述装置还具有如下特点:所述装置还包括:Wherein, the device also has the following characteristics: the device also includes:

接收模块,用于接收对资源的性能监控策略的管理请求,其中所述管理请求包括对性能监控策略的新增、修改、删除、开启和关闭中的至少一个操作;A receiving module, configured to receive a management request for a performance monitoring policy of a resource, wherein the management request includes at least one operation of adding, modifying, deleting, opening and closing the performance monitoring policy;

处理模块,用于根据所述管理请求,对所述性能监控策略进行处理。A processing module, configured to process the performance monitoring policy according to the management request.

其中,所述装置还具有如下特点:Wherein, the device also has the following characteristics:

所述判断模块,具体用于判断所述资源标识对应的性能参数的实时状态是否为预先设置的正常工作状态;和/或,判断所述资源标识对应的性能参数的实时状态是否在预先设置的阈值范围内。The judging module is specifically used to judge whether the real-time state of the performance parameter corresponding to the resource identifier is a preset normal working state; and/or judge whether the real-time state of the performance parameter corresponding to the resource identifier is in the preset within the threshold range.

其中,所述装置还具有如下特点:所述管理模块包括:Wherein, the device also has the following characteristics: the management module includes:

告警单元,用于在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息。The alarm unit is configured to send out alarm information about resource abnormality and record the alarm information when the judging result is that the resource is not in a healthy working state.

其中,所述装置还具有如下特点:所述管理模块还包括:Wherein, the device also has the following characteristics: the management module also includes:

获取单元,用于根据预先存储的各性能信息对应的告警处理策略,获取出现异常的性能信息对应的处理策略;An acquisition unit, configured to acquire a processing strategy corresponding to abnormal performance information according to a pre-stored alarm processing strategy corresponding to each performance information;

处理单元,用于利用所述处理策略对所述资源进行处理,得到处理结果;a processing unit, configured to use the processing policy to process the resource to obtain a processing result;

输出单元,用于如果所述处理结果为成功,则停止告警操作,并删除告警信息的记录。The output unit is configured to stop the alarm operation and delete the record of the alarm information if the processing result is successful.

本发明提供的实施例,通过获取每种资源的资源管理信息,采集每个资源标识对应的性能参数的实时状态,再根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果,并根据所述判断结果,对所述资源进行管理,实现对异常的云资源的自动处理,从而减少云资源异常带来的不利影响,提高云平台的可用性。In the embodiment provided by the present invention, by acquiring the resource management information of each resource, the real-time status of the performance parameter corresponding to each resource identifier is collected, and then the performance parameter corresponding to each resource identifier is judged according to the preset resource performance monitoring strategy Whether the real-time status is in a healthy working state, obtain the judgment result, and manage the resources according to the judgment result, realize the automatic processing of abnormal cloud resources, thereby reducing the adverse effects caused by abnormal cloud resources, Improve the availability of the cloud platform.

本发明的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点可通过在说明书、权利要求书以及附图中所特别指出的结构来实现和获得。Additional features and advantages of the invention will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention may be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

附图说明Description of drawings

附图用来提供对本发明技术方案的进一步理解,并且构成说明书的一部分,与本申请的实施例一起用于解释本发明的技术方案,并不构成对本发明技术方案的限制。The accompanying drawings are used to provide a further understanding of the technical solution of the present invention, and constitute a part of the description, and are used together with the embodiments of the application to explain the technical solution of the present invention, and do not constitute a limitation to the technical solution of the present invention.

图1为本发明提供的云计算系统中资源的管理方法的流程图;Fig. 1 is the flow chart of the resource management method in the cloud computing system provided by the present invention;

图2为本发明提供的基于云平台的云资源健康管理系统的结构图;Fig. 2 is a structural diagram of a cloud resource health management system based on a cloud platform provided by the present invention;

图3为本发明提供的计算机可读存储介质的结构图。FIG. 3 is a structural diagram of a computer-readable storage medium provided by the present invention.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚明白,下文中将结合附图对本发明的实施例进行详细说明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互任意组合。In order to make the purpose, technical solution and advantages of the present invention more clear, the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined arbitrarily with each other.

在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行。并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。The steps shown in the flowcharts of the figures may be performed in a computer system, such as a set of computer-executable instructions. Also, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

图1为本发明提供的云计算系统中资源的管理方法的流程图。图1所示方法包括:FIG. 1 is a flowchart of a resource management method in a cloud computing system provided by the present invention. The methods shown in Figure 1 include:

步骤101、获取每种资源的资源管理信息,其中所述资源管理信息包括资源标识和资源的性能参数;Step 101. Obtain resource management information of each resource, wherein the resource management information includes resource identifiers and resource performance parameters;

具体的,本发明所指的资源可以为运行的设备,如主机和虚拟机,也可以为资源池;对于每一种类型的资源,对应维护一个数据表,每个数据表中包括资源标识id和性能参数。例如,对于虚拟机资源,资源标识为虚拟机id,其性能参数包括虚拟机电源状态、虚拟机网络状态,虚拟机CPU负载状态和虚拟机内存负载中的至少一个。Specifically, the resource referred to in the present invention can be a running device, such as a host and a virtual machine, or a resource pool; for each type of resource, a corresponding data table is maintained, and each data table includes a resource identification id and performance parameters. For example, for a virtual machine resource, the resource identifier is a virtual machine id, and its performance parameters include at least one of a virtual machine power state, a virtual machine network state, a virtual machine CPU load state, and a virtual machine memory load.

步骤102、按照预先设置的时间策略,采集每个资源标识对应的性能参数的实时状态;Step 102, according to the preset time strategy, collect the real-time status of the performance parameters corresponding to each resource identifier;

其中,该时间策略为周期性或根据用户自定义的时间策略,其中,用户自定义的时间策略可以设置在运行高峰期的采集频率高于运行低谷期的采集频率。Wherein, the time strategy is periodical or according to a user-defined time strategy, wherein, the user-defined time strategy can be set to have a higher collection frequency during peak operation periods than during low-peak operation periods.

步骤103、根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果;Step 103, according to the preset resource performance monitoring strategy, judge whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state, and obtain the judgment result;

具体的,由于资源健康信息尅可以多个性能参数的信息,在执行判断操作过程中,可以对每个性能参数逐一进行判断,分别得到不同的判断结果。Specifically, since the resource health information can contain information on multiple performance parameters, each performance parameter can be judged one by one during the judgment operation process, and different judgment results can be obtained respectively.

步骤104、根据所述判断结果,对所述资源进行管理。Step 104: Manage the resource according to the judgment result.

具体的,在对所述资源ID对应的性能参数的实时状态的判断结果均为处在健康的工作状态,则确定该资源工作正常;如果判断结果中有不处在健康的工作状态的性能参数,则确定该资源工作异常。Specifically, if the judgment results of the real-time status of the performance parameters corresponding to the resource ID are all in a healthy working state, then it is determined that the resource is working normally; if there are performance parameters that are not in a healthy working state in the judgment results , it is determined that the resource is working abnormally.

本发明提供的方法实施例,通过获取每种资源的资源管理信息,采集每个资源标识对应的性能参数的实时状态,再根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果,并根据所述判断结果,对所述资源进行管理,实现对异常的云资源的自动处理,从而减少云资源异常带来的不利影响,提高云平台的可用性。In the method embodiment provided by the present invention, by acquiring the resource management information of each resource, the real-time status of the performance parameters corresponding to each resource identifier is collected, and then the performance corresponding to each resource identifier is judged according to the preset resource performance monitoring strategy Whether the real-time status of the parameters is in a healthy working state, get the judgment result, and manage the resources according to the judgment result, realize automatic processing of abnormal cloud resources, thereby reducing the adverse effects caused by abnormal cloud resources , to improve the availability of the cloud platform.

下面对本发明提供的方法实施例作进一步说明:The method embodiment provided by the present invention is described further below:

在执行判断操作时,将资源的各性能参数的实时状态与对应的性能监控策略中的信息进行比较,来判断该性能参数是否发生异常。由于在实际应用中,为了保证性能监控策略能够更加精确地匹配实际情况,另外满足用户的个性化要求,在在本发明提供的一个方法实施例中,所述方法还包括:When performing the judgment operation, the real-time status of each performance parameter of the resource is compared with the information in the corresponding performance monitoring policy to judge whether the performance parameter is abnormal. In practical applications, in order to ensure that the performance monitoring strategy can more accurately match the actual situation and meet the individual requirements of users, in a method embodiment provided by the present invention, the method further includes:

接收对资源的性能监控策略的管理请求,其中所述管理请求包括对性能监控策略的新增、修改、删除、开启和关闭中的至少一个操作;receiving a management request for a resource performance monitoring policy, wherein the management request includes at least one operation of adding, modifying, deleting, opening and closing the performance monitoring policy;

根据所述管理请求,对所述性能监控策略进行处理。The performance monitoring policy is processed according to the management request.

具体的,当所述该资源的性能参数信息中新增一个该资源的性能信息后,相应的,需要在性能监控策略中可以通过新增操作建立一个新的策略,来对该性能信息进行监控;Specifically, when a performance information of the resource is added to the performance parameter information of the resource, correspondingly, it is necessary to establish a new policy by adding an operation in the performance monitoring policy to monitor the performance information ;

当检测到已存在的性能监控策略不符合当前运行情况后,需对已存在的性能监控策略进行修改,可以通过修改操作对已存在的性能监控策略进行调整,以便更好地匹配当前运行情况;When it is detected that the existing performance monitoring strategy does not meet the current operating conditions, the existing performance monitoring strategy needs to be modified, and the existing performance monitoring strategy can be adjusted through the modification operation to better match the current operating condition;

当所述资源健康信息中删除一个资源的性能信息后,相应的,需要在性能监控策略中可以通过删除操作删除对应已存在的策略。When the performance information of a resource is deleted from the resource health information, correspondingly, the corresponding existing policy needs to be deleted through a delete operation in the performance monitoring policy.

当一个资源对应多个性能监控策略时,在实际应用时,可以启用一个,则需要关闭其他剩余的性能监控策略,因此,可以控制已存在的性能监控策略处于开启或关闭操作,因此,可以通过开启或关闭操作来达到上述目的。When a resource corresponds to multiple performance monitoring strategies, in actual application, one can be enabled, and the other remaining performance monitoring strategies need to be turned off. Therefore, it is possible to control the existing performance monitoring strategies to be enabled or disabled. Therefore, you can pass Turn on or off the operation to achieve the above purpose.

通过上述方式,可以有效保证性能监控策略能够更加精确地匹配实际情况,同时满足用户的个性化要求。Through the above method, it can be effectively ensured that the performance monitoring strategy can more accurately match the actual situation, and at the same time meet the personalized requirements of the user.

其中,资源异常判定值有两种情况,一种是二值的情况,也就是对应云资源属性只有两个值,如虚拟机的电源状态包括开启和关闭;另一种是设定阈值的情况,如虚拟机的磁盘使用率。Among them, there are two cases of resource abnormality judgment value, one is the case of binary value, that is, there are only two values corresponding to cloud resource attributes, such as the power state of the virtual machine includes on and off; the other is the case of setting the threshold , such as the disk usage of the virtual machine.

针对上述情况,在本发明提供的一个方法实施例,所述判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果包括:In view of the above situation, in a method embodiment provided by the present invention, the judging whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state, and obtaining the judging result includes:

判断所述资源标识对应的性能参数的实时状态是否为预先设置的正常工作状态;和/或,判断所述资源标识对应的性能参数的实时状态是否在预先设置的阈值范围内。Judging whether the real-time status of the performance parameter corresponding to the resource identifier is a preset normal working status; and/or judging whether the real-time status of the performance parameter corresponding to the resource identifier is within a preset threshold range.

如果所述资源标识对应的性能参数的实时状态在正常工作状态,则表示该资源健康信息处在健康的工作状态;否则,表示该资源健康信息不处在健康的工作状态,即发生异常;If the real-time state of the performance parameter corresponding to the resource identifier is in a normal working state, it means that the resource health information is in a healthy working state; otherwise, it means that the resource health information is not in a healthy working state, that is, an abnormality occurs;

如果所述资源标识对应的性能参数的实时状态在所述阈值范围内,则表示该资源健康信息处在健康的工作状态;否则,表示该资源健康信息不处在健康的工作状态,即发生异常;If the real-time state of the performance parameter corresponding to the resource identifier is within the threshold range, it means that the resource health information is in a healthy working state; otherwise, it means that the resource health information is not in a healthy working state, that is, an abnormality occurs ;

通过上述方式,可以有针对性的对性能信息进行比较,完成对异常云资源的识别,提高管理效率。Through the above method, the performance information can be compared in a targeted manner, the identification of abnormal cloud resources can be completed, and the management efficiency can be improved.

当发生异常的云资源后,可以对该资源进行告警操作,方便用户对异常的云资源进行管理。在本发明提供的方法实施例中,所述根据所述判断结果,对所述资源进行管理,包括:When an abnormal cloud resource occurs, an alarm operation can be performed on the resource, which is convenient for users to manage the abnormal cloud resource. In the method embodiment provided by the present invention, the management of the resource according to the judgment result includes:

在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息。When the judging result is that the resource is not in a healthy working state, an alarm message of resource abnormality is issued, and the alarm information is recorded.

具体的,告警方式可以是发出声光提醒或通过网络推送到用户的终端,并记录该告警的描述信息,如问题的描述和处理建议,方便用户进行维护和管理,为后续管理提供管理参考。Specifically, the alarm method can be to send out sound and light reminders or push them to the user's terminal through the network, and record the description information of the alarm, such as the description of the problem and processing suggestions, which is convenient for users to maintain and manage, and provides management reference for subsequent management.

当然,为了进一步减少人工的成本,提升异常资源的处理效率,在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息之后,所述方法还包括:Of course, in order to further reduce labor costs and improve the processing efficiency of abnormal resources, when the result of the judgment is that they are not in a healthy working state, sending out alarm information about resource exceptions and recording the alarm information, the method further includes:

根据预先存储的各性能信息对应的告警处理策略,获取出现异常的性能信息对应的处理策略;According to the alarm processing strategy corresponding to each performance information stored in advance, obtain the processing strategy corresponding to the abnormal performance information;

利用所述处理策略对所述资源进行处理,得到处理结果;Process the resource by using the processing strategy to obtain a processing result;

如果所述处理结果为成功,则停止告警操作,并删除告警信息的记录。If the processing result is successful, the alarm operation is stopped, and the record of the alarm information is deleted.

具体的,预先存储的资源健康信息对应的告警处理策略,其中所述告警处理策略以性能信息进行标记,通过确定发出告警的性能信息,再根据该性能信息确定对应的告警处理策略,根据该告警处理策略进行修复工作,在修复操作执行完成后,输出该处理结果,如果修复成功,则删除该告警信息,并通知用户告警已解除,如果修复不成功,则继续保留告警信息,并告知用户自动修复未成功。Specifically, the alarm processing policy corresponding to the pre-stored resource health information, wherein the alarm processing policy is marked with performance information, by determining the performance information of the alarm, and then determining the corresponding alarm processing policy according to the performance information, according to the alarm The processing strategy performs the repair work. After the repair operation is completed, the processing result is output. If the repair is successful, the alarm information is deleted and the user is notified that the alarm has been resolved. If the repair is unsuccessful, the alarm information is kept and the user is automatically notified. Repair was unsuccessful.

通过上述方式,可以实现对异常云资源的自动处理和维护,有效降低人为管理的成本。Through the above method, automatic processing and maintenance of abnormal cloud resources can be realized, and the cost of human management can be effectively reduced.

图2为本发明提供的基于云平台的云资源健康管理系统的结构图。图2所示系统包括资源信息收集装置、云资源健康评估装置、云资源健康评估规则装置、异常云资源处理装置和异常云资源处理策略装置。其中,云资源信息收集装置负责通过云平台的API(Application Programming Interface,应用程序编程接口)获取各种云资源的健康信息。云资源健康评估规则装置负责保存对多种云资源是否健康的评估标准。云资源健康评估装置负责对云资源是否异常进行判断,并获取云资源异常信息。异常云资源处理策略装置负责维护对异常云资源如何处理的策略。异常云资源处理装置负责完成对异常云资源的处理。FIG. 2 is a structural diagram of a cloud resource health management system based on a cloud platform provided by the present invention. The system shown in FIG. 2 includes a resource information collection device, a cloud resource health evaluation device, a cloud resource health evaluation rule device, an abnormal cloud resource processing device, and an abnormal cloud resource processing policy device. Wherein, the cloud resource information collection device is responsible for acquiring health information of various cloud resources through an API (Application Programming Interface, Application Programming Interface) of the cloud platform. The cloud resource health evaluation rule device is responsible for saving evaluation criteria for the health of various cloud resources. The cloud resource health evaluation device is responsible for judging whether the cloud resource is abnormal, and obtaining information about the abnormality of the cloud resource. The abnormal cloud resource processing strategy device is responsible for maintaining the strategy on how to handle abnormal cloud resources. The device for processing abnormal cloud resources is responsible for completing the processing of abnormal cloud resources.

该系统的处理思路包括:在获取到不同类型的云资源信息后,利用云资源健康评估规则对云资源信息进行评估,得到异常云资源的详细信息,再利用异常云资源处理策略来对异常云资源进行处理。The system's processing ideas include: after obtaining different types of cloud resource information, use the cloud resource health assessment rules to evaluate the cloud resource information, obtain the detailed information of abnormal cloud resources, and then use the abnormal cloud resource processing strategy to analyze the abnormal cloud resources resources are processed.

该系统的具体实施过程如下:The specific implementation process of the system is as follows:

1)云资源信息收集装置定时通过调用云平台的云资源信息获取API来得到不同资源类型的云资源的健康信息,并存入数据库。云资源健康信息数据表将为每一种类型的资源维护一个数据表,每个数据表中包括资源id和资源健康信息字段。例如,对于虚拟机资源,其资源健康信息表的字段将包括虚拟机id,虚拟机电源状态,虚拟机网络状态,虚拟机CPU负载,虚拟机内存负载等。1) The cloud resource information collection device regularly calls the cloud resource information acquisition API of the cloud platform to obtain the health information of cloud resources of different resource types, and stores them in the database. The cloud resource health information data table will maintain a data table for each type of resource, and each data table includes resource id and resource health information fields. For example, for virtual machine resources, the fields of the resource health information table will include virtual machine id, virtual machine power status, virtual machine network status, virtual machine CPU load, virtual machine memory load, and the like.

2)在云资源健康评估规则装置中维护云资源健康评估规则。对云资源评估规则的新增、修改、删除、开启、关闭。云资源评估规则将为每一种类型的云资源维护一个数据表,每个数据表中包括资源评估属性,是否开启,资源异常判定值。2) Maintaining cloud resource health assessment rules in the cloud resource health assessment rule device. Add, modify, delete, enable, and disable cloud resource evaluation rules. The cloud resource evaluation rule will maintain a data table for each type of cloud resource, and each data table includes the resource evaluation attribute, whether it is enabled, and the resource exception judgment value.

资源异常判定值有两种情况,一种是二值的情况,也就是对应云资源属性只有两个值,如虚拟机的电源状态包括开启和关闭;另一种是设定阈值的情况,如虚拟机的磁盘使用率。There are two cases of resource abnormality judgment value, one is the case of binary value, that is, there are only two values corresponding to cloud resource attributes, for example, the power state of the virtual machine includes on and off; the other is the case of setting the threshold, such as The disk usage of the virtual machine.

对资源评估的规则的操作包括新增、开启、关闭、修改。Operations on resource evaluation rules include adding, opening, closing, and modifying.

3)云资源评估装置将云资源评估规则和云资源健康信息综合判断,得到异常云资源信息。异常云资源数据表将包括资源类型,资源id,资源异常属性。3) The cloud resource evaluation device comprehensively judges cloud resource evaluation rules and cloud resource health information to obtain abnormal cloud resource information. The exception cloud resource data table will include resource type, resource id, and resource exception attributes.

4)异常云资源处理策略装置中维护异常云资源处理策略,云资源异常处理策略的数据表包括云资源类型、云资源异常属性、异常云资源处理策略。异常云资源处理策略为一些预定义的策略,云资源异常处理装置将能够通过云资源处理策略调用对应的云资源处理API。4) The abnormal cloud resource processing policy device maintains the abnormal cloud resource processing policy, and the data table of the cloud resource abnormal processing policy includes cloud resource type, cloud resource abnormal attribute, and abnormal cloud resource processing policy. The abnormal cloud resource processing policies are some predefined policies, and the cloud resource abnormal processing device will be able to call the corresponding cloud resource processing API through the cloud resource processing policies.

对异常云资源处理策略的操作包括新增、开启、关闭、修改。Operations on abnormal cloud resource processing policies include adding, opening, closing, and modifying.

5)云资源异常处理装置将对异常云资源信息和异常云资源处理策略进行综合,得到异常云资源处理方法,并调用云平台的API来完成对异常云资源的处理。异常云资源处理完成后从云平台API获得处理操作的结果,并记录处理操作日志。5) The cloud resource exception processing device will synthesize the abnormal cloud resource information and the abnormal cloud resource processing strategy to obtain the abnormal cloud resource processing method, and call the API of the cloud platform to complete the processing of the abnormal cloud resource. After the abnormal cloud resources are processed, the result of the processing operation is obtained from the cloud platform API, and the processing operation log is recorded.

本发明提供的系统实施例,主要应用于云平台中的云资源健康管理,通过本方案对云平台中的云资源进行健康管理,可以自动对异常的云资源进行处理,实现对异常的云资源的自动处理,从而减少云资源异常带来的不利影响,提高云平台的可用性。同时,本方案支持对不同种类的云资源进行健康管理,具有较好的可扩展性。The system embodiment provided by the present invention is mainly applied to the health management of cloud resources in the cloud platform. Through the health management of the cloud resources in the cloud platform, the abnormal cloud resources can be automatically processed, and the abnormal cloud resources can be realized. Automatic processing, thereby reducing the adverse effects of cloud resource exceptions and improving the availability of the cloud platform. At the same time, this solution supports health management of different types of cloud resources and has good scalability.

基于上述系统结构,本发明提供的方法主要包括以下几个步骤:Based on the above-mentioned system structure, the method provided by the present invention mainly includes the following steps:

步骤1.通过云平台的API获取多种云资源的健康信息;Step 1. Obtain the health information of various cloud resources through the API of the cloud platform;

步骤2.创建云资源健康评估规则;Step 2. Create cloud resource health assessment rules;

步骤3.云资源评估装置通过步骤2创建的云资源健康评估规则扫描云资源健康信息,获得异常云资源信息;Step 3. The cloud resource assessment device scans the cloud resource health information through the cloud resource health assessment rules created in step 2 to obtain abnormal cloud resource information;

步骤4.创建异常云资源处理策略;Step 4. Create an abnormal cloud resource processing strategy;

步骤5.异常云资源处理装置将异常云资源按照步骤4创建的异常云资源处理策略调用云平台的有关API对异常云资源进行处理。Step 5. The abnormal cloud resource processing device calls the relevant API of the cloud platform to process the abnormal cloud resource according to the abnormal cloud resource processing policy created in step 4.

本发明应用实例提供的方法,自动根据云资源健康评估规则对云平台的云资源信息扫描,获得异常云资源信息,再自动根据异常云资源处理策略对异常资源进行处理,通过设计合理的异常云资源处理流程,能够根据云资源评估规则和异常云资源处理策略自动对不同类型的异常云资源进行处理,提高管理效率,减少人工维护成本。The method provided by the application example of the present invention automatically scans the cloud resource information of the cloud platform according to the cloud resource health assessment rules to obtain abnormal cloud resource information, and then automatically processes the abnormal resource according to the abnormal cloud resource processing strategy. The resource processing process can automatically process different types of abnormal cloud resources according to cloud resource evaluation rules and abnormal cloud resource processing policies, improve management efficiency, and reduce manual maintenance costs.

图3为本发明提供的云计算系统中资源的管理装置的结构图。图3所示装置包括:FIG. 3 is a structural diagram of a resource management device in a cloud computing system provided by the present invention. The device shown in Figure 3 includes:

获取模块301,用于获取每种资源的资源管理信息,其中所述资源管理信息包括资源标识和资源的性能参数;An acquisition module 301, configured to acquire resource management information of each resource, wherein the resource management information includes resource identifiers and resource performance parameters;

采集模块302,用于按照预先设置的时间策略,采集每个资源标识对应的性能参数的实时状态;The collection module 302 is configured to collect the real-time status of the performance parameters corresponding to each resource identifier according to a preset time strategy;

判断模块303,用于根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果;A judging module 303, configured to judge whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state according to a preset resource performance monitoring strategy, and obtain a judging result;

管理模块304,用于根据所述判断结果,对所述资源进行管理。The management module 304 is configured to manage the resource according to the judgment result.

在本发明提供的装置实施例中,所述装置还包括:In the device embodiment provided by the present invention, the device further includes:

接收模块,用于接收对资源的性能监控策略的管理请求,其中所述管理请求包括对性能监控策略的新增、修改、删除、开启和关闭中的至少一个操作;A receiving module, configured to receive a management request for a performance monitoring policy of a resource, wherein the management request includes at least one operation of adding, modifying, deleting, opening and closing the performance monitoring policy;

处理模块,用于根据所述管理请求,对所述性能监控策略进行处理。A processing module, configured to process the performance monitoring policy according to the management request.

在本发明提供的装置实施例中,所述判断模块303,具体用于判断所述资源标识对应的性能参数的实时状态是否为预先设置的正常工作状态;和/或,判断所述资源标识对应的性能参数的实时状态是否在预先设置的阈值范围内。In the device embodiment provided by the present invention, the judging module 303 is specifically configured to judge whether the real-time state of the performance parameter corresponding to the resource identifier is a preset normal working state; and/or judge whether the resource identifier corresponds to Whether the real-time status of the performance parameter is within the preset threshold range.

在本发明提供的装置实施例中,所述管理模块304包括:In the device embodiment provided by the present invention, the management module 304 includes:

告警单元,用于在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息。The alarm unit is configured to send out alarm information about resource abnormality and record the alarm information when the judging result is that the resource is not in a healthy working state.

在本发明提供的装置实施例中,所述管理模块304还包括:In the device embodiment provided by the present invention, the management module 304 further includes:

获取单元,用于根据预先存储的各性能信息对应的告警处理策略,获取出现异常的性能信息对应的处理策略;An acquisition unit, configured to acquire a processing strategy corresponding to abnormal performance information according to a pre-stored alarm processing strategy corresponding to each performance information;

处理单元,用于利用所述处理策略对所述资源进行处理,得到处理结果;a processing unit, configured to use the processing policy to process the resource to obtain a processing result;

输出单元,用于如果所述处理结果为成功,则停止告警操作,并删除告警信息的记录。The output unit is configured to stop the alarm operation and delete the record of the alarm information if the processing result is successful.

本发明提供的装置实施例,通过获取每种资源的资源管理信息,采集每个资源标识对应的性能参数的实时状态,再根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果,并根据所述判断结果,对所述资源进行管理,实现对异常的云资源的自动处理,从而减少云资源异常带来的不利影响,提高云平台的可用性。同时,本方案支持对不同种类的云资源进行健康管理,具有较好的可扩展性。The device embodiment provided by the present invention acquires the resource management information of each resource, collects the real-time status of the performance parameters corresponding to each resource identifier, and then judges the performance corresponding to each resource identifier according to the preset resource performance monitoring strategy. Whether the real-time status of the parameters is in a healthy working state, the judgment result is obtained, and the resource is managed according to the judgment result, so as to realize the automatic processing of abnormal cloud resources, thereby reducing the adverse effects caused by abnormal cloud resources , to improve the availability of the cloud platform. At the same time, this solution supports health management of different types of cloud resources and has good scalability.

本领域普通技术人员可以理解上述实施例的全部或部分步骤可以使用计算机程序流程来实现,所述计算机程序可以存储于一计算机可读存储介质中,所述计算机程序在相应的硬件平台上(如系统、设备、装置、器件等)执行,在执行时,包括方法实施例的步骤之一或其组合。Those of ordinary skill in the art can understand that all or part of the steps of the above-mentioned embodiments can be implemented using a computer program flow, the computer program can be stored in a computer-readable storage medium, and the computer program can be run on a corresponding hardware platform (such as system, device, device, device, etc.), and when executed, includes one or a combination of the steps of the method embodiment.

可选地,上述实施例的全部或部分步骤也可以使用集成电路来实现,这些步骤可以被分别制作成一个个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。Optionally, all or part of the steps in the above embodiments can also be implemented using an integrated circuit, and these steps can be fabricated into individual integrated circuit modules, or multiple modules or steps among them can be fabricated into a single integrated circuit module accomplish. As such, the present invention is not limited to any specific combination of hardware and software.

上述实施例中的各装置/功能模块/功能单元可以采用通用的计算装置来实现,它们可以集中在单个的计算装置上,也可以分布在多个计算装置所组成的网络上。The devices/functional modules/functional units in the above embodiments can be realized by general-purpose computing devices, and they can be concentrated on a single computing device, or distributed on a network composed of multiple computing devices.

上述实施例中的各装置/功能模块/功能单元以软件功能模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。上述提到的计算机可读取存储介质可以是只读存储器,磁盘或光盘等。When each device/functional module/functional unit in the above-mentioned embodiments is realized in the form of a software function module and sold or used as an independent product, it can be stored in a computer-readable storage medium. The computer-readable storage medium mentioned above may be a read-only memory, a magnetic disk or an optical disk, and the like.

以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以权利要求所述的保护范围为准。The above is only a specific embodiment of the present invention, but the scope of protection of the present invention is not limited thereto. Anyone skilled in the art can easily think of changes or substitutions within the technical scope disclosed in the present invention. Should be covered within the protection scope of the present invention. Therefore, the protection scope of the present invention should be based on the protection scope described in the claims.

Claims (10)

1.一种云计算系统中资源的管理方法,其特征在于,包括:1. A method for managing resources in a cloud computing system, comprising: 获取每种资源的资源管理信息,其中所述资源管理信息包括资源标识和资源的性能参数;Acquiring resource management information of each resource, where the resource management information includes resource identifiers and resource performance parameters; 按照预先设置的时间策略,采集每个资源标识对应的性能参数的实时状态;Collect the real-time status of the performance parameters corresponding to each resource identifier according to the preset time strategy; 根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果;According to the preset resource performance monitoring strategy, judge whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state, and obtain the judgment result; 根据所述判断结果,对所述资源进行管理。According to the judgment result, the resources are managed. 2.根据权利要求1所述的方法,其特征在于,所述方法还包括:2. The method according to claim 1, characterized in that the method further comprises: 接收对资源的性能监控策略的管理请求,其中所述管理请求包括对性能监控策略的新增、修改、删除、开启和关闭中的至少一个操作;receiving a management request for a resource performance monitoring policy, wherein the management request includes at least one operation of adding, modifying, deleting, opening and closing the performance monitoring policy; 根据所述管理请求,对所述性能监控策略进行处理。The performance monitoring policy is processed according to the management request. 3.根据权利要求1或2所述的方法,其特征在于,所述判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果包括:3. The method according to claim 1 or 2, wherein said judging whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state, obtaining the judging result includes: 判断所述资源标识对应的性能参数的实时状态是否为预先设置的正常工作状态;和/或,Judging whether the real-time state of the performance parameter corresponding to the resource identifier is a preset normal working state; and/or, 判断所述资源标识对应的性能参数的实时状态是否在预先设置的阈值范围内。It is judged whether the real-time state of the performance parameter corresponding to the resource identifier is within a preset threshold range. 4.根据权利要求1所述的方法,其特征在于,所述根据所述判断结果,对所述资源进行管理,包括:4. The method according to claim 1, wherein the managing the resource according to the judgment result comprises: 在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息。When the judging result is that the resource is not in a healthy working state, an alarm message of resource abnormality is issued, and the alarm information is recorded. 5.根据权利要求4所述的方法,其特征在于,所述在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息之后,所述方法还包括:5. The method according to claim 4, characterized in that, when the result of the judgment is that it is not in a healthy working state, sending out an alarm message of resource abnormality, and after recording the alarm information, the method further includes : 根据预先存储的各性能信息对应的告警处理策略,获取出现异常的性能信息对应的处理策略;According to the alarm processing strategy corresponding to each performance information stored in advance, obtain the processing strategy corresponding to the abnormal performance information; 利用所述处理策略对所述资源进行处理,得到处理结果;Process the resource by using the processing strategy to obtain a processing result; 如果所述处理结果为成功,则停止告警操作,并删除告警信息的记录。If the processing result is successful, the alarm operation is stopped, and the record of the alarm information is deleted. 6.一种云计算系统中资源的管理装置,其特征在于,包括:6. A resource management device in a cloud computing system, characterized in that it comprises: 获取模块,用于获取每种资源的资源管理信息,其中所述资源管理信息包括资源标识和资源的性能参数;An acquisition module, configured to acquire resource management information of each resource, wherein the resource management information includes resource identifiers and resource performance parameters; 采集模块,用于按照预先设置的时间策略,采集每个资源标识对应的性能参数的实时状态;The collection module is used to collect the real-time status of the performance parameters corresponding to each resource identifier according to the preset time strategy; 判断模块,用于根据预先设置的资源的性能监控策略,判断每个资源标识对应的性能参数的实时状态是否处在健康的工作状态,得到判断结果;The judging module is used to judge whether the real-time state of the performance parameter corresponding to each resource identifier is in a healthy working state according to the preset resource performance monitoring strategy, and obtain the judgment result; 管理模块,用于根据所述判断结果,对所述资源进行管理。A management module, configured to manage the resource according to the judgment result. 7.根据权利要求6所述的装置,其特征在于,所述装置还包括:7. The device according to claim 6, further comprising: 接收模块,用于接收对资源的性能监控策略的管理请求,其中所述管理请求包括对性能监控策略的新增、修改、删除、开启和关闭中的至少一个操作;A receiving module, configured to receive a management request for a performance monitoring policy of a resource, wherein the management request includes at least one operation of adding, modifying, deleting, opening and closing the performance monitoring policy; 处理模块,用于根据所述管理请求,对所述性能监控策略进行处理。A processing module, configured to process the performance monitoring policy according to the management request. 8.根据权利要求6或7所述的装置,其特征在于:8. The device according to claim 6 or 7, characterized in that: 所述判断模块,具体用于判断所述资源标识对应的性能参数的实时状态是否为预先设置的正常工作状态;和/或,判断所述资源标识对应的性能参数的实时状态是否在预先设置的阈值范围内。The judging module is specifically used to judge whether the real-time state of the performance parameter corresponding to the resource identifier is a preset normal working state; and/or judge whether the real-time state of the performance parameter corresponding to the resource identifier is in the preset within the threshold range. 9.根据权利要求6所述的装置,其特征在于,所述管理模块包括:9. The device according to claim 6, wherein the management module comprises: 告警单元,用于在判断结果为不处在健康的工作状态时,发出资源异常的告警信息,并记录所述告警信息。The alarm unit is configured to send out alarm information about resource abnormality and record the alarm information when the judging result is that the resource is not in a healthy working state. 10.根据权利要求9所述的装置,其特征在于,所述管理模块还包括:10. The device according to claim 9, wherein the management module further comprises: 获取单元,用于根据预先存储的各性能信息对应的告警处理策略,获取出现异常的性能信息对应的处理策略;An acquisition unit, configured to acquire a processing strategy corresponding to abnormal performance information according to a pre-stored alarm processing strategy corresponding to each performance information; 处理单元,用于利用所述处理策略对所述资源进行处理,得到处理结果;a processing unit, configured to use the processing policy to process the resource to obtain a processing result; 输出单元,用于如果所述处理结果为成功,则停止告警操作,并删除告警信息的记录。The output unit is configured to stop the alarm operation and delete the record of the alarm information if the processing result is successful.
CN201810781903.6A 2018-07-17 2018-07-17 The management method and device of resource in cloud computing system Pending CN109039714A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810781903.6A CN109039714A (en) 2018-07-17 2018-07-17 The management method and device of resource in cloud computing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810781903.6A CN109039714A (en) 2018-07-17 2018-07-17 The management method and device of resource in cloud computing system

Publications (1)

Publication Number Publication Date
CN109039714A true CN109039714A (en) 2018-12-18

Family

ID=64642848

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810781903.6A Pending CN109039714A (en) 2018-07-17 2018-07-17 The management method and device of resource in cloud computing system

Country Status (1)

Country Link
CN (1) CN109039714A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111966561A (en) * 2020-07-20 2020-11-20 四川虹美智能科技有限公司 Monitoring method, device and system for intangible asset management system
CN112162912A (en) * 2020-10-23 2021-01-01 新华三大数据技术有限公司 Cloud resource monitoring method and system
CN115442262A (en) * 2022-08-01 2022-12-06 阿里巴巴(中国)有限公司 Resource evaluation method and device, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160011894A1 (en) * 2014-07-11 2016-01-14 Vmware, Inc. Methods and apparatus to configure virtual resource managers for use in virtual server rack deployments for virtual computing environments
CN107612755A (en) * 2017-10-31 2018-01-19 郑州云海信息技术有限公司 The management method and its device of a kind of cloud resource
CN107733712A (en) * 2017-10-18 2018-02-23 郑州云海信息技术有限公司 The monitoring method and device of Service Source in cloud computing system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160011894A1 (en) * 2014-07-11 2016-01-14 Vmware, Inc. Methods and apparatus to configure virtual resource managers for use in virtual server rack deployments for virtual computing environments
CN107733712A (en) * 2017-10-18 2018-02-23 郑州云海信息技术有限公司 The monitoring method and device of Service Source in cloud computing system
CN107612755A (en) * 2017-10-31 2018-01-19 郑州云海信息技术有限公司 The management method and its device of a kind of cloud resource

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111966561A (en) * 2020-07-20 2020-11-20 四川虹美智能科技有限公司 Monitoring method, device and system for intangible asset management system
CN112162912A (en) * 2020-10-23 2021-01-01 新华三大数据技术有限公司 Cloud resource monitoring method and system
CN115442262A (en) * 2022-08-01 2022-12-06 阿里巴巴(中国)有限公司 Resource evaluation method and device, electronic equipment and storage medium
CN115442262B (en) * 2022-08-01 2024-02-06 阿里巴巴(中国)有限公司 Resource evaluation method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN100407656C (en) Method and system for managing terminal equipment
CN111600952A (en) Scene pushing method, scene pushing execution device, terminal, server and scene pushing system
RU2534945C2 (en) Controlled object design, method and self-optimisation system
WO2019051948A1 (en) Method, apparatus, server, and storage medium for processing monitoring data
US20150254090A1 (en) System and method for modifying allocated resources
CN106034112B (en) Access control, policy acquisition, attribute acquisition method and related device
WO2010105443A1 (en) Managed unit device, self-optimization method and system
CN109039714A (en) The management method and device of resource in cloud computing system
CN116916343A (en) Communication method, device and storage medium
CN110083457A (en) A kind of data capture method, device and data analysing method, device
CN107943423A (en) The management method and computer-readable recording medium of storage resource in cloud system
CN114205641A (en) Video data processing method and device
CN111897643B (en) Thread pool configuration system, method, device and storage medium
CN118250243A (en) Computing-network integrated resource scheduling system, method, electronic device and storage medium
CN115664743A (en) Behavior detection method and device
CN115037577A (en) Intelligent gateway service management platform
CN105471626A (en) Method and system for distributing storage data of internal memory
CN102523107B (en) The method and device of balanced network management system service end and client computing pressure
CN105790976A (en) Method and system for realizing automatic update of virtual network function descriptor
CN112363826B (en) Project resource comprehensive management system, method, terminal and storage medium
CN106201513A (en) Method and device for creating service number in operation and maintenance platform
CN112597354A (en) Method, device, system and storage medium for providing configuration parameters
WO2017088528A1 (en) Configuration information management method and apparatus, and operation maintenance centre or base station
CN111158899A (en) Data acquisition method, data acquisition device, task management center and task management system
CN116884107A (en) Automatic service resource inspection system, method, terminal and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20181218

RJ01 Rejection of invention patent application after publication