[go: up one dir, main page]

CN105959172B - A kind of the redundant network management method and platform of group system - Google Patents

A kind of the redundant network management method and platform of group system Download PDF

Info

Publication number
CN105959172B
CN105959172B CN201610573169.5A CN201610573169A CN105959172B CN 105959172 B CN105959172 B CN 105959172B CN 201610573169 A CN201610573169 A CN 201610573169A CN 105959172 B CN105959172 B CN 105959172B
Authority
CN
China
Prior art keywords
communication network
network segment
host
group system
segment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610573169.5A
Other languages
Chinese (zh)
Other versions
CN105959172A (en
Inventor
马怀旭
方浩
樊云龙
姜文涛
赵祯龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Beijing Electronic Information Industry Co Ltd
Original Assignee
Inspur Beijing Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Beijing Electronic Information Industry Co Ltd filed Critical Inspur Beijing Electronic Information Industry Co Ltd
Priority to CN201610573169.5A priority Critical patent/CN105959172B/en
Publication of CN105959172A publication Critical patent/CN105959172A/en
Application granted granted Critical
Publication of CN105959172B publication Critical patent/CN105959172B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0659Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities
    • H04L41/0661Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities by reconfiguring faulty entities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Small-Scale Networks (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

本申请公开了一种集群系统的冗余网络管理方法,包括:预先分别利用集群系统的主通信网段和备通信网段,对所有主机进行通信网络连接,得到主环状通信网络和备环状通信网络,并以主环状通信网络作为工作网络;对上述两套通信网络进行实时监测,若它们均出现网络中断故障,则对当前集群系统中每台主机内部通信网段的工作状态进行监测;若不存在同一主机内部的主通信网段和备通信网段均出现网段故障,则利用每台主机内部当前能够正常工作的一个通信网段来重新搭建集群系统中的环状通信网络,并将集群系统的工作网络切换为该环状通信网络。本申请降低了集群系统中主机因网段故障而被隔离的概率。另外,本申请还相应公开了一种冗余网络管理平台。

The present application discloses a redundant network management method for a cluster system. The main ring communication network is used as the working network; the above two sets of communication networks are monitored in real time, if both of them have network interruption failures, the working status of each host's internal communication network segment in the current cluster system is checked. Monitoring; if there is no network segment failure in the main communication network segment and the standby communication network segment within the same host, use a communication network segment that can work normally within each host to rebuild the ring communication network in the cluster system , and switch the working network of the cluster system to the ring communication network. The present application reduces the probability that the hosts in the cluster system are isolated due to network segment failures. In addition, the present application also discloses a redundant network management platform accordingly.

Description

A kind of the redundant network management method and platform of group system
Technical field
The present invention relates to cluster network monitoring technical field, in particular to the redundant network management method of a kind of group system And platform.
Background technique
Currently, with the fast development of computer technology and network technology, group system with its powerful operational capability and Healthy and strong fault tolerant mechanism has been increasingly becoming the focus of computer industry.In order to guarantee the stability of network in group system, usually Cluster management is carried out using redundant network mode.
However, in traditional cluster redundant network management process, as long as any network segment inside certain host occurs Failure, this host will be forced to be isolated, even if can be worked normally there is also other at this time inside this host Network segment.The segregate probability of host in group system will be will increase dramatically in this way, and host is once isolated, it will it generates corresponding Business migration, this business migration itself can aggravate the burden of group system entirety again, be unfavorable for mentioning for group system performance It rises.
As can be seen that how to reduce in group system host, because of network segment failure, segregate probability is current in summary Problem to be solved.
Summary of the invention
In view of this, the purpose of the present invention is to provide the redundant network management method and platform of a kind of group system, drop In low group system host because of network segment failure segregate probability.Its concrete scheme is as follows:
A kind of redundant network management method of group system, the communication network segment in the group system includes principal communication net Section and standby communication network segment;The described method includes:
It is utilized respectively the principal communication network segment and the standby communication network segment in advance, to the All hosts in the group system Communication network connection is carried out, obtains corresponding main ring shape communication network and standby ring communication network, and with the main ring communication Job network of the network as the group system;
Real-time monitoring is carried out to the main ring shape communication network and the standby ring communication network, if monitoring presently described There is network interruption failure in main ring shape communication network and the standby ring communication network, then to every in presently described group system The working condition of platform host internal communication network segment is monitored;
If monitoring, there is no the principal communication network segments and standby communication network segment inside same host in presently described group system There is network segment failure, then utilizes a communication network segment for being currently able to work normally inside every host in the group system To build the ring communication network in the group system again, and the job network of the group system is switched to the ring-type Communication network.
Preferably, the redundant network management method, further includes:
In the mistake that the working condition to every host internal communication network segment in presently described group system is monitored Cheng Zhong, if monitoring to occur there are the principal communication network segment inside same host in presently described group system and for communication network segment Network segment failure, the then host for network segment failure occur to principal communication network segment in presently described group system and standby communication network segment carry out Isolation processing.
Preferably, the redundant network management method, further includes:
It is normal using being currently able to inside every host not segregate in the group system after the isolation processing One communication network segment of work builds the ring communication network in the group system again, and by the work of the group system Making network switching is the ring communication network.
Preferably, the redundant network management method, further includes:
Described to the main ring shape communication network and during the standby ring communication network carries out real-time monitoring, if It monitors that network interruption failure occurs in the only described main ring shape communication network, is then switched to the job network of the group system The standby ring communication network.
Preferably, the communication network segment worked normally is currently able to inside every host in using the group system Before process to build the ring communication network in the group system again, the redundant network management method further include:
The sum for counting the host that principal communication network segment can work normally in presently described group system obtains the first number Amount;
The sum for counting the host that standby communication network segment can work normally in presently described group system, obtains the second number Amount.
Preferably, described to utilize a communication network for being currently able to work normally inside every host in the group system Section builds the process of the ring communication network in the group system again, comprising:
When first quantity be greater than or equal to second quantity, then be based on the first default network establishment principle, again Build the ring communication network in the group system;
Wherein, the described first default network establishment principle specifically:
It is used for the standby communication network segment of the principal communication network segment of first kind host and the second class host to re-start ring communication Network is built;Wherein, the first kind host includes that current principal communication network segment and standby communication network segment can work normally Host, and the host that current only principal communication network segment can work normally;The second class host includes current only standby logical The host that letter network segment can work normally.
Preferably, described to utilize a communication network for being currently able to work normally inside every host in the group system Section builds the process of the ring communication network in the group system again, comprising:
When first quantity be less than second quantity, then be based on the second default network establishment principle, build institute again State the ring communication network in group system;
Wherein, the described second default network establishment principle specifically:
It is used for the standby communication network segment of the principal communication network segment of third class host and the 4th class host to re-start ring communication Network is built;Wherein, the third class host includes the host that current only principal communication network segment can work normally;Described Four class hosts include the host that current principal communication network segment and standby communication network segment can work normally, and current only standby communication The host that network segment can work normally.
The invention also discloses a kind of redundant networks of group system to manage platform, the communication network segment in the group system It include principal communication network segment and standby communication network segment;The redundant network manages platform
Communication network successive module, for being utilized respectively the principal communication network segment and the standby communication network segment in advance, to institute The All hosts stated in group system carry out communication network connection, obtain corresponding main ring shape communication network and standby ring communication net Network, and using the main ring shape communication network as the job network of the group system;
Communications network monitors module, it is real-time for being carried out to the main ring shape communication network and the standby ring communication network Monitoring;
Host network segment monitoring modular, for when the network monitoring module monitor presently described main ring shape communication network and There is network interruption failure in the standby ring communication network, then to every host intercommunication network in presently described group system The working condition of section is monitored;
Communication network reconnection module, for when the host network segment monitoring module monitors into presently described group system not There is network segment failure there are the principal communication network segment inside same host and for communication network segment, then using every in the group system The communication network segment worked normally is currently able to inside platform host to build the ring communication net in the group system again Network, and the job network of the group system is switched to the ring communication network.
Preferably, the redundant network manages platform, further includes:
Host isolation module, for existing together when the host network segment monitoring module monitors into presently described group system There is network segment failure in principal communication network segment and standby communication network segment inside one host, then to principal communication in presently described group system The host that network segment failure occur in network segment and standby communication network segment carries out isolation processing.
Preferably, the communication network reconnection module is also used to carry out the isolation processing in the host isolation module Afterwards, using a communication network segment for being currently able to work normally inside every host not segregate in the group system come weight The ring communication network in the group system is newly built, and the job network of the group system is switched to the ring communication Network.
As it can be seen that there are the feelings of network interruption failure in current main ring shape communication network and standby ring communication network in the present invention Under condition, real-time monitoring is carried out to the working condition of the communication network segment inside current every host, if monitoring that there is no same There is network segment failure in principal communication network segment and standby communication network segment inside host, that is, if monitoring current cluster system At least there is a communication network segment in every host can work normally, then current using every host inside in group system The communication network segment that can be worked normally builds the ring communication network in group system again, and by the work of group system Making network switching is the ring communication network.From the foregoing, it will be observed that going out in current main ring shape communication network and standby ring communication network In the case where existing network interruption failure, as long as there is also at least one communication network segments to work normally inside certain host, that This host can be still added in new ring communication network, be isolated without being forced, thus present invention drop In low group system host because of network segment failure segregate probability.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this The embodiment of invention for those of ordinary skill in the art without creative efforts, can also basis The attached drawing of offer obtains other attached drawings.
Fig. 1 is a kind of redundant network management method flow chart of group system disclosed by the embodiments of the present invention;
Fig. 2 is that a kind of redundant network of group system disclosed by the embodiments of the present invention manages platform structure schematic diagram.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
Communication the embodiment of the invention discloses a kind of redundant network management method of group system, in above-mentioned group system Network segment includes principal communication network segment and standby communication network segment;The above method includes:
Step S11: the principal communication network segment and standby communication network segment being utilized respectively in group system in advance, in group system All hosts carry out communication network connection, obtain corresponding main ring shape communication network and standby ring communication network, and with main ring shape Job network of the communication network as group system.
That is, All hosts in group system are successively established into communication connection using the principal communication network segment in group system, It is correspondingly made available above-mentioned main ring shape communication network;It similarly, will be by institute in group system using the standby communication network segment in group system There is host successively to establish communication connection, is correspondingly made available above-mentioned standby ring communication network.As it can be seen that the group system in the present embodiment There are two sets of communication networks, a set of is above-mentioned main ring shape communication network, another set of, is above-mentioned standby ring communication network.
It is understood that in the case where above-mentioned two sets of communication networks of group system can work normally, cluster system System only need to be using above-mentioned main ring shape communication network as current job network, and above-mentioned standby ring communication network is then in The state to stand ready.
Step S12: real-time monitoring is carried out to main ring shape communication network and standby ring communication network, if monitoring current main ring There is network interruption failure in shape communication network and standby ring communication network, then logical to every host inside in current cluster system The working condition of letter network segment is monitored.
That is, monitoring current main ring shape communication network and the feelings of network interruption failure occur for ring communication network Under condition, the working condition of current every host internal communication network segment is obtained, by monitoring means to determine current cluster system In which host inside there is network segment failure and occur in the host of network segment failure be specifically which communication network segment occur Failure.
Step S13: if monitoring in current cluster system there is no the principal communication network segment inside same host and for communication There is network segment failure in network segment, then utilizes a communication network segment for being currently able to work normally inside every host in group system To build the ring communication network in group system again, and the job network of group system is switched to the ring communication net Network.
That is, in the case where network interruption failure occur in main ring shape communication network and standby ring communication network, if Monitor network segment failure occur there is no the principal communication network segment inside same host in current cluster system and for communication network segment, Then mean that at least there is also a communication network segments to work normally in every host of current cluster system, and current At least there are two hosts in group system and different types of network segment failure occur, one of them is principal communication network segment failure, separately One is then standby communication network segment fault.In order to repair network interruption failure, the present invention passes through using every in group system at this time The communication network segment worked normally is currently able to inside host to build the ring communication network in group system again, and will The job network of group system is switched to the ring communication network.
For example, it is assumed that above-mentioned main ring shape communication network and standby ring communication network are the phenomenon that there is network interruption failure By occurring caused by occurring standby communication network segment fault inside principal communication network segment failure and host C inside host A, then subsequent When carrying out network restoration, host A can be connected into new ring communication network by the standby communication network segment that can also be worked normally, and host C can then be connected into new ring communication network by the principal communication network segment that can also be worked normally, and other hosts can both pass through master Network segment is communicated, new ring communication network can also be connected by standby communication network segment, certainly, the present embodiment preferentially enables other all The host that communication network segment can work normally is connected into new ring communication network by principal communication network segment.And in traditional technology In, above-mentioned host A and host C can not be participated in during new network struction by carry out forced quarantine, cause to collect in this way A large amount of business migration is generated in group's system.As it can be seen that the embodiment of the present invention reduces in group system host due to network segment failure Segregate probability avoids the business migration that host is frequently carried out in group system.
As it can be seen that there is network interruption event in current main ring shape communication network and standby ring communication network in the embodiment of the present invention In the case where barrier, real-time monitoring is carried out to the working condition of the communication network segment inside current every host, if monitoring not deposit There is network segment failure in principal communication network segment and standby communication network segment inside same host, that is, if monitoring current cluster At least there is a communication network segment in every host of system can work normally, then using in every host in group system Portion is currently able to the communication network segment worked normally to build the ring communication network in group system again, and by cluster system The job network of system is switched to the ring communication network.From the foregoing, it will be observed that in current main ring shape communication network and standby ring communication net It, being capable of normal work as long as there is also at least one communication network segments inside certain host in the case that network interruption failure occurs in network Make, then this host can be still added in new ring communication network, is isolated without being forced, thus this hair Bright embodiment reduce in group system host because of network segment failure segregate probability.
The embodiment of the invention discloses a kind of redundant network management methods of specific group system, implement relative to upper one Example, the present embodiment have made further instruction and optimization to technical solution.It is specific:
Relative to a upper embodiment, redundant network management method in the present embodiment be can further include: to working as During the working condition of every host internal communication network segment is monitored in preceding group system, if monitoring current cluster system There is network segment failure there are the principal communication network segment inside same host and for communication network segment in system, then in current cluster system The host that network segment failure occur in principal communication network segment and standby communication network segment carries out isolation processing.
That is, meaning if monitoring that failure occur in the principal communication network segment of certain host and standby communication network segment The host can not carry out network communication connection, can carry out forced quarantine to it at this time, be sent to it with completely cutting off other hosts All communication connection requests.
Further, the redundant network management method in the present embodiment can also include: the benefit after above-mentioned isolation processing Collection is built again with a communication network segment for being currently able to work normally inside every host not segregate in group system Ring communication network in group's system, and the job network of group system is switched to the ring communication network.
That is, at least there are one communications inside remaining not segregate host after carrying out above-mentioned isolation processing Network segment can work normally, and then utilize a communication for being currently able to work normally inside these not segregate every hosts Network segment builds the ring communication network in group system again, and then it is logical that the job network of group system is switched to the ring-type Communication network.
In addition, the redundant network management method in the present embodiment, can also include: to main ring shape communication network and standby ring During shape communication network carries out real-time monitoring, if monitoring, only there is network interruption failure in main ring shape communication network, The job network of group system is switched to standby ring communication network.
In upper embodiment step S13, the one of normal work is currently able to inside every host in using group system It further include step S130 before process of a communication network segment to build the ring communication network in group system again;Specifically:
Step S130: the sum of host that principal communication network segment can work normally in statistics current cluster system obtains the The sum for the host that standby communication network segment can work normally in one quantity, and statistics current cluster system, obtains the second quantity.
In the present embodiment, the difference of the size relation between above-mentioned first quantity and above-mentioned second quantity, it will after so that Continuing the building process of new ring communication network, there is also differences.Specifically:
In the case where above-mentioned first quantity is greater than or equal to above-mentioned second quantity, in upper embodiment step S13, utilize The communication network segment worked normally is currently able in group system inside every host to build the ring in group system again The process of shape communication network, specifically: it is based on the first default network establishment principle, builds the ring communication in group system again Network;
Wherein, the first default network establishment principle specifically: by the principal communication network segment of first kind host and the second class host Standby communication network segment for re-starting building for ring communication network;Wherein, first kind host includes current principal communication network segment The host that can be worked normally with standby communication network segment, and the host that current only principal communication network segment can work normally;The Two class hosts include the host that current only standby communication network segment can work normally.
And in the case where above-mentioned first quantity is less than above-mentioned second quantity, in upper embodiment step S13, utilize cluster It is logical to build the ring-type in group system again to be currently able to the communication network segment worked normally in system inside every host The process of communication network, specifically: it is based on the second default network establishment principle, builds the ring communication net in group system again Network;
Wherein, the second default network establishment principle specifically: by the principal communication network segment and the 4th class host of third class host Standby communication network segment for re-starting building for ring communication network;Wherein, third class host includes currently only principal communication The host that network segment can work normally;4th class host includes that current principal communication network segment and standby communication network segment can work normally Host, and the host that current only standby communication network segment can work normally.
Correspondingly, the embodiment of the present invention further discloses a kind of redundant network management platform of group system, above-mentioned collection Communication network segment in group's system includes principal communication network segment and standby communication network segment;Shown in Figure 2, above-mentioned redundant network management is flat Platform includes:
Communication network successive module 21, for being utilized respectively principal communication network segment and standby communication network segment in advance, to group system In All hosts carry out communication network connection, obtain corresponding main ring shape communication network and standby ring communication network, and with master Job network of the ring communication network as group system;
Communications network monitors module 22, for carrying out real-time monitoring to main ring shape communication network and standby ring communication network;
Host network segment monitoring modular 23, for monitoring current main ring shape communication network and standby ring-type when network monitoring module There is network interruption failure in communication network, then to the working condition of every host internal communication network segment in current cluster system into Row monitoring;
Communication network reconnection module 24, for there is no same into current cluster system when host network segment monitoring module monitors There is network segment failure in principal communication network segment and standby communication network segment inside one host, then utilize in group system inside every host The communication network segment worked normally is currently able to build the ring communication network in group system again, and by group system Job network be switched to the ring communication network.
As it can be seen that there is network interruption event in current main ring shape communication network and standby ring communication network in the embodiment of the present invention In the case where barrier, real-time monitoring is carried out to the working condition of the communication network segment inside current every host, if monitoring not deposit There is network segment failure in principal communication network segment and standby communication network segment inside same host, that is, if monitoring current cluster At least there is a communication network segment in every host of system can work normally, then using in every host in group system Portion is currently able to the communication network segment worked normally to build the ring communication network in group system again, and by cluster system The job network of system is switched to the ring communication network.From the foregoing, it will be observed that in current main ring shape communication network and standby ring communication net It, being capable of normal work as long as there is also at least one communication network segments inside certain host in the case that network interruption failure occurs in network Make, then this host can be still added in new ring communication network, is isolated without being forced, thus this hair Bright embodiment reduce in group system host because of network segment failure segregate probability.
Further, the redundant network in the present embodiment manages platform, can also include:
Host isolation module, for there are in same host into current cluster system when host network segment monitoring module monitors There is network segment failure in the principal communication network segment in portion and standby communication network segment, then to principal communication network segment in current cluster system and standby communication The host that network segment failure occurs in network segment carries out isolation processing.
In addition, above-mentioned communication network reconnection module, can also be further used for being isolated in above-mentioned host isolation module After processing, using a communication network segment for being currently able to work normally inside every host not segregate in group system come weight The ring communication network in group system is newly built, and the job network of group system is switched to the ring communication network.
Redundant network in the present embodiment manages platform, can further include: the direct switching module of network is used for There is network interruption failure to only main ring shape communication network in above-mentioned communications network monitors module monitors, then by the work of group system Making network switching is standby ring communication network.
It can refer to the related content in previous embodiment about the more detailed course of work of above-mentioned modules, herein not It is repeated one by one again.
Finally, it is to be noted that, herein, relational terms such as first and second and the like be used merely to by One entity or operation are distinguished with another entity or operation, without necessarily requiring or implying these entities or operation Between there are any actual relationship or orders.Moreover, the terms "include", "comprise" or its any other variant meaning Covering non-exclusive inclusion, so that the process, method, article or equipment for including a series of elements not only includes that A little elements, but also including other elements that are not explicitly listed, or further include for this process, method, article or The intrinsic element of equipment.In the absence of more restrictions, the element limited by sentence "including a ...", is not arranged Except there is also other identical elements in the process, method, article or apparatus that includes the element.
Detailed Jie has been carried out to the redundant network management method and platform of a kind of group system provided by the present invention above It continues, used herein a specific example illustrates the principle and implementation of the invention, and the explanation of above embodiments is only It is to be used to help understand method and its core concept of the invention;At the same time, for those skilled in the art, according to this hair Bright thought, there will be changes in the specific implementation manner and application range, in conclusion the content of the present specification should not manage Solution is limitation of the present invention.

Claims (10)

1. a kind of redundant network management method of group system, which is characterized in that the communication network segment in the group system wraps Include principal communication network segment and standby communication network segment;The described method includes:
It is utilized respectively the principal communication network segment and the standby communication network segment in advance, the All hosts in the group system are carried out Communication network connection obtains corresponding main ring shape communication network and standby ring communication network, and with the main ring shape communication network Job network as the group system;
Real-time monitoring is carried out to the main ring shape communication network and the standby ring communication network, if monitoring presently described main ring There is network interruption failure in shape communication network and the standby ring communication network, then to every master in presently described group system The working condition of machine internal communication network segment is monitored;
If monitoring to go out there is no the principal communication network segment inside same host in presently described group system and for communication network segment Existing network segment failure, then using a communication network segment for being currently able to work normally inside every host in the group system come weight The ring communication network in the group system is newly built, and the job network of the group system is switched to the ring communication Network;Wherein, during building the ring communication network in the group system again, if the principal communication network segment of host and Standby communication network segment does not occur network segment failure, then is built again using the principal communication network segment of the host or standby communication network segment described Ring communication network in group system.
2. the redundant network management method of group system according to claim 1, which is characterized in that further include:
During the working condition to every host internal communication network segment in presently described group system is monitored, If monitoring network segment occur there are the principal communication network segment inside same host in presently described group system and for communication network segment Failure, the then host for network segment failure occur to principal communication network segment in presently described group system and standby communication network segment are isolated Processing.
3. the redundant network management method of group system according to claim 2, which is characterized in that further include:
After the isolation processing, using being currently able to work normally inside every host not segregate in the group system A communication network segment build the ring communication network in the group system again, and by the working net of the group system Network is switched to the ring communication network.
4. the redundant network management method of group system according to claim 1, which is characterized in that further include:
Described to the main ring shape communication network and during the standby ring communication network carries out real-time monitoring, if monitoring There is network interruption failure to the only described main ring shape communication network, is then switched to the job network of the group system described Standby ring communication network.
5. the redundant network management method of group system according to any one of claims 1 to 4, which is characterized in that in benefit The cluster is built again with a communication network segment for being currently able to work normally inside every host in the group system Before the process of ring communication network in system, further includes:
The sum for counting the host that principal communication network segment can work normally in presently described group system obtains the first quantity;
The sum for counting the host that standby communication network segment can work normally in presently described group system, obtains the second quantity.
6. the redundant network management method of group system according to claim 5, which is characterized in that described to utilize the collection The communication network segment worked normally is currently able to built in the group system again inside every host in group's system The process of ring communication network, comprising:
When first quantity be greater than or equal to second quantity, then be based on the first default network establishment principle, build again Ring communication network in the group system;
Wherein, the described first default network establishment principle specifically:
The standby communication network segment of the principal communication network segment of first kind host and the second class host is used to re-start ring communication network Build;Wherein, the first kind host includes the host that current principal communication network segment and standby communication network segment can work normally, And the host that current only principal communication network segment can work normally;The second class host includes current only standby communication network segment The host that can be worked normally.
7. the redundant network management method of group system according to claim 5, which is characterized in that described to utilize the collection The communication network segment worked normally is currently able to built in the group system again inside every host in group's system The process of ring communication network, comprising:
When first quantity be less than second quantity, then be based on the second default network establishment principle, build the collection again Ring communication network in group's system;
Wherein, the described second default network establishment principle specifically:
The standby communication network segment of the principal communication network segment of third class host and the 4th class host is used to re-start ring communication network Build;Wherein, the third class host includes the host that current only principal communication network segment can work normally;4th class Host includes the host that current principal communication network segment and standby communication network segment can work normally, and current only standby communication network segment The host that can be worked normally.
8. a kind of redundant network of group system manages platform, which is characterized in that the communication network segment in the group system wraps Include principal communication network segment and standby communication network segment;The redundant network manages platform
Communication network successive module, for being utilized respectively the principal communication network segment and the standby communication network segment in advance, to the collection All hosts in group's system carry out communication network connection, obtain corresponding main ring shape communication network and standby ring communication network, And using the main ring shape communication network as the job network of the group system;
Communications network monitors module, for being supervised in real time to the main ring shape communication network and the standby ring communication network It surveys;
Host network segment monitoring modular, for monitoring presently described main ring shape communication network and described when the network monitoring module There is network interruption failure in standby ring communication network, then to every host internal communication network segment in presently described group system Working condition is monitored;
Communication network reconnection module, for being not present when the host network segment monitoring module monitors into presently described group system There is network segment failure in principal communication network segment and standby communication network segment inside same host, then utilize every master in the group system The communication network segment worked normally is currently able to inside machine to build the ring communication network in the group system again, and The job network of the group system is switched to the ring communication network;Wherein, in building the group system again During ring communication network, if the principal communication network segment of host and standby communication network segment do not occur network segment failure, utilizing should The principal communication network segment of host or standby communication network segment build the ring communication network in the group system again.
9. the redundant network of group system according to claim 8 manages platform, which is characterized in that further include:
Host isolation module, for there are same masters into presently described group system when the host network segment monitoring module monitors There is network segment failure in principal communication network segment and standby communication network segment inside machine, then to principal communication network segment in presently described group system The host for network segment failure occur with standby communication network segment carries out isolation processing.
10. the redundant network of group system according to claim 9 manages platform, which is characterized in that the communication network Reconnection module is also used to after the host isolation module carries out the isolation processing, using in the group system not by every From every host inside communication network segment being currently able to work normally build the ring-type in the group system again Communication network, and the job network of the group system is switched to the ring communication network.
CN201610573169.5A 2016-07-19 2016-07-19 A kind of the redundant network management method and platform of group system Active CN105959172B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610573169.5A CN105959172B (en) 2016-07-19 2016-07-19 A kind of the redundant network management method and platform of group system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610573169.5A CN105959172B (en) 2016-07-19 2016-07-19 A kind of the redundant network management method and platform of group system

Publications (2)

Publication Number Publication Date
CN105959172A CN105959172A (en) 2016-09-21
CN105959172B true CN105959172B (en) 2019-01-18

Family

ID=56900408

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610573169.5A Active CN105959172B (en) 2016-07-19 2016-07-19 A kind of the redundant network management method and platform of group system

Country Status (1)

Country Link
CN (1) CN105959172B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107360041A (en) * 2017-08-18 2017-11-17 郑州云海信息技术有限公司 A kind of network management and device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1658578A (en) * 2005-04-05 2005-08-24 北京四方继保自动化股份有限公司 Non-break switchover method of double-network communication system
CN101036330A (en) * 2004-12-01 2007-09-12 思科技术公司 System and methods for detecting network failure
CN101079781A (en) * 2007-02-01 2007-11-28 北京东土科技股份有限公司 An implementation method for industrial Ethernet fast-speed redundancy
CN101137974A (en) * 2003-10-07 2008-03-05 思科技术公司 Enhanced switchover for mpls fast reroute
CN102394787A (en) * 2011-12-14 2012-03-28 重庆邮电大学 Dual-link redundancy control method based on EPA switch
CN104660386A (en) * 2015-03-03 2015-05-27 浪潮电子信息产业股份有限公司 Method for improving DB2 disaster recovery high availability based on Itanium platform
CN105681070A (en) * 2014-11-21 2016-06-15 中芯国际集成电路制造(天津)有限公司 Method and system for automatically collecting and analyzing computer cluster node information

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100389571C (en) * 2005-03-25 2008-05-21 华为技术有限公司 Method for detecting chain circuit fault between end-to-end notes in mixed network
CN101577719B (en) * 2009-06-09 2016-03-02 华为技术有限公司 A kind of double hot standby method, device and system
CN102780635B (en) * 2012-08-09 2015-09-09 华为技术有限公司 The method of pretection switch, TOR switch and system is realized based on TRILL network

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101137974A (en) * 2003-10-07 2008-03-05 思科技术公司 Enhanced switchover for mpls fast reroute
CN101036330A (en) * 2004-12-01 2007-09-12 思科技术公司 System and methods for detecting network failure
CN1658578A (en) * 2005-04-05 2005-08-24 北京四方继保自动化股份有限公司 Non-break switchover method of double-network communication system
CN101079781A (en) * 2007-02-01 2007-11-28 北京东土科技股份有限公司 An implementation method for industrial Ethernet fast-speed redundancy
CN102394787A (en) * 2011-12-14 2012-03-28 重庆邮电大学 Dual-link redundancy control method based on EPA switch
CN105681070A (en) * 2014-11-21 2016-06-15 中芯国际集成电路制造(天津)有限公司 Method and system for automatically collecting and analyzing computer cluster node information
CN104660386A (en) * 2015-03-03 2015-05-27 浪潮电子信息产业股份有限公司 Method for improving DB2 disaster recovery high availability based on Itanium platform

Also Published As

Publication number Publication date
CN105959172A (en) 2016-09-21

Similar Documents

Publication Publication Date Title
CN103607297B (en) Fault processing method of computer cluster system
WO2018036148A1 (en) Server cluster system
CN104317803B (en) The data access arrangement and method of data-base cluster
CN106713056B (en) A kind of method of standby host election switching under distributed type assemblies
CN101013992A (en) Automatic protection method of Ethernet
CN103856357B (en) A kind of stacking system fault handling method and stacking system
CN105306272A (en) Method and system for collecting fault scene information of information system
CN104468217B (en) A kind of network reconnection method under 1394 network manager failure
CN110134518A (en) A method and system for improving the high availability of multi-node applications in a big data cluster
WO2017177788A1 (en) Automatic service transition method and apparatus
CN105207902A (en) Main-standby virtual gateway system and method based on SDN
CN109905275A (en) A control plane fault detection and processing method based on SDN layered architecture
CN103905247A (en) Two-unit standby method and system based on multi-client judgment
CN103441878B (en) The ownership processing method of PE equipment and equipment in VCF network
CN109002354B (en) OpenStack-based computing resource capacity elastic expansion method and system
CN105790825A (en) Method and apparatus for carrying out hot backup on controllers in distributed protection
CN112838669A (en) A 5G-based master-slave distribution network protection and control system ad hoc network method
CN103297279B (en) The active and standby single-deck reverse method of software control in a kind of many software process system
CN105894213A (en) Multi-agent grid fault diagnosis system and method based on blackboard model
CN105959172B (en) A kind of the redundant network management method and platform of group system
CN104182300B (en) Backup method and system of virtual machines in cluster
CN107395444A (en) One kind is based on SDN controller failures recovery system and method
CN106330698A (en) Local routing recovery method and device
US10432451B2 (en) Systems and methods for managing network health
CN110880988B (en) Network management system upgrade method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant