CN112118128B - Rapid identification method and recovery method for online executive fault - Google Patents
Rapid identification method and recovery method for online executive fault Download PDFInfo
- Publication number
- CN112118128B CN112118128B CN202010832329.XA CN202010832329A CN112118128B CN 112118128 B CN112118128 B CN 112118128B CN 202010832329 A CN202010832329 A CN 202010832329A CN 112118128 B CN112118128 B CN 112118128B
- Authority
- CN
- China
- Prior art keywords
- online
- input stream
- configuration information
- executive body
- feedback controller
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0654—Management of faults, events, alarms or notifications using network fault recovery
- H04L41/0659—Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities
- H04L41/0661—Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities by reconfiguring faulty entities
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/085—Retrieval of network configuration; Tracking network configuration history
- H04L41/0859—Retrieval of network configuration; Tracking network configuration history by keeping history of different configuration generations or by rolling back to previous configuration versions
- H04L41/0863—Retrieval of network configuration; Tracking network configuration history by keeping history of different configuration generations or by rolling back to previous configuration versions by rolling back to previous configuration versions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Environmental & Geological Engineering (AREA)
- Debugging And Monitoring (AREA)
- Test And Diagnosis Of Digital Computers (AREA)
- Testing And Monitoring For Control Systems (AREA)
Abstract
The invention provides a method for quickly identifying faults of an on-line executive body, which comprises the following steps: the input agent module uploads the input stream or the hash value of the input stream to the feedback controller while distributing the input stream to the plurality of online executors, and the feedback controller stores the input stream or the hash value of the input stream into an input agent message queue according to a receiving sequence; after each online executive receives the input stream, uploading the hash value of the input stream or the input stream to a feedback controller, and storing the hash value of the input stream or the input stream into a corresponding online executive message queue by the feedback controller according to the receiving sequence; when the scheduler schedules the new execution body to be online, the feedback controller compares the input agent message queue with the input stream or the hash value of the input stream in each current online execution body message queue, and if the input stream or the hash value of the input stream is consistent with the hash value, the current online execution body is judged to have no fault; and if the current online executive bodies are inconsistent, judging that the current online executive bodies have faults.
Description
Technical Field
The invention relates to the field of mimicry switches, in particular to a method for quickly identifying and recovering faults of an online executive body.
Background
The existing mimic switch device is shown in fig. 1, and the device ensures the correctness of the arbitration result of the switch configuration through an arbitrator, and after the arbitration is completed, the hardware configuration process of the switch is completed through a command system. For the situation of bypassing the input agent module to directly attack the executive body, after the N-1 mode attack or the N mode attack is triggered, because the arbitrator cannot identify the N-1 mode attack or the N mode attack, the wrong arbitration result is output to the command system as a correct arbitration result, and therefore the hardware configuration process of the switch is influenced.
In order to solve the above problems, people always seek an ideal technical solution.
Disclosure of Invention
The invention aims to overcome the defects of the prior art, and provides a quick identification method and a recovery method for online executive body faults.
In order to achieve the purpose, the technical scheme adopted by the invention is as follows: the invention provides a quick identification method of faults of an online executive body, which comprises the following steps:
the input agent module uploads the input stream or the hash value of the input stream to the feedback controller while distributing the input stream to a plurality of online executors, and the feedback controller stores the input stream or the hash value of the input stream into an input agent message queue according to a receiving sequence;
after each online executive receives the input stream, uploading the input stream or the hash value of the input stream to a feedback controller, and storing the hash value of the input stream or the input stream into a corresponding online executive message queue by the feedback controller according to a receiving sequence;
when the scheduler schedules the new execution body to be online, the feedback controller compares the input agent message queue with the input stream or the hash value of the input stream in each current online execution body message queue, and if the input stream or the hash value of the input stream is consistent with the hash value, the current online execution body is judged to have no fault; and if the current online executive bodies are inconsistent, judging that the current online executive bodies have faults.
Based on the above, the scheduler regularly schedules the offline of the online executive or schedules the offline of the online executive with a fault according to the arbitration result of the arbitrator, and simultaneously schedules the online of the new executive.
Based on the above, the configuration information includes configuration commands and/or configuration message information, and the configuration process data includes MD5 values of the configuration commands and/or configuration message information.
The second aspect of the invention provides a method for rapidly recovering a fault of a mimic switch, wherein the mimic switch adopts the method for rapidly identifying a fault of an online executive body to identify whether the current online executive body has a fault, wherein the input stream of the mimic switch is configuration information;
if no fault exists, the feedback controller carries out configuration synchronization on the new online executive body and the current online executive body;
if the fault exists, the feedback controller acquires a serial number corresponding to the first inconsistent content in the input agent message queue and the current online executive message queue, searches and acquires corresponding historical configuration information from the configuration information queue according to the acquired serial number, and performs configuration recovery on a new online executive by using the historical configuration information;
sending the current online executive body abnormal information to a scheduler, after receiving the current online executive body abnormal information, the scheduler offline the current online executive body, scheduling a new executive body online, and performing configuration recovery on the new online executive body by using the historical configuration information obtained by searching;
the configuration information queue is used for storing the historical configuration information of the mimicry switch uploaded by the input agent module.
Based on the above, after receiving the new configuration information, the input agent module is triggered to obtain the historical configuration information of the mimicry switch and uploads the historical configuration information to the feedback controller, and the feedback controller updates the configuration information queue according to the received historical configuration information.
Based on the above, the configuration and synchronization of the new online executor and the current online executor performed by the feedback controller specifically includes:
and the feedback controller acquires the last piece of historical configuration information from the configuration information queue and performs configuration recovery on the new online executive body by using the historical configuration information.
Based on the above, after the new online executive is configured and recovered, the data in the input agent message queue, the online executive message queue and the configuration information queue is cleared.
A third aspect of the present invention provides a computer apparatus comprising: the system comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor executes the program to realize the quick identification method of the online executive body fault.
A fourth aspect of the present invention provides a computer apparatus comprising: the system comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor executes the program to realize the rapid recovery method of the pseudo-switch failure.
A fifth aspect of the present invention provides a computer-readable storage medium, in which computer-executable instructions are stored, and when the computer-executable instructions are executed by a processor, the method for rapidly recovering from a fault of a pseudo switch is implemented.
Compared with the prior art, the invention has prominent substantive characteristics and remarkable progress, particularly:
1) the invention utilizes the input agent message list related to the input stream distributed by the input agent module and the online executive body message list related to the input stream received by the current online executive body to carry out backward comparison verification, and judges that the current online executive body has a fault when the verification result is inconsistent, thereby being capable of timely finding out N-mode attack or N-1-mode attack bypassing the input agent module, not waiting for the next online executive body to regularly go up and down or follow-up attack, and greatly reducing the damage effect generated when the system is attacked;
2) for the mimicry switch, when the current online fault body is judged to have a fault, historical reliable configuration information is searched by using the historical configuration information stored in the configuration information queue to perform configuration recovery rollback, so that the rapid recovery of the fault of the mimicry switch is facilitated.
Drawings
Figure 1 is a diagram of a simulated switch standard architecture.
Fig. 2 is a flow chart of the fast recognition method according to the present invention.
Fig. 3 is a flow chart of a fast recovery method according to the present invention.
Detailed Description
The technical solution of the present invention is further described in detail by the following embodiments.
Example 1
As shown in fig. 2, a method for quickly identifying a fault of an online executive, includes the following steps:
the input agent module uploads the input stream or the hash value of the input stream to the feedback controller while distributing the input stream to a plurality of online executors, and the feedback controller stores the input stream or the hash value of the input stream into an input agent message queue according to a receiving sequence;
after each online executive receives the input stream, uploading the hash value of the input stream or the input stream to a feedback controller, and storing the hash value of the input stream or the input stream into a corresponding online executive message queue by the feedback controller according to the receiving sequence;
when the scheduler schedules the new execution body to be online, the feedback controller compares the input agent message queue with the input stream or the hash value of the input stream in each current online execution body message queue, and if the input stream or the hash value of the input stream is consistent with the hash value, the current online execution body is judged to have no fault; and if the two are not consistent, judging that the current online executive body has a fault.
Specifically, the scheduler may schedule the new execution entity online at a timing that the online execution entity offline is scheduled according to a preset time interval, or schedule the online execution entity with a fault offline and schedule the new execution entity online according to the arbitration result of the arbitrator.
The invention utilizes the input agent message list related to the input stream distributed by the input agent module and the online executive message list related to the input stream received by the current online executive to carry out backward comparison verification, and judges that the current online executive has a fault when the verification result is inconsistent, thereby being capable of timely finding out the N-mode attack or N-1-mode attack bypassing the input agent module, avoiding waiting until the next online executive regularly goes up and down or the subsequent attack happens, and greatly reducing the harm effect generated when the system is attacked.
Example 2
As shown in fig. 3, this embodiment provides a method for quickly recovering from a fault of a mimic switch, where the mimic switch identifies whether a fault exists in a current online executor by using the method for quickly identifying a fault of an online executor described in embodiment 1, where an input stream of the mimic switch is configuration information, and specifically, the configuration information includes a configuration command and/or configuration message information;
if no fault exists, the feedback controller carries out configuration synchronization on the new online executive body and the current online executive body;
if the fault exists, the feedback controller acquires a serial number corresponding to the first inconsistent content in the input agent message queue and the current online executive message queue, searches and acquires corresponding historical configuration information from the configuration information queue according to the acquired serial number, and performs configuration recovery on a new online executive by using the historical configuration information;
sending current online executive body abnormal information to a scheduler, after receiving the current online executive body abnormal information, the scheduler offline the current online executive body, scheduling a new executive body online, and performing configuration recovery on the new online executive body by using the historical configuration information obtained by searching;
the configuration information queue is used for storing historical configuration information of the mimicry switch uploaded by the input agent module.
Specifically, after receiving new configuration information, the input agent module is triggered to acquire historical configuration information of the mimicry switch and upload the historical configuration information to the feedback controller, and the feedback controller updates the configuration information queue according to the received historical configuration information.
In this embodiment, the step of configuring and synchronizing the new online executor and the current online executor by the feedback controller specifically includes:
and the feedback controller acquires the last piece of historical configuration information from the configuration information queue and performs configuration recovery on the new online executive body by using the historical configuration information.
Specifically, after the new online executive is configured and recovered, the data in the input proxy message queue, the online executive message queue and the configuration information queue is cleared. By setting the storage time of the message queue to a scheduling period, backward verification and search can be performed quickly.
Specifically, after the online executive is offline, the scheduler may select whether to schedule a new executive from the executive pool to be online or to use the offline online executive as a new executive to be online according to the number of executables in the executive pool.
For the mimicry switch, when the current online fault body is judged to have a fault, historical reliable configuration information is searched by using the historical configuration information stored in the configuration information queue to perform configuration recovery rollback, so that the rapid recovery of the fault of the mimicry switch is facilitated.
Example 3
The present embodiment provides a computer device, including: the system comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor executes the program to realize the method for quickly identifying the fault of the online executive body in the embodiment 1.
Example 4
The present embodiment provides a computer device, including: the system comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor executes the program to realize the pseudo-switch failure fast recovery method of the embodiment 2.
Example 5
The present embodiment provides a computer-readable storage medium, in which computer-executable instructions are stored, and when the computer-executable instructions are executed by a processor, the method for rapidly recovering from a fault of a pseudo switch according to embodiment 2 is implemented.
Finally, it should be noted that the above examples are only used to illustrate the technical solutions of the present invention and not to limit the same; although the present invention has been described in detail with reference to preferred embodiments, those skilled in the art will understand that: modifications to the specific embodiments of the invention or equivalent substitutions for parts of the technical features may be made; without departing from the spirit of the present invention, it is intended to cover all aspects of the invention as defined by the appended claims.
Claims (8)
1. A method for quickly identifying faults of an on-line executive body is characterized by comprising the following steps:
the input agent module uploads the input stream or the hash value of the input stream to the feedback controller while distributing the input stream to a plurality of online executors, and the feedback controller stores the input stream or the hash value of the input stream into an input agent message queue according to a receiving sequence;
after each online executive receives the input stream, uploading the input stream or the hash value of the input stream to a feedback controller, and storing the hash value of the input stream or the input stream into a corresponding online executive message queue by the feedback controller according to a receiving sequence;
when the scheduler schedules the new execution body to be online, the feedback controller compares the input agent message queue with the input stream or the hash value of the input stream in each current online execution body message queue, and if the input stream or the hash value of the input stream is consistent with the hash value, the current online execution body is judged to have no fault; and if the two are not consistent, judging that the current online executive body has a fault.
2. The method for rapidly identifying faults of an online executive body according to claim 1, wherein the method comprises the following steps: the scheduler schedules the offline of the online executive body at regular time or schedules the offline of the online executive body with a fault according to the arbitration result of the arbitrator, and schedules the online of a new executive body.
3. A method for rapidly recovering a fault of a mimic switch is characterized by comprising the following steps: the mimicry switch adopts the method for rapidly identifying the faults of the online executive body as claimed in claim 1 or 2 to identify whether the current online executive body has faults or not, wherein the input stream of the mimicry switch is configuration information;
if no fault exists, the feedback controller carries out configuration synchronization on the new online executive body and the current online executive body: the feedback controller acquires the last piece of historical configuration information from the configuration information queue, and the new online executive body is subjected to configuration recovery by using the historical configuration information;
if the fault exists, the feedback controller acquires a serial number corresponding to the first inconsistent content in the input agent message queue and the current online executive message queue, searches and acquires corresponding historical configuration information from the configuration information queue according to the acquired serial number, and performs configuration recovery on a new online executive by using the historical configuration information;
sending current online executive body abnormal information to a scheduler, after receiving the current online executive body abnormal information, the scheduler offline the current online executive body, scheduling a new executive body online, and performing configuration recovery on the new online executive body by using the historical configuration information obtained by searching;
the configuration information queue is used for storing the historical configuration information of the mimicry switch uploaded by the input agent module.
4. The method for rapidly recovering from the fault of the mimic switch according to claim 3, wherein: after receiving new configuration information, the input agent module is triggered to acquire historical configuration information of the mimicry switch and uploads the historical configuration information to the feedback controller, and the feedback controller updates the configuration information queue according to the received historical configuration information.
5. The method for rapidly recovering from the fault of the mimic switch according to claim 3, wherein: and after the new online executive body is configured and recovered, clearing the data in the input agent message queue, the online executive body message queue and the configuration information queue.
6. A computer device, comprising: memory, processor, and computer program stored on the memory and executable on the processor, which when executed by the processor implements the method of fast recognition of faults in an online executable of claim 1 or 2.
7. A computer device, comprising: memory, a processor, and a computer program stored on the memory and executable on the processor, the processor implementing the method for rapid recovery from a simulated switch failure of any of claims 3-5 when executing the program.
8. A computer-readable storage medium having stored thereon computer-executable instructions which, when executed by a processor, implement the pseudo switch failure fast recovery method of any one of claims 3-5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010832329.XA CN112118128B (en) | 2020-08-18 | 2020-08-18 | Rapid identification method and recovery method for online executive fault |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010832329.XA CN112118128B (en) | 2020-08-18 | 2020-08-18 | Rapid identification method and recovery method for online executive fault |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112118128A CN112118128A (en) | 2020-12-22 |
CN112118128B true CN112118128B (en) | 2022-09-06 |
Family
ID=73804975
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010832329.XA Active CN112118128B (en) | 2020-08-18 | 2020-08-18 | Rapid identification method and recovery method for online executive fault |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112118128B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112668050A (en) * | 2020-12-31 | 2021-04-16 | 河南信大网御科技有限公司 | Execution body state recovery method and system based on configuration file |
CN112653707B (en) * | 2020-12-31 | 2022-08-16 | 河南信大网御科技有限公司 | Enhanced mimicry input agent |
CN113905011B (en) * | 2021-09-06 | 2023-08-04 | 河南信大网御科技有限公司 | Method and system for synchronizing Arp table of mimicry equipment |
CN115277163A (en) * | 2022-07-22 | 2022-11-01 | 杭州安司源科技有限公司 | Mimicry transformation method based on label |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5295258A (en) * | 1989-12-22 | 1994-03-15 | Tandem Computers Incorporated | Fault-tolerant computer system with online recovery and reintegration of redundant components |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106411937B (en) * | 2016-11-15 | 2017-12-29 | 中国人民解放军信息工程大学 | Zero-day attacks detection, analysis and response system and its method based on mimicry defence framework |
US10289460B2 (en) * | 2016-11-15 | 2019-05-14 | Microsoft Technology Licensing, Llc | System integration using configurable dataflow |
CN109325349A (en) * | 2018-08-15 | 2019-02-12 | 深圳市元征科技股份有限公司 | A kind of method for managing security, terminal device and computer readable storage medium |
CN110581852A (en) * | 2019-09-11 | 2019-12-17 | 河南信大网御科技有限公司 | Efficient mimicry defense system and method |
CN111181926B (en) * | 2019-12-13 | 2022-04-05 | 中国人民解放军战略支援部队信息工程大学 | A security device based on the idea of mimic defense and its operation method |
-
2020
- 2020-08-18 CN CN202010832329.XA patent/CN112118128B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5295258A (en) * | 1989-12-22 | 1994-03-15 | Tandem Computers Incorporated | Fault-tolerant computer system with online recovery and reintegration of redundant components |
Non-Patent Citations (1)
Title |
---|
Abordagem de Balanceamento de Negociacao entre Agentes em Sincronizacao de Trafego;Marcos Vinicius Pinheiro Dib等;《IEEE》;20070930;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN112118128A (en) | 2020-12-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112118128B (en) | Rapid identification method and recovery method for online executive fault | |
US20200012575A1 (en) | Method and system for data recovery in a data system | |
CN106547613B (en) | Task processing flow scheduling and fault recovery method | |
CN111400104B (en) | Data synchronization method and device, electronic equipment and storage medium | |
CN113656175B (en) | Method and equipment for training model based on distributed system | |
CN106445755A (en) | Method for automatically testing integral cabinet servers | |
WO2018149396A1 (en) | Service flow processing method and apparatus, readable storage medium and terminal device | |
US10274919B2 (en) | Method, device and computer program product for programming a plurality of control units | |
WO2023165512A1 (en) | Fault file storage method and related apparatus | |
US20160335170A1 (en) | Model checking device for distributed environment model, model checking method for distributed environment model, and medium | |
CN105335244B (en) | Method for application recovery | |
CN112395119B (en) | Abnormal data processing method, device, server and storage medium | |
JP4239989B2 (en) | Fault recovery system, fault recovery device, rule creation method, and fault recovery program | |
CN105045678A (en) | Database recovery method and apparatus | |
CN104683443A (en) | Linkage service system and linkage service method applied in integrated rail transit supervisory control system | |
CN112346842A (en) | Workflow task scheduling method based on directed acyclic graph | |
CN109063846B (en) | Machine learning operation method, device, equipment and storage medium | |
CN108924002B (en) | A method, device, device and storage medium for parsing performance data file | |
CN114720894B (en) | Battery charge and discharge test connection method and system | |
CN111814415B (en) | Efficient regression testing method for chip verification | |
CN110113395B (en) | Shared file system maintenance method and device | |
KR101610220B1 (en) | Method for software pipelining of multi-domain motion control, recording medium and device for performing the method | |
CN113760409A (en) | Service instance management method, device, equipment and storage medium | |
CN113296945A (en) | Continuous integration method, system, equipment and computer readable storage medium | |
EP4495819A1 (en) | Device, system and method for implementing a recurrent neural network to determine a given region of a build log that meets a fatal error criterion condition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |