[go: up one dir, main page]

CN119597708A - Equipment system and main/standby switching method thereof, and electronic equipment - Google Patents

Equipment system and main/standby switching method thereof, and electronic equipment Download PDF

Info

Publication number
CN119597708A
CN119597708A CN202411748941.3A CN202411748941A CN119597708A CN 119597708 A CN119597708 A CN 119597708A CN 202411748941 A CN202411748941 A CN 202411748941A CN 119597708 A CN119597708 A CN 119597708A
Authority
CN
China
Prior art keywords
device node
node
signal
equipment
standby switching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202411748941.3A
Other languages
Chinese (zh)
Inventor
胡安沙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Metabrain Intelligent Technology Co Ltd
Original Assignee
Suzhou Metabrain Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Metabrain Intelligent Technology Co Ltd filed Critical Suzhou Metabrain Intelligent Technology Co Ltd
Priority to CN202411748941.3A priority Critical patent/CN119597708A/en
Publication of CN119597708A publication Critical patent/CN119597708A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/78Architectures of general purpose stored program computers comprising a single central processing unit
    • G06F15/7867Architectures of general purpose stored program computers comprising a single central processing unit with reconfigurable architecture
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/14Handling requests for interconnection or transfer
    • G06F13/20Handling requests for interconnection or transfer for access to input/output bus
    • G06F13/24Handling requests for interconnection or transfer for access to input/output bus using interrupt
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computer Hardware Design (AREA)
  • Hardware Redundancy (AREA)

Abstract

The invention discloses an equipment system, a main/standby switching method thereof and electronic equipment, which relate to the technical field of computers and are applied to a programmable logic device on a control board, and comprise the steps of obtaining a target signal transmitted by a first equipment node on the equipment system; the method comprises the steps of determining whether a preset main/standby switching condition is met or not according to a target signal, generating a first hard interrupt signal and transmitting the first hard interrupt signal to a second equipment node to inform the second equipment node that main/standby switching operation is about to be triggered, wherein the second equipment node is a equipment node which is in main/standby relation with the first equipment node, and executing main/standby switching operation between the first equipment node and the second equipment node after transmitting the hard interrupt signal to the second equipment node. The invention improves the efficiency of main/standby switching.

Description

Equipment system, main/standby switching method thereof and electronic equipment
Technical Field
The present invention relates to the field of computer technologies, and in particular, to an equipment system, a main/standby switching method thereof, and an electronic device.
Background
With the rapid development of information technology, servers are increasingly used in various fields. In some critical business scenarios, the reliability and availability requirements for servers are extremely high. Once a single-node server fails, service interruption may be caused, and huge losses are brought to users. To improve the reliability and availability of servers, dual node servers have evolved. The dual-node server is generally composed of two independent equipment nodes, and the cooperative work is realized through a certain technical means. Under normal conditions, two nodes can bear the service load together to improve the performance of the server, and when one node fails, the other node can automatically take over the service of the failed node to ensure the continuity of the service.
When the existing dual-node server performs active-standby switching, a series of complex detection and judgment processes are generally required, which can lead to longer switching time, and in some service scenes with higher requirements on real-time performance, the longer switching time can lead to service interruption and bring loss to users.
It can be seen how to improve the efficiency of active-standby switching is a problem that needs to be solved by those skilled in the art.
Disclosure of Invention
The embodiment of the invention aims to provide an equipment system, a main/standby switching method thereof and electronic equipment, and improves the efficiency of main/standby switching. The specific scheme is as follows:
In a first aspect, the present invention discloses a main/standby switching method, which is applied to a programmable logic device on a control board, and includes:
The method comprises the steps of acquiring a target signal transmitted by a first equipment node on an equipment system, wherein the first equipment node is a master equipment node or a slave equipment node, and the target signal is a heartbeat signal of the equipment node or a soft interrupt signal triggered by the equipment node;
Determining whether a preset main/standby switching condition is met currently or not based on the target signal;
If the preset main-equipment switching condition is met currently, generating a first hard interrupt signal, and transmitting the first hard interrupt signal to a second equipment node to inform the second equipment node that main-equipment switching operation is about to be triggered;
And after the hard interrupt signal is transmitted to the second equipment node, executing the active-standby switching operation between the first equipment node and the second equipment node.
Optionally, the active-standby switching method further includes:
In the initial starting process of the equipment system, a low-level signal is transmitted to any equipment node in the equipment system through a pull-down resistor of the control board, so that the equipment node receiving the low-level signal judges the role of the equipment node as an initial main equipment node; and transmitting a high-level signal to any one of the device nodes in the device system through a pull-up resistor of the control board, so that the device node receiving the high-level signal determines the role of the device node as an initial slave device node.
Optionally, after the low level signal is transmitted to any device node in the device system through the pull-down resistor of the control board, the method further includes:
Recording the starting time of the equipment node which receives the low-level signal transmitted by the pull-down resistor;
If the time interval between the current starting time and the starting time is not equal to the preset delay starting interval, other equipment nodes are forbidden to be started, and if the first single board state position of the equipment node receiving the low-level signal is a normal position and the second single board state position of the other equipment node is an abnormal position, the equipment node receiving the low-level signal is determined to be an initial main equipment node;
and if the time interval between the current starting time and the starting time is equal to a preset delay starting interval, starting other equipment nodes, and if the first single board state position and the second single board state position of the other equipment nodes are both normal positions, determining the other equipment nodes as initial slave equipment nodes.
Optionally, acquiring a heartbeat signal transmitted by a first device node on the device system includes:
acquiring a heartbeat signal which is transmitted by the first equipment node and is obtained by superposing a type-coded signal and an operation state-coded signal according to different frequencies;
the type-coded signals and the running state-coded signals are signals obtained by the first equipment node through respectively carrying out signal coding on local hardware type signals and hardware running state signals according to different frequencies;
Correspondingly, the determining whether the preset active-standby switching condition is met based on the target signal includes:
Analyzing the heartbeat signal by utilizing different analysis clocks to obtain a state notification signal containing hardware running states corresponding to the hardware types in the first equipment node;
the state notification signal is sent to a second equipment node, and whether a first preset main equipment switching condition is met or not is determined based on the state notification signal;
the first preset master device switching condition is that the state notification signal characterizes that the hardware running state of the first device node meets a preset fault condition.
Optionally, the active-standby switching method further includes:
if a second hard interrupt signal transmitted by the first equipment node is received, detecting whether the state notification signal corresponding to the first equipment node meets a preset control right acquisition condition or not, wherein the second hard interrupt signal characterizes the first equipment node to trigger a request for acquiring the control right of a shared component;
If the state notification signal corresponding to the first equipment node meets the preset control right acquisition condition, disconnecting a path between a second equipment node and the shared component, and establishing the path between the first equipment node and the shared component;
And sending a third hard interrupt signal to the first equipment node and the second equipment node, wherein the third hard interrupt signal characterizes that the control right of the shared component belongs to the first equipment node.
Optionally, if the target signal is a soft interrupt signal, determining whether the current target signal meets a preset active/standby switching condition includes:
and determining whether a second preset main equipment switching condition is met or not currently based on the soft interrupt signal, wherein the second preset main equipment switching condition is that the soft interrupt signal characterizes the first equipment node to trigger a main equipment switching request.
Optionally, the active-standby switching method further includes:
monitoring whether the number of current service processing requests of the client is larger than a preset threshold value or not;
If the number of the current service processing requests is greater than the preset threshold, generating a fourth hard interrupt signal, and sending the fourth hard interrupt signal to the first equipment node and the second equipment node so as to inform the first equipment node and the second equipment node that the double-master switching operation is about to be triggered;
After the fourth hard interrupt signal is transmitted to the first device node and the second device node, an operation of switching both the first device node and the second device node to master device nodes is performed, so that each master device node processes the current service processing request.
Optionally, before the performing the active-standby switching operation between the first device node and the second device node, the method further includes:
Determining a current master device node and a current slave device node from the first device node and the second device node, and transmitting service data of the current master device node to the current slave device node through a preset target path, wherein the preset target path is a path between a first network card hung on a processor of the first device node and a second network card hung on a processor of the second device node;
Correspondingly, after the main/standby switching operation between the first device node and the second device node is executed, the method further includes:
and processing the current service processing request of the client by using the service data through the new master equipment node obtained after the master-slave switching operation.
In a second aspect, the present invention discloses a master/slave switching device, which is applied to a programmable logic device on a control board, and includes:
The system comprises a signal acquisition module, a signal processing module and a signal processing module, wherein the signal acquisition module is used for acquiring a target signal transmitted by a first equipment node on an equipment system, the first equipment node is a master equipment node or a slave equipment node, and the target signal is a heartbeat signal of the equipment node or a soft interrupt signal triggered by the equipment node;
the judging module is used for determining whether the preset main/standby switching condition is met currently or not based on the target signal;
The signal sending module is used for generating a first hard interrupt signal and transmitting the first hard interrupt signal to a second equipment node to inform the second equipment node that the main-standby switching operation is about to trigger if the preset main-standby switching condition is met currently;
And the main-standby switching module is used for executing main-standby switching operation between the first equipment node and the second equipment node after the hard interrupt signal is transmitted to the second equipment node.
In a third aspect, the invention discloses an equipment system, which comprises a control board and equipment nodes respectively connected with the control board, wherein a programmable logic device is arranged on the control board, and the programmable logic device implements the steps of the main/standby switching method by executing a computer program.
In a fourth aspect, the present invention discloses an electronic device, comprising:
a memory for storing a computer program;
and the processor is used for executing the computer program to realize the steps of the main/standby switching method disclosed above.
In a fifth aspect, the present invention discloses a computer readable storage medium storing a computer program, where the computer program when executed by a processor implements the steps of the active/standby switching method disclosed above.
The programmable logic device applied to the control panel comprises the steps of obtaining a target signal transmitted by a first equipment node on an equipment system, wherein the first equipment node is a master equipment node or a slave equipment node, the target signal is a heartbeat signal of the equipment node or a soft interrupt signal triggered by the equipment node, determining whether a preset master-slave switching condition is met or not currently based on the target signal, generating a first hard interrupt signal and transmitting the first hard interrupt signal to a second equipment node to inform the second equipment node that the master-slave switching operation is about to be triggered, the second equipment node is the equipment node which is in a master-slave relation with the first equipment node, and executing the master-slave switching operation between the first equipment node and the second equipment node after the hard interrupt signal is transmitted to the second equipment node.
The method has the advantages that the control board is newly added in the equipment system, the programmable logic device on the control board acquires the target signal of the first equipment node, when the current meeting of the preset main/standby switching condition is determined according to the target signal, the first hard interrupt signal is sent to the second equipment node to inform the second equipment node that the main/standby switching operation is about to be triggered, after the hard interrupt signal is transmitted to the second equipment node, the main/standby switching operation between the first equipment node and the second equipment node is executed, namely the main/standby relation between the first equipment node and the second equipment node is switched, and because the hard interrupt signal has the advantages of quick response and high priority, the quick main/standby switching can be realized, namely the current equipment node can be quickly switched to the next main equipment node and takes over the subsequent service processing work, and the continuity of the service and the stable operation of the system are ensured.
Drawings
For a clearer description of embodiments of the present invention, the drawings that are required to be used in the embodiments will be briefly described, it being apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to the drawings without inventive effort for those skilled in the art.
Fig. 1 is a flowchart of a main/standby switching method according to an embodiment of the present invention;
fig. 2 is a schematic diagram of connection between a specific device node and a control board according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a specific code superposition scheme according to embodiments of the present invention;
fig. 4 is a schematic diagram of a specific analysis waveform of a heartbeat signal according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of a specific device system connection according to an embodiment of the present invention;
Fig. 6 is a schematic structural diagram of a main/standby switching device according to an embodiment of the present invention;
FIG. 7 is a schematic diagram of an apparatus system according to an embodiment of the present invention;
fig. 8 is a block diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without making any inventive effort are within the scope of the present invention.
With the rapid development of information technology, servers are increasingly used in various fields. In some critical business scenarios, the reliability and availability requirements for servers are extremely high. Once a single-node server fails, service interruption may be caused, and huge losses are brought to users. To improve the reliability and availability of servers, dual node servers have evolved. The dual-node server is generally composed of two independent equipment nodes, and the cooperative work is realized through a certain technical means. Under normal conditions, two nodes can bear the service load together to improve the performance of the server, and when one node fails, the other node can automatically take over the service of the failed node to ensure the continuity of the service.
When the existing dual-node server performs active-standby switching, a series of complex detection and judgment processes are generally required, which can lead to longer switching time, and in some service scenes with higher requirements on real-time performance, the longer switching time can lead to service interruption and bring loss to users.
The terms "comprising" and "having" in the description of the invention and in the above-described figures, as well as any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those steps or elements but may include other steps or elements not expressly listed.
In order to better understand the aspects of the present invention, the present invention will be described in further detail with reference to the accompanying drawings and detailed description.
Next, a main/standby switching scheme provided by the embodiment of the present invention is described in detail. Fig. 1 is a schematic diagram of a main/standby switching method applied to a programmable logic device on a control board, which includes:
step S11, a target signal transmitted by a first equipment node on an equipment system is acquired, wherein the first equipment node is a master equipment node or a slave equipment node, and the target signal is a heartbeat signal of the equipment node or a soft interrupt signal triggered by the equipment node.
The control board includes a Programmable logic device, which may be a Field-Programmable gate array (FPGA) or a complex Programmable logic device (Complex Programmable Logic Device, CPLD).
In the embodiment, the method further comprises the steps of transmitting a low-level signal to any equipment node in the equipment system through a pull-down resistor of the control board in the initial starting process of the equipment system, so that the equipment node receiving the low-level signal judges the role of the equipment node to be an initial master equipment node, and transmitting a high-level signal to any equipment node in the equipment system through a pull-up resistor of the control board, so that the equipment node receiving the high-level signal judges the role of the equipment node to be an initial slave equipment node.
For example, a specific schematic diagram of connection between a device node and a control board is shown in fig. 2, the device system includes a device node 0, a device node 1 and a control board, the control board includes a pull-down resistor R1 and a pull-up resistor R2, the control board is connected with two device nodes through a cable, an interaction signal between the two device nodes and a control signal between the device node and the control board can be transferred, in an initial starting process of the device system, the device node is a master device node or a slave device node, that is, the device system is initially started, a low level signal is transmitted to any device node in the device system through the pull-down resistor of the control board, so that the device node receiving the low level signal determines its role as an initial master device node, and the device node receiving the high level signal is determined as an initial slave device node through the pull-up resistor of the control board, specifically, the device node 0 receiving the low level signal is connected with the pull-down resistor R1 through the cable, so that the device node 1 receiving the high level signal is determined as an initial slave device node when the device node receives the high level signal, and the device node receives the low level signal, and the device node is initially started. Wherein the pull-up resistor, pull-down resistor may be coupled to a complex programmable logic device or baseboard management controller (Baseboard Management Controller, i.e., BMC) of the device NODE via the FM_NODE_0/1 signal.
In this embodiment, after the low level signal is transmitted to any device node in the device system through the pull-down resistor of the control board, the method further includes recording a start time of the device node that receives the low level signal transmitted by the pull-down resistor, prohibiting starting of other device nodes if a time interval between the current start time and the start time is not equal to a preset delay start interval, determining that the device node that receives the low level signal is an initial master device node if a first board state position of the device node that receives the low level signal is a normal position and a second board state position of the other device node is an abnormal position, starting other device nodes if a time interval between the current start time and the start time is equal to a preset delay start interval, and determining that the other device nodes are initial slave device nodes if the first board state position and the second board state position of the other device node are both normal positions.
Further, in order to make the device node that receives the low level signal transmitted by the pull-down resistor determine that the device node is the master device node, a delay start interval may also be set, that is, when the device node that receives the low level signal transmitted by the pull-down resistor starts, other device nodes cannot start, that is, the control board firstly transmits the low level to the device node connected to the pull-down resistor through the pull-down resistor, and determines the start time of the device node, because the other device nodes have not reached the start time, the first board state position of the device node that receives the low level signal is a normal position and the second board state position of the other device node is an abnormal position, the device node that receives the low level signal determines that its role is the initial master device node at this time, if the time interval between the current time and the start time is equal to the preset delay start interval, the other device nodes can start, and the other device nodes can detect that the first board state position and the second board state position of the other device nodes are both normal positions, so that the other device node determines that its role is the initial slave device node. The single board generally refers to an independent printed circuit board in an electronic device (such as a communication device, a server, etc.), and the single board state refers to the working state of the circuit board, including multiple conditions such as normal working, failure, standby, etc.
In a specific embodiment, acquiring a heartbeat signal transmitted by a first equipment node on an equipment system comprises acquiring a heartbeat signal obtained by overlapping a type-coded signal and an operation state-coded signal transmitted by the first equipment node according to different frequencies, wherein the type-coded signal and the operation state-coded signal are signals obtained by the first equipment node respectively carrying out signal coding on a local hardware type signal and a local hardware operation state signal according to different frequencies. And acquiring a heartbeat signal (HEART) transmitted by the first equipment node, wherein the heartbeat signal can reflect the hardware running state of the first equipment node, the first equipment node carries out signal coding on a local hardware type signal and a hardware running state signal according to different frequencies to obtain a type-coded signal and a running state-coded signal, and then the first equipment node carries out superposition on the type-coded signal and the running state-coded signal according to different frequencies to obtain the heartbeat signal. Furthermore, other status signals may be encoded according to different frequencies, for example, a specific encoding superposition schematic diagram shown in fig. 3, where a 1Hz square wave signal represents whether a substrate management controller of a first device node has a problem of hanging, etc., a 1/8Hz status handshake signal is superimposed on a 1Hz square wave basis, and is used to transmit a hardware running status signal and a hardware type signal of the first device node to a control board, where the hardware running status signal and the hardware type signal are, for example, running status signals of a memory type signal, running status signals of a hard disk type signal, and running status signals of a fan type signal, a section of 1/8Hz square wave is superimposed at a low level position on the 1Hz square wave signal, and each section of 1/8Hz signal has 4 bits, where the first 3 bits may represent a hardware type, and specific definition examples may refer to the following table, then the 0 th bit represents a hardware status, 1 (high level) represents a normal, and 0 (low level) represents an abnormal. For example, 0001 represents normal memory state, 0010 represents abnormal hard disk state, and the following are shown in table 1:
TABLE 1
Signal definition Bit[3:1]
Memory state 000
Hard disk state 001
Fan state 010
Temperature state 011
...... ......
In another embodiment, a soft Interrupt signal, specifically an INT (Interrupt) signal, transmitted by a first device node on the device system is acquired, where the signal is actively triggered by the first device node, that is, a programmable logic device on the control board may also receive the INT signal sent by the first device node. It will be appreciated that soft interrupts are a software-level interrupt mechanism that is actively initiated by software to handle some asynchronous events or tasks that require timely response but are of a relatively lower priority than hard interrupts.
And step S12, determining whether the preset active-standby switching condition is met or not based on the target signal.
In a specific embodiment, the determining whether the preset active/standby switching condition is currently met based on the target signal includes analyzing the heartbeat signal by using different analysis clocks to obtain a state notification signal including hardware operation states corresponding to hardware types in the first device node, sending the state notification signal to a second device node, and determining whether the first preset active/standby switching condition is currently met based on the state notification signal, wherein the first preset active/standby switching condition is that the state notification signal characterizes that the hardware operation state of the first device node meets a preset fault condition. It can be understood that, according to the heartbeat signal obtained by stacking different signals according to different frequencies, different analysis clocks are needed to analyze the heartbeat signal, specifically, two kinds of analysis clocks with different frequencies can be adopted, namely xHz and X/8Hz are adopted, two kinds of analysis clocks with different frequencies are respectively set in the control panel and are used for respectively analyzing data with the two kinds of frequencies, a specific heartbeat signal analysis waveform diagram shown in fig. 4 is taken as xHz, for example, the signal frequency is xHz, the analysis clock frequency is X/2Hz, the 10101 level of the waveform 1 can be just sampled by the rising edge of the signal with the waveform 2, and the waveform diagram is shown by a dotted line in fig. 4. When analyzing the heartbeat signal by using the analysis clock to obtain a state notification signal containing hardware operation states corresponding to the hardware types in the first equipment node, sending the state notification signal to the second equipment node so that the second equipment node can determine the operation state of the first equipment node, and determining whether the first preset main equipment switching condition is met or not according to the state notification signal, wherein the first preset main equipment switching condition is that the state notification signal indicates that the hardware operation state of the first equipment node meets a preset fault condition, namely, when the first equipment node has operation faults, the first preset main equipment switching condition is met, namely, the main equipment switching is required, and when the operation faults exist, the first equipment node is hard reset or manually pulled out, and the first equipment node is subjected to software abnormal restarting, so that the switching is triggered. For example, the module exception occupies the CPU for too long, and causes the hardware watchdog to restart the system, the data access exception, the instruction access exception and other system exceptions, and the like.
In another embodiment, if the target signal is a soft interrupt signal, the determining whether the preset main/standby switching condition is currently satisfied based on the target signal includes determining whether a second preset main/standby switching condition is currently satisfied based on the soft interrupt signal, where the second preset main/standby switching condition characterizes the soft interrupt signal as the first device node to trigger a main/standby switching request. When the target signal is an INT signal, determining whether a second preset main/standby switching condition is met currently based on the soft interrupt signal, namely whether the soft interrupt signal characterizes the first equipment node to trigger the main/standby switching request, wherein the main/standby switching is forced.
And step S13, if the preset main equipment switching condition is met currently, generating a first hard interrupt signal, and transmitting the first hard interrupt signal to a second equipment node to inform the second equipment node that main equipment switching operation is about to trigger, wherein the second equipment node is an equipment node which is in a main equipment relation with the first equipment node.
If the first preset master-slave switching condition or the second preset master-slave switching condition is currently met, a first hard interrupt signal (Interrupt Request, i.e. IRQ) is generated, and the first hard interrupt signal is transmitted to a second device node to inform the second device node that the master-slave switching operation is about to be triggered, wherein the second device node is a device node which is in a master-slave relationship with the first device node, that is, if the first device node is the master device node, the second device node is a slave device node of the first device node, and if the first device node is the slave device node, the second device node is the master device node of the first device node, so that the reason for causing the master-slave switching can be initiated by the master device node or initiated by the slave device node.
Further, a hard interrupt is an interrupt request signal actively sent to a processor by a hardware device, such as an external device, a hardware module inside a chip, etc., and is used to inform the processor that an urgent event needs to be processed, and the processor may suspend a task currently being executed, and in turn execute a processing procedure related to the interrupt. The first hard interrupt signal mentioned above is specifically IRQ (Interrupt Request) signals, the IRQ signal may be in the form of GPIO (General-purpose input/output) or in the form of a bus such as I2C (Inter-INTEGRATED CIRCUIT), and by defining a certain communication protocol, communication between the baseboard management controllers of the two nodes and the programmable logic device on the control board is implemented, where the first hard interrupt signal may be used as a channel for the control board to notify the standby motherboard when the main/standby switching request interrupt occurs.
And step S14, after the hard interrupt signal is transmitted to the second equipment node, executing the active-standby switching operation between the first equipment node and the second equipment node.
The method comprises the steps of detecting whether a state notification signal corresponding to a first equipment node meets a preset control right acquisition condition or not if a second hard interrupt signal transmitted by the first equipment node is received, wherein the second hard interrupt signal represents that the first equipment node triggers an acquisition request of a control right of a shared component, disconnecting a path between the second equipment node and the shared component and establishing the path between the first equipment node and the shared component if the state notification signal corresponding to the first equipment node meets the preset control right acquisition condition, and transmitting a third hard interrupt signal to the first equipment node and the second equipment node, wherein the third hard interrupt signal represents that the control right of the shared component belongs to the first equipment node. The I2C buses of the two primary and secondary equipment nodes are connected to the programmable logic device of the control board, the buses are used for receiving a second hard interrupt signal transmitted by the first equipment node, the second hard interrupt signal characterizes the first equipment node to trigger a request for acquiring control rights of the shared component, so that whether a state notification signal corresponding to the first equipment node meets preset control rights acquiring conditions, such as whether the running state of the first equipment node is good or whether the shared component is controlled by the first equipment node at present or not is detected, if the state notification signal corresponding to the first equipment node meets the preset control rights acquiring conditions, a path between the second equipment node and the shared component is disconnected, and a path between the first equipment node and the shared component is established, so that the first equipment node can control the shared component. The common component can be a fan and a power supply, so that the control of the common component can be specifically control of the rotating speed of the fan and control of the power supply (Power supply unit, namely PSU) state, when the common component is controlled, a control instruction needs to be sent, the control instruction is generally transmitted through I2C, each I2C bus of a baseboard management controller of the two main equipment nodes and the two standby equipment nodes is connected to a programmable logic device of a control panel, one alternative switching function (data selector, namely MUX) is realized inside the programmable logic device through logic control, and the path between the second equipment node and the common component is switched to the first equipment node, so that the first equipment node can control the common component.
The method comprises the steps of monitoring whether the number of current business processing requests of a client is larger than a preset threshold, generating a fourth hard interrupt signal and sending the fourth hard interrupt signal to the first equipment node and the second equipment node to inform the first equipment node and the second equipment node that double-master switching operation is about to be triggered if the number of the current business processing requests is larger than the preset threshold, and executing operation of switching the first equipment node and the second equipment node into master equipment nodes after the fourth hard interrupt signal is transmitted to the first equipment node and the second equipment node so that each master equipment node processes the current business processing requests.
The two main boards of the double-equipment node can be designed into a main-standby relation, and can be designed into a double-main working mode, the reliability of service can be ensured by using the main-standby working mode, when equipment system processing resources are tense, the two main boards can be adjusted into the double-main mode, and the two main boards are involved in actual data and service processing, so that the resource utilization rate of a server is improved. The method comprises the steps of monitoring whether the number of current service processing requests of a client is larger than a preset threshold, generating a fourth hard interrupt signal if the number of the current service processing requests is larger than the preset threshold, sending the fourth hard interrupt signal to a first equipment node and a second equipment node to inform the first equipment node and the second equipment node that double-master switching operation is about to be triggered, executing operation of switching the first equipment node and the second equipment node into master equipment nodes after the fourth hard interrupt signal is transmitted to the first equipment node and the second equipment node so that each master equipment node can process the current service processing requests, wherein the fourth hard interrupt signal is an IRQ signal, the IRQ signal respectively transmits an instruction of a double-master working mode, telling the two equipment nodes about to enter the double-master mode from the master mode, detecting the health state of the other party through a handshake message signal and a heartbeat signal between the two equipment nodes, normally performing double-master mode switching when the first equipment node and the second equipment node are judged to be normal, executing the operation of switching the double-master mode, reflecting the communication state of the equipment nodes when the health state of any equipment node is not normal, and returning the IRQ signal to the control board through the normal switching mode, and reporting the service processing results to the control board can not be returned through the network switching mode.
In this embodiment, before executing the active-standby switching operation between the first device node and the second device node, the method further includes determining a current master device node and a current slave device node from the first device node and the second device node, and transmitting service data of the current master device node to the current slave device node through a preset target path, where the preset target path is a path between a first network card that is externally hung by a processor of the first device node and a second network card that is externally hung by a processor of the second device node. For example, as shown in fig. 5, a specific device system connection schematic diagram is shown, where a first network card is hung on a processor of a first device node, a second network card is hung on a processor of a second device node, where the first network card and the second network card may be OCP (Open Compute Project, open computing item) network cards, and data transmission between the processor of the device node and the network cards may use PCIe (PERIPHERAL COMPONENT INTERCONNECT EXPRESS is a high-speed serial computer expansion bus standard) bus, before performing a master-slave switching operation between the first device node and the second device node, it is further necessary to determine a current master device node and a current slave device node from the first device node and the second device node, and transmit service data of the current master device node to the current slave device node through a preset target path, that is, perform data smoothing processing, so that the current slave device node also has service data identical to the current master device node, and the preset target path is a path between the first network card and the second network card.
In this embodiment, after the performing the active-standby switching operation between the first device node and the second device node, the method further includes obtaining a new active device node through the active-standby switching operation, and processing a current service processing request of the client by using the service data. When the operation of main-standby switching is completed, a new main equipment node is obtained, and it can be understood that the new main equipment node is the slave equipment node before switching, that is, the new main equipment node already has service data, so that the new main equipment node can directly process the current service processing request of the client by using the service data, thereby ensuring that the service is uninterrupted and the service data is normal.
Furthermore, in this embodiment, a failover module may be further provided, where the failover module is a key point for ensuring that service data is normal when a dual-device node fails, and when a master device node fails, the failover module immediately switches a service to a slave device node, so as to ensure continuity of the service. The failover module first stops sending traffic requests to the primary node and forwards the traffic requests to the secondary node. Meanwhile, the failover module informs the client of updating the connection information so that the client can connect to the standby node to continue service processing. After the original master node is recovered from the fault, the fault switching module automatically switches the service back to the original master node and notifies the client to update the connection information. In a common manner of failover, such as in a dual device node architecture, traffic requests may be distributed using a load balancer, such as Nginx, HAProxy, which may forward requests to a master device node or a slave device node according to a preset policy, such as a poll, minimum connection, etc. policy. When the master equipment node fails, the load balancer can automatically switch the request to the slave equipment node to realize the failure switching.
For example, as shown in fig. 5, the baseboard management controller of each device node is connected to a physical layer integrated circuit (PHY) through a Reduced Gigabit Media Independent Interface (RGMII) MEDIA INDEPENDENT INTERFACE, a Serial Gigabit media independent interface (Serial Gigabit MEDIA INDEPENDENT INTERFACE, SGMII) or a media dependent interface (MEDIA DEPENDENT INTERFACE, MDI) of the physical layer integrated circuit is connected to a local area network switch board (LAN SWITCH board), a single-port network interface of the local area network switch board is connected to each network device, that is, the system is mainly implemented by designing a LAN SWITCH board, the LAN SWITCH board is essentially a local area network switch board card, the network connection capability can be extended, so that more network devices can be simultaneously connected, the baseboard management controllers of two device nodes are designed to output RGMII signals to be connected to PHY chips, then the PHY chips can be connected to LAN SWITCH boards through SGMII/MDI interfaces and cables, network data exchange is realized through LAN SWITCH chips on LAN SWITCH boards, network connection is realized through one RJ45 port, and thus the number of network devices can be simplified, and two network nodes can share one network interface.
When the current condition of the preset active-standby switching is met based on the target signal, generating an active-standby switching log according to the target signal, and when the first equipment node and the second equipment node in the equipment system are required to be maintained, determining the reason for executing the active-standby switching operation between the first equipment node and the second equipment node according to the target signal in the active-standby switching log, and determining the operation of the maintenance equipment system according to the reason, for example, if the reason represents that the active-standby switching operation is executed in response to the first equipment node triggering the active-standby switching request, the maintenance equipment system is not required, and if the reason represents that the hardware running state of the first equipment node meets the preset fault condition, the first equipment node is required to be maintained, so that the equipment system can be conveniently maintained according to the health state of the active-standby switching log monitoring equipment system.
The programmable logic device applied to the control panel comprises the steps of obtaining a target signal transmitted by a first equipment node on an equipment system, wherein the first equipment node is a master equipment node or a slave equipment node, the target signal is a heartbeat signal of the equipment node or a soft interrupt signal triggered by the equipment node, determining whether a preset master-slave switching condition is met or not currently based on the target signal, generating a first hard interrupt signal and transmitting the first hard interrupt signal to a second equipment node to inform the second equipment node that the master-slave switching operation is about to be triggered, the second equipment node is the equipment node which is in a master-slave relation with the first equipment node, and executing the master-slave switching operation between the first equipment node and the second equipment node after the hard interrupt signal is transmitted to the second equipment node.
The method has the advantages that the control board is newly added in the equipment system, the programmable logic device on the control board acquires the target signal of the first equipment node, when the current meeting of the preset main/standby switching condition is determined according to the target signal, the first hard interrupt signal is sent to the second equipment node to inform the second equipment node that the main/standby switching operation is about to be triggered, after the hard interrupt signal is transmitted to the second equipment node, the main/standby switching operation between the first equipment node and the second equipment node is executed, namely the main/standby relation between the first equipment node and the second equipment node is switched, and because the hard interrupt signal has the advantages of quick response and high priority, the quick main/standby switching can be realized, namely the current equipment node can be quickly switched to the next main equipment node and takes over the subsequent service processing work, and the continuity of the service and the stable operation of the system are ensured.
Fig. 6 is a schematic structural diagram of a main/standby switching device according to an embodiment of the present invention, which is applied to a programmable logic device on a control board, and includes:
the signal acquisition module 11 is configured to acquire a target signal transmitted by a first device node on a device system, where the first device node is a master device node or a slave device node, and the target signal is a heartbeat signal of the device node or a soft interrupt signal triggered by the device node;
a judging module 12, configured to determine whether a preset active-standby switching condition is currently satisfied based on the target signal;
The signal sending module 13 is configured to generate a first hard interrupt signal if the preset main device switching condition is currently met, and transmit the first hard interrupt signal to a second device node to notify the second device node that main device switching operation is about to trigger;
And the active-standby switching module 14 is configured to perform an active-standby switching operation between the first device node and the second device node after transmitting the hard interrupt signal to the second device node.
The programmable logic device applied to the control panel comprises the steps of obtaining a target signal transmitted by a first equipment node on an equipment system, wherein the first equipment node is a master equipment node or a slave equipment node, the target signal is a heartbeat signal of the equipment node or a soft interrupt signal triggered by the equipment node, determining whether a preset master-slave switching condition is met or not currently based on the target signal, generating a first hard interrupt signal and transmitting the first hard interrupt signal to a second equipment node to inform the second equipment node that the master-slave switching operation is about to be triggered, the second equipment node is the equipment node which is in a master-slave relation with the first equipment node, and executing the master-slave switching operation between the first equipment node and the second equipment node after the hard interrupt signal is transmitted to the second equipment node.
The method has the advantages that the control board is newly added in the equipment system, the programmable logic device on the control board acquires the target signal of the first equipment node, when the current meeting of the preset main/standby switching condition is determined according to the target signal, the first hard interrupt signal is sent to the second equipment node to inform the second equipment node that the main/standby switching operation is about to be triggered, after the hard interrupt signal is transmitted to the second equipment node, the main/standby switching operation between the first equipment node and the second equipment node is executed, namely the main/standby relation between the first equipment node and the second equipment node is switched, and because the hard interrupt signal has the advantages of quick response and high priority, the quick main/standby switching can be realized, namely the current equipment node can be quickly switched to the next main equipment node and takes over the subsequent service processing work, and the continuity of the service and the stable operation of the system are ensured.
The embodiment of the application also discloses an equipment system, and fig. 7 is a schematic diagram of the equipment system according to an exemplary embodiment, wherein the equipment system comprises a control board and each equipment node respectively connected with the control board, and a programmable logic device is arranged on the control board and is used for realizing relevant steps in the main/standby switching method disclosed in any embodiment by executing a computer program. Specifically, the device node may control a common component, where the common component is specifically a fan, a PUS power supply, and the device node may send an interaction signal, a hard interrupt signal, a target signal, and the like between the device node and a programmable logic device on the control board, and the device node may also be connected to an external device, where the external device is, for example, a hard disk, a network card, a graphics processor (Graphics Processing Unit, i.e., GPU), and the like.
Further, the embodiment of the present application further discloses an electronic device, and fig. 8 is a block diagram of an electronic device according to an exemplary embodiment, where the content of the diagram is not to be considered as any limitation on the scope of use of the present application. The electronic device may comprise, in particular, at least one processor 21, at least one memory 22, a power supply 23, a communication interface 24, an input-output interface 25 and a communication bus 26. The memory 22 is configured to store a computer program, where the computer program is loaded and executed by the processor 21 to implement relevant steps in the active-standby switching method disclosed in any of the foregoing embodiments. In addition, the electronic device in the present embodiment may be specifically an electronic computer.
In this embodiment, the power supply 23 is configured to provide working voltages for each hardware device on the electronic device, the communication interface 24 is configured to create a data transmission channel with an external device for the electronic device, and the communication protocol to be followed is any communication protocol applicable to the technical solution of the present application, which is not specifically limited herein, and the input/output interface 25 is configured to obtain external input data or output data to the outside, where the specific interface type may be selected according to the needs of the specific application, which is not specifically limited herein.
The memory 22 may be a carrier for storing resources, such as a read-only memory, a random access memory, a magnetic disk, or an optical disk, and the resources stored thereon may include an operating system 221, a computer program 222, and the like, and the storage may be temporary storage or permanent storage.
The operating system 221 is used for managing and controlling various hardware devices on the electronic device and the computer program 222, which may be Windows Server, netware, unix, linux, etc. The computer program 222 may further include a computer program capable of performing other specific tasks in addition to the computer program capable of performing the active/standby switching method performed by the electronic device as disclosed in any of the foregoing embodiments.
Furthermore, the application also discloses a computer readable storage medium for storing a computer program, wherein the computer program realizes the main/standby switching method when being executed by a processor. For specific steps of the method, reference may be made to the corresponding contents disclosed in the foregoing embodiments, and no further description is given here.
In this specification, each embodiment is described in a progressive manner, and each embodiment is mainly described in a different point from other embodiments, so that the same or similar parts between the embodiments are referred to each other. For the device disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and the relevant points refer to the description of the method section.
Those of skill would further appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative elements and steps are described above generally in terms of functionality in order to clearly illustrate the interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. The software modules may be disposed in Random Access Memory (RAM), memory, read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.
Finally, it is further noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises an element.
While the foregoing has been provided to illustrate the principles and embodiments of the present application, specific examples have been provided herein to assist in understanding the principles and embodiments of the present application, and are intended to be in no way limiting, for those of ordinary skill in the art will, in light of the above teachings, appreciate that the principles and embodiments of the present application may be varied in any way.

Claims (10)

1.一种主备倒换方法,其特征在于,应用于控制板上的可编程逻辑器件,包括:1. A method for active/standby switching, characterized in that a programmable logic device applied to a control board comprises: 获取设备系统上的第一设备节点传输的目标信号;所述第一设备节点为主设备节点或从设备节点,所述目标信号为设备节点的心跳信号或设备节点触发的软中断信号;Acquire a target signal transmitted by a first device node on the device system; the first device node is a master device node or a slave device node, and the target signal is a heartbeat signal of the device node or a soft interrupt signal triggered by the device node; 基于所述目标信号确定当前是否满足预设主备倒换条件;Determining whether a preset active/standby switching condition is currently met based on the target signal; 若当前满足所述预设主备倒换条件,则生成第一硬中断信号,并将所述第一硬中断信号传输至第二设备节点,以向所述第二设备节点通知主备倒换操作即将触发;所述第二设备节点为与所述第一设备节点互为主备关系的设备节点;If the preset active-standby switching condition is currently met, a first hard interrupt signal is generated, and the first hard interrupt signal is transmitted to a second device node to notify the second device node that an active-standby switching operation is about to be triggered; the second device node is a device node in an active-standby relationship with the first device node; 在将所述硬中断信号传输至所述第二设备节点后,执行所述第一设备节点和所述第二设备节点之间的主备倒换操作。After the hard interrupt signal is transmitted to the second device node, a master-slave switching operation is performed between the first device node and the second device node. 2.根据权利要求1所述的主备倒换方法,其特征在于,还包括:2. The active/standby switching method according to claim 1, further comprising: 在设备系统初始启动过程中,通过所述控制板的下拉电阻向所述设备系统中的任一设备节点传输低电平信号,以使接收到所述低电平信号的设备节点将自身的角色判定为初始的主设备节点;以及,通过所述控制板的上拉电阻向所述设备系统中的任一设备节点传输高电平信号,以使接收到所述高电平信号的设备节点将自身的角色判定为初始的从设备节点。During the initial startup of the device system, a low-level signal is transmitted to any device node in the device system through the pull-down resistor of the control board, so that the device node receiving the low-level signal determines its own role as the initial master device node; and a high-level signal is transmitted to any device node in the device system through the pull-up resistor of the control board, so that the device node receiving the high-level signal determines its own role as the initial slave device node. 3.根据权利要求2所述的主备倒换方法,其特征在于,所述通过所述控制板的下拉电阻向所述设备系统中的任一设备节点传输低电平信号之后,还包括:3. The active/standby switching method according to claim 2, characterized in that after transmitting a low-level signal to any device node in the device system through the pull-down resistor of the control board, the method further comprises: 记录接收到所述下拉电阻传输的低电平信号的设备节点的启动时间;Recording the startup time of the device node that receives the low level signal transmitted by the pull-down resistor; 若当前与所述启动时间之间的时间间隔不等于预设延迟启动间隔,则禁止启动其他设备节点,如果接收到所述低电平信号的设备节点的第一单板状态位置为正常位置且其他设备节点的第二单板状态位置为异常位置,则确定接收到所述低电平信号的设备节点为初始的主设备节点;If the time interval between the current time and the start time is not equal to the preset delayed start interval, then starting other device nodes is prohibited, and if the first single board state position of the device node receiving the low level signal is a normal position and the second single board state position of other device nodes is an abnormal position, then the device node receiving the low level signal is determined to be the initial master device node; 若当前与所述启动时间之间的时间间隔等于预设延迟启动间隔,则启动其他设备节点,如果所述第一单板状态位置和其他设备节点的第二单板状态位置均为正常位置,则确定其他设备节点为初始的从设备节点。If the time interval between the current time and the start time is equal to the preset delayed start interval, other device nodes are started. If the first single board status position and the second single board status position of other device nodes are both normal positions, other device nodes are determined to be initial slave device nodes. 4.根据权利要求1所述的主备倒换方法,其特征在于,获取设备系统上的第一设备节点传输的心跳信号,包括:4. The active/standby switching method according to claim 1, wherein obtaining a heartbeat signal transmitted by a first device node on the device system comprises: 获取所述第一设备节点传输的按照不同频率将类型编码后信号和运行状态编码后信号进行叠加后得到的心跳信号;Acquire a heartbeat signal transmitted by the first device node, obtained by superimposing a type-encoded signal and an operation status-encoded signal according to different frequencies; 其中,所述类型编码后信号和所述运行状态编码后信号为所述第一设备节点分别将本地的硬件类型信号和硬件运行状态信号按照不同频率进行信号编码后得到的信号;The type-encoded signal and the running status-encoded signal are signals obtained by respectively encoding the local hardware type signal and the hardware running status signal by the first device node according to different frequencies; 相应的,所述基于所述目标信号确定当前是否满足预设主备倒换条件,包括:Accordingly, determining whether a preset active/standby switching condition is currently met based on the target signal includes: 利用不同的解析时钟对所述心跳信号进行解析,以得到包含所述第一设备节点中与各硬件类型分别对应的硬件运行状态的状态通知信号;parsing the heartbeat signal using different parsing clocks to obtain a status notification signal including the hardware operating status corresponding to each hardware type in the first device node; 将所述状态通知信号发送至第二设备节点,并基于所述状态通知信号确定当前是否满足第一预设主备倒换条件;Sending the status notification signal to the second device node, and determining whether a first preset active-standby switching condition is currently met based on the status notification signal; 其中,所述第一预设主备倒换条件为所述状态通知信号表征所述第一设备节点的硬件运行状态满足预设故障条件。The first preset active-standby switching condition is that the status notification signal indicates that the hardware operating status of the first device node meets a preset fault condition. 5.根据权利要求4所述的主备倒换方法,其特征在于,还包括:5. The active/standby switching method according to claim 4, further comprising: 若接收所述第一设备节点传输的第二硬中断信号,则检测所述第一设备节点对应的所述状态通知信号是否满足预设控制权获取条件;其中,所述第二硬中断信号表征所述第一设备节点触发共用部件的控制权的获取请求;If a second hard interrupt signal transmitted by the first device node is received, detecting whether the status notification signal corresponding to the first device node meets a preset control right acquisition condition; wherein the second hard interrupt signal represents a request for acquiring the control right of a common component triggered by the first device node; 如果与所述第一设备节点对应的所述状态通知信号满足所述预设控制权获取条件,则断开第二设备节点与所述共用部件之间的通路,并建立所述第一设备节点与所述共用部件之间的通路;If the state notification signal corresponding to the first device node satisfies the preset control right acquisition condition, disconnecting the path between the second device node and the common component, and establishing the path between the first device node and the common component; 向所述第一设备节点和所述第二设备节点发送第三硬中断信号;其中,所述第三硬中断信号表征所述共用部件的控制权归属于所述第一设备节点。A third hard interrupt signal is sent to the first device node and the second device node, wherein the third hard interrupt signal indicates that control of the common component belongs to the first device node. 6.根据权利要求1所述的主备倒换方法,其特征在于,若所述目标信号为软中断信号,则所述基于所述目标信号确定当前是否满足预设主备倒换条件,包括:6. The active/standby switching method according to claim 1, wherein if the target signal is a soft interrupt signal, then determining whether a preset active/standby switching condition is currently met based on the target signal comprises: 基于所述软中断信号确定当前是否满足第二预设主备倒换条件;其中,所述第二预设主备倒换条件为所述软中断信号表征所述第一设备节点触发主备倒换请求。Based on the soft interrupt signal, determine whether a second preset active-standby switching condition is currently met; wherein the second preset active-standby switching condition is that the soft interrupt signal indicates that the first device node triggers an active-standby switching request. 7.根据权利要求1所述的主备倒换方法,其特征在于,还包括:7. The active/standby switching method according to claim 1, further comprising: 监测客户端当前业务处理请求的数量是否大于预设阈值;Monitor whether the number of current business processing requests from the client is greater than a preset threshold; 若所述当前业务处理请求的数量大于所述预设阈值,则生成第四硬中断信号,并向所述第一设备节点和所述第二设备节点发送所述第四硬中断信号,以向所述第一设备节点和所述第二设备节点通知双主切换操作即将触发;If the number of the current service processing requests is greater than the preset threshold, a fourth hard interrupt signal is generated, and the fourth hard interrupt signal is sent to the first device node and the second device node to notify the first device node and the second device node that a dual-master switching operation is about to be triggered; 在将所述第四硬中断信号传输至所述第一设备节点和所述第二设备节点后,执行将所述第一设备节点和所述第二设备节点均切换为主设备节点的操作,以便各所述主设备节点处理所述当前业务处理请求。After transmitting the fourth hard interrupt signal to the first device node and the second device node, an operation of switching both the first device node and the second device node to master device nodes is performed so that each of the master device nodes processes the current service processing request. 8.根据权利要求1所述的主备倒换方法,其特征在于,所述执行所述第一设备节点和所述第二设备节点之间的主备倒换操作之前,还包括:8. The active/standby switching method according to claim 1, characterized in that before performing the active/standby switching operation between the first device node and the second device node, the method further comprises: 从所述第一设备节点和所述第二设备节点中确定当前主设备节点和当前从设备节点,并将所述当前主设备节点的业务数据通过预设目标通路传输至所述当前从设备节点;所述预设目标通路为所述第一设备节点的处理器外挂的第一网卡与所述第二设备节点的处理器外挂的第二网卡之间的通路;Determine a current master device node and a current slave device node from the first device node and the second device node, and transmit the service data of the current master device node to the current slave device node through a preset target path; the preset target path is a path between a first network card externally mounted on a processor of the first device node and a second network card externally mounted on a processor of the second device node; 相应的,所述执行所述第一设备节点和所述第二设备节点之间的主备倒换操作之后,还包括:Correspondingly, after performing the master-slave switchover operation between the first device node and the second device node, the method further includes: 通过主备倒换操作后得到的新主设备节点,并利用所述业务数据对客户端的当前业务处理请求进行处理。The new master device node is obtained after the master-slave switch operation, and uses the service data to process the current service processing request of the client. 9.一种设备系统,其特征在于,包括控制板以及分别与所述控制板连接的各设备节点;其中,所述控制板上设置有可编程逻辑器件,所述可编程逻辑器件通过执行计算机程序以实现如权利要求1至8任一项所述的主备倒换方法。9. A device system, characterized in that it includes a control board and device nodes respectively connected to the control board; wherein a programmable logic device is provided on the control board, and the programmable logic device implements the master-slave switching method as described in any one of claims 1 to 8 by executing a computer program. 10.一种电子设备,其特征在于,包括:10. An electronic device, comprising: 存储器,用于存储计算机程序;Memory for storing computer programs; 处理器,用于执行所述计算机程序以实现如权利要求1至8任意一项所述主备倒换方法的步骤。A processor is used to execute the computer program to implement the steps of the active-standby switching method as claimed in any one of claims 1 to 8.
CN202411748941.3A 2024-11-30 2024-11-30 Equipment system and main/standby switching method thereof, and electronic equipment Pending CN119597708A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202411748941.3A CN119597708A (en) 2024-11-30 2024-11-30 Equipment system and main/standby switching method thereof, and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202411748941.3A CN119597708A (en) 2024-11-30 2024-11-30 Equipment system and main/standby switching method thereof, and electronic equipment

Publications (1)

Publication Number Publication Date
CN119597708A true CN119597708A (en) 2025-03-11

Family

ID=94830149

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202411748941.3A Pending CN119597708A (en) 2024-11-30 2024-11-30 Equipment system and main/standby switching method thereof, and electronic equipment

Country Status (1)

Country Link
CN (1) CN119597708A (en)

Similar Documents

Publication Publication Date Title
US7490264B2 (en) Method for error handling in a dual adaptor system where one adaptor is a master
US7930425B2 (en) Method of effectively establishing and maintaining communication linkages with a network interface controller
US5784617A (en) Resource-capability-based method and system for handling service processor requests
US9430266B2 (en) Activating a subphysical driver on failure of hypervisor for operating an I/O device shared by hypervisor and guest OS and virtual computer system
US7676616B2 (en) Method, apparatus and program storage device for providing asynchronous status messaging in a data storage system
US9489230B1 (en) Handling of virtual machine migration while performing clustering operations
US20080162984A1 (en) Method and apparatus for hardware assisted takeover
US20090070639A1 (en) Administering Correlated Error Logs In A Computer System
US20080307042A1 (en) Information processing system, information processing method, and program
US20090077412A1 (en) Administering A System Dump On A Redundant Node Controller In A Computer System
JP2000293497A (en) Generation system for cluster node relief signal
US20090055679A1 (en) Recovery Of A Redundant Node Controller In A Computer System
CN114003445B (en) BMC I2C monitoring function test method, system, terminal and storage medium
CN111209265B (en) Database switching method and terminal equipment
WO2020088351A1 (en) Method for sending device information, computer device and distributed computer device system
CN116074187A (en) Main-standby link switching method and device, electronic equipment and storage medium
CN118656245A (en) A method, device, electronic device and medium for handling server exceptions
CN118354034A (en) Redundancy monitoring system, redundancy monitoring method, electronic equipment and readable storage medium
CN118606117A (en) A four-controller interconnected mirroring system, data transmission method, device and medium
CN119597708A (en) Equipment system and main/standby switching method thereof, and electronic equipment
CN116974809A (en) Fault information acquisition method and device, baseboard management controller, system and medium
CN110874473A (en) Virus detection method, device and system, cloud service system and storage medium
CN113746655B (en) A backup state determination method, device and system
CN100361047C (en) Method and device for improving bus transmission reliability
CN112912848A (en) Power supply request management method in cluster operation process

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination