CN113961401A - Server fan redundancy control system and method - Google Patents
Server fan redundancy control system and method Download PDFInfo
- Publication number
- CN113961401A CN113961401A CN202111263544.3A CN202111263544A CN113961401A CN 113961401 A CN113961401 A CN 113961401A CN 202111263544 A CN202111263544 A CN 202111263544A CN 113961401 A CN113961401 A CN 113961401A
- Authority
- CN
- China
- Prior art keywords
- fan
- cpld
- bmc
- pwm
- efuse
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/18—Error detection or correction of the data by redundancy in hardware using passive fault-masking of the redundant circuits
- G06F11/181—Eliminating the failing redundant component
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Cooling Or The Like Of Electrical Apparatus (AREA)
Abstract
The invention provides a server fan redundancy control system and a method, wherein two paths of PWM signals are designed, one path of PWM signal is used as a standby, when a fan is abnormal, the fan can be switched to the standby PWM signal without sensing, the PWM failure problem can be eliminated, a CPLD is used for sending a fan power supply enabling signal to control the power-down and power-up of an EFUSE, the fan is controlled to be restarted, the probability failure problem of the fan is eliminated, and the fan failure condition can be covered and the fault repair can be carried out through a fan failure processing circuit. In addition, the delay circuit is added to the in-place signal, so that the damage of a fan circuit caused by hot plugging of the fan can be avoided, and overcurrent protection is provided for the fan.
Description
Technical Field
The invention relates to the technical field of server heat dissipation, in particular to a server fan redundancy control system and a server fan redundancy control method.
Background
The fan is an extremely important component of the server, and plays a role in heat dissipation of the whole system, and once the fan goes wrong, the whole system is fatally affected. The conventional fan redundancy system only considers the failure of PWM pulse output, but neglects the problem of probabilistic operation failure of the fan. The traditional PWM pulse switching is not solved aiming at the probability operation failure of the fan.
However, once the fan fails to operate, it is difficult to determine whether the PWM output fails or the fan itself fails with probability, so a solution for completely solving the problem of the failure of the fan operation is urgently needed, so as to ensure the heat dissipation reliability of the entire server system.
Disclosure of Invention
The invention aims to provide a server fan redundancy control system and a server fan redundancy control method, and aims to solve the problem that in the prior art, whether the server fan is abnormal or the PWM output fails or the fan operates and fails is difficult to judge, so that the purposes of processing the PWM pulse failure condition and the fan abnormality caused by the probabilistic failure of the fan can be realized, and the heat dissipation reliability of a storage server is improved.
In order to achieve the above technical object, the present invention provides a server fan redundancy control system, including:
BMC, CPLD, EFUSE and fan;
PWM signals BMC _ PWM1, BMC _ PWM2, an EFUSE enable signal BMC _ EFUSE _ EN and a PWM switching signal FAN _ PWM _ SWITCH are connected between the BMC and the CPLD;
PWM signals CPLD _ PWM1 and CPLD _ PWM2 are connected between the CPLD and the FAN, and the FAN is in-place signal FAN _ PRESENT;
a FAN rotating speed signal FAN _ TACH is connected between the BMC and the FAN;
a FAN POWER supply enabling signal FAN _ POWER _ ENABLE is connected between the CPLD and the EFUSE, and a FAN POWER-on signal P12V _ FAN is connected between the EFUSE and the FAN.
Preferably, the CPLD powers up and down the fan through the EFUSE, when the fan is in place, the CPLD delays for 1s and then controls the EFUSE to power up the fan, and when the fan is not in place, the CPLD controls the EFUSE to power down the fan.
Preferably, when the FAN rotation speed is abnormal, the BMC controls the CPLD to SWITCH the PWM signal source of the FAN through the PWM switching signal FAN _ PWM _ SWITCH.
Preferably, when the FAN is abnormal after the PWM signal source is switched, the CPLD sends a FAN POWER supply ENABLE signal FAN _ POWER _ ENABLE to control the EFUSE to POWER on and off to restart the FAN.
The invention also provides a server fan redundancy control method, which comprises the following operations:
the BMC monitors whether the FAN rotating speed is abnormal or not through a FAN rotating speed signal FAN _ TACH;
when the rotating speed of the fan is abnormal, the BMC controls the CPLD to switch the PWM signal, uses the standby PWM signal source as a rotating speed control signal source of the fan and verifies whether the PWM output is invalid or not;
when the rotating speed of the FAN is still not recovered to be normal, the BMC sends a FAN POWER supply enabling signal FAN _ POWER _ ENABLE to control the EFUSE to be powered off and powered on through the CPLD so as to restart the FAN and verify whether the FAN is in probabilistic failure;
and when the rotating speed of the fan is still abnormal, reporting the fault information of the fan.
Preferably, the CPLD powers on and off the fan through the EFUSE, when the fan is in place, the CPLD delays for 1s and then controls the EFUSE to power on the fan, and when the fan is not in place, the CPLD controls the EFUSE to power off the fan, so that overcurrent protection is provided for hot plugging of the fan.
Preferably, before the BMC is started, the CPLD controls the fan to run at full speed, and after the BMC is started, the BMC outputs a PWM signal to the CPLD, and the CPLD controls the fan to run at a certain speed.
Preferably, the BMC outputs two paths of PWM signals, one path of PWM signal is used for controlling the rotating speed of the fan, the other path of PWM signal is used for standby, and the two paths of PWM signals are the same.
The effect provided in the summary of the invention is only the effect of the embodiment, not all the effects of the invention, and one of the above technical solutions has the following advantages or beneficial effects:
compared with the prior art, the invention designs two paths of PWM signals, one path of PWM signal is used as a standby, when the fan is abnormal, the fan can be switched to the standby PWM signal without sensing, further the problem of PWM failure can be solved, and the CPLD sends a fan power supply enabling signal to control the EFUSE to be powered off and powered on, further the fan is controlled to be restarted, further the problem of probabilistic failure of the fan is solved, and the failure of the fan can be covered and the fault can be repaired through the fan self failure processing circuit, so that the invention can not only process the PWM pulse failure condition, but also process the fan abnormality caused by the probabilistic failure of the fan, and greatly improves the heat dissipation reliability of the storage server. In addition, the delay circuit is added to the in-place signal, so that the damage of a fan circuit caused by hot plugging of the fan can be avoided, and overcurrent protection is provided for the fan.
Drawings
FIG. 1 is a block diagram of a server fan redundancy control system according to an embodiment of the present invention;
fig. 2 is a flowchart of a server fan redundancy control method according to an embodiment of the present invention.
Detailed Description
In order to clearly explain the technical features of the present invention, the following detailed description of the present invention is provided with reference to the accompanying drawings. The following disclosure provides many different embodiments, or examples, for implementing different features of the invention. To simplify the disclosure of the present invention, the components and arrangements of specific examples are described below. Furthermore, the present invention may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed. It should be noted that the components illustrated in the figures are not necessarily drawn to scale. Descriptions of well-known components and processing techniques and procedures are omitted so as to not unnecessarily limit the invention.
The following describes a system and a method for controlling redundancy of a server fan according to embodiments of the present invention in detail with reference to the accompanying drawings.
As shown in fig. 1, the present invention discloses a server fan redundancy control system, which includes:
BMC, CPLD, EFUSE and fan;
PWM signals BMC _ PWM1, BMC _ PWM2, an EFUSE enable signal BMC _ EFUSE _ EN and a PWM switching signal FAN _ PWM _ SWITCH are connected between the BMC and the CPLD;
PWM signals CPLD _ PWM1 and CPLD _ PWM2 are connected between the CPLD and the FAN, and the FAN is in-place signal FAN _ PRESENT;
a FAN rotating speed signal FAN _ TACH is connected between the BMC and the FAN;
a FAN POWER supply enabling signal FAN _ POWER _ ENABLE is connected between the CPLD and the EFUSE, and a FAN POWER-on signal P12V _ FAN is connected between the EFUSE and the FAN.
In the embodiment of the invention, 4 groups of signals are connected between the BMC and the CPLD, namely a PWM signal BMC _ PWM1, a BMC _ PWM2, an EFUSE enable signal BMC _ EFUSE _ EN and a PWM switching signal FAN _ PWM _ SWITCH; three groups of signals are connected between the CPLD and the FAN, namely PWM signals CPLD _ PWM1 and CPLD _ PWM2, and a FAN in-place signal FAN _ PRESENT; a FAN rotating speed signal FAN _ TACH is connected between the BMC and the FAN; a FAN POWER supply enabling signal FAN _ POWER _ ENABLE is connected between the CPLD and the EFUSE, and a FAN POWER-on signal P12V _ FAN is connected between the EFUSE and the FAN.
The BMC outputs two paths of PWM signals, one path of PWM signal is used for controlling the rotating speed of the fan, and the other path of PWM signal is used for standby. The FAN speed signal FAN _ TACH is transmitted to the BMC for FAN speed information acquisition and abnormal information judgment. The CPLD powers on and off the fan through the EFUSE, when the fan is in place, the CPLD delays for 1s and then controls the EFUSE to power on the fan, and when the fan is not in place, the CPLD controls the EFUSE to power off the fan. By the arrangement, EFUSE overcurrent protection in the hot plug process can be effectively avoided.
When the BMC monitors that the FAN rotating speed is abnormal, firstly, whether the abnormality is a PWM output failure condition is judged, namely, a PWM switching signal FAN _ PWM _ SWITCH is triggered to the CPLD, and after the CPLD receives the PWM switching signal sent by the BMC, the control signal of the FAN is switched to a standby PWM signal, so that the switching of the PWM signal is completed. And after the switching, the BMC continuously monitors the rotating speed of the fan, and if the rotating speed of the fan still does not return to normal, whether the abnormality is the probabilistic failure condition of the fan is continuously judged.
When the FAN is judged to be in probabilistic failure, the CPLD sends a FAN POWER supply enabling signal FAN _ POWER _ ENABLE to control the EFUSE to be powered on or powered off for recovery.
Through the operation, the fan failure caused by PWM failure can be effectively processed, and the failure of the fan which is probabilistically failed can also be effectively processed.
After the server is powered on and before the BMC is started, the CPLD controls the fan to run at full speed, and after the BMC is started, the BMC outputs a PWM signal to the CPLD, and the CPLD controls the fan to run at a certain speed. The BMC monitors the rotating speed and the state of the fan through a fan rotating speed signal sent by the fan, when the BMC monitors that the rotating speed of the fan is abnormal, the BMC informs the CPLD to switch the PWM signal, and the CPLD receives the PWM switching signal sent by the BMC to complete PWM signal switching. Starting from the PWM switching signal sent by the BMC, if the fan rotating speed is still abnormal within 10s, the PWM failure problem is eliminated. And then the BMC sends an EFUSE enabling signal BMC _ EFUSE _ EN to the CPLD, the CPLD controls the EFUSE to complete powering off the fan firstly, the fan is powered on again after waiting for 2s, the BMC continuously monitors the rotating speed of the fan, and if the rotating speed of the fan is still abnormal at the moment, the problem of probability failure of the fan is solved, and fault information is reported.
According to the embodiment of the invention, two PWM signals are designed, one PWM signal is used as a standby, when the fan is abnormal, the fan can be switched to the standby PWM signal without sensing, so that the problem of PWM failure can be solved, the power-down and power-up of the EFUSE are controlled by sending a fan power supply enabling signal through the CPLD, so that the fan is restarted, the problem of probabilistic failure of the fan is solved, the failure condition of the fan can be covered through a fan self-failure processing circuit, and the fault can be repaired, so that the fan failure processing circuit can not only deal with the PWM pulse failure condition, but also deal with the fan abnormality caused by the probabilistic failure of the fan, and the heat dissipation reliability of the storage server is greatly improved. In addition, the delay circuit is added to the in-place signal, so that the damage of a fan circuit caused by hot plugging of the fan can be avoided, and overcurrent protection is provided for the fan.
As shown in fig. 2, an embodiment of the present invention further discloses a server fan redundancy control method, where the method includes the following operations:
the BMC monitors whether the FAN rotating speed is abnormal or not through a FAN rotating speed signal FAN _ TACH;
when the rotating speed of the fan is abnormal, the BMC controls the CPLD to switch the PWM signal, uses the standby PWM signal source as a rotating speed control signal source of the fan and verifies whether the PWM output is invalid or not;
when the rotating speed of the FAN is still not recovered to be normal, the BMC sends a FAN POWER supply enabling signal FAN _ POWER _ ENABLE to control the EFUSE to be powered off and powered on through the CPLD so as to restart the FAN and verify whether the FAN is in probabilistic failure;
and when the rotating speed of the fan is still abnormal, reporting the fault information of the fan.
4 groups of signals are connected between the BMC and the CPLD, namely a PWM signal BMC _ PWM1, a BMC _ PWM2, an EFUSE enable signal BMC _ EFUSE _ EN and a PWM switching signal FAN _ PWM _ SWITCH; three groups of signals are connected between the CPLD and the FAN, namely PWM signals CPLD _ PWM1 and CPLD _ PWM2, and a FAN in-place signal FAN _ PRESENT; a FAN rotating speed signal FAN _ TACH is connected between the BMC and the FAN; a FAN POWER supply enabling signal FAN _ POWER _ ENABLE is connected between the CPLD and the EFUSE, and a FAN POWER-on signal P12V _ FAN is connected between the EFUSE and the FAN.
The BMC outputs two paths of PWM signals, one path of PWM signal is used for controlling the rotating speed of the fan, and the other path of PWM signal is used for standby. The FAN speed signal FAN _ TACH is transmitted to the BMC for FAN speed information acquisition and abnormal information judgment. The CPLD powers on and off the fan through the EFUSE, when the fan is in place, the CPLD delays for 1s and then controls the EFUSE to power on the fan, and when the fan is not in place, the CPLD controls the EFUSE to power off the fan. By the arrangement, EFUSE overcurrent protection in the hot plug process can be effectively avoided.
When the BMC monitors that the FAN rotating speed is abnormal, firstly, whether the abnormality is a PWM output failure condition is judged, namely, a PWM switching signal FAN _ PWM _ SWITCH is triggered to the CPLD, and after the CPLD receives the PWM switching signal sent by the BMC, the control signal of the FAN is switched to a standby PWM signal, so that the switching of the PWM signal is completed. And after the switching, the BMC continuously monitors the rotating speed of the fan, and if the rotating speed of the fan still does not return to normal, whether the abnormality is the probabilistic failure condition of the fan is continuously judged.
When the FAN is judged to be in probabilistic failure, the CPLD sends a FAN POWER supply enabling signal FAN _ POWER _ ENABLE to control the EFUSE to be powered on or powered off for recovery.
Through the operation, the fan failure caused by PWM failure can be effectively processed, and the failure of the fan which is probabilistically failed can also be effectively processed.
After the server is powered on and before the BMC is started, the CPLD controls the fan to run at full speed, and after the BMC is started, the BMC outputs a PWM signal to the CPLD, and the CPLD controls the fan to run at a certain speed. The BMC monitors the rotating speed and the state of the fan through a fan rotating speed signal sent by the fan, when the BMC monitors that the rotating speed of the fan is abnormal, the BMC informs the CPLD to switch the PWM signal, and the CPLD receives the PWM switching signal sent by the BMC to complete PWM signal switching. Starting from the PWM switching signal sent by the BMC, if the fan rotating speed is still abnormal within 10s, the PWM failure problem is eliminated. And then the BMC sends an EFUSE enabling signal BMC _ EFUSE _ EN to the CPLD, the CPLD controls the EFUSE to complete powering off the fan firstly, the fan is powered on again after waiting for 2s, the BMC continuously monitors the rotating speed of the fan, and if the rotating speed of the fan is still abnormal at the moment, the problem of probability failure of the fan is solved, and fault information is reported.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111263544.3A CN113961401A (en) | 2021-10-28 | 2021-10-28 | Server fan redundancy control system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111263544.3A CN113961401A (en) | 2021-10-28 | 2021-10-28 | Server fan redundancy control system and method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113961401A true CN113961401A (en) | 2022-01-21 |
Family
ID=79467850
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111263544.3A Withdrawn CN113961401A (en) | 2021-10-28 | 2021-10-28 | Server fan redundancy control system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113961401A (en) |
-
2021
- 2021-10-28 CN CN202111263544.3A patent/CN113961401A/en not_active Withdrawn
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI490682B (en) | Server protecting system | |
CN105114346A (en) | Device and method for automatically controlling fan during power-on of server | |
CN102868850B (en) | A kind of video camera heater | |
CN104808758B (en) | Electronic device capable of automatic resetting and automatic resetting method thereof | |
CN106528374A (en) | Rack-level storage device and cooling control system thereof | |
CN104564764A (en) | Server system | |
WO2021120388A1 (en) | Pressure protection circuit, control method, and computer-readable storage medium | |
CN111984471A (en) | Cabinet power BMC redundancy management system and method | |
CN113961401A (en) | Server fan redundancy control system and method | |
CN204406385U (en) | The management devices of computer system | |
CN110099542B (en) | Method for ensuring normal operation of power supply device in electronic equipment | |
CN108491052B (en) | Self-recovery method for hot plug fan | |
CN108279761B (en) | A server power supply circuit and control method supporting fan hot swap | |
CN206773636U (en) | A kind of excessively stream self- recoverage fan back board structure | |
CN103454016A (en) | System temperature sensor fault processing method based on server | |
CN111416719B (en) | Server device capable of waking up through network and method thereof | |
CN112131055B (en) | Multi-mode three-machine dynamic fault-tolerant system | |
CN102645290A (en) | Processing method aiming at system temperature sensor fault based on server | |
CN113741663B (en) | Server structure, method and storage medium for starting throttle mechanism | |
TWI482422B (en) | Fan control system and fan control method | |
CN116153347B (en) | A thermal protection circuit for a solid state hard disk | |
US12047070B2 (en) | Main board, hot plug control signal generator, and control signal generating method thereof | |
CN113141287B (en) | Node load short circuit self-isolation circuit and method and multi-connected circuit | |
CN111664105A (en) | Server device capable of controlling fan and control method thereof | |
CN118040869A (en) | Driving power supply architecture of motor controller, motor controller and control method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20220121 |
|
WW01 | Invention patent application withdrawn after publication |