CN116048233A - Power architecture, power monitoring method and device - Google Patents
Power architecture, power monitoring method and device Download PDFInfo
- Publication number
- CN116048233A CN116048233A CN202310235604.3A CN202310235604A CN116048233A CN 116048233 A CN116048233 A CN 116048233A CN 202310235604 A CN202310235604 A CN 202310235604A CN 116048233 A CN116048233 A CN 116048233A
- Authority
- CN
- China
- Prior art keywords
- hard disk
- voltage
- interrupt
- controller
- slot
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/28—Supervision thereof, e.g. detecting power-supply failure by out of limits supervision
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/30—Means for acting in the event of power-supply failure or interruption, e.g. power-supply fluctuations
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Debugging And Monitoring (AREA)
Abstract
Description
技术领域technical field
本发明实施例涉及电源设计技术领域,特别是涉及一种电源架构、一种电源监控方法、一种电源监控装置、一种电子设备以及一种计算机可读存储介质。Embodiments of the present invention relate to the technical field of power supply design, and in particular, relate to a power supply architecture, a power supply monitoring method, a power supply monitoring device, an electronic device, and a computer-readable storage medium.
背景技术Background technique
近年来,伴随着5G(5th Generation Mobile Communication Technology 第五代移动通信技术)、云计算大数据以及AI(Artificial Intelligence 人工智能)技术的发展,产生的大量应用数据需要完整地存储起来,这些数据包括热数据、温数据、冷数据和极冷数据,其中热数据被调用操作的频率是最高的,极冷数据的操作频率是最低的。针对这些不同类型的数据需要设备研发厂商制造出可靠性极高的存储设备。In recent years, with the development of 5G (5th Generation Mobile Communication Technology), cloud computing big data, and AI (Artificial Intelligence artificial intelligence) technology, a large amount of application data needs to be stored completely. These data include Hot data, warm data, cold data, and extremely cold data, among them, hot data has the highest frequency of operation, and extremely cold data has the lowest operation frequency. For these different types of data, device R&D manufacturers are required to manufacture highly reliable storage devices.
目前传统的存储服务器设计中,存储数据的硬盘的电源架构存在一个问题,即:所有机械硬盘(Hard Disk Drive 简称HDD)在和磁盘阵列(Redundant ArraysofIndependent Disks 简称Raid)控制器之间有大数据的快速交互或者突发(burst)时,机械硬盘会从背板抽取大电流,进而单个硬盘和整个干路之间会有较大的压降(电压迅速跌落)产生,而在电源工程师仿真阶段,无法准确地做到或者无法无限逼近这种真实的业务场景,因此如果单个硬盘的电源仿真裕量不足,会导致设计出来的服务器某些时候存在掉盘的风险,进而对应用数据造成一定的破坏,如果单个硬盘电源仿真裕量过大,那么所有硬盘同时和磁盘阵列控制器之间有大数据的突发交互时,所有硬盘加起来后所消耗的电流会非常大,导致过度设计,进而导致干路的可恢复保险丝(electronic Fuse简称eFuse)选型规格过大,硬件设计整体成本上升。基于此,这种传统的电源设计架构对服务器系统厂商来说带来了一定的挑战。In the current traditional storage server design, there is a problem in the power structure of the hard disk that stores data, that is, all mechanical hard disks (Hard Disk Drive, HDD) and the disk array (Redundant Arrays of Independent Disks, Raid) controller have large data During fast interaction or burst (burst), the mechanical hard disk will draw a large current from the backplane, and then there will be a large voltage drop (rapid drop in voltage) between a single hard disk and the entire main circuit. In the simulation stage of the power supply engineer, This kind of real business scenario cannot be accurately achieved or infinitely approached. Therefore, if the power simulation margin of a single hard disk is insufficient, the designed server will sometimes have the risk of disk failure, which will cause certain damage to application data. , if the simulation margin of a single hard disk power supply is too large, then when all hard disks interact with the disk array controller at the same time with a burst of large data, the current consumed by all the hard disks will be very large, resulting in over-design, which will lead to The resettable fuse (electronic Fuse, eFuse for short) of the main circuit is too large in size, and the overall cost of hardware design increases. Based on this, this traditional power supply design architecture brings certain challenges to server system manufacturers.
发明内容Contents of the invention
本发明实施例是提供一种电源架构、一种电源监控方法、一种电源监控装置、一种电子设备以及一种计算机可读存储介质,以解决或部分解决电源设计架构中机械硬盘在和磁盘阵列控制器之间有大数据的快速交互或者突发时而产生较大的压降的情况下,可能会导致产生掉盘或对应用数据造成破坏,以及对架构过度设计导致成本上升的问题。Embodiments of the present invention provide a power supply framework, a power supply monitoring method, a power supply monitoring device, an electronic device, and a computer-readable storage medium, so as to solve or partially solve the mechanical hard disk and magnetic disk problems in the power supply design architecture. In the case of fast interaction of large data or sudden large voltage drop between array controllers, it may cause disk failure or damage to application data, as well as cost increase due to over-design of the architecture.
本发明实施例公开了一种电源架构,包括:The embodiment of the present invention discloses a power architecture, including:
设置在背板上的电压监控芯片,所述电压监控芯片与设置在主板上的基板管理控制器连接;所述电压监控芯片用于监控所述背板上至少一个硬盘插槽的电压。A voltage monitoring chip arranged on the backboard, the voltage monitoring chip is connected with a baseboard management controller arranged on the main board; the voltage monitoring chip is used for monitoring the voltage of at least one hard disk slot on the backboard.
可选地,所述背板还设置有可恢复保险丝,所述可恢复保险丝和设置在所述主板上的电压调节芯片连接;用于将所述电压调节芯片向所述可恢复保险丝发送的电压转换为所述硬盘插槽所需要的电压。Optionally, the backplane is also provided with a resettable fuse, and the resettable fuse is connected to the voltage regulation chip provided on the main board; it is used to send the voltage sent by the voltage regulation chip to the resettable fuse. Convert to the voltage required by the hard disk slot.
可选地,所述电压监控芯片设置在远离所述背板上的可恢复保险丝的所述硬盘插槽旁。Optionally, the voltage monitoring chip is arranged next to the hard disk slot away from the resettable fuse on the backplane.
可选地,所述主板上还设置有顺次连接的中央处理器、平台控制器与至少一个闪存颗粒。Optionally, the main board is further provided with a central processing unit, a platform controller and at least one flash memory particle connected in sequence.
可选地,所述主板上还设置有磁盘阵列控制器,所述中央处理器通过第一链路和所述磁盘阵列控制器连接,所述磁盘阵列控制器用于将所述第一链路转换为第二链路,并通过所述第二链路连接所述硬盘插槽中的硬盘。Optionally, a disk array controller is also provided on the motherboard, the central processing unit is connected to the disk array controller through a first link, and the disk array controller is used to convert the first link to is a second link, and is connected to the hard disk in the hard disk slot through the second link.
可选地,所述背板上还设置有电源连接器,所述电源连接器通过所述可恢复保险丝为所述硬盘供电。Optionally, a power connector is further arranged on the backplane, and the power connector supplies power to the hard disk through the resettable fuse.
可选地,所述硬盘插槽的电压为所述硬盘插槽中的硬盘的工作电压。Optionally, the voltage of the hard disk slot is the working voltage of the hard disk in the hard disk slot.
本发明实施例还提供了一种电源监控方法,所述方法包括:The embodiment of the present invention also provides a power monitoring method, the method comprising:
通过电压监控芯片监控硬盘插槽的电压;Monitor the voltage of the hard disk slot through the voltage monitoring chip;
通过基板管理控制器判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程;judging by the baseboard management controller whether to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller;
通过中央处理器下发中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令。An instruction to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller is issued by the central processing unit.
可选地,在所述通过电压监控芯片监控硬盘插槽的电压步骤之后,所述方法还包括:Optionally, after the step of monitoring the voltage of the hard disk slot through the voltage monitoring chip, the method further includes:
当所述电压监控芯片监控所述硬盘插槽的电压低于预设电压阈值时,通过所述电压监控芯片向所述基板管理控制器发送中断信号。When the voltage monitoring chip monitors that the voltage of the hard disk slot is lower than a preset voltage threshold, an interrupt signal is sent to the baseboard management controller through the voltage monitoring chip.
可选地,所述通过基板管理控制器判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程步骤包括:Optionally, the step of judging whether to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller through the baseboard management controller includes:
向所述电压监控芯片发送电压请求以获取所述电压监控芯片监控的所述硬盘插槽的电压,并将所述基板管理控制器获取到的所述硬盘插槽的电压作为校验电压;Sending a voltage request to the voltage monitoring chip to obtain the voltage of the hard disk slot monitored by the voltage monitoring chip, and using the voltage of the hard disk slot obtained by the baseboard management controller as a verification voltage;
根据所述校验电压判断所述电压监控芯片发送的所述中断信号是否为误报以判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程;judging whether the interrupt signal sent by the voltage monitoring chip is a false alarm according to the verification voltage, so as to judge whether to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller;
当所述中断信号为非误报时,确定中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程。When the interrupt signal is not a false positive, it is determined to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller.
可选地,所述根据所述校验电压判断所述电压监控芯片发送的所述中断信号是否为误报步骤,包括:Optionally, the step of judging whether the interrupt signal sent by the voltage monitoring chip is a false alarm according to the verification voltage includes:
若所述校验电压小于预设电压阈值,则确定所述中断信号为非误报。If the verification voltage is less than a preset voltage threshold, it is determined that the interruption signal is not a false alarm.
可选地,在所述当所述中断信号为非误报时,确定中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程步骤之后,所述方法还包括:Optionally, after the step of determining to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller when the interrupt signal is not a false positive, the method further includes:
通过所述基板管理控制器向平台控制器发送所述中断信号对应的中断方式;其中,所述中断方式为中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的方式;The baseboard management controller sends an interrupt mode corresponding to the interrupt signal to the platform controller; wherein the interrupt mode is a mode of interrupting the interactive process between the hard disk in the hard disk slot and the disk array controller;
当所述平台控制器接收到所述中断方式时,通过所述平台控制器向中央处理器上报所述中断方式。When the platform controller receives the interrupt mode, the platform controller reports the interrupt mode to the central processing unit.
可选地,所述通过中央处理器下发中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令步骤,包括:Optionally, the step of sending instructions to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller through the central processing unit includes:
当所述中央处理器接收到所述中断方式时,通过所述中央处理器向磁盘阵列控制器下发所述中断方式对应的中断指令以使所述磁盘阵列控制器中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程。When the central processing unit receives the interrupt mode, the central processing unit sends an interrupt command corresponding to the interrupt mode to the disk array controller so that the disk array controller interrupts the operation of the hard disk slot. Interaction process between hard disk and disk array controller.
可选地,所述中央处理器上设置有计算机管理控制程序,所述计算机管理控制程序用于向所述磁盘阵列控制器下发所述中断方式对应的中断指令。Optionally, the central processing unit is provided with a computer management control program, and the computer management control program is used to issue an interrupt command corresponding to the interrupt mode to the disk array controller.
可选地,所述基板管理控制器设置有针对所述基板管理控制器的固件,所述方法还包括:Optionally, the baseboard management controller is provided with firmware for the baseboard management controller, and the method further includes:
当所述基板管理控制器运行所述基板管理控制器的固件时,通过所述固件监听所述电压监控芯片是否向所述基板管理控制器发送中断信号。When the baseboard management controller runs the firmware of the baseboard management controller, the firmware monitors whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller.
可选地,所述方法还包括:Optionally, the method also includes:
在所述基板管理控制器的固件中设置针对所述中断信号的优先级;setting a priority for the interrupt signal in firmware of the baseboard management controller;
当所述基板管理控制器接收到所述中断信号时,根据所述优先级对所述中断信号进行处理。When the baseboard management controller receives the interrupt signal, it processes the interrupt signal according to the priority.
可选地,所述方法还包括:Optionally, the method also includes:
若所述中断信号为误报,则重新执行所述通过所述固件监听所述电压监控芯片是否向所述基板管理控制器发送中断信号的步骤。If the interrupt signal is a false positive, re-execute the step of monitoring whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller through the firmware.
可选地,所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程为所述磁盘阵列控制器与所述硬盘插槽中的硬盘进行数据交互,在所述通过中央处理器下发中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令步骤之后,所述方法还包括:Optionally, the interaction process between the hard disk in the hard disk slot and the disk array controller is data interaction between the disk array controller and the hard disk in the hard disk slot. After the instruction step of the hard disk of the hard disk slot and the interactive process of the disk array controller, the method also includes:
将所述磁盘阵列控制器与所述硬盘插槽中硬盘之间停止交互的数据存储于所述磁盘阵列控制器。The data of the stop interaction between the disk array controller and the hard disk in the hard disk slot is stored in the disk array controller.
本发明实施例还提供了一种电源监控装置,所述装置包括:The embodiment of the present invention also provides a power monitoring device, which includes:
电压监控模块,用于通过电压监控芯片监控硬盘插槽的电压;The voltage monitoring module is used to monitor the voltage of the hard disk slot through the voltage monitoring chip;
进程判断模块,用于通过基板管理控制器判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程;A process judging module, configured to judge whether to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller through the baseboard management controller;
指令下发模块,用于通过中央处理器下发中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令。The instruction sending module is used to send an instruction to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller through the central processing unit.
可选地,所述装置还包括:Optionally, the device also includes:
信号发送模块,用于当所述电压监控芯片监控所述硬盘插槽的电压低于预设电压阈值时,通过所述电压监控芯片向所述基板管理控制器发送中断信号。A signal sending module, configured to send an interrupt signal to the baseboard management controller through the voltage monitoring chip when the voltage of the hard disk slot monitored by the voltage monitoring chip is lower than a preset voltage threshold.
可选地,所述进程判断模块具体用于:Optionally, the process judging module is specifically used for:
向所述电压监控芯片发送电压请求以获取所述电压监控芯片监控的所述硬盘插槽的电压,并将所述基板管理控制器获取到的所述硬盘插槽的电压作为校验电压;Sending a voltage request to the voltage monitoring chip to obtain the voltage of the hard disk slot monitored by the voltage monitoring chip, and using the voltage of the hard disk slot obtained by the baseboard management controller as a verification voltage;
根据所述校验电压判断所述电压监控芯片发送的所述中断信号是否为误报以判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程;judging whether the interrupt signal sent by the voltage monitoring chip is a false alarm according to the verification voltage, so as to judge whether to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller;
当所述中断信号为非误报时,确定中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程。When the interrupt signal is not a false positive, it is determined to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller.
可选地,所述装置还包括:Optionally, the device also includes:
非误报确定模块,用于若所述校验电压小于预设电压阈值,则确定所述中断信号为非误报。A non-false alarm determining module, configured to determine that the interruption signal is not a false alarm if the verification voltage is lower than a preset voltage threshold.
可选地,所述装置还包括:Optionally, the device also includes:
中断方式发送模块,用于通过所述基板管理控制器向平台控制器发送所述中断信号对应的中断方式;其中,所述中断方式为中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的方式;An interrupt mode sending module, configured to send an interrupt mode corresponding to the interrupt signal to the platform controller through the baseboard management controller; wherein, the interrupt mode is to interrupt the interaction between the hard disk in the hard disk slot and the disk array controller the manner of the process;
中断方式上报模块,用于当所述平台控制器接收到所述中断方式时,通过所述平台控制器向中央处理器上报所述中断方式。The interrupt mode reporting module is configured to report the interrupt mode to the central processing unit through the platform controller when the platform controller receives the interrupt mode.
可选地,所述指令下发模块具体用于:Optionally, the instruction issuing module is specifically used for:
当所述中央处理器接收到所述中断方式时,通过所述中央处理器向磁盘阵列控制器下发所述中断方式对应的中断指令以使所述磁盘阵列控制器中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程。When the central processing unit receives the interrupt mode, the central processing unit sends an interrupt command corresponding to the interrupt mode to the disk array controller so that the disk array controller interrupts the operation of the hard disk slot. Interaction process between hard disk and disk array controller.
可选地,所述基板管理控制器设置有针对所述基板管理控制器的固件,所述装置还包括:Optionally, the baseboard management controller is provided with firmware for the baseboard management controller, and the device further includes:
信号监听模块,用于当所述基板管理控制器运行所述基板管理控制器的固件时,通过所述固件监听所述电压监控芯片是否向所述基板管理控制器发送中断信号。A signal monitoring module, configured to monitor whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller through the firmware when the baseboard management controller is running the firmware of the baseboard management controller.
可选地,所述装置还包括:Optionally, the device also includes:
优先级设置模块,用于在所述基板管理控制器的固件中设置针对所述中断信号的优先级;a priority setting module, configured to set the priority for the interrupt signal in the firmware of the baseboard management controller;
信号处理模块,用于当所述基板管理控制器接收到所述中断信号时,根据所述优先级对所述中断信号进行处理。A signal processing module, configured to process the interrupt signal according to the priority when the baseboard management controller receives the interrupt signal.
可选地,所述装置还包括:Optionally, the device also includes:
监听重新执行模块,用于若所述中断信号为误报,则重新执行所述通过所述固件监听所述电压监控芯片是否向所述基板管理控制器发送中断信号的步骤。A monitoring and re-executing module, configured to re-execute the step of monitoring whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller through the firmware if the interrupt signal is a false positive.
可选地,所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程为所述磁盘阵列控制器与所述硬盘插槽中的硬盘进行数据交互,所述装置还包括:Optionally, the interaction process between the hard disk in the hard disk slot and the disk array controller is data interaction between the disk array controller and the hard disk in the hard disk slot, and the device further includes:
数据存储模块,用于将所述磁盘阵列控制器与所述硬盘插槽中硬盘之间停止交互的数据存储于所述磁盘阵列控制器。The data storage module is configured to store in the disk array controller the data of the stop interaction between the disk array controller and the hard disk in the hard disk slot.
本发明实施例还公开了一种电子设备,包括处理器、通信接口、存储器和通信总线,其中,所述处理器、所述通信接口以及所述存储器通过所述通信总线完成相互间的通信;The embodiment of the present invention also discloses an electronic device, including a processor, a communication interface, a memory, and a communication bus, wherein the processor, the communication interface, and the memory complete mutual communication through the communication bus;
所述存储器,用于存放计算机程序;The memory is used to store computer programs;
所述处理器,用于执行存储器上所存放的程序时,实现如本发明实施例所述的电源监控方法。The processor is used to implement the power monitoring method according to the embodiment of the present invention when executing the program stored in the memory.
本发明实施例还公开了一种计算机可读存储介质,其上存储有指令,当由一个或多个处理器执行时,使得所述处理器执行如本发明实施例所述的电源监控方法。The embodiment of the present invention also discloses a computer-readable storage medium, on which instructions are stored, and when executed by one or more processors, the processors execute the power monitoring method according to the embodiment of the present invention.
本发明实施例包括以下优点:Embodiments of the present invention include the following advantages:
在本发明实施例中,电源架构包括设置在背板上的电压监控芯片,电压监控芯片与设置在主板上的基板管理控制器连接;电压监控芯片用于监控背板上至少一个硬盘插槽的电压。在本发明实施例中,通过设置在背板上的电压监控芯片,能够实时监测到背板上硬盘插槽的电压,可以有效地降低存储系统掉盘故障率并有效地保护用户数据,能够进一步地提高服务器的整机的可靠性,同时,本发明实施例提及的架构简单易行、可实现性高以及成本低廉。In the embodiment of the present invention, the power supply architecture includes a voltage monitoring chip arranged on the backplane, and the voltage monitoring chip is connected to a baseboard management controller arranged on the motherboard; the voltage monitoring chip is used to monitor the voltage of at least one hard disk slot on the backplane Voltage. In the embodiment of the present invention, the voltage of the hard disk slots on the backplane can be monitored in real time through the voltage monitoring chip on the backplane, which can effectively reduce the failure rate of disk failure in the storage system and effectively protect user data. The reliability of the whole server can be greatly improved, and at the same time, the architecture mentioned in the embodiment of the present invention is simple, easy to implement, high in realizability, and low in cost.
附图说明Description of drawings
图1是本发明实施例中提供的一种电源架构示意图之一;FIG. 1 is one of a schematic diagram of a power supply architecture provided in an embodiment of the present invention;
图2是本发明实施例中提供的一种电源架构示意图之二;Fig. 2 is a second schematic diagram of a power supply architecture provided in an embodiment of the present invention;
图3是本发明实施例中提供的一种电源监控方法的步骤流程图;FIG. 3 is a flowchart of steps of a power monitoring method provided in an embodiment of the present invention;
图4是本发明实施例中提供的一种电源监控方法的流程示意图;FIG. 4 is a schematic flowchart of a power monitoring method provided in an embodiment of the present invention;
图5是本发明实施例中提供的一种电源监控装置的结构框图;Fig. 5 is a structural block diagram of a power monitoring device provided in an embodiment of the present invention;
图6是本发明实施例中提供的一种计算机可读存储介质的结构示意图;FIG. 6 is a schematic structural diagram of a computer-readable storage medium provided in an embodiment of the present invention;
图7是实现本发明各个实施例的一种电子设备的硬件结构示意图。Fig. 7 is a schematic diagram of a hardware structure of an electronic device implementing various embodiments of the present invention.
具体实施方式Detailed ways
为使本发明的上述目的、特征和优点能够更加明显易懂,下面结合附图和具体实施方式对本发明作进一步详细的说明。In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.
为了使本领域技术人员更好地理解本发明实施例的技术方案,下面对本发明实施例中涉及的部分技术特征进行解释、说明:In order to enable those skilled in the art to better understand the technical solutions of the embodiments of the present invention, some technical features involved in the embodiments of the present invention are explained below:
基板管理控制器(Baseboard Management Controller 简称BMC),其为嵌入在计算机(通常是服务器)主板上的专用微控制器,BMC负责管理系统管理软件和平台硬件之间的接口。Baseboard Management Controller (BMC for short), which is a dedicated microcontroller embedded on the motherboard of a computer (usually a server), BMC is responsible for managing the interface between the system management software and the platform hardware.
时钟产生器(Clock Generator),其为同步时序电路产生时钟信号的器件或装置。Clock Generator (Clock Generator), which is a device or device that generates clock signals for synchronous sequential circuits.
时钟扇出器(Clock buffer),其通常是指基于非PLL(Phase-locked loops锁相环)的扇出型缓冲器,是一种将一路时钟源信号通过频率复制生成多路时钟信号的器件,通常时钟扇出器还兼具有时钟分配,格式转换和电平转换的功能。A clock fanout (Clock buffer), which usually refers to a fanout buffer based on a non-PLL (Phase-locked loops phase-locked loop), is a device that generates multiple clock signals through frequency replication of one clock source signal , usually the clock fan-out also has the functions of clock distribution, format conversion and level conversion.
高速串行计算机扩展总线(peripheral component interconnect express 简称PCIE),其为一种高速串行计算机扩展总线标准,PCIE属于高速串行点对点双通道高带宽传输,所连接的设备分配独享通道带宽,不共享总线带宽,主要支持主动电源管理,错误报告,端对端的可靠性传输,热插拔以及服务质量等功能。High-speed serial computer expansion bus (peripheral component interconnect express referred to as PCIE), which is a high-speed serial computer expansion bus standard, PCIE belongs to high-speed serial point-to-point dual-channel high-bandwidth transmission, and the connected devices are allocated exclusive channel bandwidth, no Shared bus bandwidth mainly supports functions such as active power management, error reporting, end-to-end reliable transmission, hot swapping, and quality of service.
固件(firmware 简称FW),其为一种写入可擦写存储器中的底层软件,在芯片上电后运行固件来初始化芯片及其外围设备。Firmware (firmware referred to as FW), which is a low-level software written in rewritable memory, runs the firmware to initialize the chip and its peripherals after the chip is powered on.
中央处理器(Central Processing Unit 简称CPU),其作为计算机系统的运算和控制核心,是信息处理、程序运行的最终执行单元。The central processing unit (Central Processing Unit referred to as CPU), as the computing and control core of the computer system, is the final execution unit of information processing and program operation.
图形处理单元(Graphics Processing Unit 简称GPU),其又称显示核心、视觉处理器、显示芯片,是一种专门在个人电脑、工作站、游戏机和一些移动设备(如平板电脑、智能手机等)上做图像和图形相关运算工作的微处理器。Graphics Processing Unit (Graphics Processing Unit referred to as GPU), also known as display core, visual processor, display chip, is a special graphics processing unit on personal computers, workstations, game consoles and some mobile devices (such as tablets, smartphones, etc.) A microprocessor that does image and graphics-related operations.
可编程门阵列(Field Programmable Gate Array 简称FPGA),其为在可编程阵列逻辑、通用阵列逻辑等可编程器件的基础上进一步发展的产物。它是作为专用集成电路领域中的一种半定制电路而出现的,既解决了定制电路的不足,又克服了原有可编程器件门电路数有限的缺点。Programmable gate array (Field Programmable Gate Array referred to as FPGA), which is a product of further development on the basis of programmable devices such as programmable array logic and general array logic. It appeared as a semi-custom circuit in the field of application-specific integrated circuits, which not only solved the shortcomings of custom circuits, but also overcome the shortcomings of the limited number of original programmable device gates.
系统管理总线(System Management Bus 简称SMBUS),其应用于移动电脑和桌面电脑系统中的低速率通讯,希望可以通过一条廉价并且功能强大的总线(由两条线组成),来控制主板上的设备并收集相应的信息。System Management Bus (SMBUS for short), which is used for low-speed communication in mobile computers and desktop computer systems, hopes to control the devices on the motherboard through a cheap and powerful bus (composed of two lines) and collect relevant information.
作为一种示例,在目前传统的的存储服务器设计中,CPU和PCH(PlatformController Hub 平台控制器)通过DMI(Direct Media Interface 直接媒体接口)链路互连,PCH通过SPI(Serial Peripheral Interface 串行外设接口)链路和第一Flash(闪存)颗粒互连,第一闪存颗粒中存储了BIOS(Basic Input Output System 基本输入输出系统)的FW(固件),BMC(基板管理控制器)也是通过SPI链路和第二闪存颗粒互连,第二闪存颗粒中存储了BMC的FW(固件),CPU通过PCIE链路和Raid控制器互连,Raid控制器可以用于协议转换,具体地,将PCIE链路转换成SAS/SATA(硬盘接口类型)链路,然后通过SAS/SATA链路连接到主板上的连接器,再通过线缆(cable)连接到背板(Backplane)上的连接器,最后连接到背板上的机械硬盘(HDD);主板上的电压调节芯片(VR Chip)可以输出P12V的电源连接到主板上的连接器,再通过线缆连接到背板上的保险丝(eFuse),保险丝可以将P12V的电源转换为背板上插槽上插的机械硬盘所需要的P12V和P5V电源;同时,BMC和PCH之间可以通过IPMI(Intelligent Platform Management Interface 智能平台管理接口)链路互连,BMC再通过I2C(Inter-Integrated Circuit 双向二线制同步串行总线)链路连接到主板上的连接器,再通线缆连接到背板上的保险丝,对背板上的干路的电源进行电压和电流监控。As an example, in the current traditional storage server design, CPU and PCH (PlatformController Hub platform controller) are interconnected through DMI (Direct Media Interface Direct Media Interface) link, and PCH is connected through SPI (Serial Peripheral Interface) Interface) link and the first Flash (flash memory) particles are interconnected, the FW (firmware) of the BIOS (Basic Input Output System) is stored in the first flash memory particles, and the BMC (baseboard management controller) is also connected through the SPI The link is interconnected with the second flash memory particle, the FW (firmware) of the BMC is stored in the second flash memory particle, the CPU is interconnected with the Raid controller through the PCIE link, and the Raid controller can be used for protocol conversion, specifically, the PCIE The link is converted into a SAS/SATA (hard disk interface type) link, and then connected to the connector on the motherboard through the SAS/SATA link, and then connected to the connector on the backplane (Backplane) through the cable (cable), and finally Connect to the mechanical hard disk (HDD) on the backplane; the voltage regulation chip (VR Chip) on the motherboard can output P12V power to the connector on the motherboard, and then connect to the fuse (eFuse) on the backplane through a cable. The fuse can convert the P12V power to the P12V and P5V power required by the mechanical hard disk inserted in the slot on the backplane; at the same time, the BMC and PCH can be interconnected through the IPMI (Intelligent Platform Management Interface) link , BMC is connected to the connector on the motherboard through the I2C (Inter-Integrated Circuit bidirectional two-wire synchronous serial bus) link, and then connected to the fuse on the backplane through the cable, and the power supply of the main circuit on the backplane is controlled. voltage and current monitoring.
但是这种存储服务器上背板的电源架构还在开发阶段,由于机箱内以及背板上空间的问题,一般会将背板上传输P12V的电源连接器放置在背板的两端,电源连接器再经过PCB(Process Control Block进程管理块)传输,分别给插槽上的机械硬盘供电,具体每个插槽上的压降以及电流通路由电源工程师提前进行仿真,在目前传统的存储服务器设计中,存储数据的硬盘的电源架构存在一个问题,即:所有机械硬盘在和磁盘阵列控制器之间有大数据的快速交互或者突发时,机械硬盘会从背板抽取大电流,进而单个硬盘和整个干路之间会有较大的压降(电压迅速跌落)产生,而电源工程师仿真阶段,无法准确的做到或者无法无限逼近这种真实的业务场景,因此如果单个硬盘的电源仿真裕量不足,会导致设计出来的服务器某些时候存在掉盘的风险,进而对应用数据造成一定的破坏,如果单个硬盘电源仿真裕量过大,那么所有硬盘同时和磁盘阵列控制器之间有大数据的突发交互时,所有硬盘加起来后所消耗的电流会非常大,导致过度设计,进而导致干路的保险丝选型规格过大,硬件设计整体成本上升。基于此,这种传统的电源设计架构对服务器系统厂商来说带来了一定的挑战。However, the power architecture of the backplane on this storage server is still in the development stage. Due to space issues in the chassis and on the backplane, the power connectors for transmitting P12V on the backplane are generally placed at both ends of the backplane. Then through the PCB (Process Control Block process management block) transmission, the mechanical hard disk on the slot is powered respectively. The specific voltage drop and current path on each slot are simulated in advance by the power engineer. In the current traditional storage server design , there is a problem in the power structure of the hard disk that stores data, that is: when all mechanical hard disks interact with the disk array controller quickly or suddenly, the mechanical hard disk will draw a large current from the backplane, and then a single hard disk and There will be a large voltage drop (rapid voltage drop) between the entire main circuit, and the simulation stage of the power supply engineer cannot accurately achieve or infinitely approach this real business scenario, so if the power supply simulation margin of a single hard disk Insufficient, it will lead to the risk of disk failure in the designed server at certain times, which will cause certain damage to application data. If the power simulation margin of a single hard disk is too large, then there will be large data between all hard disks and the disk array controller at the same time. When there is a sudden interaction of all hard disks, the current consumed by all the hard disks will be very large, resulting in over-design, which in turn leads to excessive selection of fuses for the main circuit, and an increase in the overall cost of hardware design. Based on this, this traditional power supply design architecture brings certain challenges to server system manufacturers.
对此,本发明的核心发明点之一在于电源架构包括设置在背板上的电压监控芯片,电压监控芯片与设置在主板上的基板管理控制器连接,其中,电压监控芯片用于监控背板上至少一个硬盘插槽的电压。在本发明实施例中,通过设置在背板上的电压监控芯片,能够实时监测到背板上硬盘插槽的电压,从而可以根据实时监测到的硬盘插槽的电压有效地降低存储系统掉盘故障率并有效地保护用户数据,能够进一步地提高服务器的整机的可靠性,同时,本发明实施例提及的架构简单易行、可实现性高以及成本低廉。In this regard, one of the core inventive points of the present invention is that the power supply architecture includes a voltage monitoring chip set on the backplane, and the voltage monitoring chip is connected to a baseboard management controller set on the mainboard, wherein the voltage monitoring chip is used to monitor the backplane voltage of at least one hard disk slot on the In the embodiment of the present invention, the voltage of the hard disk slots on the backplane can be monitored in real time through the voltage monitoring chip on the backplane, so that the disk failure of the storage system can be effectively reduced according to the real-time monitored voltage of the hard disk slots. The failure rate and effective protection of user data can further improve the reliability of the server as a whole. At the same time, the architecture mentioned in the embodiments of the present invention is simple, easy to implement, high in realizability, and low in cost.
参照图1,示出了本发明实施例中提供的一种电源架构示意图之一;电源架构包括设置在背板上的电压监控芯片,所述电压监控芯片与设置在主板上的基板管理控制器连接;所述电压监控芯片用于监控所述背板上至少一个硬盘插槽的电压。Referring to FIG. 1 , it shows one of the schematic diagrams of a power supply architecture provided in an embodiment of the present invention; the power supply architecture includes a voltage monitoring chip arranged on a backplane, and the voltage monitoring chip and a baseboard management controller arranged on a mainboard connected; the voltage monitoring chip is used to monitor the voltage of at least one hard disk slot on the backboard.
其中,电源架构包括主板和背板,背板上设置有电压监控芯片,主板上设置有基板管理控制器;电压监控芯片与设置在主板上的基板管理控制器连接,电压监控芯片用于监控背板上至少一个硬盘插槽的电压。Among them, the power supply architecture includes a mainboard and a backplane. The backplane is provided with a voltage monitoring chip, and the mainboard is provided with a baseboard management controller. The voltage monitoring chip is connected with the baseboard management controller provided on the mainboard. The voltage of at least one hard disk slot on the board.
对于电压监控芯片,其可以为ADC(Analog-to-digital converter模拟数字转换器),其可以用于监控硬盘插槽的电压;其中,硬盘插槽上可以插入机械硬盘,对于硬盘插槽的电压,其可以理解为硬盘插槽上的机械硬盘在正常工作时的电压,例如,在通常的情况下,机械硬盘的电压可以为12V或偏高于12V(如12.1V或12.2V),可以理解的是,对于机械硬盘正常工作的电压取值,本领域技术人员可以根据实际情况进行调整,本发明实施例对此不作限制。For the voltage monitoring chip, it can be an ADC (Analog-to-digital converter analog-to-digital converter), which can be used to monitor the voltage of the hard disk slot; wherein, a mechanical hard disk can be inserted into the hard disk slot, and the voltage of the hard disk slot , which can be understood as the voltage of the mechanical hard disk on the hard disk slot during normal operation. For example, under normal circumstances, the voltage of the mechanical hard disk can be 12V or higher than 12V (such as 12.1V or 12.2V). It can be understood It is worth noting that those skilled in the art can adjust the voltage value for the normal operation of the mechanical hard disk according to the actual situation, which is not limited in the embodiment of the present invention.
对于机械硬盘,其为一种插放于背板上的硬盘插槽上的硬盘,可以用于和磁盘阵列控制器进行数据的交互。As for the mechanical hard disk, it is a hard disk inserted into the hard disk slot on the backplane, which can be used for data interaction with the disk array controller.
可选地,所述背板还设置有可恢复保险丝,所述可恢复保险丝和设置在所述主板上的电压调节芯片连接;用于将所述电压调节芯片向所述可恢复保险丝发送的电压转换为所述硬盘插槽所需要的电压。Optionally, the backplane is also provided with a resettable fuse, and the resettable fuse is connected to the voltage regulation chip provided on the main board; it is used to send the voltage sent by the voltage regulation chip to the resettable fuse. Convert to the voltage required by the hard disk slot.
其中,对于可恢复保险丝,在本发明实施例中,保险丝为一种可恢复的保险丝(eFuse),可实现性高,成本低,不同于一般的保险丝,一般的保险丝在过流后就熔断了,不可恢复,因而具备一定的可行性,而本发明实施例中的保险丝为可恢复保险丝,可恢复保险丝可以用于将电压调节芯片向可恢复保险丝发送的电压转换为硬盘插槽所需要的电源。其中,硬盘插槽所需要的电压通常指的是硬盘插槽上机械硬盘正常工作时的电压,可以理解的是,硬盘插槽的电压为机械硬盘的工作电压。Among them, for the recoverable fuse, in the embodiment of the present invention, the fuse is a recoverable fuse (eFuse), which has high feasibility and low cost, and is different from the general fuse, which is blown after overcurrent , is not recoverable, so it has certain feasibility, and the fuse in the embodiment of the present invention is a recoverable fuse, and the recoverable fuse can be used to convert the voltage sent by the voltage regulation chip to the recoverable fuse into the power required by the hard disk slot . Wherein, the voltage required by the hard disk slot usually refers to the voltage of the mechanical hard disk on the hard disk slot when it is working normally. It can be understood that the voltage of the hard disk slot is the working voltage of the mechanical hard disk.
在具体实现中,背板还设置有可恢复保险丝,可恢复保险丝和设置在主板上的电压调节芯片连接,可恢复保险丝用于将电压调节芯片向可恢复保险丝发送的电压转换为硬盘插槽所需要的电压。In a specific implementation, the backplane is also provided with a resettable fuse, which is connected to the voltage regulation chip arranged on the main board, and the resettable fuse is used to convert the voltage sent by the voltage regulation chip to the resettable fuse into the voltage required by the hard disk slot. required voltage.
可选地,所述电压监控芯片设置在远离所述背板上的可恢复保险丝的所述硬盘插槽旁。Optionally, the voltage monitoring chip is arranged next to the hard disk slot away from the resettable fuse on the backplane.
在具体实现中,电压监控芯片设置在远离背板上的可恢复保险丝的硬盘插槽旁,可以理解的是,电压监控芯片(可称为ADC)是设置在距离可恢复保险丝最远的硬盘插槽的旁边的一个芯片,由于与可恢复保险丝的距离最远,因此该距离最远的硬盘插槽分到的电压或产生的电流就会比较小,即距离可恢复保险丝最远的硬盘插槽上机械硬盘的工作电压可能会出现低于电压监控芯片设置的预设电压阈值,则该硬盘插槽所产生的压降会是最大的,因此,当电压监控芯片检测到机械硬盘的工作电压出现低于电压监控芯片设置的预设电压阈值时,其他的硬盘插槽也同样会出现此类压降的问题。In a specific implementation, the voltage monitoring chip is set next to the hard disk slot away from the resettable fuse on the backplane. A chip next to the slot, because the distance from the resettable fuse is the farthest, the voltage or current generated by the farthest hard disk slot will be relatively small, that is, the hard disk slot farthest from the resettable fuse If the operating voltage of the mechanical hard disk on the computer may be lower than the preset voltage threshold set by the voltage monitoring chip, the voltage drop generated by the hard disk slot will be the largest. Therefore, when the voltage monitoring chip detects that the operating voltage of the mechanical hard disk has When it is lower than the preset voltage threshold set by the voltage monitoring chip, other hard disk slots will also have this kind of voltage drop problem.
可选地,主板上还设置有顺次连接的中央处理器、平台控制器与至少一个闪存颗粒。Optionally, the main board is further provided with a central processing unit, a platform controller and at least one flash memory particle connected in sequence.
其中,闪存颗粒可以包含第一闪存颗粒,第二闪存颗粒,其中,平台控制器和第一闪存颗粒连接,基板管理控制器和第二闪存颗粒连接。Wherein, the flash memory particle may include a first flash memory particle and a second flash memory particle, wherein the platform controller is connected to the first flash memory particle, and the baseboard management controller is connected to the second flash memory particle.
其中,第一闪存颗粒中存储有BIOS的FW(基本输入输出系统的固件),其也可以表示为PCH的FW(平台控制器的固件),其中,BIOS 的FW的作用是在系统上电开机后对底层硬件进行初始化,比如PCIE接口和链路初始化等,BIOS将上层驱动和底层硬件屏蔽了起来,从而不需要上层驱动程序过多地关注底层硬件的实现细节;其次,第二闪存颗粒中存储有BMC的FW(基板管理控制器的固件),BMC的FW指的是BMC要运行起来的驱动、OS Kernel(操作系统)和应用层程序的一个包,BMC是一个嵌入式操作系统,主要作用是让BMC运行起来进行整系统管理的功能,比如风扇散热等。通过利用固件对数据进行了完成的保护,提高了服务器整机的可靠性。Among them, the FW of the BIOS (firmware of the basic input and output system) is stored in the first flash memory particle, which can also be expressed as the FW of the PCH (firmware of the platform controller), wherein the function of the FW of the BIOS is to power on the system Finally, the underlying hardware is initialized, such as PCIE interface and link initialization, etc., and the BIOS shields the upper layer driver and the underlying hardware, so that the upper layer driver does not need to pay too much attention to the implementation details of the underlying hardware; secondly, in the second flash memory particle The FW of the BMC (the firmware of the baseboard management controller) is stored. The FW of the BMC refers to a package of the driver, the OS Kernel (operating system) and the application layer program that the BMC needs to run. The BMC is an embedded operating system. The function is to make the BMC run to perform the functions of the whole system management, such as fan cooling and so on. By using the firmware to complete the protection of the data, the reliability of the whole server is improved.
对于中央处理器,其可以为CPU,其作为计算机系统的运算和控制核心,是信息处理、程序运行的最终执行单元。其中,平台控制器(PCH)可以挂载一些低速设备和部分高速设备,通过DMI总线和CPU互联,控制上电和下电的时序、系统管理等功能,其还可以用于将高级别中断方式上报给CPU中运行的OS(operating system 操作系统)。As for the central processing unit, it may be a CPU, which is the computing and control core of the computer system and the final execution unit for information processing and program operation. Among them, the platform controller (PCH) can mount some low-speed devices and some high-speed devices, and interconnect with the CPU through the DMI bus to control the timing of power-on and power-off, system management and other functions. Report to the OS (operating system operating system) running in the CPU.
其中,CPU上运行有计算机管理控制程序,计算机管理控制程序也可以理解为一种操作系统,其可以为OS,OS可以处理一些高级别的事件,示例性地,平台控制器(PCH)可以将事件以高级别中断方式上报给CPU上运行的OS,OS可以对该事件进行处理。需要说明的是,可能因为距离背板上电源连接器最远的硬盘槽位的电压正在或者即将跌落出机械硬盘正常运行的电压范围外,因此其他一些硬盘槽位的电压也会出现同样的问题,因此该事件是最高级别的事件,需要OS优先处理。Wherein, a computer management control program runs on the CPU, and the computer management control program can also be understood as an operating system, which can be an OS, and the OS can handle some high-level events. For example, the platform controller (PCH) can The event is reported to the OS running on the CPU in the form of a high-level interrupt, and the OS can process the event. It should be noted that maybe because the voltage of the hard disk slot farthest from the power connector on the back panel is or is about to drop out of the normal operating voltage range of the mechanical hard disk, the voltage of some other hard disk slots will also have the same problem. , so this event is the highest-level event and requires priority processing by the OS.
可选地,所述主板上还设置有磁盘阵列控制器,所述中央处理器通过第一链路和所述磁盘阵列控制器连接,所述磁盘阵列控制器用于将所述第一链路转换为第二链路,并通过所述第二链路连接所述硬盘插槽中的硬盘。Optionally, a disk array controller is also provided on the motherboard, the central processing unit is connected to the disk array controller through a first link, and the disk array controller is used to convert the first link to is a second link, and is connected to the hard disk in the hard disk slot through the second link.
其中,第一链路可以为PCIE链路,第二链路可以为SAS/SATA(硬盘接口类型)链路,通常SAS/SATA链路为机械硬盘常用的链路。Wherein, the first link may be a PCIE link, and the second link may be a SAS/SATA (hard disk interface type) link. Usually, the SAS/SATA link is a link commonly used by mechanical hard disks.
在具体实现中,中央处理器通过第一链路和磁盘阵列控制器连接,磁盘阵列控制器用于将第一链路转换为第二链路,并通过第二链路连接硬盘插槽中的硬盘,具体地,中央处理器通过PCIE链路和磁盘阵列控制器连接,磁盘阵列控制器可以将PCIE链路转换成SAS/SATA链路,然后通过SAS/SATA链路连接到硬盘插槽中的硬盘。In a specific implementation, the central processing unit is connected to the disk array controller through the first link, and the disk array controller is used to convert the first link into the second link, and connect the hard disk in the hard disk slot through the second link Specifically, the central processing unit is connected to the disk array controller through a PCIE link, and the disk array controller can convert the PCIE link into a SAS/SATA link, and then connect to the hard disk in the hard disk slot through the SAS/SATA link .
可选地,背板上还设置有电源连接器,电源连接器通过可恢复保险丝为硬盘供电。Optionally, a power connector is also provided on the backplane, and the power connector supplies power to the hard disk through a resettable fuse.
为了使本领域技术人员更好地理解本发明实施例的技术方案,下面通过一个例子进行示例性说明:In order to enable those skilled in the art to better understand the technical solutions of the embodiments of the present invention, an example is used to illustrate the following:
参照图2,示出了本发明实施例中提供的一种电源架构示意图之二;如图2所示,与传统设计方案比较,在本发明所提及的电源架构中,新增部分如下:假设图2中Slotn(硬盘插槽n)这个槽位距离可恢复保险丝最远,则Slotn槽位的压降也最大,新的电源架构仅需要在该槽位旁边放置一颗电压监控芯片对Slotn槽位的电压进行监控。由图2可知,电源架构包括主板和背板,主板包含中央处理器、平台控制器、基板管理控制器、电压调节芯片、磁盘阵列控制器、电压调节芯片和连接器,背板包含多个硬盘插槽,硬盘插槽上插有机械硬盘,此外,背板上还包括电源连接器和连接器,具体地,电压监控芯片可以通过I2C链路连接到背板上的连接器,进一步连接到上行主板上的基板管理控制器芯片,同时,电压监控芯片可以发出硬件init(中断)信号,将中断信号连接到背板的连接器上,通过cable(线缆)连接到主板连接器,再连接到主板上的基板管理控制器芯片,中央处理器和平台控制器通过DMI链路互连,PCH通过SPI链路和第一闪存颗粒互连,基板管理控制器也是通过SPI链路和第二闪存颗粒互连,中央处理器通过PCIE链路和磁盘阵列控制器互连,磁盘阵列控制器可以用于将PCIE链路转换成SAS/SATA链路,然后通过SAS/SATA链路连接到主板上的连接器,再通过线缆(cable)连接到背板(Backplane)上的连接器,最后连接到背板上的机械硬盘;主板上的电压调节芯片(VR Chip)可以输出P12V的电源连接到主板上的连接器,再通过线缆连接到背板上的可恢复保险丝(eFuse),保险丝可以将P12V的电源转换为背板上插槽上插的机械硬盘所需要的P12V和P5V电源;同时,基板管理控制器和平台控制器之间可以通过IPMI(Intelligent Platform Management Interface 智能平台管理接口)链路互连,基板管理控制器再通过I2C链路连接到主板上的连接器,再通线缆连接到背板上的保险丝,对背板上的干路的电源进行电压和电流监控。Referring to Figure 2, it shows the second schematic diagram of a power supply architecture provided in the embodiment of the present invention; as shown in Figure 2, compared with the traditional design scheme, in the power supply architecture mentioned in the present invention, the new parts are as follows: Assuming that the slot Slotn (hard disk slot n) in Figure 2 is the farthest away from the resettable fuse, the voltage drop in the slot Slotn is also the largest, and the new power architecture only needs to place a voltage monitoring chip next to the slot for Slotn The voltage of the slot is monitored. As can be seen from Figure 2, the power supply architecture includes a motherboard and a backplane. The motherboard includes a central processing unit, a platform controller, a baseboard management controller, a voltage regulator chip, a disk array controller, a voltage regulator chip and connectors, and the backplane includes multiple hard disks. The mechanical hard disk is inserted into the hard disk slot. In addition, the backplane also includes power connectors and connectors. Specifically, the voltage monitoring chip can be connected to the connector on the backplane through an I2C link, and further connected to the upstream The baseboard management controller chip on the motherboard, at the same time, the voltage monitoring chip can send a hardware init (interrupt) signal, connect the interrupt signal to the connector on the backplane, connect to the motherboard connector through a cable (cable), and then connect to the The baseboard management controller chip on the motherboard, the central processing unit and the platform controller are interconnected through the DMI link, the PCH is interconnected with the first flash memory particle through the SPI link, and the baseboard management controller is also connected with the second flash memory particle through the SPI link Interconnection, the central processing unit is interconnected with the disk array controller through the PCIE link, the disk array controller can be used to convert the PCIE link into a SAS/SATA link, and then connect to the connection on the motherboard through the SAS/SATA link Then connect to the connector on the backplane (Backplane) through the cable (cable), and finally connect to the mechanical hard disk on the backplane; the voltage regulation chip (VR Chip) on the motherboard can output P12V power to connect to the motherboard connector, and then connected to the recoverable fuse (eFuse) on the backplane through a cable, the fuse can convert the P12V power supply to the P12V and P5V power supply required by the mechanical hard disk inserted in the slot on the backplane; at the same time, the substrate The management controller and the platform controller can be interconnected through the IPMI (Intelligent Platform Management Interface Intelligent Platform Management Interface) link, and the baseboard management controller is connected to the connector on the motherboard through the I2C link, and then connected to the The fuse on the backplane monitors the voltage and current of the power supply of the main circuit on the backplane.
对于电压调节芯片,其主要作用为将输入的某一个电压转换为输出电压,示例性地,通过电压调节芯片将P12V转换为P3V3。As for the voltage regulating chip, its main function is to convert a certain input voltage into an output voltage. For example, the voltage regulating chip converts P12V into P3V3.
在本发明实施例中,电源架构包括设置在背板上的电压监控芯片,电压监控芯片与设置在主板上的基板管理控制器连接,电压监控芯片用于监控背板上至少一个硬盘插槽的电压。在本发明实施例中,通过设置在背板上的电压监控芯片,能够实时监测到背板上硬盘插槽的电压,可以有效地降低存储系统掉盘故障率并有效地保护用户数据,能够进一步地提高服务器的整机的可靠性,同时,本发明实施例提及的架构简单易行、可实现性高以及成本低廉。In an embodiment of the present invention, the power supply architecture includes a voltage monitoring chip arranged on the backplane, the voltage monitoring chip is connected to a baseboard management controller arranged on the motherboard, and the voltage monitoring chip is used to monitor the voltage of at least one hard disk slot on the backplane. Voltage. In the embodiment of the present invention, the voltage of the hard disk slots on the backplane can be monitored in real time through the voltage monitoring chip on the backplane, which can effectively reduce the failure rate of disk failure in the storage system and effectively protect user data. The reliability of the whole server can be greatly improved, and at the same time, the architecture mentioned in the embodiment of the present invention is simple, easy to implement, high in realizability, and low in cost.
此外,电压监控芯片是设置在距离可恢复保险丝最远的硬盘插槽的旁边的一个芯片,由于与可恢复保险丝的距离最远,因此该距离最远的硬盘插槽分到的电压或产生的电流就会比较小,即距离可恢复保险丝最远的硬盘插槽上机械硬盘的工作电压可能会出现低于电压监控芯片设置的预设电压阈值,则该硬盘插槽所产生的压降会是最大的,因此,当电压监控芯片检测到机械硬盘的工作电压出现低于电压监控芯片设置的预设电压阈值时,其他的硬盘插槽也同样会出现此类压降的问题,从而无需通过电压监控芯片监控每一个硬盘插槽的电压,能够迅速发现问题和节省了发现问题的时间。In addition, the voltage monitoring chip is a chip set next to the hard disk slot farthest from the resettable fuse. Since the distance from the resettable fuse is the farthest, the voltage distributed or generated by the hard disk slot farthest The current will be relatively small, that is, the operating voltage of the mechanical hard disk on the hard disk slot farthest from the recoverable fuse may be lower than the preset voltage threshold set by the voltage monitoring chip, and the voltage drop generated by the hard disk slot will be The biggest, therefore, when the voltage monitoring chip detects that the operating voltage of the mechanical hard disk is lower than the preset voltage threshold set by the voltage monitoring chip, other hard disk slots will also have this kind of voltage drop problem, so there is no need to pass the voltage The monitoring chip monitors the voltage of each hard disk slot, which can quickly find problems and save the time of finding problems.
参照图3,示出了本发明实施例中提供的一种电源监控方法的步骤流程图,具体可以包括如下步骤:Referring to FIG. 3 , it shows a flow chart of the steps of a power monitoring method provided in an embodiment of the present invention, which may specifically include the following steps:
步骤301,通过电压监控芯片监控硬盘插槽的电压;
对于电压监控芯片,其可以为ADC(Analog-to-digital converter模拟数字转换器),其可以用于监控硬盘插槽的电压。For the voltage monitoring chip, it can be an ADC (Analog-to-digital converter), which can be used to monitor the voltage of the hard disk slot.
其中,硬盘插槽上可以插入机械硬盘,对于机械硬盘,其为一种插放于背板上的硬盘插槽上的硬盘,可以用于和磁盘阵列控制器进行数据的交互。Wherein, a mechanical hard disk can be inserted into the hard disk slot, and the mechanical hard disk is a hard disk inserted into the hard disk slot on the backplane, which can be used for data interaction with the disk array controller.
对于硬盘插槽的电压,其可以理解为硬盘插槽上的机械硬盘在正常工作时的电压,例如,在通常的情况下,机械硬盘的电压可以为12V或偏高于12V(如12.1V或12.2V),可以理解的是,对于机械硬盘正常工作的电压取值,本领域技术人员可以根据实际情况进行调整,本发明实施例对此不作限制。As for the voltage of the hard disk slot, it can be understood as the voltage of the mechanical hard disk on the hard disk slot when it is working normally. For example, under normal circumstances, the voltage of the mechanical hard disk can be 12V or higher than 12V (such as 12.1V or 12.2V), it can be understood that those skilled in the art can adjust the voltage value for the normal operation of the mechanical hard disk according to the actual situation, which is not limited in the embodiment of the present invention.
在具体实现中,可以通过电压监控芯片监控硬盘插槽的电压。In a specific implementation, the voltage of the hard disk slot can be monitored through a voltage monitoring chip.
步骤302,通过基板管理控制器判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程;
对于交互进程,其为硬盘插槽的硬盘和磁盘阵列控制器的数据交互的过程。As for the interactive process, it is a process of data interaction between the hard disk in the hard disk slot and the disk array controller.
在具体实现中,可以通过基板管理控制器判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程。In a specific implementation, the baseboard management controller may determine whether to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller.
步骤303,通过中央处理器下发中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令。In
对于中央处理器,其可以为CPU,其作为计算机系统的运算和控制核心,是信息处理、程序运行的最终执行单元。其中,平台控制器(PCH)可以挂载一些低速设备和部分高速设备,通过DMI总线和CPU互联,控制上电和下电的时序、系统管理等功能,其还可以用于将高级别中断方式上报给CPU中运行的OS。As for the central processing unit, it may be a CPU, which is the computing and control core of the computer system and the final execution unit for information processing and program operation. Among them, the platform controller (PCH) can mount some low-speed devices and some high-speed devices, and interconnect with the CPU through the DMI bus to control the timing of power-on and power-off, system management and other functions. Report to the OS running on the CPU.
CPU上运行有计算机管理控制程序,其中,计算机管理控制程序也可以理解为一种操作系统,其可以为OS,OS可以处理一些高级别的事件,示例性地,平台控制器可以将事件以高级别中断方式上报给CPU上运行的OS,OS可以对该事件进行处理。需要说明的是,可能因为距离背板上电源连接器最远的硬盘槽位的电压正在或者即将跌落出机械硬盘正常运行的电压范围外,因此其他一些硬盘槽位的电压也会出现同样的问题,因此该事件是最高级别的事件,需要OS优先处理。A computer management control program runs on the CPU, wherein the computer management control program can also be understood as an operating system, which can be an OS, and the OS can handle some high-level events. For example, the platform controller can process events in high-level The level interrupt is reported to the OS running on the CPU, and the OS can process the event. It should be noted that maybe because the voltage of the hard disk slot farthest from the power connector on the back panel is or is about to drop out of the normal operating voltage range of the mechanical hard disk, the voltage of some other hard disk slots will also have the same problem. , so this event is the highest-level event and requires priority processing by the OS.
在本发明实施例中,通过电压监控芯片监控硬盘插槽的电压,通过基板管理控制器判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令。在本发明实施例中,通过电压监控芯片监控硬盘插槽的电压,并通过基板管理控制器判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,以及通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令,可以快速中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,有效地降低因数据交互过程中出现压降而导致存储系统掉盘故障率增大的问题,有效地保护用户数据,能够进一步地提高服务器的整机的可靠性,同时,本发明实施例提及的架构简单易行、可实现性高以及成本低廉。In the embodiment of the present invention, the voltage of the hard disk slot is monitored by the voltage monitoring chip, the baseboard management controller judges whether to interrupt the interactive process of the hard disk in the hard disk slot and the disk array controller, and the interrupt hard disk slot is issued by the central processing unit. Instructions for the interaction process of the hard disk and disk array controller. In the embodiment of the present invention, the voltage of the hard disk slot is monitored by the voltage monitoring chip, and the baseboard management controller judges whether to interrupt the interactive process of the hard disk in the hard disk slot and the disk array controller, and the interrupt hard disk is issued by the central processing unit. The instruction of the interactive process between the hard disk in the slot and the disk array controller can quickly interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller, effectively reducing the failure of the storage system due to the voltage drop during the data interaction process. The problem of increased data rate can be effectively protected, and the reliability of the whole server can be further improved. At the same time, the architecture mentioned in the embodiment of the present invention is simple, easy to implement, high in realizability, and low in cost.
在一种可选实施例中,在所述通过电压监控芯片监控硬盘插槽的电压步骤之后,所述方法还包括:In an optional embodiment, after the step of monitoring the voltage of the hard disk slot through the voltage monitoring chip, the method further includes:
当所述电压监控芯片监控所述硬盘插槽的电压低于预设电压阈值时,通过所述电压监控芯片向所述基板管理控制器发送中断信号。When the voltage monitoring chip monitors that the voltage of the hard disk slot is lower than a preset voltage threshold, an interrupt signal is sent to the baseboard management controller through the voltage monitoring chip.
对于预设电压阈值,其可以理解为是电压监控芯片设置的预设电压阈值,在通常的情况下,在电压监控芯片设置预设电压阈值时,通常会设置得略高于机械硬盘在正常工作时的电压,示例性地,可以设置为略高于P12V和P5V。需要说明的是,电压监控芯片设置预设电压阈值通常会设置得略高于机械硬盘在正常工作时的电压的原因主要为:当电压监控芯片侦测到的机械硬盘工作时的电压低于预设电压阈值后,距离保险丝的最远的硬盘插槽或槽位的P12V和P5V的电压暂时还在会机械硬盘正常工作的电压范围内,但是在接下来的很快时间内,距离保险丝的最远的硬盘插槽的电压会跌出机械硬盘正常工作的电压范围外,而电压监控芯片设置的电压阈值略高于机械硬盘正常工作的电压后,当机械硬盘和磁盘阵列控制器之间有大数据的快速交互时,电压迅速跌落,在这段时间内,需要基板管理控制器和平台控制器的固件做一些操作来进行数据完整性的保护。As for the preset voltage threshold, it can be understood as the preset voltage threshold set by the voltage monitoring chip. Under normal circumstances, when the voltage monitoring chip sets the preset voltage threshold, it is usually set slightly higher than the mechanical hard disk in normal operation. The voltage at this time, for example, can be set to be slightly higher than P12V and P5V. It should be noted that the preset voltage threshold set by the voltage monitoring chip is usually set slightly higher than the voltage of the mechanical hard disk during normal operation. After setting the voltage threshold, the voltages of P12V and P5V in the hard disk slot or slot farthest from the fuse are temporarily still within the voltage range for the mechanical hard disk to work normally. The voltage of the remote hard disk slot will drop out of the normal working voltage range of the mechanical hard disk, and the voltage threshold set by the voltage monitoring chip is slightly higher than the normal working voltage of the mechanical hard disk. During the fast interaction of data, the voltage drops rapidly. During this period, the firmware of the baseboard management controller and the platform controller need to do some operations to protect the data integrity.
在具体实现中,当磁盘阵列控制器和机械硬盘之间有数据突发或发生大数据交互时,如上所述肯定会有压降的情况发生,即会产生机械硬盘的电压迅速下降的情况,但是因为电压监控芯片的预设电压阈值稍微高点,所以在机械硬盘的电压还没跌到正常工作电压之下以前,电压监控芯片就发出了中断信号给到基板管理控制器,然后基板管理控制器和平台控制器结合磁盘阵列控制器对数据做一定的保护。In the specific implementation, when there is a data burst or large data interaction between the disk array controller and the mechanical hard disk, there will definitely be a voltage drop as described above, that is, the voltage of the mechanical hard disk will drop rapidly. However, because the preset voltage threshold of the voltage monitoring chip is slightly higher, before the voltage of the mechanical hard disk drops below the normal working voltage, the voltage monitoring chip sends an interrupt signal to the baseboard management controller, and then the baseboard management control The controller and the platform controller combine with the disk array controller to protect the data to a certain extent.
在一种示例中,假设机械硬盘正常工作的最低电压是11.4V,低于11.4V时数据会丢失,然后机械硬盘正常工作时的电压是12V或者略高于12V(比如12.2V)等,假设将电压监控芯片(ADC)的预设电压阈值设置为11.6V,当机械硬盘和Raid控制器之间有大数据的快速交互时,机械硬盘的电压会从12V或者12.2V往下跌,跌到11.6V时ADC会发出中断信号给到BMC,接下来机械硬盘的电压从11.6V继续往下跌,但还没跌到11.4V时,数据已经被保护起来了。In one example, assume that the minimum voltage for the normal operation of the mechanical hard disk is 11.4V, and data will be lost when it is lower than 11.4V, and then the normal working voltage of the mechanical hard disk is 12V or slightly higher than 12V (such as 12.2V), etc., assuming Set the preset voltage threshold of the voltage monitoring chip (ADC) to 11.6V. When there is fast interaction of large data between the mechanical hard disk and the Raid controller, the voltage of the mechanical hard disk will drop from 12V or 12.2V to 11.6V At V, the ADC will send an interrupt signal to the BMC, and then the voltage of the mechanical hard disk will continue to drop from 11.6V, but before it drops to 11.4V, the data has been protected.
其中,背板上的电源连接器和背板之间走电(通电)的铜皮的阻抗是固定的,当突然有大电流通过时,会产生压降,对于压降,其为一种电压迅速下降的情况,当有Burst(突发)大量数据突发读写时机械硬盘需要很大的电流,因而会产生压降。需要说明的是,正常数据交互或者非突发数据时,通常不会有压降的情况发生。Among them, the impedance of the copper skin between the power connector on the backplane and the backplane is fixed. When a sudden large current passes through, a voltage drop will occur. For the voltage drop, it is a voltage In the case of rapid decline, when there is a burst of burst (burst) large amounts of data read and write, the mechanical hard disk needs a lot of current, which will cause a voltage drop. It should be noted that during normal data exchange or non-burst data, there is usually no voltage drop.
需要说明的是,电压监控芯片的设置通常是在距离保险丝最远的硬盘插槽的旁边设置的一个芯片,由于与保险丝的距离最远,因此该距离最远的硬盘插槽分到的电压或产生的电流就会比较小,因此距离保险丝最远的硬盘插槽上机械硬盘的工作电压可能会出现低于电压监控芯片设置的预设电压阈值,因此该硬盘插槽所产生的压降会是最大的,因此,当电压监控芯片检测到机械硬盘的工作电压出现低于电压监控芯片设置的预设电压阈值时,其他的硬盘插槽也同样会出现此类压降的问题。It should be noted that the voltage monitoring chip is usually set next to the hard disk slot farthest from the fuse. Since the distance from the fuse is the farthest, the voltage distributed by the hard disk slot farthest or The generated current will be relatively small, so the operating voltage of the mechanical hard disk on the hard disk slot farthest from the fuse may be lower than the preset voltage threshold set by the voltage monitoring chip, so the voltage drop generated by the hard disk slot will be Most importantly, when the voltage monitoring chip detects that the operating voltage of the mechanical hard disk is lower than the preset voltage threshold set by the voltage monitoring chip, other hard disk slots will also have such voltage drop problems.
其中,对于中断信号,其可以为电压监控芯片检测到硬盘插槽上的机械硬盘的工作电压出现低于电压监控芯片设置的预设电压阈值的时候,电压监控芯片向基板管理控制器发送的中断机械硬盘和磁盘阵列控制器的数据交互的信号。Wherein, for the interrupt signal, it may be an interrupt sent by the voltage monitoring chip to the baseboard management controller when the voltage monitoring chip detects that the operating voltage of the mechanical hard disk on the hard disk slot is lower than the preset voltage threshold set by the voltage monitoring chip. The data exchange signal between the mechanical hard disk and the disk array controller.
可选地,所述基板管理控制器设置有针对所述基板管理控制器的固件,所述方法还包括:Optionally, the baseboard management controller is provided with firmware for the baseboard management controller, and the method further includes:
当所述基板管理控制器运行所述基板管理控制器的固件时,通过所述固件监听所述电压监控芯片是否向所述基板管理控制器发送中断信号。When the baseboard management controller runs the firmware of the baseboard management controller, the firmware monitors whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller.
其中,基板管理控制器的固件指的是基板管理控制器要运行起来的驱动、操作系统和应用层程序的一个包,基板管理控制器是一个嵌入式操作系统,主要作用是让基板管理控制器运行起来进行整系统管理的功能,比如风扇散热等;在本发明实施例中,基板管理控制器的固件用于监听电压监控芯片是否向基板管理控制器发送中断信号。Among them, the firmware of the baseboard management controller refers to a package of drivers, operating systems and application layer programs for the baseboard management controller to run. The baseboard management controller is an embedded operating system whose main function is to enable the baseboard management controller to It runs to manage the whole system, such as fan cooling; in the embodiment of the present invention, the firmware of the baseboard management controller is used to monitor whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller.
在具体实现中,当基板管理控制器运行基板管理控制器的固件时,通过固件监听电压监控芯片是否向基板管理控制器发送中断信号。In a specific implementation, when the baseboard management controller runs the firmware of the baseboard management controller, the firmware monitors whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller.
在本发明实施例中,当电压监控芯片监控硬盘插槽的电压低于预设电压阈值时,通过电压监控芯片向基板管理控制器发送中断信号。In the embodiment of the present invention, when the voltage of the hard disk slot monitored by the voltage monitoring chip is lower than a preset voltage threshold, an interrupt signal is sent to the baseboard management controller through the voltage monitoring chip.
可选地,所述通过基板管理控制器判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程步骤包括:Optionally, the step of judging whether to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller through the baseboard management controller includes:
向所述电压监控芯片发送电压请求以获取所述电压监控芯片监控的所述硬盘插槽的电压,并将所述基板管理控制器获取到的所述硬盘插槽的电压作为校验电压;Sending a voltage request to the voltage monitoring chip to obtain the voltage of the hard disk slot monitored by the voltage monitoring chip, and using the voltage of the hard disk slot obtained by the baseboard management controller as a verification voltage;
根据所述校验电压判断所述电压监控芯片发送的所述中断信号是否为误报以判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程;judging whether the interrupt signal sent by the voltage monitoring chip is a false alarm according to the verification voltage, so as to judge whether to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller;
当所述中断信号为非误报时,确定中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程。When the interrupt signal is not a false positive, it is determined to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller.
其中,电压请求为基板管理控制器向电压监控芯片发送的请求,该请求为用于获取电压监控芯片所监控的硬盘插槽的电压,即硬盘插槽中硬盘的电压,进一步地,将基板管理控制器获取到的硬盘插槽的电压作为校验电压;其中,校验电压用于判断电压监控芯片发送的中断信号是否为误报以判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程。Wherein, the voltage request is a request sent by the baseboard management controller to the voltage monitoring chip. The request is used to obtain the voltage of the hard disk slot monitored by the voltage monitoring chip, that is, the voltage of the hard disk in the hard disk slot. Further, the baseboard management The voltage of the hard disk slot obtained by the controller is used as the verification voltage; wherein, the verification voltage is used to judge whether the interrupt signal sent by the voltage monitoring chip is a false positive to judge whether to interrupt the interaction between the hard disk in the hard disk slot and the disk array controller process.
可选地,若所述校验电压小于预设电压阈值,则确定所述中断信号为非误报。Optionally, if the verification voltage is less than a preset voltage threshold, it is determined that the interruption signal is not a false alarm.
对于预设电压阈值,其可以理解为是电压监控芯片设置的预设电压阈值,在通常的情况下,在电压监控芯片设置预设电压阈值时,通常会设置得略高于机械硬盘在正常工作时的电压,示例性地,可以设置为略高于P12V和P5V。需要说明的是,电压监控芯片设置预设电压阈值通常会设置得略高于机械硬盘在正常工作时的电压的原因主要为:当电压监控芯片侦测到的机械硬盘工作时的电压低于预设电压阈值后,距离保险丝的最远的硬盘插槽或槽位的P12V和P5V的电压暂时还在会机械硬盘正常工作的电压范围内,但是在接下来的很快时间内,距离保险丝的最远的硬盘插槽的电压会跌出机械硬盘正常工作的电压范围外,而电压监控芯片设置的电压阈值略高于机械硬盘正常工作的电压后,当机械硬盘和磁盘阵列控制器之间有大数据的快速交互时,电压迅速跌落,在这段时间内,需要基板管理控制器和平台控制器的固件做一些操作来进行数据完整性的保护。As for the preset voltage threshold, it can be understood as the preset voltage threshold set by the voltage monitoring chip. Under normal circumstances, when the voltage monitoring chip sets the preset voltage threshold, it is usually set slightly higher than the mechanical hard disk in normal operation. The voltage at this time, for example, can be set to be slightly higher than P12V and P5V. It should be noted that the preset voltage threshold set by the voltage monitoring chip is usually set slightly higher than the voltage of the mechanical hard disk during normal operation. After setting the voltage threshold, the voltages of P12V and P5V in the hard disk slot or slot farthest from the fuse are temporarily still within the voltage range for the mechanical hard disk to work normally. The voltage of the remote hard disk slot will drop out of the normal working voltage range of the mechanical hard disk, and the voltage threshold set by the voltage monitoring chip is slightly higher than the normal working voltage of the mechanical hard disk. During the fast interaction of data, the voltage drops rapidly. During this period, the firmware of the baseboard management controller and the platform controller need to do some operations to protect the data integrity.
在具体实现中,当磁盘阵列控制器和机械硬盘之间有数据突发或发生大数据交互时,如上所述肯定会有压降的情况发生,即会产生机械硬盘的电压迅速下降的情况,但是因为电压监控芯片的预设电压阈值稍微高点,所以在机械硬盘的电压还没跌到正常工作电压之下以前,电压监控芯片就发出了中断信号给到基板管理控制器,然后基板管理控制器和平台控制器结合磁盘阵列控制器对数据做一定的保护。In the specific implementation, when there is a data burst or large data interaction between the disk array controller and the mechanical hard disk, there will definitely be a voltage drop as described above, that is, the voltage of the mechanical hard disk will drop rapidly. However, because the preset voltage threshold of the voltage monitoring chip is slightly higher, before the voltage of the mechanical hard disk drops below the normal working voltage, the voltage monitoring chip sends an interrupt signal to the baseboard management controller, and then the baseboard management control The controller and the platform controller combine with the disk array controller to protect the data to a certain extent.
在一种示例中,假设机械硬盘正常工作的最低电压是11.4V,低于11.4V时数据会丢失,然后机械硬盘正常工作时的电压是12V或者略高于12V(比如12.2V)等,假设将电压监控芯片(ADC)的预设电压阈值设置为11.6V,当机械硬盘和Raid控制器之间有大数据的快速交互时,机械硬盘的电压会从12V或者12.2V往下跌,跌到11.6V时ADC会发出中断信号给到BMC,接下来机械硬盘的电压从11.6V继续往下跌,但还没跌到11.4V时,数据已经被保护起来了。In one example, assume that the minimum voltage for the normal operation of the mechanical hard disk is 11.4V, and data will be lost when it is lower than 11.4V, and then the normal working voltage of the mechanical hard disk is 12V or slightly higher than 12V (such as 12.2V), etc., assuming Set the preset voltage threshold of the voltage monitoring chip (ADC) to 11.6V. When there is fast interaction of large data between the mechanical hard disk and the Raid controller, the voltage of the mechanical hard disk will drop from 12V or 12.2V to 11.6V At V, the ADC will send an interrupt signal to the BMC, and then the voltage of the mechanical hard disk will continue to drop from 11.6V, but before it drops to 11.4V, the data has been protected.
其中,背板上的电源连接器和背板之间走电(通电)的铜皮的阻抗是固定的,当突然有大电流通过时,会产生压降,对于压降,其为一种电压迅速下降的情况,当有Burst(突发)大量数据突发读写时机械硬盘需要很大的电流,因而会产生压降。需要说明的是,正常数据交互或者非突发数据时,通常不会有压降的情况发生。Among them, the impedance of the copper skin between the power connector on the backplane and the backplane is fixed. When a sudden large current passes through, a voltage drop will occur. For the voltage drop, it is a voltage In the case of rapid decline, when there is a burst of burst (burst) large amounts of data read and write, the mechanical hard disk needs a lot of current, which will cause a voltage drop. It should be noted that during normal data exchange or non-burst data, there is usually no voltage drop.
需要说明的是,电压监控芯片的设置通常是在距离保险丝最远的硬盘插槽的旁边设置的一个芯片,由于与保险丝的距离最远,因此该距离最远的硬盘插槽分到的电压或产生的电流就会比较小,因此距离保险丝最远的硬盘插槽上机械硬盘的工作电压可能会出现低于电压监控芯片设置的预设电压阈值,因此该硬盘插槽所产生的压降会是最大的,因此,当电压监控芯片检测到机械硬盘的工作电压出现低于电压监控芯片设置的预设电压阈值时,其他的硬盘插槽也同样会出现此类压降的问题。It should be noted that the voltage monitoring chip is usually set next to the hard disk slot farthest from the fuse. Since the distance from the fuse is the farthest, the voltage distributed by the hard disk slot farthest or The generated current will be relatively small, so the operating voltage of the mechanical hard disk on the hard disk slot farthest from the fuse may be lower than the preset voltage threshold set by the voltage monitoring chip, so the voltage drop generated by the hard disk slot will be Most importantly, when the voltage monitoring chip detects that the operating voltage of the mechanical hard disk is lower than the preset voltage threshold set by the voltage monitoring chip, other hard disk slots will also have such voltage drop problems.
可选地,在所述当所述中断信号为非误报时,确定中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程步骤之后,所述方法还包括:Optionally, after the step of determining to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller when the interrupt signal is not a false positive, the method further includes:
通过所述基板管理控制器向平台控制器发送所述中断信号对应的中断方式;其中,所述中断方式为中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的方式;The baseboard management controller sends an interrupt mode corresponding to the interrupt signal to the platform controller; wherein the interrupt mode is a mode of interrupting the interactive process between the hard disk in the hard disk slot and the disk array controller;
当所述平台控制器接收到所述中断方式时,通过所述平台控制器向中央处理器上报所述中断方式。When the platform controller receives the interrupt mode, the platform controller reports the interrupt mode to the central processing unit.
对于中断方式,其为中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的方式;中断方式的级别可以为高级别的中断方式,高级别的中断方式可以理解为将中断的事件设置为优先级别最高的,这样就可以对该事件进行优先执行。For the interrupt mode, it is a way to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller; the level of the interrupt mode can be a high-level interrupt mode, and the high-level interrupt mode can be understood as setting the interrupted event as a priority The highest level, so that the event can be executed first.
可选地,在所述基板管理控制器的固件中设置针对所述中断信号的优先级;当所述基板管理控制器接收到所述中断信号时,根据所述优先级对所述中断信号进行处理。Optionally, a priority for the interrupt signal is set in firmware of the baseboard management controller; when the baseboard management controller receives the interrupt signal, the interrupt signal is processed according to the priority deal with.
对于优先级,其可以为中断方式的最高级别,高级别的中断方式可以理解为将中断的事件设置为优先级别最高的,这样就可以对该事件进行优先执行;示例性地,假设有一个进程需要被执行时,这个进程需要进行排队,即按照一定的算法(比如等待的时间长短)去安排哪个进程先被送入中央处理器的内核中去执行,但是当某一个进程被设置了优先级比较高时,比如本发明实施例中提及到的需要中断的事件,则会被优先执行,不需要排队执行。As for the priority, it can be the highest level of the interrupt method, and the high-level interrupt method can be understood as setting the interrupt event as the highest priority, so that the event can be executed preferentially; for example, suppose there is a process When it needs to be executed, this process needs to be queued, that is, to arrange which process is sent to the core of the central processing unit for execution first according to a certain algorithm (such as the length of waiting time), but when a certain process is set with a priority When it is relatively high, such as the events that need to be interrupted mentioned in the embodiment of the present invention, they will be executed with priority and do not need to be queued for execution.
在具体实现中,在基板管理控制器的固件中设置针对中断信号的优先级,当基板管理控制器接收到中断信号时,根据优先级对中断信号进行处理。In a specific implementation, the priority for the interrupt signal is set in the firmware of the baseboard management controller, and when the baseboard management controller receives the interrupt signal, it processes the interrupt signal according to the priority.
在一种示例中,当基板管理控制器接收到电压监控芯片发送过来的中断信号时,基板管理控制器会连续向电压监控芯片发送电压请求以获取电压监控芯片监控的硬盘插槽的电压,即基板管理控制器会进行连续采样,假设连续采样若干次(比如20次)采样到的硬盘插槽的电压都是低于电压监控芯片设置的电压阈值时,则中断信号为非误报,可以确定中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,接着可以通过基板管理控制器向平台控制器发送中断信号对应的中断方式,当平台控制器接收到中断方式时,通过平台控制器向中央处理器上报中断方式;如果基板管理控制器是采样到1次电压是低于电压监控芯片设置的电压阈值,其他的19次不是低于电压监控芯片设置的电压阈值,则属于误报,则基板管理控制器不需要向平台控制器发送中断信号对应的中断方式,即不需要中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程。对于判断中断信号是否为误报的方法,可以理解为存在一种滤波机制或采样机制。本领域技术人员可以对采样的次数进行调整,本发明实施例对此不作限制。In one example, when the baseboard management controller receives the interrupt signal sent by the voltage monitoring chip, the baseboard management controller will continuously send a voltage request to the voltage monitoring chip to obtain the voltage of the hard disk slot monitored by the voltage monitoring chip, namely The baseboard management controller will perform continuous sampling. Assuming that the voltage of the hard disk slots sampled several times (for example, 20 times) is lower than the voltage threshold set by the voltage monitoring chip, the interrupt signal is not a false alarm, and it can be determined that Interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller, and then send the corresponding interrupt mode to the platform controller through the baseboard management controller. The processor reports the interrupt mode; if the baseboard management controller samples 1 voltage that is lower than the voltage threshold set by the voltage monitoring chip, and the other 19 times are not lower than the voltage threshold set by the voltage monitoring chip, it is a false alarm. The management controller does not need to send an interrupt mode corresponding to the interrupt signal to the platform controller, that is, the interactive process between the hard disk in the hard disk slot and the disk array controller does not need to be interrupted. Regarding the method for judging whether the interrupt signal is a false alarm, it can be understood that there is a filtering mechanism or a sampling mechanism. Those skilled in the art may adjust the sampling times, which is not limited in this embodiment of the present invention.
可选地,若中断信号为误报,则重新执行上述通过固件监听电压监控芯片是否向基板管理控制器发送中断信号的步骤。Optionally, if the interrupt signal is a false positive, re-execute the above-mentioned step of monitoring whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller through the firmware.
可选地,所述通过中央处理器下发中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令步骤,包括:Optionally, the step of sending instructions to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller through the central processing unit includes:
当所述中央处理器接收到所述中断方式时,通过所述中央处理器向磁盘阵列控制器下发所述中断方式对应的中断指令以使所述磁盘阵列控制器中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程。When the central processing unit receives the interrupt mode, the central processing unit sends an interrupt command corresponding to the interrupt mode to the disk array controller so that the disk array controller interrupts the operation of the hard disk slot. Interaction process between hard disk and disk array controller.
其中,对于中断指令,其为中央处理器上运行的OS(操作系统)向磁盘阵列控制器下发的中断指令,该指令要求磁盘阵列控制器立即停止和背板上所有机械硬盘的数据交互。其中,数据交互主要可以理解为主板上的磁盘阵列控制器和背板上的所有机械硬盘之间的数据交互。Among them, the interrupt command is an interrupt command issued by the OS (operating system) running on the central processing unit to the disk array controller, which requires the disk array controller to immediately stop data interaction with all mechanical hard disks on the backplane. Among them, the data interaction can be mainly understood as the data interaction between the disk array controller on the motherboard and all the mechanical hard disks on the backplane.
在具体实现中,当中央处理器接收到中断方式时,通过中央处理器向磁盘阵列控制器下发中断方式对应的中断指令以使磁盘阵列控制器中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程。In a specific implementation, when the central processing unit receives the interrupt mode, the central processing unit sends an interrupt command corresponding to the interrupt mode to the disk array controller to make the disk array controller interrupt the hard disk in the hard disk slot and the disk array controller. interactive process.
在一种可选实施例中,所述中央处理器上设置有计算机管理控制程序,所述计算机管理控制程序用于向所述磁盘阵列控制器下发所述中断方式对应的中断指令。In an optional embodiment, the central processing unit is provided with a computer management control program, and the computer management control program is configured to issue an interrupt command corresponding to the interrupt mode to the disk array controller.
在具体实现中,中央处理器上设置有计算机管理控制程序,计算机管理控制程序也可以为一种操作系统OS,当中央处理器接收到平台控制器上报的中断方式时,通过中央处理器的计算机管理控制程序OS向磁盘阵列控制器下发中断指令以使磁盘阵列控制器停止与机械硬盘之间的数据交互。In a specific implementation, the central processing unit is provided with a computer management control program, which can also be an operating system OS. When the central processing unit receives the interrupt mode reported by the platform controller, the computer of the central processing unit The management control program OS sends an interrupt command to the disk array controller to stop the data interaction between the disk array controller and the mechanical hard disk.
在本发明实施例中,当电压监控芯片监控硬盘插槽的电压低于预设电压阈值时,通过电压监控芯片向基板管理控制器发送中断信号,当基板管理控制器接收到电压监控芯片发送过来的中断信号时,基板管理控制器会连续向电压监控芯片发送电压请求以获取电压监控芯片监控的硬盘插槽的电压,并将基板管理控制器获取到的硬盘插槽的电压作为校验电压,进而根据校验电压判断电压监控芯片发送的中断信号是否为误报以判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,当中断信号为非误报时,确定中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,并且,通过基板管理控制器向平台控制器发送中断信号对应的中断方式,当平台控制器接收到中断方式时,通过平台控制器向中央处理器上报中断方式以中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,当中央处理器接收到中断方式时,通过中央处理器向磁盘阵列控制器下发中断方式对应的中断指令以使磁盘阵列控制器中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程。在本发明实施例中,通过电压监控芯片监控硬盘插槽的电压,并通过基板管理控制器判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,以及通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令,可以快速中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,有效地降低因数据交互过程中出现压降而导致存储系统掉盘故障率增大的问题,有效地保护用户数据,能够进一步地提高服务器的整机的可靠性,同时,本发明实施例提及的架构简单易行、可实现性高以及成本低廉。In the embodiment of the present invention, when the voltage of the hard disk slot monitored by the voltage monitoring chip is lower than the preset voltage threshold, the voltage monitoring chip sends an interrupt signal to the baseboard management controller. When the interrupt signal is interrupted, the baseboard management controller will continuously send voltage requests to the voltage monitoring chip to obtain the voltage of the hard disk slot monitored by the voltage monitoring chip, and use the voltage of the hard disk slot obtained by the baseboard management controller as the verification voltage. Then judge whether the interrupt signal sent by the voltage monitoring chip is a false alarm according to the verification voltage to judge whether to interrupt the interactive process of the hard disk in the hard disk slot and the disk array controller. When the interrupt signal is not a false alarm, determine whether to interrupt the hard disk in the hard disk slot The interaction process with the disk array controller, and the interrupt mode corresponding to the interrupt signal sent by the baseboard management controller to the platform controller. When the platform controller receives the interrupt mode, the platform controller reports the interrupt mode to the central processing unit for Interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller. When the central processing unit receives the interrupt mode, the central processing unit sends an interrupt command corresponding to the interrupt mode to the disk array controller to make the disk array controller interrupt the hard disk. Slots of hard disks and disk array controllers interact with each other. In the embodiment of the present invention, the voltage of the hard disk slot is monitored by the voltage monitoring chip, and the baseboard management controller judges whether to interrupt the interactive process of the hard disk in the hard disk slot and the disk array controller, and the interrupt hard disk is issued by the central processing unit. The instruction of the interactive process between the hard disk in the slot and the disk array controller can quickly interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller, effectively reducing the failure of the storage system due to the voltage drop during the data interaction process. The problem of increased data rate can be effectively protected, and the reliability of the whole server can be further improved. At the same time, the architecture mentioned in the embodiment of the present invention is simple, easy to implement, high in realizability, and low in cost.
在一种可选实施例中,所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程为所述磁盘阵列控制器与所述硬盘插槽中的硬盘进行数据交互,在所述通过中央处理器下发中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令步骤之后,所述方法还包括:In an optional embodiment, the interaction process between the hard disk in the hard disk slot and the disk array controller is data interaction between the disk array controller and the hard disk in the hard disk slot. After the controller issues an instruction step of interrupting the interaction process between the hard disk in the hard disk slot and the disk array controller, the method also includes:
将所述磁盘阵列控制器与所述硬盘插槽中硬盘之间停止交互的数据存储于所述磁盘阵列控制器。The data of the stop interaction between the disk array controller and the hard disk in the hard disk slot is stored in the disk array controller.
在具体实现中,硬盘插槽的硬盘和磁盘阵列控制器的交互进程为磁盘阵列控制器与硬盘插槽中的硬盘进行数据交互,在通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令步骤之后,将磁盘阵列控制器与硬盘插槽中硬盘之间停止交互的数据存储于磁盘阵列控制器,有效地保护用户数据,从而避免了压降过大而导致掉盘或数据出现破坏的情况,能够进一步地提高服务器的整机的可靠性。In the specific implementation, the interactive process between the hard disk in the hard disk slot and the disk array controller is that the disk array controller performs data interaction with the hard disk in the hard disk slot, and the hard disk in the hard disk slot and the disk array are interrupted by the central processing unit. After the instruction step of the interaction process of the controller, the data of the stop interaction between the disk array controller and the hard disk in the hard disk slot is stored in the disk array controller, which effectively protects user data, thus avoiding the loss caused by excessive voltage drop. If the disk or data is damaged, the reliability of the whole server can be further improved.
在本发明实施例中,当电压监控芯片监控硬盘插槽的电压低于预设电压阈值时,通过电压监控芯片向基板管理控制器发送中断信号,当基板管理控制器接收到电压监控芯片发送过来的中断信号时,基板管理控制器会连续向电压监控芯片发送电压请求以获取电压监控芯片监控的硬盘插槽的电压,并将基板管理控制器获取到的硬盘插槽的电压作为校验电压,进而根据校验电压判断电压监控芯片发送的中断信号是否为误报以判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,当中断信号为非误报时,确定中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,并且,通过基板管理控制器向平台控制器发送中断信号对应的中断方式,当平台控制器接收到中断方式时,通过平台控制器向中央处理器上报中断方式以中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,当中央处理器接收到中断方式时,通过中央处理器向磁盘阵列控制器下发中断方式对应的中断指令以使磁盘阵列控制器中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程。在本发明实施例中,通过电压监控芯片监控硬盘插槽的电压,并通过基板管理控制器判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,以及通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令,可以快速中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,有效地降低因数据交互过程中出现压降而导致存储系统掉盘故障率增大的问题,有效地保护用户数据,能够进一步地提高服务器的整机的可靠性,同时,本发明实施例提及的架构简单易行、可实现性高以及成本低廉,同时,在通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令步骤之后,将磁盘阵列控制器与硬盘插槽中硬盘之间停止交互的数据存储于磁盘阵列控制器,有效地保护用户数据,从而避免了压降过大而导致掉盘或数据出现破坏的情况,能够进一步地提高服务器的整机的可靠性。In the embodiment of the present invention, when the voltage of the hard disk slot monitored by the voltage monitoring chip is lower than the preset voltage threshold, the voltage monitoring chip sends an interrupt signal to the baseboard management controller. When the interrupt signal is interrupted, the baseboard management controller will continuously send voltage requests to the voltage monitoring chip to obtain the voltage of the hard disk slot monitored by the voltage monitoring chip, and use the voltage of the hard disk slot obtained by the baseboard management controller as the verification voltage. Then judge whether the interrupt signal sent by the voltage monitoring chip is a false alarm according to the verification voltage to judge whether to interrupt the interactive process of the hard disk in the hard disk slot and the disk array controller. When the interrupt signal is not a false alarm, determine whether to interrupt the hard disk in the hard disk slot The interaction process with the disk array controller, and the interrupt mode corresponding to the interrupt signal sent by the baseboard management controller to the platform controller. When the platform controller receives the interrupt mode, the platform controller reports the interrupt mode to the central processing unit for Interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller. When the central processing unit receives the interrupt mode, the central processing unit sends an interrupt command corresponding to the interrupt mode to the disk array controller to make the disk array controller interrupt the hard disk. Slots of hard disks and disk array controllers interact with each other. In the embodiment of the present invention, the voltage of the hard disk slot is monitored by the voltage monitoring chip, and the baseboard management controller judges whether to interrupt the interactive process of the hard disk in the hard disk slot and the disk array controller, and the interrupt hard disk is issued by the central processing unit. The instruction of the interactive process between the hard disk in the slot and the disk array controller can quickly interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller, effectively reducing the failure of the storage system due to the voltage drop during the data interaction process. rate increase, effectively protect user data, and can further improve the reliability of the server as a whole. At the same time, the architecture mentioned in the embodiment of the present invention is simple, feasible, and low in cost. At the same time, through After the central processing unit issues an instruction step of interrupting the interactive process between the hard disk in the hard disk slot and the disk array controller, the data of the stop interaction between the disk array controller and the hard disk in the hard disk slot is stored in the disk array controller, effectively User data is protected, thereby avoiding disk loss or data damage caused by excessive voltage drop, and can further improve the reliability of the server as a whole.
为了使本领域技术人员更好地理解本发明实施例的技术方案,下面通过一个例子进行示例性说明:In order to enable those skilled in the art to better understand the technical solutions of the embodiments of the present invention, an example is used to illustrate the following:
参照图4,示出了本发明实施例中提供的一种电源监控方法的流程示意图;具体可以包括如下步骤:Referring to FIG. 4 , it shows a schematic flowchart of a power monitoring method provided in an embodiment of the present invention; specifically, the following steps may be included:
步骤一:在整系统上电后,基板管理控制器运行基板管理控制器对应的固件,基板管理控制器会监听背板上的电压监控芯片是否有发过来硬件的中断信号(init信号),同时在基板管理控制器的固件中,将此硬件的中断信号的优先级设置为最高,当基板管理控制器收到电压监控芯片发过来的中断信号后,立即执行相应的动作,不会进行操作系统的进程间的时间片轮转。Step 1: After the whole system is powered on, the baseboard management controller runs the firmware corresponding to the baseboard management controller. The baseboard management controller will monitor whether the voltage monitoring chip on the backplane has an interrupt signal (init signal) from the hardware, In the firmware of the baseboard management controller, the priority of the interrupt signal of this hardware is set to the highest. When the baseboard management controller receives the interrupt signal from the voltage monitoring chip, it will immediately execute the corresponding action without operating the system. Time slice rotation between processes.
步骤二:当基板管理控制器监听到电压监控芯片发出了中断信号后,基板管理控制器立即通过I2C链路去访问背板上的电压监控芯片,获取电压监控芯片监控到的电压值,并使用这个电压值进行判决电压监控芯片上报的中断信号是否为误报,如果是误报,则基板管理控制器忽略本次的中断信号,软件流程继续回到基板管理控制器侦听电压监控芯片是否发出有效的中断信号;当基板管理控制器判决监控到的机械硬盘的电压确实低于电压监控芯片的预设电压阈值,接下来基板管理控制器会将该事件通过IPMI链路以高级别中断的方式告知平台控制器。Step 2: When the baseboard management controller detects that the voltage monitoring chip sends an interrupt signal, the baseboard management controller immediately accesses the voltage monitoring chip on the backplane through the I2C link, obtains the voltage value monitored by the voltage monitoring chip, and uses This voltage value is used to determine whether the interrupt signal reported by the voltage monitoring chip is a false alarm. If it is a false alarm, the baseboard management controller ignores the interrupt signal this time, and the software process continues back to the baseboard management controller to listen to whether the voltage monitoring chip sends Effective interrupt signal; when the baseboard management controller judges that the monitored voltage of the mechanical hard disk is indeed lower than the preset voltage threshold of the voltage monitoring chip, then the baseboard management controller will use the IPMI link to interrupt the event at a high level Inform Platform Controller.
步骤三:当平台控制器接收到基板管理控制器的中断方式时,平台控制器将该事件以高级别的中断方式上报给中央处理器上运行的OS(操作系统),因为距离背板上电源连接器最远的Slotn槽位的电压正在或者即将跌落出机械硬盘正常运行的电压范围外,因此其他一些Slot(硬盘插槽)槽位的电压也会出现同样的问题,因此该事件是最高级别的事件,需要OS优先处理;OS接下来会向磁盘阵列控制器下发指令,该指令要求磁盘阵列控制器立即停止和背板上所有机械硬盘的数据交互,并且将正在交互的数据存储到磁盘阵列控制器上。Step 3: When the platform controller receives the interrupt mode of the baseboard management controller, the platform controller reports the event to the OS (operating system) running on the central processing unit in a high-level interrupt mode, because the distance from the power supply on the backplane The voltage of the Slotn slot farthest from the connector is or is about to fall out of the normal operating voltage range of the mechanical hard disk, so the voltage of some other Slot (hard disk slot) slots will also have the same problem, so this event is the highest level The event needs to be processed first by the OS; the OS will then issue an instruction to the disk array controller, which requires the disk array controller to immediately stop the data interaction with all the mechanical hard disks on the backplane, and store the interacting data to the disk array controller.
步骤四:磁盘阵列控制器执行步骤三中所述的操作。Step 4: The disk array controller executes the operations described in Step 3.
步骤五:结束。Step five: end.
在本发明实施例中,通过电压监控芯片监控硬盘插槽的电压,通过基板管理控制器判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令,具体地,当电压监控芯片监控硬盘插槽的电压低于预设电压阈值时,通过电压监控芯片向基板管理控制器发送中断信号,当基板管理控制器接收到电压监控芯片发送过来的中断信号时,基板管理控制器会连续向电压监控芯片发送电压请求以获取电压监控芯片监控的硬盘插槽的电压,并将基板管理控制器获取到的硬盘插槽的电压作为校验电压,进而根据校验电压判断电压监控芯片发送的中断信号是否为误报以判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,当中断信号为非误报时,确定中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,并且,通过基板管理控制器向平台控制器发送中断信号对应的中断方式,当平台控制器接收到中断方式时,通过平台控制器向中央处理器上报中断方式以中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,当中央处理器接收到中断方式时,通过中央处理器向磁盘阵列控制器下发中断方式对应的中断指令以使磁盘阵列控制器中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程。在本发明实施例中,通过电压监控芯片监控硬盘插槽的电压,并通过基板管理控制器判断是否中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,以及通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令,可以快速中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程,有效地降低因数据交互过程中出现压降而导致存储系统掉盘故障率增大的问题,有效地保护用户数据,能够进一步地提高服务器的整机的可靠性,同时,本发明实施例提及的架构简单易行、可实现性高以及成本低廉。In the embodiment of the present invention, the voltage of the hard disk slot is monitored by the voltage monitoring chip, the baseboard management controller judges whether to interrupt the interactive process of the hard disk in the hard disk slot and the disk array controller, and the interrupt hard disk slot is issued by the central processing unit. Instructions for the interactive process of the hard disk and the disk array controller, specifically, when the voltage monitoring chip monitors the voltage of the hard disk slot is lower than the preset voltage threshold, the voltage monitoring chip sends an interrupt signal to the baseboard management controller, when the baseboard management When the controller receives the interrupt signal sent by the voltage monitoring chip, the baseboard management controller will continuously send voltage requests to the voltage monitoring chip to obtain the voltage of the hard disk slot monitored by the voltage monitoring The voltage of the slot is used as the verification voltage, and then according to the verification voltage, it is judged whether the interrupt signal sent by the voltage monitoring chip is a false alarm to judge whether to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller. Time reporting, determine the interactive process of interrupting the hard disk in the hard disk slot and the disk array controller, and send the corresponding interrupt mode to the platform controller through the baseboard management controller, when the platform controller receives the interrupt mode, through the platform control The controller reports the interrupt mode to the central processing unit to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller. When the central processor receives the interrupt mode, the central processing unit sends the interrupt mode corresponding to the disk array controller Command to make the disk array controller interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller. In the embodiment of the present invention, the voltage of the hard disk slot is monitored by the voltage monitoring chip, and the baseboard management controller judges whether to interrupt the interactive process of the hard disk in the hard disk slot and the disk array controller, and the interrupt hard disk is issued by the central processing unit. The instruction of the interactive process between the hard disk in the slot and the disk array controller can quickly interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller, effectively reducing the failure of the storage system due to the voltage drop during the data interaction process. The problem of increased data rate can be effectively protected, and the reliability of the whole server can be further improved. At the same time, the architecture mentioned in the embodiment of the present invention is simple, easy to implement, high in realizability, and low in cost.
同时,在通过中央处理器下发中断硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令步骤之后,将磁盘阵列控制器与硬盘插槽中硬盘之间停止交互的数据存储于磁盘阵列控制器,有效地保护用户数据,从而避免了压降过大而导致掉盘或数据出现破坏的情况,能够进一步地提高服务器的整机的可靠性。At the same time, after the central processing unit issues an instruction step of interrupting the interactive process between the hard disk in the hard disk slot and the disk array controller, the data of stopping the interaction between the disk array controller and the hard disk in the hard disk slot is stored in the disk array controller. The server can effectively protect user data, thereby avoiding the situation of disk loss or data damage caused by excessive voltage drop, and can further improve the reliability of the server as a whole.
需要说明的是,对于方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明实施例并不受所描述的动作顺序的限制,因为依据本发明实施例,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作并不一定是本发明实施例所必须的。It should be noted that, for the method embodiment, for the sake of simple description, it is expressed as a series of action combinations, but those skilled in the art should know that the embodiment of the present invention is not limited by the described action sequence, because According to the embodiment of the present invention, certain steps may be performed in other orders or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification belong to preferred embodiments, and the actions involved are not necessarily required by the embodiments of the present invention.
参照图5,示出了本发明实施例中提供的一种电源监控装置的结构框图,具体可以包括如下模块:Referring to FIG. 5 , it shows a structural block diagram of a power monitoring device provided in an embodiment of the present invention, which may specifically include the following modules:
电压监控模块501,用于通过电压监控芯片监控硬盘插槽的电压;A voltage monitoring module 501, configured to monitor the voltage of the hard disk slot through a voltage monitoring chip;
进程判断模块502,用于通过基板管理控制器判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程;A process judging module 502, configured to judge whether to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller through the baseboard management controller;
指令下发模块503,用于通过中央处理器下发中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的指令。The instruction issuing module 503 is configured to issue an instruction to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller through the central processing unit.
在一种可选实施例中,所述装置还包括:In an optional embodiment, the device also includes:
信号发送模块,用于当所述电压监控芯片监控所述硬盘插槽的电压低于预设电压阈值时,通过所述电压监控芯片向所述基板管理控制器发送中断信号。A signal sending module, configured to send an interrupt signal to the baseboard management controller through the voltage monitoring chip when the voltage of the hard disk slot monitored by the voltage monitoring chip is lower than a preset voltage threshold.
在一种可选实施例中,所述进程判断模块502具体用于:In an optional embodiment, the process judging module 502 is specifically configured to:
向所述电压监控芯片发送电压请求以获取所述电压监控芯片监控的所述硬盘插槽的电压,并将所述基板管理控制器获取到的所述硬盘插槽的电压作为校验电压;Sending a voltage request to the voltage monitoring chip to obtain the voltage of the hard disk slot monitored by the voltage monitoring chip, and using the voltage of the hard disk slot obtained by the baseboard management controller as a verification voltage;
根据所述校验电压判断所述电压监控芯片发送的所述中断信号是否为误报以判断是否中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程;judging whether the interrupt signal sent by the voltage monitoring chip is a false alarm according to the verification voltage, so as to judge whether to interrupt the interactive process between the hard disk in the hard disk slot and the disk array controller;
当所述中断信号为非误报时,确定中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程。When the interrupt signal is not a false positive, it is determined to interrupt the interaction process between the hard disk in the hard disk slot and the disk array controller.
在一种可选实施例中,所述装置还包括:In an optional embodiment, the device also includes:
非误报确定模块,用于若所述校验电压小于预设电压阈值,则确定所述中断信号为非误报。A non-false alarm determining module, configured to determine that the interruption signal is not a false alarm if the verification voltage is lower than a preset voltage threshold.
在一种可选实施例中,所述装置还包括:In an optional embodiment, the device also includes:
中断方式发送模块,用于通过所述基板管理控制器向平台控制器发送所述中断信号对应的中断方式;其中,所述中断方式为中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程的方式;An interrupt mode sending module, configured to send an interrupt mode corresponding to the interrupt signal to the platform controller through the baseboard management controller; wherein, the interrupt mode is to interrupt the interaction between the hard disk in the hard disk slot and the disk array controller the manner of the process;
中断方式上报模块,用于当所述平台控制器接收到所述中断方式时,通过所述平台控制器向中央处理器上报所述中断方式。The interrupt mode reporting module is configured to report the interrupt mode to the central processing unit through the platform controller when the platform controller receives the interrupt mode.
在一种可选实施例中,所述指令下发模块503具体用于:In an optional embodiment, the instruction issuing module 503 is specifically configured to:
当所述中央处理器接收到所述中断方式时,通过所述中央处理器向磁盘阵列控制器下发所述中断方式对应的中断指令以使所述磁盘阵列控制器中断所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程。When the central processing unit receives the interrupt mode, the central processing unit sends an interrupt command corresponding to the interrupt mode to the disk array controller so that the disk array controller interrupts the operation of the hard disk slot. Interaction process between hard disk and disk array controller.
在一种可选实施例中,所述基板管理控制器设置有针对所述基板管理控制器的固件,所述装置还包括:In an optional embodiment, the baseboard management controller is provided with firmware for the baseboard management controller, and the device further includes:
信号监听模块,用于当所述基板管理控制器运行所述基板管理控制器的固件时,通过所述固件监听所述电压监控芯片是否向所述基板管理控制器发送中断信号。A signal monitoring module, configured to monitor whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller through the firmware when the baseboard management controller is running the firmware of the baseboard management controller.
在一种可选实施例中,所述装置还包括:In an optional embodiment, the device also includes:
优先级设置模块,用于在所述基板管理控制器的固件中设置针对所述中断信号的优先级;a priority setting module, configured to set the priority for the interrupt signal in the firmware of the baseboard management controller;
信号处理模块,用于当所述基板管理控制器接收到所述中断信号时,根据所述优先级对所述中断信号进行处理。A signal processing module, configured to process the interrupt signal according to the priority when the baseboard management controller receives the interrupt signal.
在一种可选实施例中,所述装置还包括:In an optional embodiment, the device also includes:
监听重新执行模块,用于若所述中断信号为误报,则重新执行所述通过所述固件监听所述电压监控芯片是否向所述基板管理控制器发送中断信号的步骤。A monitoring and re-executing module, configured to re-execute the step of monitoring whether the voltage monitoring chip sends an interrupt signal to the baseboard management controller through the firmware if the interrupt signal is a false positive.
在一种可选实施例中,所述硬盘插槽的硬盘和磁盘阵列控制器的交互进程为所述磁盘阵列控制器与所述硬盘插槽中的硬盘进行数据交互,所述装置还包括:In an optional embodiment, the interaction process between the hard disk in the hard disk slot and the disk array controller is data interaction between the disk array controller and the hard disk in the hard disk slot, and the device further includes:
数据存储模块,用于将所述磁盘阵列控制器与所述硬盘插槽中硬盘之间停止交互的数据存储于所述磁盘阵列控制器。The data storage module is configured to store in the disk array controller the data of the stop interaction between the disk array controller and the hard disk in the hard disk slot.
对于装置实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。As for the device embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and for related parts, please refer to the part of the description of the method embodiment.
另外,本发明实施例还提供了一种电子设备,包括:处理器,存储器,存储在存储器上并可在处理器上运行的计算机程序,该计算机程序被处理器执行时实现上述电源监控方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。In addition, an embodiment of the present invention also provides an electronic device, including: a processor, a memory, and a computer program stored in the memory and operable on the processor. When the computer program is executed by the processor, the above power monitoring method is implemented. Each process of the example, and can achieve the same technical effect, in order to avoid repetition, will not repeat them here.
图6是本发明实施例中提供的一种计算机可读存储介质的结构示意图。Fig. 6 is a schematic structural diagram of a computer-readable storage medium provided in an embodiment of the present invention.
本发明实施例还提供了一种计算机可读存储介质601,计算机可读存储介质601上存储有计算机程序,计算机程序被处理器执行时实现上述电源监控方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。其中,所述的计算机可读存储介质601,如只读存储器(Read-Only Memory,简称ROM)、随机存取存储器(Random AccessMemory,简称RAM)、磁碟或者光盘等。The embodiment of the present invention also provides a computer-
图7为实现本发明各个实施例的一种电子设备的硬件结构示意图。FIG. 7 is a schematic diagram of a hardware structure of an electronic device implementing various embodiments of the present invention.
该电子设备700包括但不限于:射频单元701、网络模块702、音频输出单元703、输入单元704、传感器705、显示单元706、用户输入单元707、接口单元708、存储器709、处理器710、以及电源711等部件。本领域技术人员可以理解,图7中示出的电子设备结构并不构成对电子设备的限定,电子设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。在本发明实施例中,电子设备包括但不限于手机、平板电脑、笔记本电脑、掌上电脑、车载终端、可穿戴设备、以及计步器等。The
应理解的是,本发明实施例中,射频单元701可用于收发信息或通话过程中,信号的接收和发送,具体的,将来自基站的下行数据接收后,给处理器710处理;另外,将上行的数据发送给基站。通常,射频单元701包括但不限于天线、至少一个放大器、收发信机、耦合器、低噪声放大器、双工器等。此外,射频单元701还可以通过无线通信系统与网络和其他设备通信。It should be understood that, in the embodiment of the present invention, the
电子设备通过网络模块702为用户提供了无线的宽带互联网访问,如帮助用户收发电子邮件、浏览网页和访问流式媒体等。The electronic device provides users with wireless broadband Internet access through the
音频输出单元703可以将射频单元701或网络模块702接收的或者在存储器709中存储的音频数据转换成音频信号并且输出为声音。而且,音频输出单元703还可以提供与电子设备700执行的特定功能相关的音频输出(例如,呼叫信号接收声音、消息接收声音等等)。音频输出单元703包括扬声器、蜂鸣器以及受话器等。The
输入单元704用于接收音频或视频信号。输入单元704可以包括图形处理器(Graphics Processing Unit,GPU)7041和麦克风7042,图形处理器7041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。处理后的图像帧可以显示在显示单元706上。经图形处理器7041处理后的图像帧可以存储在存储器709(或其它存储介质)中或者经由射频单元701或网络模块702进行发送。麦克风7042可以接收声音,并且能够将这样的声音处理为音频数据。处理后的音频数据可以在电话通话模式的情况下转换为可经由射频单元701发送到移动通信基站的格式输出。The
电子设备700还包括至少一种传感器705,比如光传感器、运动传感器以及其他传感器。具体地,光传感器包括环境光传感器及接近传感器,其中,环境光传感器可根据环境光线的明暗来调节显示面板7061的亮度,接近传感器可在电子设备700移动到耳边时,关闭显示面板7061和/或背光。作为运动传感器的一种,加速计传感器可检测各个方向上(一般为三轴)加速度的大小,静止时可检测出重力的大小及方向,可用于识别电子设备姿态(比如横竖屏切换、相关游戏、磁力计姿态校准)、振动识别相关功能(比如计步器、敲击)等;传感器705还可以包括指纹传感器、压力传感器、虹膜传感器、分子传感器、陀螺仪、气压计、湿度计、温度计、红外线传感器等,在此不再赘述。The
显示单元706用于显示由用户输入的信息或提供给用户的信息。显示单元706可包括显示面板7061,可以采用液晶显示器(Liquid Crystal Display,LCD)、有机发光二极管(Organic Light-Emitting Diode, OLED)等形式来配置显示面板7061。The
用户输入单元707可用于接收输入的数字或字符信息,以及产生与电子设备的用户设置以及功能控制有关的键信号输入。具体地,用户输入单元707包括触控面板7071以及其他输入设备7072。触控面板7071,也称为触摸屏,可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触控面板7071上或在触控面板7071附近的操作)。触控面板7071可包括触摸检测装置和触摸控制器两个部分。其中,触摸检测装置检测用户的触摸方位,并检测触摸操作带来的信号,将信号传送给触摸控制器;触摸控制器从触摸检测装置上接收触摸信息,并将它转换成触点坐标,再送给处理器710,接收处理器710发来的命令并加以执行。此外,可以采用电阻式、电容式、红外线以及表面声波等多种类型实现触控面板7071。除了触控面板7071,用户输入单元707还可以包括其他输入设备7072。具体地,其他输入设备7072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆,在此不再赘述。The
进一步的,触控面板7071可覆盖在显示面板7061上,当触控面板7071检测到在其上或附近的触摸操作后,传送给处理器710以确定触摸事件的类型,随后处理器710根据触摸事件的类型在显示面板7061上提供相应的视觉输出。虽然在图7中,触控面板7071与显示面板7061是作为两个独立的部件来实现电子设备的输入和输出功能,但是在某些实施例中,可以将触控面板7071与显示面板7061集成而实现电子设备的输入和输出功能,具体此处不做限定。Furthermore, the
接口单元708为外部装置与电子设备700连接的接口。例如,外部装置可以包括有线或无线头戴式耳机端口、外部电源(或电池充电器)端口、有线或无线数据端口、存储卡端口、用于连接具有识别模块的装置的端口、音频输入/输出(I/O)端口、视频I/O端口、耳机端口等等。接口单元708可以用于接收来自外部装置的输入(例如,数据信息、电力等等)并且将接收到的输入传输到电子设备700内的一个或多个元件或者可以用于在电子设备700和外部装置之间传输数据。The
存储器709可用于存储软件程序以及各种数据。存储器709可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据手机的使用所创建的数据(比如音频数据、电话本等)等。此外,存储器709可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。The
处理器710是电子设备的控制中心,利用各种接口和线路连接整个电子设备的各个部分,通过运行或执行存储在存储器709内的软件程序和/或模块,以及调用存储在存储器709内的数据,执行电子设备的各种功能和处理数据,从而对电子设备进行整体监控。处理器710可包括一个或多个处理单元;优选的,处理器710可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器710中。The
电子设备700还可以包括给各个部件供电的电源711(比如电池),优选的,电源711可以通过电源管理系统与处理器710逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。The
另外,电子设备700包括一些未示出的功能模块,在此不再赘述。In addition, the
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。It should be noted that, in this document, the term "comprising", "comprising" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, It also includes other elements not expressly listed, or elements inherent in the process, method, article, or device. Without further limitations, an element defined by the phrase "comprising a ..." does not preclude the presence of additional identical elements in the process, method, article, or apparatus comprising that element.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本发明各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus a necessary general-purpose hardware platform, and of course also by hardware, but in many cases the former is better implementation. Based on this understanding, the essence of the technical solution of the present invention or the part that contributes to the prior art can be embodied in the form of software products, and the computer software products are stored in a storage medium (such as ROM/RAM, disk, CD-ROM), including several instructions to make a terminal (which can be a mobile phone, computer, server, air conditioner, or network device, etc.) execute the methods described in various embodiments of the present invention.
上面结合附图对本发明的实施例进行了描述,但是本发明并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本发明的启示下,在不脱离本发明宗旨和权利要求所保护的范围情况下,还可做出很多形式,均属于本发明的保护之内。Embodiments of the present invention have been described above in conjunction with the accompanying drawings, but the present invention is not limited to the above-mentioned specific implementations, and the above-mentioned specific implementations are only illustrative, rather than restrictive, and those of ordinary skill in the art will Under the enlightenment of the present invention, without departing from the gist of the present invention and the protection scope of the claims, many forms can also be made, all of which belong to the protection of the present invention.
本领域普通技术人员可以意识到,结合本发明实施例中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Those of ordinary skill in the art can appreciate that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed in the embodiments of the present invention can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present invention.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described system, device and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.
在本申请所提供的实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the embodiments provided in this application, it should be understood that the disclosed devices and methods may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components can be combined or May be integrated into another system, or some features may be ignored, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。If the functions described above are realized in the form of software function units and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the essence of the technical solution of the present invention or the part that contributes to the prior art or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in various embodiments of the present invention. The aforementioned storage medium includes: various media capable of storing program codes such as U disk, mobile hard disk, ROM, RAM, magnetic disk or optical disk.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以权利要求的保护范围为准。The above is only a specific embodiment of the present invention, but the scope of protection of the present invention is not limited thereto. Anyone skilled in the art can easily think of changes or substitutions within the technical scope disclosed in the present invention. Should be covered within the protection scope of the present invention. Therefore, the protection scope of the present invention should be based on the protection scope of the claims.
Claims (21)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310235604.3A CN116048233A (en) | 2023-03-13 | 2023-03-13 | Power architecture, power monitoring method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310235604.3A CN116048233A (en) | 2023-03-13 | 2023-03-13 | Power architecture, power monitoring method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116048233A true CN116048233A (en) | 2023-05-02 |
Family
ID=86133407
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310235604.3A Pending CN116048233A (en) | 2023-03-13 | 2023-03-13 | Power architecture, power monitoring method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116048233A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN119025172A (en) * | 2024-10-23 | 2024-11-26 | 苏州元脑智能科技有限公司 | A power supply data acquisition method, system, readable storage medium and program product |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1900726A (en) * | 2005-07-22 | 2007-01-24 | 贵州南方汇通世华微硬盘有限公司 | Hard disc driver and its work pressure detecting method |
CN104598844A (en) * | 2014-12-31 | 2015-05-06 | 湖南国科微电子有限公司 | Power failure protection method and device for preventing corruption of system files of solid-state disk |
CN106873745A (en) * | 2017-02-10 | 2017-06-20 | 郑州云海信息技术有限公司 | A kind of system and method for realizing backboard voltage monitoring |
CN115098324A (en) * | 2022-06-21 | 2022-09-23 | 苏州浪潮智能科技有限公司 | Hard disk maintenance method, device, device and storage medium |
CN115756620A (en) * | 2022-11-29 | 2023-03-07 | 苏州浪潮智能科技有限公司 | A method, system, and related equipment for starting a server NVME hard disk |
-
2023
- 2023-03-13 CN CN202310235604.3A patent/CN116048233A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1900726A (en) * | 2005-07-22 | 2007-01-24 | 贵州南方汇通世华微硬盘有限公司 | Hard disc driver and its work pressure detecting method |
CN104598844A (en) * | 2014-12-31 | 2015-05-06 | 湖南国科微电子有限公司 | Power failure protection method and device for preventing corruption of system files of solid-state disk |
CN106873745A (en) * | 2017-02-10 | 2017-06-20 | 郑州云海信息技术有限公司 | A kind of system and method for realizing backboard voltage monitoring |
CN115098324A (en) * | 2022-06-21 | 2022-09-23 | 苏州浪潮智能科技有限公司 | Hard disk maintenance method, device, device and storage medium |
CN115756620A (en) * | 2022-11-29 | 2023-03-07 | 苏州浪潮智能科技有限公司 | A method, system, and related equipment for starting a server NVME hard disk |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN119025172A (en) * | 2024-10-23 | 2024-11-26 | 苏州元脑智能科技有限公司 | A power supply data acquisition method, system, readable storage medium and program product |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101565357B1 (en) | Systems, methods, and apparatuses for handling timeouts | |
KR102074018B1 (en) | Systems, methods, and apparatuses for synchronizing port entry into a low power state | |
US9645954B2 (en) | Embedded microcontroller and buses | |
CN115733549B (en) | PCIE network card, switching method of interface modes of PCIE network card, electronic equipment and storage medium | |
CN117806857B (en) | Fault location information generation method, device, electronic device and storage medium | |
WO2024131133A1 (en) | Data processing method and apparatus for electronic device, and electronic device and storage medium | |
CN115756941B (en) | Automatic device repairing method and device, electronic device and storage medium | |
CN116594493A (en) | Hard disk device control method and device, electronic device and storage medium | |
WO2024187824A1 (en) | Usb circuit, and switching method and apparatus for usb controller | |
CN116048233A (en) | Power architecture, power monitoring method and device | |
CN115543062B (en) | Server interrupt operation execution method and device | |
US10571992B2 (en) | Electronic device having a controller to enter a low power mode | |
CN213365379U (en) | Server mainboard and one-way server | |
CN115525511A (en) | Server management method, system, electronic device and storage medium | |
CN116737497A (en) | Data acquisition method and device, electronic equipment and storage medium | |
CN118964044B (en) | Memory expansion system and method, electronic device and storage medium | |
CN213987485U (en) | Mainboard supporting real-time automatic repair of USB (universal serial bus) and multiple serial ports and USB interfaces | |
CN116756069A (en) | Solid state drive NVMe control method, device, electronic equipment and storage medium | |
CN115934435A (en) | Method and device for detecting storage performance of equipment, electronic equipment and storage medium | |
CN115509334A (en) | Hard disk power supply system, method, electronic device and storage medium | |
CN116113205A (en) | Liquid cooling control method and device, electronic equipment and storage medium | |
CN116841594A (en) | Refreshing method and device for basic input/output system | |
CN102478897A (en) | Computer system | |
TW201510929A (en) | Video output controlling system and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20230502 |