[go: up one dir, main page]

CN102467425A - Method for obtaining fault signal of storage device by using baseboard management controller - Google Patents

Method for obtaining fault signal of storage device by using baseboard management controller Download PDF

Info

Publication number
CN102467425A
CN102467425A CN201010539115XA CN201010539115A CN102467425A CN 102467425 A CN102467425 A CN 102467425A CN 201010539115X A CN201010539115X A CN 201010539115XA CN 201010539115 A CN201010539115 A CN 201010539115A CN 102467425 A CN102467425 A CN 102467425A
Authority
CN
China
Prior art keywords
storage device
bmc
signal
detector
fault signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201010539115XA
Other languages
Chinese (zh)
Inventor
陈志伟
卢晓芬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inventec Corp
Original Assignee
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Corp filed Critical Inventec Corp
Priority to CN201010539115XA priority Critical patent/CN102467425A/en
Publication of CN102467425A publication Critical patent/CN102467425A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

A method for acquiring a fault signal of a storage device by utilizing a Baseboard Management Controller (BMC) comprises the following steps: defining a detector monitored by the BMC, wherein a value of the detector is used for indicating the state of the storage device; converting a storage device fault signal generated by a storage device controller into an electric signal by using a hardware coding means; and reading the electrical signal by the input-output expander and designating a value of the read electrical signal as a value of the detector. The invention utilizes the method that the BMC obtains the fault signal of the storage device to generate an electric signal, lights the corresponding LED group and informs the BMC through the detector. Therefore, the disk failure lighting mechanism controlled by hardware is integrated into the event managed by BMC, so that the management interface is unified and the management efficiency is improved.

Description

利用基板管理控制器取得储存装置故障信号的方法Method for Obtaining Fault Signal of Storage Device Using Baseboard Management Controller

技术领域 technical field

本发明涉及一种取得储存装置故障信号的方法,特别涉及一种利用基板管理控制器(Baseboard Management Controller,BMC)取得储存装置故障信号的方法。The invention relates to a method for obtaining a fault signal of a storage device, in particular to a method for obtaining a fault signal of a storage device by using a Baseboard Management Controller (BMC).

背景技术 Background technique

随着计算机的普及与网络技术的快速发展,仅由普通计算机或设备所能提供的服务以不敷使用,故发展出了服务器的技术。服务器是为一种善于处理网络技术的计算机平台,其可连结至各种网络系统,并对通过网络系统相连结的计算机提供各种的应用服务。服务器大多具有大容量的储存装置,以提供诸如多媒体播放、网络硬盘或是企业用数据库等服务。由此可知,储存装置是服务器中相当重要的一个组件,一旦发生故障就会对服务器乃至提供给客户的服务造成严重的不良影响。With the popularization of computers and the rapid development of network technology, the services that can only be provided by ordinary computers or equipment are not enough, so the server technology has been developed. A server is a computer platform that is good at processing network technology. It can be connected to various network systems and provide various application services to computers connected through the network system. Most servers have large-capacity storage devices to provide services such as multimedia playback, network hard drives, or enterprise databases. It can be seen that the storage device is a very important component in the server, and once a failure occurs, it will cause serious adverse effects on the server and even the services provided to customers.

而为了管理服务器,智能型平台管理界面(Intelligent Platform ManagementInterface,IPMI)的技术应运而生。管理者可以通过IPMI以及配置于服务器中的基板管理控制器(Baseboard Management Controller,BMC)监控服务器。但是目前的服务器在储存装置故障后,是通过独立运作的硬件发出故障信号再点亮服务器上的灯号,储存装置故障的信号并不会通过智能型平台管理界面传送至管理软件。也就是说,现有的故障信号是直接由硬件译码控制。因此造成现有的服务器无法整合并行的故障信号与管理机制,也无法有效率地通知管理员故障事件的问题。In order to manage the server, the technology of Intelligent Platform Management Interface (IPMI) came into being. The administrator can monitor the server through IPMI and the Baseboard Management Controller (BMC) configured in the server. However, after the storage device fails, the current server sends a failure signal through the independently operated hardware and then lights up the light on the server. The failure signal of the storage device is not transmitted to the management software through the intelligent platform management interface. That is to say, the existing fault signal is directly controlled by hardware decoding. Therefore, existing servers cannot integrate parallel fault signals and management mechanisms, nor can they efficiently notify administrators of fault events.

发明内容 Contents of the invention

为了解决上述问题,本发明所要解决的技术问题在于提供一种利用基板管理控制器(Baseboard Management Controller,BMC)取得储存装置故障信号的方法。利用BMC取得储存装置故障信号的方法适用于具有一BMC以及一储存装置的一服务器。利用基板管理控制器取得储存装置故障信号的方法包括:定义BMC所监控的一检测器,其中检测器的值用以表示储存装置的状态;利用一硬件编码手段,将储存装置的一储存装置控制器产生的一储存装置故障信号转换成一电信号;以及由一输入输出扩展器(input/output expander,I/Oexpander)读取电信号,并将读取的电信号的值指定为检测器的值。In order to solve the above problems, the technical problem to be solved by the present invention is to provide a method for obtaining a fault signal of a storage device by using a Baseboard Management Controller (BMC). The method for obtaining the failure signal of the storage device by using the BMC is applicable to a server with a BMC and a storage device. The method of using the baseboard management controller to obtain the fault signal of the storage device includes: defining a detector monitored by the BMC, wherein the value of the detector is used to represent the state of the storage device; using a hardware coding method to control a storage device of the storage device A storage device fault signal generated by the device is converted into an electrical signal; and an input/output expander (input/output expander, I/Oexpander) reads the electrical signal, and specifies the value of the read electrical signal as the value of the detector .

根据一实施范例,硬件编码手段可以是一复杂可程序逻辑装置(ComplexProgrammable Logic Device,CPLD),且输入输出扩展器由CPLD读取电信号。而储存装置控制器可监测储存装置的状态,并据以发送储存装置故障给CPLD。其中储存装置控制器可以通过序列式通用型输入输出(serial generalpurpose input/output,SGPIO)发送电信号给CPLD。BMC的检测器则可通过内部整合电路总线(inter integrated circuit bus,I2C bus)由输入输出扩展器读取电信号。According to an implementation example, the hardware coding means may be a complex programmable logic device (Complex Programmable Logic Device, CPLD), and the input and output expander reads electrical signals from the CPLD. The storage device controller can monitor the status of the storage device and send a storage device failure to the CPLD accordingly. The storage device controller can send electrical signals to the CPLD through a serial general purpose input/output (SGPIO). The BMC detector can read electrical signals from the input and output expander through the internal integrated circuit bus (inter integrated circuit bus, I2C bus).

此外,利用BMC取得储存装置故障信号的方法还可包括:令CPLD依据电信号点亮(驱动)对应于储存装置的一发光二极管(light emitting diode,LED)组。In addition, the method for obtaining the failure signal of the storage device by using the BMC may further include: making the CPLD light up (drive) a light emitting diode (light emitting diode, LED) group corresponding to the storage device according to the electrical signal.

根据一实施范例,利用BMC取得储存装置故障信号的方法也可包括:依据检测器的值执行一储存装置管理程序。According to an embodiment example, the method for obtaining the failure signal of the storage device by using the BMC may also include: executing a storage device management program according to the value of the detector.

其中储存装置管理程序可包括:通知通过一智能平台管理总线(IntelligentPlatform Management Bus,IPMB)与BMC相连的一远程管理程序。而储存装置管理程序也可包括:依据电信号暂停储存装置的至少一储存单元。其中储存装置可以包括多个储存单元,而电信号系对应于这些储存单元。Wherein the storage device management program may include: notifying a remote management program connected to the BMC through an Intelligent Platform Management Bus (IPMB). The storage device management program may also include: suspending at least one storage unit of the storage device according to the electrical signal. The storage device may include multiple storage units, and the electrical signals correspond to these storage units.

综上所述,利用BMC取得储存装置故障信号的方法产生电信号后,点亮对应的LED组并通过检测器告知BMC。因此由硬件控制的磁盘故障点灯机制被整合入BMC管理的事件之中,使管理界面得以统一而增进管理效率。To sum up, after using the BMC to obtain the fault signal of the storage device to generate an electrical signal, light up the corresponding LED group and notify the BMC through the detector. Therefore, the disk failure lighting mechanism controlled by the hardware is integrated into the events managed by the BMC, so that the management interface can be unified and the management efficiency can be improved.

以下结合附图和具体实施例对本发明进行详细描述,但不作为对本发明的限定。The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments, but not as a limitation of the present invention.

附图说明 Description of drawings

图1为一实施范例的服务器的示意图;FIG. 1 is a schematic diagram of a server of an embodiment example;

图2为一实施范例的利用基板管理控制器取得储存装置故障信号的方法的流程图;FIG. 2 is a flow chart of a method for obtaining a fault signal of a storage device by using a baseboard management controller;

图3为另一实施范例的利用基板管理控制器取得储存装置故障信号的方法的流程图;3 is a flow chart of another embodiment of a method for obtaining a fault signal of a storage device by using a baseboard management controller;

图4为另一实施范例的服务器的示意图。FIG. 4 is a schematic diagram of a server in another embodiment.

其中,附图标记Among them, reference signs

20服务器20 servers

21基板管理控制器21 Baseboard Management Controller

212检测器212 detectors

22储存装置22 storage device

222,222a,222b,222c  储存单元222, 222a, 222b, 222c storage unit

23南桥23 South Bridge

232储存装置控制器232 storage device controller

25输入输出扩展器25 input and output expanders

24硬件编码手段24 hardware encoding means

26发光二极管组26 LED groups

262,262a,262b,262c  发光二极管灯号262, 262a, 262b, 262c LED lights

30远程计算机30 remote computers

32远程管理程序32 remote management programs

具体实施方式 Detailed ways

以下在实施方式中详细叙述本发明的详细特征以及优点,其内容足以使任何本领域技术人员了解本发明的技术内容并据以实施,且根据本说明书所揭露的内容、权利要求范围及图式,任何本领域技术人员可轻易地理解本发明相关的目的及优点。The detailed features and advantages of the present invention are described in detail below in the embodiments, the content of which is sufficient to enable any person skilled in the art to understand the technical content of the present invention and implement it accordingly, and according to the content disclosed in this specification, the scope of claims and the drawings , any person skilled in the art can easily understand the related objects and advantages of the present invention.

本发明是关于一种利用基板管理控制器(Baseboard ManagementController,BMC)取得储存装置故障信号的方法,其适用于具有一基板管理控制器(BMC)以及一储存装置的一服务器。The invention relates to a method for obtaining a fault signal of a storage device by using a baseboard management controller (BMC), which is suitable for a server with a baseboard management controller (BMC) and a storage device.

请参照1图,其是为一实施范例的服务器的示意图。服务器20包括BMC21、储存装置22以及一中央处理器(central processing unit,CPU,未绘示)的一南桥(south bridge)23,其中南桥23通过一储存装置控制器232与储存装置22电性相连。储存装置22可以例如是各种大容量的硬盘,或是磁盘阵列(redundant array of inexpensive disk,RAID)系统。服务器20并可通过网络与一远程计算机(remote computer)30相连,而远程计算机30则可通过一远程管理程序32以及BMC 21管理服务器20。Please refer to FIG. 1 , which is a schematic diagram of a server of an implementation example. Server 20 comprises BMC21, storage device 22 and a central processing unit (central processing unit, CPU, not shown) a south bridge (south bridge) 23, wherein south bridge 23 is connected with storage device 22 by a storage device controller 232 sexually connected. The storage device 22 can be, for example, various large-capacity hard disks, or a disk array (redundant array of inexpensive disk, RAID) system. The server 20 can also be connected with a remote computer (remote computer) 30 through the network, and the remote computer 30 can manage the server 20 through a remote management program 32 and BMC 21.

服务器20可支持智能型平台管理界面(Intelligent Platform ManagementInterface,IPMI),并通过上述硬件运行一操作系统。其中服务器20可使用Unix的Linux、FreeBSD或是微软(Microsoft)的Windows(窗口操作系统)Server 2003等操作系统,也可为磁盘操作系统(Disk Operating System,DOS)或是可延伸固件接口(可扩展固件接口,Extensible Firmware Interface,EFI)的系统。且服务器20也可各种厂牌的各种服务器产品,本发明并不对其限制。The server 20 can support an intelligent platform management interface (Intelligent Platform Management Interface, IPMI), and run an operating system through the aforementioned hardware. Wherein the server 20 can use operating systems such as Linux of Unix, FreeBSD or Windows (window operating system) Server 2003 of Microsoft (Microsoft), also can be disk operating system (Disk Operating System, DOS) or extensible firmware interface (can be Extensible Firmware Interface, Extensible Firmware Interface, EFI) system. And the server 20 can also be various server products of various brands, which is not limited by the present invention.

更详细地说,智能型平台管理界面是一种服务器管理平台的标准架构,它包含BMC 21、一系统界面(System Interface)、一非挥发性储存单元(Non-volatile Storage)、一智能平台管理总线(Intelligent Platform ManagementBus,IPMB)以及一智能型机箱管理总线(Intelligent Chassis Management Bus,ICMB)等5项组件。而其中最重要的就是BMC 21。BMC 21就像是一台独立的计算机,包括自己的处理器以及记忆体等资源。且BMC 21的运作均使用自己具有的资源,而不会占用服务器20的硬件模块的其它资源。举例而言,远程计算机30可使用惠普(HP)公司的iLO系统、戴尔(DELL)公司的iDRAC系统,或是英特尔(Intel)公司的ESB2系统。In more detail, the intelligent platform management interface is a standard architecture of a server management platform, which includes BMC 21, a system interface (System Interface), a non-volatile storage unit (Non-volatile Storage), an intelligent platform management Bus (Intelligent Platform Management Bus, IPMB) and an intelligent chassis management bus (Intelligent Chassis Management Bus, ICMB) and other five components. And the most important of these is the BMC 21. BMC 21 is like an independent computer, including its own processor, memory and other resources. And the operation of BMC 21 all uses its own resources, and can not occupy other resources of the hardware module of server 20. For example, the remote computer 30 can use the iLO system of Hewlett-Packard (HP), the iDRAC system of Dell (DELL), or the ESB2 system of Intel (Intel).

请配合图1并参照图2,图2是为一实施范例的利用BMC取得储存装置故障信号的方法的流程图。首先定义BMC 21所监控的一检测器(sensor)212,其中检测器212的值用以表示储存装置22的状态,例如是否故障等(步骤S100)。BMC 21本身可包括多个其它的检测器以监控服务器20的CPU等硬件,并于步骤S100中额外定义此检测器212。Please cooperate with FIG. 1 and refer to FIG. 2 . FIG. 2 is a flowchart of a method for obtaining a failure signal of a storage device by using a BMC according to an embodiment. First, define a detector (sensor) 212 monitored by the BMC 21, wherein the value of the detector 212 is used to indicate the status of the storage device 22, such as whether it is faulty or not (step S100). The BMC 21 itself may include multiple other detectors to monitor hardware such as the CPU of the server 20, and this detector 212 is additionally defined in step S100.

储存装置控制器232随时监测储存装置22的状态;而当储存装置22发生故障时,储存装置控制器232可产生并发送一储存装置故障信号给服务器20的一硬件编码手段24。利用硬件编码手段24,服务器20将储存装置22的储存装置控制器232发出的储存装置故障信号转换成一电信号(步骤S110)。The storage device controller 232 monitors the status of the storage device 22 at any time; and when the storage device 22 fails, the storage device controller 232 can generate and send a storage device failure signal to a hardware encoding means 24 of the server 20 . Using the hardware encoding means 24, the server 20 converts the storage device failure signal sent by the storage device controller 232 of the storage device 22 into an electrical signal (step S110).

其中硬件编码手段24可以是一复杂可程序逻辑装置(ComplexProgrammable Logic Device,CPLD)。CPLD中可包含多个可程序逻辑阵列(Programmable Array Logic,PAL),并用来实现各种运算和组合逻辑(combinational logic)。各个PAL之间的互接连线也可以进行程序性的规划以及烧录。CPLD运用这种多合一(All-In-One)的整合作法,使其能实现数千个逻辑闸,甚至数十万个逻辑闸才能构成的电路。而在本实施范例中CPLD被程序化以执行硬件编码手段24所需的功能。例如CPLD可接收储存装置控制器232通过序列式通用型输入输出(serial general purpose input/output,SGPIO)发送的电信号,再将其重新编码为电信号。Wherein the hardware encoding means 24 may be a complex programmable logic device (Complex Programmable Logic Device, CPLD). The CPLD can contain multiple programmable logic arrays (Programmable Array Logic, PAL), and is used to realize various operations and combinational logic (combinational logic). The interconnection lines between each PAL can also be planned and programmed programmatically. CPLD uses this all-in-one (All-In-One) integration method, so that it can realize a circuit composed of thousands of logic gates, or even hundreds of thousands of logic gates. However, in this embodiment example, the CPLD is programmed to execute the functions required by the hardware coding means 24 . For example, the CPLD can receive an electrical signal sent by the storage device controller 232 through a serial general purpose input/output (SGPIO), and then recode it into an electrical signal.

接着,由一输入输出扩展器(input/output expander,I/O expander)25读取电信号,并将读取的电信号的值指定为检测器的值(步骤S120)。也就是说,输入输出扩展器25可由CPLD读取电信号,BMC 21的检测器212再由输入输出扩展器25读取电信号。其中BMC 21与输入输出扩展器25之间可由一内部整合电路总线(inter integrated circuit bus,I2C bus,I2C总线)连接。检测器212定期通过I2C总线由读取电信号,并将电信号的内容作为自己的值。Next, the electrical signal is read by an input/output expander (I/O expander) 25, and the value of the read electrical signal is designated as the value of the detector (step S120). That is to say, the input-output expander 25 can read the electrical signal by the CPLD, and the detector 212 of the BMC 21 can read the electrical signal by the input-output expander 25. The BMC 21 and the input/output expander 25 can be connected by an internal integrated circuit bus (I2C bus, I2C bus). The detector 212 periodically reads the electrical signal through the I2C bus, and uses the content of the electrical signal as its own value.

请参照图3,其是为另一实施范例的利用BMC取得储存装置故障信号的方法的流程图。利用BMC取得储存装置故障信号的方法还可以令硬件编码模块24(例如CPLD)依据电信号点亮(即驱动)对应于储存装置22的一发光二极管(light emitting diode,LED)组26(步骤S130)。Please refer to FIG. 3 , which is a flowchart of a method for obtaining a failure signal of a storage device by using a BMC according to another embodiment. The method of utilizing the BMC to obtain the fault signal of the storage device can also make the hardware coding module 24 (such as CPLD) light up (that is, drive) a light emitting diode (light emitting diode, LED) group 26 corresponding to the storage device 22 according to the electrical signal (step S130 ).

请配合参照4图,其是为另一实施范例的服务器的示意图。储存装置22可包括多个储存单元222,例如储存单元222a、储存单元222b以及储存单元222c;而LED组26则可包括与储存单元222数量相同的多个LED灯号262,例如LED灯号262a、LED灯号262b以及LED灯号262c。经重新编码的电信号对应这些储存单元222,并用以点亮LED灯号262。通过步骤S130,可以使得服务器20的管理员能够轻松地得知储存装置22的故障情况。Please refer to FIG. 4 , which is a schematic diagram of another embodiment of a server. The storage device 22 may include a plurality of storage units 222, such as a storage unit 222a, a storage unit 222b, and a storage unit 222c; and the LED group 26 may include a plurality of LED lights 262 having the same number as the storage units 222, such as an LED light number 262a , LED light number 262b and LED light number 262c. The recoded electrical signals correspond to these storage units 222 and are used to light up the LED lights 262 . Through step S130 , the administrator of the server 20 can easily know the fault condition of the storage device 22 .

利用BMC取得储存装置故障信号的方法并可依据检测器212的值执行一储存装置管理程序(步骤S140)。BMC 21可纪录储存装置22发生故障的事件,并依照储存装置管理程序进行后续处置。而储存装置管理程序可以依据电信号暂停储存装置22的至少一个故障的储存单元222,或是通知通过IPMB与BMC 21相连的远程计算机30的远程管理程序32。The method of obtaining the failure signal of the storage device by using the BMC can execute a storage device management program according to the value of the detector 212 (step S140). The BMC 21 can record the failure event of the storage device 22, and perform follow-up processing according to the storage device management procedure. The storage device management program can suspend at least one faulty storage unit 222 of the storage device 22 according to the electric signal, or notify the remote management program 32 of the remote computer 30 connected to the BMC 21 through the IPMB.

需注意的是,对于步骤S130以及步骤S140的执行顺序并没有限制。It should be noted that there is no limitation on the execution sequence of step S130 and step S140.

以下是为利用BMC取得储存装置故障信号的方法实际运作时的一实施范例。The following is an implementation example of the actual operation of the method for obtaining the failure signal of the storage device by using the BMC.

例如当储存单元222b发生故障时,储存装置控制器232会据以发出储存装置故障信号。硬件编码手段24接收到储存装置故障信号后将其转换为电信号。例如可以序列的二进制代码对应表示所有的储存单元222,且以“0”表示正常时,“010”的电信号辨识表示现在只有储存单元222b发生故障。硬件编码手段24接着将电信号传送给输入输出扩展器25,并依据“010”的电信号将LED灯号262b点亮为红灯。而没发生故障的储存单元222a以及222b所对应的LED灯号262a以及262b可以不点灯也可以维持点亮绿灯以表示正常状态。For example, when the storage unit 222b fails, the storage device controller 232 will accordingly send a storage device failure signal. The hardware encoding means 24 converts the failure signal of the storage device into an electrical signal after receiving it. For example, all the storage units 222 can be represented by a sequence of binary codes, and "0" means normal, and an electrical signal identification of "010" indicates that only the storage unit 222b is faulty. The hardware encoding means 24 then transmits the electrical signal to the I/O expander 25, and lights the LED 262b as a red light according to the "010" electrical signal. The LED lights 262a and 262b corresponding to the storage units 222a and 222b that have not failed can be turned off or kept green to indicate a normal state.

BMC 21定期读取检测器212的值,也可以视为通过检测器212读取输入输出扩展器25输出的值。若检测器212的值都是0,表示一切正常。而当BMC21收到值不为0的电信号后,便执行储存装置管理程序以经由网络以及通知远程计算机30通知管理人。如此一来,管理人便可实时前往维修或更换故障的储存单元222b。The BMC 21 regularly reads the value of the detector 212, which can also be regarded as reading the value output by the input-output expander 25 through the detector 212. If the values of the detector 212 are all 0, it means everything is normal. And when the BMC 21 receives the electrical signal whose value is not 0, it executes the storage device management program to notify the administrator via the network and the remote computer 30 . In this way, the administrator can repair or replace the faulty storage unit 222b in real time.

综上所述,利用BMC取得储存装置故障信号的方法利用硬件编码手段产生电信号后,不但用以点亮对应的LED组,也通过检测器告知BMC。也就是说,原先独立由硬件控制的磁盘故障点灯机制被整合入BMC管理的事件之中,使管理界面得以统一。如此一来,可以解决现有技术像多头马车并行般的杂乱的管理方式,而能以更简洁且有效率的方法管理服务器,并有效率地再发生故障事件时通知管理员。To sum up, the method of using the BMC to obtain the fault signal of the storage device uses the hardware coding method to generate an electrical signal, which is not only used to light up the corresponding LED group, but also notifies the BMC through the detector. That is to say, the previously independent hardware-controlled disk failure lighting mechanism is integrated into the events managed by the BMC, so that the management interface can be unified. In this way, it is possible to solve the messy multi-horse-drawn management method of the existing technology, manage the server in a more concise and efficient way, and efficiently notify the administrator when a failure event occurs.

当然,本发明还可有其它多种实施例,在不背离本发明精神及其实质的情况下,熟悉本领域的技术人员当可根据本发明作出各种相应的改变和变形,但这些相应的改变和变形都应属于本发明所附的权利要求的保护范围。Certainly, the present invention also can have other multiple embodiments, without departing from the spirit and essence of the present invention, those skilled in the art can make various corresponding changes and deformations according to the present invention, but these corresponding Changes and deformations should belong to the scope of protection of the appended claims of the present invention.

Claims (10)

1.一种利用基板管理控制器BMC取得储存装置故障信号的方法,适用于具有一基板管理控制器BMC以及一储存装置的一服务器,其特征在于,该利用基板管理控制器取得储存装置故障信号的方法包括: 1. A method for utilizing a baseboard management controller BMC to obtain a storage device fault signal, suitable for a server having a baseboard management controller BMC and a storage device, characterized in that the baseboard management controller is used to obtain a storage device fault signal The methods include: 定义该BMC所监控的一检测器,其中该检测器的值用以表示该储存装置的状态; defining a detector monitored by the BMC, wherein the value of the detector is used to represent the status of the storage device; 利用一硬件编码手段,将该储存装置的一储存装置控制器产生的一储存装置故障信号转换成一电信号;以及 converting a storage device fault signal generated by a storage device controller of the storage device into an electrical signal by means of a hardware encoding; and 由一输入输出扩展器读取该电信号,并将读取的该电信号的值指定为该检测器的值。 The electric signal is read by an input-output expander, and the value of the read electric signal is designated as the value of the detector. 2.根据权利要求1所述的利用基板管理控制器取得储存装置故障信号的方法,其特征在于,该硬件编码手段是一复杂可程序逻辑装置CPLD,且该输入输出扩展器由该CPLD读取该电信号。 2. The method of utilizing a baseboard management controller to obtain a fault signal of a storage device according to claim 1, wherein the hardware encoding means is a complex programmable logic device (CPLD), and the input-output expander is read by the CPLD the electrical signal. 3.根据权利要求2所述的利用基板管理控制器取得储存装置故障信号的方法,其特征在于,该储存装置控制器监测该储存装置的状态,并据以发送该储存装置故障信号给该CPLD。 3. The method for obtaining a fault signal of a storage device by using a baseboard management controller according to claim 2, wherein the storage device controller monitors the state of the storage device, and accordingly sends the fault signal of the storage device to the CPLD . 4.根据权利要求3所述的利用基板管理控制器取得储存装置故障信号的方法,其特征在于,该储存装置控制器通过序列式通用型输入输出发送该电信号给该CPLD。 4 . The method for obtaining a fault signal of a storage device by using a baseboard management controller according to claim 3 , wherein the storage device controller sends the electrical signal to the CPLD through serial general-purpose input and output. 5.根据权利要求2所述的利用基板管理控制器取得储存装置故障信号的方法,其特征在于,还包括: 5. The method according to claim 2, further comprising: 令该CPLD依据该电信号点亮对应于该储存装置的一发光二极管组。 The CPLD is made to turn on a light emitting diode group corresponding to the storage device according to the electric signal. 6.根据权利要求1所述的利用基板管理控制器取得储存装置故障信号的方法,其特征在于,该BMC的该检测器通过内部整合电路总线由该输入输出扩展器读取该电信号。 6 . The method for obtaining a failure signal of a storage device by using a BMC according to claim 1 , wherein the detector of the BMC reads the electrical signal from the I/O expander through an internal integrated circuit bus. 7.根据权利要求1所述的利用基板管理控制器取得储存装置故障信号的方法,其特征在于,还包括: 7. The method for obtaining a fault signal of a storage device by using a baseboard management controller according to claim 1, further comprising: 依据该检测器的值执行一储存装置管理程序。 Execute a storage device management program according to the value of the detector. 8.根据权利要求7所述的利用基板管理控制器取得储存装置故障信号的 方法,其特征在于,该储存装置管理程序包括: 8. The method according to claim 7, wherein the storage device management program comprises: 通知通过一智能平台管理总线IPMB与该BMC相连的一远程管理程序。 A remote management program connected to the BMC through an intelligent platform management bus IPMB is notified. 9.根据权利要求7所述的利用基板管理控制器取得储存装置故障信号的方法,其特征在于,该储存装置管理程序包括: 9. The method according to claim 7, wherein the storage device management program comprises: 依据该电信号暂停该储存装置的至少一储存单元。 Pausing at least one storage unit of the storage device according to the electric signal. 10.根据权利要求1所述的利用基板管理控制器取得储存装置故障信号的方法,其特征在于,该储存装置包括多个储存单元,该电信号对应于该些储存单元。  10 . The method for obtaining a fault signal of a storage device by using a baseboard management controller according to claim 1 , wherein the storage device comprises a plurality of storage units, and the electrical signal corresponds to the storage units. 11 . the
CN201010539115XA 2010-11-05 2010-11-05 Method for obtaining fault signal of storage device by using baseboard management controller Pending CN102467425A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010539115XA CN102467425A (en) 2010-11-05 2010-11-05 Method for obtaining fault signal of storage device by using baseboard management controller

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010539115XA CN102467425A (en) 2010-11-05 2010-11-05 Method for obtaining fault signal of storage device by using baseboard management controller

Publications (1)

Publication Number Publication Date
CN102467425A true CN102467425A (en) 2012-05-23

Family

ID=46071089

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010539115XA Pending CN102467425A (en) 2010-11-05 2010-11-05 Method for obtaining fault signal of storage device by using baseboard management controller

Country Status (1)

Country Link
CN (1) CN102467425A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103744774A (en) * 2014-01-23 2014-04-23 浪潮电子信息产业股份有限公司 Server fault visualizing and rapid diagnosing method
CN103902427A (en) * 2014-03-11 2014-07-02 浪潮电子信息产业股份有限公司 Out-band hard disk status obtaining method
CN104008035A (en) * 2014-06-10 2014-08-27 浪潮电子信息产业股份有限公司 A remote monitoring method for implementing state of hard disk back broad
CN104112461A (en) * 2013-04-17 2014-10-22 鸿富锦精密电子(天津)有限公司 Hard disk detecting circuit
CN104122939A (en) * 2013-04-23 2014-10-29 英业达科技有限公司 Server system
CN104461841A (en) * 2014-12-22 2015-03-25 浪潮电子信息产业股份有限公司 Working state monitoring method and monitoring equipment
CN105528214A (en) * 2015-12-10 2016-04-27 英业达科技有限公司 Server system for reading firmware version by using internal integrated circuit interface
CN106469109A (en) * 2015-08-15 2017-03-01 昆达电脑科技(昆山)有限公司 DIMM failure informs system and method in time
US20170124006A1 (en) * 2015-10-30 2017-05-04 Aspeed Technology Inc. Server system with bmc having hard disk drive monitoring function
CN107203456A (en) * 2016-03-17 2017-09-26 昆达电脑科技(昆山)有限公司 The remote alarming system and method for server failure
CN109359016A (en) * 2018-09-27 2019-02-19 郑州云海信息技术有限公司 A kind of hard disk alarm method and device
TWI815718B (en) * 2022-11-01 2023-09-11 神雲科技股份有限公司 Monitoring method of the clock signal of the server

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1746859A (en) * 2004-09-09 2006-03-15 英业达股份有限公司 An intelligent platform event alarm system and method thereof
CN101000567A (en) * 2006-01-13 2007-07-18 英业达股份有限公司 Labeling system and method
CN101350737A (en) * 2007-07-19 2009-01-21 宏正自动科技股份有限公司 Intelligent platform management interface system, electronic equipment and memory sharing method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1746859A (en) * 2004-09-09 2006-03-15 英业达股份有限公司 An intelligent platform event alarm system and method thereof
CN101000567A (en) * 2006-01-13 2007-07-18 英业达股份有限公司 Labeling system and method
CN101350737A (en) * 2007-07-19 2009-01-21 宏正自动科技股份有限公司 Intelligent platform management interface system, electronic equipment and memory sharing method

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104112461A (en) * 2013-04-17 2014-10-22 鸿富锦精密电子(天津)有限公司 Hard disk detecting circuit
CN104112461B (en) * 2013-04-17 2017-01-18 鸿富锦精密电子(天津)有限公司 HDD detection circuit
CN104122939A (en) * 2013-04-23 2014-10-29 英业达科技有限公司 Server system
CN103744774A (en) * 2014-01-23 2014-04-23 浪潮电子信息产业股份有限公司 Server fault visualizing and rapid diagnosing method
CN103902427A (en) * 2014-03-11 2014-07-02 浪潮电子信息产业股份有限公司 Out-band hard disk status obtaining method
CN103902427B (en) * 2014-03-11 2017-06-30 浪潮电子信息产业股份有限公司 A kind of method with outer acquisition disk state
CN104008035A (en) * 2014-06-10 2014-08-27 浪潮电子信息产业股份有限公司 A remote monitoring method for implementing state of hard disk back broad
CN104461841A (en) * 2014-12-22 2015-03-25 浪潮电子信息产业股份有限公司 Working state monitoring method and monitoring equipment
CN106469109A (en) * 2015-08-15 2017-03-01 昆达电脑科技(昆山)有限公司 DIMM failure informs system and method in time
US20170124006A1 (en) * 2015-10-30 2017-05-04 Aspeed Technology Inc. Server system with bmc having hard disk drive monitoring function
US10013369B2 (en) * 2015-10-30 2018-07-03 Aspeed Technology Inc. Server system with BMC having hard disk drive monitoring function
CN105528214A (en) * 2015-12-10 2016-04-27 英业达科技有限公司 Server system for reading firmware version by using internal integrated circuit interface
CN107203456A (en) * 2016-03-17 2017-09-26 昆达电脑科技(昆山)有限公司 The remote alarming system and method for server failure
CN109359016A (en) * 2018-09-27 2019-02-19 郑州云海信息技术有限公司 A kind of hard disk alarm method and device
TWI815718B (en) * 2022-11-01 2023-09-11 神雲科技股份有限公司 Monitoring method of the clock signal of the server

Similar Documents

Publication Publication Date Title
CN102467425A (en) Method for obtaining fault signal of storage device by using baseboard management controller
US20120110389A1 (en) Method for obtaining storage device state signal by using bmc
US10037238B2 (en) System and method for encoding exception conditions included at a remediation database
US9697068B2 (en) Building an intelligent, scalable system dump facility
US20140122931A1 (en) Performing diagnostic tests in a data center
US11640377B2 (en) Event-based generation of context-aware telemetry reports
US20140258790A1 (en) Communication failure source isolation in a distributed computing system
Mogul et al. Thinking about availability in large service infrastructures
US10514972B2 (en) Embedding forensic and triage data in memory dumps
CN102467434A (en) Method for obtaining storage device state signal by using baseboard management controller
US9436539B2 (en) Synchronized debug information generation
US12210659B2 (en) Real-time management of delta inventory certificates for component validation using eventing and cloud infrastructures
CN105141478A (en) Method for monitoring state of sas card hard disk of linux server
US20230023869A1 (en) System and method for providing intelligent assistance using a warranty bot
CN105553731A (en) Automatic inspection method and device
US9489254B1 (en) Verification of erasure encoded fragments
CN116010156B (en) Exception handling system
US9535783B2 (en) Firmware dump collection from primary system dump device adapter
US9552254B1 (en) Verification of erasure encoded fragments
WO2011051999A1 (en) Information processing device and method for controlling information processing device
TW201220076A (en) A method for obtaining a status signal of a storage device using a Baseboard Management Controller
CN117608952A (en) Detection device and detection method
CN110543385A (en) Virtual backup method and virtual backup restoration method
US10216525B1 (en) Virtual disk carousel
US8533331B1 (en) Method and apparatus for preventing concurrency violation among resources

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120523