CN115359834B - A disk arbitration area detection method, device, equipment and readable storage medium - Google Patents
A disk arbitration area detection method, device, equipment and readable storage medium Download PDFInfo
- Publication number
- CN115359834B CN115359834B CN202211269936.5A CN202211269936A CN115359834B CN 115359834 B CN115359834 B CN 115359834B CN 202211269936 A CN202211269936 A CN 202211269936A CN 115359834 B CN115359834 B CN 115359834B
- Authority
- CN
- China
- Prior art keywords
- disk
- arbitration
- target
- node
- area
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11C—STATIC STORES
- G11C29/00—Checking stores for correct operation ; Subsequent repair; Testing stores during standby or offline operation
- G11C29/56—External testing equipment for static stores, e.g. automatic test equipment [ATE]; Interfaces therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/2205—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing using arrangements specific to the hardware being tested
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/2273—Test methods
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Hardware Design (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Debugging And Monitoring (AREA)
Abstract
Description
技术领域technical field
本申请涉及计算机技术领域,特别涉及一种盘仲裁区域检测方法、装置、设备及可读存储介质。The present application relates to the field of computer technology, in particular to a disk arbitration area detection method, device, equipment and readable storage medium.
背景技术Background technique
目前,存储系统中需要设置仲裁盘来记录仲裁信息,基于该仲裁信息可方便确定存储系统中的主控节点,或者使存储系统分裂为多个。当前可以对存储系统中的RAID(Redundant Arraysof Independent Disks,磁盘阵列)成员盘进行检测,以根据检测到的盘健康状况确定哪些盘可被选为仲裁盘。在检测各RAID成员盘时,系统既需要承担IO业务,又需要承担检测任务,因此系统总体负载可能较大,会降低系统整体性能。At present, an arbitration disk needs to be set in the storage system to record arbitration information. Based on the arbitration information, the master control node in the storage system can be conveniently determined, or the storage system can be split into several. Currently, member disks of a RAID (Redundant Arrays of Independent Disks, disk array) in a storage system may be detected, so as to determine which disks may be selected as arbitration disks according to the detected health status of the disks. When detecting each RAID member disk, the system needs to undertake both IO services and detection tasks. Therefore, the overall system load may be large, which will reduce the overall system performance.
因此,如何平衡盘检测任务和IO业务对存储系统的压力,是本领域技术人员需要解决的问题。Therefore, how to balance the pressure of the disk detection task and the IO service on the storage system is a problem to be solved by those skilled in the art.
发明内容Contents of the invention
有鉴于此,本申请的目的在于提供一种盘仲裁区域检测方法、装置、设备及可读存储介质,以平衡盘检测任务和IO业务对存储系统的压力。其具体方案如下:In view of this, the purpose of the present application is to provide a disk arbitration area detection method, device, equipment and readable storage medium, so as to balance the pressure of the disk detection task and the IO service on the storage system. The specific plan is as follows:
第一方面,本申请提供了一种盘仲裁区域检测方法,包括:In a first aspect, the present application provides a disk arbitration area detection method, including:
遍历多控存储系统中的各磁盘,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测所述目标磁盘的目标节点;其中,所述目标节点为所述多控存储系统中的任一节点;Traversing each disk in the multi-control storage system, if currently traversing to the target disk, then determine a target node for detecting the target disk based on a preset load balancing strategy; wherein, the target node is the target node in the multi-control storage system any node of
利用所述目标节点检测所述目标磁盘中的仲裁区域对于IO的响应;Using the target node to detect the response of the arbitration area in the target disk to the IO;
若所述响应为IO报错消息且所述目标磁盘不是系统仲裁盘,则为所述目标磁盘添加仲裁区域故障的标记信息。If the response is an IO error message and the target disk is not a system quorum disk, add the flag information of the quorum area failure to the target disk.
可选地,所述基于预设负载均衡策略确定用于检测所述目标磁盘的目标节点,包括:Optionally, the determining the target node for detecting the target disk based on a preset load balancing policy includes:
从所述预设负载均衡策略中获取所述多控存储系统中的各节点与其需要检测的磁盘之间的对应关系;Acquire the correspondence between each node in the multi-control storage system and the disks to be detected from the preset load balancing strategy;
按照所述对应关系确定用于检测所述目标磁盘的目标节点。A target node for detecting the target disk is determined according to the corresponding relationship.
可选地,所述对应关系的构建过程包括:Optionally, the process of constructing the correspondence includes:
将所述多控存储系统中的所有磁盘均衡分配至所述多控存储系统中的各节点,得到分配结果;All the disks in the multi-control storage system are evenly allocated to each node in the multi-control storage system, and an allocation result is obtained;
按照所述分配结果建立所述对应关系。The corresponding relationship is established according to the allocation result.
可选地,所述利用所述目标节点检测所述目标磁盘中的仲裁区域对于IO的响应,包括:Optionally, using the target node to detect the response of the arbitration area in the target disk to the IO includes:
利用所述目标节点读取所述仲裁区域中的任一段连续地址中存储的数据;using the target node to read data stored in any segment of consecutive addresses in the arbitration area;
若读取数据失败,则确定所述仲裁区域对于IO的响应为所述IO报错消息。If the data reading fails, it is determined that the arbitration area responds to the IO by reporting an IO error message.
可选地,所述确定所述仲裁区域对于IO的响应为所述IO报错消息之前,还包括:Optionally, before determining that the response of the arbitration area to the IO is the IO error message, the method further includes:
对当前连续地址执行修复操作,若当前连续地址修复失败,则执行所述确定所述仲裁区域对于IO的响应为所述IO报错消息的步骤。Perform a repair operation on the current continuous address, and if the repair of the current continuous address fails, perform the step of determining that the response of the arbitration area to the IO is the IO error message.
可选地,所述对当前连续地址执行修复操作,包括:Optionally, performing the repair operation on the current continuous address includes:
将随机生成数据写入当前连续地址;Write randomly generated data into the current continuous address;
从当前连续地址读取所述随机生成数据;reading the randomly generated data from the current continuous address;
若读取到的随机生成数据与写入的随机生成数据一致,则确定当前连续地址修复成功;否则,确定当前连续地址修复失败。If the read randomly generated data is consistent with the written randomly generated data, it is determined that the repair of the current continuous address is successful; otherwise, it is determined that the repair of the current continuous address fails.
可选地,还包括:Optionally, also include:
若读取数据成功或当前连续地址修复成功,则在当前连续地址的尾地址不是所述仲裁区域的结束地址时,利用所述目标节点读取与当前连续地址相邻的下一段连续地址中存储的数据,并判断是否成功读取数据。If the read data is successful or the current continuous address is successfully repaired, when the tail address of the current continuous address is not the end address of the arbitration area, the target node is used to read the data stored in the next continuous address adjacent to the current continuous address. , and judge whether the data is read successfully.
可选地,还包括:Optionally, also include:
若读取数据时获取到所述目标磁盘的离线消息或当前连续地址的尾地址是所述仲裁区域的结束地址,则等待下一检测时间点,以便到达下一检测时间点时,遍历到下一磁盘。If the offline message of the target disk is obtained when reading data or the tail address of the current continuous address is the end address of the arbitration area, then wait for the next detection time point, so that when the next detection time point is reached, traverse to the next a disk.
可选地,还包括:Optionally, also include:
若所述响应为所述IO报错消息且所述目标磁盘是所述系统仲裁盘,则生成所述系统仲裁盘的故障提示消息。If the response is the IO error message and the target disk is the system arbitration disk, generate a failure prompt message for the system arbitration disk.
可选地,所述生成所述系统仲裁盘的故障提示消息之后,还包括:Optionally, after generating the fault prompt message of the system arbitration disk, the method further includes:
收集所述多控存储系统中各磁盘的所述标记信息;Collecting the tag information of each disk in the multi-control storage system;
基于各磁盘的所述标记信息重新选择所述系统仲裁盘。The system arbitration disk is reselected based on the tag information of each disk.
可选地,所述基于各磁盘的所述标记信息重新选择所述系统仲裁盘,包括:Optionally, the reselecting the system arbitration disk based on the label information of each disk includes:
将没有所述标记信息的磁盘作为候选盘,并基于仲裁盘选择策略在各候选盘中选择所述系统仲裁盘。A disk without the tag information is used as a candidate disk, and the system arbitration disk is selected from among the candidate disks based on an arbitration disk selection policy.
可选地,所述利用所述目标节点检测所述目标磁盘中的仲裁区域对于IO的响应,包括:Optionally, using the target node to detect the response of the arbitration area in the target disk to the IO includes:
利用所述目标节点下发读请求和/或写请求至所述仲裁区域;using the target node to issue a read request and/or write request to the arbitration area;
获取所述仲裁区域对于所述读请求和/或所述写请求的处理结果;Obtain a processing result of the arbitration area for the read request and/or the write request;
将所述仲裁区域对于所述读请求和/或所述写请求的处理结果作为所述仲裁区域对于IO的响应。Taking the processing result of the arbitration area for the read request and/or the write request as the arbitration area's response to the IO.
第二方面,本申请提供了一种盘仲裁区域检测装置,包括:In a second aspect, the present application provides a disk arbitration area detection device, including:
遍历模块,用于遍历多控存储系统中的各磁盘,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测所述目标磁盘的目标节点;其中,所述目标节点为所述多控存储系统中的任一节点;The traversal module is used to traverse each disk in the multi-control storage system. If the target disk is currently traversed, the target node for detecting the target disk is determined based on a preset load balancing strategy; wherein the target node is the Any node in the multi-control storage system;
检测模块,用于利用所述目标节点检测所述目标磁盘中的仲裁区域对于IO的响应;A detection module, configured to use the target node to detect the response of the arbitration area in the target disk to IO;
标记模块,用于若所述响应为IO报错消息且所述目标磁盘不是系统仲裁盘,则为所述目标磁盘添加仲裁区域故障的标记信息。The marking module is configured to add the marking information of the arbitration area failure to the target disk if the response is an IO error message and the target disk is not a system arbitration disk.
可选地,遍历模块具体用于:Optionally, the traversal module is specifically used to:
从所述预设负载均衡策略中获取所述多控存储系统中的各节点与其需要检测的磁盘之间的对应关系;按照所述对应关系确定用于检测所述目标磁盘的目标节点。Obtain the corresponding relationship between each node in the multi-control storage system and the disk to be detected from the preset load balancing strategy; determine the target node for detecting the target disk according to the corresponding relationship.
可选地,所述对应关系的构建过程包括:Optionally, the process of constructing the correspondence includes:
将所述多控存储系统中的所有磁盘均衡分配至所述多控存储系统中的各节点,得到分配结果;按照所述分配结果建立所述对应关系。All disks in the multi-control storage system are evenly allocated to each node in the multi-control storage system to obtain an allocation result; and the corresponding relationship is established according to the allocation result.
可选地,检测模块包括:Optionally, the detection module includes:
读IO检测单元,用于利用所述目标节点读取所述仲裁区域中的任一段连续地址中存储的数据;若读取数据失败,则确定所述仲裁区域对于IO的响应为所述IO报错消息。The read IO detection unit is used to use the target node to read the data stored in any continuous address in the arbitration area; if the read data fails, it is determined that the response of the arbitration area to the IO is an error report for the IO information.
可选地,检测模块还包括:Optionally, the detection module also includes:
修复单元,用于在所述确定所述仲裁区域对于IO的响应为所述IO报错消息之前,对当前连续地址执行修复操作,若当前连续地址修复失败,则执行所述确定所述仲裁区域对于IO的响应为所述IO报错消息的步骤。A repair unit, configured to perform a repair operation on the current continuous address before determining that the response of the arbitration area to the IO is the IO error message, and if the repair of the current continuous address fails, perform the determination of the arbitration area for the The IO response is the step of reporting an IO error message.
可选地,修复单元具体用于:Optionally, the repair unit is specifically for:
将随机生成数据写入当前连续地址;从当前连续地址读取所述随机生成数据;若读取到的随机生成数据与写入的随机生成数据一致,则确定当前连续地址修复成功;否则,确定当前连续地址修复失败。Write the randomly generated data into the current continuous address; read the randomly generated data from the current continuous address; if the read randomly generated data is consistent with the written randomly generated data, then determine that the current continuous address is repaired successfully; otherwise, determine The current consecutive address repair failed.
可选地,检测模块还包括:Optionally, the detection module also includes:
循环单元,用于若读取数据成功或当前连续地址修复成功,则在当前连续地址的尾地址不是所述仲裁区域的结束地址时,利用所述目标节点读取与当前连续地址相邻的下一段连续地址中存储的数据,并判断是否成功读取数据。The loop unit is used to read the next address adjacent to the current continuous address by using the target node when the tail address of the current continuous address is not the end address of the arbitration area if the data is read successfully or the current continuous address is repaired successfully. Data stored in a continuous address, and judge whether the data is read successfully.
可选地,还包括:Optionally, also include:
定时模块,用于若读取数据时获取到所述目标磁盘的离线消息或当前连续地址的尾地址是所述仲裁区域的结束地址,则等待下一检测时间点,以便到达下一检测时间点时,遍历到下一磁盘。The timing module is used to wait for the next detection time point so as to reach the next detection time point if the offline message of the target disk is obtained when reading data or the tail address of the current continuous address is the end address of the arbitration area , traverse to the next disk.
可选地,还包括:Optionally, also include:
提示模块,用于若所述响应为所述IO报错消息且所述目标磁盘是所述系统仲裁盘,则生成所述系统仲裁盘的故障提示消息。A prompt module, configured to generate a failure prompt message of the system arbitration disk if the response is the IO error message and the target disk is the system arbitration disk.
可选地,还包括:Optionally, also include:
仲裁盘更换模块,用于在所述生成所述系统仲裁盘的故障提示消息之后,收集所述多控存储系统中各磁盘的所述标记信息;基于各磁盘的所述标记信息重新选择所述系统仲裁盘。The arbitration disk replacement module is configured to collect the label information of each disk in the multi-control storage system after the generation of the failure prompt message of the system arbitration disk; reselect the disk based on the label information of each disk. System quorum disk.
可选地,仲裁盘更换模块具体用于:Optionally, the arbitration disk replacement module is specifically used for:
将没有所述标记信息的磁盘作为候选盘,并基于仲裁盘选择策略在各候选盘中选择所述系统仲裁盘。A disk without the tag information is used as a candidate disk, and the system arbitration disk is selected from among the candidate disks based on an arbitration disk selection policy.
可选地,检测模块具体用于:Optionally, the detection module is specifically used for:
利用所述目标节点下发读请求和/或写请求至所述仲裁区域;using the target node to issue a read request and/or write request to the arbitration area;
获取所述仲裁区域对于所述读请求和/或所述写请求的处理结果;Obtain a processing result of the arbitration area for the read request and/or the write request;
将所述仲裁区域对于所述读请求和/或所述写请求的处理结果作为所述仲裁区域对于IO的响应。Taking the processing result of the arbitration area for the read request and/or the write request as the arbitration area's response to the IO.
第三方面,本申请提供了一种电子设备,包括:In a third aspect, the present application provides an electronic device, including:
存储器,用于存储计算机程序;memory for storing computer programs;
处理器,用于执行所述计算机程序,以实现前述公开的盘仲裁区域检测方法。A processor, configured to execute the computer program, so as to implement the aforementioned disclosed disk arbitration area detection method.
第四方面,本申请提供了一种可读存储介质,用于保存计算机程序,其中,所述计算机程序被处理器执行时实现前述公开的盘仲裁区域检测方法。In a fourth aspect, the present application provides a readable storage medium for storing a computer program, wherein, when the computer program is executed by a processor, the above disclosed disk arbitration area detection method is implemented.
通过以上方案可知,本申请提供了一种盘仲裁区域检测方法,包括:遍历多控存储系统中的各磁盘,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测所述目标磁盘的目标节点;其中,所述目标节点为所述多控存储系统中的任一节点;利用所述目标节点检测所述目标磁盘中的仲裁区域对于IO的响应;若所述响应为IO报错消息且所述目标磁盘不是系统仲裁盘,则为所述目标磁盘添加仲裁区域故障的标记信息。From the above solutions, it can be seen that the present application provides a disk arbitration area detection method, including: traversing each disk in the multi-control storage system, if the current traversal to the target disk, then based on the preset load balancing strategy to determine the method for detecting the target The target node of the disk; wherein, the target node is any node in the multi-control storage system; using the target node to detect the response of the arbitration area in the target disk to IO; if the response is an IO error message and the target disk is not the system quorum disk, add the flag information of the quorum area failure to the target disk.
可见,本申请通过遍历多控存储系统中的各磁盘来检测各磁盘的健康状况,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测目标磁盘的目标节点,由此可使系统中的所有节点均衡承担系统中各盘的检测任务,不至于某一节点承担的检测任务较多,而影响了该节点上的IO业务。在具体检测各个盘时,利用上述步骤确定的目标节点检测目标磁盘中的仲裁区域对于IO的响应;若响应为IO报错消息且目标磁盘不是系统仲裁盘,则为目标磁盘添加仲裁区域故障的标记信息,后续便可以依据此标记信息为系统选择仲裁盘。并且,本申请对系统中的每个磁盘都进行检测,也就是说:系统中的每个磁盘都有概率被选为仲裁盘,选择范围并不局限于系统中的RAID成员盘。因为存储系统中的非RAID成员盘的业务负载压力一般小,更适合被选为仲裁盘。可见,本申请使系统中的所有节点均衡承担系统中各盘的检测任务,能够平衡盘检测任务和IO业务对存储系统的压力,并且系统中负载较小的非RAID成员盘也有可能被选为仲裁盘,可优化仲裁盘的选择策略。It can be seen that the present application detects the health status of each disk by traversing each disk in the multi-control storage system. If the target disk is currently traversed, the target node for detecting the target disk is determined based on the preset load balancing strategy, so that the All nodes in the system undertake the detection tasks of each disk in the system in a balanced manner, so that a certain node does not undertake more detection tasks, which affects the IO business on this node. When specifically detecting each disk, use the target node determined by the above steps to detect the response of the arbitration area in the target disk to IO; if the response is an IO error message and the target disk is not a system arbitration disk, add a flag for the arbitration area failure to the target disk Information, and then you can select the quorum disk for the system based on this tag information. Moreover, the present application detects each disk in the system, that is to say, each disk in the system has a probability of being selected as an arbitration disk, and the selection range is not limited to the RAID member disks in the system. Because the service load pressure of the non-RAID member disk in the storage system is generally small, it is more suitable to be selected as the arbitration disk. It can be seen that this application enables all nodes in the system to bear the detection tasks of each disk in the system in a balanced manner, which can balance the disk detection tasks and the pressure of the IO business on the storage system, and non-RAID member disks with less load in the system may also be selected as Arbitration disk, which can optimize the selection strategy of the arbitration disk.
相应地,本申请提供的一种盘仲裁区域检测装置、设备及可读存储介质,也同样具有上述技术效果。Correspondingly, the disk arbitration area detection device, equipment and readable storage medium provided by the present application also have the above-mentioned technical effects.
附图说明Description of drawings
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only It is an embodiment of the present application, and those skilled in the art can also obtain other drawings according to the provided drawings without creative work.
图1为本申请公开的一种盘仲裁区域检测方法流程图;FIG. 1 is a flow chart of a disk arbitration area detection method disclosed in the present application;
图2为本申请公开的另一种盘仲裁区域检测方法流程图;FIG. 2 is a flow chart of another disk arbitration area detection method disclosed in the present application;
图3为本申请公开的一种verify IO的结果处理示意图;Fig. 3 is a schematic diagram of result processing of a verify IO disclosed in the present application;
图4为本申请公开的一种write verify指令的结果处理示意图;Fig. 4 is a schematic diagram of the result processing of a write verify command disclosed in the present application;
图5为本申请公开的一种盘仲裁区域检测装置示意图;FIG. 5 is a schematic diagram of a disc arbitration area detection device disclosed in the present application;
图6为本申请公开的一种电子设备示意图。FIG. 6 is a schematic diagram of an electronic device disclosed in the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.
目前,可以对存储系统中的RAID成员盘进行检测,以根据检测到的盘健康状况确定哪些盘可被选为仲裁盘。在检测各RAID成员盘时,系统既需要承担IO业务,又需要承担检测任务,因此系统总体负载可能较大,会降低系统整体性能。为此,本申请提供了一种盘仲裁区域检测方案,能够平衡盘检测任务和IO业务对存储系统的压力,优化仲裁盘的选择策略。Currently, RAID member disks in the storage system can be detected to determine which disks can be selected as arbitration disks according to the detected disk health status. When detecting each RAID member disk, the system needs to undertake both IO services and detection tasks. Therefore, the overall system load may be large, which will reduce the overall system performance. For this reason, the present application provides a disk arbitration area detection scheme, which can balance the disk detection task and the pressure of the IO service on the storage system, and optimize the selection strategy of the arbitration disk.
参见图1所示,本申请实施例公开了一种盘仲裁区域检测方法,包括:Referring to Figure 1, the embodiment of the present application discloses a disk arbitration area detection method, including:
S101、遍历多控存储系统中的各磁盘,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测目标磁盘的目标节点。S101. Traverse each disk in the multi-control storage system, and if a target disk is currently traversed, determine a target node for detecting the target disk based on a preset load balancing policy.
需要说明的是,多控存储系统包括多个节点以及多个磁盘,并且,所有磁盘被每个节点可见。为了使各个节点分担各磁盘的检测任务,可以将各磁盘均衡分配给相应节点,以使各个节点均衡承担各磁盘的检测任务。假设共有2个节点、10个磁盘,那么可使每个节点承担5个磁盘的检测任务。其中,目标节点为多控存储系统中的任一节点。It should be noted that the multi-control storage system includes multiple nodes and multiple disks, and all disks are visible to each node. In order to make each node share the detection task of each disk, each disk may be distributed to corresponding nodes in a balanced manner, so that each node undertakes the detection task of each disk in a balanced manner. Assuming that there are 2 nodes and 10 disks, each node can undertake the detection task of 5 disks. Wherein, the target node is any node in the multi-controller storage system.
在一种实施方式中,基于预设负载均衡策略确定用于检测目标磁盘的目标节点,包括:从预设负载均衡策略中获取多控存储系统中的各节点与其需要检测的磁盘之间的对应关系;按照对应关系确定用于检测目标磁盘的目标节点。其中,对应关系的构建过程包括:将多控存储系统中的所有磁盘均衡分配至多控存储系统中的各节点,得到分配结果;按照分配结果建立对应关系。In one embodiment, determining the target node for detecting the target disk based on the preset load balancing strategy includes: obtaining the correspondence between each node in the multi-control storage system and the disk to be detected from the preset load balancing strategy relationship; determine the target node for detecting the target disk according to the corresponding relationship. Wherein, the process of establishing the corresponding relationship includes: all the disks in the multi-control storage system are evenly allocated to each node in the multi-control storage system to obtain the allocation result; and the corresponding relationship is established according to the allocation result.
S102、利用目标节点检测目标磁盘中的仲裁区域对于IO的响应。S102. Utilize the target node to detect the response of the arbitration area in the target disk to the IO.
需要说明的是,每个磁盘中都有一部分空间用作仲裁区域,其他空间用来存储业务数据。若某一磁盘被选为系统仲裁盘,那么该磁盘中的仲裁区域用来存储仲裁数据。若某一磁盘没有被选为系统仲裁盘,那么该磁盘中的仲裁区域不存储任何数据。仲裁区域可位于每个磁盘的存储空间尾部,例如:在每个磁盘的末尾预留512M大小作为仲裁区域。仲裁区域的大小可根据实际情况进行调整。It should be noted that part of the space in each disk is used as the arbitration area, and the other space is used to store business data. If a certain disk is selected as the system quorum disk, the quorum area in the disk is used to store quorum data. If a certain disk is not selected as the system quorum disk, the quorum area of the disk does not store any data. The arbitration area can be located at the end of the storage space of each disk, for example: 512M is reserved at the end of each disk as the arbitration area. The size of the arbitration area can be adjusted according to the actual situation.
在本实施例中,检测仲裁区域对于IO的响应的目的是:检测仲裁区域的读写功能是否有问题。为此,利用目标节点检测目标磁盘中的仲裁区域对于IO的响应,包括:利用目标节点下发读请求和/或写请求至仲裁区域;获取仲裁区域对于读请求和/或写请求的处理结果;将仲裁区域对于读请求和/或写请求的处理结果作为仲裁区域对于IO的响应。可见,仲裁区域对于IO的响应包括:读IO的响应和/或写IO的响应。若读IO的响应或写IO的响应有问题,则认为仲裁区域对于IO的响应出错,故确定响应为IO报错消息。In this embodiment, the purpose of detecting the response of the arbitration area to the IO is to detect whether there is a problem with the read-write function of the arbitration area. To this end, the target node is used to detect the response of the arbitration area in the target disk to the IO, including: using the target node to send a read request and/or write request to the arbitration area; obtaining the processing result of the arbitration area for the read request and/or write request ; Use the processing result of the arbitration area for the read request and/or write request as the arbitration area's response to the IO. It can be seen that the response of the arbitration area to the IO includes: the response of the read IO and/or the response of the write IO. If there is a problem with the response of the read IO or the response of the write IO, it is considered that the arbitration area has an error in the response to the IO, so it is determined that the response is an IO error message.
由于仲裁区域是一个整块的存储空间,而该整块存储空间中可能有个别地址故障,为了检测出该整块存储空间中的个别故障地址,可以分区域检测仲裁区域。在一种实施方式中,利用目标节点检测目标磁盘中的仲裁区域对于IO的响应,包括:利用目标节点读取仲裁区域中的任一段连续地址中存储的数据;若读取数据失败,则确定仲裁区域对于IO的响应为IO报错消息。若从当前连续地址读不出数据,则表明当前连续地址故障,因此确定仲裁区域对于IO的响应出错。Since the arbitration area is a whole block of storage space, and there may be individual address faults in the entire block of storage space, in order to detect individual faulty addresses in the entire block of storage space, the arbitration area can be detected in subregions. In one embodiment, using the target node to detect the response of the arbitration area in the target disk to the IO includes: using the target node to read the data stored in any continuous address in the arbitration area; if the read data fails, determine The response of the arbitration area to the IO is an IO error message. If no data can be read from the current continuous address, it indicates that the current continuous address is faulty, so it is determined that the arbitration area responds incorrectly to the IO.
当然,由于数据读取出错存在偶然性,因此在某一连续地址读不出数据时,可以尝试进行修复操作;在修复失败后,才认为仲裁区域对于IO的响应出错;若修改成功,则对仲裁区域中的其他地址继续检测。在一种实施方式中,确定仲裁区域对于IO的响应为IO报错消息之前,还包括:对当前连续地址执行修复操作,若当前连续地址修复失败,则执行确定仲裁区域对于IO的响应为IO报错消息的步骤。其中,对当前连续地址执行修复操作,包括:将随机生成数据写入当前连续地址;从当前连续地址读取随机生成数据;若读取到的随机生成数据与写入的随机生成数据一致,则确定当前连续地址修复成功;否则,确定当前连续地址修复失败。Of course, because data reading errors are accidental, you can try to perform a repair operation when a certain continuous address cannot read data; after the repair fails, it is considered that the response of the arbitration area to IO is wrong; if the modification is successful, the arbitration area Other addresses in the zone continue to be detected. In one embodiment, before determining that the response of the arbitration area to the IO is an IO error message, it also includes: performing a repair operation on the current continuous address, and if the repair of the current continuous address fails, then performing a determination that the response of the arbitration area to the IO is an IO error report message steps. Wherein, the repair operation is performed on the current continuous address, including: writing the randomly generated data into the current continuous address; reading the randomly generated data from the current continuous address; if the read randomly generated data is consistent with the written randomly generated data, then It is determined that the repair of the current continuous address is successful; otherwise, it is determined that the repair of the current continuous address fails.
在一种实施方式中,若读取数据成功或当前连续地址修复成功,则在当前连续地址的尾地址不是仲裁区域的结束地址时,利用目标节点读取与当前连续地址相邻的下一段连续地址中存储的数据,并判断是否成功读取数据,并根据数据读取成功与否的结果进行进一步处理。In one embodiment, if the read data is successful or the current continuous address is successfully repaired, when the tail address of the current continuous address is not the end address of the arbitration area, the target node is used to read the next continuous address adjacent to the current continuous address. The data stored in the address, and judge whether the data is successfully read, and further process according to the result of the success of the data read.
当然,存储系统中的磁盘也可能因故障等原因离线,因此若读取数据时获取到目标磁盘的离线消息或当前连续地址的尾地址是仲裁区域的结束地址,则等待下一检测时间点,以便到达下一检测时间点时,遍历到下一磁盘。可见,本实施例能够对系统中的所有磁盘进行定时遍历,例如:检测完当前磁盘后,经过3秒后,再检测下一磁盘。Of course, the disk in the storage system may also be offline due to failure or other reasons. Therefore, if the offline message of the target disk is obtained when reading data or the tail address of the current continuous address is the end address of the arbitration area, wait for the next detection time point. In order to traverse to the next disk when the next detection time point is reached. It can be seen that this embodiment can periodically traverse all the disks in the system, for example: after the current disk is detected, the next disk will be detected after 3 seconds.
由于系统仲裁盘中记录的仲裁信息非常重要,因此当确认系统仲裁盘的仲裁区域有问题后,需要及时告知系统进行系统仲裁盘的更换,因此在一种实施方式中,若响应为IO报错消息且目标磁盘是系统仲裁盘,则生成系统仲裁盘的故障提示消息。Since the arbitration information recorded in the system arbitration disk is very important, after confirming that there is a problem with the arbitration area of the system arbitration disk, the system needs to be notified in time to replace the system arbitration disk. Therefore, in an implementation mode, if the response is an IO error message And if the target disk is the system arbitration disk, a fault prompt message of the system arbitration disk is generated.
在一种实施方式中,生成系统仲裁盘的故障提示消息之后,还包括:收集多控存储系统中各磁盘的标记信息;基于各磁盘的标记信息重新选择系统仲裁盘。其中,基于各磁盘的标记信息重新选择系统仲裁盘,包括:将没有标记信息的磁盘作为候选盘,并基于仲裁盘选择策略在各候选盘中选择系统仲裁盘,以避免新选择的系统仲裁盘的仲裁区域有问题。其中,仲裁盘选择策略可按照现有相关技术制定。In an implementation manner, after generating the failure prompt message of the system arbitration disk, the method further includes: collecting label information of each disk in the multi-controller storage system; and reselecting the system arbitration disk based on the label information of each disk. Wherein, re-selecting the system arbitration disk based on the label information of each disk includes: using disks without label information as candidate disks, and selecting the system arbitration disk among the candidate disks based on the arbitration disk selection strategy, so as to avoid the newly selected system arbitration disk There is a problem with the arbitration area of . Wherein, the arbitration disk selection strategy may be formulated according to existing related technologies.
系统中的磁盘可以为SAS(serial attached scsi,scsi总线协议串行标准)盘或NVMe(Non-Volatile Memory express,闪存类存储协议)盘。当磁盘可以为SAS盘时,目标节点采用scsi总线命令检测磁盘中的仲裁区域对于IO的响应;当磁盘可以为NVMe盘时,目标节点采用NVMe总线命令检测磁盘中的仲裁区域对于IO的响应。The disks in the system can be SAS (serial attached scsi, scsi bus protocol serial standard) disks or NVMe (Non-Volatile Memory express, flash storage protocol) disks. When the disk can be a SAS disk, the target node uses the scsi bus command to detect the response of the arbitration area in the disk to IO; when the disk can be an NVMe disk, the target node uses the NVMe bus command to detect the response of the arbitration area in the disk to the IO.
S103、若响应为IO报错消息且目标磁盘不是系统仲裁盘,则为目标磁盘添加仲裁区域故障的标记信息。S103. If the response is an IO error message and the target disk is not a system quorum disk, add flag information of a quorum area failure to the target disk.
可见,本实施例通过遍历多控存储系统中的各磁盘来检测各磁盘的健康状况,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测目标磁盘的目标节点,由此可使系统中的所有节点均衡承担系统中各盘的检测任务,不至于某一节点承担的检测任务较多,而影响了该节点上的IO业务。在具体检测各个盘时,利用上述步骤确定的目标节点检测目标磁盘中的仲裁区域对于IO的响应;若响应为IO报错消息且目标磁盘不是系统仲裁盘,则为目标磁盘添加仲裁区域故障的标记信息,后续便可以依据此标记信息为系统选择仲裁盘。并且,本申请对系统中的每个磁盘都进行检测,也就是说:系统中的每个磁盘都有概率被选为仲裁盘,选择范围并不局限于系统中的RAID成员盘。因为存储系统中的非RAID成员盘的业务负载压力一般小,更适合被选为仲裁盘。可见,本实施例使系统中的所有节点均衡承担系统中各盘的检测任务,能够平衡盘检测任务和IO业务对存储系统的压力,并且系统中负载较小的非RAID成员盘也有可能被选为仲裁盘,可优化仲裁盘的选择策略。It can be seen that in this embodiment, the health status of each disk is detected by traversing each disk in the multi-control storage system. If the target disk is currently traversed, the target node for detecting the target disk is determined based on the preset load balancing strategy, so that Make all the nodes in the system undertake the detection tasks of each disk in the system in a balanced manner, so that a certain node does not undertake more detection tasks, which affects the IO business on this node. When specifically detecting each disk, use the target node determined by the above steps to detect the response of the arbitration area in the target disk to IO; if the response is an IO error message and the target disk is not a system arbitration disk, add a flag for the arbitration area failure to the target disk Information, and then you can select the quorum disk for the system based on this tag information. Moreover, the present application detects each disk in the system, that is to say, each disk in the system has a probability of being selected as an arbitration disk, and the selection range is not limited to the RAID member disks in the system. Because the service load pressure of the non-RAID member disk in the storage system is generally small, it is more suitable to be selected as the arbitration disk. It can be seen that in this embodiment, all the nodes in the system can balance the detection tasks of the disks in the system, which can balance the disk detection tasks and the pressure of the IO business on the storage system, and the non-RAID member disks with smaller loads in the system may also be selected. As an arbitration disk, the selection strategy of the arbitration disk can be optimized.
下面以SAS盘和scsi总线命令为例进一步介绍本申请。本实施例的方案总体逻辑请参照图2,如图2所示,本实施例设置了磁盘巡检任务和执行巡检任务的定时器,其中,巡检任务的执行线程为scrub_scan fibre,定时器设定:同一节点对于不同磁盘的检测任务间隔3秒;不同节点对于不同磁盘的检测也间隔3秒。In the following, the present application will be further introduced by taking the SAS disk and the scsi bus command as examples. Please refer to Figure 2 for the overall logic of the solution in this embodiment. As shown in Figure 2, this embodiment sets the disk inspection task and the timer for executing the inspection task, wherein the execution thread of the inspection task is scrub_scan fiber, and the timer Setting: The interval between detection tasks of the same node for different disks is 3 seconds; the interval of detection tasks for different disks of different nodes is also 3 seconds.
具体的,按照定时器的设定,按照磁盘号遍历存储系统中的所有受管磁盘,每3s取下一块磁盘,启动该盘的仲裁区域的检查,巡检执行线程为scrub_scan fibre。如果磁盘号存在、且巡检时间到、且用于检测当前盘的节点上没有正在执行的巡检任务,则对当前盘的仲裁区域进行IO下发,并且所下发IO进入队列,等待系统调度。如果磁盘号不存在、或巡检时间到但所确定的节点不用于检测当前盘,那么针对下一磁盘进行检测。Specifically, according to the setting of the timer, traverse all the managed disks in the storage system according to the disk number, remove a disk every 3s, and start the inspection of the arbitration area of the disk, and the inspection execution thread is scrub_scan fiber. If the disk number exists, the inspection time is up, and there is no inspection task being executed on the node used to detect the current disk, then the IO will be issued to the arbitration area of the current disk, and the issued IO will enter the queue and wait for the system scheduling. If the disk number does not exist, or the polling time is up but the determined node is not used to detect the current disk, then the next disk is detected.
其中,为了平衡系统中所有节点的IO负载压力,设计如下策略,以为各个盘选择检测节点。假设双控系统中(集群中有两个节点),所有受管磁盘对双控节点都可见,节点包括D0和D1,所有受管磁盘的盘号driveD从D0开始依次递增。节点D0执行drive D为偶数的磁盘,节点D1执行drive D为奇数的磁盘,如此各个磁盘的检测任务便被D0和D1这两个节点均衡分担,以避免节点在检测各个盘时,影响节点上的IO业务。当然,当集群中只有一个节点工作时,所有盘的巡检任务都在这个节点执行。因此对于每个节点,可以判断当前节点是否应该执行当前盘的本次巡检任务,若是,则进入下一步,否则,本次执行结束,等待3s后对下一块盘操作。Among them, in order to balance the IO load pressure of all nodes in the system, the following strategy is designed to select detection nodes for each disk. Assume that in a dual-control system (there are two nodes in the cluster), all managed disks are visible to the dual-control nodes, including D0 and D1, and the disk numbers driveD of all managed disks increase sequentially from D0. Node D0 executes the disk whose drive D is an even number, and node D1 executes the disk whose drive D is an odd number. In this way, the detection tasks of each disk are shared by the two nodes D0 and D1 in a balanced manner, so as to avoid affecting the detection of each disk when the node detects each disk. IO business. Of course, when only one node in the cluster is working, all disk inspection tasks are performed on this node. Therefore, for each node, it can be judged whether the current node should execute the inspection task of the current disk, if so, enter the next step, otherwise, this execution ends, wait for 3s and then operate on the next disk.
由于本实施例设定磁盘为SAS盘,故使用scsi verify指令可检测被下发IO的目标地址是否可读,每次下发IO所读取的数据长度为512byte,盘的仲裁区域为512M。具体的,可以设定盘的仲裁区域的last->lba(尾地址)、本次IO读取的数据长度length、下一次IO读取的地址next_lba。队列中已下发的IO被节点中的线程scrub_submit fibre处理,以确定本次IO是否能成功读取数据。其中,scrub_submit fibre处理IO的过程请参见图3,如图3所示,线程scrub_submit fibre从队列取出一个verify IO,将verify IO提交给指定盘,盘上处理IO并返回结果(即verify IO结果),该结果再次异步调度回scrub_scan fibre,scrub_scan fibre确认verify IO结果。当返回结果为success时,说明磁盘当前被下发IO的目标地址状态良好,那么根据next_lba和last->lba的大小关系判断盘仲裁区域的巡检是否执行完毕。若当前读完成地址等于last->lba,则认为盘仲裁区域巡检结束;若当前读完成地址不等于last->lba,则重新计算next->lba=当前读完成地址+length,并再次执行IO下发流程。当返回结果是offline时,说明磁盘离线,不再需要巡检,那么该盘的巡检任务立即停止。当返回结果为其它错误时,说明当前被下发IO的目标地址疑似坏块,若该盘是存储系统的仲裁盘,则立即通知集群更换仲裁盘,并结束该盘的巡检任务;若不是系统的仲裁盘,则提交write verify指令,以尝试修复。Since the disk is set as a SAS disk in this embodiment, the scsi verify command can be used to detect whether the target address of the issued IO is readable. The length of the data read by each issued IO is 512 bytes, and the arbitration area of the disk is 512M. Specifically, it is possible to set the last->lba (tail address) of the arbitration area of the disk, the length of data read by this IO, and the address next_lba of the next IO read. The issued IO in the queue is processed by the thread scrub_submit fiber in the node to determine whether the IO can successfully read the data. Among them, please refer to Figure 3 for the process of scrub_submit fiber processing IO. As shown in Figure 3, the thread scrub_submit fiber takes a verify IO from the queue, submits the verify IO to the specified disk, processes the IO on the disk and returns the result (that is, the verify IO result) , the result is dispatched back to the scrub_scan fiber asynchronously again, and the scrub_scan fiber confirms the verify IO result. When the returned result is success, it means that the target address of the disk to which the IO is currently delivered is in good condition, then judge whether the inspection of the disk arbitration area is completed according to the size relationship between next_lba and last->lba. If the current read completion address is equal to last->lba, it is considered that the inspection of the disk arbitration area is over; if the current read completion address is not equal to last->lba, recalculate next->lba=current read completion address+length, and execute again IO distribution process. When the returned result is offline, it means that the disk is offline and inspection is no longer needed, and the inspection task of this disk will stop immediately. When the returned result is other errors, it means that the target address of the currently issued IO is suspected of being a bad block. If the disk is the arbitration disk of the storage system, immediately notify the cluster to replace the arbitration disk and end the inspection task of the disk; if not system's arbitration disk, submit the write verify command to try to restore it.
其中,使用write verify指令对疑似坏块写入二进制数据,并对比读出和写入的数据是否一致,以尝试修复。修复流程请参见图4,如图4所示,当得到write verify指令的处理结果后,根据此结果确定是否修复成功。当write verify指令的处理结果为success时,坏块修复成功,继续提交下一块地址的verify IO请求。当write verify指令的处理结果是offline时,说明磁盘离线,不再需要巡检,巡检任务立即停止。当write verify指令的处理结果为其它错误时,坏块修复失败,标记该盘为WRITE VERIFY ERROR(写证明错误),同时通知集群,以便存储系统的仲裁盘选择机制将此结果作为选择权重。Among them, use the write verify command to write binary data to the suspected bad block, and compare whether the read and written data are consistent, so as to try to repair. Please refer to Figure 4 for the repair process. As shown in Figure 4, after the processing result of the write verify command is obtained, it is determined whether the repair is successful based on the result. When the processing result of the write verify command is success, the bad block has been repaired successfully, and the verify IO request for the next block address will continue to be submitted. When the processing result of the write verify command is offline, it means that the disk is offline and inspection is no longer required, and the inspection task stops immediately. When the processing result of the write verify command is other errors, the bad block repair fails, the disk is marked as WRITE VERIFY ERROR (write proof error), and the cluster is notified at the same time, so that the arbitration disk selection mechanism of the storage system can use this result as the selection weight.
可见,本实施例依据scsi verify指令和write verify指令,实现了存储系统中所有受管磁盘的仲裁区域的坏块检测,可以及时发现盘仲裁区域的坏块,保证仲裁盘数据的可靠性,能够为存储系统的仲裁选择机制提供可靠参考因素。其中,本实施例针对每个盘的仲裁区域,以地址块为单位滑动下发verify指令检测是否坏块,并通过下发write verify指令尝试坏块修复,可在检测到某一坏块后立即停止当前盘的检测,每次IO的数据体量小且可避免检测盘的整个仲裁区域,检测效率较高。It can be seen that, according to the scsi verify command and the write verify command, this embodiment realizes the bad block detection in the arbitration area of all managed disks in the storage system, can find the bad block in the disk arbitration area in time, ensures the reliability of the arbitration disk data, and can Provide reliable reference factors for the arbitration selection mechanism of the storage system. Among them, this embodiment is aimed at the arbitration area of each disk, and the verify command is issued in units of address blocks to detect whether there is a bad block, and the bad block is tried to be repaired by issuing the write verify command, and a certain bad block can be detected immediately. The detection of the current disk is stopped, the data volume of each IO is small and the detection of the entire arbitration area of the disk can be avoided, and the detection efficiency is high.
下面对本申请实施例提供的一种盘仲裁区域检测装置进行介绍,下文描述的一种盘仲裁区域检测装置与上文描述的一种盘仲裁区域检测方法可以相互参照。The following is an introduction to a device for detecting an arbitration area of a disk provided by an embodiment of the present application. The device for detecting an arbitration area of a disk described below may be referred to with the method for detecting an arbitration area of a disk described above.
参见图5所示,本申请实施例公开了一种盘仲裁区域检测装置,包括:Referring to Fig. 5, the embodiment of the present application discloses a disk arbitration area detection device, including:
遍历模块501,用于遍历多控存储系统中的各磁盘,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测目标磁盘的目标节点;其中,目标节点为多控存储系统中的任一节点;The
检测模块502,用于利用目标节点检测目标磁盘中的仲裁区域对于IO的响应;A
标记模块503,用于若响应为IO报错消息且目标磁盘不是系统仲裁盘,则为目标磁盘添加仲裁区域故障的标记信息。The marking
在一种实施方式中,遍历模块具体用于:In one embodiment, the traversal module is specifically used for:
从预设负载均衡策略中获取多控存储系统中的各节点与其需要检测的磁盘之间的对应关系;按照对应关系确定用于检测目标磁盘的目标节点。The corresponding relationship between each node in the multi-control storage system and the disk to be detected is obtained from the preset load balancing strategy; and the target node for detecting the target disk is determined according to the corresponding relationship.
在一种实施方式中,对应关系的构建过程包括:In one embodiment, the process of constructing the corresponding relationship includes:
将多控存储系统中的所有磁盘均衡分配至多控存储系统中的各节点,得到分配结果;按照分配结果建立对应关系。All the disks in the multi-controller storage system are evenly allocated to each node in the multi-controller storage system to obtain the allocation result; and a corresponding relationship is established according to the allocation result.
在一种实施方式中,检测模块包括:In one embodiment, the detection module includes:
读IO检测单元,用于利用目标节点读取仲裁区域中的任一段连续地址中存储的数据;若读取数据失败,则确定仲裁区域对于IO的响应为IO报错消息。The read IO detection unit is used to use the target node to read the data stored in any continuous address in the arbitration area; if the read data fails, it is determined that the response of the arbitration area to the IO is an IO error message.
在一种实施方式中,检测模块还包括:In one embodiment, the detection module also includes:
修复单元,用于在确定仲裁区域对于IO的响应为IO报错消息之前,对当前连续地址执行修复操作,若当前连续地址修复失败,则执行确定仲裁区域对于IO的响应为IO报错消息的步骤。The repair unit is used to perform a repair operation on the current continuous address before determining that the response of the arbitration area to the IO is an IO error message, and if the repair of the current continuous address fails, perform the step of determining that the response of the arbitration area to the IO is an IO error message.
在一种实施方式中,修复单元具体用于:In one embodiment, the repair unit is specifically used for:
将随机生成数据写入当前连续地址;从当前连续地址读取随机生成数据;若读取到的随机生成数据与写入的随机生成数据一致,则确定当前连续地址修复成功;否则,确定当前连续地址修复失败。Write the randomly generated data into the current continuous address; read the randomly generated data from the current continuous address; if the read randomly generated data is consistent with the written randomly generated data, it is determined that the current continuous address is repaired successfully; otherwise, it is determined that the current continuous address Address repair failed.
在一种实施方式中,检测模块还包括:In one embodiment, the detection module also includes:
循环单元,用于若读取数据成功或当前连续地址修复成功,则在当前连续地址的尾地址不是仲裁区域的结束地址时,利用目标节点读取与当前连续地址相邻的下一段连续地址中存储的数据,并判断是否成功读取数据。The loop unit is used to read data in the next continuous address adjacent to the current continuous address by using the target node when the end address of the current continuous address is not the end address of the arbitration area if the data is successfully read or the current continuous address is repaired successfully. Stored data, and judge whether the data is read successfully.
在一种实施方式中,还包括:In one embodiment, it also includes:
定时模块,用于若读取数据时获取到目标磁盘的离线消息或当前连续地址的尾地址是仲裁区域的结束地址,则等待下一检测时间点,以便到达下一检测时间点时,遍历到下一磁盘。The timing module is used to wait for the next detection time point if the offline message of the target disk is obtained when reading data or the tail address of the current continuous address is the end address of the arbitration area, so that when the next detection time point is reached, traverse to next disk.
在一种实施方式中,还包括:In one embodiment, it also includes:
提示模块,用于若响应为IO报错消息且目标磁盘是系统仲裁盘,则生成系统仲裁盘的故障提示消息。The prompt module is configured to generate a fault prompt message of the system arbitration disk if the response is an IO error message and the target disk is the system arbitration disk.
在一种实施方式中,还包括:In one embodiment, it also includes:
仲裁盘更换模块,用于在生成系统仲裁盘的故障提示消息之后,收集多控存储系统中各磁盘的标记信息;基于各磁盘的标记信息重新选择系统仲裁盘。The arbitration disk replacement module is used to collect the label information of each disk in the multi-controller storage system after generating the failure prompt message of the system arbitration disk; and reselect the system arbitration disk based on the label information of each disk.
在一种实施方式中,仲裁盘更换模块具体用于:In an implementation manner, the arbitration disk replacement module is specifically used for:
将没有标记信息的磁盘作为候选盘,并基于仲裁盘选择策略在各候选盘中选择系统仲裁盘。The disks without tag information are used as candidate disks, and the system arbitration disk is selected among the candidate disks based on the arbitration disk selection policy.
在一种实施方式中,检测模块具体用于:In one embodiment, the detection module is specifically used for:
利用目标节点下发读请求和/或写请求至仲裁区域;Use the target node to issue read requests and/or write requests to the arbitration area;
获取仲裁区域对于读请求和/或写请求的处理结果;Obtain the processing result of the arbitration area for the read request and/or write request;
将仲裁区域对于读请求和/或写请求的处理结果作为仲裁区域对于IO的响应。The processing result of the arbitration area for the read request and/or the write request is used as the response of the arbitration area for the IO.
其中,关于本实施例中各个模块、单元更加具体的工作过程可以参考前述实施例中公开的相应内容,在此不再进行赘述。For the more specific working process of each module and unit in this embodiment, reference may be made to the corresponding content disclosed in the foregoing embodiments, and details are not repeated here.
可见,本实施例提供了一种盘仲裁区域检测装置,能够平衡盘检测任务和IO业务对存储系统的压力,优化仲裁盘的选择策略。It can be seen that this embodiment provides a disk arbitration area detection device, which can balance the disk detection task and the pressure of the IO service on the storage system, and optimize the selection strategy of the arbitration disk.
下面对本申请实施例提供的一种电子设备进行介绍,下文描述的一种电子设备与上文描述的一种盘仲裁区域检测方法及装置可以相互参照。An electronic device provided by an embodiment of the present application is introduced below, and the electronic device described below and the method and device for detecting an arbitration area of a disk described above may refer to each other.
参见图6所示,本申请实施例公开了一种电子设备,包括:Referring to Figure 6, the embodiment of the present application discloses an electronic device, including:
存储器601,用于保存计算机程序;
处理器602,用于执行所述计算机程序,以实现上述任意实施例公开的方法。The
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:遍历多控存储系统中的各磁盘,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测目标磁盘的目标节点;其中,目标节点为多控存储系统中的任一节点;利用目标节点检测目标磁盘中的仲裁区域对于IO的响应;若响应为IO报错消息且目标磁盘不是系统仲裁盘,则为目标磁盘添加仲裁区域故障的标记信息。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: traverse each disk in the multi-control storage system, and if the target disk is currently traversed, determine the The target node of the target disk; wherein, the target node is any node in the multi-control storage system; utilize the target node to detect the response of the arbitration area in the target disk to IO; if the response is an IO error message and the target disk is not a system arbitration disk, Then add the flag information of the quorum area failure to the target disk.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:从预设负载均衡策略中获取多控存储系统中的各节点与其需要检测的磁盘之间的对应关系;按照对应关系确定用于检测目标磁盘的目标节点。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: Obtain the corresponding relationship between each node in the multi-control storage system and the disk to be detected from the preset load balancing strategy; The correspondence determines the target node for detecting the target disk.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:将多控存储系统中的所有磁盘均衡分配至多控存储系统中的各节点,得到分配结果;按照分配结果建立对应关系。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: all the disks in the multi-control storage system are evenly distributed to each node in the multi-control storage system, and the distribution result is obtained; Correspondence.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:利用目标节点读取仲裁区域中的任一段连续地址中存储的数据;若读取数据失败,则确定仲裁区域对于IO的响应为IO报错消息。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: use the target node to read the data stored in any continuous address in the arbitration area; if the read data fails, determine the arbitration area The response to IO is an IO error message.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:对当前连续地址执行修复操作,若当前连续地址修复失败,则执行确定仲裁区域对于IO的响应为IO报错消息的步骤。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: perform a repair operation on the current continuous address, and if the repair of the current continuous address fails, then execute and determine that the response of the arbitration area to the IO is an IO error message A step of.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:将随机生成数据写入当前连续地址;从当前连续地址读取随机生成数据;若读取到的随机生成数据与写入的随机生成数据一致,则确定当前连续地址修复成功;否则,确定当前连续地址修复失败。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: writing the randomly generated data into the current continuous address; reading the randomly generated data from the current continuous address; if the read randomly generated data If it is consistent with the written randomly generated data, it is determined that the repair of the current continuous address is successful; otherwise, it is determined that the repair of the current continuous address fails.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:若读取数据成功或当前连续地址修复成功,则在当前连续地址的尾地址不是仲裁区域的结束地址时,利用目标节点读取与当前连续地址相邻的下一段连续地址中存储的数据,并判断是否成功读取数据。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: if the data is successfully read or the current continuous address is successfully repaired, when the tail address of the current continuous address is not the end address of the arbitration area, The target node is used to read the data stored in the next continuous address adjacent to the current continuous address, and judge whether the data is successfully read.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:若读取数据时获取到目标磁盘的离线消息或当前连续地址的尾地址是仲裁区域的结束地址,则等待下一检测时间点,以便到达下一检测时间点时,遍历到下一磁盘。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: if the offline message of the target disk is obtained when reading data or the tail address of the current continuous address is the end address of the arbitration area, then wait The next detection time point, so as to traverse to the next disk when the next detection time point is reached.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:若响应为IO报错消息且目标磁盘是系统仲裁盘,则生成系统仲裁盘的故障提示消息。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: if the response is an IO error message and the target disk is the system arbitration disk, then generate a system arbitration disk failure prompt message.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:收集多控存储系统中各磁盘的标记信息;基于各磁盘的标记信息重新选择系统仲裁盘。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps may be implemented: collecting label information of each disk in the multi-control storage system; and reselecting the system arbitration disk based on the label information of each disk.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:将没有标记信息的磁盘作为候选盘,并基于仲裁盘选择策略在各候选盘中选择系统仲裁盘。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: using disks without tag information as candidate disks, and selecting the system arbitration disk among the candidate disks based on the arbitration disk selection strategy.
在本实施例中,存储器保存的计算机程序被处理器执行时,可以实现以下步骤:利用目标节点下发读请求和/或写请求至仲裁区域;获取仲裁区域对于读请求和/或写请求的处理结果;将仲裁区域对于读请求和/或写请求的处理结果作为仲裁区域对于IO的响应。In this embodiment, when the computer program stored in the memory is executed by the processor, the following steps can be implemented: use the target node to send a read request and/or write request to the arbitration area; obtain the arbitration area's response to the read request and/or write request Processing result: the processing result of the arbitration area for the read request and/or write request is used as the arbitration area's response to the IO.
进一步的,本申请实施例还提供了一种服务器来作为上述电子设备。该服务器,具体可以包括:至少一个处理器、至少一个存储器、电源、通信接口、输入输出接口和通信总线。其中,所述存储器用于存储计算机程序,所述计算机程序由所述处理器加载并执行,以实现前述任一实施例公开的盘仲裁区域检测方法中的相关步骤。Further, the embodiment of the present application also provides a server as the above-mentioned electronic device. The server may specifically include: at least one processor, at least one memory, a power supply, a communication interface, an input and output interface, and a communication bus. Wherein, the memory is used to store a computer program, and the computer program is loaded and executed by the processor, so as to implement relevant steps in the disk arbitration area detection method disclosed in any of the foregoing embodiments.
本实施例中,电源用于为服务器上的各硬件设备提供工作电压;通信接口能够为服务器创建与外界设备之间的数据传输通道,其所遵循的通信协议是能够适用于本申请技术方案的任意通信协议,在此不对其进行具体限定;输入输出接口,用于获取外界输入数据或向外界输出数据,其具体的接口类型可以根据具体应用需要进行选取,在此不进行具体限定。In this embodiment, the power supply is used to provide working voltage for each hardware device on the server; the communication interface can create a data transmission channel between the server and external devices, and the communication protocol it follows is applicable to the technical solution of this application Any communication protocol is not specifically limited here; the input/output interface is used to obtain external input data or output data to the external, and its specific interface type can be selected according to specific application needs, and is not specifically limited here.
另外,存储器作为资源存储的载体,可以是只读存储器、随机存储器、磁盘或者光盘等,其上所存储的资源包括操作系统、计算机程序及数据等,存储方式可以是短暂存储或者永久存储。In addition, memory, as a resource storage carrier, can be read-only memory, random access memory, magnetic disk or optical disk, etc. The resources stored on it include operating system, computer program and data, etc., and the storage method can be temporary storage or permanent storage.
其中,操作系统用于管理与控制服务器上的各硬件设备以及计算机程序,以实现处理器对存储器中数据的运算与处理,其可以是Windows Server、Netware、Unix、Linux等。计算机程序除了包括能够用于完成前述任一实施例公开的盘仲裁区域检测方法的计算机程序之外,还可以进一步包括能够用于完成其他特定工作的计算机程序。数据除了可以包括虚拟机等数据外,还可以包括虚拟机的开发商信息等数据。Wherein, the operating system is used to manage and control various hardware devices and computer programs on the server, so as to realize the calculation and processing of the data in the memory by the processor, which may be Windows Server, Netware, Unix, Linux, etc. In addition to the computer program that can be used to complete the method for detecting the disk arbitration area disclosed in any of the foregoing embodiments, the computer program may further include a computer program that can be used to complete other specific tasks. In addition to data such as the virtual machine, the data may also include data such as developer information of the virtual machine.
进一步的,本申请实施例还提供了一种终端来作为上述电子设备。该终端具体可以包括但不限于智能手机、平板电脑、笔记本电脑或台式电脑等。Further, the embodiment of the present application also provides a terminal as the above-mentioned electronic device. The terminal specifically may include, but is not limited to, a smart phone, a tablet computer, a notebook computer or a desktop computer, and the like.
通常,本实施例中的终端包括有:处理器和存储器。Generally, the terminal in this embodiment includes: a processor and a memory.
其中,处理器可以包括一个或多个处理核心,比如4核心处理器、8核心处理器等。处理器可以采用DSP(Digital Signal Processing,数字信号处理)、FPGA(Field-Programmable Gate Array,现场可编程门阵列)、PLA(Programmable Logic Array,可编程逻辑阵列)中的至少一种硬件形式来实现。处理器也可以包括主处理器和协处理器,主处理器是用于对在唤醒状态下的数据进行处理的处理器,也称CPU(Central Processing Unit,中央处理器);协处理器是用于对在待机状态下的数据进行处理的低功耗处理器。在一些实施例中,处理器可以在集成有GPU(Graphics Processing Unit,图像处理器),GPU用于负责显示屏所需要显示的内容的渲染和绘制。一些实施例中,处理器还可以包括AI(ArtificialIntelligence,人工智能)处理器,该AI处理器用于处理有关机器学习的计算操作。Wherein, the processor may include one or more processing cores, such as a 4-core processor, an 8-core processor, and the like. The processor can be realized by at least one hardware form of DSP (Digital Signal Processing, digital signal processing), FPGA (Field-Programmable Gate Array, field programmable gate array), PLA (Programmable Logic Array, programmable logic array) . The processor can also include a main processor and a coprocessor, the main processor is a processor for processing data in the wake-up state, also called a CPU (Central Processing Unit, central processing unit); Low-power processor for processing data in standby state. In some embodiments, the processor may be integrated with a GPU (Graphics Processing Unit, image processor), and the GPU is used for rendering and drawing content that needs to be displayed on the display screen. In some embodiments, the processor may further include an AI (Artificial Intelligence, artificial intelligence) processor, where the AI processor is configured to process computing operations related to machine learning.
存储器可以包括一个或多个计算机可读存储介质,该计算机可读存储介质可以是非暂态的。存储器还可包括高速随机存取存储器,以及非易失性存储器,比如一个或多个磁盘存储设备、闪存存储设备。本实施例中,存储器至少用于存储以下计算机程序,其中,该计算机程序被处理器加载并执行之后,能够实现前述任一实施例公开的由终端侧执行的盘仲裁区域检测方法中的相关步骤。另外,存储器所存储的资源还可以包括操作系统和数据等,存储方式可以是短暂存储或者永久存储。其中,操作系统可以包括Windows、Unix、Linux等。数据可以包括但不限于应用程序的更新信息。The memory may include one or more computer-readable storage media, which may be non-transitory. Memory may also include high-speed random access memory, and non-volatile memory, such as one or more magnetic disk storage devices, flash memory storage devices. In this embodiment, the memory is at least used to store the following computer program, wherein, after the computer program is loaded and executed by the processor, it can implement the relevant steps in the disk arbitration area detection method performed by the terminal side disclosed in any of the foregoing embodiments . In addition, the resources stored in the memory may also include an operating system and data, etc., and the storage method may be temporary storage or permanent storage. Wherein, the operating system may include Windows, Unix, Linux and so on. Data may include, but is not limited to, application update information.
在一些实施例中,终端还可包括有显示屏、输入输出接口、通信接口、传感器、电源以及通信总线。In some embodiments, the terminal may further include a display screen, an input/output interface, a communication interface, a sensor, a power supply, and a communication bus.
下面对本申请实施例提供的一种可读存储介质进行介绍,下文描述的一种可读存储介质与上文描述的一种盘仲裁区域检测方法、装置及设备可以相互参照。A readable storage medium provided by an embodiment of the present application is introduced below. The readable storage medium described below and the method, device, and device for detecting an arbitration area of a disk described above may refer to each other.
一种可读存储介质,用于保存计算机程序,其中,所述计算机程序被处理器执行时实现前述实施例公开的盘仲裁区域检测方法。A readable storage medium is used for saving a computer program, wherein when the computer program is executed by a processor, the disk arbitration area detection method disclosed in the foregoing embodiments is implemented.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:遍历多控存储系统中的各磁盘,若当前遍历到目标磁盘,则基于预设负载均衡策略确定用于检测目标磁盘的目标节点;其中,目标节点为多控存储系统中的任一节点;利用目标节点检测目标磁盘中的仲裁区域对于IO的响应;若响应为IO报错消息且目标磁盘不是系统仲裁盘,则为目标磁盘添加仲裁区域故障的标记信息。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: traversing each disk in the multi-control storage system, if the target disk is currently traversed, then determine based on the preset load balancing strategy The target node used to detect the target disk; where the target node is any node in the multi-control storage system; use the target node to detect the response of the arbitration area in the target disk to IO; if the response is an IO error message and the target disk is not a system If the quorum disk is used, add the flag information of the quorum area failure to the target disk.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:从预设负载均衡策略中获取多控存储系统中的各节点与其需要检测的磁盘之间的对应关系;按照对应关系确定用于检测目标磁盘的目标节点。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: Obtain the correspondence between each node in the multi-control storage system and the disk to be detected from the preset load balancing strategy relationship; determine the target node for detecting the target disk according to the corresponding relationship.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:将多控存储系统中的所有磁盘均衡分配至多控存储系统中的各节点,得到分配结果;按照分配结果建立对应关系。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: all the disks in the multi-control storage system are evenly allocated to each node in the multi-control storage system, and the allocation results are obtained; according to The allocation results establish a corresponding relationship.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:利用目标节点读取仲裁区域中的任一段连续地址中存储的数据;若读取数据失败,则确定仲裁区域对于IO的响应为IO报错消息。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: use the target node to read the data stored in any continuous address in the arbitration area; if the reading of the data fails, then It is determined that the response of the arbitration area to the IO is an IO error message.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:对当前连续地址执行修复操作,若当前连续地址修复失败,则执行确定仲裁区域对于IO的响应为IO报错消息的步骤。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: perform a repair operation on the current continuous address, and if the repair of the current continuous address fails, then execute and determine that the response of the arbitration area to the IO is Steps for IO error messages.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:将随机生成数据写入当前连续地址;从当前连续地址读取随机生成数据;若读取到的随机生成数据与写入的随机生成数据一致,则确定当前连续地址修复成功;否则,确定当前连续地址修复失败。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: writing the randomly generated data into the current continuous address; reading the randomly generated data from the current continuous address; if the read If the randomly generated data is consistent with the written randomly generated data, it is determined that the repair of the current continuous address is successful; otherwise, it is determined that the repair of the current continuous address fails.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:若读取数据成功或当前连续地址修复成功,则在当前连续地址的尾地址不是仲裁区域的结束地址时,利用目标节点读取与当前连续地址相邻的下一段连续地址中存储的数据,并判断是否成功读取数据。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: if the data is successfully read or the current continuous address is successfully repaired, the end address of the current continuous address is not the end of the arbitration area address, use the target node to read the data stored in the next continuous address adjacent to the current continuous address, and judge whether the data is successfully read.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:若读取数据时获取到目标磁盘的离线消息或当前连续地址的尾地址是仲裁区域的结束地址,则等待下一检测时间点,以便到达下一检测时间点时,遍历到下一磁盘。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: if the offline message of the target disk is obtained when reading data or the tail address of the current continuous address is the end address of the arbitration area , then wait for the next detection time point, so as to traverse to the next disk when the next detection time point is reached.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:若响应为IO报错消息且目标磁盘是系统仲裁盘,则生成系统仲裁盘的故障提示消息。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: if the response is an IO error message and the target disk is the system arbitration disk, generate a system arbitration disk failure prompt message.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:收集多控存储系统中各磁盘的标记信息;基于各磁盘的标记信息重新选择系统仲裁盘。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: collecting label information of each disk in the multi-control storage system; reselecting the system arbitration disk based on the label information of each disk.
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:将没有标记信息的磁盘作为候选盘,并基于仲裁盘选择策略在各候选盘中选择系统仲裁盘。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: use a disk without tag information as a candidate disk, and select the system arbitration disk among the candidate disks based on the arbitration disk selection strategy .
在本实施例中,可读存储介质保存的计算机程序被处理器执行时,可以实现以下步骤:利用目标节点下发读请求和/或写请求至仲裁区域;获取仲裁区域对于读请求和/或写请求的处理结果;将仲裁区域对于读请求和/或写请求的处理结果作为仲裁区域对于IO的响应。In this embodiment, when the computer program stored in the readable storage medium is executed by the processor, the following steps can be implemented: use the target node to send a read request and/or write request to the arbitration area; obtain the arbitration area for the read request and/or The processing result of the write request; the processing result of the arbitration area for the read request and/or the write request is used as the response of the arbitration area to the IO.
本申请涉及的“第一”、“第二”、“第三”、“第四”等(如果存在)是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的实施例能够以除了在这里图示或描述的内容以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法或设备固有的其它步骤或单元。"First", "second", "third", "fourth" and the like referred to in the present application, if any, are used to distinguish similar objects and not necessarily to describe a specific order or sequence. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments described herein can be practiced in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" and "having", as well as any variations thereof, are intended to cover a non-exclusive inclusion, e.g. a process, method or apparatus comprising a series of steps or elements is not necessarily limited to those steps or elements explicitly listed , but may include other steps or elements not explicitly listed or inherent to the process, method or apparatus.
需要说明的是,在本申请中涉及“第一”、“第二”等的描述仅用于描述目的,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。另外,各个实施例之间的技术方案可以相互结合,但是必须是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在,也不在本申请要求的保护范围之内。It should be noted that the descriptions in this application involving "first", "second" and so on are for descriptive purposes only, and should not be understood as indicating or implying their relative importance or implicitly indicating the number of indicated technical features . Thus, the features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In addition, the technical solutions of the various embodiments can be combined with each other, but it must be based on the realization of those skilled in the art. When the combination of technical solutions is contradictory or cannot be realized, it should be considered that the combination of technical solutions does not exist , nor within the scope of protection required by the present application.
本说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其它实施例的不同之处,各个实施例之间相同或相似部分互相参见即可。Each embodiment in this specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same or similar parts of each embodiment can be referred to each other.
结合本文中所公开的实施例描述的方法或算法的步骤可以直接用硬件、处理器执行的软件模块,或者二者的结合来实施。软件模块可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的可读存储介质中。The steps of the methods or algorithms described in connection with the embodiments disclosed herein may be directly implemented by hardware, software modules executed by a processor, or a combination of both. Software modules can be placed in random access memory (RAM), internal memory, read-only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other Any other known readable storage medium.
本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。In this paper, specific examples are used to illustrate the principles and implementation methods of the application. The descriptions of the above embodiments are only used to help understand the method and core idea of the application; meanwhile, for those of ordinary skill in the art, according to the application There will be changes in the specific implementation and scope of application. In summary, the content of this specification should not be construed as limiting the application.
Claims (10)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211269936.5A CN115359834B (en) | 2022-10-18 | 2022-10-18 | A disk arbitration area detection method, device, equipment and readable storage medium |
PCT/CN2023/115982 WO2024082834A1 (en) | 2022-10-18 | 2023-08-30 | Disk arbitration area detection method and apparatus, device, and nonvolatile readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211269936.5A CN115359834B (en) | 2022-10-18 | 2022-10-18 | A disk arbitration area detection method, device, equipment and readable storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115359834A CN115359834A (en) | 2022-11-18 |
CN115359834B true CN115359834B (en) | 2023-03-24 |
Family
ID=84008797
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211269936.5A Active CN115359834B (en) | 2022-10-18 | 2022-10-18 | A disk arbitration area detection method, device, equipment and readable storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN115359834B (en) |
WO (1) | WO2024082834A1 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115359834B (en) * | 2022-10-18 | 2023-03-24 | 苏州浪潮智能科技有限公司 | A disk arbitration area detection method, device, equipment and readable storage medium |
CN118427130B (en) * | 2024-07-01 | 2024-09-24 | 杭州华澜微电子股份有限公司 | SAS expander, arbitration method and device thereof, and SAS transmission subsystem |
CN118838546B (en) * | 2024-07-08 | 2025-04-29 | 无锡众星微系统技术有限公司 | Performance balanced scheduling method, device, equipment and storage medium for HBA IOC |
CN118567576B (en) * | 2024-07-31 | 2024-10-29 | 浪潮电子信息产业股份有限公司 | Multi-control memory system and data storage method, device, medium and product thereof |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4505763B2 (en) * | 2007-01-31 | 2010-07-21 | ヒューレット−パッカード デベロップメント カンパニー エル.ピー. | Managing node clusters |
US8443231B2 (en) * | 2010-04-12 | 2013-05-14 | Symantec Corporation | Updating a list of quorum disks |
US10782898B2 (en) * | 2016-02-03 | 2020-09-22 | Surcloud Corp. | Data storage system, load rebalancing method thereof and access control method thereof |
CN105872031B (en) * | 2016-03-26 | 2019-06-14 | 天津书生云科技有限公司 | Storage System |
US9766965B2 (en) * | 2015-11-25 | 2017-09-19 | Salesforce.Com, Inc. | System and method for monitoring and detecting faulty storage devices |
CN107273231A (en) * | 2016-04-07 | 2017-10-20 | 阿里巴巴集团控股有限公司 | Distributed memory system hard disk tangles fault detect, processing method and processing device |
CN106980468A (en) * | 2017-03-03 | 2017-07-25 | 杭州宏杉科技股份有限公司 | Method and device for triggering RAID array reconstruction |
CN111813604B (en) * | 2020-07-17 | 2022-06-10 | 济南浪潮数据技术有限公司 | Data recovery method, system and related device of fault storage equipment |
CN114064374A (en) * | 2021-11-12 | 2022-02-18 | 中国建设银行股份有限公司 | Fault detection method and system based on distributed block storage |
CN115359834B (en) * | 2022-10-18 | 2023-03-24 | 苏州浪潮智能科技有限公司 | A disk arbitration area detection method, device, equipment and readable storage medium |
-
2022
- 2022-10-18 CN CN202211269936.5A patent/CN115359834B/en active Active
-
2023
- 2023-08-30 WO PCT/CN2023/115982 patent/WO2024082834A1/en unknown
Also Published As
Publication number | Publication date |
---|---|
WO2024082834A1 (en) | 2024-04-25 |
CN115359834A (en) | 2022-11-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN115359834B (en) | A disk arbitration area detection method, device, equipment and readable storage medium | |
US11163472B2 (en) | Method and system for managing storage system | |
JP2005301497A (en) | Storage management device, restore method and program thereof | |
US10445295B1 (en) | Task-based framework for synchronization of event handling between nodes in an active/active data storage system | |
US10852982B2 (en) | Method, system, and computer program product for managing storage system | |
CN108536548A (en) | A kind of processing method of Bad Track, device and computer storage media | |
US11385823B2 (en) | Method, electronic device and computer program product for rebuilding disk array | |
WO2019210844A1 (en) | Anomaly detection method and apparatus for storage device, and distributed storage system | |
CN103019880A (en) | Data verification method, storage device and storage system | |
CN116185311B (en) | Upgrade migration method, downgrade migration method and device of redundant array of independent disks | |
JP6540334B2 (en) | SYSTEM, INFORMATION PROCESSING DEVICE, AND INFORMATION PROCESSING METHOD | |
JP6069962B2 (en) | Information processing apparatus, area release control program, and area release control method | |
CN114816270A (en) | Storage system global caching method, device and medium | |
CN115562593A (en) | Abnormal RAID member disk processing method, device and medium | |
JP2013239117A (en) | Computer, data storage method, data storage program, and information processing system | |
CN116974489A (en) | Data processing method, device and system, electronic equipment and storage medium | |
CN117149550A (en) | Solid state disk performance detection method and device and electronic equipment | |
CN116244127A (en) | Hard disk detection method, device, equipment and storage medium | |
CN115202589A (en) | Placement group member selection method, apparatus, device and readable storage medium | |
CN107422990A (en) | A kind of dynamic migration method and device | |
CN107329856A (en) | A kind of SDS dispositions methods and system based on RAID | |
US12332744B2 (en) | Method and system for media error recovery | |
JP7694171B2 (en) | Conformity assessment system, conformity assessment program, and conformity assessment method | |
CN110008064A (en) | A kind of RAID restoration methods, device, equipment and medium | |
CN113918092B (en) | Method and system for distributing storage space |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |