CN105635300A - Method and device for realizing high availability of NAS sharing service - Google Patents
Method and device for realizing high availability of NAS sharing service Download PDFInfo
- Publication number
- CN105635300A CN105635300A CN201610022076.3A CN201610022076A CN105635300A CN 105635300 A CN105635300 A CN 105635300A CN 201610022076 A CN201610022076 A CN 201610022076A CN 105635300 A CN105635300 A CN 105635300A
- Authority
- CN
- China
- Prior art keywords
- controller
- nas
- shared service
- delaying
- configuration information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 53
- 239000011159 matrix material Substances 0.000 claims description 24
- 230000001360 synchronised effect Effects 0.000 claims description 13
- 230000000903 blocking effect Effects 0.000 claims description 4
- 238000013475 authorization Methods 0.000 claims description 3
- 230000001934 delay Effects 0.000 claims 2
- 230000008569 process Effects 0.000 description 21
- 238000010586 diagram Methods 0.000 description 6
- 230000007246 mechanism Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
- H04L67/1097—Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
- H04L67/1001—Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
- H04L67/1034—Reaction to server failures by a load balancer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
- H04L67/1095—Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域technical field
本发明涉及存储技术领域,特别是涉及一种实现NAS共享业务高可用的方法及装置。The invention relates to the field of storage technology, in particular to a method and device for realizing high availability of NAS shared services.
背景技术Background technique
目前,在多控存储系统中,当其中一个控制器因软硬件故障造成宕机时,要保证故障控制器上NAS共享业务能继续为用户提供服务,就必须将原控制器所管理的存储设备和NAS共享业务切换到其他控制器上。在常规多控存储系统控制器切换过程中,用户正在运行的NAS共享业务必须中断,当等待控制器切换完成后,需要手动重新发起原来的NAS共享业务,在宕机过程中没有保存到硬盘的共享数据也会有相应的丢失,如此不能保证NAS共享业务的有效切换,NAS共享业务切换的有效性和高效性很低。At present, in a multi-controller storage system, when one of the controllers is down due to software and hardware failures, to ensure that the NAS sharing service on the faulty controller can continue to provide services for users, the storage device managed by the original controller must be Shared services with the NAS are switched to other controllers. During the controller switching process of a conventional multi-controller storage system, the running NAS sharing service of the user must be interrupted. After waiting for the controller switching to complete, the original NAS sharing service needs to be manually re-initiated. Shared data will also be lost accordingly, so effective switching of NAS sharing services cannot be guaranteed, and the effectiveness and efficiency of NAS sharing service switching are very low.
发明内容Contents of the invention
本发明的目的是提供一种实现NAS共享业务高可用的方法及装置,以实现提高NAS共享业务切换的有效性和高效性。The purpose of the present invention is to provide a method and device for realizing high availability of NAS shared services, so as to improve the effectiveness and efficiency of switching of NAS shared services.
为解决上述技术问题,本发明提供一种实现NAS共享业务高可用的方法,该方法包括:In order to solve the above technical problems, the present invention provides a method for realizing high availability of NAS shared services, the method comprising:
在多控存储系统中的任意一个控制器建立NAS共享业务时,将NAS共享业务的配置信息的副本保存在多控存储系统的每一个控制器中作为备份;When any controller in the multi-control storage system establishes a NAS sharing service, save a copy of the configuration information of the NAS sharing service in each controller of the multi-control storage system as a backup;
当一个控制器因故障造成宕机时,依据各控制器的负载状况进行分析判断,从除了宕机控制器之外的所有控制器中选择出一个控制器作为接管控制器;When a controller is down due to failure, analyze and judge according to the load status of each controller, and select a controller from all controllers except the down controller as the takeover controller;
利用所述接管控制器中存储的宕机控制器建立的NAS共享业务的配置信息的副本,使所述接管控制器依据所述副本建立起相应的NAS共享业务。Using the copy of the configuration information of the NAS shared service created by the downtime controller stored in the takeover controller, the takeover controller is made to establish the corresponding NAS shared service according to the copy.
优选的,所述利用所述接管控制器中存储的宕机控制器的NAS共享业务的配置信息的副本之前,还包括:Preferably, before using the copy of the NAS shared service configuration information of the down controller stored in the takeover controller, the method further includes:
检查全局磁盘矩阵管理权限表,将宕机控制器负责的存储空间的接管权限授权给所述接管控制器,由所述接管控制器接管宕机控制器负责的存储空间。Check the global disk matrix management authority table, authorize the takeover authority of the storage space in charge of the downtime controller to the takeover controller, and the takeover controller takes over the storage space in charge of the downtime controller.
优选的,所述方法还包括:Preferably, the method also includes:
当所述宕机控制器经过修复重新恢复正常后,在多控存储系统的所有的控制器中同步NAS共享业务配置信息;When the downtime controller returns to normal after being repaired, synchronize the NAS sharing service configuration information among all controllers of the multi-control storage system;
阻塞接管控制器上正在进行的NAS共享业务,归还属于宕机控制器的存储池和磁盘矩阵空间;Block the ongoing NAS sharing business on the takeover controller, and return the storage pool and disk matrix space belonging to the down controller;
依据宕机控制器中同步的NAS共享业务配置信息,在宕机控制器上建立NAS共享业务,继续执行被阻塞的NAS共享业务。According to the NAS sharing service configuration information synchronized in the downtime controller, the NAS sharing service is established on the downtime controller, and the blocked NAS sharing service is continued to be executed.
优选的,所述在多控存储系统的所有的控制器中同步NAS共享业务配置信息之后,还包括:Preferably, after synchronizing the NAS shared service configuration information among all the controllers of the multi-control storage system, it further includes:
在多控存储系统的所有的控制器中同步全局磁盘矩阵管理权限信息和全局用户管理信息。Synchronize global disk matrix management authority information and global user management information among all controllers in the multi-controller storage system.
优选的,所述继续执行被阻塞的NAS共享业务之后,还包括:修改全局磁盘阵列管理权限信息,取消接管控制器的接管权限。Preferably, after continuing to execute the blocked NAS sharing service, the method further includes: modifying the global disk array management authority information, and canceling the takeover authority of the takeover controller.
本发明还提供一种实现NAS共享业务高可用的装置,该装置包括:The present invention also provides a device for realizing high availability of NAS shared services, the device comprising:
副本保存模块,用于在多控存储系统中的任意一个控制器建立NAS共享业务时,将NAS共享业务的配置信息的副本保存在多控存储系统的每一个控制器中作为备份;The copy saving module is used to save a copy of the configuration information of the NAS sharing service in each controller of the multi-control storage system as a backup when any controller in the multi-control storage system establishes a NAS shared service;
选择模块,用于当一个控制器因故障造成宕机时,依据各控制器的负载状况进行分析判断,从除了宕机控制器之外的所有控制器中选择出一个控制器作为接管控制器;The selection module is used to analyze and judge according to the load status of each controller when a controller is down due to a fault, and select a controller from all controllers except the downtime controller as the takeover controller;
业务建立模块,用于利用所述接管控制器中存储的宕机控制器建立的NAS共享业务的配置信息的副本,使所述接管控制器依据所述副本建立起相应的NAS共享业务。The service establishment module is configured to use the copy of configuration information of the NAS shared service created by the downtime controller stored in the takeover controller, so that the takeover controller establishes the corresponding NAS shared service according to the copy.
优选的,所述装置还包括:Preferably, the device also includes:
授权模块,用于检查全局磁盘矩阵管理权限表,将宕机控制器负责的存储空间的接管权限授权给所述接管控制器,由所述接管控制器接管宕机控制器负责的存储空间。The authorization module is used to check the global disk matrix management authority table, authorize the takeover authority of the storage space in charge of the downtime controller to the takeover controller, and the takeover controller takes over the storage space in charge of the downtime controller.
优选的,所述装置还包括:Preferably, the device also includes:
第一同步模块,用于当所述宕机控制器经过修复重新恢复正常后,在多控存储系统的所有的控制器中同步NAS共享业务配置信息;The first synchronization module is used to synchronize NAS sharing service configuration information among all controllers of the multi-control storage system after the downtime controller is repaired and restored to normal;
阻塞模块,用于阻塞接管控制器上正在进行的NAS共享业务,归还属于宕机控制器的存储池和磁盘矩阵空间;The blocking module is used to block the ongoing NAS sharing business on the takeover controller, and return the storage pool and disk matrix space belonging to the down controller;
执行模块,用于依据宕机控制器中同步的NAS共享业务配置信息,在宕机控制器上建立NAS共享业务,继续执行被阻塞的NAS共享业务。The execution module is configured to establish a NAS sharing service on the downtime controller according to the NAS sharing service configuration information synchronized in the downtime controller, and continue to execute the blocked NAS sharing service.
优选的,所述装置还包括:Preferably, the device also includes:
第二同步模块,用于在多控存储系统的所有的控制器中同步全局磁盘矩阵管理权限信息和全局用户管理信息。The second synchronization module is used for synchronizing the global disk matrix management authority information and the global user management information among all the controllers of the multi-controller storage system.
优选的,所述装置还包括:Preferably, the device also includes:
取消模块,用于修改全局磁盘阵列管理权限信息,取消接管控制器的接管权限。The cancellation module is used to modify the management authority information of the global disk array, and cancel the takeover authority of the takeover controller.
本发明所提供的一种实现NAS共享业务高可用的方法及装置,在多控存储系统中的任意一个控制器建立NAS共享业务时,将NAS共享业务的配置信息的副本保存在多控存储系统的每一个控制器中作为备份;当一个控制器因故障造成宕机时,依据各控制器的负载状况进行分析判断,从除了宕机控制器之外的所有控制器中选择出一个控制器作为接管控制器;利用所述接管控制器中存储的宕机控制器建立的NAS共享业务的配置信息的副本,使所述接管控制器依据所述副本建立起相应的NAS共享业务。可见,接管宕机控制器的NAS共享业务的过程中,利用NAS共享业务的的配置信息的副本来建立中断的NAS共享业务,优化了NAS共享服务接管流程,减少接管的耗时,保证用户NAS共享业务不会因控制器切换而中断,并且保证切换在整个控制器切换过程中不会造成共享数据的任何丢失,因此提高NAS共享业务切换的有效性和高效性。A method and device for realizing high availability of NAS shared services provided by the present invention, when any controller in a multi-control storage system establishes a NAS shared service, saves a copy of the configuration information of the NAS shared service in the multi-controlled storage system Each of the controllers is used as a backup; when a controller is down due to a fault, analyze and judge according to the load status of each controller, and select a controller from all the controllers except the down controller as the backup. A takeover controller: using a copy of the configuration information of the NAS sharing service created by the downtime controller stored in the takeover controller, so that the takeover controller establishes a corresponding NAS sharing service according to the copy. It can be seen that in the process of taking over the NAS sharing service of the down controller, the copy of the configuration information of the NAS sharing service is used to establish the interrupted NAS sharing service, which optimizes the takeover process of the NAS sharing service, reduces the time-consuming takeover, and ensures that the user NAS Shared services will not be interrupted due to controller switching, and it is guaranteed that the switching will not cause any loss of shared data during the entire controller switching process, thereby improving the effectiveness and efficiency of NAS shared service switching.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only It is an embodiment of the present invention, and those skilled in the art can also obtain other drawings according to the provided drawings without creative work.
图1为本发明所提供的一种实现NAS共享业务高可用的方法的流程图;FIG. 1 is a flow chart of a method for realizing high availability of NAS sharing services provided by the present invention;
图2为每个控制器均正常工作时NAS共享业务的示意图;FIG. 2 is a schematic diagram of NAS shared services when each controller is working normally;
图3为当一个控制器出现宕机时NAS共享业务的示意图;FIG. 3 is a schematic diagram of NAS sharing services when a controller is down;
图4为本发明所提供的一种实现NAS共享业务高可用的装置的结构示意图。FIG. 4 is a schematic structural diagram of an apparatus for realizing high availability of NAS shared services provided by the present invention.
具体实施方式detailed description
本发明的核心是提供一种实现NAS共享业务高可用的方法及装置,以实现提高NAS共享业务切换的有效性和高效性。The core of the present invention is to provide a method and device for realizing high availability of NAS shared services, so as to improve the effectiveness and efficiency of switching of NAS shared services.
为了使本技术领域的人员更好地理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to enable those skilled in the art to better understand the solutions of the present invention, the following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
请参考图1,图1为本发明所提供的一种实现NAS共享业务高可用的方法的流程图,该方法包括:Please refer to FIG. 1. FIG. 1 is a flow chart of a method for realizing high availability of NAS sharing services provided by the present invention. The method includes:
S11:在多控存储系统中的任意一个控制器建立NAS共享业务时,将NAS共享业务的配置信息的副本保存在多控存储系统的每一个控制器中作为备份;S11: When any controller in the multi-control storage system establishes a NAS sharing service, save a copy of the configuration information of the NAS sharing service in each controller of the multi-control storage system as a backup;
其中,NAS共享业务的配置信息即为NAS客户端的配置文件。多控存储系统的每一个控制器中都存储有任意一个控制器建立的NAS共享业务的配置信息的副本。Wherein, the configuration information of the NAS sharing service is the configuration file of the NAS client. Each controller of the multi-controller storage system stores a copy of the configuration information of the NAS sharing service established by any controller.
S12:当一个控制器因故障造成宕机时,依据各控制器的负载状况进行分析判断,从除了宕机控制器之外的所有控制器中选择出一个控制器作为接管控制器;S12: When a controller is down due to a fault, analyze and judge according to the load status of each controller, and select a controller from all controllers except the down controller as the takeover controller;
其中,宕机控制器为宕机的控制器。Wherein, the downtime controller is a downtime controller.
S13:利用接管控制器中存储的宕机控制器建立的NAS共享业务的配置信息的副本,使接管控制器依据副本建立起相应的NAS共享业务。S13: Utilize the copy of the configuration information of the NAS sharing service created by the downtime controller stored in the takeover controller, so that the takeover controller establishes a corresponding NAS sharing service according to the copy.
可见,该方法在接管宕机控制器的NAS共享业务的过程中,利用NAS共享业务的的配置信息的副本来建立中断的NAS共享业务,优化了NAS共享服务接管流程,减少接管的耗时,保证用户NAS共享业务不会因控制器切换而中断,并且保证切换在整个控制器切换过程中不会造成共享数据的任何丢失,因此提高NAS共享业务切换的有效性和高效性。It can be seen that in the process of taking over the NAS sharing service of the downtime controller, the method uses a copy of the configuration information of the NAS sharing service to establish the interrupted NAS sharing service, optimizes the NAS sharing service takeover process, and reduces the time-consuming takeover. It ensures that user NAS sharing services will not be interrupted due to controller switching, and ensures that switching will not cause any loss of shared data during the entire controller switching process, thereby improving the effectiveness and efficiency of NAS sharing service switching.
需要说明的是,本发明中的所描述的NAS共享业务主要指SMB共享业务和NFS共享业务。It should be noted that the NAS sharing service described in the present invention mainly refers to the SMB sharing service and the NFS sharing service.
其中,利用接管控制器中存储的宕机控制器的NAS共享业务的配置信息的副本之前,检查全局磁盘矩阵管理权限表,将宕机控制器负责的存储空间的接管权限授权给接管控制器,由接管控制器接管宕机控制器负责的存储空间。Wherein, before using the copy of configuration information of the NAS shared service of the downtime controller stored in the takeover controller, check the global disk matrix management authority table, and authorize the takeover authority of the storage space that the downtime controller is responsible for to the takeover controller, The takeover controller takes over the storage space that the down controller is responsible for.
优选的,所述方法还包括以下步骤:Preferably, the method further comprises the steps of:
S21:当宕机控制器经过修复重新恢复正常后,在多控存储系统的所有的控制器中同步NAS共享业务配置信息;S21: When the downtime controller returns to normal after being repaired, synchronize the NAS shared service configuration information among all controllers of the multi-controller storage system;
S22:阻塞接管控制器上正在进行的NAS共享业务,归还属于宕机控制器的存储池和磁盘矩阵空间;S22: block the ongoing NAS sharing business on the takeover controller, and return the storage pool and disk matrix space belonging to the down controller;
S23:依据宕机控制器中同步的NAS共享业务配置信息,在宕机控制器上建立NAS共享业务,继续执行被阻塞的NAS共享业务。S23: According to the NAS sharing service configuration information synchronized in the downtime controller, establish a NAS sharing service on the downtime controller, and continue to execute the blocked NAS sharing service.
其中,在多控存储系统的所有的控制器中同步NAS共享业务配置信息之后,在多控存储系统的所有的控制器中同步全局磁盘矩阵管理权限信息和全局用户管理信息。Wherein, after the NAS shared service configuration information is synchronized among all the controllers of the multi-control storage system, the global disk matrix management authority information and the global user management information are synchronized among all the controllers of the multi-control storage system.
其中,继续执行被阻塞的NAS共享业务之后,修改全局磁盘阵列管理权限信息,取消接管控制器的接管权限。Wherein, after continuing to execute the blocked NAS sharing service, modify the management authority information of the global disk array, and cancel the takeover authority of the takeover controller.
基于上述优选,可见,该方法通过优化NAS共享服务接管和回切的流程减少接管和回切过程的耗时,以及对特定NAS共享协议特性的实现,保证用户NAS共享业务不会因控制器切换而中断。并且通过建立全局磁盘矩阵权限表和全局用户配置表,以及全局配置信息和各控制器NAS共享业务的备份同步机制,保证NAS共享业务的配置一致性。通过特定数据保护模块的实现和特定读写模式,以及控制器切换过程中的特定操作保证存储数据的一致性。另外,NAS共享业务的接管和回切过程对NAS共享业务使用者透明,提高了用户体验。Based on the above optimization, it can be seen that this method reduces the time-consuming process of takeover and switchback by optimizing the process of NAS shared service takeover and switchback, and realizes the characteristics of a specific NAS sharing protocol to ensure that users' NAS shared services will not be switched due to controllers. And interrupt. And by establishing the global disk matrix permission table and global user configuration table, as well as the global configuration information and the backup synchronization mechanism of NAS shared services of each controller, the configuration consistency of NAS shared services is guaranteed. The consistency of stored data is guaranteed through the implementation of specific data protection modules, specific read-write modes, and specific operations during controller switching. In addition, the takeover and switchback process of the NAS sharing service is transparent to the users of the NAS sharing service, which improves user experience.
总之,该方法保证NAS共享业务的有效切换,保证用户NAS共享业务在控制器切换过程中不中断,用户NAS业务只会经历短暂的延迟便恢复正常,并且保证切换在整个控制器切换过程中不会造成共享数据的任何丢失。In short, this method ensures the effective switching of NAS shared services, ensures that user NAS shared services are not interrupted during the controller switching process, user NAS services will only return to normal after a short delay, and ensures that switching is not interrupted during the entire controller switching process. Any loss of shared data will result.
基于本发明所提供的实现NAS共享业务高可用的方法,具体的,首先进行NAS共享业务建立过程。用户在建立NAS共享业务时,选择同步模式进行建立,在这种模式下当共享数据写入到存储池后才会返回到用户的调用接口,这样虽然会有一定量的写入性能损耗,但是数据的安全性由存储池提供保护。Based on the method for realizing high availability of NAS shared services provided by the present invention, specifically, firstly, a process of establishing NAS shared services is performed. When the user establishes the NAS sharing service, he chooses the synchronous mode to establish it. In this mode, the shared data will be returned to the user's call interface only after it is written into the storage pool. Although there will be a certain amount of write performance loss, the data Security is provided by storage pools.
其中,多控系统维护一个各控制器对共用矩阵管理权限的全局配置信息,该配置信息在各控制器中保存多个副本。在正常情况下各控制器只负责为部分特定存储空间提供NAS共享服务,当处于控制器接管状态时,接管控制器负责宕机控制器存储空间上的NAS共享服务。Wherein, the multi-control system maintains a global configuration information of each controller's authority to manage the shared matrix, and multiple copies of the configuration information are saved in each controller. Under normal circumstances, each controller is only responsible for providing NAS sharing services for some specific storage spaces. When the controller is in the takeover state, the takeover controller is responsible for NAS sharing services on the storage space of the down controller.
新建立的NAS共享业务的配置信息在本地建立后,会根据算法规则写入到多个控制器的副本中,防止当前控制器宕机后配置信息丢失。同时相关后台程序会定时检查各副本间是否一致,如果不一致则进行同步。After the configuration information of the newly established NAS sharing service is established locally, it will be written to the copies of multiple controllers according to the algorithm rules to prevent the loss of configuration information after the current controller goes down. At the same time, the relevant background program will regularly check whether the copies are consistent, and if they are inconsistent, they will be synchronized.
多控存储系统采用全局用户配置的方式,在每个控制器上对用户账户和权限的操作,都会同时被写入全局用户配置信息里,该配置信息会在多个控制器上有多个副本中。同时会有系统后台程序负责定时检查各副本是否一致,如果不一致则根据算法规则进行同步。The multi-control storage system adopts the global user configuration method, and the operations on user accounts and permissions on each controller will be written into the global user configuration information at the same time, and the configuration information will have multiple copies on multiple controllers middle. At the same time, there will be a system background program responsible for regularly checking whether the copies are consistent, and if they are inconsistent, they will be synchronized according to the algorithm rules.
然后进行控制器的接管过程,当一个控制器出现严重的软硬件错误宕机后,系统会阻塞正在进行的NAS共享业务,即SMB共享和NFS共享的业务,进入如下接管流程:Then carry out the controller takeover process. When a controller has a serious software and hardware error and goes down, the system will block the ongoing NAS sharing business, that is, the SMB sharing and NFS sharing business, and enter the following takeover process:
(1)多控系统会根据各控制器负载等判断条件,选择出一个最合适的接管控制器。并负责检查全局磁盘矩阵管理权限表,将宕机控制器负责的存储空间赋予接管控制器接管权限;(1) The multi-control system will select the most suitable takeover controller according to the judgment conditions such as the load of each controller. It is also responsible for checking the global disk matrix management authority table, and granting the takeover authority to the takeover controller for the storage space that the downtime controller is responsible for;
(2)接管控制器接到命令后,参照全局磁盘矩阵管理权限表,将宕机控制器负责存储空间的存储池接管过来;(2) After the takeover controller receives the command, refer to the global disk matrix management authority table, and take over the storage pool that the downtime controller is responsible for the storage space;
(3)然后根据备份的宕机控制器的NAS共享业务的配置信息即NAS共享业务的配置文件,从接管的存储池中挂载相应文件系统,然后建立起相应的NAS共享业务;(3) Then according to the configuration information of the NAS shared service of the downtime controller of the backup, that is, the configuration file of the NAS shared service, mount the corresponding file system from the storage pool taken over, and then set up the corresponding NAS shared service;
(4)放开阻塞的网络,恢复原NAS共享业务。(4) Unblock the blocked network and restore the original NAS sharing service.
其中,多控存储系统通过对SMB3.0协议和NFSv3协议特性的支持,在控制器的接管过程中,阻塞的NAS共享业务不会中断,接管完成后继续进行数据的传输。Among them, the multi-control storage system supports the SMB3.0 protocol and NFSv3 protocol features. During the takeover process of the controller, the blocked NAS sharing service will not be interrupted, and the data transmission will continue after the takeover is completed.
最后进行控制器的回切过程。当宕机控制器经过故障修复重新恢复正常后,同步全局磁盘矩阵管理权限信息、全局用户管理信息和NAS共享业务配置信息。多控系统通知接管控制器进行回切操作,具体过程为:接管控制器阻塞正在进行的NAS共享业务;卸载相关文件系统,并归还原控制器上的存储池和磁盘矩阵存储空间;在原控制器上,接管磁盘矩阵空间和存储池,并挂载起相应文件系统;根据同步的最新NAS共享业务的配置信息即NAS共享业务的配置文件,建立起NAS共享服务,继续阻塞的NAS共享业务;最后多控存储系统修改全局磁盘矩阵管理权限信息,取消接管控制器的接管权限。Finally, the switchback process of the controller is performed. When the down controller returns to normal after fault repair, the global disk matrix management authority information, global user management information and NAS shared service configuration information will be synchronized. The multi-control system notifies the takeover controller to perform the switchback operation. The specific process is: the takeover controller blocks the ongoing NAS sharing service; First, take over the disk matrix space and storage pool, and mount the corresponding file system; according to the configuration information of the latest NAS sharing service that is synchronized, that is, the configuration file of the NAS sharing service, establish the NAS sharing service and continue the blocked NAS sharing service; finally The multi-control storage system modifies the global disk matrix management authority information, and cancels the takeover authority of the takeover controller.
可见,通过配置全局用户管理和磁盘矩阵信息,以及NAS共享业务的配置信息的备份同步机制,为控制器切换时NAS共享业务的接管和回切提供了基础支持。NAS共享业务的接管和回切过程中的操作和特定流程提高了NAS共享业务切换的稳定性和高效性。图2为每个控制器均正常工作时NAS共享业务的示意图;图3为当一个控制器出现宕机时NAS共享业务的示意图。图2中,NAS客户端D与控制器A相连,和控制器A之间有共享链接,控制器A与NAS客户端D之间进行业务共享,控制器A共享业务即控制器A共享的业务,NAS客户端D能够获取控制器A共享的业务,控制器A对磁盘disk1和磁盘disk2进行管理,控制器A能够对磁盘disk1和磁盘disk2中的数据进行共享。图3中,当控制器A出现宕机时,控制器B接管控制器A共享业务,即接管控制器A的共享业务,磁盘disk1和磁盘disk2均由控制器B接管,由控制器B管理磁盘disk3和磁盘disk4。It can be seen that by configuring the global user management and disk matrix information, as well as the backup and synchronization mechanism of the configuration information of the NAS shared service, basic support is provided for the takeover and switchback of the NAS shared service when the controller is switched. The operations and specific procedures during the takeover and switchback of the NAS shared service improve the stability and efficiency of the NAS shared service handover. FIG. 2 is a schematic diagram of a NAS sharing service when each controller is working normally; FIG. 3 is a schematic diagram of a NAS sharing service when a controller fails. In Figure 2, the NAS client D is connected to the controller A, and there is a shared link with the controller A, and the service is shared between the controller A and the NAS client D, and the shared service of the controller A is the service shared by the controller A , the NAS client D can obtain the services shared by the controller A, the controller A manages the disk disk1 and the disk disk2, and the controller A can share the data in the disk disk1 and the disk disk2. In Figure 3, when controller A goes down, controller B takes over the shared business of controller A, that is, takes over the shared business of controller A, and both disk1 and disk2 are taken over by controller B, which manages the disks disk3 and disk disk4.
请参考图4,图4为本发明所提供的一种实现NAS共享业务高可用的装置的结构示意图,该装置包括:Please refer to FIG. 4. FIG. 4 is a schematic structural diagram of a device for achieving high availability of NAS sharing services provided by the present invention. The device includes:
副本保存模块101,用于在多控存储系统中的任意一个控制器建立NAS共享业务时,将NAS共享业务的配置信息的副本保存在多控存储系统的每一个控制器中作为备份;The copy saving module 101 is used to save a copy of the configuration information of the NAS sharing service in each controller of the multi-control storage system as a backup when any controller in the multi-control storage system establishes a NAS shared service;
选择模块102,用于当一个控制器因故障造成宕机时,依据各控制器的负载状况进行分析判断,从除了宕机控制器之外的所有控制器中选择出一个控制器作为接管控制器;The selection module 102 is used to analyze and judge according to the load status of each controller when a controller is down due to a fault, and select a controller as a takeover controller from all controllers except the downtime controller ;
业务建立模块103,用于利用接管控制器中存储的宕机控制器建立的NAS共享业务的配置信息的副本,使接管控制器依据副本建立起相应的NAS共享业务。The service establishment module 103 is configured to use the copy of the configuration information of the NAS shared service created by the downtime controller stored in the takeover controller, so that the takeover controller establishes the corresponding NAS shared service according to the copy.
优选的,所述装置还包括:Preferably, the device also includes:
授权模块,用于检查全局磁盘矩阵管理权限表,将宕机控制器负责的存储空间的接管权限授权给接管控制器,由接管控制器接管宕机控制器负责的存储空间。The authorization module is used to check the global disk matrix management authority table, authorize the takeover authority of the storage space in charge of the downtime controller to the takeover controller, and the takeover controller takes over the storage space in charge of the downtime controller.
所述装置还包括:The device also includes:
第一同步模块,用于当宕机控制器经过修复重新恢复正常后,在多控存储系统的所有的控制器中同步NAS共享业务配置信息;The first synchronization module is used to synchronize NAS sharing service configuration information among all controllers of the multi-control storage system after the downtime controller is repaired and restored to normal;
阻塞模块,用于阻塞接管控制器上正在进行的NAS共享业务,归还属于宕机控制器的存储池和磁盘矩阵空间;The blocking module is used to block the ongoing NAS sharing business on the takeover controller, and return the storage pool and disk matrix space belonging to the down controller;
执行模块,用于依据宕机控制器中同步的NAS共享业务配置信息,在宕机控制器上建立NAS共享业务,继续执行被阻塞的NAS共享业务。The execution module is configured to establish a NAS sharing service on the downtime controller according to the NAS sharing service configuration information synchronized in the downtime controller, and continue to execute the blocked NAS sharing service.
所述装置还包括:The device also includes:
第二同步模块,用于在多控存储系统的所有的控制器中同步全局磁盘矩阵管理权限信息和全局用户管理信息。The second synchronization module is used for synchronizing the global disk matrix management authority information and the global user management information among all the controllers of the multi-controller storage system.
所述装置还包括:The device also includes:
取消模块,用于修改全局磁盘阵列管理权限信息,取消接管控制器的接管权限。The cancellation module is used to modify the management authority information of the global disk array, and cancel the takeover authority of the takeover controller.
本发明所提供的一种实现NAS共享业务高可用的装置,副本保存模块在多控存储系统中的任意一个控制器建立NAS共享业务时,将NAS共享业务的配置信息的副本保存在多控存储系统的每一个控制器中作为备份;选择模块在当一个控制器因故障造成宕机时,依据各控制器的负载状况进行分析判断,从除了宕机控制器之外的所有控制器中选择出一个控制器作为接管控制器;业务建立模块利用接管控制器中存储的宕机控制器建立的NAS共享业务的配置信息的副本,使接管控制器依据副本建立起相应的NAS共享业务。可见,该装置在接管宕机控制器的NAS共享业务的过程中,利用NAS共享业务的的配置信息的副本来建立中断的NAS共享业务,优化了NAS共享服务接管流程,减少接管的耗时,保证用户NAS共享业务不会因控制器切换而中断,并且保证切换在整个控制器切换过程中不会造成共享数据的任何丢失,因此提高NAS共享业务切换的有效性和高效性。In the device for realizing high availability of NAS sharing services provided by the present invention, when any controller in the multi-control storage system establishes NAS sharing services, the copy saving module saves a copy of the configuration information of the NAS sharing services in the multi-control storage Each controller in the system is used as a backup; when a controller is down due to a fault, the selection module analyzes and judges according to the load status of each controller, and selects the controller from all controllers except the downtime controller. One controller acts as a takeover controller; the service establishment module utilizes a copy of configuration information of the NAS shared service created by the downtime controller stored in the takeover controller, so that the takeover controller establishes a corresponding NAS shared service according to the copy. It can be seen that in the process of taking over the NAS sharing service of the down controller, the device uses the copy of the configuration information of the NAS sharing service to establish the interrupted NAS sharing service, optimizes the NAS sharing service takeover process, and reduces the time-consuming takeover. It ensures that user NAS sharing services will not be interrupted due to controller switching, and ensures that switching will not cause any loss of shared data during the entire controller switching process, thereby improving the effectiveness and efficiency of NAS sharing service switching.
以上对本发明所提供的一种实现NAS共享业务高可用的方法及装置进行了详细介绍。本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想。应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以对本发明进行若干改进和修饰,这些改进和修饰也落入本发明权利要求的保护范围内。A method and device for realizing high availability of NAS sharing services provided by the present invention have been introduced in detail above. In this paper, specific examples are used to illustrate the principle and implementation of the present invention, and the descriptions of the above embodiments are only used to help understand the method and core idea of the present invention. It should be pointed out that for those skilled in the art, without departing from the principle of the present invention, some improvements and modifications can be made to the present invention, and these improvements and modifications also fall within the protection scope of the claims of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610022076.3A CN105635300A (en) | 2016-01-13 | 2016-01-13 | Method and device for realizing high availability of NAS sharing service |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610022076.3A CN105635300A (en) | 2016-01-13 | 2016-01-13 | Method and device for realizing high availability of NAS sharing service |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105635300A true CN105635300A (en) | 2016-06-01 |
Family
ID=56049771
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610022076.3A Pending CN105635300A (en) | 2016-01-13 | 2016-01-13 | Method and device for realizing high availability of NAS sharing service |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105635300A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1742337A2 (en) * | 2005-07-07 | 2007-01-10 | Power Integrations, Inc. | Method and apparatus for conditional response to a fault condition in a switching power supply |
CN101739220A (en) * | 2009-02-25 | 2010-06-16 | 浪潮电子信息产业股份有限公司 | A Design Method of Multi-controller Storage Array |
CN102426512A (en) * | 2011-11-09 | 2012-04-25 | 浪潮电子信息产业股份有限公司 | A virtualization-based implementation method of storage dual-controller disk array |
-
2016
- 2016-01-13 CN CN201610022076.3A patent/CN105635300A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1742337A2 (en) * | 2005-07-07 | 2007-01-10 | Power Integrations, Inc. | Method and apparatus for conditional response to a fault condition in a switching power supply |
CN101739220A (en) * | 2009-02-25 | 2010-06-16 | 浪潮电子信息产业股份有限公司 | A Design Method of Multi-controller Storage Array |
CN102426512A (en) * | 2011-11-09 | 2012-04-25 | 浪潮电子信息产业股份有限公司 | A virtualization-based implementation method of storage dual-controller disk array |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10713135B2 (en) | Data disaster recovery method, device and system | |
US10146453B2 (en) | Data migration using multi-storage volume swap | |
US8676760B2 (en) | Maintaining data integrity in data servers across data centers | |
US9280430B2 (en) | Deferred replication of recovery information at site switchover | |
CN104794028B (en) | A kind of disaster tolerance processing method, device, primary data center and preliminary data center | |
US8793456B2 (en) | Automated migration to a new target volume via merged bitmaps to maintain consistency | |
US7085956B2 (en) | System and method for concurrent logical device swapping | |
WO2019085875A1 (en) | Configuration modification method for storage cluster, storage cluster and computer system | |
CN111162941B (en) | Method for automatically managing virtual IP (Internet protocol) in Kubernetes environment | |
CN105224637A (en) | A kind of based on PostgreSQL database active and standby/the comprehensive method of cluster application | |
CN102761566A (en) | Method and device for migrating virtual machine | |
JP2003517651A (en) | Highly available file server | |
CN110674539B (en) | Hard disk protection device, method and system | |
US20150317223A1 (en) | Method and system for handling failures by tracking status of switchover or switchback | |
CN103617269A (en) | Disaster tolerance pipe connecting method and disaster tolerance pipe connecting system | |
CN106168915A (en) | Distributed system architecture data process consistency ensuring method | |
CN104572355A (en) | Asynchronous remote data copying method and device | |
CN108733808B (en) | Big data software system switching method, system, terminal equipment and storage medium | |
CN107357800A (en) | A kind of database High Availabitity zero loses solution method | |
CN107621996A (en) | Cache backup method, fault handling method, system, device and storage medium | |
CN106250048B (en) | Manage the method and device of storage array | |
CN114337944A (en) | A system-level active-standby redundancy general control method | |
CN117827544B (en) | Hot backup system, method, electronic device and storage medium | |
CN105677519A (en) | Resource access method and device | |
CN105635300A (en) | Method and device for realizing high availability of NAS sharing service |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160601 |
|
RJ01 | Rejection of invention patent application after publication |