JP6396849B2

JP6396849B2 - Generator matrix configuration apparatus and generator matrix configuration method

Info

Publication number: JP6396849B2
Application number: JP2015110148A
Authority: JP
Inventors: 由唯吉田; 喜秀外村; 孝之仲地; 白井　大介; 大介白井; 藤井　竜也; 竜也藤井
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2015-05-29
Filing date: 2015-05-29
Publication date: 2018-09-26
Anticipated expiration: 2035-05-29
Also published as: JP2016224679A

Description

本発明は、分散ストレージシステムにおいてデータを符号化する非正則の生成行列を構成するための生成行列構成装置及び生成行列構成方法に関する。 The present invention relates to a generator matrix configuration device and a generator matrix configuration method for configuring an irregular generator matrix for encoding data in a distributed storage system.

近年の映像の高品質化およびアプリケーションの扱うデータ量の増大により、映像サービスの提供に必要なストレージの規模が拡大し、需要に応じてハードウェアの規模を最適化可能なクラウドストレージの利用が進んでいる。ストレージシステムには高い可用性、長期の耐久性が求められるが、ディスク容量の増加によりＲＡＩＤ（ＲｅｄｕｎｄａｎｔＡｒｒａｙｏｆＩｎｄｅｐｅｎｄｅｎｔＤｉｓｋ）による冗長化を行うシステムでは複数ディスク故障時の信頼性が低くなることが知られている（例えば、非特許文献１参照）。そのため、複数故障に対する信頼性を高める冗長化技術が検討されている。 With the recent increase in video quality and the amount of data handled by applications, the scale of storage required to provide video services has expanded, and the use of cloud storage that can optimize the scale of hardware according to demand has advanced. It is out. Storage systems are required to have high availability and long-term durability, but it is known that the reliability of multiple disk failures is reduced in a system that performs redundancy by RAID (Redundant Array of Independent Disk) due to an increase in disk capacity. (For example, refer nonpatent literature 1). Therefore, a redundancy technique for improving the reliability against a plurality of failures has been studied.

現在ＨＡＤＯＯＰ（登録商標）では、冗長性の担保のため３倍の複製を保存する方式が採用されている。しかし、複製により高可用性を実現するにはレプリカ数を増やす必要があり、ストレージの容量効率が下がることから大規模なストレージシステムへの適用は経済的ではない。ディスクの容量増加や計算資源の高性能化を背景に、消失訂正符号を用いたストレージ容量効率の高い冗長化手法が注目されている。消失訂正符号を適用することで複製と同等の耐久性および可用性を格段に低いネットワークコストおよびストレージコストで実現できることが知られており（例えば、非特許文献２参照）、消失訂正符号を利用した分散ストレージサービスが提供されている。 Currently, HADOOP (registered trademark) employs a method of storing three times as many copies for ensuring redundancy. However, in order to achieve high availability by replication, it is necessary to increase the number of replicas, and the capacity efficiency of the storage is lowered, so that it is not economical to apply to a large-scale storage system. With the background of increased disk capacity and higher performance of computing resources, attention is focused on a redundancy method with high storage capacity efficiency using erasure correction codes. It is known that by applying an erasure correction code, durability and availability equivalent to duplication can be realized at a significantly lower network cost and storage cost (for example, see Non-Patent Document 2), and dispersion using an erasure correction code Storage service is provided.

ＣＬＥＶＥＲＳＡＦＥ（登録商標）では、リードソロモン符号（ＲＳ符号）をベースとした符号化技術を用いて分散ストレージの冗長化を行う（例えば、非特許文献３参照）。非組織符号であるＲＳ符号の場合、データをｋ分割し符号化したｎ個の断片を保存し、ｎ個中ｋ個の断片を集めることでデータを復元する。この場合ストレージ容量の利用効率は高くなるが、故障ディスクの復旧時に他のディスクから送るデータの総量が大きくなる、またはガロア体の計算が必要なため復旧に係る計算量が多くなることが課題である。そこで、故障ディスク修復に必要な通信量を最小化可能な再生成符号（ＭＢＲ符号）が提案された（例えば、非特許文献４参照）。 In CLEVERSAFE (registered trademark), distributed storage is made redundant by using an encoding technique based on a Reed-Solomon code (RS code) (for example, see Non-Patent Document 3). In the case of an RS code, which is a non-systematic code, data is restored by collecting k pieces of data obtained by dividing the data into k pieces and collecting k pieces of the n pieces. In this case, the storage capacity utilization efficiency is high, but the problem is that the total amount of data sent from other disks at the time of recovery of the failed disk becomes large, or the calculation amount related to recovery increases because the calculation of Galois field is required. is there. Therefore, a regenerated code (MBR code) that can minimize the amount of communication necessary for repairing the failed disk has been proposed (see, for example, Non-Patent Document 4).

ＭＢＲ符号は再生成符号においてノードを再構成するために送るデータの総量を最小化した符号であり、ストレージ容量の利用効率を最適化したＭＳＲ符号も提案されている。しかし、再生成符号ではノード数が多くなるという課題があり、ディスク修復時にデータを送るヘルパーノードの数を最小化したピラミッド符号が提案された（例えば、非特許文献５参照）。しかし、これらは密行列に基づく符号であるため、符号化に伴う計算量が大きいことが課題となっている。 The MBR code is a code that minimizes the total amount of data that is sent to reconfigure the node in the regenerated code, and an MSR code that optimizes the utilization efficiency of storage capacity has also been proposed. However, there is a problem that the number of nodes increases in the regenerated code, and a pyramid code in which the number of helper nodes that send data at the time of disk restoration is minimized has been proposed (for example, see Non-Patent Document 5). However, since these are codes based on a dense matrix, there is a problem that the amount of calculation accompanying encoding is large.

一方で、ＸＯＲ演算による符号化で計算負荷を抑えたＦｌａｔＸＯＲ符号が提案されている（例えば、非特許文献６参照）。しかし、ＨＤＣｏｍｂｉｎａｔｉｏｎｃｏｄｅｓなどのＦｌａｔＸＯＲ符号では生成行列の構成が限られており、ディスク復旧時の通信量が最適化されていなかった。そこで、非正則構成の生成行列を用いたＦｌａｔＸＯＲ符号により通信量を削減する手法が提案された（例えば、非特許文献７参照）。しかし現在までのところ、通信量を最小化する非正則構成のＦｌａｔＸＯＲ符号は取り得る符号長が数十までの範囲に限定されており、分散数の大きいストレージシステムには適用できないという課題がある。 On the other hand, there has been proposed a Flat XOR code in which the calculation load is suppressed by encoding using an XOR operation (see, for example, Non-Patent Document 6). However, in the Flat XOR code such as HD Combination codes, the configuration of the generator matrix is limited, and the communication amount at the time of disk restoration has not been optimized. In view of this, a method has been proposed in which the communication amount is reduced by a Flat XOR code using a generation matrix having an irregular structure (see, for example, Non-Patent Document 7). However, to date, the non-regular configuration of Flat XOR code that minimizes the amount of communication is limited to a range of possible code lengths of up to several tens, and there is a problem that it cannot be applied to a storage system with a large number of distributions. .

Ｊ．Ｒｅｓｃｈ，Ｉ．Ｖｏｌｖｏｓｋｉ，“Ｒｅｌｉａｂｉｌｉｔｙｍｏｄｅｌｓｆｏｒｈｉｇｈｌｙｆａｕｌｔ−ｔｏｌｅｒａｎｔｓｔｏｒａｇｅｓｙｓｔｅｍｓ，” ａｒＸｉｖＣｏｍｐｕｔｅｒＳｃｉｅｎｃｅＤｉｓｔｒｉｂｕｔｅｄ，Ｐａｒａｌｌｅｌ，ａｎｄＣｌｕｓｔｅｒＣｏｍｐｕｔｉｎｇ，Ｏｃｔ．２０１３．J. et al. Resch, I. et al. Volvoski, “Reliability models for high fault-tolerant storage systems,” arXiv Computer Science Distributed, Parallel Computing, and Cluster Computing. 2013. Ｗ．Ｈａｋｉｍ，Ｊ．Ｄ．Ｋｕｂｉａｔｏｗｉｃｚ，“Ｅｒａｓｕｒｅｃｏｄｉｎｇｖｓ．ｒｅｐｌｉｃａｔｉｏｎ：ａｑｕａｎｔｉｔａｔｉｖｅｃｏｍｐａｒｉｓｏｎ，” ＩＰＴＰＳＰｒｏｃｅｅｄｉｎｇｓ，Ｃａｍｂｒｉｄｇｅ，ＵＳＡ，Ｍａｒ．２００２．W. Hakim, J .; D. Kubiatowitz, “Erasure coding vs. replication: a quantitative comparison,” IPTPS Proceedings, Cambridge, USA, Mar. 2002. Ｈ．Ｌａｈｋａｒ，Ｍ．Ｃ．Ｒ，“Ｔｏｗａｒｄｓｈｉｇｈｓｅｃｕｒｉｔｙａｎｄｆａｕｌｔｔｏｌｅｒａｎｔｄｉｓｐｅｒｓｅｄｓｔｏｒａｇｅｓｙｓｔｅｍｗｉｔｈｏｐｔｉｍｉｚｅｄｉｎｆｏｒｍａｔｉｏｎｄｉｓｐｅｒｓａｌａｌｇｏｒｉｔｈｍ，” ＩｎｔｅｒｎａｔｉｏｎａｌＪｏｕｒｎａｌｏｆＡｄｖａｎｃｅｄＲｅｓｅａｒｃｈｉｎＣｏｍｐｕｔｅｒＳｃｉｅｎｃｅｖｏｌ．２Ｉｓｓｕｅ３，Ｊｕｌｙ２０１４．H. Lahkar, M .; C. R, “Towards high security and fault tolerant dispersed storage system with optimized information dispersal algorithm, and“ Individual Journal of Revolution ”. 2 Issue 3, July 2014. Ａ．Ｇ．Ｄｉｍａｋｉｓ，Ｐ．Ｂ．Ｇｏｄｆｒｅｙ，Ｙ．Ｗｕ，Ｍ．Ｗａｉｎｗｒｉｇｈｔ，Ｋ．Ｒａｍｃｈａｎｄｒａｎ，“Ｎｅｔｗｏｒｋｃｏｄｉｎｇｆｏｒｄｉｓｔｒｉｂｕｔｅｄｓｔｏｒａｇｅｓｙｓｔｅｍｓ，” ＩＥＥＥＴｒａｎｓ．ＩｎｆｏｒｍａｔｉｏｎＴｈｅｏｒｙ，Ｓｅｐｔ．２０１０．A. G. Dimakis, P.A. B. Godfrey, Y .; Wu, M .; Wainwright, K.M. Ramchandran, “Network coding for distributed storage systems,” IEEE Trans. Information Theory, Sept. 2010. Ｃ．Ｈｕａｎｇ，Ｍ．Ｃｈｅｎ，Ｊ．Ｌｉ，“Ｐｙｒａｍｉｄｃｏｄｅｓ：ｆｌｅｘｉｂｌｅｓｃｈｅｍｅｓｔｏｔｒａｄｅｓｐａｃｅｆｏｒａｃｃｅｓｓｅｆｆｉｃｉｅｎｃｙｉｎｒｅｌｉａｂｌｅｄａｔａｓｔｏｒａｇｅｓｙｓｔｅｍｓ，” ＳｉｘｔｈＩＥＥＥＩｎｔｅｒｎａｔｉｏｎａｌＳｙｍｐｏｓｉｕｍｏｎＮｅｔｗｏｒｋＣｏｍｐｕｔｉｎｇａｎｄＡｐｐｌｉｃａｔｉｏｎｓ，２００７，Ｃａｍｂｒｉｄｇｅ，ＵＳＡ，Ｊｕｌｙ２００７．C. Huang, M .; Chen, J. et al. Li, “Pyramid codes: flexible schemes to trade space for access efficiency, Reliable data storage systems, and“ Sixth IEEE International Symposium. ” Ｋ．Ｇｒｅｅｎａｎ，Ｘ．Ｌｉ，Ｗ．Ｊ．Ｊ，“ＦｌａｔＸＯＲ−ｂａｓｅｄｅｒａｓｕｒｅｃｏｄｅｓｉｎｓｔｏｒａｇｅｓｙｓｔｅｍｓ：Ｃｏｎｓｔｒｕｃｔｉｏｎｓ，ｅｆｆｉｃｅｎｔｒｅｃｏｖｅｒｙ，ａｎｄｔｒａｄｅｏｆｆｓ，” ＭａｓｓＳｔｏｒａｇｅＳｙｓｔｅｍｓａｎｄＴｅｃｈｎｏｌｏｇｉｅｓ，２０１０ＩＥＥＥ２６ｔｈＳｙｍｐｏｓｉｕｍ，ＩｎｃｌｉｎｅＶｉｌｌａｇｅ，ＵＳＡ，Ｍａｙ，２０１０．K. Greenan, X. et al. Li, W .; J. et al. J, “Flat XOR-based erase codes in storage systems: Constructions, effective recovery, and tradeoffs,” Mass Storage Systems, 10I, E. 吉田由唯，外村喜秀，白井大介，藤井竜也，“分散ストレージシステムのための通信量を削減するＦｌａｔＸＯＲに基づく消失訂正符号の非正則構成，”２０１４信学ソ大（基礎・境界），Ｔｏｋｕｓｈｉｍａ，Ｊａｐａｎ，Ｓｅｐｔ．２０１４．Yoshida Yui, Toshimura Yoshihide, Shirai Daisuke, Fujii Tatsuya, “Irregular composition of erasure correction codes based on Flat XOR to reduce the amount of communication for distributed storage systems,” 2014 Shingaku Sodai (Basic / Boundary) , Tokyo, Japan, Sept. 2014.

従来のストレージシステムに適用されるＦｌａｔＸＯＲ符号は、故障ディスク修復時に必要な通信量が最小化されていなかった。そこで、行重みに偏りを持たせた非正則構成の生成行列を利用しＲｅｃｏｖｅｒｙＥｑｕａｔｉｏｎＡｌｇｏｒｉｔｈｍを用いた復号を行うことで、通信量を削減することができる。 The Flat XOR code applied to the conventional storage system does not minimize the amount of communication required for repairing the failed disk. Therefore, the amount of communication can be reduced by performing decoding using the recovery equation algorithm using a generation matrix having an irregular configuration with biased row weights.

また、符号長ｎ、分割数ｋ、列重みｗのＦｌａｔＸＯＲ符号を構成するとき、符号長が大きい場合にＦｌａｔＸＯＲ符号の生成行列を探索すると、ｎ−ｋ及びｗの組み合わせからｋ種類を選ぶ組み合わせ爆発が起こり、ｋ＝_ｎ−ｋＣ_ｗ以外の分割数について実現可能な計算時間で符号を決定することができない。この場合、ＦｌａｔＸＯＲ符号の生成行列は行重みおよび列重みを一定とする正則構成となり、演算負荷・通信量の小さい非正則構成の生成行列は構成できなかった。 Further, when a Flat XOR code having a code length n, a division number k, and a column weight w is configured, when a code matrix of a Flat XOR code is searched when the code length is large, k types are selected from combinations of n−k and w. A combination explosion occurs and the code cannot be determined with a feasible calculation time for a number of divisions other than k = _n−k C _w . In this case, the generation matrix of the Flat XOR code has a regular configuration in which the row weight and the column weight are constant, and a generation matrix having a non-regular configuration with a small calculation load / communication amount cannot be configured.

そこで、本発明は、ｎ＝１００程度の大きな符号長であってもＦｌａｔＸＯＲ符号の非正則構成の生成行列を構成可能にすることを目的とする。 Therefore, an object of the present invention is to make it possible to construct a non-regular configuration generator matrix of a Flat XOR code even with a large code length of about n = 100.

本発明は、生成行列の部分的な領域に規則的に１を配置し、残りの領域を探索するという制限を設けることとした。 In the present invention, 1 is regularly arranged in a partial region of the generator matrix, and the restriction of searching the remaining region is provided.

具体的には、本発明に係る生成行列構成装置は、
データの分割数及びデータを格納するディスク数に基づいてゼロ行列を作成するゼロ行列作成部と、
前記ゼロ行列における所定行において、各列に配置する「１」の個数が一定値となるように、前記所定行の各列に「１」又は「０」を配置する所定行構成部と、
前記ゼロ行列における前記所定行を除く全ての行において、各列に配置する「１」の個数が予め定められた符号の最小距離から前記一定値を差し引いた値となり、かつ「１」を配置する行の組み合わせが列ごとに異なるように、前記所定行を除く全ての行の各列に「１」又は「０」を配置する行列構成部と、
を備える。 Specifically, the generator matrix construction device according to the present invention is:
A zero matrix creation unit that creates a zero matrix based on the number of data divisions and the number of disks storing data;
A predetermined row configuration unit that arranges “1” or “0” in each column of the predetermined row so that the number of “1” arranged in each column has a constant value in the predetermined row in the zero matrix;
Arranged in all rows, is the value obtained by subtracting the predetermined value from the minimum distance of the code in which the number is predetermined in the "1" to place in each column, and a "1", except for the predetermined row before Symbol zero matrix A matrix configuration unit that arranges “1” or “0” in each column of all rows except the predetermined row so that the combination of rows to be different is different for each column;
Is provided.

本発明に係る生成行列構成装置では、前記所定行が２以上の場合、前記行列構成部が、前記ゼロ行列における前記所定行を除く全ての行における複数の列において、「１」を配置する行の組み合わせが同じになる場合、当該列同士で前記所定行における「１」を配置する行の組み合わせが異なるように、前記所定行を除く全ての行の各列に「１」又は「０」を配置してもよい。 In the generator matrix configuration device according to the present invention, when the predetermined row is 2 or more, the matrix configuration unit arranges “1” in a plurality of columns in all rows except the predetermined row in the zero matrix. If the combination of the same, "1" is as combinations of different row disposed at the predetermined row in the column between the "1" or "0" in each column of all rows except the predetermined row You may arrange.

具体的には、本発明に係る生成行列構成装置は、
データの分割数及びデータを格納するディスク数に基づいてゼロ行列を作成するゼロ行列作成部と、
前記ゼロ行列における特定の１行において、行に配置する「１」の個数が一定値以上となるように、前記特定の１行の各列に「１」又は「０」を配置する特定行構成部と、
前記特定の１行に「１」が配置された列については、前記ゼロ行列における前記特定の１行を除く全ての行において、各列に配置する「１」の個数が予め定められた符号の最小距離から１を差し引いた値となり、かつ「１」を配置する行の組み合わせが列ごとに異なるように、前記特定の１行を除く全ての行の各列に「１」又は「０」を配置し、
前記特定の１行に「０」が配置された列については、前記ゼロ行列における前記特定の１行を除く全ての行において、各列に配置する「１」の個数が前記符号の最小距離に等しい値となり、かつ「１」を配置する行の組み合わせが列ごとに異なるように、前記特定の１行を除く全ての行の各列に「１」又は「０」を配置する行列構成部と、
を備える。 Specifically, the generator matrix construction device according to the present invention is:
A zero matrix creation unit that creates a zero matrix based on the number of data divisions and the number of disks storing data;
Specific row configuration in which “1” or “0” is arranged in each column of the specific one row so that the number of “1” arranged in the row is equal to or greater than a certain value in the specific row in the zero matrix And
For the column in which “1” is arranged in the specific row, the number of “1” s to be arranged in each column in all rows except the specific row in the zero matrix is a predetermined code. The value obtained by subtracting 1 from the minimum distance , and “1” or “0” in each column of all rows except the specific one row so that the combination of rows in which “1” is arranged is different for each column. Place and
Regarding the column in which “0” is arranged in the specific row, the number of “1” arranged in each column is the minimum distance of the code in all the rows except the specific row in the zero matrix. It becomes equal, and "1" so that the combination of rows to place differs for each row, and the matrix component to place "1" or "0" in each column in every row except the particular one line ,
Is provided.

本発明に係る生成行列構成装置では、前記ゼロ行列は、データの分割数に等しい列数を有し、かつデータを格納するディスク数から前記データの分割数を差し引いて求められるパリティディスク数に等しい行数を有してもよい。 In the generator matrix configuration device according to the present invention, the zero matrix has the number of columns equal to the number of data divisions , and is equal to the number of parity disks obtained by subtracting the number of data divisions from the number of disks storing data. You may have the number of rows .

具体的には、本発明に係る生成行列構成方法は、
生成行列構成装置が実行する生成行列構成方法であって、
データの分割数及びデータを格納するディスク数に基づいてゼロ行列を作成するゼロ行列作成ステップと、
前記ゼロ行列における所定行において、各列に配置する「１」の個数が一定値となるように、前記所定行の各列に「１」又は「０」を配置する所定行構成ステップと、
前記ゼロ行列における前記所定行を除く全ての行において、各列に配置する「１」の個数が予め定められた符号の最小距離から前記一定値を差し引いた値となり、かつ「１」を配置する行の組み合わせが列ごとに異なるように、前記所定行を除く全ての行の各列に「１」又は「０」を配置する行列構成ステップと、
を実行する。 Specifically, the generator matrix construction method according to the present invention is:
A generator matrix configuration method executed by a generator matrix configuration apparatus,
Creating a zero matrix based on the number of data divisions and the number of disks storing the data; and
A predetermined row configuration step of arranging “1” or “0” in each column of the predetermined row so that the number of “1” arranged in each column has a constant value in the predetermined row in the zero matrix;
Arranged in all rows, is the value obtained by subtracting the predetermined value from the minimum distance of the code in which the number is predetermined in the "1" to place in each column, and a "1", except for the predetermined row before Symbol zero matrix A matrix construction step of arranging “1” or “0” in each column of all rows except the predetermined row so that the combination of rows to be different is different for each column;
Execute.

具体的には、本発明に係る生成行列構成方法は、
生成行列構成装置が実行する生成行列構成方法であって、
データの分割数及びデータを格納するディスク数に基づいてゼロ行列を作成するゼロ行列作成ステップと、
前記ゼロ行列における特定の１行において、行に配置する「１」の個数が一定値以上となるように、前記特定の１行の各列に「１」又は「０」を配置する特定行構成ステップと、
前記特定の１行に「１」が配置された列については、前記ゼロ行列における前記特定の１行を除く全ての行において、各列に配置する「１」の個数が予め定められた符号の最小距離から１を差し引いた値となり、かつ「１」を配置する行の組み合わせが列ごとに異なるように、前記特定の１行を除く全ての行の各列に「１」又は「０」を配置し、
前記特定の１行に「０」が配置された列については、前記ゼロ行列における前記特定の１行を除く全ての行において、各列に配置する「１」の個数が前記符号の最小距離に等しい値となり、かつ「１」を配置する行の組み合わせが列ごとに異なるように、前記特定の１行を除く全ての行の各列に「１」又は「０」を配置する行列構成ステップと、
を実行する。 Specifically, the generator matrix construction method according to the present invention is:
A generator matrix configuration method executed by a generator matrix configuration apparatus,
Creating a zero matrix based on the number of data divisions and the number of disks storing the data; and
Specific row configuration in which “1” or “0” is arranged in each column of the specific one row so that the number of “1” arranged in the row is equal to or greater than a certain value in the specific row in the zero matrix Steps,
For the column in which “1” is arranged in the specific row, the number of “1” s to be arranged in each column in all rows except the specific row in the zero matrix is a predetermined code. The value obtained by subtracting 1 from the minimum distance , and “1” or “0” in each column of all rows except the specific one row so that the combination of rows in which “1” is arranged is different for each column. Place and
Regarding the column in which “0” is arranged in the specific row, the number of “1” arranged in each column is the minimum distance of the code in all the rows except the specific row in the zero matrix. It becomes equal, and "1" so that the combination of rows to place differs for each column, a matrix arrangement step of placing the "1" or "0" in each column in every row except the particular one line ,
Execute.

本発明は、ｎ＝１００程度の大きな符号長であってもＦｌａｔＸＯＲ符号の非正則構成の生成行列の構成が可能であるため、多地点に分散したストレージシステムにおいて疎行列を用いた符号により小さい演算負荷で冗長化を行い、正則ＦｌａｔＸＯＲ符号およびＲＳ符号と比べてディスク復旧時の通信量および演算量を削減することができる。 Since the present invention can construct a non-regular configuration generator matrix of a Flat XOR code even with a large code length of about n = 100, it is smaller than a code using a sparse matrix in a multi-point distributed storage system. Redundancy is performed with a calculation load, and the communication amount and the calculation amount at the time of disk restoration can be reduced as compared with the regular Flat XOR code and the RS code.

実施形態に係る符号構成装置の概略を示す。1 shows an outline of a code configuration device according to an embodiment. 第１の構成法によって構成した生成行列Ｇの一例を示す。An example of the generator matrix G configured by the first configuration method is shown. 第２の構成法によって構成した生成行列Ｇの一例を示す。An example of the generator matrix G configured by the second configuration method is shown. 第３の構成法によって構成した生成行列Ｇの一例を示す。An example of the generator matrix G configured by the third configuration method is shown. Ｉ＝１のときの修復式の集合ＲＥの一例を示す。An example of a set RE of repair formulas when I = 1 is shown. Ｉ＝２のときの行の選択例を示す。An example of selecting a row when I = 2 is shown. Ｉ＝２のときの修復式の集合ＲＥの一例を示す。An example of a set RE of repair formulas when I = 2 is shown. Ｉ＝３のときの行の選択例を示す。An example of row selection when I = 3 is shown. Ｉ＝３のときの修復式の集合ＲＥの一例を示す。An example of a set RE of repair formulas when I = 3 is shown. 実施例１に係る非正則構成のＦｌａｔＸＯＲ符号と比較例に係る正則構成のＦｌａｔＸＯＲ符号との平均通信量の比較結果を示す。The comparison result of the average traffic of the Flat XOR code of the irregular structure which concerns on Example 1 and the Flat XOR code of the regular structure which concerns on a comparative example is shown. 実施例１に係る非正則構成のＦｌａｔＸＯＲ符号と比較例に係るＲＳ符号との通信量の比較結果を示す。The comparison result of the traffic of the Flat XOR code of the irregular structure which concerns on Example 1, and RS code which concerns on a comparative example is shown. 実施例２に係る非正則構成のＦｌａｔＸＯＲ符号と比較例に係るＲＳ符号とのＸＯＲ計算回数の比較結果を示す。The comparison result of the XOR calculation frequency of the Flat XOR code | symbol of the irregular structure which concerns on Example 2, and RS code | cord | chord concerning a comparative example is shown. 実施例２に係る非正則構成のＦｌａｔＸＯＲ符号と比較例に係るＲＳ符号との平均通信量の比較結果を示す。The comparison result of the average traffic of the Flat XOR code of the irregular structure which concerns on Example 2, and RS code which concerns on a comparative example is shown. 実施例３に係る非正則構成のＦｌａｔＸＯＲ符号と比較例に係る正則構成のＦｌａｔＸＯＲ符号とのＸＯＲ計算回数の比較結果を示す。10 shows comparison results of the number of XOR calculations between a non-regular configuration Flat XOR code according to Example 3 and a regular configuration Flat XOR code according to a comparative example. 実施例３に係る非正則構成のＦｌａｔＸＯＲ符号と比較例に係る正則構成のＦｌａｔＸＯＲ符号との平均通信量の比較結果を示す。The comparison result of the average traffic of the Flat XOR code of the irregular structure which concerns on Example 3, and the Flat XOR code of the regular structure which concerns on a comparative example is shown.

以下、本発明の実施形態について、図面を参照しながら詳細に説明する。なお、本発明は、以下に示す実施形態に限定されるものではない。これらの実施の例は例示に過ぎず、本発明は当業者の知識に基づいて種々の変更、改良を施した形態で実施することができる。なお、本明細書及び図面において符号が同じ構成要素は、相互に同一のものを示すものとする。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In addition, this invention is not limited to embodiment shown below. These embodiments are merely examples, and the present invention can be implemented in various modifications and improvements based on the knowledge of those skilled in the art. In the present specification and drawings, the same reference numerals denote the same components.

従来のＦｌａｔＸＯＲ符号は、（ｎ−ｋ）及びｗの組み合わせからｋ種類を選ぶ数の行列から生成行列を選択する必要があり、符号長が大きいときに組合せ爆発となるため、とりうる符号長が限定されていた。本実施形態に係る発明は、符号長が大きい場合に広範囲の分割数でＦｌａｔＸＯＲ符号の非正則構成の生成行列Ｇを構成可能とする。これにより本実施形態に係る発明は、生成行列の部分的な領域に規則的に各列ｐ個の１を配置し、残りの領域にｗ−ｐ個の１を配置する問題に制限することで探索を可能とする。 Since the conventional Flat XOR code needs to select a generator matrix from the number of matrices that select k types from the combination of (n−k) and w, and a combination explosion occurs when the code length is large, the possible code length Was limited. The invention according to the present embodiment makes it possible to construct a non-regular configuration generation matrix G of Flat XOR codes with a wide range of division numbers when the code length is large. As a result, the invention according to this embodiment is limited to the problem of regularly arranging 1 p in each column in a partial region of the generator matrix and placing wp 1s in the remaining region. Allows searching.

本実施形態に係るストレージシステムは、符号化装置と、複数のディスクと、復号装置と、を備える。図１に、本実施形態に係る符号化装置の概略を示す。符号化装置１０は、冗長化の対象となるデータを符号化する。複数のディスク２０は、冗長化の対象となるデータを格納する。復号装置（不図示）は、非正則生成行列Ｇを用いて、複数のディスクに格納されているデータの修復を行う。 The storage system according to the present embodiment includes an encoding device, a plurality of disks, and a decoding device. FIG. 1 shows an outline of an encoding apparatus according to this embodiment. The encoding device 10 encodes data to be made redundant. The plurality of disks 20 store data to be made redundant. A decoding device (not shown) uses the irregular generator matrix G to restore data stored in a plurality of disks.

符号化装置１０は、データ分割部１１と、符号化部１２と、データ分配部１３と、を備える。データ分割部１１は、冗長化するデータをｋ個に分割する。符号化部１２は、非正則構成の生成行列Ｇを用いて、ｋ個に分割したデータを符号化し、（ｎ−ｋ）個の冗長データおよびｋ個に分割したデータを含むｎ個の符号化データを生成する。データ分配部１３は、符号化部１２の符号化したｎ個のデータを、各ディスク２０に送信する。 The encoding device 10 includes a data dividing unit 11, an encoding unit 12, and a data distribution unit 13. The data dividing unit 11 divides data to be redundant into k pieces. The encoding unit 12 encodes the data divided into k pieces using the generation matrix G having a non-regular configuration, and encodes n pieces of data including (n−k) redundant data and k pieces of data. Generate data. The data distribution unit 13 transmits the n pieces of data encoded by the encoding unit 12 to each disk 20.

符号化装置は、コンピュータを、各機能部として機能させることで実現してもよい。この場合、符号化装置内のＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）が、記憶部（不図示）に記憶されたコンピュータプログラムを実行することで、各構成を実現する。図１では、ディスク２０の数と生成する符号化データの数が等しい場合を示すが、ディスク２０の数と生成する符号化データの数は同一でなくともよい。例えば、分散した各ディスクに複数の符号化データを保存するなどの使い方をすることも考えられる。また、ディスク２０は地理的に分散して配置され、ネットワークにより接続されていてもよい。 The encoding device may be realized by causing a computer to function as each functional unit. In this case, each configuration is realized by a CPU (Central Processing Unit) in the encoding apparatus executing a computer program stored in a storage unit (not shown). Although FIG. 1 shows the case where the number of disks 20 is equal to the number of encoded data to be generated, the number of disks 20 and the number of encoded data to be generated may not be the same. For example, it is conceivable to use a plurality of encoded data on each distributed disk. The disks 20 may be arranged geographically distributed and connected by a network.

（生成行列構成装置）
本実施形態に係る符号化部１２は、生成行列構成装置として機能し、非正則構成の生成行列Ｇを構成する。本実施形態では、ストレージシステムを構成する全ディスク数を符号長に等しいｎ、データの分割数をｋ、パリティディスク数をｎ−ｋ、符号の最小距離をｗとする。ｎ−ｋ行ｋ列のゼロ行列をＺとする。 (Generator matrix construction device)
The encoding unit 12 according to the present embodiment functions as a generator matrix configuration device and configures a generator matrix G having a non-regular configuration. In this embodiment, the total number of disks constituting the storage system is n equal to the code length, the number of data divisions is k, the number of parity disks is nk, and the minimum code distance is w. Let Z be a zero matrix of nk rows and k columns.

（ａ）全探索可能な範囲の非正則の生成行列Ｇの第１の構成方法
符号長ｎがある程度小さく、ｎ−ｋ及びｗの組み合わせからｋ種類を選ぶ全組み合わせについて全探索可能である場合、ｋ−ｎ行ｎ列のゼロ行列Ｚを作成し、Ｚの各列について探索した組み合わせに相当する行の値を１とする。構成した行列の中で、任意のｆ列｛ｆ＝１，２，…，ｗ｝を修復可能な行列を生成行列Ｇとし、最も平均通信量の小さい生成行列を選択する。 (A) First configuration method of non-regular generator matrix G in a fully searchable range When the code length n is small to some extent and k search is possible for all combinations in which k types are selected from combinations of nk and w, A zero matrix Z of k−n rows and n columns is created, and a row value corresponding to a combination searched for each column of Z is set to 1. Among the constructed matrices, a matrix capable of repairing an arbitrary f column {f = 1, 2,..., W} is set as a generation matrix G, and a generation matrix having the smallest average traffic is selected.

図２に、第１の構成法によって構成した生成行列Ｇの一例を示す。ｎ＝１６、ｋ＝１０、ｗ＝３の場合を示す。生成行列の各列について、ｎ−ｋ行からｗ行を選ぶ組合せは列内で１を配置する行の組合せとなる。ｎ−ｋ＝６、ｗ＝３の場合、各列内の１の配置パターン数は図に示すように_ｎ−ｋＣ_ｗとなり、２０通り存在する。２０通りの配置パターンからｋ列分を選択することで生成行列を構成するため、とりうる生成行列候補数は_２０Ｃ_ｋとなる。ｋ＝１０であるため、とりうる生成行列候補数は１８４７５６通りとなる。ｆ＝２とすると、その中の２つの生成行列候補である行列（１）および行列（１８４７５６）が、生成行列Ｇとなる。 FIG. 2 shows an example of the generator matrix G configured by the first configuration method. The case where n = 16, k = 10, and w = 3 is shown. For each column of the generator matrix, the combination of selecting w rows from nk rows is a combination of rows in which 1 is arranged in the column. When n−k = 6 and w = 3, the number of one arrangement pattern in each column is _n−k C _{w as} shown in the figure, and there are 20 types. Since the generation matrix is configured by selecting k columns from the 20 arrangement patterns, the number of possible generation matrix candidates is ₂₀ C _k . Since k = 10, the number of possible generation matrix candidates is 184756. When f = 2, the matrix (1) and the matrix (184756) which are two generation matrix candidates among them are the generation matrix G.

（ｂ）全探索不可能な範囲の非正則の生成行列Ｇの第２の構成方法
符号長ｎが大きい場合、ｎ−ｋ及びｗの組み合わせからｋ種類を選ぶ数が非常に大きくなるため、生成行列Ｇの第１の構成法を用いて行列を構成することができない。そこで、生成行列を２つの領域に分割し、列重みを割り振ることで探索範囲に制約をつけ、非正則の生成行列を構成する。具体的には、以下の手順で非正則の生成行列Ｇを構成する。 (B) Second configuration method of non-regular generator matrix G in a range where full search is impossible When code length n is large, the number of k types to be selected from the combination of n−k and w becomes very large. A matrix cannot be constructed using the first construction method of the matrix G. Therefore, the generation matrix is divided into two regions, and the search range is restricted by assigning column weights, thereby forming a non-regular generation matrix. Specifically, the non-regular generator matrix G is constructed by the following procedure.

具体的には、符号化部１２は、ゼロ行列作成ステップを実行するゼロ行列作成部と、所定行構成ステップを実行する所定行構成部と、行列構成ステップを実行する行列構成部と、を備える。ゼロ行列作成ステップにおいてステップＳ１０１を実行し、所定行構成ステップにおいてステップＳ１０２及びＳ１０３を実行し、行列構成ステップにおいてステップＳ１０４及びＳ１０５を実行する。 Specifically, the encoding unit 12 includes a zero matrix creation unit that executes a zero matrix creation step, a predetermined row configuration unit that executes a predetermined row configuration step, and a matrix configuration unit that executes a matrix configuration step. . Step S101 is executed in the zero matrix creation step, steps S102 and S103 are executed in the predetermined row configuration step, and steps S104 and S105 are executed in the matrix configuration step.

ステップＳ１０１：ｎ−ｋ行ｋ列のゼロ行列Ｚを作成する。 Step S101: A zero matrix Z of nk rows and k columns is created.

例えば、ｎ＝１６、ｋ＝１０、ｗ＝３の場合、図３に示すように、ゼロ行列Ｚは６行１０列となる。 For example, when n = 16, k = 10, and w = 3, the zero matrix Z has 6 rows and 10 columns as shown in FIG.

ステップＳ１０２：行重みに偏りを持たせるため、１行目からｘ行目までの行重みをｐとおき、式１を満たす［ｘ，ｐ］をすべて求める。
（数１）
_ｘＣ_ｐ×_{ｎ−ｋ−ｘ}Ｃ_ｗ−ｐ≧ｋ（式１）
ただし、ｘ＞ｐである。 Step S102: In order to give bias to the row weights, the row weights from the first row to the x-th row are set as p, and all [x, p] satisfying Expression 1 are obtained.
(Equation 1)
_{_{_{x C p × n-k-}}} x C w-p ≧ k ( Equation 1)
However, x> p.

例えば、ｎ＝１６、ｋ＝１０、ｗ＝３の場合、［ｘ，ｐ］の組み合わせは［１，１］、［２，１］、［４，２］、［５，２］、［６，３］となる。［１，１］の場合は１行目、［２，１］の場合は２行目が所定行となる。 For example, when n = 16, k = 10, and w = 3, the combinations of [x, p] are [1,1], [2,1], [4,2], [5,2], [6 , 3]. In the case of [1, 1], the first row is designated, and in the case of [2, 1], the second row is designated.

ステップＳ１０３：Ｚの１行目からｘ行目までの範囲に列重みｐとなるように１を配置した行列をＺ３とする。_ｘＣ_ｐの組み合わせにより各列において１となる行を決定する。 Step S103: A matrix in which 1 is arranged in the range from the first row to the x-th row of Z so as to have the column weight p is defined as Z3. _The row which becomes 1 in each column is determined by the combination of _x C _p .

例えば、列重みがｐ＝１であるとする。
ｘ＝１のとき、Ｚ３１に示すように、１行目に「１１１１１１１１１１」が配置されている。
ｘ＝２のとき、Ｚ３２に示すように、１行目に「１０１０１０１０１０」が配置され、２行目に「０１０１０１０１０１」が配置されている。 For example, assume that the column weight is p = 1.
When x = 1, “1111111111” is arranged in the first row as indicated by Z31.
When x = 2, as shown in Z32, “1010101010” is arranged in the first row, and “0101010101” is arranged in the second row.

ステップＳ１０４：Ｚ３のｘ＋１行目からｎ−ｋ行目までの範囲に列重みｗ−ｐとなるように以下の方法で１を配置する。_{ｎ−ｋ−ｘ}Ｃ_ｗ−ｐ≦ｋの場合、１列目から_{ｎ−ｋ−ｘ}Ｃ_ｗ−ｐ行目までの範囲に、_{ｎ−ｋ−ｘ}Ｃ_ｗ−ｐの組み合わせにより各列において１となる行を決定した行列Ｂを置き、ｒ（_{ｎ−ｋ−ｘ}Ｃ_ｗ−ｐ）＋１列目から２ｒ（_{ｎ−ｋ−ｘ}Ｃ_ｗ−ｐ）列目にＢをｒ列｛ｒ＝１，２，…，ｘ−１｝循環させた行列Ｂ_ｒを置く。Ｂｒの列数合計がｋを超える場合はランダムにｋ列を選択しＺ３の１列目からｋ列目に配置する。 Step S104: 1 is arranged by the following method so that the column weight wp is in the range from the x + 1th row to the nk row of Z3. _In the case of nk _−xC wp ≦ k, 1 in each column in the range from the first column to the _nk−xC _wp row by the combination of _nk−xC _wp Place the matrix B to determine the rows _{_{to, r (n-k-x}} C w-p) +1 row _{_{2r (n-k-x C}} w-p) B into th column r columns {r = 1 , 2,..., X−1} and put the matrix B _r that is circulated. When the total number of Br columns exceeds k, k columns are randomly selected and arranged in the first to kth columns of Z3.

例えば、列重みｐ＝１であるとすると、ｗ＝３であるため、列重みｗ−ｐは２となる。
ｘ＝１のとき、Ｚ４１に示すように、２行目から６行目までの各列に「１」を２つずつ配置する。
ｘ＝２のとき、Ｚ４２に示すように、１列目から順に、３行目から６行目までの各列に「１」を２つずつ配置する。このとき、各列に「１」を２つずつ配置する組み合わせは６列目で終了する。この場合、７列目以降は、１行目及び２行目の「１」の配置との組み合わせが１列目から６列目までと異なるように、２行目から６行目までの各列に「１」を２つずつ配置する。 For example, if the column weight p = 1, w = 3, so the column weight w−p is 2.
When x = 1, two “1” s are arranged in each column from the second row to the sixth row as indicated by Z41.
When x = 2, as shown in Z42, two “1” s are arranged in each column from the third row to the sixth row in order from the first column. At this time, the combination in which two “1” s are arranged in each column ends in the sixth column. In this case, each column from the second row to the sixth row is different from the seventh column so that the combination of the arrangement of “1” in the first row and the second row is different from the first column to the sixth column. Two “1” s are arranged in each.

ステップＳ１０５：構成した行列の中で任意のｆ｛ｆ＝１，２，…，ｗ｝列を修復可能な行列を生成行列Ｇとする。例えば、Ｚ４１及びＺ４２の両方が生成行列Ｇとなる。 Step S105: A matrix that can repair an arbitrary f {f = 1, 2,..., W} column in the constructed matrix is set as a generation matrix G. For example, both Z41 and Z42 are the generator matrix G.

以上説明したように、生成行列Ｇの第２の構成方法を用いることで、非正則の生成行列Ｇを構成することができる。なお、本実施形態では「所定行」が１行目及び２行目である例を示したが、「所定行」は２行目以降の任意の行であってもよい。 As described above, the non-regular generation matrix G can be configured by using the second configuration method of the generation matrix G. In the present embodiment, an example in which the “predetermined row” is the first row and the second row is shown, but the “predetermined row” may be an arbitrary row after the second row.

（ｃ）全探索不可能な範囲の非正則の生成行列Ｇの第３の構成方法
符号長ｎが大きく、ステップＳ１０２において式１を満たす［ｘ，ｐ］の組合せが［ｘ，ｐ］＝［ｎ−ｋ，ｗ］のみである場合等において、_ｘＣ_ｐの組合せ数が大きくなり、生成行列Ｇの第２の構成法で行列を構成することができない。そこで、生成行列の１行目の行重みが平均行重み以上となるように構成することで、残りｎ−ｋ−１行の探索範囲を制限し、非正則の生成行列Ｇを構成する。具体的には、以下の手順で非正則の生成行列Ｇを構成する。 (C) Third Method of Constructing Non-Regular Generator Matrix G That Cannot Be Fully Searched A combination of [x, p] that satisfies the formula 1 in step S102 is [x, p] = [ In the case of only n−k, w], the number of combinations of _x C _p becomes large, and the matrix cannot be configured by the second configuration method of the generator matrix G. Therefore, the search range of the remaining n−k−1 rows is limited by configuring the row weight of the first row of the generation matrix to be equal to or greater than the average row weight, and the irregular generation matrix G is configured. Specifically, the non-regular generator matrix G is constructed by the following procedure.

具体的には、符号化部１２は、ゼロ行列作成ステップを実行するゼロ行列作成部と、特定行構成ステップを実行する特定行構成部と、行列構成ステップを実行する行列構成部と、を備える。ゼロ行列作成ステップにおいてステップＳ２０１を実行し、特定行構成ステップにおいてステップＳ２０２及びＳ２０３を実行し、行列構成ステップにおいてステップＳ２０４及びＳ２０５を実行する。 Specifically, the encoding unit 12 includes a zero matrix creation unit that executes a zero matrix creation step, a specific row configuration unit that executes a specific row configuration step, and a matrix configuration unit that executes a matrix configuration step. . Step S201 is executed in the zero matrix creation step, steps S202 and S203 are executed in the specific row configuration step, and steps S204 and S205 are executed in the matrix configuration step.

ステップＳ２０１：ｎ−ｋ行ｋ列のゼロ行列Ｚを作成する。 Step S201: A zero matrix Z of nk rows and k columns is created.

例えば、ｎ＝１６、ｋ＝１０、ｗ＝３の場合、図４に示すように、ゼロ行列Ｚは６行１０列となる。 For example, when n = 16, k = 10, and w = 3, the zero matrix Z has 6 rows and 10 columns as shown in FIG.

ステップＳ２０２：生成行列の平均行重みＥを式２により求める。
（数２）
Ｅ＝ｗ×ｋ／（ｎ−ｋ）（式２） Step S202: The average row weight E of the generator matrix is obtained by Equation 2.
(Equation 2)
E = w × k / (n−k) (Formula 2)

例えば、ｎ＝１６、ｋ＝１０、ｗ＝３の場合、Ｅ＝５となる。 For example, when n = 16, k = 10, and w = 3, E = 5.

ステップＳ２０３：Ｚの１行目の行重みがＷ｛Ｗ＝Ｅ＋１，Ｅ＋２，…，ｋ｝となるように、_ｋＣ_Ｗの組合せにより１行目の１となる列を配置した行列をＺ’とする。１行目に１を配置した列の集合をＡとする。 Step S203: A matrix in which 1 column of the first row is arranged by a combination of _k C _W so that the row weight of the first row of Z becomes W {W = E + 1, E + 2,. And A set of columns in which 1 is arranged in the first row is A.

例えば、Ｅ＝５である場合、１行目の行重みＷはＷ｛６，７，８，９，１０｝となる。
１行目の行重みがＷ＝６のとき、１行目に１を配置する列の組合せを_ｋＣ_ｗにより決定する。例えば、Ａ＝｛１，２，３，４，５，６｝のとき、Ｚ３６に示すように、１行目の１列目から６列目までに「１」を配置する。また、１行目の行重みがＷ＝７のとき、１行目に１を配置する列の組合せを_ｋＣ_ｗにより決定する。例えば、Ａ＝｛１，２，３，７，８，９，１０｝のとき、Ｚ３７に示すように、１行目の１列目から３列目までに「１」を配置し、１行目の７列目から１０列目までに「１」を配置する。 For example, when E = 5, the row weight W of the first row is W {6, 7, 8, 9, 10}.
When the row weight of the first row is W = 6, a combination of columns in which 1 is arranged in the first row is determined by _k C _w . For example, when A = {1, 2, 3, 4, 5, 6}, “1” is arranged from the first column to the sixth column of the first row as indicated by Z36. When the row weight of the first row is W = 7, a combination of columns in which 1 is arranged in the first row is determined by _k C _w . For example, when A = {1, 2, 3, 7, 8, 9, 10}, as shown in Z37, “1” is arranged from the first column to the third column of the first row, and one row “1” is arranged from the seventh column to the tenth column.

ステップＳ２０４：Ｚ’の２行目からｎ−ｋ行目までの範囲に１を配置する。Ａに含まれる列は、_{ｎ−ｋ−１}Ｃ_ｗ−１の組合せにより１となる行を決定する。_{ｎ−ｋ−１}Ｃ_ｗ−１＞Ｗの場合は、ｎ−ｋ−１及びｗ−１の組み合わせからＷ種類を選び、各列を構成する。Ａに含まれない列は、_{ｎ−ｋ−１}Ｃ_ｗの組合せにより１となる行を決定する（図３）。_{ｎ−ｋ−１}Ｃ_ｗ＞ｋ−Ｗの場合は、ｎ−ｋ−１及びｗの組み合わせからｋ−Ｗ種類を選び各列を構成する。 Step S204: 1 is arranged in the range from the second row of Z ′ to the nk row. The column included in A determines the row which becomes 1 by the combination of _n−k−1 C _w−1 . _{When n−k−1} C _w−1 > W, the W type is selected from the combination of n−k−1 and w−1, and each column is configured. For a column not included in A, a row that is 1 is determined by a combination of _n−k−1 C _w (FIG. 3). _{When n−k−1} C _w > k−W, k−W types are selected from combinations of n−k−1 and w to configure each column.

例えば、Ｗ＝６のとき、Ｚ４６に示すように、Ａに含まれる１列目から６列目までは_５Ｃ_２の組合せにより決定される行に「１」を配置し、Ａに含まれない７列目から１０列目までは_５Ｃ_３の組合せにより決定される行に「１」を配置する。
例えば、Ｗ＝７のとき、Ｚ４７に示すように、Ａに含まれる１列目から３列目まで及び７列目から１０列目までは_５Ｃ_２の組合せにより決定される行に「１」を配置し、Ａに含まれない４列目から６列目までは_５Ｃ_３の組合せにより決定される行に「１」を配置する。 For example, when W = 6, as shown in Z46, “1” is arranged in the row determined by the combination of ₅ C ₂ from the first column to the sixth column included in A, and is not included in A From the seventh column to the tenth column, “1” is arranged in a row determined by the combination of ₅ C ₃ .
For example, when W = 7, as indicated by Z47, the first to third columns and the seventh to tenth columns included in A are “1” in the rows determined by the combination of ₅ C _2. And from the fourth column to the sixth column not included in A, “1” is arranged in a row determined by the combination of ₅ C ₃ .

ステップＳ２０５：構成した行列の中で任意のｆ｛ｆ＝１，２，…，ｗ｝列を修復可能な行列を生成行列Ｇとする。例えば、Ｚ４６及びＺ４７の両方が生成行列Ｇとなる。構成した生成行列の中で最も平均通信量の小さい行列を選択する。 Step S205: A matrix that can repair an arbitrary f {f = 1, 2,..., W} column in the constructed matrix is set as a generation matrix G. For example, both Z46 and Z47 are the generator matrix G. A matrix having the smallest average traffic is selected from the generated generation matrices.

以上説明したように、生成行列Ｇの第３の構成方法を用いることで、非正則の生成行列Ｇを構成することができる。なお、本実施形態では「特定の１行」が１行目である例を示したが、「特定の１行」は２行目以降の任意の行であってもよい。 As described above, the non-regular generation matrix G can be configured by using the third configuration method of the generation matrix G. In the present embodiment, an example in which “specific one line” is the first line is shown, but “specific one line” may be an arbitrary line after the second line.

（復号装置）
復号装置は、生成行列Ｇにｎ−ｋ行ｎ−ｋ列の単位行列を水平に接続しパリティ検査行列Ｈとする。
（数３）
Ｈ＝［ＧＩ］ (Decryption device)
The decoding apparatus horizontally connects a unit matrix of nk rows and nk columns to the generator matrix G to obtain a parity check matrix H.
(Equation 3)
H = [GI]

ＲｅｃｏｖｅｒｙＥｑｕａｔｉｏｎＡｌｇｏｒｉｔｈｍにより、パリティ検査行列Ｈのｎ−ｋ行からＩ｛Ｉ＝１，２，…，ｎ−ｋ｝行を選び選択した行のＸＯＲにより得られる修復式の集合をＲＥとする。本実施形態では、全ディスク数をｎ、データの分割数をｋ、パリティディスク数をｎ−ｋ、符号の最小距離をｗ、同時故障ディスク数をｆ｛ｆ＝１，２，３｝、故障ディスクの集合をＦ｛Ｆ＝Ｆ_１，…，Ｆ_ｆ｝とする。 Let RE be a set of restoration formulas obtained by XOR of selected rows by selecting I {I = 1, 2,..., Nk} rows from nk rows of the parity check matrix H by the Recovery Equation Algorithm. In this embodiment, the total number of disks is n, the number of data divisions is k, the number of parity disks is nk, the minimum code distance is w, the number of simultaneous failed disks is f {f = 1, 2, 3}, Let F {F = F ₁ ,..., F _f } be a set of disks.

図５に、Ｉ＝１のときの修復式の集合ＲＥの一例を示す。本実施形態では、ｎ＝１６、ｋ＝１０、ｗ＝３であり、生成行列Ｇは６行１０列である。この場合、生成行列Ｇに６行６列の単位行列を水平に接続する。 FIG. 5 shows an example of a set RE of repair formulas when I = 1. In this embodiment, n = 16, k = 10, and w = 3, and the generator matrix G has 6 rows and 10 columns. In this case, a 6 × 6 unit matrix is connected horizontally to the generator matrix G.

図６に、Ｉ＝２のときの行の選択例を示す。図７に、Ｉ＝２のときの修復式の集合ＲＥの一例を示す。パリティ検査行列Ｈの１行目及び２行目を選択した場合、図７の１行目に示すように、Ｉ＝２のときのパリティ検査行列Ｈは、図５に示すパリティ検査行列Ｈの１行目と２行目のＸＯＲとなる。Ｉ＝２のときの行の選択のバリエーションは１５通りあるため、Ｉ＝２のパリティ検査行列Ｈは１５行となる。 FIG. 6 shows an example of row selection when I = 2. FIG. 7 shows an example of a set RE of repair formulas when I = 2. When the first and second rows of the parity check matrix H are selected, as shown in the first row of FIG. 7, the parity check matrix H when I = 2 is 1 of the parity check matrix H shown in FIG. XOR for the second and second rows. Since there are 15 variations of row selection when I = 2, the parity check matrix H of I = 2 is 15 rows.

図８に、Ｉ＝３のときの行の選択例を示す。図９に、Ｉ＝３のときの修復式の集合ＲＥの一例を示す。パリティ検査行列Ｈの１、２及び３行目を選択した場合、図９の１行目に示すように、Ｉ＝３のときのパリティ検査行列Ｈは、図５に示すパリティ検査行列Ｈの１、２及び３行目のＸＯＲとなる。Ｉ＝３のときの行の選択のバリエーションは２０とおりあるため、Ｉ＝３のパリティ検査行列Ｈは２０行となる。 FIG. 8 shows an example of row selection when I = 3. FIG. 9 shows an example of a set RE of repair formulas when I = 3. When the first, second and third rows of the parity check matrix H are selected, the parity check matrix H when I = 3 is 1 of the parity check matrix H shown in FIG. XOR of the second and third rows. Since there are 20 variations of row selection when I = 3, the parity check matrix H of I = 3 is 20 rows.

ｆ＝１の場合、修復式の集合ＲＥの中でＦ_１列目の値が１となる行の集合ＲＥ_Ｆを求め、ＲＥ_Ｆの中で最も行重みの小さい行をＦ_１の修復式とする。例えば、図５、図７、図９のように修復式の集合ＲＥを求めたとき、列１に対応するディスクが故障した場合の修復式を求めるためには、まずＲＥの１列目の値が１となるＩ＝１の１，２，３行目およびＩ＝２の３，４，５，７，８，９，１０，１１，１２行目およびＩ＝３の１，８，９，１３，１４，１５，１６，１７，１８行目の集合を抜き出しＲＥ_Ｆとする。ＲＥ_Ｆの行重みは順に１１，５，５，８，８，８，８，８，８，８，９，８，７，７，７，９，９，９，９，９，９であることから、最も行重みの小さいＩ＝１の２，３行目のいずれかを修復式とする。Ｉ＝１の２行目を修復式とすると、値が１となる列は１，２，３，４，１２であることから、２，３，４，１２列目に相当するディスクのデータのＸＯＲにより、１列目に相当する故障ディスクのデータを修復する。 For f = 1, determined a set RE _F line the value of F ₁ row becomes 1 in collective RE repair type, a small line most row degree in the RE _F and repair formula F ₁ To do. For example, when the repair formula set RE is obtained as shown in FIG. 5, FIG. 7, and FIG. 9, in order to obtain the repair formula when the disk corresponding to the column 1 fails, first the value of the first column of the RE 1 = 1, 1, 2, 3 rows and I = 2 3, 4, 5, 7, 8, 9, 10, 11, 12 and I = 3 1, 8, 9, and RE _F pulled out a set of 13,14,15,16,17,18 line. Line weight of RE _F in turn is 11,5,5,8,8,8,8,8,8,8,9,8,7,7,7,9,9,9,9,9,9 Therefore, one of the second and third rows with I = 1 having the smallest row weight is set as a restoration formula. Assuming that the second row of I = 1 is a restoration formula, the columns with values of 1 are 1, 2, 3, 4 and 12, so the data of the disk corresponding to the 2nd, 3rd, 4th and 12th columns The data of the failed disk corresponding to the first column is repaired by XOR.

修復式において、故障列Ｆ_１を除く値が１の列に対応するディスクに保存したデータのＸＯＲにより、故障ディスクＦ_１のデータを修復する。
このとき、修復式の行重みから故障列に相当する１を引いた値を修復時の参照ディスク数、参照ディスク数から１を引いた値を修復時のＸＯＲ回数、参照ディスク数とディスク容量の積を修復に必要な通信量とする。修復式において値が１の行に該当するディスクのデータのＸＯＲ計算により、故障ディスクに保存されていたデータを修復する。 In the repair formula, the data of the failed disk F ₁ is repaired by XOR of the data stored in the disk corresponding to the column whose value is 1 except for the failed column F ₁ .
At this time, the value obtained by subtracting 1 corresponding to the failure column from the row weight of the restoration formula is the number of reference disks at the time of restoration, and the value obtained by subtracting 1 from the number of reference disks is the number of XORs at the time of restoration, the number of reference disks and the disk capacity. The product is the amount of communication necessary for restoration. Data stored in the failed disk is repaired by XOR calculation of the data of the disk corresponding to the row whose value is 1 in the repair formula.

ｆ＝２の場合、ＲＥの中でＦ｛Ｆ＝Ｆ_１，Ｆ_２｝のＦ_１列目の値が１、Ｆ_２列目の値が０となる行の集合ＲＥ_Ｆを求め、最も行重みの小さい式をＦ_１の修復式とする。ただし、Ｆ_１およびＦ_２は入れ替えてもよい。ディスクＦ_１を修復後、ｆ＝１の場合と同様にディスクＦ_２の修復式を求め、ディスクＦ_２を修復する。 For f = 2, obtains the _{_{F {F = F 1, F}} 2} set RE _F line the value of _{F 1} row becomes 1, _{F 2} column value is 0 in the RE, most row small formula weighted and repair formula F _1. However, F ₁ and F ₂ may be interchanged. After repair the disk _{F 1,} as in the case of f = 1 obtains a repair disc _{F 2,} to repair the disk _{F 2.}

ｆ＝３の場合、ＲＥの中でＦ｛Ｆ＝Ｆ_１，Ｆ_２，Ｆ_３｝のＦ_１列目の値が１、Ｆ_２、Ｆ_３列目の値が０となる行の集合ＲＥ_Ｆを求め、最も行重みの小さい式をＦ_１の修復式とする。ただし、Ｆ_１、Ｆ_２およびＦ_３は入れ替えてもよい。ディスクＦ_１を修復後、ｆ＝２の場合と同様にＦ_２，Ｆ_３の修復式を求め、ディスクＦ_２，Ｆ_３を修復する。 In the case of f = 3, a set of rows RE in which the value of the F _1st column of F {F = F ₁ , F ₂ , F ₃ } is 1 and the value of the F ₂ , F ₃ column is 0 in the RE. seeking _F, the most row weight small equations repair formula F _1. _{However, _F 1, F} ₂ and _{F 3} may be interchanged. After repair the disk _{F 1,} as in the case of f = 2 obtains the repair type _{_F} 2, _F _3, to repair the disk _{_F} 2, _F _3.

ストレージシステムを構成するｎディスクの中でｆ｛ｆ＝１，２，…，ｗ｝ディスクが故障する全組み合わせについて、ＲｅｃｏｖｅｒｙＥｑｕａｔｉｏｎＡｌｇｏｒｉｔｈｍにより修復式を求め、ＢｅｌｉｅｆＰｒｏｐａｇａｔｉｏｎによる復号を行い、ｆディスク修復に必要な通信量の平均値を計算する。 For all combinations in which the f {f = 1, 2,..., W} disk fails among the n disks constituting the storage system, the recovery equation is obtained by the recovery equation algorithm, and decryption is performed by the Belief Propagation to restore the f disk. Calculate the average amount of traffic required.

なお、通信量を最小とする生成行列Ｇが１通りに決定した場合にはそれを記録するとともに、故障ディスクの組合せごとに予め修復式の探索を行い、通信量が最小となる修復式を記録しておくことで、復号毎に修復式の探索を行うことを省略することができる。 In addition, when one generation matrix G that minimizes the traffic is determined, it is recorded, and a repair formula is searched for in advance for each combination of failed disks, and a repair formula that minimizes the traffic is recorded. By doing so, it is possible to omit searching for a repair formula for each decoding.

［第１実施例］
一例として、ストレージシステムを構成するディスク数ｎおよび分割数ｋが小さく、（ｎ−ｋ）及びｗの組み合わせからｋ種類を選ぶ組み合わせを全通り探索可能な場合について、通信量を削減する符号化手法を示す。 [First embodiment]
As an example, an encoding method for reducing the amount of communication in the case where the number of disks n and the number of divisions k constituting the storage system are small and a combination of selecting k types from combinations of (nk) and w can be searched. Indicates.

ディスク容量１（ＴＢ）、ストレージを構成する全ディスク数１２≦ｎ≦２４、データ分割数６≦ｋ≦１８、最大同時故障数ｗ＝３として冗長化を行う場合、符号長ｎ、分割数ｋ、列重みｗのＦｌａｔＸＯＲ符号を用いてデータの分散保存を行う（図１）。 When redundancy is performed with the disk capacity 1 (TB), the total number of disks constituting the storage 12 ≦ n ≦ 24, the data division number 6 ≦ k ≦ 18, and the maximum simultaneous failure number w = 3, the code length n and the division number k The data is distributed and stored using the Flat XOR code having the column weight w (FIG. 1).

前述の非正則の生成行列Ｇの第１の構成方法の手順に従い構成した生成行列Ｇを用いて符号化した非正則構成のＦｌａｔＸＯＲ符号の中で通信量が最小の行列を探索し（図２）、正則構成のＦｌａｔＸＯＲ符号の中で通信量が最小の行列を用いた場合とディスクの故障確率により重み付けした平均通信量を比較した。 A matrix with the smallest traffic is searched for in a non-regular Flat XOR code encoded using the generation matrix G configured according to the procedure of the first configuration method of the irregular generator matrix G described above (FIG. 2). ), The average communication amount weighted by the failure probability of the disk was compared with the case where the matrix having the smallest communication amount in the flat XOR code of the regular configuration was used.

図１０に、本実施例における正則構成のＦｌａｔＸＯＲ符号との平均通信量の比較結果を示す。縦軸は、正則構成のＦｌａｔＸＯＲ符号を用いて測定した平均通信量に対する、本実施形態に係る非正則構成のＦｌａｔＸＯＲ符号を用いて測定した平均通信量の比を示す。本実施形態に係る非正則構成のＦｌａｔＸＯＲ符号を用いた場合、いずれの符号長ｎにおいても、より小さい通信量でデータを保護することができる。 FIG. 10 shows a comparison result of the average communication amount with the flat XOR code having a regular configuration according to the present embodiment. The vertical axis represents the ratio of the average communication amount measured using the non-regular configuration Flat XOR code according to the present embodiment to the average communication amount measured using the regular configuration Flat XOR code. When the non-regular Flat XOR code according to the present embodiment is used, data can be protected with a smaller communication amount at any code length n.

また、図１１に、ＲＳ符号を用いた場合の通信量の比較結果を示す。縦軸は、Ｒｅｅｄ−Ｓｏｌｏｍｏｎ符号を用いて測定した平均通信量に対する、本実施形態に係る非正則構成のＦｌａｔＸＯＲ符号を用いて測定した平均通信量の比を示す。本実施形態に係る非正則構成のＦｌａｔＸＯＲ符号を用いた場合、いずれの符号長ｎにおいても、より小さい通信量でデータを保護することができる。 Further, FIG. 11 shows a comparison result of the traffic when the RS code is used. The vertical axis indicates the ratio of the average traffic volume measured using the Flat XOR code of the irregular configuration according to this embodiment to the average traffic volume measured using the Reed-Solomon code. When the non-regular Flat XOR code according to the present embodiment is used, data can be protected with a smaller communication amount at any code length n.

［第２実施例］
多地点にデータを分散保存するストレージシステムにおいて、ディスク数ｎおよび分割数ｋが大きく、（ｎ−ｋ）及びｗの組み合わせからｋ種類を選ぶ全組み合わせを探索不可能な場合について、ＲＳ符号と比較して通信量を削減する符号化手法を示す。 [Second Embodiment]
Compared with RS code in a storage system that distributes and stores data at multiple points, where the number of disks n and the number of partitions k are large and it is not possible to search for all combinations in which k types are selected from the combinations of (n−k) and w Thus, an encoding method for reducing the communication amount will be described.

ディスク容量１（ＴＢ）、ストレージを構成する全ディスク数３６≦ｎ≦１１６、データ分割数２０≦ｋ≦１００、最大同時故障数ｗ＝３として冗長化を行う場合、符号長ｎ、分割数ｋ、列重みｗのＦｌａｔＸＯＲ符号を用いてデータの分散保存を行う（図１）。 When redundancy is performed with the disk capacity 1 (TB), the total number of disks constituting the storage 36 ≦ n ≦ 116, the data division number 20 ≦ k ≦ 100, and the maximum simultaneous failure number w = 3, the code length n and the division number k The data is distributed and stored using the Flat XOR code having the column weight w (FIG. 1).

前述の非正則生成行列Ｇの第２の構成方法の手順に従い構成した生成行列Ｇを用いて符号化した非正則構成のＦｌａｔＸＯＲ符号と、ＲＳ符号を用いた場合のそれぞれについて、故障ディスク数ｆごとに、修復に必要なＸＯＲ計算回数およびディスクの故障確率により重み付けした平均通信量を算出して比較した。 For each of the non-regular configuration Flat XOR code encoded using the generation matrix G configured according to the procedure of the second configuration method of the non-regular generation matrix G and the RS code, the number of failed disks f Each time, the average communication traffic weighted by the number of XOR calculations required for repair and the failure probability of the disk was calculated and compared.

図１２に、ＲＳ符号とのＸＯＲ計算回数の比較結果を示す。縦軸は、ＲＳ符号を用いた場合の平均ＸＯＲ計算回数に対する、本実施形態に係る非正則構成のＦｌａｔＸＯＲ符号を用いた場合の平均ＸＯＲ計算回数の比を示す。 FIG. 12 shows a comparison result of the number of XOR calculations with the RS code. The vertical axis represents the ratio of the average number of XOR calculations when the non-regular configuration Flat XOR code according to the present embodiment is used to the average number of XOR calculations when the RS code is used.

図１３に、ＲＳ符号との平均通信量の比較結果を示す。縦軸は、ＲＳ符号を用いて測定した平均通信量に対する、本実施形態に係る非正則構成のＦｌａｔＸＯＲ符号を用いて測定した平均通信量の比を示す。 FIG. 13 shows a comparison result of the average traffic with the RS code. The vertical axis represents the ratio of the average traffic measured using the Flat XOR code of the irregular configuration according to the present embodiment to the average traffic measured using the RS code.

図１２及び図１３に示すように、非正則構成の符号を用いた場合、より少ないＸＯＲ計算回数および通信量でデータを保護することができることが分かる。 As shown in FIGS. 12 and 13, it is understood that data can be protected with a smaller number of XOR calculations and communication traffic when using a code with a non-regular configuration.

［第３実施例］
多地点にデータを分散保存するストレージシステムにおいて、ディスク数ｎおよび分割数ｋが大きく、（ｎ−ｋ）及びｗの組み合わせからｋ種類を選ぶ全組み合わせを探索不可能な場合について、正則構成のＦｌａｔＸＯＲ符号と比較して通信量を削減する符号化手法を示す。 [Third embodiment]
In a storage system in which data is distributed and stored at multiple points, the regular number of flats is used in the case where the number of disks n and the number of divisions k are large and it is impossible to search for all combinations in which k types are selected from the combinations of (n−k) and w. An encoding method for reducing the communication amount as compared with the XOR code will be described.

ディスク容量１（ＴＢ）、全ディスク数２６≦ｎ≦１３０、データ分割数１８≦ｋ≦１１３、最大同時故障数ｗ＝３として冗長化を行う場合、符号長ｎ、分割数ｋ、列重みｗの非正則構成ＦｌａｔＸＯＲ符号を非正則構成の生成行列Ｇの第２の構成方法の手順に従って構成する。また、このとき分割数ｋ＝_ｎ−ｋＣ_ｗにおいて正則構成のＦｌａｔＸＯＲ符号の生成行列が１通りに決定される。 When redundancy is performed with the disk capacity 1 (TB), the total number of disks 26 ≦ n ≦ 130, the data division number 18 ≦ k ≦ 113, and the maximum simultaneous failure number w = 3, the code length n, the division number k, and the column weight w The non-regular configuration Flat XOR code is configured according to the procedure of the second configuration method of the non-regular configuration generator matrix G. At this time, the generation matrix of the flat XOR code having the regular configuration is determined in one way with the division number k = _n−k C _w .

非正則構成の生成行列Ｇおよびｋ＝_ｎ−ｋＣ_ｗの正則構成の生成行列Ｇを用いた場合について、ｆディスクの修復に必要なＸＯＲ計算回数およびディスクの故障確率により重み付けした平均通信量を比較した。 In the case of using a non-regular configuration generator matrix G and a regular configuration generation matrix G of k = _n−k C _w , the average communication amount weighted by the number of XOR calculations necessary for repairing the f disk and the failure probability of the disk Compared.

図１４に、正則構成のＦｌａｔＸＯＲ符号とのＸＯＲ計算回数の比較結果を示す。縦軸は、正則構成のＦｌａｔＸＯＲ符号を用いた場合の平均ＸＯＲ計算回数に対する、本実施形態に係る非正則構成のＦｌａｔＸＯＲ符号を用いた場合の平均ＸＯＲ計算回数の比を示す。 FIG. 14 shows a comparison result of the number of XOR calculations with a regular configuration Flat XOR code. The vertical axis indicates the ratio of the average number of XOR calculations when the non-regular configuration Flat XOR code according to the present embodiment is used to the average number of XOR calculations when the regular configuration Flat XOR code is used.

図１５に、正則構成のＦｌａｔＸＯＲ符号との平均通信量の比較結果を示す。縦軸は、正則構成のＦｌａｔＸＯＲ符号を用いて測定した平均通信量に対する、本実施形態に係る非正則構成のＦｌａｔＸＯＲ符号を用いて測定した平均通信量の比を示す。 FIG. 15 shows a comparison result of the average communication amount with the flat XOR code having a regular configuration. The vertical axis represents the ratio of the average communication amount measured using the non-regular configuration Flat XOR code according to the present embodiment to the average communication amount measured using the regular configuration Flat XOR code.

図１４及び図１５に示すように、非正則構成の符号を用いることで、同じ符号長の場合に正則構成の符号より少ないＸＯＲ計算回数および通信量でデータを保護することができる。 As shown in FIGS. 14 and 15, by using a code with a non-regular configuration, data can be protected with a smaller number of XOR calculations and a smaller traffic than a code with a regular configuration when the code length is the same.

以上説明したように、本実施形態に係る非正則構成の生成行列Ｇを用いて符号化したＦｌａｔＸＯＲ符号を用いることで、ＲＳ符号および正則ＦｌａｔＸＯＲ符号と比べ、故障ディスク修復時の通信量および計算負荷を低減する符号を構成することができる。また、従来は非正則のＦｌａｔＸＯＲ符号を適用できなかった符号長１００程度までの場合に広範囲の分割数で非正則符号を構成することができるため、分散数が大きなストレージシステムにおいて計算負荷を抑えた消失訂正符号の適用が可能となる。 As described above, by using the Flat XOR code encoded using the non-regular configuration generator matrix G according to the present embodiment, compared with the RS code and the regular Flat XOR code, the communication amount at the time of failure disk repair and A code that reduces the computational load can be configured. In addition, since the irregular code can be configured with a wide range of division numbers when the code length is up to about 100, where the irregular Flat XOR code could not be applied conventionally, the calculation load is suppressed in a storage system with a large number of distributions. The erasure correction code can be applied.

また、本実施形態では、符号化においてＦｌａｔＸＯＲ符号を用いる例を示したが、本発明は非正則構成の生成行列Ｇを用いた任意の符号化に適用できる。例えば、疎グラフに基づき、ＸＯＲによる符号化を行い、ＲｅｃｏｖｅｒｙＥｑｕａｔｉｏｎＡｌｇｏｒｉｔｈｍによる復号が可能な他の組織符号にも適用可能である。そのような組織符号としては、例えば、ＬＤＰＣ（ＬｏｗＤｅｎｓｉｔｙＰａｒｉｔｙＣｈｅｃｋ）符号が例示できる。 In the present embodiment, an example in which the Flat XOR code is used in the encoding has been described. However, the present invention can be applied to any encoding using a generation matrix G having a non-regular configuration. For example, the present invention can also be applied to other systematic codes that can be encoded by XOR based on a sparse graph and decoded by Recovery Equation Algorithm. As such a systematic code, for example, an LDPC (Low Density Parity Check) code can be exemplified.

本発明は情報通信産業に適用することができる。 The present invention can be applied to the information communication industry.

１０：符号化装置
１１：データ分割部
１２：符号化部
１３：データ分配部
２０：ディスク 10: Encoding device 11: Data dividing unit 12: Encoding unit 13: Data distributing unit 20: Disk

Claims

A zero matrix creation unit that creates a zero matrix based on the number of data divisions and the number of disks storing data;
A predetermined row configuration unit that arranges “1” or “0” in each column of the predetermined row so that the number of “1” arranged in each column has a constant value in the predetermined row in the zero matrix;
Arranged in all rows, is the value obtained by subtracting the predetermined value from the minimum distance of the code in which the number is predetermined in the "1" to place in each column, and a "1", except for the predetermined row before Symbol zero matrix A matrix configuration unit that arranges “1” or “0” in each column of all rows except the predetermined row so that the combination of rows to be different is different for each column;
A generator matrix construction device comprising:

When the predetermined row is 2 or more,
When the matrix configuration unit has the same combination of rows in which “1” is arranged in a plurality of columns in all rows except the predetermined row in the zero matrix, “1” in the predetermined row between the columns. Arrange “1” or “0” in each column of all rows except the predetermined row so that the combinations of the rows in which are arranged are different.
The generator matrix construction device according to claim 1.

A zero matrix creation unit that creates a zero matrix based on the number of data divisions and the number of disks storing data;
Specific row configuration in which “1” or “0” is arranged in each column of the specific one row so that the number of “1” arranged in the row is equal to or greater than a certain value in the specific row in the zero matrix And
For the column in which “1” is arranged in the specific row, the number of “1” s to be arranged in each column in all rows except the specific row in the zero matrix is a predetermined code. The value obtained by subtracting 1 from the minimum distance , and “1” or “0” in each column of all rows except the specific one row so that the combination of rows in which “1” is arranged is different for each column. Place and
Regarding the column in which “0” is arranged in the specific row, the number of “1” arranged in each column is the minimum distance of the code in all the rows except the specific row in the zero matrix. It becomes equal, and "1" so that the combination of rows to place differs for each row, and the matrix component to place "1" or "0" in each column in every row except the particular one line ,
A generator matrix construction device comprising:

The zero matrix, have equal number of columns to the number of divided data, and has a number of rows equal to the number of parity disks obtained from the number of disks for storing data by subtracting the number of divisions of the data, claims 1 to 3 The generator matrix construction device according to any one of the above.

A generator matrix configuration method executed by a generator matrix configuration apparatus,
Creating a zero matrix based on the number of data divisions and the number of disks storing the data; and
A predetermined row configuration step of arranging “1” or “0” in each column of the predetermined row so that the number of “1” arranged in each column has a constant value in the predetermined row in the zero matrix;
Arranged in all rows, is the value obtained by subtracting the predetermined value from the minimum distance of the code in which the number is predetermined in the "1" to place in each column, and a "1", except for the predetermined row before Symbol zero matrix A matrix construction step of arranging “1” or “0” in each column of all rows except the predetermined row so that the combination of rows to be different is different for each column;
A generator matrix construction method for executing

A generator matrix configuration method executed by a generator matrix configuration apparatus,
Creating a zero matrix based on the number of data divisions and the number of disks storing the data; and
Specific row configuration in which “1” or “0” is arranged in each column of the specific one row so that the number of “1” arranged in the row is equal to or greater than a certain value in the specific row in the zero matrix Steps,
For the column in which “1” is arranged in the specific row, the number of “1” s to be arranged in each column in all rows except the specific row in the zero matrix is a predetermined code. The value obtained by subtracting 1 from the minimum distance , and “1” or “0” in each column of all rows except the specific one row so that the combination of rows in which “1” is arranged is different for each column. Place and
Regarding the column in which “0” is arranged in the specific row, the number of “1” arranged in each column is the minimum distance of the code in all the rows except the specific row in the zero matrix. It becomes equal, and "1" so that the combination of rows to place differs for each column, a matrix arrangement step of placing the "1" or "0" in each column in every row except the particular one line ,
A generator matrix construction method for executing