JPS63318628A

JPS63318628A - Horizontally dividing system for data base

Info

Publication number: JPS63318628A
Application number: JP62155731A
Authority: JP
Inventors: Tatsuo Minohara; 箕原　辰夫; Shunichiro Nakamura; 俊一郎中村
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1987-06-23
Filing date: 1987-06-23
Publication date: 1988-12-27
Anticipated expiration: 2010-09-06
Also published as: JPH0782451B2

Abstract

PURPOSE:To contrive the equalization of the number of tuples stored in a disk device automatically, by dividing a page into two, when tapple storing pages are full, storing the tuple in one of them, and sending one page to the disk device having the smallest number of pages. CONSTITUTION:It is assumed that a tuple in which key values of an integer value are between 1-100 is stored in a page 3c in a disk device 2e. In case of storing the tuple having a key value of the key values between 1-100, in this page 3c, since the page in full, it is divided into two. Subsequently, the tuple having a key value between 1-50 is stored in the original page 3c, and the tuple having a key value between 51-100 is stored in the other page 3d. A disk 2f is constituted of the tuple of a relation to which its tuple selected in order to store a page 3d belongs, and its number of pages is the smallest. After the page has been divided, a clustered index is reorganized, an operation for determining the disk device to be stored, and the page is repeated, and unless the page is full, the tuple is stored in the determined page.

Description

【発明の詳細な説明】ｒ産業上の利用分野〕この発明ハ、リレーショナルデータベース管理システム
におけるデータベースの格納方式に関する本のである。[Detailed Description of the Invention] r Industrial Application Field] This invention is a book related to a database storage method in a relational database management system.

（従来の技術〕リレーシロナルデータベース管理システムにおいて、デ
ータベースは第３図に示したような表の集まりである。(Prior Art) In a relay serial database management system, a database is a collection of tables as shown in FIG.

個々の表はリレーション（５）と呼ばれ９表の各項目は
アトＩＪビュート（６）、各アトリビユートに実際にデ
ータが入った１つのレコードはダブル（７）と呼ばれる
。Each table is called a relation (5), each item in the nine tables is called an atto IJ butte (6), and one record that actually contains data in each attribute is called a double (7).

次に、第４図に示すような複数のデータ処理装置ｉ　１
８ａｌ〜（８ｄ）がネッ°トワーク（９）を介して、複
数のディスク装置１２ｇ）〜（２ｊ）に接続されている
ようなシステム上で動作するリレーシロナルデータベー
ス管理システムにおいて、１つのりレーションを複数の
ディスク装置（２ｇ）〜（２Ｊ）に分割して格納してお
けば、複数のデータ処理装置ｔ８ａＪ　〜１８ｄ）　Ｊ
’ｔ　、並列にディスク装置１２ｇ）　〜１２ｊ）に格
納されているリレーションのデータにアクセスすること
ができる。このとき、リレーシヲンをダブル単位で横に
分割１．ていくことを水平分割化と呼ぶ。Next, a plurality of data processing devices i 1 as shown in FIG.
In a relay serial database management system that operates on a system in which disk devices 8al to 8d are connected to a plurality of disk devices 12g) to 2j via a network (9), one By dividing and storing data in multiple disk devices (2g) to (2J), multiple data processing devices t8aJ to 18d) J
't, relation data stored in the disk devices 12g) to 12j) can be accessed in parallel. At this time, divide the relation horizontally into double units.1. This process is called horizontal partitioning.

この水平分割化において、格納するりレーションがクラ
スタードインデックスを有していない場合は、各ダブル
を挿入する際、その時点で一番少ない量のダブルを保持
しているディスク装置へそのダブルを格納すれば、その
リレーションに対して各ディスク装置が持つダブルの量
がほぼ均等になるような形でリレーションを格納してお
くことができる。In this horizontal partitioning, if the storage partition does not have a clustered index, when each double is inserted, it is stored in the disk device that holds the least amount of doubles at that time. Then, the relation can be stored in such a way that the amount of doubles that each disk device has for that relation is approximately equal.

このように均等にリレーションを分割すると、複数のデ
ータ処理装置（８ａ）〜（８ｄ）が、各ディスク装置（
２ｇ）〜（２ｊ）から、　１１は等しい時間でデータを
読み込むことができるので、あるデータ処理装置がデー
タを既に読み終ってしまったのに、他のデータ処理装置
がまだデータを読み終わっていないということがなくな
り、処理速度が向上できる。If the relations are divided equally in this way, a plurality of data processing devices (8a) to (8d) will be divided into each disk device (8a) to (8d).
From 2g) to (2j), 11 can read data in equal time, so even if one data processing device has already finished reading the data, another data processing device has not finished reading the data yet. This eliminates this problem, and the processing speed can be improved.

ｒ発明が解決しようとする問題点〕従来の水平分割化方式は以上のように構成されているが
、これはクラスタードインデックスを持っていないリレ
ーションに対する分割化方式なので、クラスタードイン
デックスを持つリレーションを分割化することができず
、クラスタードインデックスを利用することによって。Problems to be Solved by the Invention] The conventional horizontal partitioning method is configured as described above, but since this is a partitioning method for relations that do not have a clustered index, By using a clustered index, which cannot be partitioned.

リレーション内のダブルに高速にアクセスすることがで
きるような処理を実現することができないという問題点
があった。There is a problem in that it is not possible to implement processing that allows high-speed access to doubles within a relation.

この発明は、上記のような問題点を解消するためになさ
れたもので、クラスタードインデックスを持つリレーシ
ョンを、均等に分割できるようなデータベースの水平分
割化方式を得ることを目的としている。This invention was made to solve the above-mentioned problems, and aims to provide a horizontal database partitioning method that can evenly partition relations with clustered indexes.

（問題点を解決するための手段〕この発明に係るデータベースの水平分割化方式は、Ｂ−
ｔｒｅｅで構成されるクラスタードインデックスを持つ
リレーションを、複数のディスク装置に格納するとき罠
、ディスク装置内の物理的なページがそのリレーション
のダブルによって酒杯になったとき、そのページを２つ
に分割させて９分割したページのうち片方のページヲ、
ソのリレーションに対するダブルを保持するページが一
番少ないディスク装置に格納することによって、データ
ベースの水平分割を均等に行なえるようにしたものであ
る。(Means for Solving the Problems) The database horizontal partitioning method according to the present invention is B-
This is a trap when storing a relation with a clustered index consisting of a tree on multiple disk devices, and when a physical page in the disk device becomes a cup due to the double of that relation, the page is divided into two. One of the pages divided into 9 parts,
The database can be evenly divided horizontally by storing the pages that hold doubles for each relation in the disk device that has the least number of pages.

[Effect]

この発明におけるデータベースの水平分割化方式は、ク
ラスタードインデックスを持つリレーションのダブルを
ディスク装置に格納する際、そのダブルを格納するペー
ジが満杯のとき。In the horizontal database partitioning method of this invention, when a double of a relation with a clustered index is stored in a disk device, the page storing the double is full.

ページを２分割し、そのいずれかにそのダブルを格納し
た後１片方のページを一番少ないページを持つディスク
装置へ送ることにより、自動的にディスク装置に格納さ
れるダブルの数がほぼ均等である分割状態を維持する。By dividing a page into two, storing the double in one of them, and then sending one page to the disk device with the least number of pages, the number of doubles automatically stored in the disk device is approximately equal. Maintain a certain division state.

Ｃ発明の実施例〕以下、この発明の一実施例を図について説明する。＠１
図において、（１）はクラスタードインデックスである
。クラスタードインデックスは＊　Ｃｏｍｐｕｔｅｒ　
５ｃｉｅｎｃｅ　Ｐｒｅｓｓ　Ｉｎｃ、社が版権を持つ
Ｊ、ｒ）、　Ｕｌ　１ｍａｎが著した”　Ｐｒ１ｎｅｉ
ｐｌｅ　ｏｆ　ＤａｔａｂａｓｅＳｙｓｔｅｍｓ　’　
を邦訳　日本コンビエータ協会版権、国井利泰訳「デー
タベース・システムの原理」）の第２．４節で説明され
ている。Ｂ−ｔｒｅｅで構成されているインデックスで
、かつ、クラスタードインデックスを持つリレーション
の各ダブルは、クラスタードインデックスが指定されて
いるア）　＋７ピエートの値に基いて、物理的に格納さ
れている状態においてもソートされている。C Embodiment of the Invention] Hereinafter, an embodiment of the invention will be described with reference to the drawings. @1
In the figure, (1) is a clustered index. Clustered index is * Computer
Copyrighted by 5science Press Inc.
ple of Database Systems'
It is explained in Section 2.4 of ``Principles of Database Systems'' (Japanese translation, copyright of the Japan Combiator Association, translated by Toshiyasu Kunii). In an index composed of a B-tree, each double of a relation with a clustered index is physically stored based on the value of a) +7 pietes, where the clustered index is specified. are also sorted.

各ダブルは、ディスク装Ｋ　１２ａ）〜（２ｄ）の物理
的な記憶単位であるページ（３ａ）〜（３ｅ）の中に。Each double is in a page (3a) to (3e) which is a physical storage unit of the disk drive K 12a) to (2d).

ノートされた順番で格納されている。第１図では、ｌ−
１０００の範囲の整数値を持つアトＩＪビエートに、ク
ラスタードインデックスが付いているときの格納例を示
ｔ７たものであるが１例えばキー傳が７３よシも小さい
値を持つダブルは、ディスク装置（２ａ）のページ（３
ａ）に、７３から１８６の間のキー値金持つダブルは、
ディスク装＠　ｔ２ｂｌのページ（コ市）に格納されて
いる。このように、ダブルのキー値を指定すれば゛、ク
ラスタードインデックスの持つポインタ（４１によって
、ディスク装置とページが決定される。They are stored in the order they were noted. In Figure 1, l-
t7 shows an example of storage when a clustered index is attached to an integer value in the range of 1000. Page (2a) (3
In a), a double with a key value between 73 and 186 is
It is stored on the disk installation @ t2bl page (Ko City). In this way, if a double key value is specified, the disk device and page are determined by the pointer (41) of the clustered index.

クラスタードインデックスを持つリレーションのダブル
をディスク装置に格納するときは。When storing doubles of a relation with a clustered index on a disk device.

第５図のフローチャートに示すように、ダブルのキー値
によυ、クラスタードインデックスを参照して格納すべ
きディスク装置及びページを決め＋１０１）、決定さｈ
たページにそのダブルを格納したら、ページがあふれる
かどうか調査する（１０２）、この調査により、あふれ
るようならばページの内容を２分する。２分されたうち
の片方のページは、一番少ないダブル数を持つディスク
装置へ転送される＋１０３）。！２図は、この様子を示
したもので、ディスク装＠　１２ｅ）中のページ（３ｃ
）の中に、整数値のキー値が１〜１０００間のダブルが
格納されているとき、１〜１００間の値を持つダブルを
このページ（３ｃ）に格納しようとした場合、ページが
満杯なので、２分され１元のページ１３ｃ）　Ｋ　Ｆｉ
１〜５０間のキー値を持つダブルが、もう一方のページ
（３ｄ）には、５１〜１００間のキー値を持つダブルが
格納されるようになったことを示している。ディスク装
置（２ｆ）は、ページ（３ｄ）を格納するために選ばれ
た、そのダブルが属するリレーションのダブルから構成
されるページの数が一番少ないものである。ページを分
割後は、クラスタードインデックスを再編成しく１０４
）、ステップ＋１０１１からの動作を繰シ返す。ステッ
プ（１０２１で、ページがあふれないという結果が検出
されれば、ダブルをステップ（１０１）において決定さ
れたページに格納する。As shown in the flowchart in Figure 5, the disk device and page to be stored are determined by referring to the clustered index by the double key value υ+101), and the
After storing the double in the page, it is investigated whether the page overflows (102).If the double is stored in the page, the contents of the page are divided into two if the page overflows. One of the divided pages is transferred to the disk device with the smallest number of doubles +103). ! Figure 2 shows this situation, and the page (3c
) stores a double with an integer key value between 1 and 1000, and if you try to store a double with a value between 1 and 100 on this page (3c), the page is full, so , 2 parts and 1 original page 13c) K Fi
This shows that doubles with key values between 1 and 50 are now stored in the other page (3d), and doubles with key values between 51 and 100 are now stored. The disk device (2f) is selected to store the page (3d) and has the least number of pages composed of doubles of the relation to which the double belongs. After dividing the page, reorganize the clustered index104
), the operations from step +1011 are repeated. In step (1021), if it is detected that the page does not overflow, the double is stored in the page determined in step (101).

なお、上記の実施例では、第４図に示す如くネットワー
ク（９）によって、ディスク装置（２ｇ）〜（２ｊ）と
データ処理装置１８ａ）〜（８ｄ）を結んでいるという
システム構成について説明したが、このネットワーク（
９）の形ＷＡは９例えばリング型や単一バス型等でもよ
く、一般にこれらに類したものであれば上記実施例と同
様の効果１［する。In the above embodiment, a system configuration was described in which the disk devices (2g) to (2j) and the data processing devices 18a) to (8d) are connected by a network (9) as shown in FIG. , this network (
The shape WA of 9) may be, for example, a ring type or a single bus type, and in general, if it is similar to these, it will have the same effect as the above embodiment.

（発明の効果〕以上のように、この発明によれば、クラスタードインデ
ックスを有する場合にも、リレーションの水平分割を均
等に行なうことができるようＫしたので、複数のデータ
処理装置が複数のディスク装置を同時にアクセスできる
ようなシステムにおいて、クラスターインデックスを利
用した高速処理の他に、レコードの全件サーチ処理等に
おいても各データ処理装置が等しい時間でデータを読み
込むことができ、全体として最小の時間でデータを読み
込むことができるという効果がある。(Effects of the Invention) As described above, according to the present invention, relations can be horizontally divided evenly even when a clustered index is provided, so that multiple data processing devices can process multiple disks. In a system that allows devices to be accessed simultaneously, in addition to high-speed processing using cluster indexes, each data processing device can read data in the same amount of time during all-record search processing, reducing the overall time to a minimum. This has the effect of being able to read data with .

[Brief explanation of the drawing]

第１図はこの発明の一実施例によるクラスタードインデ
ックスを持つリレーションの分割状態の例會示す説明図
、第２図はこの発明の一実施例によるページの分割の例
を示す説明図、第３図はりレーシッナルデータベースに
おけるリレーションの例を示す説明図、第４図はこの発
明を実施するために必要なシステム構成の例を示す説明
図、第５図はこの発明の一実施例によるプログラムのフ
ローチャートを示す説明図である。、（１＋１ｉクラス
タートインデツクス、　　（２ａＪ〜（２ｆ）はディス
ク装置、　　１３ａ）　〜（３ｅ）はページ、（４）は
インデックスのポインタ、（５）はリレーション、（６
）社アトリビエート、（７］はダブル、　　（８ａｌ〜
（８ｄ）はデータ処理装置、（９）はネットワークであ
る。なお９図中同一符号は、各々同−又は相当部分を示す。FIG. 1 is an explanatory diagram showing an example of how a relation with a clustered index is divided according to an embodiment of the present invention, FIG. 2 is an explanatory diagram showing an example of page division according to an embodiment of the present invention, and FIG. FIG. 4 is an explanatory diagram showing an example of a relation in a relational database, FIG. 4 is an explanatory diagram showing an example of a system configuration necessary to implement this invention, and FIG. 5 is a flowchart of a program according to an embodiment of this invention. FIG. , (1+1i cluster index, (2aJ to (2f) are disk devices, 13a) to (3e) are pages, (4) is an index pointer, (5) is a relation, (6
) Company Attribute, (7] is double, (8al~
(8d) is a data processing device, and (9) is a network. Note that the same reference numerals in Figure 9 indicate the same or corresponding parts.

Claims

[Claims]

In a relational database management system in which one relation in a database is horizontally divided into double units and stored on multiple disk devices, when a relation with a clustered index is stored on multiple disk devices, the disk When a physical page in the device becomes full with doubles for that relation, the page is split into two, and one of the pages is used as the page that holds the double for that relation. A database horizontal partitioning method characterized in that the amount of doubles stored in each disk device is made almost equal by storing data in the smallest disk device.