JP2910847B2

JP2910847B2 - Array type disk controller

Info

Publication number: JP2910847B2
Application number: JP9285752A
Authority: JP
Inventors: 辰夫的場; 圭一依光
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1990-09-20
Filing date: 1997-10-17
Publication date: 1999-06-23
Anticipated expiration: 2017-10-17
Also published as: JPH1083259A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、入出力装置を上位
装置からの命令に基づいて制御する入出力制御装置に関
し、特に入出力装置の異常発生時には異常のある入出力
装置に代えて使用される予備用入出力装置の制御を含む
監視制御の改善に関する。近年の計算機システムにおけ
る高信頼化の要求に伴い、入出力サブシステムでの高信
頼化が要求されている。このため、予備用入出力装置を
有する入出力サブシステムが提供されている。この入出
力サブシステムでは予備用入出力装置は通常時には使用
されず、他の現用の入出力装置に読み取りエラーや故障
等の異常が発生したときに初めて使用される。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an input / output control device for controlling an input / output device based on a command from a higher-level device. Monitoring control including control of a backup input / output device. With the recent demand for higher reliability in computer systems, higher reliability is required for input / output subsystems. For this reason, an input / output subsystem having a spare input / output device has been provided. In this input / output subsystem, the spare input / output device is not used in normal times, but is used only when an abnormality such as a reading error or a failure occurs in another current input / output device.

【０００２】このとき予備用入出力装置が正常に動作し
て予備用入出力装置としての役割を果たすためには、事
前に予備用入出力装置の正常／異常を監視し、ハードエ
ラーや読み取りエラー等の予防保守情報を上位装置に提
供する手段を入出力サブシステムにおいて持つ必要があ
る。At this time, in order for the spare I / O device to operate normally and fulfill the role of the spare I / O device, the normal / abnormal state of the spare I / O device is monitored in advance, and hardware errors and read errors are monitored. It is necessary for the input / output subsystem to have a means for providing preventive maintenance information such as the above to the host device.

【０００３】[0003]

【従来の技術】異常のある入出力装置に代えて予備用入
出力装置を予防保守情報を上位装置に提供する入出力制
御装置として、従来よりアレー形記憶装置であるアレー
型磁気ディスク制御装置が知られている。このアレー型
磁気ディスク制御装置は、上位装置とのインタフェース
制御部、下位装置とのインタフェース制御部、プロセッ
サ及び各種レジスタを有し、下位装置として複数台のデ
ータディスク装置、１台のパリティ用ディスク装置及び
１台の予備用ディスク装置が接続され、これらを同期回
転制御している。2. Description of the Related Art As an input / output control device for providing preventive maintenance information to a host device using a spare input / output device instead of a faulty input / output device, an array type magnetic disk control device, which is an array type storage device, has conventionally been used. Are known. This array type magnetic disk controller has an interface controller for an upper device, an interface controller for a lower device, a processor, and various registers. As the lower device, a plurality of data disk devices and one parity disk device are provided. And one spare disk device are connected and synchronously controlled.

【０００４】アレー型磁気ディスク制御装置は複数台の
データディスク装置及びパリティ用ディスク装置の各々
からデータ及びパリティを並行して読み出し、データの
異常の有無を検査する。データに異常がない通常時はデ
ータはこのまま上位装置に転送される。しかし、データ
に異常有りと検出されたときは、アレー型磁気ディスク
装置は異常のあるデータを読み出した１台のディスク装
置以外の、残りの正常なデータディスク装置とパリティ
ディスク装置から読み出されたデータ及びパリティに基
づいて、正しいデータを復元して上位装置に転送する。The array type magnetic disk controller reads data and parity from each of a plurality of data disk devices and parity disk devices in parallel, and checks for data abnormality. Normally, when there is no abnormality in the data, the data is transferred to the host device as it is. However, when it is detected that the data has an abnormality, the array-type magnetic disk drive reads data from the remaining normal data disk device and the parity disk device other than the one disk device that has read the abnormal data. Based on the data and parity, correct data is restored and transferred to the host device.

【０００５】現用の入出力装置であるデータディスク装
置の１台又はパリティ用ディスク装置に何らかの理由で
読み取りエラーや故障等の異常が規定回数以上発生した
ときは、その段階で残りの正常なデータディスク装置と
パリティ用ディスク装置から、異常なデータディスク装
置又は異常なパリティ用ディスク装置に書かれている全
てのデータを自動的に予備用ディスク装置に復元し、こ
の後は現用入出力装置として異常発生ディスク装置に代
えて使用することにより、所要の処理動作を続行させ、
システムダウンを防止することができる。If an error such as a reading error or a failure occurs for a certain number of times in one of the data disk devices or the parity disk device which is the current input / output device, the remaining normal data disk is not restored. From the device and parity disk unit, all data written on the abnormal data disk unit or abnormal parity disk unit is automatically restored to the spare disk unit. By using it in place of a disk device, required processing operations can be continued,
System down can be prevented.

【０００６】[0006]

【発明が解決しようとする課題】しかし、予備用ディス
ク装置自体にハードエラーや読み取りエラーがあって
も、そのことが上位装置には全くわからず、異常発生デ
ィスク装置に代えて初めて予備用ディスク装置を入出力
装置として使用するときに、予備用ディスク装置自体の
ハードエラーや読み取りエラー等が発見され、予備用デ
ィスク装置としての役割を果たせないことがある。However, even if there is a hard error or a read error in the spare disk device itself, the error is not known to the upper-level device at all, and the spare disk device is replaced by the spare disk device for the first time. When using as an input / output device, a hardware error, a reading error, or the like of the spare disk device itself may be found, and the spare disk device may not function as a spare disk device.

【０００７】そこで、従来複数のユニットからなる通常
系と、通常系のユニットの予備ユニットを有する装置に
おいて、予備ユニットに対し通常系の任意のユニットと
同一の入力信号を与え、予備ユニットと上記任意のユニ
ットの両出力信号を比較することにより、予備ユニット
を監視する方式が知られている（特開昭５６−７２３５
９号公報）。しかし、この監視方式は通常系の現用ユニ
ットと同時に常時予備ユニットを稼動する必要があるた
め、前記した入出力制御装置では制御する入出力装置が
実質的に１台増えることと等価であり、上位装置の負荷
が大きい。Therefore, in a conventional apparatus having a normal system including a plurality of units and a spare unit of the normal system unit, the same input signal as that of an arbitrary unit of the normal system is given to the spare unit, and the spare unit and the above optional unit are supplied. A method of monitoring a spare unit by comparing both output signals of the two units is known (JP-A-56-7235).
No. 9). However, this monitoring method requires that the standby unit always be operated at the same time as the normal working unit. Therefore, the input / output control device described above is equivalent to substantially increasing the number of input / output devices to be controlled by one. The load on the device is large.

【０００８】また、入出力制御装置の異常を時間監視タ
イムテーブルに従って監視する方式（特開昭６２−２１
２８５６号公報）も知られているが、このものは予備用
入出力装置に対しての異常を監視するものではなく、ま
た処理の空き時間を利用したものでもない。更に、従来
より中央処理装置（ＣＰＵ）が現用と予備用の二重化構
成された装置において、予備系のＣＰＵでもソフトウェ
アを走行させて実動作に近い形態で診断を行なう監視方
式（特開平２−９３９５３号公報）及び予備系のＣＰＵ
を一定周期で診断を行なう監視方式（特開昭６２−９０
０６８号公報）なども従来より知られている。しかし、
上記のＣＰＵが二重化された装置の各監視方式はいずれ
も処理の空き時間を利用して予備ＣＰＵの監視を行なう
ものではない。Further, a method of monitoring an abnormality of the input / output control device according to a time monitoring time table (Japanese Patent Laid-Open No. 62-21)
No. 2856) is also known, but this does not monitor an abnormality in the spare input / output device, nor does it use a process idle time. Further, in a conventional system in which a central processing unit (CPU) is configured in a duplex configuration for working and standby, a monitoring system in which a standby CPU runs software and diagnoses in a manner similar to actual operation (Japanese Patent Laid-Open No. 2-93953). Publication) and a standby CPU
Monitoring system for diagnosing the data at regular intervals (Japanese Patent Laid-Open No. 62-90)
No. 068) has been conventionally known. But,
None of the above-mentioned monitoring methods of the apparatus having a duplicated CPU monitor the spare CPU using the idle time of the processing.

【０００９】本発明は以上の点に鑑みてなされたもの
で、予備用入出力装置及び／又は現用の入出力装置の状
態を監視する際、上位装置からの命令の処理実行を行な
っていない処理空き時間を利用し、特に予備用ディスク
装置（入出力装置）を使用しないときであっても、電源
を常時投入して予備用ディスク装置を常時レディ状態に
しておき、予備用ディスク装置を含めた各装置の異常判
定が行えるアレイ型ディスク制御装置を提供することに
ある。SUMMARY OF THE INVENTION The present invention has been made in view of the above, and when monitoring the status of a spare input / output device and / or a current input / output device, a process that does not execute a command from a host device. Even when the spare disk device (input / output device) is not used, the power is always turned on and the spare disk device is always ready, and the spare disk device is included. An object of the present invention is to provide an array-type disk control device capable of determining abnormality of each device.

【００１０】[0010]

【課題を解決するための手段】請求項１に記載の発明
は、上位装置との間で転送されるデータと当該データか
ら作成した冗長データとがそれぞれ格納される複数台の
ディスク装置（３０）と、一台の予備用ディスク装置
（３１）とを含んで成る論理デバイスが接続される下位
装置インタフェース制御部（２６）を有し、該下位装置
インタフェース制御部は、上位装置（１０）に接続さ
れ、前記上位装置（１０）から発行される前記複数台の
ディスク装置（３０）及び前記予備用ディスク装置（３
１）への動作命令を受けるチャネルインタフェース制御
部（２１）と、該チャネルインタフェース制御部を介し
て上位装置から入力されたコマンドに基づき、前記デー
タを格納する複数台のディスク装置の読み出し及び書き
込み動作を制御するとともに、前記データを格納する論
理テバイスに含まれる複数台のディスク装置が正常な場
合には前記データを格納する複数台のディスク装置を制
御し、前記データを格納する論理デバイスに含まれる複
数台のディスク装置のいずれかにエラーが発見された場
合には正常なディスク装置及び予備用ディスク装置を制
御する動作制御部と、前記論理デバイス中の複数のディ
スク装置及び予備用ディスク装置のそれぞれに対し電力
を供給する電力供給手段と、を備え、当該論理デバイス
中の全てのディスク装置を、リードライトコマンド及び
シークコマンドの実行が可能な状態に常に保持しておく
こと、を特徴とするアレイ型ディスク制御装置である。According to the first aspect of the present invention, there are provided a plurality of disk devices (30) for storing data transferred to and from a higher-level device and redundant data created from the data, respectively. And a lower-level device interface controller (26) to which a logical device including a spare disk device (31) is connected. The lower-level device interface controller connects to a higher-level device (10). And the plurality of disk devices (30) and the spare disk device (3) issued from the higher-level device (10).
A channel interface controller (21) for receiving an operation command to 1), and read / write operations of a plurality of disk devices for storing the data based on a command input from a higher-level device via the channel interface controller. When the plurality of disk devices included in the logical device storing the data are normal, the plurality of disk devices storing the data are controlled and included in the logical device storing the data. An operation control unit that controls a normal disk unit and a spare disk unit when an error is found in any of the plurality of disk units; and an operation control unit that controls the plurality of disk units and the spare disk unit in the logical device. Power supply means for supplying power to the logical device. The device, always be held in the execution state capable of read and write commands and seek command is an array type disk control apparatus according to claim.

【００１１】上記構成によれば、全てのディスク装置に
電力を供給するのみならず、上位装置からアクセスがあ
った時には直ちに応答して動作できるレディ状態、すな
わちリードライトコマンド及びシークコマンドの実行が
可能な状態に常に保持しておくことで、冗長性のある、
すなわち、予備ディスク装置を常に正常な状態にしてお
くアレイ型ディスク制御装置が実現できる。この点で、
待機中の予備装置に常に電源を供給するのみで予備装置
をレディ状態に保持することをしない公知のホットスタ
ート方式とは異なるものである。すなわち、単に予備装
置は、通電されているだけでは動作不可能であり、正常
に動作するためにはレディ状態になっている必要があ
る。本発明は予備ディスク装置を含めた各装置をレディ
状態とすることで、これらの異常判定が行えるようにし
たものである。According to the above configuration, not only is it possible to supply power to all the disk devices, but also it is possible to execute a ready state in which the disk device can operate immediately in response to an access from a host device, that is, to execute a read / write command and a seek command. By keeping it in a consistent state,
That is, it is possible to realize an array-type disk controller that always keeps the spare disk device in a normal state. In this regard,
This is different from the known hot start method in which the standby apparatus is always supplied with power only and the standby apparatus is not kept ready. That is, the standby device cannot operate simply by being energized, but must be in a ready state to operate normally. The present invention makes it possible to judge these abnormalities by putting each device including the spare disk device in a ready state.

【００１２】[0012]

【発明の実施の形態】本発明をより詳細に説明するため
に、以下添付図面に従って説明する。図１は本発明の原
理構成図を示す。図１において、入出力制御装置２０は
上位装置インタフェース制御部２１、検査手段２２、制
御手段２３、判定手段２４、予防保守情報格納用レジス
タ２５、下位装置インタフェース制御部２６及びパトロ
ール用タイマレジスタ２７を有する。上位装置インタフ
ェース制御部２１は上位装置１０から発行された命令を
受信する。下位装置インタフェース制御部２６は複数台
の入出力装置３０と１台の予備用入出力装置３１に夫々
接続されている。検査手段２２は入出力装置３０よりデ
ータ及び誤り検出符号を並行して読み出し、読み出した
データと誤り検出符号とを用いて誤り検査を行ない、入
出力装置３０のデータの正常性を検査する。DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will be described in more detail with reference to the accompanying drawings. FIG. 1 shows a principle configuration diagram of the present invention. In FIG. 1, the input / output control device 20 includes a higher-level device interface control unit 21, an inspection unit 22, a control unit 23, a determination unit 24, a preventive maintenance information storage register 25, a lower-level device interface control unit 26, and a patrol timer register 27. Have. The host device interface control unit 21 receives a command issued from the host device 10. The lower-level device interface control unit 26 is connected to a plurality of input / output devices 30 and one spare input / output device 31, respectively. The checking means 22 reads the data and the error detection code from the input / output device 30 in parallel, performs an error check using the read data and the error detection code, and checks the normality of the data of the input / output device 30.

【００１３】制御手段２３は、検査手段２２の検査結果
がすべての入出力装置３０の読み出しデータが正常であ
ることを示しているときは、入出力装置３０のみを選択
し、上位装置１０からの命令に従い、入出力制御装置３
０の動作を制御する。従って、このときは予備用入出力
装置３１は使用されない。一方、検査手段２２の検査結
果が複数台の入出力装置３０のうち１台の入出力装置の
読み出しデータに異常があり、これが規定回数を越えた
場合、下位装置インタフェース制御部２６は異常発生し
た入出力装置以外の残りの正常な入出力装置から読み出
されるデータ及び誤り検出符号から、異常発生入出力装
置に書き込まれている全てのデータを復元し、復元した
データを予備用入出力装置３１に書き込む。従って、以
後、予備用入出力装置３１は異常発生入出力装置の代わ
りに現用の入出力装置として使用される。The control means 23 selects only the input / output device 30 when the inspection result of the inspection means 22 indicates that the read data of all the input / output devices 30 is normal, and According to the instruction, the input / output control device 3
0 operation is controlled. Therefore, at this time, the spare input / output device 31 is not used. On the other hand, if the inspection result of the inspection means 22 indicates that the read data of one input / output device among the plurality of input / output devices 30 has an abnormality, and this number exceeds a specified number, the lower-level device interface control unit 26 has generated an abnormality. From the data read from the remaining normal input / output devices other than the input / output device and the error detection code, all data written to the abnormal input / output device is restored, and the restored data is transferred to the spare input / output device 31. Write. Therefore, the spare input / output device 31 is used as a working input / output device instead of the abnormal input / output device.

【００１４】判定手段２４は上位装置１０から上位装置
インタフェース制御部２１を介して入力される命令の処
理実行を制御手段２３が行なっていない処理空き時間
に、下位装置インタフェース制御部２６を介して入出力
装置及び予備用入出力装置３１又は予備用入出力装置３
１を選択し、選択した入出力装置３０及び予備用入出力
装置３１又は予備用入出力装置３１が正常動作するか否
か判定する。The judging means 24 inputs, via the lower-level device interface control unit 26, an idle time during which the control unit 23 does not execute the processing of the command input from the higher-level device 10 via the higher-level device interface control unit 21. Output device and spare I / O device 31 or spare I / O device 3
1 is selected, and it is determined whether or not the selected I / O device 30 and backup I / O device 31 or backup I / O device 31 operate normally.

【００１５】判定手段２４により異常と判定された時
は、その異常に関する情報が予防保守情報格納用レジス
タ２５に格納される。予防保守情報として後に上位装置
１０へ通知するためである。また、前記判定手段２４が
判定を行なう処理空き時間は、パトロール用タイマレジ
スタ２７の計数値から判定される。上記の判定手段２４
は予備用入出力装置３１に対してデータの書き込み、あ
るいはシーク動作終了後に予備用入出力装置３１から送
出される信号がハードエラーを示しているときは異常と
判定する。また、判定手段２４は予備用入出力装置３１
に対してデータの読み出しを実行し、その読み出しデー
タに読み取りエラーがあるときは異常と判定する。When it is determined by the determining means 24 that an abnormality has occurred, information relating to the abnormality is stored in the preventive maintenance information storage register 25. This is to notify the host device 10 later as preventive maintenance information. The idle time for which the determination means 24 makes a determination is determined from the count value of the patrol timer register 27. The above determination means 24
Is determined to be abnormal when a signal sent from the spare input / output device 31 after writing data or seeking operation to the spare input / output device 31 indicates a hard error. Further, the determination means 24 includes a spare input / output device 31.
Is read, and if the read data has a read error, it is determined to be abnormal.

【００１６】かかる構成の入出力制御装置２０によれ
ば、予備用入出力装置３１が使用されないときであって
も、処理空き時間を利用して予備用入出力装置３１の異
常の有無を検出できる。また、予備用入出力装置３１に
異常があるときは予防保守情報格納用レジスタ２５から
の予防保守情報が上位装置１０に通知されるので、上位
装置１０はその予防保守情報に基づき予防用入出力装置
３１自体のハードエラーや読み取りエラーを確認でき
る。According to the input / output control device 20 having such a configuration, even when the standby input / output device 31 is not used, it is possible to detect the presence or absence of the abnormality of the standby input / output device 31 by utilizing the idle time of processing. . Further, when there is an abnormality in the spare input / output device 31, the preventive maintenance information from the preventive maintenance information storage register 25 is notified to the host device 10. A hardware error or a reading error of the device 31 itself can be confirmed.

【００１７】また、入出力制御装置２０によれば、予備
用入出力装置３１に限らず、入出力装置３０の異常の有
無も検出できる。この場合、判定手段２４は入出力装置
３０に対してシーク動作を実行した後、入出力装置３０
からハードエラー信号が入力されたとき異常と判定す
る。この入出力制御装置２０は例えば図２に２０₁，２
０₂で示す如く２組設け、上位装置１０を共有し、か
つ、複数の論理デバイス４０₀〜４０_Nを共有するよう
にしてもよい。ここで、論理デバイス４０₀〜４０_Nの
各々は複数台の入出力装置３０と１台の予備用入出力装
置３１とからなる。According to the input / output control device 20, not only the spare input / output device 31 but also the presence / absence of an abnormality of the input / output device 30 can be detected. In this case, the determination unit 24 performs a seek operation on the input / output
Is determined to be abnormal when a hard error signal is input from the CPU. The output control unit 20 in FIG. 2, for example 20 _1, 2
0 ₂ provided two sets as shown, to share the high-level equipment 10, and may share a plurality of logical devices 40 ₀ to 40 _N. Here, consisting of the logical device 40 _0-40 each of _N and a plurality of input-output device 30 one spare output device 31. of.

【００１８】かかる構成によれば、入出力制御装置２０
₁と２０₂が、上位装置１０よりの命令に基づき夫々異
なる論理デバイスを同時に選択して制御することができ
る。次に図３の一実施例について説明する。同図中、ア
レー型磁気ディスク制御装置６０は前記入出力制御装置
２０に相当する。アレー型磁気ディスク制御装置６０は
上位装置インタフェース制御部２１に相当するチャネル
インタフェース制御部６１、動作制御部６２ａ、判定制
御部６２ｂ、下位装置インタフェース制御部２６に相当
する下位装置インタフェース制御部６３、予防保守情報
格納用レジスタ６４、パトロール用タイマレジスタ６５
及び回転同期制御部６６を有する。According to this configuration, the input / output control device 20
₁ and 20 _2, respectively different logical devices on the basis of the instructions of the from the host device 10 can be simultaneously selected and controlled. Next, one embodiment of FIG. 3 will be described. In the figure, an array type magnetic disk controller 60 corresponds to the input / output controller 20. The array type magnetic disk controller 60 includes a channel interface controller 61 corresponding to the upper device interface controller 21, an operation controller 62a, a determination controller 62b, a lower device interface controller 63 corresponding to the lower device interface controller 26, and a prevention device. Maintenance information storage register 64, patrol timer register 65
And a rotation synchronization control unit 66.

【００１９】動作制御部６２ａは前記した検査手段２
２、制御手段２３を構成する回路部で、判定制御部６２
ｂは前記した判定手段２４を構成する回路部で、いずれ
もプロセッサ６２１、コントロールストレージ６２２、
システムリセットタイマレジスタ６２３、動作時間監視
タイマレジスタ６２４、シークアドレスレジスタ６２
５、論理デバイスアドレスレジスタ６２６、エラー検出
フラグレジスタ６２７及びデータ転送制御部６２８から
なる。The operation control unit 62a is provided with the inspection unit 2 described above.
2. A circuit part constituting the control means 23, and a judgment control part 62
Reference numeral b denotes a circuit unit constituting the above-described determination means 24, each of which includes a processor 621, a control storage 622,
System reset timer register 623, operating time monitoring timer register 624, seek address register 62
5, a logical device address register 626, an error detection flag register 627, and a data transfer control unit 628.

【００２０】プロセッサ６２１はマイクロプログラム制
御方式のアレー型磁気ディスク制御装置６０全体を制御
する。また、コントロールストレージ６２２はプロセッ
サ６２１のインストラクションコードが格納されてお
り、またファームウェア制御用テーブルとしても使われ
る。データ転送制御部６２８はチャネルインタフェース
制御部６１と下位装置インタフェース制御部６３との間
のデータ転送を制御し、また転送データのパリティを生
成し、それを下位装置インタフェース制御部６３へ転送
する。The processor 621 controls the entire array type magnetic disk controller 60 of the microprogram control system. The control storage 622 stores the instruction code of the processor 621 and is also used as a firmware control table. The data transfer control unit 628 controls data transfer between the channel interface control unit 61 and the lower device interface control unit 63, generates a parity of the transfer data, and transfers the parity to the lower device interface control unit 63.

【００２１】下位装置インタフェース制御部６３は♯０
〜♯９のディスク制御部６３０〜６３９を有する。ま
た、チャネルインタフェース制御部６１は制御レジスタ
群を有しており、上位装置５０との間で双方向のデータ
通信を行なう。上位装置５０は中央処理装置（ＣＰＵ）
で構成されている。また、回転同期制御部６６は後述の
論理デバイス７０₀〜７０_N内の各１０台の磁気ディス
ク装置の磁気ディスクを同期回転させている。The lower-level device interface controller 63 outputs $ 0
To # 9 disk control units 630 to 639. The channel interface control section 61 has a control register group, and performs bidirectional data communication with the host device 50. The host device 50 is a central processing unit (CPU)
It is composed of Further, the rotation synchronization control unit 66 synchronously rotates the magnetic disks of ten magnetic disk devices in the logical devices 70 ₀ to 70 _N to be described later.

【００２２】一方、ディスク制御部６３０〜６３９はＮ
＋１個の論理デバイス７０₀〜７０ _Nに夫々接続されて
いる。論理デバイス７０₀〜７０_Nは夫々同一構成で、
１０台の磁気ディスク装置７００〜７０９からなる。こ
のうち、磁気ディスク装置７００〜７０７はデータディ
スク装置で、磁気ディスクにデータが予め書き込まれて
おり、また任意に書き込み、読み出し可能な構成とされ
ている。On the other hand, the disk controllers 630 to 639
+1 logical device 70₀~ 70 _NConnected to each other
I have. Logical device 70₀~ 70_NHave the same configuration,
It comprises ten magnetic disk devices 700 to 709. This
Of these, the magnetic disk devices 700 to 707
Data is written to the magnetic disk in advance
And can be written and read arbitrarily.
ing.

【００２３】また、磁気ディスク装置７０８はパリティ
用ディスク装置で、記録再生する磁気ディスク（以下、
これをパリティディスクという）にデータディスク装置
７００〜７０７に書き込まれたデータのパリティビット
が書き込まれている。更に、磁気ディスク装置７０９は
予備用ディスク装置で、記録再生する磁気ディスク（以
下、これを予備ディスクという）にデータ又はパリティ
は初期時には書き込まれていない。論理デバイス７０₀
〜７０_Nは同時刻では１台の論理デバイスのみアレー型
磁気ディスク制御装置６０で制御される。A magnetic disk device 708 is a disk device for parity, and is a magnetic disk for recording and reproducing (hereinafter, referred to as a magnetic disk).
The parity bits of the data written to the data disk devices 700 to 707 are written in a parity disk. Further, the magnetic disk device 709 is a spare disk device, and data or parity is not written in a magnetic disk for recording / reproducing (hereinafter referred to as a spare disk) at the initial stage. Logical device 70 ₀
To 70 _N is controlled by the array-type magnetic disk controller 60 only logical device one at the same time.

【００２４】なお、図３に示すアレー型磁気ディスク制
御装置６０は、図２に示したように２台並列に設けら
れ、上位装置５０を共有し、かつ、論理デバイス７０₀
〜７０ _Nを夫々制御できるように接続されている。この
ような構成のアレー型磁気ディスク制御装置６０は、上
位装置５０からの命令に基づくデータを例えば８バイト
同時に、データディスク装置７００〜７０７に書き込
み、かつ、それらのデータのパリティもパリティ用ディ
スク装置７０８に書き込み、またそれらを読み出すよう
にしているため、データの転送スピードを１台の磁気デ
ィスク装置を使用した場合に比し高速にできるという特
長がある。The array type magnetic disk drive shown in FIG.
Two control devices 60 are provided in parallel as shown in FIG.
And the logical device 70₀
~ 70 _NAre connected so that each can be controlled. this
The array-type magnetic disk control device 60 having such a configuration is
8 bytes of data based on the instruction from the position device 50
At the same time, write to data disk units 700 to 707
And the parity of the data is
Write to and read them from disk device 708
Data transfer speed to one magnetic disk
The feature is that it can be operated at a higher speed than when a disk device is used.
There is a head.

【００２５】しかも、上位装置５０からの命令実行時に
上記の９台のディスク装置７００〜７０８のいずれかで
読み取りエラーや故障の異常が発生したことがわかった
場合、その異常発生のあるディスク装置の内容を残りの
正常な８台のディスク装置のデータから磁気ディスクサ
ブシステム内で自動的に復元して予備用ディスク装置７
０９に復元データを書き込むことができるため、信頼性
が高いという特長がある。Further, when it is found that a read error or a failure has occurred in any of the nine disk devices 700 to 708 during execution of an instruction from the host device 50, the disk device having the failure is identified. The contents are automatically restored in the magnetic disk subsystem from the data of the remaining normal eight disk devices, and the spare disk device 7 is restored.
Since the restoration data can be written in the data 09, there is a feature that the reliability is high.

【００２６】例えば、８台の磁気ディスクに書き込まれ
ているデータをＤ₀〜Ｄ₇、１台のパリティ用ディスク
装置７０８のパリティディスクに書き込まれているパリ
ティをＰ₀とし、このうちデータディスク装置７０２の
磁気ディスクで異常が発生した場合、読み取れないデー
タＤ₂を、正常に読み取りができた他のデータＤ₀，Ｄ
₁，Ｄ₃〜Ｄ₇及びパリティＰ₀から復元する。そし
て、データディスク装置７０２で規定回数以上の異常が
発生した場合、データディスク装置７０２に書き込まれ
ている全てのデータを残りの正常な８台のディスク装置
の中から磁気ディスクサブシステム内で自動的に復元し
て予備ディスクに書き込み、以後データディスク装置７
０２の代りに予備用ディスク装置７０９が用いられる。For example, the data written on the eight magnetic disks are D _{0 to} D ₇ , the parity written on the parity disk of one parity disk unit 708 is P _0, and the data disk unit If abnormalities in 702 of the magnetic disk occurs, the data D ₂ unreadable, other data D ₀ which can read normally, D
₁ , D ₃ -D ₇ and parity P ₀ . Then, when an abnormality occurs more than the specified number of times in the data disk device 702, all the data written in the data disk device 702 are automatically extracted from the remaining eight normal disk devices in the magnetic disk subsystem. And write it to the spare disk.
A spare disk device 709 is used in place of 02.

【００２７】このようなアレー型磁気ディスク制御装置
６０を用いた入出力サブシステムにおいて、主として上
位装置５０からの命令の非実行時に行なう各実施例の動
作について説明する。まず、本発明の第１実施例の動作
について図４と共に説明する。本実施例は、図３中のシ
ステムリセットタイマレジスタ６２３、動作時間監視タ
イマレジスタ６２４、シークアドレスレジスタ６２５、
論理デバイスアドレスレジスタ６２６及びエラー検出フ
ラグレジスタ６２７は使用せず、予備用ディスク装置７
０９のパトロール動作だけを実行する。図４において、
まずプロセッサ６２１は上位装置５０からの命令が有る
かどうか判定する（ステップ８１）。ここで、中央処理
装置（ＣＰＵ）を含む上位装置５０がアレー型磁気ディ
スク制御装置６０を介して一つの論理デバイスにデータ
の書き込み、読み出し等の命令を発行すると、アレー型
磁気ディスク制御装置６０はディスク制御部６３０〜６
３９を介してディスク装置７００〜７０８の９台の磁気
ディスクに同時にデータ及びパリティの書き込み、読み
出しを実行する。In the input / output subsystem using the array type magnetic disk controller 60, the operation of each embodiment mainly performed when an instruction from the host device 50 is not executed will be described. First, the operation of the first embodiment of the present invention will be described with reference to FIG. In this embodiment, the system reset timer register 623, the operating time monitoring timer register 624, the seek address register 625,
The logical device address register 626 and the error detection flag register 627 are not used, and the spare disk device 7
Only the patrol operation of 09 is executed. In FIG.
First, the processor 621 determines whether there is an instruction from the host device 50 (step 81). Here, when the host device 50 including the central processing unit (CPU) issues a data write / read command to one logical device via the array type magnetic disk controller 60, the array type magnetic disk controller 60 Disk control units 630-6
Data and parity are simultaneously written and read to and from nine magnetic disks of the disk devices 700 to 708 via the disk 39.

【００２８】すなわち、プロセッサ６２１は１つの命令
を実行する毎に図４に示すように上位装置５０からの命
令が有るかどうかチェックし（ステップ８１）、上記の
命令を検出すると、次にパトロール動作によりエラーを
検出した論理デバイスへの命令かどうか判定する（ステ
ップ８２）。通常は否であるので、上位装置５０からの
命令を実行し（ステップ８３）ステップ８１へ戻る。That is, the processor 621 checks whether or not there is an instruction from the host device 50 every time one instruction is executed as shown in FIG. 4 (step 81). It is determined whether the command is for the logical device that has detected the error (step 82). Normally, this is not the case, so the command from the host device 50 is executed (step 83), and the process returns to step 81.

【００２９】ここで、データが例えば８バイトであるも
のとすると、書き込み時は各１バイトのデータがディス
ク制御部６３０〜６３７を介してデータディスク装置７
００〜７０７へ並列に出力され、また、これら８バイト
のデータからデータ転送制御部６２８で生成されたパリ
ティがディスク制御部６３８を介してパリティ用ディス
ク装置７０８へ出力される。読み出し時はディスク装置
７００〜７０８で再生されたデータ及びパリティがディ
スク制御部６３０〜６３８を介してデータ制御転送部６
２８へ入力される。Here, assuming that the data is, for example, 8 bytes, at the time of writing, 1-byte data is transferred to the data disk device 7 via the disk control units 630 to 637.
The parity is output in parallel to 00 to 707, and the parity generated by the data transfer control unit 628 from the 8-byte data is output to the parity disk device 708 via the disk control unit 638. At the time of reading, the data and parity reproduced by the disk devices 700 to 708 are transferred to the data control transfer unit 6 via the disk control units 630 to 638.
28.

【００３０】一方、ステップ８１で上位装置５０からの
命令が無いと判定されたときは、上位装置５０に対し割
込み処理が必要か否か判定する（ステップ８４）。例え
ばコマンドチェインのため、シーク終了後のデバイスか
らのデバイスエンド通知に基づく上位装置５０に対する
割込み処理などの割込み処理が必要か否か判定する。割
り込みが必要な場合は上位装置５０に対して割込み処理
を実行する（ステップ８５）。割り込みが不要な場合は
パトロール用タイマレジスタ６５の読み出し（ステップ
８６）、インクリメント（ステップ８７）、格納（ステ
ップ８８）を順次行ない、パトロール用タイマレジスタ
６５の上記インクリメント後の値が所定値を越えたか
（すなわち規定時間に相当する値を越えたか）否かの判
定を行なう（ステップ８９）。ステップ８９で所定値を
越えていないと判定された場合は再びステップ８１へ戻
る。On the other hand, when it is determined in step 81 that there is no command from the host device 50, it is determined whether or not an interrupt process is necessary for the host device 50 (step 84). For example, for a command chain, it is determined whether or not an interrupt process such as an interrupt process for the host device 50 based on a device end notification from the device after the seek is completed is necessary. If an interrupt is required, an interrupt process is executed for the host device 50 (step 85). If an interrupt is not required, reading (step 86), incrementing (step 87), and storing (step 88) of the patrol timer register 65 are sequentially performed to determine whether the value of the patrol timer register 65 after the increment exceeds a predetermined value. It is determined whether or not the value has exceeded the value corresponding to the specified time (step 89). If it is determined in step 89 that the value does not exceed the predetermined value, the process returns to step 81 again.

【００３１】一方、ステップ８９で所定値を越えている
と判定されたときは、ステップ９０へ進み、予備用ディ
スク装置７０９に対してパトロール動作を実行する（ス
テップ９０）。具体的には、プロセッサ６２１が自分で
予め定めた特定パターンのデータを発生し、それをデー
タ転送制御部６２８を介してディスク制御部６３９に入
力し、ここでエラー検出コードを特定パターンのデータ
に基づいて生成して上記特定パターンのデータに付加す
る。そして、これらの特定パターンのデータ及びエラー
検出コードを論理デバイスアドレスレジスタ６２６で
指定された論理デバイス中の予備用ディスク装置６３９
の１トラックに書き込み、それを読み出す。On the other hand, if it is determined in step 89 that the value exceeds the predetermined value, the flow advances to step 90 to execute a patrol operation on the spare disk device 709 (step 90). Specifically, the processor 621 generates data of a specific pattern predetermined by itself, inputs the data to the disk control unit 639 via the data transfer control unit 628, and converts the error detection code into data of the specific pattern. It is generated on the basis of the above and added to the data of the specific pattern. The data of the specific pattern and the error detection code are stored in the spare disk device 639 in the logical device designated by the logical device address register 626.
, And read it out.

【００３２】予備用ディスク装置６３９は上記のパトロ
ール動作（リード、ライト動作）が正常に行なわれなか
ったときはハードエラー信号をディスク制御部６３９へ
送出する。また、リード動作により予備用ディスクから
読み出し出力されたデータ及びエラー検出コードはディ
スク制御部６３９を介してデータ転送制御部６２８へ入
力される。プロセッサ６２１は上記のハードエラー信号
の入力の有無及び上記読み出しデータ等の読み取りエラ
ーがあったか否かによりパトロール動作が正常終了した
か否か判定する（ステップ９１）。エラーが無かったと
きはプロセッサ６２１は正常終了と判定して、コントロ
ールストレージ６２２内にあるパトロール動作によるエ
ラー検出フラグをリセットする（ステップ９２）。The spare disk unit 639 sends a hard error signal to the disk control unit 639 when the above patrol operation (read / write operation) is not performed normally. The data and error detection code read and output from the spare disk by the read operation are input to the data transfer control unit 628 via the disk control unit 639. The processor 621 determines whether or not the patrol operation has been normally completed based on whether or not the hard error signal has been input and whether or not there has been a read error of the read data or the like (step 91). If there is no error, the processor 621 determines that the operation has been completed normally, and resets an error detection flag for the patrol operation in the control storage 622 (step 92).

【００３３】一方、ステップ９１でエラー発生と判定し
たときは、プロセッサ６２１は例えばシークが正常に終
了しないなどのハードエラー、あるいは読み取りエラー
等のエラー状態の詳細情報を収集して予防保守情報格納
用レジスタ６４へ格納し（ステップ９３）、コントロー
ルストレージ６２２内にあるパトロール動作によるエラ
ー検出フラグをセットする（ステップ９４）。On the other hand, if it is determined in step 91 that an error has occurred, the processor 621 collects detailed information of a hard error such as a failure of the seek to end normally, or an error state such as a reading error, and stores the information for storing preventive maintenance information. The data is stored in the register 64 (step 93), and an error detection flag for the patrol operation in the control storage 622 is set (step 94).

【００３４】ステップ９２又は９４の処理が終了する
と、ステップ８１に戻る。以下、上記と同様にして上位
装置５０からの命令が無い処理空き時間に（ステップ８
１）、パトロール用タイマレジスタ６５により判定され
る規定時間毎に（ステップ８６〜８９）、各論理デバイ
スの予備用ディスク装置７０９の予備ディスクの１トラ
ックに対して上記の特定パターンのデータ及びエラー検
出コードの書込みと読み出し及びエラー検出等が行なわ
れていく（ステップ９０）。When the processing in step 92 or 94 is completed, the flow returns to step 81. Thereafter, in the same manner as described above, during the processing idle time when there is no command from the host device 50 (step 8).
1) At a specified time determined by the patrol timer register 65 (steps 86 to 89), the data of the specific pattern and the error detection for one track of the spare disk of the spare disk unit 709 of each logical device are detected. Code writing and reading, error detection, and the like are performed (step 90).

【００３５】論理デバイス７０_Nの最初の１トラックへ
の上記の書込みと読み出しが終了すると、次に論理デバ
イス７０₀の予備用ディスク装置７０９の予備ディスク
の次の１トラックに対して上記の特定パターンのデータ
及びエラー検出コードの書き込みと読み出し及びエラー
検出等が行なわれる。以下、上記と同様の動作が繰り返
される。The logical device 70 When the above writing and reading of _N to the first track is finished, then the logic device 70 ₀ the specific pattern for the next track of the spare disk of the spare disk device 709 The writing and reading of the data and the error detection code, the error detection, and the like are performed. Hereinafter, the same operation as described above is repeated.

【００３６】その後、パトロール動作によりエラーが検
出された予備ディスクが属する論理デバイスに対して上
位装置５０から命令が発行されたときに（ステップ８
１，８２）、プロセッサ６２１は予防保守情報格納用レ
ジスタ６４から格納してある予防保守情報を読み出し
て、それをチャネルインタフェース制御部６１を介して
上位装置５０に送出し、予備用ディスク装置のエラー情
報を通知した後（ステップ９５）、ステップ８１に戻
る。Thereafter, when an instruction is issued from the higher-level device 50 to the logical device to which the spare disk in which the error has been detected by the patrol operation belongs (step 8).
1, 82), the processor 621 reads the preventive maintenance information stored from the preventive maintenance information storage register 64, sends it out to the upper-level device 50 via the channel interface control unit 61, and outputs the error of the spare disk device. After the information is notified (step 95), the process returns to step 81.

【００３７】これにより、本実施例によれば、予備用デ
ィスク装置７０９自体にハードエラーや読み取りエラー
等があった場合には、アレー型磁気ディスク制御装置６
０から通知される予防保守情報に基づき、事前に予備用
ディスク装置７０９自体のハードエラーや読み取りエラ
ー等に対する修理や交換が実施可能となる。これによ
り、予備用ディスク装置７０９を異常が発生したディス
ク装置の代わりとして使用する際に、予備用ディスク装
置７０９を直ちに使用できる。Thus, according to the present embodiment, if there is a hard error, a reading error or the like in the spare disk device 709 itself, the array type magnetic disk controller 6
Based on the preventive maintenance information notified from 0, repair or replacement of the spare disk unit 709 itself for a hard error, a reading error, or the like can be performed in advance. This allows the spare disk device 709 to be used immediately when the spare disk device 709 is used as a substitute for the disk device in which an abnormality has occurred.

【００３８】次に本発明の第２実施例の動作について、
図５及び図６のフローチャートと共に説明する。本実施
例は予備用ディスク装置７０９に対するパトロール動作
だけでなく、現用の入出力装置であるデータディスク装
置７００〜７０７及びパリティ用ディスク装置７０８に
対してもパトロール動作を実行するようにしたものであ
る。Next, the operation of the second embodiment of the present invention will be described.
This will be described with reference to the flowcharts of FIGS. In the present embodiment, the patrol operation is performed not only on the spare disk device 709 but also on the data disk devices 700 to 707 and the parity disk device 708 which are the current input / output devices. .

【００３９】図５において、まずアレー型磁気ディスク
制御装置６０に電源が投入されると、図５のステップ１
０１にてパワーオンシーケンスが実行される。このパワ
ーオンシーケンスにより前記した各種レジスタ６４，６
５，６２３〜６２７の初期化などが行なわれる。また、
上記電源投入により、論理デバイス７０₀〜７０_Nに電
源が投入され、予備用ディスク装置７０９も他のディス
ク装置７００〜７０８と同様に電源が投入される。In FIG. 5, when power is first supplied to the array type magnetic disk controller 60, step 1 in FIG.
At 01, a power-on sequence is executed. By this power-on sequence, the various registers 64, 6
5, 623 to 627 are initialized. Also,
By turning on the power, the logical devices 70 ₀ to 70 _N are turned on, and the spare disk unit 709 is also turned on similarly to the other disk units 700 to 708.

【００４０】続いて、プロセッサ６２１は上位装置５０
からの命令が有るかどうか判定する（ステップ１０
２）。ここで、中央処理装置（ＣＰＵ）を含む上位装置
５０がアレー型磁気ディスク制御装置６０を介して一つ
の論理デバイスにデータの書き込み、読み出し等の命令
を発行すると、アレー型磁気ディスク制御装置６０はデ
ィスク制御部６３０〜６３９を介してディスク装置７０
０〜７０８の９台の磁気ディスクに同時にデータ及びパ
リティの書き込み、読み出しを実行する。Subsequently, the processor 621 is connected to the host device 50.
(Step 10)
2). Here, when the host device 50 including the central processing unit (CPU) issues a data write / read command to one logical device via the array type magnetic disk controller 60, the array type magnetic disk controller 60 The disk device 70 via the disk control units 630-639
Data and parity are simultaneously written and read from nine magnetic disks 0 to 708.

【００４１】すなわち、プロセッサ６２１は１つの命令
を実行する毎に図５に示すように上位装置５０からの命
令が有るかどうかチェックし（ステップ１０２）、上記
の命令を検出すると、次にパトロール動作によりエラー
を検出した論理デバイスへの命令かどうか判定する（ス
テップ１０３）。通常は否であるので、上位装置５０か
らの命令を実行し（ステップ１０４）、動作時間監視タ
イマレジスタ６２４を初期化した後（ステップ１０
５）、ステップ１０２へ戻る。That is, the processor 621 checks whether or not there is an instruction from the host device 50 every time one instruction is executed as shown in FIG. 5 (step 102). It is determined whether the command is for the logical device that has detected the error (step 103). Normally, this is not the case, so that an instruction from the host device 50 is executed (step 104), and the operation time monitoring timer register 624 is initialized (step 10).
5) Return to step 102.

【００４２】一方、ステップ１０２で上位装置５０から
の命令が無いと判定されたときは、上位装置５０に対
し、コントロールユニットエンド（ＣＵＥｎｄ）、デ
バイスエンド等の割込み処理が必要か否か判定する（ス
テップ１０６）。割り込みが必要な場合は上位装置５０
に対して割込み処理を実行する（ステップ１０７）。割
り込みが不要な場合はパトロール用タイマレジスタ６５
のインクリメント（ステップ１０８）、システムリセッ
トタイマレジスタ６２３のインクリメント（ステップ１
０９）を順次行ない、システムリセットタイマレジスタ
６２３の値が１０分に相当する所定値を越えたか否かの
判定（ステップ１１０）と、パトロール用タイマレジス
タ６５の値が所定値を越えたか（すなわち規定時間であ
る１秒に相当する値を越えたか）否かの判定（ステップ
１１１）とを順次行なう。ステップ１１０及び１１１の
いずれも所定値を越えていない場合は再びステップ１０
２へ戻る。上記のパトロール用タイマレジスタ６５によ
り定められる１秒は、後述する如く、論理デバイスの選
択周期であり、これは上位装置５０からの命令処理に支
障をきたさないように、経験により定められた時間であ
る。On the other hand, if it is determined in step 102 that there is no command from the host device 50, it is determined whether or not an interrupt process such as a control unit end (CU End) or a device end is required for the host device 50. (Step 106). If an interrupt is required, host device 50
(Step 107). If no interrupt is required, patrol timer register 65
(Step 108), the system reset timer register 623 is incremented (step 1).
09) is sequentially performed to determine whether the value of the system reset timer register 623 has exceeded a predetermined value corresponding to 10 minutes (step 110), and whether the value of the patrol timer register 65 has exceeded the predetermined value (that is, the specified value). (Step 111) is sequentially determined. If neither of Steps 110 and 111 exceeds the predetermined value, Step 10 is performed again.
Return to 2. One second determined by the patrol timer register 65 is a logical device selection cycle, as described later, and is a time determined by experience so as not to hinder the instruction processing from the host device 50. is there.

【００４３】一方、ステップ１１０及び１１１のいずれ
の判定時も所定値を越えていると判定されたときは、ス
テップ１１２へ進み、保守パネル上に設けられたハード
ウェアスイッチの設定又は上位装置５０からの命令によ
り、パトロール動作が禁止されているかの判定を行な
う。パトロール動作が禁止されている場合はステップ１
０２へ戻り、他方、パトロール動作が禁止されていない
通常の場合は、上位装置５０からの命令に対し、コント
ロールユニットビジー（ＣｕＢｕｓｙ）を返すモード
をセットし（ステップ１１３）、その後パトロール動作
実行用論理デバイスアドレスレジスタ６２６の内容を読
み出し（ステップ１１４）、当該論理デバイスを選択す
る（ステップ１１５）。On the other hand, if it is determined that the value exceeds the predetermined value in any of the determinations in steps 110 and 111, the process proceeds to step 112, in which the setting of the hardware switches provided on the maintenance panel or the Is determined whether or not the patrol operation is prohibited. Step 1 if patrol is prohibited
02, on the other hand, in the normal case where the patrol operation is not prohibited, the mode for returning the control unit busy (Cu Busy) is set for the command from the host device 50 (step 113). The contents of the logical device address register 626 are read (step 114), and the logical device is selected (step 115).

【００４４】続いて、当該論理デバイスが使用可能状態
か判定する（ステップ１１６）。これは当該論理デバイ
スの電源が入っていなくても、ステップ１１５による論
理デバイスの選択は行なわれるので、当該論理デバイス
からのステータスをプロセッサ６２１が見て当該論理デ
バイスが使用可能状態かをチェックするのである。当該
論理デバイスが使用可能状態であるときは当該論理デバ
イスが他系から使用中か判定される（ステップ１１
７）。すなわち、図２に示したようにアレー型磁気ディ
スク制御装置６０が２台若しくはそれ以上同時に使用さ
れているような場合において、当該論理デバイスが他系
のアレー型磁気ディスク制御装置により既に使用中のと
きは使用できないからである。Subsequently, it is determined whether the logical device is usable (step 116). This is because, even if the power of the logical device is not turned on, the logical device is selected in step 115, and the processor 621 checks the status from the logical device to check whether the logical device is usable. is there. If the logical device is available, it is determined whether the logical device is being used by another system (step 11).
7). That is, when two or more array-type magnetic disk controllers 60 are used simultaneously as shown in FIG. 2, the logical device is already in use by another array-type magnetic disk controller. This is because sometimes it cannot be used.

【００４５】当該論理デバイスが使用可能状態でないと
き（ステップ１１６）、又は当該論理デバイスが他系か
ら使用中のとき（ステップ１１７）は、当該論理デバイ
スの選択状態を解除し（ステップ１１８）、パトロール
動作実行用論理デバイスアドレスレジスタ６２６をイン
クリメントして格納する（ステップ１１９）。すなわ
ち、当該論理デバイスの選択をあきらめて、次の論理デ
バイスを選択するようにする。その後、パトロール用タ
イマレジスタ６５及び動作時間監視タイマレジスタ６２
４を夫々初期化して（ステップ１２０）、ステップ１０
２へ戻る。When the logical device is not in a usable state (step 116) or when the logical device is being used by another system (step 117), the selected state of the logical device is released (step 118) and patrol is performed. The operation execution logical device address register 626 is incremented and stored (step 119). That is, the selection of the logical device is abandoned, and the next logical device is selected. Thereafter, the patrol timer register 65 and the operation time monitoring timer register 62
4 are initialized (Step 120), and Step 10
Return to 2.

【００４６】他方、当該論理デバイスが使用可能状態
で、かつ、他系から使用中でないとき（ステップ１１
６，１１７）は、図６のステップ１２１へ進み、当該論
理デバイス内の物理デバイス（すなわち、データディス
ク装置７００〜７０７、パリティ用ディスク装置７０８
及び予備用ディスク装置７０９）の状態を読み取る（ス
テップ１２１）。On the other hand, when the logical device is in a usable state and is not being used by another system (step 11).
6, 117) proceeds to step 121 in FIG. 6, and proceeds to the physical device (ie, the data disk devices 700 to 707 and the parity disk device 708) in the logical device.
Then, the state of the spare disk device 709) is read (step 121).

【００４７】続いて、当該論理デバイスに対して２０秒
の間にコマンドが実行されたか否か判定される（ステッ
プ１２２）。「２０秒」か否かは動作時間監視タイマレ
ジスタ６２４の値が、２０秒に相当する値（回数）にな
ったか否かで判断する。また、「２０秒」の値自体は経
験値で、必ずしもこの時間に限定されるものではない。Subsequently, it is determined whether a command has been executed for the logical device within 20 seconds (step 122). Whether or not “20 seconds” is determined based on whether or not the value of the operation time monitoring timer register 624 has reached a value (number of times) corresponding to 20 seconds. The value of “20 seconds” itself is an empirical value, and is not necessarily limited to this time.

【００４８】２０秒間の間にコマンドが実行されないと
きは、パトロール動作モードを論理デバイスモードとす
る（ステップ１２３）。この論理デバイスモードは、現
用のデータディスク装置７００〜７０７及びパリティ用
ディスク装置７０８及び予備用ディスク装置７０９を
使用するモードである。このモードに切換わると、次に
パトロール動作実行用シークアドレスレジスタ６２５を
インクリメントした後（ステップ１２４）、上記の論理
デバイス（すなわち、ディスク装置７００〜７０８）の
夫々のヘッドを現在の位置から一旦一定距離移動して再
び元の位置に戻すシーク動作を実行する（ステップ１２
５）。このシーク動作は、パトロールシーク動作であっ
て、ヘッドを長時間一定位置に静止しておくと、ヘッド
にゴミが付着し、記録再生動作に悪影響を与えるので、
ゴミのヘッドへの付着を防止するために行なわれる。If no command is executed within 20 seconds, the patrol operation mode is set to the logical device mode (step 123). This logical device mode uses the current data disk devices 700 to 707, the parity disk device 708, and the spare disk device 709. When the mode is switched to this mode, the seek address register 625 for executing the patrol operation is incremented next (step 124), and then the respective heads of the above logical devices (that is, the disk devices 700 to 708) are once fixed from the current position. A seek operation is performed to move the distance and return to the original position again (step 12).
5). This seek operation is a patrol seek operation, and if the head is kept at a fixed position for a long time, dust adheres to the head and adversely affects the recording / reproducing operation.
This is performed to prevent dust from adhering to the head.

【００４９】上記のパトロールシーク動作が正常に終了
しない場合は、正常終了しないディスク装置から対応す
るディスク制御部へハードエラー信号が送出される。そ
こで、プロセッサ６２１はこのハードエラー信号の入力
の有無からエラー発生の有無を検出し（ステップ１２
６）、エラー検出時は後述のステップ１３８の処理を実
行し、エラー非検出時は後述のステップ１４０の処理を
実行する。If the patrol seek operation does not end normally, a hard error signal is sent from the disk device that does not end normally to the corresponding disk control unit. Therefore, the processor 621 detects whether an error has occurred from the presence or absence of the input of the hard error signal (step 12).
6) When an error is detected, the process of step 138 described below is executed. When no error is detected, the process of step 140 described below is executed.

【００５０】他方、２０秒の間にコマンドが実行された
とステップ１２２で判定されたときは、予備用ディスク
装置７０９が動作可能状態（待機状態）かを確認した後
（ステップ１２７）、パトロール動作モードを物理デバ
イスモードとする（ステップ１２８）。この物理デバイ
スモードは予備用ディスク装置７０９のみを使用するモ
ードである。このモードに切換わると、次にパトロール
動作実行用シークアドレスレジスタ６２５をインクリメ
ントした後（ステップ１２９）、予備用ディスク装置７
０９に対しシーク動作を実行する（ステップ１３０）。On the other hand, if it is determined in step 122 that the command has been executed within 20 seconds, it is checked whether the spare disk unit 709 is in an operable state (standby state) (step 127), and the patrol operation mode is set. Is set to the physical device mode (step 128). This physical device mode is a mode in which only the spare disk device 709 is used. When the mode is switched to this mode, the seek address register 625 for executing the patrol operation is incremented next (step 129), and then the spare disk device 7
A seek operation is executed for the step 09 (step 130).

【００５１】このシーク動作はライト時に予め行なわれ
るシーク動作であって、本実施例の対象とする予備用デ
ィスク装置７０９がライト時及びリード時の夫々に先立
ってヘッドを一定距離移動させるシーク動作が構造的に
必要であるためである。しかし、このシーク動作によ
り、前記パトロールシークと同様にヘッドへのゴミの付
着も防止される。このシーク動作が正常に終了しなかっ
た場合は予備用ディスク装置７０９がディスク制御部６
３９へハードエラー信号を送出する。そこで、次のステ
ップ１３１ではこのハードエラー信号が入力されるとき
はエラー発生と判断し、ハードエラー信号が入力されな
いときは正常と判定する。This seek operation is a seek operation that is performed in advance at the time of writing, and the seek operation in which the spare disk device 709 of the present embodiment moves the head by a predetermined distance prior to each of writing and reading. This is because it is structurally necessary. However, this seek operation prevents dust from adhering to the head as in the case of the patrol seek. If the seek operation has not been completed normally, the spare disk device
A hard error signal is sent to 39. Therefore, in the next step 131, when this hard error signal is input, it is determined that an error has occurred, and when the hard error signal is not input, it is determined that it is normal.

【００５２】正常の場合は予備用ディスク装置７０９に
対して前記した特定パターンのデータとエラー検出コー
ドのライト動作を実行する（ステップ１３２）。予備用
ディスク装置６３９は上記のライト動作が正常に行なわ
れなかったときはシーク動作と同様にハードエラー信号
をディスク制御部６３９へ送出するので、プロセッサ６
２１はこのハードエラー信号の入力の有無からエラー発
生か否かをチェックする（ステップ１３３）。続いて、
エラーが無いときはリード動作に先立つシーク動作を予
備用ディスク装置１３４に対して実行し（ステップ１３
４）、上記エラー検出コード及びデータを書き込んだ１
トラックの先頭位置にヘッドを移動した後、ハードエラ
ー信号の有無によりエラー発生か否かをチェックし（ス
テップ１３５）、エラーが無いときはリード動作を実行
し（ステップ１３６）、上記のデータ及びエラー検出コ
ードを再生する。If the data is normal, the write operation of the data of the specific pattern and the error detection code is executed for the spare disk device 709 (step 132). When the above-mentioned write operation is not performed normally, the spare disk device 639 sends a hard error signal to the disk control unit 639 similarly to the seek operation.
21 checks whether or not an error has occurred based on the presence or absence of the input of the hard error signal (step 133). continue,
If there is no error, a seek operation prior to the read operation is executed for the spare disk device 134 (step 13).
4), 1 where the above error detection code and data are written
After moving the head to the head position of the track, it is checked whether or not an error has occurred based on the presence or absence of a hard error signal (step 135). If there is no error, a read operation is executed (step 136), and the above data and error are detected. Play the detection code.

【００５３】このリード動作終了時にハードエラーが有
るときはハードエラー信号がディスク制御部６３９に供
給される。また、リード動作により予備ディスクから
読み出し出力されたデータ及びエラー検出コードはディ
スク制御部６３９を介してデータ転送制御部６２８へ入
力される。プロセッサ６２１は上記のハードエラー信号
の入力の有無及び読み取りエラーがあったか否か判定
する（ステップ１３７）。エラーが無かったときはプロ
セッサ６２１は正常終了と判定して、エラー検出フラグ
レジスタ６２７のエラー検出フラグをリセットする（ス
テップ１４０）。If there is a hard error at the end of the read operation, a hard error signal is supplied to the disk controller 639. The data and the error detection code read out from the spare disk by the read operation are input to the data transfer control unit 628 via the disk control unit 639. The processor 621 determines whether or not the hard error signal has been input and whether or not there has been a read error (step 137). If there is no error, the processor 621 determines that the processing has been completed normally, and resets the error detection flag of the error detection flag register 627 (step 140).

【００５４】一方、ステップ１２６，１３１，１３３，
１３５又は１３７でエラー発生と判定したときは、プロ
セッサ６２１は例えばシークが正常に終了しないなどの
ハードエラー、あるいは読み取りエラー等のエラー状態
の詳細情報を収集して予防保守情報格納用レジスタ６４
へ格納し（ステップ１３８）、エラー検出フラグレジス
タ６２７のエラー検出フラグをセットする（ステップ１
３９）。On the other hand, steps 126, 131, 133,
If it is determined at 135 or 137 that an error has occurred, the processor 621 collects detailed information on an error state such as a hard error such as a case where the seek does not end normally or a reading error, and stores the preventive maintenance information storage register 64.
(Step 138), and sets the error detection flag of the error detection flag register 627 (step 1).
39).

【００５５】ステップ１３９又は１４０の処理が終了す
ると、又はステップ１２７で予備用ディスク装置７０９
がパトロール動作可能な状態にないと判定されたとき
は、ＣｕＢｕｓｙ状態を解除した後（ステップ１４
１）、図４のステップ１１８〜１２０により現在の論理
デバイスの選択状態を解除した後パトロール動作実行完
了用論理デバイスアドレスレジスタ６２６をインクリメ
ントして次の論理デバイスを指定し、またパトロール用
タイマレジスタ６５及び動作時間監視タイマレジスタ１
２０を夫々初期化した後ステップ１０２へ戻る。When the processing in step 139 or 140 is completed, or in step 127, the spare disk unit 709
If it is determined that the state is not in a state where patrol operation is possible, the Cu Busy state is released (step 14).
1), after canceling the current logical device selection state in steps 118 to 120 of FIG. 4, the patrol operation execution completion logical device address register 626 is incremented to designate the next logical device, and the patrol timer register 65 And operating time monitoring timer register 1
After the initialization of each of the variables 20, the process returns to step 102.

【００５６】以下、上記と同様にして上位装置５０から
の命令が無い処理空き時間に（ステップ１０２）、パト
ロール用タイマレジスタ６５により判定される規定時間
（通常１秒）毎に（ステップ１１１）、各論理デバイス
の予備用ディスク装置７０９の予備ディスクの１トラッ
クに対して上記の特定パターンのデータ及びエラー検出
コードの書込みと読み出し及びエラー検出等が行なわれ
ていく（ステップ１２９〜１３７）。Thereafter, in the same manner as described above, during the processing idle time when there is no command from the host device 50 (step 102), every prescribed time (normally 1 second) determined by the patrol timer register 65 (step 111), Writing and reading of the data of the specific pattern and the error detection code, error detection, and the like are performed on one track of the spare disk of the spare disk unit 709 of each logical device (steps 129 to 137).

【００５７】論理デバイス７０_Nの予備用ディスク装置
７０９の最初の１トラックへの上記の書込みと読み出し
が終了すると、次に論理デバイス７０₀の予備用ディス
ク装置７０９の予備ディスクの次の１トラックに対して
上記の特定パターンのデータ及びエラー検出コードの書
き込みと読み出し及びエラー検出等が行なわれる。以
下、上記と同様の動作が繰り返される。[0057] When the writing and reading to the first track of the logical device 70 _N of the spare disk device 709 is completed, the next following track of the spare disk of logical devices 70 ₀ of the spare disk device 709 On the other hand, writing and reading of the data of the specific pattern and the error detection code, error detection, and the like are performed. Hereinafter, the same operation as described above is repeated.

【００５８】ここで、１秒後に論理デバイスが選択され
たときに予備ディスクにデータが書き込み及び読み出し
されるトラックは、前回のトラックの隣接トラックでは
なく、一定トラック数離れたトラックとされている。す
なわち、予備ディスクの記録面を複数のゾーンに分割
し、その各分割ゾーンの代表トラックにデータが書き込
まれるようにステップ１２９におけるシークアドレスレ
ジスタ６２５のインクリメント数が予め定められてい
る。これにより、短時間で予備ディスクの正常性の判定
ができる。Here, when a logical device is selected one second later, the track on which data is written and read from the spare disk is not a track adjacent to the previous track but a track separated by a certain number of tracks. That is, the recording surface of the spare disk is divided into a plurality of zones, and the number of increments of the seek address register 625 in step 129 is determined in advance so that data is written to the representative track of each divided zone. Thus, the normality of the spare disk can be determined in a short time.

【００５９】その後、パトロール動作によりエラーが検
出された予備ディスクが属する論理デバイスに対して上
位装置５０から命令が発行されたときに（ステップ１０
２，１０３）、プロセッサ６２１は予防保守情報格納用
レジスタ６４から格納してある予防保守情報を読み出し
て、それをチャネルインタフェース制御部６１を介して
上位装置５０に送出し、予備用ディスク装置のエラー情
報を通知した後（ステップ１４２）、エラー検出フラグ
レジスタ６２７内のパトロール動作によるエラー検出フ
ラグをリセットし（ステップ１４３）、ステップ１０２
に戻る。Thereafter, when an instruction is issued from the higher-level device 50 to the logical device to which the spare disk in which the error has been detected by the patrol operation belongs (step 10).
2, 103), the processor 621 reads the stored preventive maintenance information from the preventive maintenance information storage register 64, sends the readout preventive maintenance information to the upper-level device 50 via the channel interface control unit 61, and outputs the error of the spare disk device. After notifying the information (step 142), the error detection flag by the patrol operation in the error detection flag register 627 is reset (step 143), and step 102
Return to

【００６０】これにより、本実施例によれば、予備用デ
ィスク装置７０９自体にハードエラーや読み取りエラー
等があった場合は勿論のこと、現用のデータディスク
装置７００〜７０７、パリティ用ディスク装置にハード
エラーがあった場合もアレー型磁気ディスク制御装置６
０から通知される予防保守情報に基づき、事前にハード
エラーや読み取りエラー等に対する修理や交換が実施可
能となる。Thus, according to the present embodiment, not only when the spare disk unit 709 itself has a hard error or a read error, but also when the current data disk units 700 to 707 and the parity disk unit are used. The array-type magnetic disk controller 6 is used even when an error occurs.
Based on the preventive maintenance information notified from 0, repair or replacement for a hardware error, a reading error, or the like can be performed in advance.

【００６１】なお、本発明は上記の実施例に限定される
ものではなく、例えばパトロール用タイマレジスタ６５
及び予防保守情報格納用レジスタ６４はハードウェアで
なくともよく、プロセッサ６２１がファームウェアによ
ってタイマ機能を実行したり、コントロールストレージ
６２２内に予防保守情報を格納してもよい。The present invention is not limited to the above-described embodiment. For example, the patrol timer register 65
The register 64 for storing preventive maintenance information may not be hardware, and the processor 621 may execute a timer function by firmware, or may store the preventive maintenance information in the control storage 622.

【００６２】[0062]

【発明の効果】以上のように、本発明に係るアレイ型デ
ィスク制御装置は、複数台の入出力装置と１台の予備用
入出力装置からなる論理デバイスに接続され、論理デバ
イス内の複数台の入出力装置と同時に制御して上位装置
からのデータの書き込みや読み出しを並行して行なうた
めに、高速のデータ転送に適しており、また異常発生の
ある１台の入出力装置の代りに使用される予備用入出力
装置を電源投入により常に監視してその予防保守情報を
上位装置に提供するようにしているため、高信頼性であ
り、高信頼化が特に要求される計算機システムに適用し
て好適である。As described above, the array-type disk controller according to the present invention is connected to a logical device consisting of a plurality of input / output devices and one spare input / output device. It is suitable for high-speed data transfer because data is written and read from the host device in parallel by controlling at the same time as the input / output device, and it is used in place of one input / output device with an error. The standby I / O device is always monitored when the power is turned on, and its preventive maintenance information is provided to the host device. Therefore, the I / O device can be applied to computer systems that require high reliability and high reliability. It is suitable.

【００６３】また、上記効果を有するアレー形記憶シス
テムを提供することができる。Further, it is possible to provide an array type storage system having the above effects.

[Brief description of the drawings]

【図１】本発明の基本原理構成を示す図である。FIG. 1 is a diagram showing a basic principle configuration of the present invention.

【図２】上位装置及び複数の論理デバイスを共有する２
台の入出力制御装置の構成図である。FIG. 2 illustrates a host device and a plurality of logical devices that are shared.
FIG. 3 is a configuration diagram of one input / output control device.

【図３】本発明になる入出力制御装置の一実施例の構成
図である。FIG. 3 is a configuration diagram of an embodiment of an input / output control device according to the present invention.

【図４】本発明の第１実施例の動作説明用フローチャー
トである。FIG. 4 is a flowchart for explaining the operation of the first embodiment of the present invention.

【図５】本発明の第２実施例の動作説明用フローチャー
ト（その１）である。FIG. 5 is a flowchart (part 1) for explaining the operation of the second embodiment of the present invention.

【図６】本発明の第２実施例の動作説明用フローチャー
ト（その２）である。FIG. 6 is a flowchart (part 2) for explaining the operation of the second embodiment of the present invention.

[Explanation of symbols]

１０上位装置２０入出力制御装置２１上位装置インタフェース制御部２２検査手段２３制御手段２４判定手段２５予防保守情報格納用レジスタ２６下位装置インタフェース制御部２７パトロール用タイマレジスタ３０入出力装置３１予備用入出力装置 DESCRIPTION OF SYMBOLS 10 High-order apparatus 20 I / O control apparatus 21 High-order apparatus interface control part 22 Inspection means 23 Control means 24 Judgment means 25 Preventive maintenance information storage register 26 Subordinate equipment interface control part 27 Patrol timer register 30 I / O device 31 Reserved I / O apparatus

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06F 3/06 ──────────────────────────────────────────────────続き Continued on front page (58) Field surveyed (Int.Cl. ⁶ , DB name) G06F 3/06

Claims

(57) [Claims]

1. A plurality of disk devices (30) each storing data transferred to and from a higher-level device and redundant data created from the data, and one spare disk device (31). A lower-level device interface control unit (26) to which a logical device comprising: is connected, and the lower-level device interface control unit is connected to a higher-level device (10) and is issued from the higher-level device (10). A channel interface controller (21) for receiving operation commands to the plurality of disk devices (30) and the spare disk device (31); and a command input from a higher-level device via the channel interface controller. A logical device that controls read and write operations of a plurality of disk devices that store the data, and that stores the data. If the plurality of disk devices included in the device are normal, the plurality of disk devices that store the data are controlled, and an error is found in one of the plurality of disk devices included in the logical device that stores the data. An operation control unit that controls a normal disk device and a spare disk device in a case where it is performed; and a power supply unit that supplies power to each of the plurality of disk devices and the spare disk device in the logical device. An array-type disk controller, comprising: always keeping all disk devices in the logical device in a state in which a read / write command and a seek command can be executed.