JP2000339186A

JP2000339186A - Automatic reconnection method for cluster system monitoring terminal and its system

Info

Publication number: JP2000339186A
Application number: JP11153049A
Authority: JP
Inventors: Hidehiko Masumoto; 英彦升本
Original assignee: NEC Software Chubu Ltd
Current assignee: NEC Solution Innovators Ltd
Priority date: 1999-05-31
Filing date: 1999-05-31
Publication date: 2000-12-08

Abstract

PROBLEM TO BE SOLVED: To automatically reconnect a cluster system monitoring terminal with a new master without the wasteful use of any resource, or the interruption of any operator when a failure is generated in a master host computer with which the cluster system monitoring terminal is connected. SOLUTION: When a master host computer 1 is connected with a cluster system monitoring terminal 4, an address adr of the cluster system monitoring terminal 4 is obtained and stored in a shared region 3. When a failure is generated in the master, this failure is detected by a slave host computer 2, and the slave host computer 2 is switched to a master host computer, and connected with the cluster system monitoring terminal 4 corresponding to the address adr read from the shared region 3.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、複数のホストコン
ピュータで構成されるクラスタシステムのホストコンピ
ュータ障害時におけるクラスタシステム監視端末の自動
再接続方法および自動再接続システムに関する。The present invention relates to a method and system for automatically reconnecting a cluster system monitoring terminal when a host computer fails in a cluster system comprising a plurality of host computers.

【０００２】[0002]

【従来の技術】従来のクラスタシステム内で障害が発生
した場合の障害復旧、運用継続を自動的に行う方法の一
例が、特開平０５−３４６８６８号に記載されている。
この方法は、端末装置から稼動系および待機系のすべて
のホストコンピュータそれぞれに対してセッションを張
る。そして、稼動系のホストコンピュータで障害が発生
した場合には、待機系のホストコンピュータは端末装置
との間に予め張られている待機系のセッションを使用し
て稼動系のホストコンピュータから業務を引継ぐ。2. Description of the Related Art An example of a conventional method for automatically recovering a failure and continuing operation when a failure occurs in a cluster system is described in Japanese Patent Application Laid-Open No. 05-346868.
In this method, a session is established from a terminal device to each of the active and standby host computers. When a failure occurs in the active host computer, the standby host computer takes over the work from the active host computer using a standby session established in advance with the terminal device. .

【０００３】しかし、この方法の場合には、実際に必要
なセッションは端末装置と１台の稼動系のホストコンピ
ュータとの間のセッションだけであるにもかかわらず、
ホストコンピュータ台数分のセッションを張るため、資
源を無駄に使用することになる。However, in this method, although the session actually required is only a session between the terminal device and one active host computer,
Since sessions are set up for the number of host computers, resources are wasted.

【０００４】以上述べたように、従来の端末装置または
クラスタシステム監視端末の自動再接続方法または自動
再接続システムにおいては、待機系のセッションを予め
張るため、資源を無駄に使用するという問題があった。As described above, in the conventional automatic reconnection method or automatic reconnection system for a terminal device or a cluster system monitoring terminal, there is a problem that resources are wasted since a standby system session is established in advance. Was.

【０００５】[0005]

【発明が解決しようとする課題】本発明は、上記の問題
に鑑みてなされたもので、クラスタシステム監視端末が
接続されているマスタのホストコンピュータに障害が発
生した場合に、待機系のセッションを予め張るといった
資源を無駄に使用することなく、また操作員の介入なし
に、クラスタシステム監視端末を新たにマスタに切り替
わったホストコンピュータに自動的に再接続し監視を継
続することを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above-mentioned problems, and is intended to reduce a standby session when a failure occurs in a master host computer to which a cluster system monitoring terminal is connected. An object of the present invention is to automatically reconnect a cluster system monitoring terminal to a host computer that has newly been switched to a master and continue monitoring without wasteful use of resources such as setting up in advance and without intervention of an operator.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に、請求項１に記載の発明は、複数のホストコンピュー
タと共有メモリで構成され前記ホストコンピュータの中
の１台がマスタに他のホストコンピュータがスレーブに
設定されたクラスタシステムを監視するクラスタシステ
ム監視端末を１台以上備えたクラスタシステム監視端末
の自動再接続方法において、前記クラスタシステム監視
端末が前記マスタのホストコンピュータに接続されたと
きに、前記マスタのホストコンピュータは該クラスタシ
ステム監視端末のアドレスを取得して該アドレスを前記
共有メモリの共有領域に格納し、前記マスタのホストコ
ンピュータに障害が発生した場合に、前記スレーブのホ
ストコンピュータは該障害を検出して予め設定された順
序にしたがってスレーブからマスタに切り替わった後、
該マスタに切り替わったホストコンピュータは前記共有
領域に格納されている前記クラスタシステム監視端末の
アドレスを読み出して該アドレスに対応した前記クラス
タシステム監視端末と接続するように制御することを特
徴とする。In order to achieve the above object, the invention according to claim 1 comprises a plurality of host computers and a shared memory, wherein one of the host computers serves as a master and another as a master. In the automatic reconnection method of a cluster system monitoring terminal provided with one or more cluster system monitoring terminals in which a computer monitors a cluster system set as a slave, when the cluster system monitoring terminal is connected to the master host computer, The master host computer acquires the address of the cluster system monitoring terminal, stores the address in a shared area of the shared memory, and when a failure occurs in the master host computer, the slave host computer Detects the fault and scans according to the preset order. After switching from over blanking the master,
The host computer switched to the master reads an address of the cluster system monitoring terminal stored in the shared area and performs control so as to connect to the cluster system monitoring terminal corresponding to the address.

【０００７】請求項２に記載の発明は、前記共有メモリ
の共有領域は、請求項１に記載のクラスタシステム監視
端末の自動再接続方法において、前記各クラスタシステ
ム監視端末毎に予め決められた領域であることを特徴と
する。According to a second aspect of the present invention, in the method of automatically reconnecting a cluster system monitoring terminal according to the first aspect, the shared area of the shared memory is an area predetermined for each of the cluster system monitoring terminals. It is characterized by being.

【０００８】請求項３に記載の発明は、前記共有メモリ
の共有領域は、請求項１に記載のクラスタシステム監視
端末において、前記全てのクラスタシステム監視端末に
共通な予め決められた領域であることを特徴とする。According to a third aspect of the present invention, in the cluster system monitoring terminal according to the first aspect, the shared area of the shared memory is a predetermined area common to all the cluster system monitoring terminals. It is characterized by.

【０００９】請求項４に記載の発明は、複数のホストコ
ンピュータと共有メモリで構成され、前記ホストコンピ
ュータの中の１台がマスタに他のホストコンピュータが
スレーブに設定されたクラスタシステムを監視するクラ
スタシステム監視端末を１台以上備えたクラスタシステ
ム監視端末の自動再接続システムにおいて、前記クラス
タシステムの状態を管理するクラスタシステム管理手段
と、前記クラスタシステム監視端末との接続を行う接続
手段と、前記クラスタシステム監視端末のアドレスを前
記共有メモリの共有領域に格納および読み出す手段と、
前記共有メモリの共有領域から読み出した前記アドレス
に対応した前記クラスタシステム監視端末との接続を行
う接続手段と、を少なくとも有するホストコンピュータ
と、前記全てのホストコンピュータからアクセス可能な
共有メモリと、前記ホストコンピュータとの接続を行う
接続手段と、前記クラスタシステムの状態を監視するク
ラスタシステム監視手段と、を少なくとも有する前記ク
ラスタシステム監視端末とを具備し、前記マスタのホス
トコンピュータに障害が発生した場合に、前記スレーブ
のホストコンピュータは該障害を検出して予め設定され
た順序にしたがってスレーブからマスタに切り替わり、
前記共有メモリの共有領域に格納されたアドレスに対応
したクラスタシステム監視端末と接続するように制御す
ることを特徴とする。According to a fourth aspect of the present invention, there is provided a cluster system comprising a plurality of host computers and a shared memory, wherein one of the host computers monitors a cluster system in which one host is set as a master and another host computer is set as a slave. In an automatic reconnection system for a cluster system monitoring terminal having at least one system monitoring terminal, a cluster system management means for managing a state of the cluster system, a connection means for connecting to the cluster system monitoring terminal, Means for storing and reading the address of the system monitoring terminal in the shared area of the shared memory;
A host computer having at least connection means for connecting to the cluster system monitoring terminal corresponding to the address read from the shared area of the shared memory; a shared memory accessible from all the host computers; A connection means for connecting to a computer, and a cluster system monitoring terminal having at least a cluster system monitoring means for monitoring the state of the cluster system, when a failure occurs in the master host computer, The host computer of the slave detects the fault and switches from the slave to the master according to a preset order,
It is characterized in that control is performed so as to connect to a cluster system monitoring terminal corresponding to the address stored in the shared area of the shared memory.

【００１０】請求項５に記載の発明は、前記共有メモリ
の共有領域は、請求項１に記載のクラスタシステム監視
端末の自動再接続システムにおいて、前記各クラスタシ
ステム監視端末毎に予め決められた領域であることを特
徴とする。According to a fifth aspect of the present invention, in the automatic reconnection system for a cluster system monitoring terminal according to the first aspect, the shared area of the shared memory is an area predetermined for each of the cluster system monitoring terminals. It is characterized by being.

【００１１】請求項６に記載の発明は、前記共有メモリ
の共有領域は、請求項１に記載のクラスタシステム監視
端末の自動再接続システムにおいて、前記全てのクラス
タシステム監視端末に共通な予め決められた領域である
ことを特徴とする。According to a sixth aspect of the present invention, in the automatic reconnection system for a cluster system monitoring terminal according to the first aspect, the shared area of the shared memory is predetermined in common to all the cluster system monitoring terminals. Characterized in that the region is

【００１２】[0012]

【発明の実施の形態】以下、本発明の実施の形態による
クラスタシステム監視端末の自動再接続方法および自動
再接続システムを図１を参照して説明する。図１は同実
施の形態によるクラスタシステム監視端末の自動再接続
システムのブロック図である。図１において、１，２は
クラスタシステムを構成するホストコンピュータであ
り、一方がマスタに他方がスレーブに設定される。３は
ホストコンピュータ１とホストコンピュータ２の間での
データの受け渡しに使用する共有メモリの共有領域であ
り、ホストコンピュータ１およびホストコンピュータ２
からデータの読み書きができる手段を備えている。４は
マスタのホストコンピュータに接続され、クラスタシス
テムを監視するクラスタシステム監視端末である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, an automatic reconnection method and an automatic reconnection system for a cluster system monitoring terminal according to an embodiment of the present invention will be described with reference to FIG. FIG. 1 is a block diagram of an automatic reconnection system for a cluster system monitoring terminal according to the embodiment. In FIG. 1, reference numerals 1 and 2 denote host computers constituting a cluster system, one set as a master and the other set as a slave. Reference numeral 3 denotes a shared area of a shared memory used for transferring data between the host computer 1 and the host computer 2. The host computer 1 and the host computer 2
It has a means to read and write data from. Reference numeral 4 denotes a cluster system monitoring terminal connected to the master host computer and monitoring the cluster system.

【００１３】１１，２１はホストコンピュータ１，２の
情報を保持しクラスタシステムの管理を行うクラスタシ
ステム管理手段、１２，２２はそれぞれホストコンピュ
ータ１，２の情報を収集するクラスタ情報収集手段であ
る。また、マスタのホストコンピュータ側のクラスタ情
報収集手段１２または２２は、スレーブのクラスタ情報
収集手段２２または１２からスレーブのホストコンピュ
ータの情報も収集し、クラスタシステム全体の情報をク
ラスタシステム監視端末４に送信する手段も有する。１
３，２３は予め決められた共有領域３からクラスタ監視
端末４のアドレスを読み込むアドレス読込手段、１４，
２４は予め決められた共有領域３にクラスタ監視端末４
のアドレスを格納するアドレス格納手段、１５，２５は
それぞれホストコンピュータ１，２とクラスタ監視端末
４との接続を行う接続手段、１６，２６はクラスタ監視
端末４からの接続要求を受け付ける待受け手段である。Reference numerals 11 and 21 denote cluster system management means for holding information on the host computers 1 and 2 and managing the cluster system, and reference numerals 12 and 22 denote cluster information collection means for collecting information on the host computers 1 and 2, respectively. Further, the cluster information collecting means 12 or 22 on the master host computer side also collects information on the slave host computer from the cluster information collecting means 22 or 12 on the slave, and transmits information on the entire cluster system to the cluster system monitoring terminal 4. There is also a means to do. 1
Address reading means for reading the address of the cluster monitoring terminal 4 from the predetermined shared area 3;
Reference numeral 24 denotes a cluster monitoring terminal 4 in a predetermined shared area 3.
Address storage means for storing the address of the cluster monitoring terminal 4, and connection means 15 and 25 for connecting the host computers 1 and 2 to the cluster monitoring terminal 4, and standby means 16 for receiving a connection request from the cluster monitoring terminal 4. .

【００１４】４１はクラスタシステムより情報を取得し
クラスタシステムの監視を行うクラスタシステム監視手
段、４２はクラスタシステム監視端末４とホストコンピ
ュータ１，２との接続を行う接続手段、４３はホストコ
ンピュータ１，２からの接続要求を受け付ける待受け手
段である。Reference numeral 41 denotes cluster system monitoring means for acquiring information from the cluster system and monitoring the cluster system; 42, connection means for connecting the cluster system monitoring terminal 4 to the host computers 1 and 2; It is a waiting means for receiving a connection request from the server 2.

【００１５】次に、本発明の実施形態の動作を図１を参
照して説明する。具体的には、ホストコンピュータ１が
マスタにホストコンピュータ２がスレーブに設定されて
いて、クラスタシステム監視端末４がマスタのホストコ
ンピュータ１に接続されクラスタシステムの監視を行っ
ている状態で、ホストコンピュータ１に障害が発生し、
マスタがホストコンピュータ１からホストコンピュータ
２に切り替った場合のクラスタシステム監視端末４の自
動再接続動作について説明する。Next, the operation of the embodiment of the present invention will be described with reference to FIG. Specifically, in a state where the host computer 1 is set as the master and the host computer 2 is set as the slave, and the cluster system monitoring terminal 4 is connected to the master host computer 1 and monitors the cluster system, Fails,
The automatic reconnection operation of the cluster system monitoring terminal 4 when the master switches from the host computer 1 to the host computer 2 will be described.

【００１６】操作員はクラスタシステムを監視したり情
報を収集するため、クラスタシステム監視端末４からク
ラスタシステム監視端末４をマスタのホストコンピュー
タ１に接続するように操作する。これにより、クラスタ
システム監視手段４１は、ホストコンピュータ１の待受
け手段１６に対して接続要求を行う。The operator operates the cluster system monitoring terminal 4 to connect the cluster system monitoring terminal 4 to the master host computer 1 in order to monitor the cluster system and collect information. As a result, the cluster system monitoring unit 41 issues a connection request to the standby unit 16 of the host computer 1.

【００１７】クラスタシステム監視手段４１からの接続
要求をホストコンピュータ１の待受け手段１６で受け付
けると、接続手段１５はクラスタ情報収集手段１２をク
ラスタシステム監視手段４１と接続する。そして、待受
け手段１６は、クラスタシステム監視手段４１から受信
したクラスタシステム監視端末４のアドレスａｄｒをア
ドレス格納手段１４に渡す。アドレス格納手段１４は受
け取ったクラスタシステム監視端末４のアドレスａｄｒ
を予め決められた共有領域３に格納する。また、クラス
タ情報収集手段１２は、クラスタシステム監視端末４か
らの要求にしたがって、収集したクラスタシステムの情
報をクラスタシステム監視手段４１に送信する。When the connection request from the cluster system monitoring means 41 is received by the standby means 16 of the host computer 1, the connection means 15 connects the cluster information collecting means 12 to the cluster system monitoring means 41. Then, the standby unit 16 passes the address adr of the cluster system monitoring terminal 4 received from the cluster system monitoring unit 41 to the address storage unit 14. The address storage means 14 receives the address adr of the received cluster system monitoring terminal 4.
Is stored in a predetermined shared area 3. Further, the cluster information collecting means 12 transmits the collected information of the cluster system to the cluster system monitoring means 41 according to the request from the cluster system monitoring terminal 4.

【００１８】そして、ホストコンピュータ１で障害が発
生しマスタのホストコンピュータとしての処理が行えな
くなると、スレーブのホストコンピュータ２のクラスタ
システム管理手段２１はホストコンピュータ１の障害を
検知してマスタの切り換えを行い、ホストコンピュータ
２がマスタとなる。また、クラスタシステム管理手段２
１は、ホストコンピュータ２がマスタに切り替わったこ
とをクラスタ情報収集手段２２に通知する。When a failure occurs in the host computer 1 and the processing as the master host computer cannot be performed, the cluster system management means 21 of the slave host computer 2 detects the failure of the host computer 1 and switches the master. Then, the host computer 2 becomes the master. Cluster system management means 2
1 notifies the cluster information collecting means 22 that the host computer 2 has been switched to the master.

【００１９】次に、クラスタ情報収集手段２２は、予め
決められた共有領域３に格納されているクラスタシステ
ム監視端末４のアドレスａｄｒをアドレス読込手段２３
により読み出し、このアドレスａｄｒを接続手段２５に
渡す。接続手段２５は受け取ったアドレスａｄｒに対応
したクラスタシステム監視端末４に対して接続要求を行
う。Next, the cluster information collecting means 22 reads the address adr of the cluster system monitoring terminal 4 stored in the predetermined shared area 3 into the address reading means 23.
And passes the address adr to the connection means 25. The connection means 25 issues a connection request to the cluster system monitoring terminal 4 corresponding to the received address adr.

【００２０】一方、クラスタシステム監視端末４がホス
トコンピュータ１に接続され、クラスタシステムを監視
している間にホストコンピュータ１に障害が発生する
と、クラスタシステム監視端末４とホストコンピュータ
１の間の通信が切断される。クラスタシステム監視手段
４１は、通信が切断されたことを検知すると、待受け手
段４３にクラスタシステムからの接続を待ち合わせるよ
うに要求する。On the other hand, when the cluster system monitoring terminal 4 is connected to the host computer 1 and a failure occurs in the host computer 1 while monitoring the cluster system, communication between the cluster system monitoring terminal 4 and the host computer 1 is established. Be cut off. When detecting that the communication has been disconnected, the cluster system monitoring means 41 requests the waiting means 43 to wait for a connection from the cluster system.

【００２１】そして、待受け手段４３がホストコンピュ
ータ２の接続手段２５から接続要求を受け付けると、接
続手段４２は接続手段２５との間で接続処理を行い、ク
ラスタシステム監視端末４はマスタに切り替わったホス
トコンピュータ２に接続される。これにより、クラスタ
システム監視端末４はクラスタシステムの監視を再開す
る。その結果、操作員は一時的な処理遅延のみでクラス
タシステムの監視を継続することができる。When the standby unit 43 receives a connection request from the connection unit 25 of the host computer 2, the connection unit 42 performs a connection process with the connection unit 25, and the cluster system monitoring terminal 4 switches the host that has been switched to the master. Connected to computer 2. As a result, the cluster system monitoring terminal 4 resumes monitoring the cluster system. As a result, the operator can continue monitoring the cluster system only with a temporary processing delay.

【００２２】次に、本発明の他の実施の形態を図２を参
照して説明する。図２はクラスタシステム監視端末が２
台存在する場合のクラスタシステム監視端末の自動再接
続システムを示す図である。図２において、図１と同一
部分には同一符号を付してその説明を省略する。図２に
おいて、５はクラスタシステム監視端末であり、３１，
３２はそれぞれクラスタシステム監視端末４，５のアド
レスａｄｒ１，ａｄｒ２を格納する共有メモリの共有領
域である。Next, another embodiment of the present invention will be described with reference to FIG. FIG. 2 shows two cluster system monitoring terminals.
FIG. 11 is a diagram illustrating an automatic reconnection system of a cluster system monitoring terminal when there are multiple monitoring terminals; 2, the same parts as those in FIG. 1 are denoted by the same reference numerals, and the description thereof will be omitted. In FIG. 2, reference numeral 5 denotes a cluster system monitoring terminal;
Reference numeral 32 denotes a shared area of a shared memory for storing addresses adr1 and adr2 of the cluster system monitoring terminals 4 and 5, respectively.

【００２３】次に、本発明の他の実施形態の動作につい
て図２を参照して説明する。具体的には、ホストコンピ
ュータ１がマスタにホストコンピュータ２がスレーブに
設定されていて、クラスタシステム監視端末４，５がマ
スタのホストコンピュータ１に接続されクラスタシステ
ムの監視を行っている状態で、ホストコンピュータ１に
障害が発生し、マスタがホストコンピュータ１からホス
トコンピュータ２に切り替った場合のクラスタシステム
監視端末４，５の自動再接続動作について説明する。Next, the operation of another embodiment of the present invention will be described with reference to FIG. Specifically, in a state where the host computer 1 is set as the master and the host computer 2 is set as the slave, and the cluster system monitoring terminals 4 and 5 are connected to the master host computer 1 and monitor the cluster system, An automatic reconnection operation of the cluster system monitoring terminals 4 and 5 when a failure occurs in the computer 1 and the master is switched from the host computer 1 to the host computer 2 will be described.

【００２４】上記した本発明の実施形態と同様の動作に
より、クラスタシステム監視端末４がホストコンピュー
タ１に接続されると、クラスタ情報収集手段２２は受け
取ったクラスタシステム監視端末４のアドレスａｄｒ１
をアドレス格納手段１４により共有領域３１に格納す
る。同様に、クラスタシステム監視端末５がホストコン
ピュータ１に接続されると、クラスタ情報収集手段２２
は受け取ったクラスタシステム監視端末５のアドレスａ
ｄｒ２をアドレス格納手段１４により共有領域３２に格
納する。When the cluster system monitoring terminal 4 is connected to the host computer 1 by the same operation as the above-described embodiment of the present invention, the cluster information collecting means 22 receives the address adr1 of the received cluster system monitoring terminal 4.
Is stored in the shared area 31 by the address storage means 14. Similarly, when the cluster system monitoring terminal 5 is connected to the host computer 1, the cluster information collecting means 22
Is the address a of the received cluster system monitoring terminal 5
Dr2 is stored in the shared area 32 by the address storage means 14.

【００２５】そして、マスタのホストコンピュータ１で
障害が発生すると、クラスタシステム管理機能２１はこ
の障害を検知し、スレーブのホストコンピュータ２はマ
スタに切り替わる。クラスタ情報収集手段２２は、クラ
スタシステム監視端末４，５のアドレスａｄｒ１，ａｄ
ｒ２をそれぞれ共有領域３１，３２からアドレス読込手
段２３により読み出して接続手段２５に渡す。接続手段
２５は受け取ったアドレスａｄｒ１，ａｄｒ２に対応し
たクラスタシステム監視端末４，５に対して接続要求を
行う。クラスタシステム監視端末４，５は、マスタに切
り替わったホストコンピュータ２からの接続要求を受け
付けると、それぞれホストコンピュータ２との間で接続
処理を行い、ホストコンピュータ２に再接続されてクラ
スタシステムの監視を継続する。When a failure occurs in the master host computer 1, the cluster system management function 21 detects this failure, and the slave host computer 2 switches to the master. The cluster information collecting means 22 stores the addresses adr1, ad of the cluster system monitoring terminals 4, 5.
r2 is read from the shared areas 31 and 32 by the address reading means 23 and passed to the connection means 25. The connection means 25 issues a connection request to the cluster system monitoring terminals 4 and 5 corresponding to the received addresses adr1 and adr2. Upon receiving the connection request from the host computer 2 switched to the master, the cluster system monitoring terminals 4 and 5 respectively perform connection processing with the host computer 2 and are reconnected to the host computer 2 to monitor the cluster system. continue.

【００２６】なお、ホストコンピュータが３台以上の場
合および／またはクラスタシステム監視端末が３台以上
の場合についても、同様に、本発明を適用し実施するこ
とができる。The present invention can be applied and implemented similarly when there are three or more host computers and / or when there are three or more cluster system monitoring terminals.

【００２７】[0027]

【発明の効果】以上説明したように、この発明によれ
ば、マスタのホストコンピュータにクラスタシステム監
視端末が接続されたときに、クラスタシステム監視端末
のアドレスを取得して共有メモリの共有領域に格納して
おき、マスタのホストコンピュータに障害が発生した場
合に、マスタの切り替えによりスレーブからマスタに切
り替わったホストコンピュータは、共有領域に格納され
ているクラスタシステム監視端末のアドレスを読み出
し、そのアドレスに対応したクラスタシステム監視端末
に接続するようにしたので、スレーブのホストコンピュ
ータとの間にセッションを予め張っておく必要がなく資
源の無駄使いを回避できる。また、操作員の介入を必要
とすることなく、クラスタシステム監視端末をマスタに
切り替わったホストコンピュータに再接続し、クラスタ
システムの監視を継続することができる。As described above, according to the present invention, when the cluster system monitoring terminal is connected to the master host computer, the address of the cluster system monitoring terminal is acquired and stored in the shared area of the shared memory. If a failure occurs in the master host computer, the host computer switched from the slave to the master by switching the master reads the address of the cluster system monitoring terminal stored in the shared area and responds to that address. Since the connection is made to the cluster system monitoring terminal, it is not necessary to establish a session with the slave host computer in advance, and wasteful use of resources can be avoided. Further, the cluster system monitoring terminal can be reconnected to the host computer that has been switched to the master, and the monitoring of the cluster system can be continued without requiring the intervention of an operator.

【図面の簡単な説明】[Brief description of the drawings]

【図１】この発明の一実施形態によるクラスタシステ
ム監視端末の自動再接続システムの構成を示すブロック
図である。FIG. 1 is a block diagram showing a configuration of an automatic reconnection system for a cluster system monitoring terminal according to an embodiment of the present invention.

【図２】この発明の他の実施形態によるクラスタシス
テム監視端末の自動再接続システムの構成を示すブロッ
ク図である。FIG. 2 is a block diagram showing a configuration of an automatic reconnection system for a cluster system monitoring terminal according to another embodiment of the present invention.

[Explanation of symbols]

１，２…ホストコンピュータ３，３１，３２…共
有領域４，５…クラスタシステム監視端末１１，２１
…クラスタシステム管理手段１２，２２…クラスタ情報収集手段１３，２３
…アドレス読込手段１４，２４…アドレス格納手段１５，２５…接続手
段１６，２６…待受け手段４１…クラスタシステム監視
手段４２…接続手段４３…待受け手段1, 2 ... host computer 3, 31, 32 ... shared area 4, 5 ... cluster system monitoring terminal 11, 21
... cluster system management means 12, 22 ... cluster information collection means 13, 23
... Address reading means 14,24 ... Address storage means 15,25 ... Connection means 16,26 ... Standing means 41 ... Cluster system monitoring means 42 ... Connection means 43 ... Standing means

Claims

[Claims]

1. A cluster system monitoring terminal comprising a plurality of host computers and a shared memory, wherein one of the host computers monitors a cluster system in which one of the host computers is a master and another host computer is a slave.
In the automatic reconnection method for a cluster system monitoring terminal provided with at least one cluster system monitoring terminal, when the cluster system monitoring terminal is connected to the master host computer, the master host computer acquires an address of the cluster system monitoring terminal. The address is stored in a shared area of the shared memory, and when a failure occurs in the master host computer, the slave host computer detects the failure and switches from the slave to the master in a preset order. After that, the host computer switched to the master reads the address of the cluster system monitoring terminal stored in the shared area, and controls to connect to the cluster system monitoring terminal corresponding to the address. Cluster system monitoring Automatic re-connection method of the youngest.

2. The method according to claim 1, wherein the shared area of the shared memory is an area predetermined for each of the cluster system monitoring terminals.

3. The method according to claim 1, wherein the shared area of the shared memory is a predetermined area common to all the cluster system monitoring terminals. .

4. A cluster system monitoring terminal comprising at least one host computer and a shared memory, wherein at least one cluster system monitoring terminal monitors a cluster system in which one of the host computers is set as a master and the other host computer is set as a slave. An automatic reconnection system for a cluster system monitoring terminal, comprising: a cluster system management unit that manages a state of the cluster system; a connection unit that connects to the cluster system monitoring terminal; and an address of the cluster system monitoring terminal. A host computer having at least: means for storing and reading data in and out of the shared area of the shared memory; and connecting means for connecting to the cluster system monitoring terminal corresponding to the address read from the shared area of the shared memory. Access from the host computer The cluster system monitoring terminal having at least: an accessible shared memory; connection means for connecting to the host computer; and cluster system monitoring means for monitoring the state of the cluster system. When a failure occurs in the host computer, the slave host computer detects the failure and switches from the slave to the master according to a preset order, and the cluster corresponding to the address stored in the shared area of the shared memory. An automatic reconnection system for a cluster system monitoring terminal, which is controlled to connect to a system monitoring terminal.

5. The automatic reconnection system for a cluster system monitoring terminal according to claim 1, wherein the shared area of the shared memory is an area predetermined for each of the cluster system monitoring terminals.

6. The automatic reconnection system for a cluster system monitoring terminal according to claim 1, wherein the shared area of the shared memory is a predetermined area common to all the cluster system monitoring terminals. .