JP2015132979A

JP2015132979A - Information processing device and control method thereof, control program and recording medium

Info

Publication number: JP2015132979A
Application number: JP2014004007A
Authority: JP
Inventors: 保利鈴木; Yasutoshi Suzuki; 朋美大塚; Tomomi Otsuka
Original assignee: Fujitsu Ltd; Fujitsu Social Science Labs Ltd
Current assignee: Fujitsu Ltd; Fujitsu Social Science Labs Ltd
Priority date: 2014-01-14
Filing date: 2014-01-14
Publication date: 2015-07-23

Abstract

PROBLEM TO BE SOLVED: To provide an information processing device capable of reducing the load of collecting a memory dump.SOLUTION: A guest driver 121 notifies a host OS 11 of the occurrence of a fault by generating an interrupt in a virtual bus 21 allocated to a virtual machine 124 in which a guest OS 12 with the fault occurring therein operates. The host OS 11 acquires process identification information from a virtualization ID/process list 117 on the basis of a virtualization ID of the virtual bus 21 with the interrupt generated, acquires the virtual machine 124 in which the guest OS 12 with the fault occurring operates from a virtual machine/process list 118 on the basis of the process identification information, and collects contents of a memory 123 in a storage state storage area 114.

Description

本発明は、情報処理装置及びその制御方法、制御プログラム並びに記録媒体に関する。 The present invention relates to an information processing apparatus, a control method thereof, a control program, and a recording medium.

複数の仮想マシンが動作する情報処理システムにおいては、例えば、仮想計算機（仮想マシン）はホストＯＳ（オペレーティングシステム）のプロセスとして実行され、仮想マシン上でゲストＯＳ上が動作する。ゲストＯＳにおいてクラッシュが発生して処理が停止する等した場合、メモリの内容とログ情報とを取得して、クラッシュの原因が調査される。 In an information processing system in which a plurality of virtual machines operate, for example, a virtual machine (virtual machine) is executed as a process of a host OS (operating system), and the guest OS operates on the virtual machine. When a crash occurs in the guest OS and processing stops, the contents of the memory and log information are acquired, and the cause of the crash is investigated.

例えば、仮想計算機システムにおいて、ホスト仮想計算機が、ゲスト仮想計算機についてのシステム情報を退避する退避位置であって、ホスト仮想計算機が使用するログ部における退避位置を予め定めるゲスト格納定義を備え、仮想計算機モニタが、ゲスト仮想計算機を監視して、ゲスト仮想計算機に発生した障害を検出するパニック監視管理部と、パニック監視管理部によりゲスト仮想計算機における障害が検出された場合、ゲスト格納定義に基づいて、退避位置からシステム情報を採取するログ採取部とを備えることが提案されている。 For example, in a virtual machine system, the host virtual machine has a guest storage definition for prescribing a save position in a log unit used by the host virtual machine, which is a save position for saving system information about the guest virtual machine. The monitor monitors the guest virtual machine and detects a failure that occurred in the guest virtual machine. When a failure in the guest virtual machine is detected by the panic monitoring manager, based on the guest storage definition, It has been proposed to include a log collection unit that collects system information from the retreat position.

特開２０１０−０８６１８１号公報JP 2010-086181 A

例えば、ゲスト仮想マシンのメモリが２ＧＢの容量を持つ場合、メモリダンプを採取して格納するためには、磁気ディスク装置のような記憶装置における記憶領域（以下、ディスク領域）として、２ＧＢを各々の仮想マシンに予め割り当てておく必要がある。従って、例えば１０個の仮想マシンが存在する場合、１０個の仮想マシンの全てについてメモリダンプを採取するとすれば、２０ＧＢのディスク領域が必要となる。 For example, if the memory of a guest virtual machine has a capacity of 2 GB, in order to collect and store a memory dump, 2 GB is used as a storage area (hereinafter referred to as a disk area) in a storage device such as a magnetic disk device. It is necessary to assign to the virtual machine in advance. Therefore, for example, if there are 10 virtual machines, if a memory dump is collected for all 10 virtual machines, a disk area of 20 GB is required.

しかし、情報処理システムで使用される記憶装置は高価である。従って、発生が予測できないゲストＯＳのクラッシュに備えて、大量のディスク領域を獲得しておくことは、メモリダンプ採取の負担（コスト）の観点からは好ましくない。そこで、メモリダンプの採取のためのディスク領域を減らしてしまうと、実際にクラッシュが発生しても、メモリダンプが採取されない場合があり、結果的に原因調査が長期化する。 However, the storage device used in the information processing system is expensive. Therefore, acquiring a large amount of disk area in preparation for a guest OS crash that cannot be predicted is not preferable from the viewpoint of memory dump collection burden (cost). Therefore, if the disk area for collecting the memory dump is reduced, even if a crash actually occurs, the memory dump may not be collected. As a result, the cause investigation becomes longer.

本発明は、一側面によれば、メモリダンプの採取の負担を軽減することが可能な情報処理装置を提供することを目的とする。 An object of the present invention, according to one aspect, is to provide an information processing apparatus that can reduce the burden of collecting a memory dump.

情報処理装置は、一側面によれば、複数の仮想マシンが動作する情報処理装置であって、第１の記憶手段と、第２の記憶手段と、保存手段とを含む。第１の記憶手段は、前記複数の仮想マシンを制御する仮想マシン制御部と前記複数の仮想マシンそれぞれとを結ぶ複数の仮想バスを識別する複数の仮想バス識別情報と、前記複数の仮想マシンにそれぞれ対応した前記仮想マシン制御部のプロセスを識別する複数のプロセス識別情報との対応関係である第１対応情報を記憶する。第２の記憶手段は、前記複数のプロセス識別情報と、前記複数の仮想マシンを識別する複数の仮想マシン識別情報との対応関係である第２対応情報を記憶する。保存手段は、前記複数の仮想マシンに含まれる第１の仮想マシンで動作する第１のＯＳより前記第１の仮想マシンに対応する第１の仮想バスに前記第１のＯＳの障害に対応した割り込みがあった場合に、前記第１対応情報および前記第２対応情報に基づき第１の仮想マシンを特定し、前記第１の仮想マシン上で動作する前記第１のＯＳが使用するメモリ領域の内容を保存する。 According to one aspect, the information processing apparatus is an information processing apparatus on which a plurality of virtual machines operate, and includes a first storage unit, a second storage unit, and a storage unit. The first storage means includes a plurality of virtual bus identification information for identifying a plurality of virtual buses connecting the virtual machine control unit for controlling the plurality of virtual machines and the plurality of virtual machines, and the plurality of virtual machines. First correspondence information that is a correspondence relationship with a plurality of process identification information for identifying processes of the corresponding virtual machine control units is stored. The second storage means stores second correspondence information that is a correspondence relationship between the plurality of process identification information and the plurality of virtual machine identification information for identifying the plurality of virtual machines. The storage means copes with the failure of the first OS on the first virtual bus corresponding to the first virtual machine from the first OS operating on the first virtual machine included in the plurality of virtual machines. When there is an interrupt, the first virtual machine is identified based on the first correspondence information and the second correspondence information, and the memory area used by the first OS operating on the first virtual machine Save the contents.

情報処理装置は、一側面によれば、メモリダンプの採取の負担を軽減することができる。 According to one aspect, the information processing apparatus can reduce the burden of collecting a memory dump.

情報処理システムの構成の一例を示す図である。It is a figure which shows an example of a structure of an information processing system. 情報処理システムの構成の一例を示す図であるIt is a figure showing an example of composition of an information processing system 情報処理システムの説明図である。It is explanatory drawing of an information processing system. 情報処理システムの説明図である。It is explanatory drawing of an information processing system. 識別情報作成処理フローである。It is an identification information creation processing flow. 識別情報作成処理フローである。It is an identification information creation processing flow. 識別情報作成処理フローである。It is an identification information creation processing flow. 識別情報作成処理フローである。It is an identification information creation processing flow. ダンプ採取処理フローである。It is a dump collection processing flow. ダンプ採取処理フローである。It is a dump collection processing flow. ダンプ採取処理フローである。It is a dump collection processing flow.

図１は、情報処理システムの構成の一例を示す図である。 FIG. 1 is a diagram illustrating an example of a configuration of an information processing system.

複数の仮想マシンが動作するシステム（情報処理システム）は、複数のＯＳ（オペレーティングシステム）１、ハイパーバイザ２、ハードウェア３を備える情報処理装置、換言すれば、物理計算機である。複数のＯＳ１は、例えば、１個のホストＯＳ１１と、複数のゲストＯＳ１２とを含む。ホストＯＳ１１は、例えば、図２を参照して後述する仮想ＣＰＵ１１９上で動作する。ゲストＯＳ１２は、例えば、図２を参照して後述する仮想マシン１２４上で動作する。 A system (information processing system) on which a plurality of virtual machines operate is an information processing apparatus including a plurality of OSs (operating systems) 1, a hypervisor 2, and hardware 3, in other words, a physical computer. The plurality of OSs 1 include, for example, one host OS 11 and a plurality of guest OSs 12. For example, the host OS 11 operates on a virtual CPU 119 described later with reference to FIG. The guest OS 12 operates on, for example, a virtual machine 124 described later with reference to FIG.

ハードウェア３は、物理装置、例えば、物理ＣＰＵと、物理メモリと、物理Ｉ／Ｏ装置、換言すれば、物理入出力装置とを含む。物理メモリは主記憶装置（主メモリ）であり、物理Ｉ／Ｏ装置は例えば磁気ディスク装置のような周辺記憶装置である。 The hardware 3 includes a physical device, for example, a physical CPU, a physical memory, a physical I / O device, in other words, a physical input / output device. The physical memory is a main storage device (main memory), and the physical I / O device is a peripheral storage device such as a magnetic disk device.

ハイパーバイザ２は、ハードウェア３に含まれる物理的なＣＰＵ、物理的なメモリ、物理的なＩ／Ｏ装置等の物理装置を仮想化する。そして、ハイパーバイザ２は、仮想化された物理装置、換言すれば、仮想装置に対する要求を物理装置に転送し、また、物理装置からの応答を仮想装置に転送する。これにより、ハイパーバイザ２は仮想マシン１２４を構成する。 The hypervisor 2 virtualizes physical devices such as a physical CPU, a physical memory, and a physical I / O device included in the hardware 3. The hypervisor 2 transfers a virtualized physical device, in other words, a request for the virtual device to the physical device, and transfers a response from the physical device to the virtual device. As a result, the hypervisor 2 configures the virtual machine 124.

図２は、情報処理システムの構成の一例を示す図である。なお、図２においては、情報処理システムは複数の仮想マシン１２４及び複数のゲストＯＳ１２を含むが、１個の仮想マシン１２４及び１個のゲストＯＳ１２のみを示す。 FIG. 2 is a diagram illustrating an example of the configuration of the information processing system. In FIG. 2, the information processing system includes a plurality of virtual machines 124 and a plurality of guest OS 12, but only one virtual machine 124 and one guest OS 12 are shown.

情報処理システムは、１個のホストＯＳ１１と、複数の仮想マシン１２４とを含む。ホストＯＳ１１は、仮想マシン制御部であり、複数の仮想マシン１２４を制御する。各々の仮想マシン１２４上では、ゲストＯＳ１２が動作する。 The information processing system includes one host OS 11 and a plurality of virtual machines 124. The host OS 11 is a virtual machine control unit and controls a plurality of virtual machines 124. On each virtual machine 124, the guest OS 12 operates.

ホストＯＳ１１は、仮想ＣＰＵ１１９上で動作し、ダンプ採取ツール１１１と、ダンプ採取サービス部１１２と、ホストドライバ１１３と、保存状態格納領域１１４と、ダンプ格納領域１１５とを含む。図１の例では、ダンプ採取ツール１１１が、ダンプ採取サービス部１１２と、ホストドライバ１１３とを含む。ダンプ採取ツール１１１は、ダンプ採取サービス部１１２及びホストドライバ１１３とは別に設けるようにしてもよい。ゲストＯＳ１２は、仮想マシン１２４上で動作し、ゲストドライバ１２１を含む。仮想マシン１２４は、仮想デバイス１２２と、メモリ１２３と、仮想ＣＰＵ１２５とを含む。ハイパーバイザ２は、仮想バス２１を含む。 The host OS 11 operates on the virtual CPU 119 and includes a dump collection tool 111, a dump collection service unit 112, a host driver 113, a saved state storage area 114, and a dump storage area 115. In the example of FIG. 1, the dump collection tool 111 includes a dump collection service unit 112 and a host driver 113. The dump collection tool 111 may be provided separately from the dump collection service unit 112 and the host driver 113. The guest OS 12 operates on the virtual machine 124 and includes a guest driver 121. The virtual machine 124 includes a virtual device 122, a memory 123, and a virtual CPU 125. The hypervisor 2 includes a virtual bus 21.

ホストＯＳ１１は、オペレーティングシステム又は制御プログラムであり、ハイパーバイザ２を呼び出して、仮想マシン１２４に仮想化されたＣＰＵ、メモリ、Ｉ／Ｏ装置等の資源を配分する。ホストＯＳ１１は、仮想マシン１２４の実行を制御する仮想マシン制御部である。ホストＯＳ１１は、例えば、ホストマシン上で動作する。 The host OS 11 is an operating system or a control program, and calls the hypervisor 2 to allocate resources such as a virtualized CPU, memory, and I / O device to the virtual machine 124. The host OS 11 is a virtual machine control unit that controls execution of the virtual machine 124. The host OS 11 operates on a host machine, for example.

ホストマシンは、例えば、ハイパーバイザ２上に１個だけ存在する特権的な仮想マシンである。ホストマシンである特権的な仮想マシンは、他の仮想マシン１２４、換言すれば、ゲストＯＳ１２が動作する仮想マシン１２４と異なり、直接、物理装置にアクセスすることができる。従って、ホストマシンである特権的な仮想マシンは、ゲストＯＳ１２が動作する仮想マシン１２４とは、動作が全く異なる。 The host machine is, for example, a privileged virtual machine that exists only on the hypervisor 2. A privileged virtual machine that is a host machine can directly access a physical device, unlike other virtual machines 124, in other words, a virtual machine 124 in which the guest OS 12 operates. Therefore, the privileged virtual machine that is the host machine is completely different from the virtual machine 124 in which the guest OS 12 operates.

また、ホストマシンは、例えば、物理マシンそれ自体であってもよい。また、ホストＯＳ１１、換言すれば、仮想マシン１２４の実行を制御する仮想マシン制御部は、仮想マシン上で動作する必要はなく、ハイパーバイザ２上で動作する制御プログラムや、仮想ＣＰＵ１１９上ではなく物理ＣＰＵ上で動作する制御プログラムであってもよい。 The host machine may be a physical machine itself, for example. In addition, the host OS 11, in other words, the virtual machine control unit that controls the execution of the virtual machine 124 does not need to operate on the virtual machine, and does not operate on the control program that operates on the hypervisor 2 or on the virtual CPU 119. It may be a control program that runs on the CPU.

以上から、図２においては、仮想マシン制御部であるホストＯＳ１１が動作するホストマシンの図示を省略している。一方、仮想マシン制御部であるホストＯＳ１１が、ハイパーバイザ２により仮想化される仮想ＣＰＵ１１９上で動作しているように図示しているが、ホストＯＳ１１は、前述したように、仮想ＣＰＵ１１９ではなく、直接、物理ＣＰＵ上で動作しても、又は、物理ＣＰＵに対応する論理ＣＰＵ上で動作してもよい。 From the above, in FIG. 2, illustration of a host machine on which the host OS 11 that is a virtual machine control unit operates is omitted. On the other hand, the host OS 11 that is the virtual machine control unit is illustrated as operating on the virtual CPU 119 that is virtualized by the hypervisor 2, but the host OS 11 is not the virtual CPU 119, as described above. It may operate directly on a physical CPU or may operate on a logical CPU corresponding to the physical CPU.

仮想マシン１２４は、ホストＯＳ１１によって割り当てられた仮想装置、換言すれば、ゲストマシンとして構成される。仮想マシン１２４は、ホストＯＳ１１上のプロセス、具体的には、子プロセスとして実行される。仮想マシン１２４上でゲストＯＳ１２が動作する。なお、仮想マシン１２４上にゲストＯＳ１２が存在しなくてもよい。 The virtual machine 124 is configured as a virtual device assigned by the host OS 11, in other words, a guest machine. The virtual machine 124 is executed as a process on the host OS 11, specifically, as a child process. The guest OS 12 operates on the virtual machine 124. Note that the guest OS 12 does not have to exist on the virtual machine 124.

ゲストＯＳ１２は、動作している仮想マシン１２４上、換言すれば、仮想ＣＰＵ１２５及び仮想化されたメモリ１２３上で動作し、仮想化されたＩ／Ｏ装置、例えば仮想デバイス１２２にアクセスすることができる。ゲストＯＳ１２は、ホストＯＳ１１又はハイパーバイザ２を経由して、仮想装置に対するアクセスを物理装置に転送し、アクセスの結果を物理装置から返信される。 The guest OS 12 operates on the operating virtual machine 124, in other words, the virtual CPU 125 and the virtualized memory 123, and can access a virtualized I / O device, for example, the virtual device 122. . The guest OS 12 transfers access to the virtual device to the physical device via the host OS 11 or the hypervisor 2, and the access result is returned from the physical device.

ゲストＯＳ１２において、ゲストドライバ１２１は、ホストＯＳ１１に障害の発生を通知する通知手段であり、複数のゲストＯＳ１２の各々に設けられる。ゲストドライバ１２１は、自己の属するゲストＯＳ１２、換言すれば、対応するゲストＯＳ１２に障害が発生した場合に、ホストＯＳ１１に、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４に割り当てられた仮想バス２１に割り込みを発生することにより、障害の発生を通知する。換言すれば、仮想マシン１２４は、その上で動作するゲストＯＳ１２に障害が発生した場合に、自己に割り当てられた仮想バス２１に割り込みを発生することにより、障害の発生を通知する。ゲストドライバ１２１は、複数のゲストＯＳ１２の各々に設けられる。ゲストドライバ１２１は、例えば、カーネルドライバであり、ゲストＯＳ１２に設けられる。 In the guest OS 12, the guest driver 121 is a notification unit that notifies the host OS 11 of the occurrence of a failure, and is provided in each of the plurality of guest OSs 12. When a failure occurs in the guest OS 12 to which the guest driver 121 belongs, in other words, in the corresponding guest OS 12, the virtual bus 21 assigned to the host machine 11 and the virtual machine 124 in which the failed guest OS 12 operates. The occurrence of a fault is notified by generating an interrupt. In other words, when a failure occurs in the guest OS 12 that operates on the virtual machine 124, the virtual machine 124 notifies the occurrence of the failure by generating an interrupt to the virtual bus 21 assigned to itself. The guest driver 121 is provided in each of the plurality of guest OSs 12. The guest driver 121 is a kernel driver, for example, and is provided in the guest OS 12.

具体的には、ゲストドライバ１２１は、ゲストＯＳ１２のパニック出口としてゲストＯＳ１２に登録される。例えば、ゲストドライバ１２１は、ゲストＯＳ１２の起動時に呼び出されて、ゲストＯＳ１２の予め定められた位置、具体的にはパニック出口に組み込まれる。そして、ゲストドライバ１２１は、ゲストＯＳ１２にクラッシュが発生した場合に、ゲストＯＳ１２から呼び出され、仮想マシンとハードウェアとの間を結ぶ仮想バス２１に割り込みを発生させる。 Specifically, the guest driver 121 is registered in the guest OS 12 as a panic exit of the guest OS 12. For example, the guest driver 121 is called when the guest OS 12 is activated, and is incorporated in a predetermined position of the guest OS 12, specifically, a panic exit. The guest driver 121 is called from the guest OS 12 when a crash occurs in the guest OS 12, and generates an interrupt on the virtual bus 21 connecting the virtual machine and the hardware.

仮想デバイス１２２は、複数の仮想マシン１２４の各々に設けられる記憶装置である。仮想デバイス１２２は、自己が接続された仮想マシン１２４がホストＯＳ１１のプロセスとして実行される場合における当該プロセスを識別するプロセス識別情報を格納する。換言すれば、仮想デバイス１２２は、自己が接続された仮想マシン１２４に対応した、仮想マシン制御部であるホストＯＳ１１のプロセスを識別するプロセス識別情報を格納する。 The virtual device 122 is a storage device provided in each of the plurality of virtual machines 124. The virtual device 122 stores process identification information for identifying the process when the virtual machine 124 to which the virtual device 122 is connected is executed as a process of the host OS 11. In other words, the virtual device 122 stores process identification information that identifies the process of the host OS 11 that is the virtual machine control unit corresponding to the virtual machine 124 to which the virtual device 122 is connected.

プロセス識別情報は、例えば、ホストＯＳ１１のダンプ採取サービス部１１２により、仮想デバイス１２２に書き込まれる。プロセス識別情報の書込みは、例えば、仮想マシン１２４の生成時に実行される。これは、プロセス識別情報が仮想マシン１２４に対応して定まるためである。 For example, the process identification information is written to the virtual device 122 by the dump collection service unit 112 of the host OS 11. The process identification information is written, for example, when the virtual machine 124 is created. This is because the process identification information is determined corresponding to the virtual machine 124.

実際には、仮想デバイス１２２としては、例えば、ＤＶＤ（ＤｉｊｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）装置が用いられる。例えば、仮想デバイス１２２は、仮想マシン１２４に動的に組み込まれる。換言すれば、仮想デバイス１２２は、仮想マシン１２４に動的に組み込むことができる記憶装置であればよい。これにより、仮想マシン１２４の生成に応じて、生成される仮想マシン１２４に対応するプロセスのプロセス識別情報を仮想デバイス１２２に書き込み、生成される仮想マシン１２４で動作するゲストＯＳ１２のゲストドライバ１２１にプロセス識別情報を通知することができる。 Actually, as the virtual device 122, for example, a DVD (Digital Versatile Disc) apparatus is used. For example, the virtual device 122 is dynamically incorporated into the virtual machine 124. In other words, the virtual device 122 may be a storage device that can be dynamically incorporated into the virtual machine 124. Thereby, according to the generation of the virtual machine 124, the process identification information of the process corresponding to the generated virtual machine 124 is written in the virtual device 122, and the process is stored in the guest driver 121 of the guest OS 12 operating on the generated virtual machine 124. Identification information can be notified.

例えば、プロセス識別情報を書き込んだ仮想ＤＶＤが作成され、また、仮想化されたＤＶＤ読み込み装置が作成される。仮想ＤＶＤが挿入された仮想ＤＶＤ読み込み装置が、仮想デバイス１２２として、起動される仮想マシン１２４に接続される。 For example, a virtual DVD in which process identification information is written is created, and a virtualized DVD reader is created. The virtual DVD reading device into which the virtual DVD is inserted is connected as a virtual device 122 to the virtual machine 124 to be activated.

仮想デバイス１２２は、ＤＶＤ装置に限られず、他の種々の記憶装置を利用することができる。例えば、仮想マシン１２４の定義に追加することができるデバイス又はリソースであって、プロセス化された仮想マシン１２４の仮想デバイス１２２に対してデータを書き込むことができ、かつ、仮想デバイス１２２からゲストドライバ１２１がデータを読み出すことができるデバイスであればよい。 The virtual device 122 is not limited to a DVD device, and other various storage devices can be used. For example, a device or resource that can be added to the definition of the virtual machine 124, data can be written to the virtual device 122 of the processed virtual machine 124, and the guest driver 121 can be written from the virtual device 122. Any device that can read data is acceptable.

メモリ１２３は、ハードウェア３に含まれる主メモリの一部である。メモリ１２３は、仮想化されて複数の仮想マシン１２４の各々に含まれ、ゲストＯＳ１２が使用するメモリ領域である。 The memory 123 is a part of the main memory included in the hardware 3. The memory 123 is a memory area that is virtualized and included in each of the plurality of virtual machines 124 and is used by the guest OS 12.

ハイパーバイザ２において、仮想バス２１は、複数の仮想マシン１２４の各々と物理ＣＰＵとの間を、例えば論理的に結ぶ仮想的な割り込み信号線である。ゲストＯＳ１２は、障害が発生すると、自己が動作している仮想マシン１２４の仮想ＣＰＵに、実行不可能な命令を実行させる。これにより、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４における実行例外が発生するので、最終的に、物理ＣＰＵが仮想バス２１への割り込みを捕捉することになる。従って、仮想バス２１は、ゲストＯＳ１２を実行中の仮想マシン１２４から物理ＣＰＵへの割り込みを発生することになる。一方、前述したように、ホストＯＳ１１は、特権的な仮想マシン又は物理マシンであるホストマシン上で動作するので、物理ＣＰＵにより捕捉された仮想バス２１への割り込みを処理する。結果として、仮想バス２１は、複数の仮想マシン１２４の各々とホストＯＳ１１との間を論理的に結ぶことになる。換言すれば、複数の仮想バス２１は、仮想マシン制御部と複数の仮想マシンそれぞれとを結ぶ。複数の仮想バス２１は、ハイパーバイザ２により、複数の仮想マシン１２４の各々に１対１に割り当てられ、仮想バス識別情報を与えられる。仮想バス２１への割り込みは、物理ＣＰＵへの物理的な割り込み信号線への割り込み、換言すれば、ハードウェア割り込みと同一の優先度、換言すれば、最優先の割り込みとされる。 In the hypervisor 2, the virtual bus 21 is a virtual interrupt signal line that logically connects each of the plurality of virtual machines 124 and the physical CPU, for example. When a failure occurs, the guest OS 12 causes the virtual CPU of the virtual machine 124 in which the guest OS 12 is operating to execute an instruction that cannot be executed. As a result, an execution exception occurs in the virtual machine 124 in which the guest OS 12 in which the failure has occurred operates, so that the physical CPU eventually captures an interrupt to the virtual bus 21. Therefore, the virtual bus 21 generates an interrupt from the virtual machine 124 executing the guest OS 12 to the physical CPU. On the other hand, as described above, the host OS 11 operates on a host machine that is a privileged virtual machine or physical machine, and therefore processes an interrupt to the virtual bus 21 captured by the physical CPU. As a result, the virtual bus 21 logically connects each of the plurality of virtual machines 124 and the host OS 11. In other words, the plurality of virtual buses 21 connect the virtual machine control unit and each of the plurality of virtual machines. The plurality of virtual buses 21 are assigned one by one to each of the plurality of virtual machines 124 by the hypervisor 2 and given virtual bus identification information. The interrupt to the virtual bus 21 is an interrupt to the physical interrupt signal line to the physical CPU, in other words, the same priority as the hardware interrupt, in other words, the highest priority interrupt.

仮想マシン制御部であるホストＯＳ１１は、ゲストＯＳ１２に障害が発生した場合に、後述する第１対応情報および第２対応情報に基づいて、ゲストＯＳ１２が使用するメモリ１２３の内容を保存する。 When a failure occurs in the guest OS 12, the host OS 11 serving as the virtual machine control unit stores the contents of the memory 123 used by the guest OS 12 based on first correspondence information and second correspondence information described later.

例えば、複数の仮想マシン１２４に含まれるある仮想マシン１２４で動作するゲストＯＳ１２に障害が発生したとする。障害が発生したゲストＯＳ１２が動作する仮想マシン１２４を、第１の仮想マシン１２４ということとする。 For example, it is assumed that a failure has occurred in the guest OS 12 operating on a certain virtual machine 124 included in the plurality of virtual machines 124. The virtual machine 124 on which the guest OS 12 in which the failure has occurred operates is referred to as the first virtual machine 124.

複数の仮想マシン１２４の各々は、対応するゲストＯＳ１２に障害が発生した場合に、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４、換言すれば、第１の仮想マシン１２４に割り当てられた仮想バス２１に割り込みを発生させることにより、仮想マシン制御部であるホストＯＳ１１に障害の発生を通知する。 Each of the plurality of virtual machines 124, when a failure occurs in the corresponding guest OS 12, the virtual machine 124 in which the failed guest OS 12 operates, in other words, the virtual bus allocated to the first virtual machine 124. By causing 21 to generate an interrupt, the occurrence of a failure is notified to the host OS 11 which is the virtual machine control unit.

ホストＯＳ１１は、複数の仮想マシン１２４に含まれる第１の仮想マシン１２４で動作するゲストＯＳ１２より、第１の仮想マシン１２４に対応する第１の仮想バス２１に、ゲストＯＳ１２の障害に対応した割り込みがあった場合に、第１対応情報および第２対応情報に基づき第１の仮想マシン１２４を特定し、特定した第１の仮想マシン１２４上で動作するゲストＯＳ１２が使用するメモリ１２３の内容を保存する。第１対応情報は、後述するように、複数の仮想バスを識別する複数の仮想バス識別情報と、複数の仮想マシンにそれぞれ対応した仮想マシン制御部のプロセスを識別する複数のプロセス識別情報との対応関係である。第２対応情報は、後述するように、複数のプロセス識別情報と、複数の仮想マシンを識別する複数の仮想マシン識別情報との対応関係である。 The host OS 11 receives an interrupt corresponding to the failure of the guest OS 12 from the guest OS 12 operating in the first virtual machine 124 included in the plurality of virtual machines 124 to the first virtual bus 21 corresponding to the first virtual machine 124. If there is, the first virtual machine 124 is identified based on the first correspondence information and the second correspondence information, and the contents of the memory 123 used by the guest OS 12 operating on the identified first virtual machine 124 are saved. To do. As will be described later, the first correspondence information includes a plurality of virtual bus identification information for identifying a plurality of virtual buses and a plurality of process identification information for identifying processes of the virtual machine control unit respectively corresponding to the plurality of virtual machines. It is correspondence. As will be described later, the second correspondence information is a correspondence relationship between a plurality of process identification information and a plurality of virtual machine identification information for identifying a plurality of virtual machines.

実際には、ホストＯＳ１１において、ダンプ採取ツール１１１が、メモリダンプの保存手段であり、ゲストＯＳ１２のクラッシュ時に、ゲストＯＳ１２のクラッシュ時におけるゲストＯＳ１２が使用するメモリ１２３の内容の保存を行う。メモリダンプは、保存されたメモリ１２３の内容を、予め定められたフォーマットに変換したものである。保存されたメモリ１２３の内容は、メモリダンプに変換可能であり、ダンプ採取ツール１１１によりメモリダンプに変換される。従って、メモリダンプ処理は、情報処理システムのホストＯＳ１１により実行される。 In practice, in the host OS 11, the dump collection tool 111 is a memory dump storage unit that stores the contents of the memory 123 used by the guest OS 12 when the guest OS 12 crashes. The memory dump is obtained by converting the stored contents of the memory 123 into a predetermined format. The stored contents of the memory 123 can be converted into a memory dump and converted into a memory dump by the dump collection tool 111. Accordingly, the memory dump process is executed by the host OS 11 of the information processing system.

例えば、ダンプ採取ツール１１１は、ホストＯＳ１１に設けられる。ダンプ採取ツール１１１は、通知手段であるゲストドライバ１２１から障害の発生を通知された場合に、割り込みが発生した仮想バス２１の仮想バス識別情報、例えば仮想化ＩＤに基づいて、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４に対応するプロセスのプロセス識別情報を求める。複数の仮想マシン１２４の各々に対応するプロセスは、仮想マシン制御部であるホストＯＳ１１のプロセスである。 For example, the dump collection tool 111 is provided in the host OS 11. When the dump collection tool 111 is notified of the occurrence of a failure from the guest driver 121 as a notification means, the guest having a failure is generated based on the virtual bus identification information of the virtual bus 21 in which the interrupt has occurred, for example, the virtualization ID. The process identification information of the process corresponding to the virtual machine 124 on which the OS 12 operates is obtained. A process corresponding to each of the plurality of virtual machines 124 is a process of the host OS 11 which is a virtual machine control unit.

更に、ダンプ採取ツール１１１は、求めたプロセス識別情報に基づいて、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４、具体的には、仮想マシン１２４の仮想マシン名を求める。更に、ダンプ採取ツール１１１は、求めた障害が発生したゲストＯＳ１２が動作する仮想マシン１２４のメモリ１２３の内容を、記憶装置に保存する。換言すれば、求めた障害が発生したゲストＯＳが使用するメモリ１２３の内容が、記憶装置に採取される。記憶装置は、例えば、ホストＯＳ１１がアクセス可能であり、メモリ１２３とは異なる記憶装置である。 Furthermore, the dump collection tool 111 obtains the virtual machine 124 on which the guest OS 12 in which the failure has occurred, specifically, the virtual machine name of the virtual machine 124 is obtained based on the obtained process identification information. Further, the dump collection tool 111 saves the contents of the memory 123 of the virtual machine 124 in which the guest OS 12 in which the requested failure has occurred in a storage device. In other words, the contents of the memory 123 used by the guest OS in which the requested failure has occurred are collected in the storage device. For example, the storage device can be accessed by the host OS 11 and is a storage device different from the memory 123.

具体的には、ダンプ採取ツール１１１は、ゲストドライバ１２１から障害の発生を通知された場合に、割り込みが発生した仮想バス２１の仮想化ＩＤに基づいて、第１の記憶手段である仮想化ＩＤ／プロセス一覧１１７から、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４に対応するプロセスのプロセス識別情報を求める。更に、ダンプ採取ツール１１１は、求めたプロセス識別情報に基づいて、第２の記憶手段である仮想マシン／プロセス一覧１１８から、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４、具体的には、仮想マシン１２４の仮想マシン名を求める。仮想化ＩＤ／プロセス一覧１１７及び仮想マシン／プロセス一覧１１８については後述する。仮想化ＩＤ／プロセス一覧１１７と仮想マシン／プロセス一覧１１８とを合せて仮想マシン一覧ということがある。 Specifically, when the dump collection tool 111 is notified of the occurrence of a failure from the guest driver 121, the virtualization ID that is the first storage unit is based on the virtualization ID of the virtual bus 21 in which the interrupt has occurred. / From the process list 117, process identification information of the process corresponding to the virtual machine 124 in which the guest OS 12 in which the failure has occurred operates is obtained. Further, the dump collection tool 111 determines, based on the obtained process identification information, from the virtual machine / process list 118 as the second storage unit, the virtual machine 124 in which the guest OS 12 in which the failure has occurred, specifically, The virtual machine name of the virtual machine 124 is obtained. The virtualization ID / process list 117 and the virtual machine / process list 118 will be described later. The virtualization ID / process list 117 and the virtual machine / process list 118 may be collectively referred to as a virtual machine list.

実際には、ダンプ採取ツール１１１において、ホストドライバ１１３は、ゲストＯＳ１２からのクラッシュの通知を検出して、ダンプ採取サービス部１１２にゲストＯＳ１２のクラッシュを通知する。ゲストＯＳ１２からのクラッシュの通知は、後述するように、仮想マシンとハードウェアとの間を論理的に結ぶ仮想バス２１への割り込みを捕捉することにより検出される。ホストドライバ１１３は、例えば、カーネルドライバであり、ホストＯＳ１１に設けられる。 Actually, in the dump collection tool 111, the host driver 113 detects a crash notification from the guest OS 12, and notifies the dump collection service unit 112 of the crash of the guest OS 12. The notification of the crash from the guest OS 12 is detected by capturing an interrupt to the virtual bus 21 that logically connects the virtual machine and the hardware, as will be described later. The host driver 113 is a kernel driver, for example, and is provided in the host OS 11.

具体的には、ホストドライバ１１３は、通知手段であるゲストドライバ１２１から障害の発生を通知された場合に、ゲストドライバ１２１からの割り込みが発生した仮想バス２１の仮想化ＩＤに基づいて、仮想化ＩＤ／プロセス一覧１１７から、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４に対応するプロセスのプロセス識別情報を求める。ホストドライバ１１３は、求めたプロセスをダンプ採取サービス部１１２に通知する。 Specifically, when the host driver 113 is notified of the occurrence of a failure from the guest driver 121 as a notification unit, the host driver 113 performs virtualization based on the virtualization ID of the virtual bus 21 in which the interrupt from the guest driver 121 has occurred. From the ID / process list 117, process identification information of a process corresponding to the virtual machine 124 in which the guest OS 12 in which the failure has occurred operates is obtained. The host driver 113 notifies the dump collection service unit 112 of the obtained process.

また、ダンプ採取ツール１１１において、ダンプ採取サービス部１１２は、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４を特定して、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４が使用するメモリ１２３の内容を保存する。ダンプ採取サービス部１１２は、ホストＯＳ１１が提供するサービスプログラムである。 In the dump collection tool 111, the dump collection service unit 112 identifies the virtual machine 124 in which the guest OS 12 in which the failure has occurred operates, and the memory 123 used by the virtual machine 124 in which the guest OS 12 in which the failure has occurred operates. Save the contents. The dump collection service unit 112 is a service program provided by the host OS 11.

具体的には、ダンプ採取サービス部１１２は、ホストドライバ１１３が求めたプロセス識別情報に基づいて、仮想マシン／プロセス一覧１１８から、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４、具体的には、仮想マシン１２４の仮想マシン名を求める。そして、ダンプ採取サービス部１１２は、求めた仮想マシン名を有する仮想マシン１２４のメモリ１２３、換言すれば、障害が発生したゲストＯＳ１２が使用するメモリ１２３の内容を、記憶装置に採取する。ゲストＯＳ１２が使用するメモリ１２３は、ゲストＯＳ１２が動作する仮想マシン１２４に含まれるメモリである。 Specifically, the dump collection service unit 112 determines, based on the process identification information obtained by the host driver 113, from the virtual machine / process list 118, the virtual machine 124 in which the failed guest OS 12 operates, specifically, The virtual machine name of the virtual machine 124 is obtained. Then, the dump collection service unit 112 collects the contents of the memory 123 of the virtual machine 124 having the obtained virtual machine name, in other words, the contents of the memory 123 used by the guest OS 12 in which a failure has occurred. The memory 123 used by the guest OS 12 is a memory included in the virtual machine 124 on which the guest OS 12 operates.

例えば、ダンプ採取サービス部１１２は、障害が発生したゲストＯＳ１２が使用するメモリ１２３にアクセスして、障害発生時におけるメモリ１２３の内容をコピーして、保存状態格納領域１１４に保存する。そして、ダンプ採取サービス部１１２は、保存状態格納領域１１４に保存したメモリ１２３の内容を読みだしてダンプファイルに変換し、ダンプ格納領域１１５に格納する。 For example, the dump collection service unit 112 accesses the memory 123 used by the guest OS 12 in which the failure has occurred, copies the contents of the memory 123 at the time of the failure, and saves it in the storage state storage area 114. Then, the dump collection service unit 112 reads the contents of the memory 123 stored in the storage state storage area 114, converts the contents into a dump file, and stores the dump file in the dump storage area 115.

また、ダンプ採取サービス部１１２は、仮想マシン１２４の停止手段であって、通知手段であるゲストドライバ１２１によって障害の発生を通知された場合に、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４を停止させる。また、ダンプ採取サービス部１１２は、仮想マシン１２４の再開手段であって、停止させた仮想マシン１２４を再開又は再起動させる。実際には、ダンプ採取サービス部１１２は、仮想マシン１２４の停止や再開を、ハイパーバイザ２に依頼する。ハイパーバイザ２は、ダンプ採取サービス部１１２から仮想マシン１２４の停止や再開の依頼を受けると、これに応じて、仮想マシン１２４を停止し再開する。 The dump collection service unit 112 is a means for stopping the virtual machine 124. When the guest driver 121, which is a notification means, notifies the occurrence of the failure, the dump collection service unit 112 displays the virtual machine 124 on which the guest OS 12 in which the failure has occurred operates. Stop. The dump collection service unit 112 is a restart unit for the virtual machine 124 and restarts or restarts the stopped virtual machine 124. Actually, the dump collection service unit 112 requests the hypervisor 2 to stop or restart the virtual machine 124. When receiving a request to stop or restart the virtual machine 124 from the dump collection service unit 112, the hypervisor 2 stops and restarts the virtual machine 124 accordingly.

図２の例では、ダンプ採取ツール１１１がダンプ採取サービス部１１２とホストドライバ１１３とを含むが、ダンプ採取ツール１１１を、ダンプ採取サービス部１１２及びホストドライバ１１３と別個に設けるようにしてもよい。 In the example of FIG. 2, the dump collection tool 111 includes the dump collection service unit 112 and the host driver 113. However, the dump collection tool 111 may be provided separately from the dump collection service unit 112 and the host driver 113.

例えば、ダンプ採取サービス部１１２はホストＯＳ１１のサービスプログラムであり、ホストドライバ１１３はホストＯＳ１１のカーネルドライバであり、ダンプ採取ツール１１１は、例えばホストＯＳ１１上で動作するアプリケーションとして設けられる。これにより、利用者は、ホストＯＳ１１の改変によることなく、メモリ１２３の内容を取得しメモリダンプを実行することができる。 For example, the dump collection service unit 112 is a service program of the host OS 11, the host driver 113 is a kernel driver of the host OS 11, and the dump collection tool 111 is provided as an application that operates on the host OS 11, for example. Thereby, the user can acquire the contents of the memory 123 and execute a memory dump without changing the host OS 11.

保存状態格納領域１１４は、ハードウェア３に含まれる磁気ディスク装置のような記憶装置の領域である。磁気ディスク装置のような記憶装置は、ホストＯＳ１１がアクセス可能であり、メモリ１２３とは異なる記憶装置である。保存状態格納領域１１４は、１個の領域であり、複数の仮想マシン１２４に共通である。 The storage state storage area 114 is an area of a storage device such as a magnetic disk device included in the hardware 3. A storage device such as a magnetic disk device can be accessed by the host OS 11 and is a storage device different from the memory 123. The saved state storage area 114 is one area and is common to a plurality of virtual machines 124.

ダンプ格納領域１１５は、ハードウェア３に含まれる磁気ディスク装置のような記憶装置の領域である。磁気ディスク装置のような記憶装置は、ホストＯＳ１１がアクセス可能であり、メモリ１２３とは異なる記憶装置である。ダンプ格納領域１１５は、１個の領域であり、複数の仮想マシン１２４に共通である。 The dump storage area 115 is an area of a storage device such as a magnetic disk device included in the hardware 3. A storage device such as a magnetic disk device can be accessed by the host OS 11 and is a storage device different from the memory 123. The dump storage area 115 is a single area and is common to a plurality of virtual machines 124.

図３は、情報処理システムの説明図である。図３は、ホストＯＳ１１の実行する処理と、ゲストＯＳ１２の実行する処理とを表す。具体的には、図３は、ゲストＯＳ１２のゲストドライバ１２１が実行する処理と、ホストＯＳ１１のホストドライバ１１３及びダンプ採取サービス部１１２が実行する処理とを表す。なお、図３においては、１個のゲストＯＳ１２のみを示す。 FIG. 3 is an explanatory diagram of the information processing system. FIG. 3 shows processing executed by the host OS 11 and processing executed by the guest OS 12. Specifically, FIG. 3 illustrates processing executed by the guest driver 121 of the guest OS 12 and processing executed by the host driver 113 and the dump collection service unit 112 of the host OS 11. In FIG. 3, only one guest OS 12 is shown.

ダンプ採取サービス部１１２は、ダンプ採取サービスの初期処理において、ゲスト制御処理の前処理を実行する。ゲスト制御処理の前処理は、仮想マシン１２４の仮想デバイス１２２にプロセス識別情報を書き込む処理である。書き込まれるプロセス識別情報は、指定された仮想マシン１２４に対応するホストＯＳ１１のプロセスを示す。仮想マシン１２４に対応するプロセスは、仮想マシン１２４毎に定められる。 The dump collection service unit 112 performs a pre-process of the guest control process in the initial process of the dump collection service. The pre-process for the guest control process is a process for writing process identification information in the virtual device 122 of the virtual machine 124. The written process identification information indicates the process of the host OS 11 corresponding to the designated virtual machine 124. A process corresponding to the virtual machine 124 is determined for each virtual machine 124.

仮想マシン管理サービス部１１６は、複数の仮想マシン１２４を管理するＶＭＭ（ＶｉｒｔｕａｌＭａｃｈｉｎｅＭｏｎｉｔｏｒ）であり、仮想マシン１２４の開始、停止、削除、再開等の処理を行う。仮想マシン管理サービス部１１６は、ダンプ採取サービス部１１２からプロセス識別情報の書き込みを依頼されると、指定された仮想マシン１２４の仮想デバイス１２２へ、プロセス識別情報を書き込む。 The virtual machine management service unit 116 is a VMM (Virtual Machine Monitor) that manages a plurality of virtual machines 124, and performs processes such as starting, stopping, deleting, and restarting the virtual machine 124. When requested by the dump collection service unit 112 to write process identification information, the virtual machine management service unit 116 writes the process identification information to the virtual device 122 of the designated virtual machine 124.

例えば、ダンプ採取サービス部１１２は、情報処理システムの仮想マシン管理サービス部１１６を呼び出す。呼び出された仮想マシン管理サービス部１１６は、例えば仮想化されたＤＶＤ装置のような仮想デバイス１２２を用意して、ホストマシンに接続する。ホストマシン上で動作するホストＯＳ１１のダンプ採取サービス部１１２は、用意された仮想ＤＶＤ装置内の書込み可能なＤＶＤに、プロセス識別情報を書き込む。この後、仮想マシン管理サービス部１１６は、仮想ＤＶＤ装置をホストマシンから切断して、ゲストマシンに接続する。接続されるゲストマシンは、そのゲストマシンに対応するホストＯＳ１１のプロセスであって、書き込まれるプロセス識別情報を有するプロセスに対応する。 For example, the dump collection service unit 112 calls the virtual machine management service unit 116 of the information processing system. The called virtual machine management service unit 116 prepares a virtual device 122 such as a virtualized DVD device and connects it to the host machine. The dump collection service unit 112 of the host OS 11 running on the host machine writes process identification information to a writable DVD in the prepared virtual DVD device. Thereafter, the virtual machine management service unit 116 disconnects the virtual DVD device from the host machine and connects it to the guest machine. The guest machine to be connected corresponds to a process of the host OS 11 corresponding to the guest machine and having process identification information to be written.

ゲストＯＳ１２のゲストドライバ１２１は、ゲストドライバ１２１の初期処理として、プロセス識別取得処理を実行する。プロセス識別取得処理は、仮想デバイス１２２からプロセス識別情報を取得する処理である。これにより、仮想マシン１２４に対応するプロセスを外部から識別する情報が取得される。 The guest driver 121 of the guest OS 12 executes a process identification acquisition process as an initial process of the guest driver 121. The process identification acquisition process is a process for acquiring process identification information from the virtual device 122. Thereby, information for identifying the process corresponding to the virtual machine 124 from the outside is acquired.

初期処理の後、ゲストドライバ１２１は、クラッシュ検知処理を開始、換言すれば、ゲストＯＳ１２のパニック出口として登録される。クラッシュ検知処理は、クラッシュ発生時に、ゲストＯＳ１２から呼び出される処理である。この呼び出しのために、ゲストドライバ１２１は、ゲストＯＳ１２のパニック出口として登録される。クラッシュの検知は、実際には、ゲストＯＳ１２自体が種々の手段により実行する。従って、ゲストＯＳ１２は、自己のクラッシュを検知すると、自己のパニック出口として登録されたゲストドライバ１２１を呼び出す。 After the initial process, the guest driver 121 starts the crash detection process, in other words, is registered as a panic exit of the guest OS 12. The crash detection process is a process called from the guest OS 12 when a crash occurs. For this call, the guest driver 121 is registered as a panic exit of the guest OS 12. In practice, the guest OS 12 itself performs the crash detection by various means. Therefore, when the guest OS 12 detects its own crash, the guest OS 12 calls the guest driver 121 registered as its own panic exit.

また、ゲストドライバ１２１は、クラッシュ通知処理を実行する。クラッシュ通知処理は、クラッシュの発生時に、換言すれば、ゲストＯＳ１２から呼び出された場合に、ゲストＯＳ１２が動作する仮想マシン１２４と、仮想マシン制御部であるホストＯＳ１１との間を論理的に結ぶ仮想バス２１に割り込みを発生する処理である。クラッシュ通知処理は、例えば、クラッシュ検知処理に続いて実行される。 Further, the guest driver 121 executes a crash notification process. The crash notification process is a virtual link that logically connects between the virtual machine 124 in which the guest OS 12 operates and the host OS 11 as the virtual machine control unit when a crash occurs, in other words, when called from the guest OS 12. This is a process for generating an interrupt on the bus 21. The crash notification process is executed following the crash detection process, for example.

仮想バス２１への割り込みは、ゲストドライバ１２１が動作する仮想マシン１２４に割り当てられた仮想バス２１に発生する。仮想バス２１は、例えば、ハイパーバイザ２により、仮想マシン１２４の開始時に、仮想マシン１２４毎に割り当てられる。従って、ゲストＯＳ１２のクラッシュは、個々のゲストＯＳ１２が動作している仮想マシン１２４に割り当てられた仮想バス２１により、ホストドライバ１１３に通知される。 The interruption to the virtual bus 21 occurs in the virtual bus 21 assigned to the virtual machine 124 on which the guest driver 121 operates. The virtual bus 21 is assigned to each virtual machine 124 by the hypervisor 2 when the virtual machine 124 is started, for example. Accordingly, the crash of the guest OS 12 is notified to the host driver 113 through the virtual bus 21 assigned to the virtual machine 124 in which each guest OS 12 is operating.

また、ゲストドライバ１２１は、ダンプ出力抑止処理を実行する。ダンプ抑止処理は、クラッシュ発生時に、換言すれば、ゲストＯＳ１２から呼び出された場合に、ゲストＯＳ１２によるメモリ１２３のダンプ採取を抑止する処理である。ダンプ出力抑止処理は、例えば、クラッシュ通知処理に続いて又はクラッシュ通知処理と並行して実行される。 In addition, the guest driver 121 executes dump output suppression processing. The dump suppression process is a process of suppressing dumping of the memory 123 by the guest OS 12 when a crash occurs, in other words, when called from the guest OS 12. The dump output suppression process is executed, for example, following the crash notification process or in parallel with the crash notification process.

ゲストＯＳ１２は、通常、ゲストＯＳ１２のメモリ１２３のメモリダンプを採取する機能であるダンプ採取機能を含むので、ゲストＯＳ１２のクラッシュ時に、メモリダンプが採取されてしまう。そこで、ゲストドライバ１２１は、クラッシュ発生時に、ゲストＯＳ１２によるメモリ１２３のダンプ採取を抑止する。 Since the guest OS 12 usually includes a dump collection function that is a function of collecting the memory dump of the memory 123 of the guest OS 12, a memory dump is collected when the guest OS 12 crashes. Therefore, the guest driver 121 suppresses dumping of the memory 123 by the guest OS 12 when a crash occurs.

一方、ホストＯＳ１１のホストドライバ１１３は、割り込み捕捉処理を実行する。割り込み捕捉処理は、ゲストドライバ１２１による仮想バス２１への割り込みを捕捉する処理である。換言すれば、割り込み捕捉処理は、ゲストドライバ１２１からのクラッシュ通知を受信する受信処理である。ゲストドライバ１２１からのクラッシュ通知のための割り込みの優先度は最優先である。そこで、ホストドライバ１１３は、割り込み捕捉処理を繰り返し実行する。割り込み捕捉処理は、例えば、最初のゲストＯＳ１２の開始の後に実行される。 On the other hand, the host driver 113 of the host OS 11 executes an interrupt capturing process. The interrupt capturing process is a process for capturing an interrupt to the virtual bus 21 by the guest driver 121. In other words, the interrupt capturing process is a receiving process for receiving a crash notification from the guest driver 121. The priority of the interrupt for the crash notification from the guest driver 121 is the highest priority. Therefore, the host driver 113 repeatedly executes the interrupt capturing process. The interrupt capturing process is executed after the start of the first guest OS 12, for example.

また、ホストドライバ１１３は、クラッシュ判定処理を開始、換言すれば、ホストＯＳ１１の予め定められた位置に登録する。クラッシュ判定処理は、クラッシュ判定時に、ホストＯＳ１１から呼び出される処理である。この呼び出しのために、ホストドライバ１１３は、ホストＯＳ１１の予め定められた位置に登録される。クラッシュ判定処理は、仮想バス２１への全ての割り込みを調べて、ゲストＯＳ１２のクラッシュを通知する割り込みと、それ以外の割り込みとを判別する処理である。クラッシュ判定処理は、例えば、割り込み捕捉処理に続いて実行される。 Further, the host driver 113 starts the crash determination process, in other words, registers at a predetermined position of the host OS 11. The crash determination process is a process called from the host OS 11 when a crash is determined. For this call, the host driver 113 is registered at a predetermined position of the host OS 11. The crash determination process is a process for checking all interrupts to the virtual bus 21 and determining an interrupt notifying the guest OS 12 of a crash and other interrupts. The crash determination process is executed, for example, following the interrupt capturing process.

また、ホストドライバ１１３は、プロセス識別情報解決処理を実行する。プロセス識別情報解決処理は、クラッシュが発生したゲストＯＳ１２がどの仮想マシン１２４に対応するプロセスで動作しているかを解決する処理、換言すれば、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に対応するプロセスのプロセス識別情報を求める処理である。プロセス識別情報解決処理は、例えば、割り込みがゲストＯＳ１２のクラッシュを通知する割り込みであると判別された場合に、クラッシュ判定処理に続いて実行される。 In addition, the host driver 113 executes process identification information resolution processing. The process identification information resolution process is a process for resolving the process corresponding to which virtual machine 124 the guest OS 12 in which the crash has occurred, in other words, corresponding to the virtual machine 124 in which the guest OS 12 in which the crash has occurred is operating. This is a process for obtaining process identification information of a process to be performed. The process identification information resolution process is executed following the crash determination process, for example, when it is determined that the interrupt is an interrupt for notifying the guest OS 12 of a crash.

また、ホストドライバ１１３は、クラッシュ発生通知処理を実行する。クラッシュ発生通知処理は、ゲストＯＳ１２にクラッシュが発生したことを、ダンプ採取サービス部１１２に通知する処理である。クラッシュ発生通知処理は、例えば、割り込みがゲストＯＳ１２のクラッシュを通知する割り込みであると判定された場合に、プロセス識別情報解決処理に続いて実行される。クラッシュ発生通知処理において、プロセス識別情報解決処理により求められた、クラッシュが発生したゲストＯＳ１２が動作するプロセスのプロセス識別情報が、ダンプ採取サービス部１１２に通知される。 In addition, the host driver 113 executes a crash occurrence notification process. The crash occurrence notification process is a process for notifying the dump collection service unit 112 that a crash has occurred in the guest OS 12. The crash occurrence notification process is executed following the process identification information resolution process, for example, when it is determined that the interrupt is an interrupt for notifying the guest OS 12 of a crash. In the crash occurrence notification process, the process identification information of the process in which the guest OS 12 in which the crash has occurred, obtained by the process identification information resolution process, is notified to the dump collection service unit 112.

一方、ホストＯＳ１１のダンプ採取サービス部１１２は、メモリ内容保存処理、換言すれば、メモリ状態保存処理を実行する。メモリ状態保存処理は、クラッシュが発生したゲストＯＳ１２が使用するメモリ１２３の内容を保存する処理である。ゲストＯＳ１２が使用するメモリ１２３は、ゲストＯＳ１２が動作する仮想マシン１２４に含まれる仮想化されたメモリ、換言すれば、仮想メモリである。メモリ状態保存処理は、例えば、ダンプ採取サービス部１１２がゲストＯＳ１２にクラッシュが発生した通知をホストドライバ１１３から受け取った場合に実行される。 On the other hand, the dump collection service unit 112 of the host OS 11 executes a memory content saving process, in other words, a memory state saving process. The memory state saving process is a process for saving the contents of the memory 123 used by the guest OS 12 in which the crash has occurred. The memory 123 used by the guest OS 12 is a virtualized memory included in the virtual machine 124 on which the guest OS 12 operates, in other words, a virtual memory. The memory state saving process is executed, for example, when the dump collection service unit 112 receives a notification from the host driver 113 that the guest OS 12 has crashed.

メモリ状態保存処理により、クラッシュが発生したゲストＯＳ１２が使用するメモリ１２３の内容がコピーされて、予め定められた記憶領域、換言すれば、保存状態格納領域１１４に格納される。保存状態格納領域１１４は、例えば、ダンプ採取ツール１１１により予め定められる。例えば、利用者は、ホストＯＳ１１上で動作するダンプ採取ツール１１１に端末からアクセスして、保存状態格納領域１１４を予め定めることができる。 Through the memory state saving process, the contents of the memory 123 used by the guest OS 12 in which the crash has occurred are copied and stored in a predetermined storage area, in other words, in the saved state storage area 114. The storage state storage area 114 is determined in advance by the dump collection tool 111, for example. For example, the user can access the dump collection tool 111 operating on the host OS 11 from the terminal and determine the storage state storage area 114 in advance.

メモリ状態保存処理は、メモリ１２３の内容をコピーして保存状態格納領域１１４に格納するのみであるので、ダンプファイルの作成と比べて短時間で処理を終了することができる。また、メモリ状態保存処理が終了すると、ダンプファイルの作成を待たずに、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４を停止し、再開することができる。 Since the memory state saving process only copies the contents of the memory 123 and stores it in the saving state storage area 114, the process can be completed in a shorter time than the creation of the dump file. When the memory state saving process is completed, the virtual machine 124 on which the guest OS 12 in which the crash occurred can be stopped and restarted without waiting for the creation of the dump file.

メモリ内容保存処理の後、ダンプ採取サービス部１１２は、メモリ内容保存処理の後処理として、ダンプ変換処理を実行する。ダンプ変換処理は、保存状態格納領域１１４に格納したメモリ１２３の内容を、予め定められたフォーマットのダンプファイルに変換して保存する処理である。ダンプ変換処理は、メモリ状態保存処理の実行の後において、ゲスト制御処理の実行の後に又はゲスト制御処理の実行と並行して、ゲスト制御処理の実行の障害とならないタイミングで実行される。 After the memory content saving process, the dump collection service unit 112 executes a dump conversion process as a post process of the memory content saving process. The dump conversion process is a process of converting the contents of the memory 123 stored in the storage state storage area 114 into a dump file having a predetermined format and storing it. The dump conversion process is executed after the execution of the memory state saving process, at a timing that does not cause an obstacle to the execution of the guest control process after the guest control process or in parallel with the guest control process.

ダンプ変換処理により、保存状態格納領域１１４に格納したメモリ１２３の内容がダンプファイルに変換されて、予め定められた記憶領域、換言すれば、ダンプ格納領域１１５に格納される。ダンプ格納領域１１５は、例えば、ダンプ採取ツール１１１により予め定められる。例えば、利用者は、ホストＯＳ１１上で動作するダンプ採取ツール１１１に端末からアクセスして、ダンプ格納領域１１５を予め定めることができる。 By the dump conversion process, the contents of the memory 123 stored in the saved state storage area 114 are converted into a dump file and stored in a predetermined storage area, in other words, the dump storage area 115. The dump storage area 115 is predetermined by the dump collection tool 111, for example. For example, the user can access the dump collection tool 111 operating on the host OS 11 from the terminal and determine the dump storage area 115 in advance.

ダンプ変換処理は、例えば、ゲスト制御処理の実行の障害とならないタイミングで実行される。換言すれば、ゲスト制御処理がダンプ変換処理に優先して実行される。これにより、メモリ状態保存処理の終了の後に、ゲスト制御処理を実行することができる。 The dump conversion process is executed, for example, at a timing that does not hinder the execution of the guest control process. In other words, the guest control process is executed with priority over the dump conversion process. As a result, the guest control process can be executed after the end of the memory state saving process.

また、ダンプ採取サービス部１１２は、ゲスト制御処理を実行する。ゲスト制御処理は、クラッシュが発生したゲストＯＳ１２を実行中の仮想マシン１２４の状態を制御する処理である。例えば、ダンプ採取サービス部１１２は、情報処理システムの仮想マシン管理サービス部１１６を呼び出し、呼び出した仮想マシン管理サービス部１１６に、指定したゲストＯＳ１２が使用する仮想デバイス１２２の制御を依頼する。従って、ゲスト制御処理は、実際には、仮想マシン管理サービス部１１６により実行される。 Further, the dump collection service unit 112 executes guest control processing. The guest control process is a process for controlling the state of the virtual machine 124 that is executing the guest OS 12 in which the crash has occurred. For example, the dump collection service unit 112 calls the virtual machine management service unit 116 of the information processing system and requests the called virtual machine management service unit 116 to control the virtual device 122 used by the specified guest OS 12. Accordingly, the guest control process is actually executed by the virtual machine management service unit 116.

ゲスト制御処理は、ダンプ採取サービス部１１２がゲストＯＳ１２にクラッシュが発生した通知を受け取った場合に、メモリ状態保存処理に続いて、ダンプ変換処理に優先して実行される。ゲスト制御処理により、ゲストＯＳ１２が動作する仮想マシン１２４の状態は、そのままの状態とされるか、停止や再開の状態とされる。 The guest control process is executed prior to the dump conversion process following the memory state saving process when the dump collection service unit 112 receives a notification that a crash has occurred in the guest OS 12. By the guest control process, the state of the virtual machine 124 on which the guest OS 12 operates is left as it is, or is stopped or restarted.

なお、図３において説明した処理は、各々、ゲストドライバ１２１、ホストドライバ１１３又はダンプ採取サービス部１１２のルーチン又はサブルーチンとして実現するようにしてもよい。 3 may be realized as a routine or a subroutine of the guest driver 121, the host driver 113, or the dump collection service unit 112, respectively.

図４は、情報処理システムの説明図である。 FIG. 4 is an explanatory diagram of the information processing system.

図４において、例えば、仮想マシン１２４Ｂ、換言すれば、仮想マシンＢは、プロセスＹとして実行され、仮想マシン１２４Ｂ上でゲストＯＳ１２Ｂが動作する。仮想マシン１２４Ｂには、仮想ＣＰＵ１２５Ｂ、換言すれば、仮想ＣＰＵ−１が割り当てられている。仮想マシン１２４Ｂに対応するゲストＯＳ１２及びゲストドライバ１２１を、各々、ゲストＯＳ１２Ｂ及びゲストドライバ１２１Ｂと表す。仮想マシンＡ及び仮想マシンＣについても同様である。図４においては、仮想マシンＢにおいて、クラッシュが発生するものとする。 In FIG. 4, for example, the virtual machine 124B, in other words, the virtual machine B is executed as the process Y, and the guest OS 12B operates on the virtual machine 124B. A virtual CPU 125B, in other words, virtual CPU-1 is assigned to the virtual machine 124B. The guest OS 12 and the guest driver 121 corresponding to the virtual machine 124B are represented as a guest OS 12B and a guest driver 121B, respectively. The same applies to virtual machine A and virtual machine C. In FIG. 4, it is assumed that a crash occurs in the virtual machine B.

前述したように、ホストＯＳ１１が複数のゲストＯＳ１２についてのメモリダンプを実行するためには、ホストＯＳ１１が、クラッシュしたゲストＯＳ１２がどの仮想マシン１２４で動作中であるのかを知る必要がある。 As described above, in order for the host OS 11 to perform a memory dump for a plurality of guest OSs 12, the host OS 11 needs to know which virtual machine 124 the crashed guest OS 12 is operating on.

ここで、情報処理システムにおいて、仮想マシン１２４は情報処理システム又はホストＯＳ１１におけるプロセスとして存在する。このため、仮想マシン１２４のメモリ１２３の内容は、仮想マシン１２４の外部からも取得可能である。従って、クラッシュが発生した際のメモリ１２３の内容も仮想マシン１２４の外部から採取でき、メモリダンプを生成することができる。 Here, in the information processing system, the virtual machine 124 exists as a process in the information processing system or the host OS 11. Therefore, the contents of the memory 123 of the virtual machine 124 can be acquired from outside the virtual machine 124. Therefore, the contents of the memory 123 when a crash occurs can be collected from outside the virtual machine 124, and a memory dump can be generated.

しかし、ゲストＯＳ１２から見える仮想マシン１２４は１個だけであり、一方、情報処理システム上には複数の仮想マシン１２４が存在する。例えば、ホストＯＳ１１から見た場合に、１個の仮想マシン１２４は１つのプロセスであり、その上でゲストＯＳ１２が動作する。情報処理システムから見た仮想マシン１２４は、情報処理システムが管理する１セットの仮想ハードウェア（ＣＯＰ、メモリ、ストレージ、ネットワークデバイス等）の集合である。そして、情報処理システムにおいて、仮想ハードウェアは、仮想マシン１２４上のホストＯＳ１１及びゲストＯＳ１２と厳密に分離されている。このため、例えば、ゲストＯＳ１２は、そのゲストＯＳ１２自身がどの仮想マシン１２４上で動作するかを知ることはできない。 However, only one virtual machine 124 is visible from the guest OS 12, while there are a plurality of virtual machines 124 on the information processing system. For example, when viewed from the host OS 11, one virtual machine 124 is one process on which the guest OS 12 operates. The virtual machine 124 viewed from the information processing system is a set of a set of virtual hardware (COP, memory, storage, network device, etc.) managed by the information processing system. In the information processing system, the virtual hardware is strictly separated from the host OS 11 and the guest OS 12 on the virtual machine 124. For this reason, for example, the guest OS 12 cannot know on which virtual machine 124 the guest OS 12 itself operates.

以上から、ゲストＯＳ１２のクラッシュの発生を通知する場合、ゲストＯＳ１２の動作する仮想マシン１２４を情報処理システム上で識別可能な情報を、合わせて通知することができる必要がある。 From the above, when notifying the occurrence of a crash of the guest OS 12, it is necessary to be able to notify information that can identify the virtual machine 124 on which the guest OS 12 operates on the information processing system.

そこで、ダンプ採取サービス部１１２、ホストドライバ１１３、ゲストドライバ１２１が設けられる。ダンプ採取サービス部１１２は、仮想マシン１２４のプロセスを識別する情報、換言すれば、プロセス識別情報を持つ。ホストドライバ１１３は、仮想化システムの仮想バス２１上で仮想ハードウェアを識別する情報、換言すれば、仮想化ＩＤを持つ。ゲストドライバ１２１は、それ自体は、プロセス識別情報及び仮想化ＩＤを持たない。 Therefore, a dump collection service unit 112, a host driver 113, and a guest driver 121 are provided. The dump collection service unit 112 has information for identifying the process of the virtual machine 124, in other words, process identification information. The host driver 113 has information for identifying virtual hardware on the virtual bus 21 of the virtualization system, in other words, a virtualization ID. The guest driver 121 itself does not have process identification information and virtualization ID.

そして、図４に示すように、例えば、ホストドライバ１１３に、仮想化ＩＤ／プロセス一覧１１７が設けられる。仮想化ＩＤ／プロセス一覧１１７は、第１の記憶手段であり、ホストＯＳ１１に設けられ、複数の仮想バス識別情報と複数のプロセス識別情報との対応関係である第１対応情報を記憶する。仮想バス識別情報は、複数の仮想マシン１２４の各々に割り当てられた複数の仮想バス２１を識別する識別情報であり、一意に定まる。プロセス識別情報は、複数の仮想マシン１２４の各々に対応するホストＯＳ１１のプロセスを識別する識別情報であり、一意に定まる。従って、プロセス識別情報は、複数の仮想マシン１２４にそれぞれ対応した仮想マシン制御部のプロセスを識別する。仮想バス識別情報は、例えば、仮想化ＩＤであり、ハイパーバイザ２において複数の仮想バス２１を識別する情報である。 As shown in FIG. 4, for example, the virtualization ID / process list 117 is provided in the host driver 113. The virtualization ID / process list 117 is a first storage unit and is provided in the host OS 11 and stores first correspondence information that is a correspondence relationship between a plurality of virtual bus identification information and a plurality of process identification information. The virtual bus identification information is identification information for identifying the plurality of virtual buses 21 assigned to each of the plurality of virtual machines 124, and is uniquely determined. The process identification information is identification information for identifying a process of the host OS 11 corresponding to each of the plurality of virtual machines 124, and is uniquely determined. Therefore, the process identification information identifies the process of the virtual machine control unit corresponding to each of the plurality of virtual machines 124. The virtual bus identification information is, for example, a virtualization ID, and is information for identifying a plurality of virtual buses 21 in the hypervisor 2.

また、図４に示すように、例えば、ダンプ採取サービス部１１２に、仮想マシン／プロセス一覧１１８が設けられる。仮想マシン／プロセス一覧１１８は、第２の記憶手段であり、ホストＯＳ１１に設けられ、複数のプロセス識別情報と複数のマシン識別情報との対応関係である第２対応情報を記憶する。マシン識別情報は、複数の仮想マシン１２４を識別する識別情報である。マシン識別情報は、例えば、仮想マシン１２４の仮想マシン名であり、一意に定まる。 As shown in FIG. 4, for example, a virtual machine / process list 118 is provided in the dump collection service unit 112. The virtual machine / process list 118 is a second storage unit, is provided in the host OS 11, and stores second correspondence information that is a correspondence relationship between a plurality of process identification information and a plurality of machine identification information. The machine identification information is identification information for identifying a plurality of virtual machines 124. The machine identification information is, for example, the virtual machine name of the virtual machine 124 and is uniquely determined.

最初に、ホストＯＳ１１が複数のゲストＯＳ１２についてのメモリダンプを実行するために、以下のような前処理が実行される。 First, in order for the host OS 11 to execute a memory dump for a plurality of guest OSs 12, the following pre-processing is executed.

ホストＯＳ１１のダンプ採取サービス部１１２は、仮想マシン１２４に対応するプロセスのプロセス識別情報を、複数の仮想マシン１２４の各々に送る。複数の仮想マシン１２４の各々は、ホストＯＳ１１から受け取ったプロセス識別情報を、仮想デバイス１２２に格納する。 The dump collection service unit 112 of the host OS 11 sends process identification information of a process corresponding to the virtual machine 124 to each of the plurality of virtual machines 124. Each of the plurality of virtual machines 124 stores the process identification information received from the host OS 11 in the virtual device 122.

複数の仮想マシン１２４の各々のゲストドライバ１２１は、仮想デバイス１２２に格納されたプロセス識別情報を、ホストＯＳ１１のホストドライバ１１３に送る。 Each guest driver 121 of the plurality of virtual machines 124 sends the process identification information stored in the virtual device 122 to the host driver 113 of the host OS 11.

ホストＯＳ１１のホストドライバ１１３は、複数の仮想マシン１２４の各々から受け取ったプロセス識別情報を用いて、仮想化ＩＤ／プロセス一覧１１７に、複数の仮想マシン１２４の各々に割り当てられた複数の仮想バス２１を識別する仮想バス識別情報、換言すれば、仮想化ＩＤと、複数の仮想マシン１２４であるプロセスを識別するプロセス識別情報との対応関係である第１対応情報を記憶する。更に、ホストＯＳ１１のダンプ採取サービス部１１２は、仮想マシン／プロセス一覧１１８に、複数の仮想マシン１２４であるプロセスを識別するプロセス識別情報と、複数の仮想マシン１２４の各々の識別情報、換言すれば、仮想マシン名との対応関係である第１対応情報を記憶する。 The host driver 113 of the host OS 11 uses the process identification information received from each of the plurality of virtual machines 124 to store the plurality of virtual buses 21 assigned to each of the plurality of virtual machines 124 in the virtualization ID / process list 117. In other words, the first correspondence information that is a correspondence relationship between the virtualization ID and the process identification information that identifies the processes that are the plurality of virtual machines 124 is stored. Furthermore, the dump collection service unit 112 of the host OS 11 includes, in the virtual machine / process list 118, process identification information for identifying processes that are a plurality of virtual machines 124, and identification information for each of the plurality of virtual machines 124, in other words, First correspondence information that is a correspondence relationship with the virtual machine name is stored.

次に、情報処理システムの運用において、ホストＯＳ１１が、複数のゲストＯＳ１２についてのメモリダンプが、以下のように実行される。 Next, in the operation of the information processing system, the host OS 11 performs a memory dump for the plurality of guest OSs 12 as follows.

ゲストドライバ１２１は、ゲストＯＳ１２のクラッシュ発生を通知するため、予めそのゲストＯＳ１２に割り当てられた仮想化ＩＤの仮想バス２１に、割り込みを発生させる。換言すれば、ゲストドライバ１２１は、障害が発生したゲストＯＳ１２が動作する仮想マシン１２４に割り当てられた仮想バス２１に割り込みを発生することにより、ホストＯＳ１１に障害の発生を通知する。 The guest driver 121 generates an interrupt to the virtual bus 21 of the virtualization ID assigned in advance to the guest OS 12 in order to notify the occurrence of the crash of the guest OS 12. In other words, the guest driver 121 notifies the host OS 11 of the occurrence of a failure by generating an interrupt on the virtual bus 21 assigned to the virtual machine 124 in which the guest OS 12 in which the failure has occurred operates.

具体的には、ゲストドライバ１２１は、自己が動作している仮想マシン１２４の仮想ＣＰＵ１２５に、実行不可能な命令を実行させる。これにより、ゲストドライバ１２１が動作している仮想マシン１２４に割り当てられた仮想バス２１に割り込みが発生する。 Specifically, the guest driver 121 causes the virtual CPU 125 of the virtual machine 124 in which the guest driver 121 is operating to execute an instruction that cannot be executed. As a result, an interrupt occurs in the virtual bus 21 assigned to the virtual machine 124 in which the guest driver 121 is operating.

ホストドライバ１１３は、割り込みの発生した仮想バス２１から、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に割り当てられた仮想バス２１の仮想化ＩＤを判定する。換言すれば、ホストドライバ１１３は、ゲストドライバ１２１により仮想バス２１に発生させられた割り込みを捕捉することにより、ゲストドライバ１２１から障害の発生を通知される。 The host driver 113 determines the virtualization ID of the virtual bus 21 assigned to the virtual machine 124 in which the guest OS 12 in which the crash occurred operates from the interrupted virtual bus 21. In other words, the host driver 113 is notified of the occurrence of a failure from the guest driver 121 by capturing an interrupt generated on the virtual bus 21 by the guest driver 121.

ホストドライバ１１３は、判定した仮想化ＩＤに基づいて仮想化ＩＤ／プロセス一覧１１７を参照して、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に割り当てられた仮想バス２１の仮想化ＩＤに基づいて、仮想マシン１２４に対応するプロセスのプロセス識別情報を求める。 The host driver 113 refers to the virtualization ID / process list 117 based on the determined virtualization ID, and based on the virtualization ID of the virtual bus 21 assigned to the virtual machine 124 on which the guest OS 12 in which the crash occurred operates. Thus, the process identification information of the process corresponding to the virtual machine 124 is obtained.

ダンプ採取サービス部１１２は、求めたプロセス識別情報に基づいて仮想マシン／プロセス一覧１１８を参照して、メモリダンプを採取する対象の仮想マシン１２４を識別する。これにより、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４の使用するメモリ１２３の内容を保存状態格納領域１１４に保存し、ダンプファイルの形式に変換して、ダンプ格納領域１１５に格納することができる。 The dump collection service unit 112 refers to the virtual machine / process list 118 based on the obtained process identification information, and identifies the virtual machine 124 that is the target of collecting the memory dump. As a result, the contents of the memory 123 used by the virtual machine 124 on which the guest OS 12 in which the crash has occurred can be stored in the storage state storage area 114, converted into a dump file format, and stored in the dump storage area 115. it can.

以下、ホストＯＳ１１が複数のゲストＯＳ１２についてのメモリダンプを実行するための処理について説明する。 Hereinafter, processing for the host OS 11 to execute a memory dump for a plurality of guest OSs 12 will be described.

ホストドライバ１１３は、ホストＯＳ１１のカーネルドライバであるので仮想バス２１への割り込みにおける仮想化ＩＤを知ることができるが、プロセス識別情報は知ることができない。仮想化ＩＤを知ることができるのは、ホストドライバ１１３のみである。ダンプ採取サービス部１１２は、プロセス識別情報は知ることができるが、仮想マシン１２４ではないので仮想化ＩＤにアクセスすることはできない。仮想マシン１２４は、仮想マシン１２４を表すプロセス、換言すれば、仮想マシン１２４に対応するプロセスとは区別されるので、プロセス識別情報は知ることができない。ゲストＯＳ１２は、プロセス識別情報も仮想化ＩＤも知ることはできない。 Since the host driver 113 is a kernel driver of the host OS 11, it can know the virtualization ID in the interrupt to the virtual bus 21, but cannot know the process identification information. Only the host driver 113 can know the virtualization ID. The dump collection service unit 112 can know the process identification information, but cannot access the virtualization ID because it is not the virtual machine 124. Since the virtual machine 124 is distinguished from a process representing the virtual machine 124, in other words, a process corresponding to the virtual machine 124, the process identification information cannot be known. The guest OS 12 cannot know the process identification information and the virtualization ID.

そこで、ダンプ採取サービス部１１２が、プロセス識別情報を、例えば、仮想マシン１２４の生成時に、仮想マシン１２４の仮想デバイス１２２に書き込む。仮想マシン１２４上で動作するゲストＯＳ１２のゲストドライバ１２１は、仮想デバイス１２２のプロセス識別情報を、仮想デバイス１２２から読み出してホストドライバ１１３に渡す。これに応じて、ホストドライバ１１３は、仮想化ＩＤ／プロセス一覧１１７を作成する。また、ホストドライバ１１３は、プロセス識別情報を、ダンプ採取サービス部１１２に渡す。これに応じて、ダンプ採取サービス部１１２は、仮想マシン／プロセス一覧１１８を作成する。この結果、仮想マシン１２４に１対１に割り当てられかつ一意に定まる仮想化ＩＤを有する仮想バス２１への割り込みが発生すると、ホストドライバ１１３が対応するプロセス識別情報を求めることができ、ダンプ採取サービス部１１２が対応する仮想マシン名、換言すれば、仮想マシン１２４、具体的には、仮想マシン１２４の仮想マシン名を求めることができる。 Therefore, the dump collection service unit 112 writes the process identification information in the virtual device 122 of the virtual machine 124 when, for example, the virtual machine 124 is generated. The guest driver 121 of the guest OS 12 operating on the virtual machine 124 reads the process identification information of the virtual device 122 from the virtual device 122 and passes it to the host driver 113. In response to this, the host driver 113 creates a virtualization ID / process list 117. In addition, the host driver 113 passes the process identification information to the dump collection service unit 112. In response to this, the dump collection service unit 112 creates the virtual machine / process list 118. As a result, when an interrupt to the virtual bus 21 having a virtual ID that is uniquely assigned to the virtual machine 124 and has a unique ID occurs, the host driver 113 can obtain the corresponding process identification information, and the dump collection service The virtual machine name corresponding to the unit 112, in other words, the virtual machine 124, specifically, the virtual machine name of the virtual machine 124 can be obtained.

最初に、仮想マシン／プロセス一覧１１８の作成について説明する。 First, creation of the virtual machine / process list 118 will be described.

ダンプ採取サービス部１１２は、仮想マシン管理サービス部１１６に接続して、仮想マシン／プロセス一覧１１８を作成する。仮想マシン／プロセス一覧１１８は、仮想マシン１２４の情報として、仮想マシン名と、仮想マシン１２４の状態（実行中、停止中、削除）と、仮想マシン１２４に対応するプロセスのプロセス識別情報とを含む。 The dump collection service unit 112 connects to the virtual machine management service unit 116 and creates a virtual machine / process list 118. The virtual machine / process list 118 includes, as information on the virtual machine 124, a virtual machine name, a state of the virtual machine 124 (running, stopped, deleted), and process identification information of a process corresponding to the virtual machine 124. .

仮想マシン名と、仮想マシン１２４の状態は、例えば、仮想マシン／プロセス一覧１１８の作成時又は作成に先立って、ダンプ採取サービス部１１２により、仮想マシン１２４を管理する仮想マシン管理サービス部１１６から取得される。 The virtual machine name and the state of the virtual machine 124 are acquired from the virtual machine management service unit 116 that manages the virtual machine 124 by the dump collection service unit 112 when the virtual machine / process list 118 is created or prior to creation, for example. Is done.

仮想マシン１２４に対応するプロセスのプロセス識別情報は、実行中の仮想マシン１２４について与えられる一意な情報であり、仮想マシン１２４の定義とは独立に、実行中の仮想マシン１２４に対応するプロセスを識別する情報である。従って、プロセス識別情報は、例えば、ダンプ採取サービス部１１２により、仮想マシン管理サービス部１１６から取得した実行中の仮想マシン１２４の各々について定められる。また、プロセス識別情報を仮想マシン定義とは独立に定めることにより、１つの仮想マシン定義から複数の仮想マシン１２４を同時にプロセス化する場合にも対応することができる。 The process identification information of the process corresponding to the virtual machine 124 is unique information given to the running virtual machine 124, and identifies the process corresponding to the running virtual machine 124 independently of the definition of the virtual machine 124. Information. Accordingly, the process identification information is determined for each of the running virtual machines 124 acquired from the virtual machine management service unit 116 by the dump collection service unit 112, for example. Further, by defining the process identification information independently of the virtual machine definition, it is possible to cope with a case where a plurality of virtual machines 124 are simultaneously processed from one virtual machine definition.

ダンプ採取サービス部１１２は、仮想マシン／プロセス一覧１１８を作成すると、仮想マシン状態の監視を開始し、仮想マシン１２４のクラッシュの監視を開始する。ダンプ採取サービス部１１２は、予め定められた時間間隔で、仮想マシン管理サービス部１１６から仮想マシン状態を取得して、仮想マシン状態の変更に応じて、仮想マシン／プロセス一覧１１８を更新する。そして、新たに開始された仮想マシン１２４がある場合には、新たに開始された仮想マシン１２４について、新しいプロセス識別情報を作成し、その仮想マシン１２４に接続されている仮想デバイス１２２に書き込む。 When the dump collection service unit 112 creates the virtual machine / process list 118, the dump collection service unit 112 starts monitoring the virtual machine state and starts monitoring the crash of the virtual machine 124. The dump collection service unit 112 acquires the virtual machine state from the virtual machine management service unit 116 at predetermined time intervals, and updates the virtual machine / process list 118 according to the change of the virtual machine state. If there is a newly started virtual machine 124, new process identification information is created for the newly started virtual machine 124 and written to the virtual device 122 connected to the virtual machine 124.

仮想マシン管理サービス部１１６は、仮想マシン／プロセス一覧１１８を更新すると、ホストドライバ１１３に仮想マシン状態の変更を通知する。この通知は、状態変更の種別（追加、削除、実行中→停止、停止中→実行）と、仮想マシン１２４に対応するプロセスのプロセス識別情報とを含む。 When the virtual machine management service unit 116 updates the virtual machine / process list 118, the virtual machine management service unit 116 notifies the host driver 113 of a change in the virtual machine state. This notification includes the type of state change (addition, deletion, executing → stopped, stopped → executed) and process identification information of the process corresponding to the virtual machine 124.

次に、仮想化ＩＤ／プロセス一覧１１７の作成について説明する。 Next, creation of the virtualization ID / process list 117 will be described.

ホストドライバ１１３は、ゲストＯＳ１２からの割り込みを受付け、また、ダンプ採取サービス部１１２からの要求を受付ける。例えば、ホストドライバ１１３は、ダンプ採取サービス部１１２からの要求の処理において、仮想化ＩＤ／プロセス一覧１１７の作成と更新を行う。前述したように、ホストドライバ１１３は、ハイパーバイザ２を介してのみ仮想マシン１２４にアクセスできるので仮想化ＩＤにはアクセスできるが、一方、仮想マシン１２４に対応するプロセスのプロセス識別情報にはアクセスできない。そこで、ダンプ採取サービス部１１２が仮想マシン１２４の仮想デバイス１２２に書き込んだプロセス識別情報を、ゲストドライバ１２１を介して読み込み、仮想化ＩＤとプロセス識別情報とを対応付ける仮想化ＩＤ／プロセス一覧１１７を作成する。 The host driver 113 accepts an interrupt from the guest OS 12 and accepts a request from the dump collection service unit 112. For example, the host driver 113 creates and updates the virtualization ID / process list 117 in the processing of the request from the dump collection service unit 112. As described above, since the host driver 113 can access the virtual machine 124 only through the hypervisor 2, it can access the virtualization ID, but it cannot access the process identification information of the process corresponding to the virtual machine 124. . Therefore, the process ID information written to the virtual device 122 of the virtual machine 124 by the dump collection service unit 112 is read via the guest driver 121, and a virtualization ID / process list 117 that associates the virtualization ID with the process identification information is created. To do.

仮想化ＩＤは、例えば、仮想化ＩＤ／プロセス一覧１１７の作成時又は作成に先立って、ホストドライバ１１３により、仮想バス２１を管理するハイパーバイザ２から取得される。 For example, the virtualization ID is acquired from the hypervisor 2 that manages the virtual bus 21 by the host driver 113 when or before the creation of the virtualization ID / process list 117.

ホストドライバ１１３は、ダンプ採取サービス部１１２からＩＯ要求を受け付ける。受け付けるＩＯ要求には、クラッシュの監視開始、仮想マシン１２４の状態変更（ＶＭ開始、ＶＭ停止、ＶＭ削除）、クラッシュの監視終了の種別がある。 The host driver 113 receives an IO request from the dump collection service unit 112. The received IO requests include types of crash monitoring start, virtual machine 124 status change (VM start, VM stop, VM deletion), and crash monitoring end.

ホストドライバ１１３は、最初にクラッシュの監視開始の要求を受け取り、仮想化ＩＤ／プロセス一覧１１７の作成を開始する。具体的には、ホストドライバ１１３は、ハイパーバイザ２から仮想化ＩＤを取得し、仮想化ＩＤで識別される仮想マシン１２４がゲストドライバ１２１を持つ場合に、その仮想マシン１２４に対してクラッシュ監視の開始を通知し、応答として、ゲストドライバ１２１からその仮想マシン１２４に対応するプロセスのプロセス識別情報を受け取る。仮想化ＩＤに対応するプロセス識別情報が得られた仮想マシン１２４は、クラッシュの監視対象であるため、仮想化ＩＤ／プロセス一覧１１７に追加される。 The host driver 113 first receives a request to start monitoring for crashes, and starts creating the virtualization ID / process list 117. Specifically, the host driver 113 acquires a virtualization ID from the hypervisor 2, and when the virtual machine 124 identified by the virtualization ID has the guest driver 121, the host driver 113 performs crash monitoring for the virtual machine 124. The start is notified, and the process identification information of the process corresponding to the virtual machine 124 is received from the guest driver 121 as a response. The virtual machine 124 for which the process identification information corresponding to the virtualization ID is obtained is added to the virtualization ID / process list 117 because it is a crash monitoring target.

なお、ゲストＯＳ１２からの割り込みの受付は初期処理の後に直ちに開始されるが、実際にゲストＯＳ１２からの割り込みが行われるのは、そのゲストＯＳ１２のゲストドライバ１２１の確認が完了し、ゲストＯＳ１２の情報が仮想化ＩＤ／プロセス一覧１１７への追加が行われてからである。 The acceptance of the interrupt from the guest OS 12 starts immediately after the initial processing, but the actual interrupt from the guest OS 12 is performed after the confirmation of the guest driver 121 of the guest OS 12 is completed and the information on the guest OS 12 Is added to the virtualization ID / process list 117.

ホストドライバ１１３は、クラッシュの監視開始後、仮想マシン１２４の状態変更の要求を受け取る。ホストドライバ１１３は、仮想マシン１２４の状態変更の内容に応じて、仮想化ＩＤ／プロセス一覧１１７を更新する。この際、変更要求に含まれるプロセス識別情報をキーとして仮想化ＩＤ／プロセス一覧１１７が更新される。 The host driver 113 receives a request to change the state of the virtual machine 124 after the start of crash monitoring. The host driver 113 updates the virtualization ID / process list 117 according to the contents of the status change of the virtual machine 124. At this time, the virtualization ID / process list 117 is updated using the process identification information included in the change request as a key.

ホストドライバ１１３は、ダンプ採取サービス部１１２の停止にともない、クラッシュ監視停止の要求を受け取る。この要求を受け取ったなら、仮想化ＩＤ／プロセス一覧１１７の全ての情報を破棄する。 The host driver 113 receives a request to stop crash monitoring as the dump collection service unit 112 stops. If this request is received, all information in the virtualization ID / process list 117 is discarded.

次に、ゲストドライバ１２１からホストドライバ１１３へのプロセス識別情報の引渡しについて説明する。 Next, delivery of process identification information from the guest driver 121 to the host driver 113 will be described.

ゲストドライバ１２１は、ホストドライバ１１３からのクラッシュの監視開始の通知を受け取って、これを契機として仮想デバイス１２２にアクセスし、自己が動作する仮想マシン１２４に対応するプロセスのプロセス識別情報を取得する。ゲストドライバ１２１は、取得したプロセス識別情報をホストドライバ１１３に送る。プロセス識別情報は、仮想マシン１２４の開始に当って、ダンプ採取サービス部１１２が一意に設定する値であるため、この処理はゲストドライバ１２１の初期化後に一回だけ実行されれば十分である。 Upon receiving the notification of the start of crash monitoring from the host driver 113, the guest driver 121 accesses the virtual device 122 as a trigger, and acquires process identification information of the process corresponding to the virtual machine 124 on which the guest driver 121 operates. The guest driver 121 sends the acquired process identification information to the host driver 113. Since the process identification information is a value that is uniquely set by the dump collection service unit 112 at the start of the virtual machine 124, it is sufficient that this process is executed only once after the initialization of the guest driver 121.

この際、例えば、ゲストドライバ１２１は、プロセス識別情報を予め定められたアドレスに設定する。予め定められたアドレスは、例えば、ホストＯＳ１１の使用する物理メモリである。ホストドライバ１１３は、ハイパーバイザ２によりゲストＯＳ１２上のアドレスをホストＯＳ１１の物理メモリ上のアドレスに変換して、直接参照する。これは、アドレス変換による直接参照である。 At this time, for example, the guest driver 121 sets the process identification information to a predetermined address. The predetermined address is, for example, a physical memory used by the host OS 11. The host driver 113 converts the address on the guest OS 12 into an address on the physical memory of the host OS 11 by the hypervisor 2 and directly refers to it. This is a direct reference by address translation.

なお、ゲストドライバ１２１が、取得したプロセス識別情報を、ハイパーバイザ２の持つ仮想マシン１２４の間の通信機能を使用して、ホストドライバ１１３に送るようにしてもよい。これは、ドライバ間の通信による間接参照である。 Note that the guest driver 121 may send the acquired process identification information to the host driver 113 using a communication function between the virtual machines 124 of the hypervisor 2. This is an indirect reference by communication between drivers.

次に、情報処理システムの運用における、ホストＯＳ１１による、複数のゲストＯＳ１２についてのメモリダンプについて説明する。 Next, a memory dump for a plurality of guest OSs 12 by the host OS 11 in the operation of the information processing system will be described.

ゲストドライバ１２１Ｂは、ゲストＯＳ１２Ｂのクラッシュの監視開始の通知をホストドライバ１１３から受け取り、プロセス識別情報をホストドライバ１１３に返す。そして、ゲストドライバ１２１Ｂは、ゲストＯＳ１２Ｂのパニック出口にゲストドライバ１２１Ｂを登録する。 The guest driver 121B receives from the host driver 113 a notification that the guest OS 12B has started to monitor crashes, and returns process identification information to the host driver 113. Then, the guest driver 121B registers the guest driver 121B at the panic exit of the guest OS 12B.

ゲストＯＳ１２ＢがゲストＯＳ１２Ｂにおけるクラッシュの発生を検出すると、ゲストＯＳ１２Ｂのパニック出口として登録されているゲストドライバ１２１Ｂが呼び出される。呼び出されたゲストドライバ１２１Ｂは、ホストドライバ１１３に、クラッシュの通知を送る。具体的には、ゲストドライバ１２１Ｂは、対応するゲストＯＳ１２Ｂが動作している仮想マシン１２４に割り当てられた仮想バス２１に割り込みを発生させる。仮想バス２１には仮想化ＩＤが与えられている。 When the guest OS 12B detects the occurrence of a crash in the guest OS 12B, the guest driver 121B registered as a panic exit of the guest OS 12B is called. The called guest driver 121B sends a crash notification to the host driver 113. Specifically, the guest driver 121B generates an interrupt on the virtual bus 21 assigned to the virtual machine 124 in which the corresponding guest OS 12B is operating. A virtual ID is given to the virtual bus 21.

ホストＯＳ１１のホストドライバ１１３は、ゲストドライバ１２１Ｂからのクラッシュの通知を受け取り、割り込みの発生した仮想バス２１に基づいて、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に割り当てられた仮想バス２１の仮想化ＩＤ「ＩＤ２」を特定する。そして、ホストドライバ１１３は、受け取ったクラッシュの通知の仮想化ＩＤ「ＩＤ２」をキーとして用いて、仮想化ＩＤ／プロセス一覧１１７を検索して、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に対応するプロセスのプロセス識別情報である「プロセスＹ」を得る。 The host driver 113 of the host OS 11 receives the notification of the crash from the guest driver 121B, and based on the virtual bus 21 in which the interrupt has occurred, the virtual bus 21 assigned to the virtual machine 124 in which the guest OS 12 in which the crash has occurred operates. The virtualization ID “ID2” is specified. Then, the host driver 113 searches the virtualization ID / process list 117 using the virtual ID “ID2” of the received notification of the crash as a key, and sends it to the virtual machine 124 on which the guest OS 12 in which the crash occurred operates. “Process Y” which is the process identification information of the corresponding process is obtained.

ホストドライバ１１３は、ダンプ採取サービス部１１２へクラッシュ発生を通知し、この際、プロセス識別情報「プロセスＹ」も通知する。 The host driver 113 notifies the dump collection service unit 112 of the occurrence of the crash, and at this time, also notifies the process identification information “process Y”.

ダンプ採取サービス部１１２は、ホストドライバ１１３に、仮想マシン１２４のクラッシュの監視の開始を指示する。この際、応答として、ホストドライバ１１３から、ダンプ採取サービス部１１２に、クラッシュ発生を通知する際に使用するインタフェース情報が送られる。 The dump collection service unit 112 instructs the host driver 113 to start monitoring the crash of the virtual machine 124. At this time, as a response, interface information used when notifying the occurrence of the crash is sent from the host driver 113 to the dump collection service unit 112.

ダンプ採取サービス部１１２は、ホストドライバ１１３から前述のインタフェース情報に従うクラッシュの発生通知を受け取り、受け取ったクラッシュの発生通知に含まれるプロセス識別情報「プロセスＹ」を取り出す。そして、ダンプ採取サービス部１１２は、取り出したプロセス識別情報「プロセスＹ」をキーとして用いて、仮想マシン／プロセス一覧１１８を検索して、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４の仮想マシン名である「仮想マシンＢ」を得る。これにより、クラッシュが発生した仮想マシンＢと、クラッシュが発生した仮想マシンＢが動作するプロセスＹが特定される。 The dump collection service unit 112 receives a crash occurrence notification according to the above-described interface information from the host driver 113, and extracts process identification information “process Y” included in the received crash occurrence notification. Then, the dump collection service unit 112 searches the virtual machine / process list 118 using the extracted process identification information “process Y” as a key, and the virtual machine 124 of the virtual machine 124 on which the guest OS 12 in which the crash occurred operates. The name “virtual machine B” is obtained. Thereby, the virtual machine B in which the crash has occurred and the process Y in which the virtual machine B in which the crash has occurred operate are specified.

ダンプ採取サービス部１１２は、仮想マシンＢを特定すると、特定した仮想マシンＢに対するダンプ採取処理を実行する。例えば、ダンプ採取サービス部１１２は、クラッシュが発生した仮想マシンＢのプロセスＹが使用するメモリ１２３の障害発生時におけるメモリ１２３の内容をコピーして、保存状態格納領域１１４に保存する。そして、ダンプ採取サービス部１１２は、保存状態格納領域１１４に保存したメモリ１２３の内容を読みだしてダンプファイルに変換し、ダンプ格納領域１１５に格納する。 When the dump collection service unit 112 identifies the virtual machine B, the dump collection service unit 112 executes dump collection processing for the identified virtual machine B. For example, the dump collection service unit 112 copies the contents of the memory 123 at the time of failure of the memory 123 used by the process Y of the virtual machine B in which the crash has occurred, and saves it in the save state storage area 114. Then, the dump collection service unit 112 reads the contents of the memory 123 stored in the storage state storage area 114, converts the contents into a dump file, and stores the dump file in the dump storage area 115.

また、ダンプ採取サービス部１１２は、仮想マシン管理サービス部１１６に、仮想マシンＢの停止と再起動を依頼する。 Further, the dump collection service unit 112 requests the virtual machine management service unit 116 to stop and restart the virtual machine B.

なお、仮想マシン１２４の停止又は再開に応じて仮想化ＩＤ／プロセス一覧１１７を更新するために、仮想マシン１２４の状態を監視する監視処理が実行される。この監視処理は、例えば、仮想マシン管理サービス部１１６により実行され、仮想マシン管理サービス部１１６から、仮想マシン１２４の状態が切り替わる度に、ホストドライバ１１３に仮想マシン１２４の状態の切り替わりを通知されることにより、仮想化ＩＤ／プロセス一覧１１７を更新する。 In order to update the virtualization ID / process list 117 according to the stop or restart of the virtual machine 124, a monitoring process for monitoring the state of the virtual machine 124 is executed. This monitoring process is executed by, for example, the virtual machine management service unit 116, and the virtual machine management service unit 116 notifies the host driver 113 of the switching of the state of the virtual machine 124 every time the state of the virtual machine 124 is switched. As a result, the virtualization ID / process list 117 is updated.

以上のように、本発明によれば、ホストＯＳ１１がゲストＯＳ１２のメモリ１２３のメモリダンプを採取することができる。これにより、採取したダンプファイルの保存先をホストＯＳ１１側に集約しているので、複数のゲストＯＳ１２の間でディスク領域を共用することができる。また、ゲストＯＳ１２が備えるダンプ採取機能を使用しないので、ゲストＯＳ１２が動作する仮想マシン１２４の各々に、ダンプ採取のためのリソースを割り当てる必要を無くすことができる。 As described above, according to the present invention, the host OS 11 can collect the memory dump of the memory 123 of the guest OS 12. Thereby, since the storage destinations of the collected dump files are collected on the host OS 11 side, the disk area can be shared among the plurality of guest OSs 12. In addition, since the dump collection function of the guest OS 12 is not used, it is possible to eliminate the need to allocate resources for dump collection to each of the virtual machines 124 on which the guest OS 12 operates.

例えば、前述したように、２ＧＢの容量を持つメモリ１２３を使用するゲストＯＳ１１が１０個存在する場合において、１０個の仮想マシン１２４の全てについてメモリダンプを採取するとしても、２０ＧＢのディスク領域は必要ではなく、予備のディスク領域を考慮したとしても、例えば４ＧＢのディスク領域を用意するだけでよい。なお、採取したメモリダンプは、次のメモリダンプの採取に障害のない時間で、例えば、ファイルとして出力すればよい。 For example, as described above, when there are ten guest OSs 11 using the memory 123 having a capacity of 2 GB, even if a memory dump is collected for all of the ten virtual machines 124, a disk area of 20 GB is required. Instead, even if a spare disk area is considered, for example, a 4 GB disk area need only be prepared. The collected memory dump may be output as a file, for example, at a time when there is no failure in collecting the next memory dump.

従って、情報処理システムで使用される高価な記憶装置を、発生が予測できないゲストＯＳのクラッシュに備えて大量に獲得しておく必要をなくして、メモリダンプ採取の負担（コスト）を低減することができる。また、メモリダンプの採取のためのディスク領域を減らしても、実際のクラッシュの発生時に確実にメモリダンプが採取することができ、結果的に原因調査の期間を短縮化することができる。 Accordingly, it is not necessary to acquire a large amount of expensive storage devices used in the information processing system in preparation for a guest OS crash that cannot be predicted, and the burden (cost) of collecting a memory dump can be reduced. it can. Even if the disk area for collecting the memory dump is reduced, the memory dump can be reliably collected when an actual crash occurs, and the cause investigation period can be shortened as a result.

一方、採取したダンプファイルの保存先をホストＯＳ１１側に集約しているので、メモリダンプに関して、情報処理システムの構成を単純化することができる。これにより、１台のハードウェア上に複数の仮想マシン１２４をより容易に実装することができる。 On the other hand, since the storage destinations of the collected dump files are collected on the host OS 11 side, the configuration of the information processing system can be simplified with respect to the memory dump. Thereby, a plurality of virtual machines 124 can be more easily mounted on one piece of hardware.

また、本発明によれば、ホストドライバ１１３及びゲストドライバ１２１を設ける必要はあるが、メモリダンプ採取のための種々の設定を個々のゲストＯＳ１２に対して行う必要を無くすことができる。 Further, according to the present invention, it is necessary to provide the host driver 113 and the guest driver 121, but it is possible to eliminate the need to perform various settings for collecting the memory dump for each guest OS 12.

また、本発明によれば、ゲストＯＳ１２のメモリ１２３の内容の保存処理と、保存したメモリ１２３の内容のダンプファイルへの変換処理とは、異なるタイミングで、換言すれば、独立したタイミングで行うことができる。また、メモリ１２３の内容の保存処理は、メモリ１２３のコピーであるので高速で終了することができる。従って、ゲストＯＳ１２のクラッシュ発生時に、メモリ１２３の内容の保存処理のみを終了した時点でメモリダンプ採取が完了する前に、ゲストＯＳ１２の停止処理及び再開処理を実行することができる。これにより、通常のメモリダンプ採取によるよりも、仮想マシン１２４のダウンタイムを大幅に縮小することができる。 Further, according to the present invention, the process of saving the contents of the memory 123 of the guest OS 12 and the process of converting the saved contents of the memory 123 into a dump file are performed at different timings, in other words, at independent timings. Can do. In addition, the storage process of the contents of the memory 123 can be completed at high speed because it is a copy of the memory 123. Therefore, when the guest OS 12 crashes, the stop processing and the restart processing of the guest OS 12 can be executed before the memory dump collection is completed when only the storage processing of the contents of the memory 123 is completed. As a result, the downtime of the virtual machine 124 can be significantly reduced as compared with the normal memory dump collection.

また、本発明によれば、ゲストＯＳ１２のパニック出口や仮想バス２１等の通常のＯＳやハイパーバイザ２が備える機能を用いて実現することができる。これにより、ＯＳレベルでは、ホストドライバ１１３及びゲストドライバ１２１を追加するのみでよく、ゲストＯＳ１１やハイパーバイザ２を大きく変更する必要を無くすことができる。また、これにより、各々のゲストＯＳ１２のメモリダンプ採取の機能が適切に構成又は設定されていないため、クラッシュの発生時にダンプが採取されなかったり、十分な情報を取得できなかったりすることを防止することができる。 Further, according to the present invention, it can be realized by using a function provided in a normal OS or hypervisor 2 such as a panic exit of the guest OS 12 or the virtual bus 21. Thus, at the OS level, it is only necessary to add the host driver 113 and the guest driver 121, and it is possible to eliminate the need to greatly change the guest OS 11 and the hypervisor 2. In addition, this prevents the memory dump collection function of each guest OS 12 from being appropriately configured or set, so that a dump is not collected or sufficient information cannot be acquired when a crash occurs. be able to.

図５及び図６は、一体となって識別情報作成処理フローであり、ホストＯＳのダンプ採取サービス部における処理を示す。 5 and 6 are integrated identification information creation processing flows, and show processing in the dump collection service unit of the host OS.

ホストＯＳ１１のダンプ採取サービス部１１２は、ダンプ採取サービス部１１２の初期化処理を実行する（ステップＳ１１）。これにより、ダンプ採取サービス部１１２は、予め定められた初期状態に復帰する。 The dump collection service unit 112 of the host OS 11 executes initialization processing of the dump collection service unit 112 (step S11). Thereby, the dump collection service unit 112 returns to a predetermined initial state.

初期化処理の後、ダンプ採取サービス部１１２は、仮想マシン１２４を識別するために用いられる、仮想マシン１２４に対応するプロセス識別情報の作成処理を開始し、仮想マシン１２４を予め定められた順に、例えば仮想マシン名の順に処理対象として、処理対象とした仮想マシン１２４の状態を判定する（ステップＳ１２）。 After the initialization process, the dump collection service unit 112 starts creating process identification information corresponding to the virtual machine 124 used to identify the virtual machine 124, and sets the virtual machines 124 in a predetermined order. For example, the status of the virtual machine 124 to be processed is determined as the processing target in the order of the virtual machine name (step S12).

仮想マシン１２４の状態が実行中である場合に、ダンプ採取サービス部１１２は、実行中の仮想マシン１２４に対応するプロセスに対して、プロセス識別情報を作成して割り当てる（ステップＳ１３）。そして、ダンプ採取サービス部１１２は、割り当てたプロセス識別情報を、プロセスに対応する仮想マシン１２４の仮想デバイス１２２に書き込む（ステップＳ１４）。そして、ダンプ採取サービス部１１２は、実行中の仮想マシン１２４について、仮想マシン一覧である仮想マシン／プロセス一覧１１８に追加する（ステップＳ１５）。 When the state of the virtual machine 124 is being executed, the dump collection service unit 112 creates and assigns process identification information to the process corresponding to the virtual machine 124 being executed (step S13). Then, the dump collection service unit 112 writes the assigned process identification information to the virtual device 122 of the virtual machine 124 corresponding to the process (step S14). Then, the dump collection service unit 112 adds the virtual machine 124 being executed to the virtual machine / process list 118, which is a virtual machine list (step S15).

この後、ダンプ採取サービス部１１２は、全ての仮想マシン１２４について、仮想マシン／プロセス一覧１１８への追加を終了したか否かを調べる（ステップＳ１６）。全ての仮想マシン１２４について、仮想マシン／プロセス一覧１１８への追加を終了していない場合に、ダンプ採取サービス部１１２は、次の処理対象についてステップＳ１２を繰り返す。 Thereafter, the dump collection service unit 112 checks whether or not all the virtual machines 124 have been added to the virtual machine / process list 118 (step S16). If the addition to the virtual machine / process list 118 has not been completed for all virtual machines 124, the dump collection service unit 112 repeats step S12 for the next processing target.

全ての仮想マシン１２４について、仮想マシン／プロセス一覧１１８への追加を終了した場合に、ダンプ採取サービス部１１２は、仮想マシン状態の監視を開始し（ステップＳ１７）、また、ゲストＯＳ１２のクラッシュの監視を開始する（ステップＳ１８）。ステップＳ１７において、ダンプ採取サービス部１１２は、仮想マシン状態の監視の開始を、ホストドライバ１１３に通知する。 When all the virtual machines 124 have been added to the virtual machine / process list 118, the dump collection service unit 112 starts monitoring the virtual machine state (step S17), and also monitors the guest OS 12 for crashes. Is started (step S18). In step S17, the dump collection service unit 112 notifies the host driver 113 of the start of monitoring of the virtual machine state.

この後、ダンプ採取サービス部１１２は、ゲストＯＳ１２のクラッシュの通知を待つクラッシュ待ちの処理に入る（ステップＳ１９）。 Thereafter, the dump collection service unit 112 enters a process of waiting for a crash waiting for a notification of a crash of the guest OS 12 (step S19).

一方、ダンプ採取サービス部１１２は、ステップＳ１８の後、仮想マシン状態監視処理を開始する。仮想マシン状態監視処理は、仮想マシン１２４の状態を監視する状態監視スレッドにおいて実行される。状態監視スレッドは、例えば、ダンプ採取サービス部１１２により生成され、状態監視スレッド実行部により実行される。 On the other hand, the dump collection service unit 112 starts virtual machine state monitoring processing after step S18. The virtual machine state monitoring process is executed in a state monitoring thread that monitors the state of the virtual machine 124. For example, the state monitoring thread is generated by the dump collection service unit 112 and executed by the state monitoring thread execution unit.

ステップＳ１８の後、状態監視スレッド実行部は、予め定められた時間間隔で、仮想マシン管理サービス部１１６から仮想マシン１２４の状態を示す情報である仮想マシン状態を取得する（ステップＳ１１０）。 After step S18, the state monitoring thread execution unit acquires a virtual machine state that is information indicating the state of the virtual machine 124 from the virtual machine management service unit 116 at predetermined time intervals (step S110).

状態監視スレッド実行部は、仮想マシン状態を取得した場合に、取得した仮想マシン状態において仮想マシン状態に変更があるか否かを調べる（ステップＳ１１１）。そして、状態監視スレッド実行部は、仮想マシン状態に変更がある場合に、仮想マシン状態の変更の内容を判定する（ステップＳ１１２）。 When the virtual machine state is acquired, the state monitoring thread execution unit checks whether there is a change in the virtual machine state in the acquired virtual machine state (step S111). Then, when there is a change in the virtual machine state, the state monitoring thread execution unit determines the content of the change in the virtual machine state (step S112).

マシン状態の変更の内容が新たな仮想マシン１２４の開始である場合に、状態監視スレッド実行部は、新たに開始される仮想マシン１２４に対応するプロセスに対して、プロセス識別情報を作成して割り当てる（ステップＳ１１３）。そして、状態監視スレッド実行部は、割り当てたプロセス識別情報を、割り当てられたプロセスに対応する仮想マシン１２４の仮想デバイス１２２に書き込む（ステップＳ１１４）。そして、状態監視スレッド実行部は、新たに開始される仮想マシン１２４について、仮想マシン一覧である仮想マシン／プロセス一覧１１８に追加する（ステップＳ１１５）。これにより、仮想マシン１２４に生じた仮想マシン状態の変更に応じて、仮想マシン／プロセス一覧１１８が更新される。 When the content of the machine state change is the start of a new virtual machine 124, the state monitoring thread execution unit creates and assigns process identification information to the process corresponding to the newly started virtual machine 124. (Step S113). Then, the state monitoring thread execution unit writes the assigned process identification information to the virtual device 122 of the virtual machine 124 corresponding to the assigned process (step S114). Then, the state monitoring thread execution unit adds the newly started virtual machine 124 to the virtual machine / process list 118, which is a virtual machine list (step S115). As a result, the virtual machine / process list 118 is updated in accordance with the change in the virtual machine state that has occurred in the virtual machine 124.

この後、状態監視スレッド実行部は、ダンプ採取サービス部１１２に、仮想マシン状態を変更したことを通知する（ステップＳ１１６）。 Thereafter, the state monitoring thread execution unit notifies the dump collection service unit 112 that the virtual machine state has been changed (step S116).

ステップＳ１１２において、マシン状態の変更の内容が仮想マシン１２４の停止である場合に、状態監視スレッド実行部は、停止される仮想マシン１２４について、仮想マシン／プロセス一覧１１８において、マシン状態を更新する（ステップＳ１１７）。これにより、仮想マシン１２４に生じた仮想マシン状態の変更に応じて、仮想マシン／プロセス一覧１１８が更新される。この後、状態監視スレッド実行部は、ステップＳ１１６を実行する。 In step S112, when the content of the change in the machine state is the stop of the virtual machine 124, the state monitoring thread execution unit updates the machine state in the virtual machine / process list 118 for the virtual machine 124 to be stopped ( Step S117). As a result, the virtual machine / process list 118 is updated in accordance with the change in the virtual machine state that has occurred in the virtual machine 124. Thereafter, the state monitoring thread execution unit executes Step S116.

ステップＳ１１２において、マシン状態の変更の内容が仮想マシン１２４の削除である場合に、状態監視スレッド実行部は、削除される仮想マシン１２４について、仮想マシン／プロセス一覧１１８から削除する（ステップＳ１１８）。これにより、仮想マシン１２４に生じた仮想マシン状態の変更に応じて、仮想マシン／プロセス一覧１１８が更新される。この後、状態監視スレッド実行部は、ステップＳ１１６を実行する。 In step S112, when the content of the change in the machine state is deletion of the virtual machine 124, the state monitoring thread execution unit deletes the virtual machine 124 to be deleted from the virtual machine / process list 118 (step S118). As a result, the virtual machine / process list 118 is updated in accordance with the change in the virtual machine state that has occurred in the virtual machine 124. Thereafter, the state monitoring thread execution unit executes Step S116.

この後、状態監視スレッド実行部は、予め定められた時間が経過するまで待機し（ステップＳ１１９）、その後、ステップＳ１１０を繰り返す。 Thereafter, the state monitoring thread execution unit waits until a predetermined time elapses (step S119), and then repeats step S110.

一方、ステップＳ１１１において、仮想マシン状態に変更がない場合に、状態監視スレッド実行部は、ステップＳ１１９を実行する。 On the other hand, when there is no change in the virtual machine state in step S111, the state monitoring thread execution unit executes step S119.

図７は、識別情報作成処理フローであり、ホストＯＳ１１のホストドライバ１１３における処理を示す。 FIG. 7 is an identification information creation processing flow and shows processing in the host driver 113 of the host OS 11.

ホストＯＳ１１のホストドライバ１１３は、ホストドライバ１１３の初期化処理を実行する（ステップＳ２１）。これにより、ホストドライバ１１３は、予め定められた初期状態に復帰する。 The host driver 113 of the host OS 11 executes initialization processing for the host driver 113 (step S21). As a result, the host driver 113 returns to a predetermined initial state.

初期化処理の後、ホストドライバ１１３は、ゲストＯＳ１２からの仮想バス２１を介しての割り込みの受付を開始する（ステップＳ２２）。ゲストＯＳ１２からの割り込みの受付は、初期化処理の後、直ちに開始される。しかし、実際にゲストＯＳ１２からの割り込みが発生するのは、ゲストＯＳ１２が、下記のように、仮想化ＩＤ／プロセス一覧１１７へ追加された後である。 After the initialization process, the host driver 113 starts accepting an interrupt from the guest OS 12 via the virtual bus 21 (step S22). Acceptance of an interrupt from the guest OS 12 is started immediately after the initialization process. However, the interruption from the guest OS 12 actually occurs after the guest OS 12 is added to the virtualization ID / process list 117 as described below.

この後、ホストドライバ１１３は、ダンプ採取サービス部１１２からの種々の要求の受付を開始する（ステップＳ２３）。ダンプ採取サービス部１１２からの要求の受付の処理において、以下のように、仮想化ＩＤ／プロセス一覧１１７の作成及び更新が実行される。 Thereafter, the host driver 113 starts accepting various requests from the dump collection service unit 112 (step S23). In the process of receiving a request from the dump collection service unit 112, the creation and update of the virtualization ID / process list 117 are executed as follows.

ステップＳ２２の後、ホストドライバ１１３は、ゲストＯＳ１２のクラッシュの通知である仮想バス２１への割り込みを待つ処理、換言すれば、クラッシュ発生待ちの処理に入る（ステップＳ２４）。 After step S22, the host driver 113 enters a process of waiting for an interrupt to the virtual bus 21 that is a notification of a crash of the guest OS 12, in other words, a process of waiting for a crash to occur (step S24).

ステップＳ２３の後、ホストドライバ１１３は、サービス要求の処理を開始し、ダンプ採取サービス部１１２からのＩＯ要求を受け付ける（ステップＳ２５）。そして、ホストドライバ１１３は、ダンプ採取サービス部１１２からのＩＯ要求を受け付けると、受け付けたＩＯ要求の種別や変更内容を判定する（ステップＳ２６）。 After step S23, the host driver 113 starts service request processing and accepts an IO request from the dump collection service unit 112 (step S25). When the host driver 113 receives the IO request from the dump collection service unit 112, the host driver 113 determines the type of the received IO request and the change content (step S26).

ステップＳ２６において受け付けたＩＯ要求が仮想マシン１２４の停止、仮想マシン１２４の削除である場合には、ホストドライバ１１３は、停止や削除の対象である仮想マシン１２４に対応するプロセスのプロセス識別情報を用いて仮想化ＩＤ／プロセス一覧１１７を参照して、停止や削除の対象である仮想マシン１２４が仮想マシン一覧の仮想化ＩＤ／プロセス一覧１１７に追加されているか否かを調べる（ステップＳ２７）。そして、ホストドライバ１１３は、停止や削除の対象である仮想マシン１２４が仮想化ＩＤ／プロセス一覧１１７に追加されている場合には（ステップＳ２７、Ｙｅｓ）、停止や削除の対象である仮想マシン１２４を、仮想化ＩＤ／プロセス一覧１１７から削除する（ステップＳ２８）。 When the IO request received in step S26 is the stop of the virtual machine 124 or the deletion of the virtual machine 124, the host driver 113 uses the process identification information of the process corresponding to the virtual machine 124 to be stopped or deleted. Then, it is checked whether or not the virtual machine 124 to be stopped or deleted has been added to the virtualization ID / process list 117 of the virtual machine list with reference to the virtualization ID / process list 117 (step S27). When the virtual machine 124 to be stopped or deleted has been added to the virtualization ID / process list 117 (Yes in step S27), the host driver 113 determines that the virtual machine 124 to be stopped or deleted. Is deleted from the virtualization ID / process list 117 (step S28).

なお、ホストドライバ１１３は、停止や削除の対象である仮想マシン１２４が仮想化ＩＤ／プロセス一覧１１７に追加されていない場合には（ステップＳ２７、Ｎｏ）、停止や削除の対象である仮想マシン１２４についての処理を行わない。 Note that the host driver 113 determines that the virtual machine 124 to be stopped or deleted is not added to the virtualization ID / process list 117 (No in step S27), and the virtual machine 124 to be stopped or deleted. Do not process for.

ステップＳ２６において受け付けたＩＯ要求が仮想マシン１２４のクラッシュの監視開始、仮想マシン１２４の開始である場合には、ホストドライバ１１３は、仮想化ＩＤ／プロセス一覧１１７の作成更新処理を開始する。最初に、ホストドライバ１１３は、ハイパーバイザ２に接続して、全てのゲストＯＳ１２が動作する仮想マシン１２４に割り当てられた仮想バス２１の仮想化ＩＤを取得して列挙する（ステップＳ２９）。 If the IO request received in step S26 is the start of monitoring the crash of the virtual machine 124 or the start of the virtual machine 124, the host driver 113 starts the process of creating and updating the virtualization ID / process list 117. First, the host driver 113 connects to the hypervisor 2 and acquires and enumerates the virtualization IDs of the virtual bus 21 assigned to the virtual machines 124 on which all the guest OSs 12 operate (step S29).

この後、ホストドライバ１１３は、予め定められた順に、例えば仮想化ＩＤの昇順に、１個の仮想化ＩＤを処理対象として、仮想化ＩＤで識別される仮想マシン１２４がゲストドライバ１２１を含んでおり仮想化ＩＤ／プロセス一覧１１７に追加されているか否かを調べる（ステップＳ２１０）。処理対象である仮想マシン１２４が仮想化ＩＤ／プロセス一覧１１７に追加されている場合には（ステップＳ２１０、Ｙｅｓ）、ホストドライバ１１３は、次の処理対象である仮想マシン１２４についてステップＳ２９を繰り返す。 Thereafter, the host driver 113 includes the guest driver 121 in the virtual machine 124 identified by the virtualization ID, with one virtualization ID as a processing target, in the ascending order of the virtualization ID, for example. It is checked whether it has been added to the cage virtualization ID / process list 117 (step S210). When the processing target virtual machine 124 has been added to the virtualization ID / process list 117 (step S210, Yes), the host driver 113 repeats step S29 for the next processing target virtual machine 124.

処理対象である仮想マシン１２４が仮想化ＩＤ／プロセス一覧１１７に追加されていない場合には（ステップＳ２１０、Ｎｏ）、ホストドライバ１１３は、処理対象である仮想マシン１２４に対してクラッシュ監視の開始を通知し（ステップＳ２１１）、この通知に対する応答として、処理対象である仮想マシン１２４からそのプロセス識別情報を取得する（ステップＳ２１２）。そして、ホストドライバ１１３は、処理対象である仮想マシン１２４について、取得した仮想化ＩＤと、これに対応するプロセス識別情報とを、仮想化ＩＤ／プロセス一覧１１７に追加し（ステップＳ２１３）、全ての仮想化ＩＤについての処理を終了するまでステップＳ２９を繰り返す。これにより、仮想化ＩＤとこれに対応するプロセス識別情報とが取得できた仮想マシン１２４が、仮想化ＩＤ／プロセス一覧１１７に登録され、クラッシュの監視対象とされる。 If the processing target virtual machine 124 has not been added to the virtualization ID / process list 117 (No in step S210), the host driver 113 starts the crash monitoring for the processing target virtual machine 124. Notification is made (step S211), and as a response to this notification, the process identification information is acquired from the virtual machine 124 to be processed (step S212). Then, the host driver 113 adds the acquired virtualization ID and process identification information corresponding to the acquired virtual machine 124 to the virtualization ID / process list 117 (step S213). Step S29 is repeated until the process for the virtualization ID is completed. As a result, the virtual machine 124 for which the virtualization ID and the process identification information corresponding to the virtualization ID can be acquired is registered in the virtualization ID / process list 117 and is subject to crash monitoring.

ステップＳ２６において受け付けたＩＯ要求が仮想マシン１２４のクラッシュの監視停止である場合には、ホストドライバ１１３は、仮想化ＩＤ／プロセス一覧１１７の全ての仮想マシン１２４についての情報を破棄する（ステップＳ２１４）。 If the IO request received in step S26 is to stop monitoring the crash of the virtual machine 124, the host driver 113 discards information on all virtual machines 124 in the virtualization ID / process list 117 (step S214). .

図８は、識別情報作成処理フローであり、ゲストＯＳ１２のゲストドライバ１２１における処理を示す。 FIG. 8 is an identification information creation processing flow, and shows processing in the guest driver 121 of the guest OS 12.

ゲストドライバ１２１は、ゲストドライバ１２１の初期化処理を実行する（ステップＳ３１）。これにより、ゲストドライバ１２１は、予め定められた初期状態に復帰する。 The guest driver 121 executes initialization processing for the guest driver 121 (step S31). As a result, the guest driver 121 returns to a predetermined initial state.

初期化処理の後、ゲストドライバ１２１は、ホストドライバ１１３からのクラッシュの監視開始の通知の待ち状態となる（ステップＳ３２）。 After the initialization process, the guest driver 121 waits for a crash monitoring start notification from the host driver 113 (step S32).

この後、待ち状態のゲストドライバ１２１は、ホストドライバ１１３からのクラッシュの監視開始の通知を受信する（ステップＳ３３）。そして、クラッシュの監視開始の通知の受信を契機として、ゲストドライバ１２１は、対応するゲストＯＳ１２が使用する仮想デバイス１２２にアクセスして、仮想デバイス１２２から自己の動作する仮想マシン１２４に対応するプロセスのプロセス識別情報を読み出して取得する（ステップＳ３４）。 Thereafter, the waiting guest driver 121 receives a notification of the start of crash monitoring from the host driver 113 (step S33). Upon receiving the notification of the start of monitoring the crash, the guest driver 121 accesses the virtual device 122 used by the corresponding guest OS 12, and the process of the process corresponding to the virtual machine 124 that operates itself from the virtual device 122. Process identification information is read and acquired (step S34).

この後、ゲストドライバ１２１は、仮想デバイス１２２から取得したプロセス識別情報を、ホストドライバ１１３に通知する（ステップＳ３５）。そして、ゲストドライバ１２１は、ゲストＯＳ１２のパニック出口にゲストドライバ１２１を登録する（ステップＳ３６）。ゲストドライバ１２１の登録は、例えば、クラッシュ発生時にゲストＯＳ１２のパニック出口から呼び出される関数を登録することにより行われる。この後、ゲストドライバ１２１は、ゲストＯＳ１２の終了待ち又はクラッシュの発生待ち処理を実行する（ステップＳ３７）。 Thereafter, the guest driver 121 notifies the host driver 113 of the process identification information acquired from the virtual device 122 (step S35). Then, the guest driver 121 registers the guest driver 121 at the panic exit of the guest OS 12 (step S36). For example, the guest driver 121 is registered by registering a function called from a panic exit of the guest OS 12 when a crash occurs. Thereafter, the guest driver 121 executes a waiting process for waiting for the guest OS 12 to end or a crash occurrence (step S37).

図９は、ダンプ採取処理フローであり、ゲストＯＳ１２のゲストドライバ１２１における処理を示す。 FIG. 9 is a dump collection processing flow and shows processing in the guest driver 121 of the guest OS 12.

ゲストドライバ１２１は、ゲストドライバ１２１の初期化処理及び仮想マシン識別情報作成処理の後（ステップＳ４１）、ＯＳ状態監視スレッドとクラッシュ待ちスレッドとを生成する。ＯＳ状態監視スレッドは、ＯＳ状態監視スレッド実行部により実行される。クライアント待ちスレッドは、クライアント待ちスレッド実行部により実行される。ＯＳ状態監視スレッドとクライアント待ちスレッドとは、並行して実行される。なお、ステップＳ４１の処理は、図８に示す処理である。 The guest driver 121 generates an OS state monitoring thread and a crash waiting thread after the initialization process and the virtual machine identification information creation process of the guest driver 121 (step S41). The OS state monitoring thread is executed by the OS state monitoring thread execution unit. The client waiting thread is executed by the client waiting thread execution unit. The OS state monitoring thread and the client waiting thread are executed in parallel. The process in step S41 is the process shown in FIG.

ゲストドライバ１２１、換言すれば、ＯＳ状態監視スレッド実行部は、ステップＳ４１の後、情報処理システムにおけるイベント待ちの状態となる（ステップＳ４２）。即ち、ゲストドライバ１２１は、クラッシュの監視開始の通知をホストドライバ１１３から受け取って、プロセス識別情報をホストドライバ１１３に返信し、ゲストＯＳ１２のパニック出口にゲストドライバ１２１を登録した後に、イベント待ちの状態になる。 The guest driver 121, in other words, the OS state monitoring thread execution unit enters an event waiting state in the information processing system after step S41 (step S42). That is, the guest driver 121 receives a notification of the start of crash monitoring from the host driver 113, returns process identification information to the host driver 113, registers the guest driver 121 in the panic exit of the guest OS 12, and then waits for an event. become.

この後、ＯＳ状態監視スレッド実行部は、イベントが発生すると、発生したイベントがゲストＯＳ１２の終了というイベントであるか否かを調べる（ステップＳ４３）。発生したイベントがゲストＯＳ１２の終了というイベントでない場合には（ステップＳ４３、Ｎｏ）、ＯＳ状態監視スレッド実行部は、ステップＳ４２を繰り返す。 Thereafter, when an event occurs, the OS state monitoring thread execution unit checks whether the generated event is an event of termination of the guest OS 12 (step S43). When the generated event is not an event of termination of the guest OS 12 (No at Step S43), the OS state monitoring thread execution unit repeats Step S42.

発生したイベントがゲストＯＳ１２の終了というイベントである場合には（ステップＳ４３、Ｙｅｓ）、ＯＳ状態監視スレッド実行部は、ゲストＯＳ１２のパニック出口へのゲストドライバ１２１の登録を解除して、ゲストＯＳ１２自体を終了させる（ステップＳ４４）。 When the generated event is an event of termination of the guest OS 12 (step S43, Yes), the OS state monitoring thread execution unit cancels the registration of the guest driver 121 at the panic exit of the guest OS 12, and the guest OS 12 itself Is terminated (step S44).

一方、ゲストドライバ１２１、換言すれば、クラッシュ待ちスレッド実行部は、ゲストＯＳ１２が自己におけるクラッシュの発生を検出すると、ゲストＯＳ１２のパニック出口に登録されているゲストドライバ１２１を呼び出す（ステップＳ４５）。 On the other hand, the guest driver 121, in other words, the crash waiting thread execution unit, when the guest OS 12 detects the occurrence of a crash in itself, calls the guest driver 121 registered in the panic exit of the guest OS 12 (step S45).

呼び出されたゲストドライバ１２１は、仮想バス２１を介してホストＯＳ１１に割り込むことにより、ホストドライバ１１３へ、ゲストＯＳ１２におけるクラッシュの発生を通知する（ステップＳ４６）。 The called guest driver 121 interrupts the host OS 11 via the virtual bus 21, thereby notifying the host driver 113 of the occurrence of a crash in the guest OS 12 (step S46).

そして、ゲストドライバ１２１は、ステップＳ４６に続いて、又は、ステップＳ４６と並行して、ゲストＯＳ１２によるメモリダンプの出力を抑止する処理ルーチンを呼び出して、メモリダンプの出力を抑止させる（ステップＳ４７）。この処理ルーチンは、仮想マシン１２４の外部からゲストＯＳ１２の使用するメモリ１２３の内容を保存するまで、仮想マシン１２４のメモリ１２３の内容が変更されないようにする無限ループである。 Then, the guest driver 121 calls a processing routine for suppressing the output of the memory dump by the guest OS 12 subsequent to step S46 or in parallel with step S46, and suppresses the output of the memory dump (step S47). This processing routine is an infinite loop that prevents the contents of the memory 123 of the virtual machine 124 from being changed until the contents of the memory 123 used by the guest OS 12 are stored from outside the virtual machine 124.

図１０は、ダンプ採取処理フローであり、ホストＯＳ１１のホストドライバ１１３における処理を示す。 FIG. 10 is a dump collection processing flow and shows processing in the host driver 113 of the host OS 11.

ホストドライバ１１３は、ホストドライバ１１３の初期化処理及び仮想マシン識別情報作成処理の後（ステップＳ５１）、ＯＳ状態監視スレッドと割り込み待ちスレッドとを生成する。ＯＳ状態監視スレッドは、ＯＳ状態監視スレッド実行部により実行される。割り込み待ちスレッドは、割り込み待ちスレッド実行部により実行される。ＯＳ状態監視スレッドと割り込み待ちスレッドとは、並行して実行される。なお、ステップＳ５１の処理は、図７に示す処理である。 The host driver 113 generates an OS state monitoring thread and an interrupt waiting thread after the initialization process of the host driver 113 and the virtual machine identification information creation process (step S51). The OS state monitoring thread is executed by the OS state monitoring thread execution unit. The interrupt waiting thread is executed by the interrupt waiting thread execution unit. The OS state monitoring thread and the interrupt waiting thread are executed in parallel. Note that the processing in step S51 is the processing shown in FIG.

割り込み待ちスレッドの生成により、ホストドライバ１１３は、ゲストＯＳ１２のゲストドライバ１２１からの割り込みの受け付けが可能となる。ホストドライバ１１３は、ダンプ採取サービス部１１２からクラッシュの監視開始の要求を受け取って、クラッシュ監視対象の仮想化ＩＤ／プロセス一覧１１７を作成した後に、実際のクラッシュの監視を開始する。換言すれば、ホストドライバ１１３は、情報処理システムにおけるイベント待ち、及び、割り込み待ちの状態となる。 By generating the interrupt waiting thread, the host driver 113 can accept an interrupt from the guest driver 121 of the guest OS 12. The host driver 113 receives a crash monitoring start request from the dump collection service unit 112 and creates a crash monitoring target virtualization ID / process list 117, and then starts actual crash monitoring. In other words, the host driver 113 enters an event waiting state and an interrupt waiting state in the information processing system.

ホストドライバ１１３、換言すれば、ＯＳ状態監視スレッド実行部は、ステップＳ５１の後、情報処理システムにおけるイベント待ちの状態となる（ステップＳ５２）。 The host driver 113, in other words, the OS state monitoring thread execution unit enters a state of waiting for an event in the information processing system after step S51 (step S52).

この後、ＯＳ状態監視スレッド実行部は、イベントが発生すると、発生したイベントがホストＯＳ１１の終了というイベントであるか否かを調べる（ステップＳ５３）。発生したイベントがホストＯＳ１１の終了というイベントでない場合には（ステップＳ５３、Ｎｏ）、ＯＳ状態監視スレッド実行部は、ステップＳ５２を繰り返す。 Thereafter, when an event occurs, the OS state monitoring thread execution unit checks whether the generated event is an event of termination of the host OS 11 (step S53). When the generated event is not an event of termination of the host OS 11 (No at Step S53), the OS state monitoring thread execution unit repeats Step S52.

発生したイベントがホストＯＳ１１の終了というイベントである場合には（ステップＳ５３、Ｙｅｓ）、ＯＳ状態監視スレッド実行部は、割り込み待ちスレッドを解除し（ステップＳ５４）、仮想マシン一覧の仮想化ＩＤ／プロセス一覧１１７を破棄して（ステップＳ５５）、処理を終了する。 When the generated event is an event of termination of the host OS 11 (step S53, Yes), the OS state monitoring thread execution unit releases the interrupt waiting thread (step S54), and the virtual machine list virtualization ID / process The list 117 is discarded (step S55), and the process is terminated.

一方、ホストドライバ１１３、換言すれば、割り込み待ちスレッド実行部は、情報処理システムからの割り込みの待ち状態となる（ステップＳ５６）。 On the other hand, the host driver 113, in other words, the interrupt waiting thread execution unit waits for an interrupt from the information processing system (step S56).

割り込み待ちスレッド実行部は、ゲストＯＳ１２におけるクラッシュの発生に応じてゲストドライバ１２１により仮想バス２１を介してホストＯＳ１１に割り込みが発生すると、発生した割り込みを捕捉する（ステップＳ５７）。そして、割り込み待ちスレッドは、仮想化ＩＤのような割り込みに付随する情報を参照することにより、割り込みがゲストＯＳ１２のクラッシュを契機として発生したものか否かを判定する（ステップＳ５８）。 When an interrupt occurs in the host OS 11 via the virtual bus 21 by the guest driver 121 in response to the occurrence of a crash in the guest OS 12, the interrupt waiting thread execution unit captures the generated interrupt (step S57). Then, the interrupt waiting thread determines whether or not the interrupt is generated when the guest OS 12 crashes by referring to information accompanying the interrupt such as the virtualization ID (step S58).

割り込みがゲストＯＳ１２のクラッシュを契機として発生したものでない場合には、ホストＯＳ１１で発生したイベントを契機として発生した割り込みであるので、割り込み待ちスレッド実行部は、捕捉した割り込みを、ホストＯＳ１１用の割り込み待ちスレッドに転送した後（ステップＳ５９）、ステップＳ５６を繰り返す。割り込み待ちスレッドがゲストＯＳ１２のクラッシュに起因する割り込みを処理するスレッドであるのに対して、ホストＯＳ１１用の割り込み待ちスレッド実行部はホストＯＳ１１に起因する割り込みを処理するスレッドである。 If the interrupt is not triggered by the crash of the guest OS 12, it is an interrupt that was triggered by an event that occurred in the host OS 11, so the interrupt-waiting thread execution unit sends the captured interrupt to the interrupt for the host OS 11. After transferring to the waiting thread (step S59), step S56 is repeated. The interrupt waiting thread is a thread that processes an interrupt caused by the crash of the guest OS 12, whereas the interrupt waiting thread execution unit for the host OS 11 is a thread that processes an interrupt caused by the host OS 11.

割り込みがゲストＯＳ１２のクラッシュを契機として発生した場合には、割り込み待ちスレッド実行部は、仮想化ＩＤに基づいて、プロセス識別情報を解決する（ステップＳ５１０）。具体的には、割り込み待ちスレッド実行部は、割り込みの発生した仮想バス２１に基づいて、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に割り当てられた仮想バス２１の仮想化ＩＤを特定する。そして、特定した仮想化ＩＤをキーとして用いて仮想化ＩＤ／プロセス一覧１１７を参照することにより、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に対応するプロセスのプロセス識別情報を得る。これにより、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に対応する、ホストＯＳ１１のプロセスを特定することができる。 When an interrupt occurs when the guest OS 12 crashes, the interrupt waiting thread execution unit resolves the process identification information based on the virtualization ID (step S510). Specifically, the interrupt waiting thread execution unit identifies the virtualization ID of the virtual bus 21 assigned to the virtual machine 124 in which the guest OS 12 in which the crash occurred operates based on the virtual bus 21 in which the interrupt has occurred. Then, by referring to the virtualization ID / process list 117 using the specified virtualization ID as a key, process identification information of a process corresponding to the virtual machine 124 on which the guest OS 12 in which the crash has occurred is obtained. Thereby, the process of the host OS 11 corresponding to the virtual machine 124 in which the guest OS 12 in which the crash occurred operates can be specified.

この後、割り込み待ちスレッド実行部は、特定した仮想マシン１２４をダンプ採取サービス部１１２に通知して（ステップＳ５１１）、ステップＳ５６を繰り返す。この通知は、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４に対応するプロセスのプロセス識別情報を含む。 Thereafter, the interrupt waiting thread execution unit notifies the identified virtual machine 124 to the dump collection service unit 112 (step S511), and repeats step S56. This notification includes process identification information of a process corresponding to the virtual machine 124 in which the guest OS 12 in which the crash has occurred operates.

図１１は、ダンプ採取処理フローであり、ホストＯＳ１１のダンプ採取サービス部１１２における処理を示す。 FIG. 11 is a dump collection processing flow and shows processing in the dump collection service unit 112 of the host OS 11.

ダンプ採取サービス部１１２は、ダンプ採取サービス部１１２の初期化処理及び仮想マシン識別情報作成処理の後（ステップＳ６１）、ＯＳ状態監視スレッドとクラッシュイベント待ちスレッドとを生成する。ＯＳ状態監視スレッドは、ＯＳ状態監視スレッド実行部により実行される。クラッシュイベント待ちスレッドは、クラッシュイベント待ちスレッド実行部により実行される。ＯＳ状態監視スレッドとクラッシュイベント待ちスレッドとは、並行して実行される。なお、ステップＳ６１の処理は、図５及び図６に示す処理である。 The dump collection service unit 112 generates an OS state monitoring thread and a crash event waiting thread after the initialization process and the virtual machine identification information creation process of the dump collection service unit 112 (step S61). The OS state monitoring thread is executed by the OS state monitoring thread execution unit. The crash event waiting thread is executed by the crash event waiting thread execution unit. The OS state monitoring thread and the crash event waiting thread are executed in parallel. The process of step S61 is the process shown in FIGS.

ダンプ採取サービス部１１２、換言すれば、ＯＳ状態監視スレッド実行部は、ステップＳ６１の後、情報処理システムにおけるイベント待ちの状態となる（ステップＳ６２）。 After step S61, the dump collection service unit 112, in other words, the OS state monitoring thread execution unit enters an event waiting state in the information processing system (step S62).

この後、ＯＳ状態監視スレッド実行部は、イベントが発生すると、発生したイベントがホストＯＳ１１の終了というイベントであるか否かを調べる（ステップＳ６３）。発生したイベントがホストＯＳ１１の終了というイベントでない場合には（ステップＳ６３、Ｎｏ）、ＯＳ状態監視スレッド実行部は、ステップＳ６２を繰り返す。 Thereafter, when an event occurs, the OS state monitoring thread execution unit checks whether the generated event is an event of termination of the host OS 11 (step S63). When the event that has occurred is not an event of termination of the host OS 11 (No in step S63), the OS state monitoring thread execution unit repeats step S62.

発生したイベントがホストＯＳ１１の終了というイベントである場合には（ステップＳ６３、Ｙｅｓ）、ＯＳ状態監視スレッド実行部は、クラッシュイベント待ちスレッドによるクラッシュイベントの受信を停止し（ステップＳ６４）、ホストドライバ１１３にクラッシュ監視の停止を指示し（ステップＳ６５）、仮想マシン一覧の仮想マシン／プロセス一覧１１８を破棄して（ステップＳ６６）、処理を終了する。 If the event that has occurred is an event that the host OS 11 has ended (step S63, Yes), the OS state monitoring thread execution unit stops receiving the crash event by the crash event waiting thread (step S64), and the host driver 113 Is instructed to stop crash monitoring (step S65), the virtual machine / process list 118 in the virtual machine list is discarded (step S66), and the process is terminated.

一方、ダンプ採取サービス部１１２、換言すれば、クラッシュイベント待ちスレッド実行部は、情報処理システムのイベント待ち状態となる（ステップＳ６７）。 On the other hand, the dump collection service unit 112, in other words, the crash event waiting thread execution unit enters an event waiting state of the information processing system (step S67).

クラッシュイベント待ちスレッド実行部は、ホストドライバ１１３からのクラッシュの発生通知を受信すると、クラッシュの発生通知に基づいて、クラッシュが発生したゲストＯＳ１２が動作する仮想マシン１２４を特定する。クラッシュの発生通知に基づくのは、仮想マシン１２４及びゲストＯＳ１２は、その仮想マシン１２４がどのプロセスとして実行されているか知ることができないためである。そして、割り込み待ちスレッド実行部は、仮想マシン１２４を特定すると、特定した仮想マシン１２４に対応して、ダンプ採取スレッドを起動する（ステップＳ６８）。ダンプ採取スレッドは、仮想マシン１２４毎に起動される。従って、複数のダンプ採取スレッドが起動される場合がある。ダンプ採取スレッドは、ダンプ採取スレッド実行部により実行される。メモリダンプ処理はある程度時間のかかる処理であるため、ダンプ採取サービス部１１２の処理と、クラッシュイベント待ちスレッドの処理とは、非同期に実行される。 When the crash event waiting thread execution unit receives a crash occurrence notification from the host driver 113, the crash event waiting thread execution unit identifies the virtual machine 124 in which the guest OS 12 in which the crash occurred operates based on the crash occurrence notification. The reason for the occurrence of the crash is that the virtual machine 124 and the guest OS 12 cannot know as which process the virtual machine 124 is executed. Then, when specifying the virtual machine 124, the interrupt waiting thread execution unit activates a dump collection thread corresponding to the specified virtual machine 124 (step S68). The dump collection thread is activated for each virtual machine 124. Therefore, a plurality of dump collection threads may be activated. The dump collection thread is executed by the dump collection thread execution unit. Since the memory dump process takes a certain amount of time, the process of the dump collection service unit 112 and the process of the crash event waiting thread are executed asynchronously.

クラッシュイベント待ちスレッド実行部は、ダンプ採取スレッドを起動した後、ステップＳ６７のイベント待ち状態に戻る。 After starting the dump collection thread, the crash event waiting thread execution unit returns to the event waiting state in step S67.

起動されたダンプ採取スレッド実行部は、クラッシュが発生したゲストＯＳ１２が使用するメモリ１２３の内容を、一時ファイルとして保存状態格納領域１１４に保存する（ステップＳ６９）。そして、ダンプ採取スレッド実行部は、ダンプ変換処理を実行するダンプ変換スレッドを起動する（ステップＳ６１０）。 The activated dump collection thread execution unit stores the contents of the memory 123 used by the guest OS 12 in which the crash has occurred in the storage state storage area 114 as a temporary file (step S69). Then, the dump collection thread execution unit activates a dump conversion thread that executes dump conversion processing (step S610).

この後、ダンプ採取スレッド実行部は、仮想マシン管理サービス部１１６に、仮想マシン１２４の停止と再起動を依頼する（ステップＳ６１１）。 Thereafter, the dump collection thread execution unit requests the virtual machine management service unit 116 to stop and restart the virtual machine 124 (step S611).

一方、起動されたダンプ変換スレッド実行部は、保存状態格納領域１１４に保存されたメモリ１２３の内容の一時ファイルを、ダンプファイルの形式に変換して、ダンプ格納領域１１５に格納する（ステップＳ６１２）。 On the other hand, the activated dump conversion thread execution unit converts the temporary file having the contents of the memory 123 stored in the storage state storage area 114 into a dump file format and stores it in the dump storage area 115 (step S612). .

以上、本発明を実施例に基づいて説明したが、本発明は、その主旨の範囲内において種々の変形が可能である。 As mentioned above, although this invention was demonstrated based on the Example, this invention can be variously deformed within the range of the main point.

例えば、ゲストＯＳ１２からホストＯＳ１１へのクラッシュの通知に、仮想バス２１以外を用いるようにしても良い。この場合、クラッシュの通知が、ハードウェア割り込みと同等の優先度で捕捉されるようにすればよい。また、クラッシュの通知において、仮想化ＩＤ以外の識別情報を用いるようにしても良い。 For example, a notification other than the virtual bus 21 may be used for notification of a crash from the guest OS 12 to the host OS 11. In this case, the crash notification may be captured with the same priority as the hardware interrupt. Further, identification information other than the virtualization ID may be used in the crash notification.

また、ゲストＯＳ１２のクラッシュのみでなく、ホストＯＳ１１のクラッシュを検出して、同様の手段により、仮想マシン制御部であるホストＯＳ１１が使用するメモリのダンプファイルを取得するようにしてもよい。 Further, not only the crash of the guest OS 12 but also the crash of the host OS 11 may be detected, and a dump file of the memory used by the host OS 11 as the virtual machine control unit may be acquired by the same means.

また、ゲストＯＳ１２が動作する仮想マシン１２４が、仮想デバイス１２２以外の手段により、プロセス識別情報を保持するようにしてもよい。 Further, the virtual machine 124 on which the guest OS 12 operates may hold the process identification information by means other than the virtual device 122.

また、ホストドライバ１１３とダンプ採取サービス部１１２とを一体に形成するようにしてもよい。この場合、仮想化ＩＤ／プロセス一覧１１７と仮想マシン／プロセス一覧１１８とを一体に形成するようにしてもよい。 Further, the host driver 113 and the dump collection service unit 112 may be integrally formed. In this case, the virtualization ID / process list 117 and the virtual machine / process list 118 may be integrally formed.

また、ホストＯＳ１１が、仮想化ＩＤ／プロセス一覧１１７と仮想マシン／プロセス一覧１１８と備えるようにしてもよい。また、ホストＯＳ１１が、仮想化ＩＤ／プロセス一覧１１７と仮想マシン／プロセス一覧１１８とを一体に形成した一覧を備え、ホストドライバ１１３及びダンプ採取サービス部１１２からの要求に応じて必要な情報を提供するようにしてもよい。 Further, the host OS 11 may include a virtualization ID / process list 117 and a virtual machine / process list 118. The host OS 11 includes a list in which the virtualization ID / process list 117 and the virtual machine / process list 118 are integrally formed, and provides necessary information in response to requests from the host driver 113 and the dump collection service unit 112. You may make it do.

１仮想マシン
２ハイパーバイザ
３ハードウェア
１１ホストＯＳ
１２ゲストＯＳ
２１仮想バス
１１１ダンプ採取ツール
１１２ダンプ採取サービス部
１１３ホストドライバ
１１４保存状態格納領域
１１５ダンプ格納領域
１１６仮想マシン管理サービス部
１１７仮想化ＩＤ／プロセス一覧
１１８仮想マシン／プロセス一覧
１２１ゲストドライバ
１２２仮想デバイス
１２３メモリ
１２４仮想マシン
１２５仮想ＣＰＵ 1 Virtual machine 2 Hypervisor 3 Hardware 11 Host OS
12 Guest OS
21 Virtual Bus 111 Dump Collection Tool 112 Dump Collection Service Unit 113 Host Driver 114 Storage State Storage Area 115 Dump Storage Area 116 Virtual Machine Management Service Section 117 Virtualization ID / Process List 118 Virtual Machine / Process List 121 Guest Driver 122 Virtual Device 123 Memory 124 Virtual machine 125 Virtual CPU

Claims

An information processing apparatus that operates a plurality of virtual machines,
A plurality of virtual bus identification information for identifying a plurality of virtual buses connecting the virtual machine control unit for controlling the plurality of virtual machines and the plurality of virtual machines, and the virtual machine control corresponding to the plurality of virtual machines, respectively. First storage means for storing first correspondence information that is a correspondence relationship with a plurality of process identification information for identifying a process of a part;
Second storage means for storing second correspondence information which is a correspondence relationship between the plurality of process identification information and the plurality of virtual machine identification information for identifying the plurality of virtual machines;
There was an interrupt corresponding to the failure of the first OS on the first virtual bus corresponding to the first virtual machine from the first OS operating on the first virtual machine included in the plurality of virtual machines. A first virtual machine is identified based on the first correspondence information and the second correspondence information, and the contents of the memory area used by the first OS operating on the first virtual machine are stored. An information processing apparatus comprising: storage means.

The plurality of virtual buses are included in a hypervisor that configures the plurality of virtual machines by virtualizing a physical device of the information processing apparatus, and are assigned to each of the plurality of virtual machines by the hypervisor, The information processing apparatus according to claim 1, wherein bus identification information is given.

The storage means is notified of the occurrence of a failure by causing an interrupt to a virtual bus assigned to the first virtual machine when a failure occurs in the first guest OS from the first virtual machine. The process identification of the process of the virtual machine control unit corresponding to the virtual machine on which the first guest OS operates from the first storage unit based on the virtualization identification information of the virtual bus in which the interrupt occurred A virtual machine having the machine identification information obtained by obtaining the machine identification information of the virtual machine on which the first guest OS operates from the second storage unit based on the obtained process identification information; The content of a memory area used by the operating first OS is stored in a storage device different from the memory area. The information processing apparatus described.

The storage means includes a host driver provided in a second OS that is the virtual machine control unit, and a dump collection service unit that is a service program of the second OS,
When the host driver is notified of the occurrence of a failure from the first virtual machine, the first OS stores the first OS from the first storage unit based on virtual bus identification information of the virtual bus in which the interrupt has occurred. Obtaining the process identification information of the process of the virtual machine control unit corresponding to the operating virtual machine;
The dump collection service unit obtains the machine identification information of the virtual machine on which the first OS operates from the second storage unit based on the obtained process identification information, and includes the obtained machine identification information. The information processing apparatus according to claim 3, wherein contents of the memory area of the machine are stored in the storage device.

The information processing apparatus includes:
A virtual device for storing process identification information for identifying a plurality of processes of the virtual machine control unit corresponding to the plurality of virtual machines;
Each of the plurality of virtual machines sends the process identification information stored in the virtual device to the virtual machine control unit,
The storage unit uses the process identification information received from each of the plurality of virtual machines, and stores the correspondence between the plurality of virtual bus identification information and the plurality of process identification information in the first storage unit. The first correspondence information is stored, and the second correspondence information that is a correspondence relationship between the plurality of process identification information and the plurality of machine identification information is stored in the second storage unit. The information processing apparatus according to claim 1.

The storage means sends the process identification information for each of the processes of the virtual machine control unit corresponding to each of the plurality of virtual machines to each of the plurality of virtual machines,
The information processing apparatus according to claim 5, wherein each of the plurality of virtual machines stores the process identification information received from the virtual machine control unit in the virtual device.

A method for controlling an information processing apparatus in which a plurality of virtual machines operate,
A plurality of virtual bus identification information for identifying a plurality of virtual buses connecting the virtual machine control unit and each of the plurality of virtual machines; Storing first correspondence information, which is a correspondence relationship with a plurality of process identification information for identifying processes of the corresponding virtual machine control units, in a first storage unit;
The virtual machine control unit stores second correspondence information, which is a correspondence relationship between the plurality of process identification information and the plurality of virtual machine identification information for identifying the plurality of virtual machines, in a second storage unit;
The virtual machine control unit causes a failure of the first OS to a first virtual bus corresponding to the first virtual machine from a first OS operating on a first virtual machine included in the plurality of virtual machines. Is used, the first OS that operates on the first virtual machine is used by specifying the first virtual machine based on the first correspondence information and the second correspondence information. A method for controlling an information processing apparatus, characterized by storing the contents of a memory area.

A control program for an information processing apparatus that operates a plurality of virtual machines,
The control program is stored in a computer.
A plurality of virtual bus identification information for identifying a plurality of virtual buses connecting the virtual machine control unit and each of the plurality of virtual machines; A process of storing first correspondence information, which is a correspondence relationship with a plurality of process identification information for identifying processes of the corresponding virtual machine control units, in a first storage unit;
Processing in which the virtual machine control unit stores second correspondence information, which is a correspondence relationship between the plurality of process identification information and the plurality of virtual machine identification information for identifying the plurality of virtual machines, in a second storage unit. When,
The virtual machine control unit causes a failure of the first OS to a first virtual bus corresponding to the first virtual machine from a first OS operating on a first virtual machine included in the plurality of virtual machines. Is used, the first OS that operates on the first virtual machine is used by specifying the first virtual machine based on the first correspondence information and the second correspondence information. A control program for executing a process for saving the contents of a memory area.

A computer-readable recording medium for recording a control program of an information processing apparatus on which a plurality of virtual machines operate,
The control program is stored in a computer.
A plurality of virtual bus identification information for identifying a plurality of virtual buses connecting the virtual machine control unit and each of the plurality of virtual machines; A process of storing first correspondence information, which is a correspondence relationship with a plurality of process identification information for identifying processes of the corresponding virtual machine control units, in a first storage unit;
Processing in which the virtual machine control unit stores second correspondence information, which is a correspondence relationship between the plurality of process identification information and the plurality of virtual machine identification information for identifying the plurality of virtual machines, in a second storage unit. When,
The virtual machine control unit causes a failure of the first OS to a first virtual bus corresponding to the first virtual machine from a first OS operating on a first virtual machine included in the plurality of virtual machines. Is used, the first OS that operates on the first virtual machine is used by specifying the first virtual machine based on the first correspondence information and the second correspondence information. A recording medium that executes processing for saving the contents of a memory area.