JP6152786B2

JP6152786B2 - Communication control apparatus, information processing apparatus, parallel computer system, control program, and parallel computer system control method

Info

Publication number: JP6152786B2
Application number: JP2013248579A
Authority: JP
Inventors: 英樹三輪; 郁夫三吉
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2013-11-29
Filing date: 2013-11-29
Publication date: 2017-06-28
Anticipated expiration: 2033-11-29
Also published as: EP2879054A2; EP2879054A3; US20150154058A1; US9465675B2; JP2015106311A

Description

本発明は、通信制御装置、情報処理装置、並列計算機システム、制御プログラム、及び並列計算機システムの制御方法に関する。 The present invention relates to a communication control device, an information processing device, a parallel computer system, a control program, and a control method for the parallel computer system.

並列計算機システムにおける並列アプリケーションプログラムの実行時には、主にMessage Passing Interface（ＭＰＩ）を利用して、複数のプロセスの各々が演算処理とプロセス間通信処理とを繰り返しながら並列に処理を進める。このプロセス間通信処理は、並列計算機システムのあるノード内のプロセス間で行われるとともに、異なるノードのプロセス間でも行われる。このとき、演算処理時間がプロセス間で異なることで、通信処理の開始時刻がプロセス間で異なる場合がある。 When executing a parallel application program in a parallel computer system, mainly using a Message Passing Interface (MPI), each of a plurality of processes proceeds in parallel while repeating arithmetic processing and inter-process communication processing. This inter-process communication process is performed between processes in a node of the parallel computer system, and is also performed between processes of different nodes. At this time, the start time of communication processing may differ between processes due to the difference in the computation processing time between processes.

例えば、図１に示すように、プロセスＰ０、プロセスＰ１、及びプロセスＰ２が並列アプリケーションプログラムを実行する場合、演算処理が早く完了したプロセスＰ０及びプロセスＰ１は、プロセスＰ２との通信処理をそれぞれ試行する。しかし、プロセスＰ２は、演算処理中であり通信処理を開始できないため、プロセスＰ０及びプロセスＰ１は、プロセスＰ２による演算処理の完了をそれぞれ待つ。この間、プロセスＰ０及びプロセスＰ１は、演算処理も通信処理も行わないため、並列計算機システムの利用効率が低下し、台数効果が悪化する。 For example, as shown in FIG. 1, when the process P0, the process P1, and the process P2 execute the parallel application program, the process P0 and the process P1 that have completed the arithmetic processing earlier try communication processing with the process P2, respectively. . However, since the process P2 is performing arithmetic processing and communication processing cannot be started, the process P0 and the process P1 each wait for completion of the arithmetic processing by the process P2. During this time, since the process P0 and the process P1 do not perform arithmetic processing or communication processing, the utilization efficiency of the parallel computer system is lowered and the number effect is deteriorated.

この問題を解決するには、並列アプリケーションプログラムの開発者が各プロセスの演算処理時間を均等化するコード書き換えやパラメータチューニング等を行い、通信処理の開始時刻をプロセス間で揃える方法が考えられる。 In order to solve this problem, a method in which a developer of a parallel application program performs code rewriting, parameter tuning, etc. to equalize the processing time of each process and aligns the start time of communication processing among the processes.

通信処理の開始時刻がプロセス間で揃っているか否かを確認する方法として、同期待ち時間と呼ばれる値の多寡で確認する方法が知られている。同期待ち時間は、例えば、次のようにして求められる。 As a method for confirming whether or not the start time of communication processing is uniform among processes, there is known a method for confirming based on a value called a synchronization waiting time. The synchronization waiting time is obtained, for example, as follows.

１．各プロセスの通信処理ごとに開始時刻を取得する。この開始時刻は、例えば、各プロセスの実行開始時点からの経過時間として取得することができる。
２．通信処理ごとに複数のプロセスの開始時刻の最大値を求める。
３．最大値と各プロセスの通信処理の開始時刻との差分を求め、プロセスごとに複数の通信処理に関する差分を積算し、同期待ち時間として記録する。 1. The start time is acquired for each communication process of each process. This start time can be acquired, for example, as an elapsed time from the execution start time of each process.
2. The maximum value of the start times of a plurality of processes is obtained for each communication process.
3. The difference between the maximum value and the communication processing start time of each process is obtained, and the differences regarding a plurality of communication processes are integrated for each process, and recorded as the synchronization waiting time.

各プロセスの同期待ち時間は、すべての通信処理の開始時刻がプロセス間で一致すると０になり、プロセス間で開始時刻の差が大きいほど、並列アプリケーションプログラムの実行に要した経過時間に近づく。したがって、同期待ち時間が０に近いほど望ましい状態であると判断できる。 The synchronization waiting time of each process becomes 0 when the start times of all communication processes coincide between processes, and the greater the difference in start time between processes, the closer to the elapsed time required to execute the parallel application program. Therefore, it can be determined that the closer the synchronization waiting time is to 0, the more desirable the state is.

図１の１回目の通信処理では、プロセスＰ０、プロセスＰ１、及びプロセスＰ２の開始時刻はそれぞれ２０、１０、及び３０であり、開始時刻の最大値は、矢印１０１が示す３０である。そして、最大値３０とプロセスＰ０、プロセスＰ１、及びプロセスＰ２の開始時刻との差分は、それぞれ１０、２０、及び０となる。 In the first communication process of FIG. 1, the start times of the process P0, the process P1, and the process P2 are 20, 10, and 30, respectively, and the maximum start time is 30 indicated by the arrow 101. Differences between the maximum value 30 and the start times of the process P0, the process P1, and the process P2 are 10, 20, and 0, respectively.

２回目の通信処理では、プロセスＰ０、プロセスＰ１、及びプロセスＰ２の開始時刻はそれぞれ６０、７０、及び５０であり、開始時刻の最大値は、矢印１０２が示す７０である。そして、最大値７０とプロセスＰ０、プロセスＰ１、及びプロセスＰ２の開始時刻との差分は、それぞれ１０、０、及び２０となる。 In the second communication process, the start times of the process P0, the process P1, and the process P2 are 60, 70, and 50, respectively, and the maximum start time is 70 indicated by the arrow 102. Differences between the maximum value 70 and the start times of the process P0, the process P1, and the process P2 are 10, 0, and 20, respectively.

したがって、１回目及び２回目の通信処理に関する差分を積算すると、プロセスＰ０、プロセスＰ１、及びプロセスＰ２の同期待ち時間はそれぞれ２０、２０、及び２０となる。この場合、並列アプリケーションプログラムの実行に要した経過時間は８０であり、そのうち２０が、他のプロセスによる演算処理の完了を待つための無駄な時間であったと解釈できる。 Therefore, when the differences related to the first and second communication processes are integrated, the synchronization waiting times of the process P0, the process P1, and the process P2 are 20, 20, and 20, respectively. In this case, the elapsed time required to execute the parallel application program is 80, and 20 of them can be interpreted as wasted time for waiting for completion of the arithmetic processing by another process.

米国Ｃｒａｙ社の並列アプリケーション性能プロファイリングツールは、同期待ち時間を求めるためにＭＰＩの集団通信関数をフックし、通信処理の開始前にプロセス間同期インタフェース（ＭＰＩ＿Ｂａｒｒｉｅｒ関数）を自動的に呼び出す。そして、並列アプリケーション性能プロファイリングツールは、プロセスごとにＭＰＩ＿Ｂａｒｒｉｅｒ関数の経過時間の合計値を求める。 The parallel application performance profiling tool of Cray Corporation in the United States hooks the MPI collective communication function to obtain the synchronization waiting time, and automatically calls the inter-process synchronization interface (MPI_Barrier function) before the start of communication processing. Then, the parallel application performance profiling tool obtains the total elapsed time of the MPI_Barrier function for each process.

複数のプロセスが持つデータを対象として、データの総和、最大値、最小値等を求めるリダクション演算を行うリダクション演算装置も知られている（例えば、特許文献１を参照）。 There is also known a reduction operation device that performs a reduction operation for obtaining the sum, maximum value, minimum value, etc. of data for data of a plurality of processes (see, for example, Patent Document 1).

特開２０１０−１２２８４８号公報JP 2010-122848 A

以下の説明では、並列アプリケーションプログラムを、単に、並列アプリケーションと記載する場合がある。 In the following description, the parallel application program may be simply referred to as a parallel application.

従来の並列計算機システムには、以下のような問題がある。
並列アプリケーション性能プロファイリングツールのように、同期待ち時間を求めるためにプロセス間同期インタフェースを呼び出す場合、各プロセスの処理に同期処理が加わるため、並列アプリケーションの挙動を大きく変えてしまうことがある。このため、同期処理を行うことなく、通信処理ごとに開始時刻の最大値を求めることが望ましい。 The conventional parallel computer system has the following problems.
When the inter-process synchronization interface is called to obtain the synchronization waiting time as in the parallel application performance profiling tool, the synchronization processing is added to the processing of each process, so the behavior of the parallel application may be greatly changed. For this reason, it is desirable to obtain the maximum value of the start time for each communication process without performing the synchronization process.

同期処理を行うことなく（非同期に）、通信処理ごとに開始時刻の最大値を求めるには、各プロセスが通信処理ごとに開始時刻を取得して主記憶領域に書き込んだ後に、並列アプリケーションの実行中又は実行後に最大値を計算する方法が考えられる。 To obtain the maximum start time for each communication process without performing synchronous processing (asynchronously), each process acquires the start time for each communication process and writes it to the main storage area before executing the parallel application. A method of calculating the maximum value during or after execution is conceivable.

並列アプリケーションの実行後に最大値を計算する場合、すべての通信処理の開始時刻のデータを記録するため、実行時間が長い並列アプリケーションでは並列アプリケーションが利用できる主記憶領域を圧迫する可能性がある。このような場合、並列アプリケーションの実行中に最大値を計算することが望ましい。 When the maximum value is calculated after the execution of the parallel application, data of the start time of all communication processes is recorded. Therefore, in a parallel application having a long execution time, the main storage area that can be used by the parallel application may be compressed. In such a case, it is desirable to calculate the maximum value during execution of the parallel application.

並列アプリケーションの実行中に最大値を計算する場合、すべての通信処理の開始時刻のデータを記録する必要はなく、過去の通信処理に関する差分の積算値を同期待ち時間として記録すれば足りる。したがって、実行時間が長い並列アプリケーションであっても主記憶領域を圧迫することはない。この場合、演算処理中に、ソフトウェア又はハードウェアの非同期通信インタフェースを利用して、各プロセスの通信処理の開始時刻を送受信する方法が考えられる。 When calculating the maximum value during the execution of the parallel application, it is not necessary to record the data of the start time of all the communication processes, and it is sufficient to record the integrated value of the differences regarding the past communication processes as the synchronization waiting time. Therefore, even in a parallel application having a long execution time, the main storage area is not compressed. In this case, a method of transmitting and receiving the communication processing start time of each process using an asynchronous communication interface of software or hardware during the arithmetic processing is conceivable.

しかし、ソフトウェアの非同期通信インタフェースを利用する方法では、最大値の計算はCentral Processing Unit（ＣＰＵ）によるソフトウェア処理のため、ＣＰＵ時間を消費することで並列アプリケーションの挙動が変わる可能性がある。 However, in the method using the asynchronous communication interface of software, the calculation of the maximum value is software processing by the Central Processing Unit (CPU), so that the behavior of the parallel application may change by consuming CPU time.

ハードウェアの非同期通信インタフェースを利用する方法では、各プロセスがネットワークインタフェースに実装されたネットワークリダクション機構に開始時刻のデータを転送し、最大値演算を指示することができる。 In the method using the hardware asynchronous communication interface, each process can transfer start time data to a network reduction mechanism mounted on the network interface and instruct maximum value calculation.

ここで、ある通信処理と次の通信処理との間の演算処理時間が非常に短い場合、稀にではあるが、ネットワークリダクション機構による最大値演算が完了しない可能性がある。ネットワークリダクション機構が同時に単一の最大値演算しか実行できない場合、各プロセスは、先行する最大値演算が完了するまで後続の最大値演算を指示できないため、並列アプリケーションの挙動が変わってしまう。 Here, when the calculation processing time between one communication process and the next communication process is very short, there is a possibility that the maximum value calculation by the network reduction mechanism may not be completed. If the network reduction mechanism can execute only a single maximum value operation at the same time, each process cannot instruct the subsequent maximum value operation until the preceding maximum value operation is completed, and the behavior of the parallel application is changed.

１つの側面において、本発明は、並列アプリケーションの実行中に、並列アプリケーションの挙動を変えることなく、プロセス間通信の開始時刻を用いた演算を行うことを目的とする。 In one aspect, an object of the present invention is to perform an operation using the start time of interprocess communication without changing the behavior of a parallel application during execution of the parallel application.

１つの案では、演算処理装置と主記憶装置とに接続する通信制御装置は、演算部を含む。 In one proposal, the communication control device connected to the arithmetic processing device and the main storage device includes an arithmetic unit.

第１のシーケンス情報は、演算処理装置が実行するプログラムに含まれる複数のプロセスのうち第１のプロセスが第１のプロセス間通信を開始した第１の開始時刻に対して演算処理装置が付与して、主記憶装置に書き込んだシーケンス情報である。複数のプロセスのうち第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に付与された第２のシーケンス情報の方が第１のシーケンス情報よりも新しい場合、演算部は、第１の開始時刻を用いた演算を行わない。 The first sequence information is given by the arithmetic processing device to the first start time at which the first process of the plurality of processes included in the program executed by the arithmetic processing device starts the first inter-process communication. Sequence information written to the main memory. When the second sequence information given at the second start time at which the second process of the plurality of processes has started the second inter-process communication is newer than the first sequence information, The calculation using the first start time is not performed.

第２のシーケンス情報が第１のシーケンス情報に対応する場合、演算部は、第１の開始時刻と第２の開始時刻とを用いた演算を行って演算結果を出力する。 When the second sequence information corresponds to the first sequence information, the calculation unit performs a calculation using the first start time and the second start time and outputs a calculation result.

実施形態によれば、並列アプリケーションの実行中に、並列アプリケーションの挙動を変えることなく、プロセス間通信の開始時刻を用いた演算を行うことができる。 According to the embodiment, it is possible to perform an operation using the start time of interprocess communication without changing the behavior of the parallel application during execution of the parallel application.

並列計算機システムにおけるプロセス間通信処理を示す図である。It is a figure which shows the inter-process communication process in a parallel computer system. プロセス間同期インタフェースを用いた場合のプロセス間通信処理を示す図である。It is a figure which shows the inter-process communication process at the time of using an inter-process synchronous interface. ネットワークリダクション機構を示す図である。It is a figure which shows a network reduction mechanism. 第１の通信制御装置の構成図である。It is a block diagram of a 1st communication control apparatus. 並列計算機システムの構成図である。It is a block diagram of a parallel computer system. ノードの構成図である。It is a block diagram of a node. 第２の通信制御装置の構成図である。It is a block diagram of a 2nd communication control apparatus. プロセス間通信の開始時刻を用いた演算処理のフローチャートである。It is a flowchart of the arithmetic processing using the start time of interprocess communication. 演算処理の第１の具体例を示すフローチャートである。It is a flowchart which shows the 1st specific example of a calculation process. 演算処理の第２の具体例を示すフローチャートである。It is a flowchart which shows the 2nd specific example of a calculation process.

以下、図面を参照しながら、実施形態を詳細に説明する。
並列アプリケーション性能プロファイリングツールのように、同期待ち時間を求めるためにプロセス間同期インタフェースを呼び出す場合、各プロセスの処理に同期処理が加わるため、並列アプリケーションの挙動を大きく変えてしまうことがある。 Hereinafter, embodiments will be described in detail with reference to the drawings.
When the inter-process synchronization interface is called to obtain the synchronization waiting time as in the parallel application performance profiling tool, the synchronization processing is added to the processing of each process, so the behavior of the parallel application may be greatly changed.

図２は、図１の並列アプリケーションにおいて、並列アプリケーション性能プロファイリングツールによりプロセス間同期インタフェースを呼び出す例を示している。この場合、各プロセスの通信処理が完了した後に、同期待ち時間を求める同期処理が行われるため、並列アプリケーションの実行に要した経過時間は、８０から１００に２５％増加している。並列アプリケーションにも依存するが、図２のように、経過時間が１０％以上増加する例は少なくない。このため、同期処理を行うことなく、通信処理ごとに開始時刻の最大値を求めることが望ましい。 FIG. 2 shows an example in which the inter-process synchronization interface is called by the parallel application performance profiling tool in the parallel application of FIG. In this case, since the synchronization process for obtaining the synchronization waiting time is performed after the communication process of each process is completed, the elapsed time required to execute the parallel application is increased by 25% from 80 to 100. Although it depends on the parallel application, there are many cases where the elapsed time increases by 10% or more as shown in FIG. For this reason, it is desirable to obtain the maximum value of the start time for each communication process without performing the synchronization process.

まず、並列アプリケーションの実行後に最大値を計算する場合、実行時間が長い並列アプリケーションでは並列アプリケーションが利用できる主記憶領域を圧迫する可能性がある。例えば、１００，０００個のプロセスが１日間走行する並列アプリケーションにおいて、１ｍｓあたり１回の頻度で通信処理が発生すると仮定する。通信処理ごとに８バイトの開始時刻のデータを記録すると、並列アプリケーションが終了する間際には、１プロセスあたり約６９１Ｍバイトの主記憶領域が開始時刻のデータで埋まることになる。 First, when the maximum value is calculated after execution of a parallel application, there is a possibility that a parallel application having a long execution time may squeeze the main storage area that the parallel application can use. For example, it is assumed that communication processing occurs at a frequency of once per 1 ms in a parallel application in which 100,000 processes run for one day. If 8 bytes of start time data is recorded for each communication process, about 691 Mbytes of main storage area per process is filled with start time data just before the end of the parallel application.

１ノードあたり３２Ｇバイトの主記憶装置を搭載した並列計算機システムにおいて、各ノード内で１６プロセスが並列に動作する場合、約１１Ｇバイトの主記憶領域が開始時刻のデータで埋まり、並列アプリケーションの実行に支障が出る可能性がある。このような場合、並列アプリケーションの実行中に最大値を計算することが望ましい。 In a parallel computer system equipped with a main memory device of 32 GB per node, when 16 processes operate in parallel in each node, the main memory area of about 11 GB is filled with start time data to execute parallel applications. There is a possibility of trouble. In such a case, it is desirable to calculate the maximum value during execution of the parallel application.

ソフトウェアの非同期通信インタフェースを利用する方法では、ＭＰＩのノンブロッキング通信インタフェース（ＭＰＩ＿Ｉｓｅｎｄ関数、ＭＰＩ＿Ｉｒｅｃｖ関数、ＭＰＩ＿Ｗａｉｔ関数等）により、プロセス間で開始時刻のデータを送受信することができる。しかし、最大値の計算はCentral Processing Unit（ＣＰＵ）によるソフトウェア処理のため、ＣＰＵ時間を消費することで並列アプリケーションの挙動が変わる可能性がある。 In the method using the asynchronous communication interface of software, the data of the start time can be transmitted and received between processes by the MPI non-blocking communication interface (MPI_Isend function, MPI_Irecv function, MPI_Wait function, etc.). However, since the calculation of the maximum value is software processing by the Central Processing Unit (CPU), the behavior of the parallel application may change by consuming CPU time.

ハードウェアの非同期通信インタフェースを利用する方法では、各プロセスがネットワークインタフェースに実装されたネットワークリダクション機構に開始時刻のデータを転送し、最大値演算を指示することができる。ネットワークリダクション機構とは、通信ネットワークで接続された各ノードの各プロセスからデータを受信し、すべてのデータを対象として単一の演算を行った後、演算結果を各ノードで保持するためのハードウェア機構を指す。ネットワークリダクション機構による演算としては、例えば、データの総和、最大値、最小値等を求める演算が挙げられる。 In the method using the hardware asynchronous communication interface, each process can transfer start time data to a network reduction mechanism mounted on the network interface and instruct maximum value calculation. Network reduction mechanism is hardware that receives data from each process of each node connected in the communication network, performs a single operation on all data, and holds the operation result in each node Refers to mechanism. Examples of the calculation by the network reduction mechanism include calculation for obtaining the sum, maximum value, minimum value, and the like of data.

例えば、各プロセスが、ある通信処理の完了時にその通信処理の開始時刻をネットワークリダクション機構に転送して最大値演算を指示した後、次の通信処理の開始時にネットワークリダクション機構から最大値を読み出すことができる。これにより、各プロセスの演算処理中にネットワークリダクション機構が最大値を計算することができる。 For example, each process reads the maximum value from the network reduction mechanism at the start of the next communication process after transferring the communication process start time to the network reduction mechanism at the completion of a certain communication process and instructing the maximum value calculation. Can do. Thereby, the network reduction mechanism can calculate the maximum value during the arithmetic processing of each process.

図３は、図１の並列アプリケーションにおいて、ネットワークリダクション機構３０１が最大値を計算する例を示している。この場合、ネットワークリダクション機構３０１は、２回目の演算処理中に、１回目の通信処理の開始時刻の最大値３０を計算する。そして、プロセスＰ０、プロセスＰ１、及びプロセスＰ２は、２回目の通信処理の開始時に、最大値３０と１回目の通信処理の開始時刻２０、１０、及び３０との差分１０、２０、及び０をそれぞれ求める。 FIG. 3 shows an example in which the network reduction mechanism 301 calculates the maximum value in the parallel application of FIG. In this case, the network reduction mechanism 301 calculates the maximum value 30 of the start time of the first communication process during the second calculation process. Then, the process P0, the process P1, and the process P2 set the differences 10, 20, and 0 between the maximum value 30 and the start times 20, 10, and 30 of the first communication process at the start of the second communication process. Ask for each.

ネットワークリダクション機構を利用する場合に、並列アプリケーションの挙動が変わることを防ぐ方法はいくつか考えられる。例えば、先行する最大値演算が完了するまで、ネットワークリダクション機構が後続の最大値演算の指示をスキップする方法が挙げられる。しかし、ネットワークリダクション機構による演算がプロセス間で同一時刻に完了するとは限らないため、プロセスごとにスキップした回数が異なる状況が発生し得る。このような状況で後続の最大値演算を行うと、１つの通信処理の開始時刻の最大値の代わりに、複数の異なる通信処理の開始時刻の最大値が計算されてしまう。 There are several ways to prevent the behavior of parallel applications from changing when using a network reduction mechanism. For example, there is a method in which the network reduction mechanism skips the instruction for the subsequent maximum value calculation until the preceding maximum value calculation is completed. However, since the calculation by the network reduction mechanism is not always completed between processes at the same time, a situation may occur in which the number of skips differs for each process. When the subsequent maximum value calculation is performed in such a situation, the maximum value of the start times of a plurality of different communication processes is calculated instead of the maximum value of the start time of one communication process.

他の方法としては、ネットワークインタフェースに複数のネットワークリダクション機構を実装して、それらのネットワークリダクション機構を順番に使用する方法が挙げられる。しかし、ハードウェアの増加が非常に大きくなる。ネットワークリダクション機構による最大値演算が完了しないことは稀に発生する問題であるため、その解決のために大きなハードウェアコストをかけるのは現実的ではない。 As another method, a plurality of network reduction mechanisms are implemented in the network interface, and the network reduction mechanisms are used in order. However, the increase in hardware becomes very large. Since it is a rare problem that the maximum value calculation by the network reduction mechanism is not completed, it is not realistic to apply a large hardware cost to solve the problem.

なお、かかる問題は、並列アプリケーションにおいてプロセス間通信を開始した時刻の最大値を計算する場合に限らず、プロセス間通信を開始した時刻を用いた他の演算を行う場合においても生ずるものである。 Such a problem occurs not only when calculating the maximum value of the time when interprocess communication is started in a parallel application, but also when performing other operations using the time when communication between processes is started.

図４は、実施形態の通信制御装置の構成例を示している。図４の通信制御装置４０１は、並列計算機システムの各ノードに対応する情報処理装置（コンピュータ）内に設けられ、演算部４１１を含む。 FIG. 4 shows a configuration example of the communication control apparatus of the embodiment. The communication control device 401 in FIG. 4 is provided in an information processing device (computer) corresponding to each node of the parallel computer system, and includes a calculation unit 411.

情報処理装置内の演算処理装置は、プログラムを実行し、プログラムに含まれる複数のプロセスのうち第１のプロセスが第１のプロセス間通信を開始した第１の開始時刻に対して第１のシーケンス情報を付与する。そして、第１の開始時刻と第１のシーケンス情報とを情報処理装置内の主記憶装置に書き込む。 The arithmetic processing unit in the information processing apparatus executes a program, and a first sequence with respect to a first start time when a first process among the plurality of processes included in the program starts communication between the first processes. Give information. Then, the first start time and the first sequence information are written to the main storage device in the information processing apparatus.

複数のプロセスのうち第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に付与された第２のシーケンス情報の方が第１のシーケンス情報よりも新しい場合、通信制御装置４０１内の演算部４１１は、第１の開始時刻を用いた演算を行わない。一方、第２のシーケンス情報が第１のシーケンス情報に対応する場合、演算部４１１は、第１の開始時刻と第２の開始時刻とを用いた演算を行って演算結果を出力する。 When the second sequence information given at the second start time when the second process of the plurality of processes starts the second inter-process communication is newer than the first sequence information, the communication control device 401 The calculation unit 411 does not perform calculation using the first start time. On the other hand, when the second sequence information corresponds to the first sequence information, the calculation unit 411 performs a calculation using the first start time and the second start time and outputs a calculation result.

シーケンス情報は、各プロセスが行う通信処理の順序を示す情報であり、複数のプロセスの間で、同じ順序の通信処理に対して同じシーケンス情報が付与される。例えば、第２のプロセスが行う１回目の通信処理に付与されるシーケンス情報は、第１のプロセスが行う１回目の通信処理に付与されるシーケンス情報と同じである。また、第２のプロセスが行う２回目の通信処理に付与されるシーケンス情報は、第１のプロセスが行う２回目の通信処理に付与されるシーケンス情報と同じである。 The sequence information is information indicating the order of communication processing performed by each process, and the same sequence information is given to the communication processing in the same order among a plurality of processes. For example, the sequence information given to the first communication process performed by the second process is the same as the sequence information given to the first communication process performed by the first process. Further, the sequence information given to the second communication process performed by the second process is the same as the sequence information given to the second communication process performed by the first process.

第１及び第２のプロセスの間で、それぞれの開始時刻に付与されたシーケンス情報を比較することで、２つの開始時刻が同じ順序の通信処理の開始時刻であるか否かを判定することができる。 By comparing the sequence information given at the respective start times between the first and second processes, it is possible to determine whether the two start times are the start times of the communication processes in the same order. it can.

そして、演算部４１１は、第２の開始時刻に付与された第２のシーケンス情報の方が第１の開始時刻に付与された第１のシーケンス情報よりも新しい場合、第１の開始時刻を用いた演算を行わない。このため、第１のプロセスは、古い開始時刻を用いた演算の完了を待つことなく、最新の開始時刻を用いた演算を演算部４１１に対して次々に指示することができる。したがって、並列アプリケーションの実行中に、並列アプリケーションの挙動を変えることなく、プロセス間通信の開始時刻を用いた演算を行うことができる。 Then, the calculation unit 411 uses the first start time when the second sequence information given at the second start time is newer than the first sequence information given at the first start time. Do not perform the operation. For this reason, the first process can sequentially instruct the calculation unit 411 to perform the calculation using the latest start time without waiting for the completion of the calculation using the old start time. Therefore, during the execution of the parallel application, it is possible to perform an operation using the start time of the interprocess communication without changing the behavior of the parallel application.

図５は、実施形態の並列計算機システムの構成例を示している。図５の並列計算機システム５００は、ノード５０１−１〜ノード５０１−ｎのｎ個（ｎは１以上の整数）の計算ノードとディスクノード５０２とを含む。ノード５０１−１〜ノード５０１−ｎとディスクノード５０２は、通信ネットワーク５０３により互いに接続されている。 FIG. 5 shows a configuration example of the parallel computer system of the embodiment. The parallel computer system 500 of FIG. 5 includes n (n is an integer of 1 or more) computation nodes and disk nodes 502, which are nodes 501-1 to 501-n. The nodes 501-1 to 501-n and the disk node 502 are connected to each other by a communication network 503.

図６は、図５のノード５０１−ｉ（ｉ＝１〜ｎ）に対応する情報処理装置の構成例を示している。図６のノード５０１−ｉは、ＣＰＵ６０１、メモリ６０２、媒体駆動装置６０３、及び通信制御装置４０１を含む。ＣＰＵ６０１、メモリ６０２、媒体駆動装置６０３、及び通信制御装置４０１は、バス６０４により互いに接続されている。 FIG. 6 shows a configuration example of an information processing apparatus corresponding to the nodes 501-i (i = 1 to n) in FIG. 5. A node 501-i in FIG. 6 includes a CPU 601, a memory 602, a medium driving device 603, and a communication control device 401. The CPU 601, the memory 602, the medium driving device 603, and the communication control device 401 are connected to each other via a bus 604.

メモリ６０２は主記憶装置に対応する。メモリ６０２は、例えば、Read Only Memory（ＲＯＭ）、Random Access Memory（ＲＡＭ）等の半導体メモリであり、並列アプリケーションプログラム及びその処理に用いられるデータを記憶する。 The memory 602 corresponds to the main storage device. The memory 602 is a semiconductor memory such as a read only memory (ROM) or a random access memory (RAM), and stores a parallel application program and data used for the processing.

ＣＰＵ６０１は演算処理装置（プロセッサ）に対応し、メモリコントローラ及びネットワークインタフェースコントローラを含むことができる。ＣＰＵ６０１は、例えば、メモリ６０２を利用して並列アプリケーションプログラムを実行する。ＣＰＵ６０１は、オペレーティングシステム（ＯＳ）及びネットワークインタフェースドライバ等のプログラムも実行することができる。 The CPU 601 corresponds to an arithmetic processing unit (processor) and can include a memory controller and a network interface controller. For example, the CPU 601 uses the memory 602 to execute the parallel application program. The CPU 601 can also execute programs such as an operating system (OS) and a network interface driver.

媒体駆動装置６０３は、可搬型記録媒体６０５を駆動し、その記録内容にアクセスする。可搬型記録媒体６０５は、メモリデバイス、フレキシブルディスク、光ディスク、光磁気ディスク等である。この可搬型記録媒体６０５は、Compact Disk Read Only Memory（ＣＤ−ＲＯＭ）、Digital Versatile Disk（ＤＶＤ）、Universal Serial Bus（ＵＳＢ）メモリ等であってもよい。ユーザ又はオペレータは、この可搬型記録媒体６０５に並列アプリケーションプログラム及びデータを格納しておき、それらをメモリ６０２にロードして使用することができる。 The medium driving device 603 drives the portable recording medium 605 and accesses the recorded contents. The portable recording medium 605 is a memory device, a flexible disk, an optical disk, a magneto-optical disk, or the like. The portable recording medium 605 may be a compact disk read only memory (CD-ROM), a digital versatile disk (DVD), a universal serial bus (USB) memory, or the like. A user or an operator can store parallel application programs and data in the portable recording medium 605 and load them into the memory 602 for use.

通信制御装置４０１は、図５の通信ネットワーク５０３に接続され、他のノードと通信するネットワークインタフェースである。ノード５０１−ｉは、並列アプリケーションプログラム及びデータを、並列計算機システムの外部の装置から通信制御装置４０１を介して受け取り、それらをメモリ６０２にロードして使用することができる。 The communication control device 401 is a network interface that is connected to the communication network 503 in FIG. 5 and communicates with other nodes. The node 501-i can receive the parallel application program and data from a device external to the parallel computer system via the communication control device 401 and load them into the memory 602 for use.

なお、ノード５０１−ｉが図６のすべての構成要素を含む必要はなく、用途や条件に応じて一部の構成要素を省略することも可能である。例えば、可搬型記録媒体６０５を使用しない場合は、媒体駆動装置６０３を省略してもよい。 Note that the node 501-i does not have to include all the components shown in FIG. 6, and some of the components can be omitted depending on applications and conditions. For example, when the portable recording medium 605 is not used, the medium driving device 603 may be omitted.

図５のディスクノード５０２は、例えば、図６のノード５０１−ｉの構成にディスク装置を追加した構成を有する。ディスク装置は、例えば、磁気ディスク装置、光ディスク装置、光磁気ディスク装置等であり、ハードディスクドライブであってもよい。ディスク装置は、並列アプリケーションプログラムの処理に用いられるデータを格納することができる。 The disk node 502 in FIG. 5 has a configuration in which a disk device is added to the configuration of the node 501-i in FIG. The disk device is, for example, a magnetic disk device, an optical disk device, a magneto-optical disk device, or the like, and may be a hard disk drive. The disk device can store data used for processing parallel application programs.

ノード５０１−１〜ノード５０１−ｎは、例えば、ディスクノード５０２に格納されたデータを用いて並列アプリケーションを実行する。このとき、各ノード内には１つ以上のプロセスが生成され、ノード５０１−１〜ノード５０１−ｎ内の複数のプロセスは、ＭＰＩにより並列化されて、すべてのプロセスを対象とする集団通信を繰り返す。 For example, the nodes 501-1 to 501-n execute parallel applications using data stored in the disk node 502. At this time, one or more processes are generated in each node, and a plurality of processes in the nodes 501-1 to 501-n are parallelized by MPI to perform collective communication for all processes. repeat.

なお、ノード５０１−１〜ノード５０１−ｎの各々にディスク装置が設けられている場合は、ディスクノード５０２を省略してもよい。 If a disk device is provided for each of the nodes 501-1 to 501-n, the disk node 502 may be omitted.

図７は、図６の通信制御装置４０１内の演算部４１１の構成例を示している。図７の演算部４１１は、ＣＰＵ７０１、メモリ７０２、インタフェース７０３、及びインタフェース７０４を含む。ＣＰＵ７０１、メモリ７０２、インタフェース７０３、及びインタフェース７０４は、バス７０５により互いに接続されている。 FIG. 7 shows a configuration example of the calculation unit 411 in the communication control apparatus 401 of FIG. The arithmetic unit 411 in FIG. 7 includes a CPU 701, a memory 702, an interface 703, and an interface 704. The CPU 701, the memory 702, the interface 703, and the interface 704 are connected to each other via a bus 705.

ＣＰＵ７０１は演算処理装置（プロセッサ）に対応し、メモリ７０２は主記憶装置に対応する。メモリ７０２は、例えば、ＲＯＭ、ＲＡＭ等の半導体メモリであり、制御プログラム及びデータを記憶する。メモリ７０２が記憶するデータには、図６のＣＰＵ６０１から転送されたデータ、他のノードから受信したデータ等が含まれる。ＣＰＵ７０１は、例えば、メモリ７０２を利用して制御プログラムを実行することで、ネットワークリダクション機構の演算を行って、演算結果をメモリ７０２に書き込む。 The CPU 701 corresponds to an arithmetic processing device (processor), and the memory 702 corresponds to a main storage device. The memory 702 is a semiconductor memory such as a ROM and a RAM, for example, and stores a control program and data. Data stored in the memory 702 includes data transferred from the CPU 601 in FIG. 6, data received from other nodes, and the like. For example, the CPU 701 executes a control program using the memory 702 to perform an operation of the network reduction mechanism, and writes the operation result in the memory 702.

ユーザ又はオペレータは、図６の可搬型記録媒体６０５に制御プログラム及びデータを格納しておき、それらをメモリ７０２にロードして使用することができる。このように、制御プログラム及びデータを格納するコンピュータ読み取り可能な記録媒体は、メモリ７０２又は可搬型記録媒体６０５のような、物理的な（非一時的な）記録媒体である。 The user or operator can store the control program and data in the portable recording medium 605 of FIG. 6 and load them into the memory 702 for use. As described above, the computer-readable recording medium that stores the control program and data is a physical (non-transitory) recording medium such as the memory 702 or the portable recording medium 605.

インタフェース７０３は、図６のバス６０４に接続されてＣＰＵ６０１と通信し、インタフェース７０４は、図５の通信ネットワーク５０３に接続されて他のノードと通信する。インタフェース７０４を介して、ネットワークリダクション機構の部分的な演算結果をノード間で送受信することで、演算が効率化される。演算部４１１は、制御プログラム及びデータを、並列計算機システムの外部の装置からインタフェース７０３又はインタフェース７０４を介して受け取り、それらをメモリ７０２にロードして使用することもできる。 The interface 703 is connected to the bus 604 in FIG. 6 to communicate with the CPU 601, and the interface 704 is connected to the communication network 503 in FIG. 5 to communicate with other nodes. The calculation is made efficient by transmitting / receiving partial calculation results of the network reduction mechanism between the nodes via the interface 704. The arithmetic unit 411 can receive a control program and data from a device external to the parallel computer system via the interface 703 or the interface 704 and load them into the memory 702 for use.

なお、演算部４１１の構成は図７の構成には限られず、一部又は全部の処理を布線論理で実装することも可能である。 Note that the configuration of the calculation unit 411 is not limited to the configuration of FIG. 7, and part or all of the processing can be implemented by wiring logic.

図８は、ノード５０１−１において行われる、プロセス間通信の開始時刻を用いた演算処理の例を示すフローチャートである。この例では、第１のプロセスは、並列アプリケーションの複数のプロセスのうち、ノード５０１−１内で生成されるプロセスである。第２のプロセスは、並列アプリケーションの複数のプロセスのうち、ノード５０１−１内で生成されるプロセス、又はノード５０１−２〜ノード５０１−ｎのいずれかで生成されるプロセスである。 FIG. 8 is a flowchart illustrating an example of arithmetic processing using the start time of interprocess communication performed in the node 501-1. In this example, the first process is a process generated in the node 501-1 among a plurality of processes of the parallel application. The second process is a process generated in the node 501-1 among a plurality of processes of the parallel application, or a process generated in any of the nodes 501-2 to 501-n.

ノード５０１−１内のＣＰＵ６０１は、第１のプロセスが第１のプロセス間通信を開始した第１の開始時刻に対して第１のシーケンス情報を付与する。そして、そのＣＰＵ６０１は、第１の開始時刻と第１のシーケンス情報とをノード５０１−１内のメモリ６０２に書き込み、第１の開始時刻と第１のシーケンス情報とを演算部４１１内のメモリ７０２へ転送する。ノード５０１−１内の通信制御装置４０１は、第１の開始時刻と第１のシーケンス情報とをノード５０１−２〜ノード５０１−ｎへ送信する。 The CPU 601 in the node 501-1 gives the first sequence information to the first start time when the first process starts the first inter-process communication. Then, the CPU 601 writes the first start time and the first sequence information to the memory 602 in the node 501-1, and writes the first start time and the first sequence information in the memory 702 in the arithmetic unit 411. Forward to. The communication control device 401 in the node 501-1 transmits the first start time and the first sequence information to the node 501-2 to the node 501-n.

第２のプロセスがノード５０１−１内で生成される場合、ノード５０１−１内のＣＰＵ６０１は、第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に対して第２のシーケンス情報を付与する。そして、そのＣＰＵ６０１は、第２の開始時刻と第２のシーケンス情報とをノード５０１−１内のメモリ６０２に書き込み、第２の開始時刻と第２のシーケンス情報とを演算部４１１内のメモリ７０２へ転送する。ノード５０１−１内の通信制御装置４０１は、第２の開始時刻と第２のシーケンス情報とをノード５０１−２〜ノード５０１−ｎへ送信する。 When the second process is generated in the node 501-1, the CPU 601 in the node 501-1 sets the second process time for the second start time when the second process starts the second inter-process communication. Assign sequence information. Then, the CPU 601 writes the second start time and the second sequence information in the memory 602 in the node 501-1, and writes the second start time and the second sequence information in the memory 702 in the arithmetic unit 411. Forward to. The communication control device 401 in the node 501-1 transmits the second start time and the second sequence information to the node 501-2 to the node 501-n.

第２のプロセスがノード５０１−２内で生成される場合、ノード５０１−２内のＣＰＵ６０１は、第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に対して第２のシーケンス情報を付与する。そして、そのＣＰＵ６０１は、第２の開始時刻と第２のシーケンス情報とをノード５０１−２内のメモリ６０２に書き込み、第２の開始時刻と第２のシーケンス情報とを演算部４１１内のメモリ７０２へ転送する。ノード５０１−２内の通信制御装置４０１は、第２の開始時刻と第２のシーケンス情報とをノード５０１−１及びノード５０１−３〜ノード５０１−ｎへ送信する。 When the second process is generated in the node 501-2, the CPU 601 in the node 501-2 sets the second process time to the second start time when the second process starts the second inter-process communication. Assign sequence information. Then, the CPU 601 writes the second start time and the second sequence information to the memory 602 in the node 501-2, and writes the second start time and the second sequence information in the memory 702 in the arithmetic unit 411. Forward to. The communication control device 401 in the node 501-2 transmits the second start time and the second sequence information to the node 501-1 and the nodes 501-3 to 501-n.

ノード５０１−３〜ノード５０１−ｎの各ＣＰＵ６０１も、同様にして、各プロセスがプロセス間通信を開始した開始時刻に対してシーケンス情報を付与する。そして、そのＣＰＵ６０１は、開始時刻とシーケンス情報とをメモリ６０２に書き込み、開始時刻とシーケンス情報とを演算部４１１内のメモリ７０２へ転送する。ノード５０１−３〜ノード５０１−ｎの通信制御装置４０１は、開始時刻とシーケンス情報とを他のノードへ送信する。 Similarly, each CPU 601 of the node 501-3 to the node 501-n gives sequence information to the start time when each process starts inter-process communication. Then, the CPU 601 writes the start time and sequence information to the memory 602 and transfers the start time and sequence information to the memory 702 in the calculation unit 411. The communication control devices 401 of the nodes 501-3 to 501-n transmit the start time and sequence information to other nodes.

ノード５０１−１において、演算部４１１内のＣＰＵ７０１は、第２の開始時刻に付与された第２のシーケンス情報の方が第１のシーケンス情報よりも新しいか否かをチェックする（ステップ８０１）。 In node 501-1, CPU 701 in operation unit 411 checks whether or not the second sequence information given at the second start time is newer than the first sequence information (step 801).

第２のシーケンス情報の方が第１のシーケンス情報よりも新しい場合（ステップ８０１，ＹＥＳ）、ＣＰＵ７０１は、第１の開始時刻を用いた演算を行わない。一方、ステップ８０１のチェック結果がＮＯの場合、ＣＰＵ７０１は、第２のシーケンス情報が第１のシーケンス情報に対応するか否かをチェックする（ステップ８０２）。 When the second sequence information is newer than the first sequence information (step 801, YES), the CPU 701 does not perform calculation using the first start time. On the other hand, when the check result in step 801 is NO, the CPU 701 checks whether or not the second sequence information corresponds to the first sequence information (step 802).

第２のシーケンス情報が第１のシーケンス情報に対応する場合（ステップ８０２，ＹＥＳ）、ＣＰＵ７０１は、少なくとも第１の開始時刻と第２の開始時刻とを用いた演算を行って、演算結果をメモリ７０２に書き込む（ステップ８０３）。 When the second sequence information corresponds to the first sequence information (step 802, YES), the CPU 701 performs calculation using at least the first start time and the second start time, and stores the calculation result in the memory. Write to 702 (step 803).

一方、ステップ８０２のチェック結果がＮＯの場合、ＣＰＵ７０１は、第２の開始時刻を用いた演算を行わず、第２のプロセスが第３のプロセス間通信を開始した第３の開始時刻と第３のシーケンス情報とを受信するまで待機する（ステップ８０４）。そして、第３のシーケンス情報が第１のシーケンス情報に対応する場合、ＣＰＵ７０１は、少なくとも第１の開始時刻と第３の開始時刻とを用いた演算を行って、演算結果をメモリ７０２に書き込む。 On the other hand, if the check result in step 802 is NO, the CPU 701 does not perform the calculation using the second start time, and the third start time and the third start time when the second process starts the third inter-process communication. Until the sequence information is received (step 804). When the third sequence information corresponds to the first sequence information, the CPU 701 performs a calculation using at least the first start time and the third start time, and writes the calculation result in the memory 702.

ノード５０１−２〜ノード５０１−ｎの各ノードの演算部４１１も、ノード５０１−１の演算部４１１と同様の演算処理を行うことができる。 The calculation units 411 of the nodes 501-2 to 501-n can perform the same calculation process as the calculation unit 411 of the node 501-1.

図９は、図５のノード５０１−１〜ノード５０１−ｎの各ノード内で１つのプロセスが動作している場合に行われる、図８の演算処理の具体例を示すフローチャートである。 FIG. 9 is a flowchart illustrating a specific example of the arithmetic processing in FIG. 8 performed when one process is operating in each of the nodes 501-1 to 501-n in FIG.

シーケンス情報としては、例えば、昇順の非負の整数であるシーケンス番号が用いられる。シーケンス番号は、ネットワークインタフェースドライバにより管理され、並列アプリケーションの開始時に０にリセットされる。各ノード５０１−ｉ（ｉ＝１〜ｎ）内のメモリ６０２は、プロセスがプロセス間通信を開始した開始時刻Ｔ（ｉ）と、そのプロセスのシーケンス番号Ｓ（ｉ）とを記憶する。 As the sequence information, for example, a sequence number that is a non-negative integer in ascending order is used. The sequence number is managed by the network interface driver and is reset to 0 at the start of the parallel application. The memory 602 in each node 501-i (i = 1 to n) stores a start time T (i) at which a process starts inter-process communication and a sequence number S (i) of the process.

各ノード５０１−ｉ内のＣＰＵ６０１は、並列アプリケーションを実行することでプロセスを動作させ、ネットワークインタフェースドライバのプログラムを実行することでネットワークインタフェースドライバを動作させる。 The CPU 601 in each node 501-i operates a process by executing a parallel application, and operates a network interface driver by executing a network interface driver program.

ノード５０１−ｉ内のプロセスがプロセス間通信を開始したとき、そのプロセスは、ネットワークインタフェースドライバに対して演算処理を指示する（ステップ９０１）。次に、ＣＰＵ６０１は、ネットワークインタフェースドライバとして動作し、開始時刻Ｔ（ｉ）とシーケンス番号Ｓ（ｉ）とをメモリ６０２から読み出す（ステップ９０２）。次に、ＣＰＵ６０１は、読み出した開始時刻Ｔ（ｉ）とシーケンス番号Ｓ（ｉ）とを演算部４１１内のメモリ７０２に書き込み、演算部４１１内のＣＰＵ７０１に対して演算処理を指示する。そして、ＣＰＵ６０１は、メモリ６０２内のシーケンス番号Ｓ（ｉ）を１だけインクリメントする。 When a process in the node 501-i starts inter-process communication, the process instructs the network interface driver to perform arithmetic processing (step 901). Next, the CPU 601 operates as a network interface driver, and reads the start time T (i) and the sequence number S (i) from the memory 602 (step 902). Next, the CPU 601 writes the read start time T (i) and sequence number S (i) in the memory 702 in the calculation unit 411 and instructs the CPU 701 in the calculation unit 411 to perform calculation processing. Then, the CPU 601 increments the sequence number S (i) in the memory 602 by 1.

次に、ＣＰＵ７０１は、メモリ７０２に書き込まれた開始時刻Ｔ（ｉ）とシーケンス番号Ｓ（ｉ）とを他のノード５０１−ｊ（１≦ｊ≦ｎ，ｊ≠ｉ）へ送信する（ステップ９０３）。そして、ＣＰＵ７０１は、並列アプリケーションの全プロセスの開始時刻とシーケンス番号とがメモリ７０２に書き込まれているか否かをチェックする（ステップ９０４）。 Next, the CPU 701 transmits the start time T (i) and the sequence number S (i) written in the memory 702 to another node 501-j (1 ≦ j ≦ n, j ≠ i) (step 903). ). Then, the CPU 701 checks whether or not the start times and sequence numbers of all processes of the parallel application are written in the memory 702 (step 904).

いずれかのプロセスの開始時刻とシーケンス番号とが書き込まれていない場合（ステップ９０４，ＮＯ）、ＣＰＵ７０１は、他のノードから開始時刻とシーケンス番号とを受信したか否かをチェックする（ステップ９０８）。 When the start time and sequence number of any process are not written (step 904, NO), the CPU 701 checks whether the start time and sequence number have been received from another node (step 908). .

他のノードから開始時刻とシーケンス番号とを受信していない場合（ステップ９０８，ＮＯ）、ＣＰＵ７０１は、ステップ９０４以降の処理を繰り返す。一方、他のノード５０１−ｋ（１≦ｋ≦ｎ，ｋ≠ｉ）から開始時刻Ｔ（ｋ）とシーケンス番号Ｓ（ｋ）とを受信した場合（ステップ９０８，ＹＥＳ）、ＣＰＵ７０１は、受信したシーケンス番号Ｓ（ｋ）とシーケンス番号Ｓ（ｉ）とを比較する（ステップ９０９）。 When the start time and the sequence number have not been received from another node (step 908, NO), the CPU 701 repeats the processing after step 904. On the other hand, when the start time T (k) and the sequence number S (k) are received from another node 501-k (1 ≦ k ≦ n, k ≠ i) (step 908, YES), the CPU 701 has received The sequence number S (k) is compared with the sequence number S (i) (step 909).

受信したシーケンス番号Ｓ（ｋ）がシーケンス番号Ｓ（ｉ）より小さい場合（ステップ９０９，ＮＯ）、ＣＰＵ７０１は、ステップ９０４以降の処理を繰り返す。一方、受信したシーケンス番号Ｓ（ｋ）がシーケンス番号Ｓ（ｉ）以上である場合（ステップ９０９，ＹＥＳ）、ＣＰＵ７０１は、受信した開始時刻Ｔ（ｋ）とシーケンス番号Ｓ（ｋ）とをメモリ７０２に書き込む（ステップ９１０）。そして、ＣＰＵ７０１は、再び、受信したシーケンス番号Ｓ（ｋ）とシーケンス番号Ｓ（ｉ）とを比較する（ステップ９１１）。 When the received sequence number S (k) is smaller than the sequence number S (i) (step 909, NO), the CPU 701 repeats the processing after step 904. On the other hand, when the received sequence number S (k) is greater than or equal to the sequence number S (i) (step 909, YES), the CPU 701 stores the received start time T (k) and the sequence number S (k) in the memory 702. (Step 910). The CPU 701 again compares the received sequence number S (k) with the sequence number S (i) (step 911).

受信したシーケンス番号Ｓ（ｋ）がシーケンス番号Ｓ（ｉ）と一致する場合（ステップ９１１，ＮＯ）、ＣＰＵ７０１は、ステップ９０４以降の処理を繰り返す。こうして、全プロセスの開始時刻とシーケンス番号とがメモリ７０２に書き込まれると（ステップ９０４，ＹＥＳ）、ＣＰＵ７０１は、全プロセスの開始時刻の最大値を求め、メモリ７０２に書き込む（ステップ９０５）。 When the received sequence number S (k) matches the sequence number S (i) (step 911, NO), the CPU 701 repeats the processing after step 904. Thus, when the start time and sequence number of all processes are written in the memory 702 (step 904, YES), the CPU 701 obtains the maximum value of the start time of all processes and writes it in the memory 702 (step 905).

次に、ＣＰＵ７０１は、ＣＰＵ６０１に割り込み（ステップ９０６）、ＣＰＵ６０１は、ネットワークインタフェースドライバを動作させる。そして、ＣＰＵ６０１は、メモリ７０２から開始時刻の最大値を読み出してメモリ６０２に書き込み、メモリ７０２を初期化する（ステップ９０７）。これにより、メモリ７０２内の全プロセスの開始時刻とシーケンス番号とが消去される。 Next, the CPU 701 interrupts the CPU 601 (step 906), and the CPU 601 operates the network interface driver. The CPU 601 reads out the maximum value of the start time from the memory 702, writes it in the memory 602, and initializes the memory 702 (step 907). As a result, the start times and sequence numbers of all processes in the memory 702 are deleted.

一方、受信したシーケンス番号Ｓ（ｋ）がシーケンス番号Ｓ（ｉ）より大きい場合（ステップ９１１，ＹＥＳ）、ＣＰＵ７０１は、ＣＰＵ６０１に割り込み、指示された演算処理を中止する旨を通知する（ステップ９１２）。この通知により、ＣＰＵ６０１は、メモリ７０２を初期化することなく、プロセスがネットワークインタフェースドライバに対して指示した演算処理を中止する。その後、プロセスがネットワークインタフェースドライバに対して次の演算処理を指示したとき、ＣＰＵ６０１は、改めてステップ９０１以降の処理を開始する。 On the other hand, when the received sequence number S (k) is larger than the sequence number S (i) (step 911, YES), the CPU 701 interrupts the CPU 601 and notifies that the instructed arithmetic processing is stopped (step 912). . By this notification, the CPU 601 stops the arithmetic processing instructed by the process to the network interface driver without initializing the memory 702. Thereafter, when the process instructs the network interface driver to perform the next calculation process, the CPU 601 starts the process from step 901 onward.

図１０は、図５のノード５０１−１〜ノード５０１−ｎの各ノード内で複数のプロセスが動作している場合に行われる、図８の演算処理の具体例を示すフローチャートである。 FIG. 10 is a flowchart illustrating a specific example of the arithmetic processing in FIG. 8 performed when a plurality of processes are operating in each of the nodes 501-1 to 501-n in FIG.

シーケンス情報としては、例えば、昇順の非負の整数であるシーケンス番号が用いられる。シーケンス番号は、ネットワークインタフェースドライバにより管理され、並列アプリケーションの開始時に０にリセットされる。ノード５０１−１〜ノード５０１−ｎで動作するプロセスの総数をｐとし、ｍ番目のプロセスの識別情報をＰ（ｍ）とする（ｍ＝１〜ｐ）。各ノード５０１−ｉ（ｉ＝１〜ｎ）内のメモリ６０２は、ノード５０１−ｉ内の各プロセスがプロセス間通信を開始した開始時刻Ｔ（ｍ）と、そのプロセスのシーケンス番号Ｓ（ｍ）とを記憶する。 As the sequence information, for example, a sequence number that is a non-negative integer in ascending order is used. The sequence number is managed by the network interface driver and is reset to 0 at the start of the parallel application. Assume that the total number of processes operating in the nodes 501-1 to 501-n is p, and the identification information of the mth process is P (m) (m = 1 to p). The memory 602 in each node 501-i (i = 1 to n) includes a start time T (m) at which each process in the node 501-i starts inter-process communication, and a sequence number S (m) of the process. And remember.

ノード５０１−ｉ内の複数のプロセスのうちプロセスＰ（ｒ）がプロセス間通信を開始したとき、プロセスＰ（ｒ）は、ネットワークインタフェースドライバに対して演算処理を指示する（ステップ１００１）。次に、ＣＰＵ６０１は、ネットワークインタフェースドライバとして動作し、プロセスＰ（ｒ）の開始時刻Ｔ（ｒ）とシーケンス番号Ｓ（ｒ）とをメモリ６０２から読み出す（ステップ１００２）。次に、ＣＰＵ６０１は、読み出した開始時刻Ｔ（ｒ）とシーケンス番号Ｓ（ｒ）とを演算部４１１内のメモリ７０２に書き込み、演算部４１１内のＣＰＵ７０１に対して演算処理を指示する。そして、ＣＰＵ６０１は、メモリ６０２内のシーケンス番号Ｓ（ｒ）を１だけインクリメントする。 When the process P (r) among the plurality of processes in the node 501-i starts inter-process communication, the process P (r) instructs the network interface driver to perform arithmetic processing (step 1001). Next, the CPU 601 operates as a network interface driver, and reads the start time T (r) and the sequence number S (r) of the process P (r) from the memory 602 (step 1002). Next, the CPU 601 writes the read start time T (r) and sequence number S (r) to the memory 702 in the calculation unit 411 and instructs the CPU 701 in the calculation unit 411 to perform calculation processing. Then, the CPU 601 increments the sequence number S (r) in the memory 602 by 1.

次に、ＣＰＵ７０１は、メモリ７０２に書き込まれた開始時刻Ｔ（ｒ）とシーケンス番号Ｓ（ｒ）とを他のノード５０１−ｊ（１≦ｊ≦ｎ，ｊ≠ｉ）へ送信する（ステップ１００３）。 Next, the CPU 701 transmits the start time T (r) and the sequence number S (r) written in the memory 702 to another node 501-j (1 ≦ j ≦ n, j ≠ i) (step 1003). ).

その後、ノード５０１−ｉ内の別のプロセスＰ（ｘ）がプロセス間通信を開始したとき、プロセスＰ（ｘ）は、ステップ１００１と同様に、ネットワークインタフェースドライバに対して演算処理を指示する。また、ＣＰＵ６０１は、ステップ１００２と同様に、プロセスＰ（ｘ）の開始時刻Ｔ（ｘ）とシーケンス番号Ｓ（ｘ）とをメモリ７０２に書き込み、メモリ６０２内のシーケンス番号Ｓ（ｘ）を１だけインクリメントする。そして、ＣＰＵ７０１は、ステップ１００３と同様に、メモリ７０２に書き込まれた開始時刻Ｔ（ｘ）とシーケンス番号Ｓ（ｘ）とを他のノード５０１−ｊ（１≦ｊ≦ｎ，ｊ≠ｉ）へ送信する。 Thereafter, when another process P (x) in the node 501-i starts inter-process communication, the process P (x) instructs the network interface driver to perform arithmetic processing, similarly to step 1001. Similarly to step 1002, the CPU 601 writes the start time T (x) of the process P (x) and the sequence number S (x) in the memory 702, and sets the sequence number S (x) in the memory 602 to only 1. Increment. Then, as in step 1003, the CPU 701 transfers the start time T (x) and the sequence number S (x) written in the memory 702 to another node 501-j (1 ≦ j ≦ n, j ≠ i). Send.

次に、ＣＰＵ７０１は、並列アプリケーションの全プロセスの開始時刻とシーケンス番号とがメモリ７０２に書き込まれているか否かをチェックする（ステップ１００４）。いずれかのプロセスの開始時刻とシーケンス番号とが書き込まれていない場合（ステップ１００４，ＮＯ）、ＣＰＵ７０１は、他のノードから開始時刻とシーケンス番号とを受信したか否かをチェックする（ステップ１００８）。 Next, the CPU 701 checks whether or not the start time and sequence number of all processes of the parallel application are written in the memory 702 (step 1004). When the start time and sequence number of any process are not written (step 1004, NO), the CPU 701 checks whether the start time and sequence number are received from another node (step 1008). .

他のノードから開始時刻とシーケンス番号とを受信していない場合（ステップ１００８，ＮＯ）、ＣＰＵ７０１は、ステップ１００４以降の処理を繰り返す。 When the start time and the sequence number have not been received from another node (step 1008, NO), the CPU 701 repeats the processing after step 1004.

この間に、ノード５０１−ｉ内の別のプロセスＰ（ｙ）がプロセス間通信を開始したとき、プロセスＰ（ｙ）は、ステップ１００１と同様に、ネットワークインタフェースドライバに対して演算処理を指示する。また、ＣＰＵ６０１は、ステップ１００２と同様に、プロセスＰ（ｙ）の開始時刻Ｔ（ｙ）とシーケンス番号Ｓ（ｙ）とをメモリ７０２に書き込み、メモリ６０２内のシーケンス番号Ｓ（ｙ）を１だけインクリメントする。そして、ＣＰＵ７０１は、ステップ１００３と同様に、メモリ７０２に書き込まれた開始時刻Ｔ（ｙ）とシーケンス番号Ｓ（ｙ）とを他のノード５０１−ｊ（１≦ｊ≦ｎ，ｊ≠ｉ）へ送信する。 During this time, when another process P (y) in the node 501-i starts inter-process communication, the process P (y) instructs the network interface driver to perform arithmetic processing in the same manner as in Step 1001. Similarly to step 1002, the CPU 601 writes the start time T (y) of the process P (y) and the sequence number S (y) to the memory 702, and sets the sequence number S (y) in the memory 602 to only 1. Increment. Then, as in step 1003, the CPU 701 transfers the start time T (y) and the sequence number S (y) written in the memory 702 to another node 501-j (1 ≦ j ≦ n, j ≠ i). Send.

一方、他のノード５０１−ｋ（１≦ｋ≦ｎ，ｋ≠ｉ）から開始時刻Ｔ（ｑ）とシーケンス番号Ｓ（ｑ）とを受信した場合（ステップ１００８，ＹＥＳ）、ＣＰＵ７０１は、ステップ１００９の処理を行う。 On the other hand, when the start time T (q) and the sequence number S (q) are received from another node 501-k (1 ≦ k ≦ n, k ≠ i) (step 1008, YES), the CPU 701 proceeds to step 1009. Perform the process.

ステップ１００９において、ＣＰＵ７０１は、メモリ７０２に書き込まれたシーケンス番号のうち、ノード５０１−ｉ内の１つ以上のプロセスに対応するシーケンス番号の最大値を求める。そして、ＣＰＵ７０１は、受信したシーケンス番号Ｓ（ｑ）とシーケンス番号の最大値とを比較する。受信したシーケンス番号Ｓ（ｑ）がシーケンス番号の最大値より小さい場合（ステップ１００９，ＮＯ）、ＣＰＵ７０１は、ステップ１００４以降の処理を繰り返す。 In step 1009, the CPU 701 obtains the maximum value of the sequence numbers corresponding to one or more processes in the node 501-i among the sequence numbers written in the memory 702. Then, the CPU 701 compares the received sequence number S (q) with the maximum value of the sequence number. When the received sequence number S (q) is smaller than the maximum value of the sequence number (step 1009, NO), the CPU 701 repeats the processing after step 1004.

一方、受信したシーケンス番号Ｓ（ｑ）がシーケンス番号の最大値以上である場合（ステップ１００９，ＹＥＳ）、ＣＰＵ７０１は、受信した開始時刻Ｔ（ｑ）とシーケンス番号Ｓ（ｑ）とをメモリ７０２に書き込む（ステップ１０１０）。そして、ＣＰＵ７０１は、再び、受信したシーケンス番号Ｓ（ｑ）とシーケンス番号の最大値とを比較する（ステップ１０１１）。 On the other hand, when the received sequence number S (q) is greater than or equal to the maximum value of the sequence number (step 1009, YES), the CPU 701 stores the received start time T (q) and the sequence number S (q) in the memory 702. Write (step 1010). The CPU 701 again compares the received sequence number S (q) with the maximum sequence number (step 1011).

受信したシーケンス番号Ｓ（ｑ）がシーケンス番号の最大値と一致する場合（ステップ１０１１，ＮＯ）、ＣＰＵ７０１は、ステップ１００４以降の処理を繰り返す。ノード５０１−ｉ内の全プロセスの開始時刻とシーケンス番号とがメモリ７０２に書き込まれ、それらのシーケンス番号がシーケンス番号Ｓ（ｒ）と一致すると、ステップ１００９で求められるシーケンス番号の最大値はＳ（ｒ）となる。 When the received sequence number S (q) matches the maximum value of the sequence number (step 1011, NO), the CPU 701 repeats the processing after step 1004. When the start times and sequence numbers of all processes in the node 501-i are written in the memory 702 and these sequence numbers match the sequence number S (r), the maximum value of the sequence number obtained in step 1009 is S ( r).

並列アプリケーションの全プロセスの開始時刻とシーケンス番号とがメモリ７０２に書き込まれると（ステップ１００４，ＹＥＳ）、ＣＰＵ７０１は、全プロセスの開始時刻の最大値を求め、メモリ７０２に書き込む（ステップ１００５）。 When the start times and sequence numbers of all processes of the parallel application are written in the memory 702 (step 1004, YES), the CPU 701 obtains the maximum start time of all processes and writes it in the memory 702 (step 1005).

次に、ＣＰＵ７０１は、ＣＰＵ６０１に割り込み（ステップ１００６）、ＣＰＵ６０１は、ネットワークインタフェースドライバを動作させる。そして、ＣＰＵ６０１は、メモリ７０２から開始時刻の最大値を読み出してメモリ６０２に書き込み、メモリ７０２を初期化する（ステップ１００７）。これにより、メモリ７０２内の全プロセスの開始時刻とシーケンス番号とが消去される。 Next, the CPU 701 interrupts the CPU 601 (step 1006), and the CPU 601 operates the network interface driver. Then, the CPU 601 reads the maximum value of the start time from the memory 702, writes it in the memory 602, and initializes the memory 702 (step 1007). As a result, the start times and sequence numbers of all processes in the memory 702 are deleted.

一方、受信したシーケンス番号Ｓ（ｑ）がシーケンス番号の最大値より大きい場合（ステップ１０１１，ＹＥＳ）、ＣＰＵ７０１は、ＣＰＵ６０１に割り込み、指示された演算処理を中止する旨を通知する（ステップ１０１２）。この通知により、ＣＰＵ６０１は、メモリ７０２を初期化することなく、プロセスＰ（ｒ）がネットワークインタフェースドライバに対して指示した演算処理を中止する。その後、プロセスＰ（ｒ）又は別のプロセスがネットワークインタフェースドライバに対して次の演算処理を指示したとき、ＣＰＵ６０１は、改めてステップ１００１以降の処理を開始する。 On the other hand, when the received sequence number S (q) is larger than the maximum value of the sequence number (step 1011, YES), the CPU 701 interrupts the CPU 601 and notifies it to stop the instructed arithmetic processing (step 1012). With this notification, the CPU 601 stops the arithmetic processing instructed by the process P (r) to the network interface driver without initializing the memory 702. Thereafter, when the process P (r) or another process instructs the network interface driver to perform the next arithmetic processing, the CPU 601 starts the processing from step 1001 onward.

図９又は図１０の演算処理によれば、演算部４１１は、演算処理を指示されても、最新のシーケンス番号を持つ全プロセスの開始時刻がメモリ７０２に書き込まれるまで、開始時刻を用いた演算を行わない。そして、ノード５０１−ｉ内のプロセスのシーケンス番号がインクリメントされて、受信したシーケンス番号と同じになったとき、そのシーケンス番号を持つ全プロセスの開始時刻を用いた演算処理が行われる。 According to the arithmetic processing in FIG. 9 or 10, the arithmetic unit 411 uses the start time until the start times of all processes having the latest sequence number are written in the memory 702 even if the arithmetic processing is instructed. Do not do. Then, when the sequence number of the process in the node 501-i is incremented to be the same as the received sequence number, arithmetic processing using the start times of all processes having that sequence number is performed.

このように、常に最新のシーケンス番号を持つ開始時刻を対象として演算処理が行われ、各プロセスは、古い開始時刻を用いた演算処理の完了を待つことなく、最新の開始時刻を用いた演算処理を演算部４１１に対して次々に指示することができる。したがって、並列アプリケーションの実行中に、並列アプリケーションの挙動を変えることなく、プロセス間通信の開始時刻を用いた演算を行うことができる。 In this way, calculation processing is always performed for the start time having the latest sequence number, and each process uses the latest start time without waiting for completion of the calculation processing using the old start time. Can be instructed to the calculation unit 411 one after another. Therefore, during the execution of the parallel application, it is possible to perform an operation using the start time of the interprocess communication without changing the behavior of the parallel application.

また、通信制御装置４０１内に１つの演算部４１１を設ければよく、複数のネットワークリダクション機構を設ける必要がないため、ハードウェアの増加を抑えることができる。特に、大規模並列計算機システムにおいて、図９又は図１０の演算処理による大きな効果が期待できる。 In addition, it is only necessary to provide one arithmetic unit 411 in the communication control device 401, and it is not necessary to provide a plurality of network reduction mechanisms, so that an increase in hardware can be suppressed. In particular, in a large-scale parallel computer system, a great effect can be expected from the arithmetic processing of FIG. 9 or FIG.

図４及び図７の通信制御装置４０１、図５の並列計算機システム５００、及び図６のノード５０１−ｉの構成は一例に過ぎず、並列計算機システムの用途や条件に応じて一部の構成要素を省略又は変更してもよい。 The configurations of the communication control device 401 in FIGS. 4 and 7, the parallel computer system 500 in FIG. 5, and the node 501-i in FIG. 6 are merely examples, and some of the components depend on the use and conditions of the parallel computer system May be omitted or changed.

図８〜図１０のフローチャートは一例に過ぎず、並列計算機システムの構成や条件に応じて一部の処理を省略又は変更してもよい。例えば、図９又は図１０の演算処理において、シーケンス番号以外の通信処理の順序を示す情報をシーケンス情報として用いることもできる。また、図９のステップ９０５又は図１０のステップ１００５において、ＣＰＵ７０１は、開始時刻の最大値を求める代わりに、開始時刻の総和、開始時刻の最小値等の他の数値を求めることもできる。 The flowcharts of FIGS. 8 to 10 are merely examples, and some processes may be omitted or changed depending on the configuration and conditions of the parallel computer system. For example, in the arithmetic processing of FIG. 9 or FIG. 10, information indicating the order of communication processing other than the sequence number can be used as sequence information. Further, in step 905 of FIG. 9 or step 1005 of FIG. 10, the CPU 701 can obtain other numerical values such as the sum of the start times and the minimum value of the start times instead of obtaining the maximum value of the start times.

開示の実施形態とその利点について詳しく説明したが、当業者は、特許請求の範囲に明確に記載した本発明の範囲から逸脱することなく、様々な変更、追加、省略をすることができるであろう。 Although the disclosed embodiments and their advantages have been described in detail, those skilled in the art can make various modifications, additions and omissions without departing from the scope of the present invention as explicitly set forth in the claims. Let's go.

図４乃至図１０を参照しながら説明した実施形態に関し、さらに以下の付記を開示する。
（付記１）
演算処理装置と主記憶装置とに接続する通信制御装置であって、
前記演算処理装置が実行するプログラムに含まれる複数のプロセスのうち第１のプロセスが第１のプロセス間通信を開始した第１の開始時刻に対して前記演算処理装置が付与して、前記主記憶装置に書き込んだ第１のシーケンス情報よりも、前記複数のプロセスのうち第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に付与された第２のシーケンス情報の方が新しい場合、前記第１の開始時刻を用いた演算を行わず、前記第２のシーケンス情報が前記第１のシーケンス情報に対応する場合、前記第１の開始時刻と前記第２の開始時刻とを用いた演算を行って演算結果を出力する演算部、
を有することを特徴とする通信制御装置。
（付記２）
前記第２のシーケンス情報よりも前記第１のシーケンス情報の方が新しく、前記第２のプロセスが第３のプロセス間通信を開始した第３の開始時刻に付与された第３のシーケンス情報が前記第１のシーケンス情報に対応する場合、前記演算部は、前記第２の開始時刻を用いた演算を行わず、前記第１の開始時刻と前記第３の開始時刻とを用いた演算を行って演算結果を出力することを特徴とする付記１記載の通信制御装置。
（付記３）
前記演算処理装置は、前記第１のプロセスを実行し、前記通信制御装置を介して接続された情報処理装置が有する演算処理装置は、前記第２のプロセスを実行することを特徴とする付記１又は２記載の通信制御装置。
（付記４）
前記第１のシーケンス情報は、前記複数のプロセスのうち、前記第１のプロセスを実行する前記演算処理装置が実行する第４のプロセスが、第４のプロセス間通信を開始した第４の開始時刻に付与された第４のシーケンス情報に対応するか、又は前記第４のシーケンス情報よりも新しいことを特徴とする付記３記載の通信制御装置。
（付記５）
前記演算処理装置は、前記第１のプロセス及び前記第２のプロセスを実行することを特徴とする付記１又は２記載の通信制御装置。
（付記６）
プログラムを実行し、前記プログラムに含まれる複数のプロセスのうち第１のプロセスが第１のプロセス間通信を開始した第１の開始時刻に対して第１のシーケンス情報を付与する演算処理装置と、
前記第１の開始時刻と前記第１のシーケンス情報とを記憶する主記憶装置と、
前記複数のプロセスのうち第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に付与された第２のシーケンス情報の方が前記第１のシーケンス情報よりも新しい場合、前記第１の開始時刻を用いた演算を行わず、前記第２のシーケンス情報が前記第１のシーケンス情報に対応する場合、前記第１の開始時刻と前記第２の開始時刻とを用いた演算を行って演算結果を出力する通信制御装置と、
を有することを特徴とする情報処理装置。
（付記７）
複数の情報処理装置を有する並列計算機システムであって、
前記複数の情報処理装置のうち少なくとも１つの情報処理装置は、
プログラムを実行し、前記プログラムに含まれる複数のプロセスのうち第１のプロセスが第１のプロセス間通信を開始した第１の開始時刻に対して第１のシーケンス情報を付与する演算処理装置と、
前記第１の開始時刻と前記第１のシーケンス情報とを記憶する主記憶装置と、
前記複数のプロセスのうち第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に付与された第２のシーケンス情報の方が前記第１のシーケンス情報よりも新しい場合、前記第１の開始時刻を用いた演算を行わず、前記第２のシーケンス情報が前記第１のシーケンス情報に対応する場合、前記第１の開始時刻と前記第２の開始時刻とを用いた演算を行って演算結果を出力する通信制御装置と、
を有することを特徴とする並列計算機システム。
（付記８）
演算処理装置と通信制御装置と主記憶装置とを有する情報処理装置の制御プログラムであって、
前記演算処理装置が実行するプログラムに含まれる複数のプロセスのうち第１のプロセスが第１のプロセス間通信を開始した第１の開始時刻に対して前記演算処理装置が付与して、前記主記憶装置に書き込んだ第１のシーケンス情報よりも、前記複数のプロセスのうち第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に付与された第２のシーケンス情報の方が新しい場合、前記第１の開始時刻を用いた演算を行わず、前記第２のシーケンス情報が前記第１のシーケンス情報に対応する場合、前記第１の開始時刻と前記第２の開始時刻とを用いた演算を行って演算結果を出力する、
処理を前記通信制御装置内の演算処理装置に実行させることを特徴とする制御プログラム。
（付記９）
複数の情報処理装置を有する並列計算機システムの制御方法であって、
前記複数の情報処理装置のうち少なくとも１つの情報処理装置が、
プログラムを実行し、
前記プログラムに含まれる複数のプロセスのうち第１のプロセスが第１のプロセス間通信を開始した第１の開始時刻に対して第１のシーケンス情報を付与し、
前記複数のプロセスのうち第２のプロセスが第２のプロセス間通信を開始した第２の開始時刻に付与された第２のシーケンス情報の方が前記第１のシーケンス情報よりも新しい場合、前記第１の開始時刻を用いた演算を行わず、前記第２のシーケンス情報が前記第１のシーケンス情報に対応する場合、前記第１の開始時刻と前記第２の開始時刻とを用いた演算を行って演算結果を出力する、
ことを特徴とする制御方法。 Regarding the embodiment described with reference to FIGS. 4 to 10, the following additional notes are disclosed.
(Appendix 1)
A communication control device connected to the arithmetic processing device and the main storage device,
The main processing unit assigns the main memory to a first start time at which a first process among the plurality of processes included in a program executed by the arithmetic processing unit starts a first inter-process communication. The second sequence information given at the second start time at which the second process of the plurality of processes started the second inter-process communication is newer than the first sequence information written in the apparatus. In the case where the calculation using the first start time is not performed and the second sequence information corresponds to the first sequence information, the first start time and the second start time are used. A computation unit that performs the computation and outputs the computation result,
A communication control device comprising:
(Appendix 2)
The first sequence information is newer than the second sequence information, and the third sequence information given at the third start time when the second process starts the third inter-process communication is In the case of corresponding to the first sequence information, the calculation unit does not perform calculation using the second start time, but performs calculation using the first start time and the third start time. The communication control apparatus according to appendix 1, wherein a calculation result is output.
(Appendix 3)
The arithmetic processing unit executes the first process, and the arithmetic processing unit included in the information processing apparatus connected via the communication control unit executes the second process. Or the communication control apparatus of 2.
(Appendix 4)
The first sequence information includes a fourth start time when a fourth process executed by the arithmetic processing unit that executes the first process among the plurality of processes starts a fourth inter-process communication. The communication control device according to supplementary note 3, wherein the communication control device corresponds to the fourth sequence information given to or is newer than the fourth sequence information.
(Appendix 5)
The communication control apparatus according to appendix 1 or 2, wherein the arithmetic processing unit executes the first process and the second process.
(Appendix 6)
An arithmetic processing unit that executes a program and assigns first sequence information to a first start time at which a first process among the plurality of processes included in the program starts first inter-process communication;
A main storage device for storing the first start time and the first sequence information;
When the second sequence information given at the second start time at which the second process of the plurality of processes starts the second inter-process communication is newer than the first sequence information, When the calculation using the start time of 1 is not performed and the second sequence information corresponds to the first sequence information, the calculation using the first start time and the second start time is performed. A communication control device for outputting a calculation result,
An information processing apparatus comprising:
(Appendix 7)
A parallel computer system having a plurality of information processing devices,
At least one information processing apparatus among the plurality of information processing apparatuses is
An arithmetic processing unit that executes a program and assigns first sequence information to a first start time at which a first process among the plurality of processes included in the program starts first inter-process communication;
A main storage device for storing the first start time and the first sequence information;
When the second sequence information given at the second start time at which the second process of the plurality of processes starts the second inter-process communication is newer than the first sequence information, When the calculation using the start time of 1 is not performed and the second sequence information corresponds to the first sequence information, the calculation using the first start time and the second start time is performed. A communication control device for outputting a calculation result,
A parallel computer system characterized by comprising:
(Appendix 8)
A control program for an information processing device having an arithmetic processing device, a communication control device, and a main storage device,
The main processing unit assigns the main memory to a first start time at which a first process among the plurality of processes included in a program executed by the arithmetic processing unit starts a first inter-process communication. The second sequence information given at the second start time at which the second process of the plurality of processes started the second inter-process communication is newer than the first sequence information written in the apparatus. In the case where the calculation using the first start time is not performed and the second sequence information corresponds to the first sequence information, the first start time and the second start time are used. Output the calculation result.
A control program that causes an arithmetic processing unit in the communication control unit to execute processing.
(Appendix 9)
A method for controlling a parallel computer system having a plurality of information processing devices,
At least one information processing device of the plurality of information processing devices is
Run the program
The first sequence information is given to the first start time when the first process among the plurality of processes included in the program starts the first inter-process communication,
When the second sequence information given at the second start time at which the second process of the plurality of processes starts the second inter-process communication is newer than the first sequence information, When the calculation using the start time of 1 is not performed and the second sequence information corresponds to the first sequence information, the calculation using the first start time and the second start time is performed. Output the calculation result.
A control method characterized by that.

１０１、１０２矢印
３０１ネットワークリダクション機構
４０１通信制御装置
４１１演算部
５００並列計算機システム
５０１−１〜５０１−ｎ、５０１−ｉノード
５０２ディスクノード
５０３通信ネットワーク
６０１、７０１ＣＰＵ
６０２、７０２メモリ
６０３媒体駆動装置
６０４、７０５バス
６０５可搬型記録媒体
７０３、７０４インタフェース 101, 102 arrow 301 network reduction mechanism 401 communication control device 411 arithmetic unit 500 parallel computer system 501-1 to 501-n, 501-i node 502 disk node 503 communication network 601, 701 CPU
602, 702 Memory 603 Medium drive device 604, 705 Bus 605 Portable recording medium 703, 704 Interface

Claims

A communication control device connected to the arithmetic processing device and the main storage device,
The main processing unit assigns the main memory to a first start time at which a first process among the plurality of processes included in a program executed by the arithmetic processing unit starts a first inter-process communication. The second sequence information given at the second start time at which the second process of the plurality of processes started the second inter-process communication is newer than the first sequence information written in the apparatus. In the case where the calculation using the first start time is not performed and the second sequence information corresponds to the first sequence information, the first start time and the second start time are used. A computation unit that performs the computation and outputs the computation result,
A communication control device comprising:

The first sequence information is newer than the second sequence information, and the third sequence information given at the third start time when the second process starts the third inter-process communication is In the case of corresponding to the first sequence information, the calculation unit does not perform calculation using the second start time, but performs calculation using the first start time and the third start time. The communication control apparatus according to claim 1, wherein a calculation result is output.

The arithmetic processing device executes the first process, and the arithmetic processing device included in an information processing device connected via the communication control device executes the second process. The communication control apparatus according to 1 or 2.

An arithmetic processing unit that executes a program and assigns first sequence information to a first start time at which a first process among the plurality of processes included in the program starts first inter-process communication;
A main storage device for storing the first start time and the first sequence information;
When the second sequence information given at the second start time at which the second process of the plurality of processes starts the second inter-process communication is newer than the first sequence information, When the calculation using the start time of 1 is not performed and the second sequence information corresponds to the first sequence information, the calculation using the first start time and the second start time is performed. A communication control device for outputting a calculation result,
An information processing apparatus comprising:

A parallel computer system having a plurality of information processing devices,
At least one information processing apparatus among the plurality of information processing apparatuses is
An arithmetic processing unit that executes a program and assigns first sequence information to a first start time at which a first process among the plurality of processes included in the program starts first inter-process communication;
A main storage device for storing the first start time and the first sequence information;
When the second sequence information given at the second start time at which the second process of the plurality of processes starts the second inter-process communication is newer than the first sequence information, When the calculation using the start time of 1 is not performed and the second sequence information corresponds to the first sequence information, the calculation using the first start time and the second start time is performed. A communication control device for outputting a calculation result,
A parallel computer system characterized by comprising:

A control program for an information processing device having an arithmetic processing device, a communication control device, and a main storage device,
The main processing unit assigns the main memory to a first start time at which a first process among the plurality of processes included in a program executed by the arithmetic processing unit starts a first inter-process communication. The second sequence information given at the second start time at which the second process of the plurality of processes started the second inter-process communication is newer than the first sequence information written in the apparatus. In the case where the calculation using the first start time is not performed and the second sequence information corresponds to the first sequence information, the first start time and the second start time are used. Output the calculation result.
A control program that causes an arithmetic processing unit in the communication control unit to execute processing.

A method for controlling a parallel computer system having a plurality of information processing devices,
At least one information processing device of the plurality of information processing devices is
Run the program
The first sequence information is given to the first start time when the first process among the plurality of processes included in the program starts the first inter-process communication,
When the second sequence information given at the second start time at which the second process of the plurality of processes starts the second inter-process communication is newer than the first sequence information, When the calculation using the start time of 1 is not performed and the second sequence information corresponds to the first sequence information, the calculation using the first start time and the second start time is performed. Output the calculation result.
A control method characterized by that.