JP2010067033A

JP2010067033A - Data processor, data processing method, and program

Info

Publication number: JP2010067033A
Application number: JP2008233126A
Authority: JP
Inventors: Kuniaki Noda; 邦昭野田; Masato Ito; 真人伊藤; Kazumi Aoyama; 一美青山
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2008-09-11
Filing date: 2008-09-11
Publication date: 2010-03-25

Abstract

【課題】複雑で、長時間の時系列データを、容易に学習し、滑らかな時系列データを生成する。
【解決手段】教師データ分割部１２は、時系列データである教師データを、一部がオーバラップする複数のモデル学習用データに分割する。学習部１４は、１つのモデル学習用データを、１つの学習モデルに割り当て、学習モデルの学習を、その学習モデルに割り当てられたモデル学習用データを用いて行う。コネクティビティ算出部１６は、１つの学習モデルが生成する時系列データの最後の一部分と、他の１つの学習モデルが生成する時系列データの最初の一部分との誤差を、１つの学習モデルの後に、他の１つの学習モデルが接続する適切さを表すコネクティビティとして算出する。本発明は、例えば、時系列データの学習、及び生成に適用できる。
【選択図】図１Complex and long time series data is easily learned to generate smooth time series data.
A teacher data dividing unit 12 divides teacher data, which is time-series data, into a plurality of model learning data partially overlapping. The learning unit 14 assigns one model learning data to one learning model, and learns the learning model using the model learning data assigned to the learning model. The connectivity calculation unit 16 determines an error between the last part of the time series data generated by one learning model and the first part of the time series data generated by the other learning model after the one learning model. It is calculated as connectivity representing the appropriateness of connection with one other learning model. The present invention can be applied to learning and generation of time-series data, for example.
[Selection] Figure 1

Description

本発明は、データ処理装置、データ処理方法、及び、プログラムに関し、特に、例えば、複雑で、長時間の時系列データを、容易に学習すること、及び、学習結果に基づき、滑らかな時系列データを、精度良く生成することができるようにするデータ処理装置、データ処理方法、及び、プログラムに関する。 The present invention relates to a data processing device, a data processing method, and a program, and in particular, for example, easily learning complicated time series data for a long time, and smooth time series data based on a learning result. The present invention relates to a data processing apparatus, a data processing method, and a program.

例えば、時系列パターンを、時間差分方程式の形で関数近似し、ダイナミクスとして学習（記憶）する学習モデルしては、例えば、RNN(Recurrent Neural Network)や、SVR(Support Vector Regression)等がある（例えば、特許文献１や２を参照）。 For example, learning models for approximating time series patterns in the form of time difference equations and learning (memory) as dynamics include RNN (Recurrent Neural Network), SVR (Support Vector Regression), etc. ( For example, see Patent Documents 1 and 2.)

例えば、RNNの学習を、時系列データを用いて行うことで、RNNは、学習に用いられた時系列データの時系列パターンを獲得（記憶）（学習）する。 For example, by performing RNN learning using time-series data, the RNN acquires (stores) (learns) a time-series pattern of time-series data used for learning.

時系列パターンを学習したRNNによれば、その時系列パターンの時系列データを生成することができる。 According to the RNN that has learned the time series pattern, the time series data of the time series pattern can be generated.

また、RNNをSOM(Self-Organizing Maps)のノードとし、勝者ノード(winner)が有するRNNから時系列データを生成するデータ生成の技術が、先に提案されている（例えば、特許文献３や４を参照）。 In addition, a data generation technique for generating time-series data from an RNN included in a winner node (winner) using an RNN as a node of SOM (Self-Organizing Maps) has been proposed (for example, Patent Documents 3 and 4). See).

特開2007-265345号公報JP 2007-265345 JP 特開2006-318319号公報JP 2006-318319 A 特開2007-280066号公報JP 2007-280066 A 特開2007-280067号公報Japanese Unexamined Patent Publication No. 2007-280067

ところで、RNN等では、時系列パターンの記憶のメカニズムに、記憶容量の限界がある。そのため、ある程度の規模のRNNによって、複雑で（非線形性が強い）（多次元で）、かつ長時間の時系列パターンを記憶し、そのような時系列パターンの時系列データを、精度良く再構成（生成）することは、困難である。 By the way, in RNN and the like, the storage mechanism of the time series pattern has a storage capacity limit. For this reason, complex (strong non-linearity) (multi-dimensional) and long-time time-series patterns are stored by RNNs of a certain scale, and time-series data of such time-series patterns are accurately reconstructed. It is difficult to (generate).

すなわち、複雑で、かつ長時間の時系列パターンを記憶し、そのような時系列パターンの時系列データを、精度良く生成するには、大規模なRNNが必要となる。しかしながら、大規模なRNNの学習には、膨大な演算量と時間が必要となる。 That is, a large-scale RNN is required to store a complicated and long time series pattern and to generate time series data of such a time series pattern with high accuracy. However, large-scale RNN learning requires a large amount of computation and time.

一方、上述したデータ生成の技術によれば、勝者ノードを順次決定し、勝者ノードが有するRNNから時系列データを生成して、順次出力することで、長時間の時系列データを出力することができる。 On the other hand, according to the data generation technique described above, it is possible to output long-time time-series data by sequentially determining the winner node, generating time-series data from the RNN possessed by the winner node, and sequentially outputting it. it can.

しかしながら、この場合、勝者ノードとなるノードが切り替わると、その切り替わりの前後で、時系列データが不連続となる。 However, in this case, when the node that becomes the winner node is switched, the time-series data becomes discontinuous before and after the switching.

そこで、滑らかな時系列データを出力する方法として、勝者ノードとなるノードの切り替わりを抑制する方法（特許文献３）や、時系列データが不連続になっている部分（不連続部分）をスムージングする方法（特許文献４）がある。 Therefore, as a method of outputting smooth time-series data, a method of suppressing switching of a node that becomes a winner node (Patent Document 3), or a portion in which time-series data is discontinuous (discontinuous portion) is smoothed. There is a method (Patent Document 4).

しかしながら、勝者ノードとなるノードの切り替わりを抑制する方法では、不連続部分の数が減少するだけで、勝者ノードとなるノードが切り替われば、不連続部分が生じる。 However, in the method of suppressing the switching of the node that becomes the winner node, only the number of discontinuous portions is reduced, and if the node that becomes the winner node is switched, a discontinuous portion is generated.

また、不連続部分をスムージングするのでは、勝者ノードとなったノードが有するRNNが記憶している時系列パターンとは異なる時系列パターンの時系列データ（RNNが記憶していない時系列パターンの時系列データ）が得られることがある。 In addition, smoothing the discontinuous part is because the time-series data of the time-series pattern different from the time-series pattern stored in the RNN of the node that has become the winner node (the time-series pattern time not stored in the RNN). Series data) may be obtained.

本発明は、このような状況に鑑みてなされたものであり、複雑で、長時間の時系列データを、容易に学習し、また、学習結果に基づき、滑らかな時系列データを、精度良く生成することができるようにするものである。 The present invention has been made in view of such a situation, and easily learns complex and long-time time-series data, and generates smooth time-series data with high accuracy based on the learning result. It is something that can be done.

本発明の第１の側面のデータ処理装置、又は、プログラムは、時系列データを、一部がオーバラップする複数のデータに分割し、時系列パターンを学習する学習モデルであって、内部状態を有する学習モデルの学習に用いるモデル学習用データとして出力し、１つの前記モデル学習用データを、１つの前記学習モデルに割り当てるように、前記時系列データを分割することにより得られる複数の前記モデル学習用データを、複数の前記学習モデルに割り当て、前記学習モデルによる時系列パターンの学習を、その学習モデルに割り当てられた前記モデル学習用データを用いて行うことにより得られる、学習後の複数の前記学習モデルのうちの、１つの前記学習モデルを、時系列データの生成に用いる前記学習モデルのシーケンスである生成用モデルシーケンスの始点となる始点モデルとして選択する始点モデル選択手段と、複数の前記学習モデルのうちの、他の１つの前記学習モデルを、前記生成用モデルシーケンスの終点となる終点モデルとして選択する終点モデル選択手段と、複数の前記学習モデルすべてについて、１つの前記学習モデルが生成する時系列データの最後の一部分のデータ列と、他の１つの前記学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を表す値を、１つの前記学習モデルが学習した前記時系列パターンの後に、他の１つの前記学習モデルが学習した前記時系列パターンが接続する適切さを表すコネクティビティとして算出することにより得られる前記コネクティビティに対応する値を、１つの前記学習モデルの後に、他の１つの前記学習モデルを接続する接続コストとして、前記接続コストの累積値を最小にする、前記始点モデルから前記終点モデルまでの前記学習モデルの並びを、前記生成用モデルシーケンスとして求める生成用モデルシーケンス算出手段と、前記生成用モデルシーケンスを構成する前記学習モデルについて、前記学習モデルが生成する時系列データの最後の一部分のデータ列と、後に接続される前記学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を小さくするように、前記学習モデルの前記内部状態の初期値を決定し、その初期値を、前記学習モデルに与えて、時系列データを生成する時系列データ生成手段とを備えるデータ処理装置、又は、データ処理装置として、コンピュータを機能させるためのプログラムである。 A data processing apparatus or program according to the first aspect of the present invention is a learning model for dividing time-series data into a plurality of pieces of data partially overlapping and learning a time-series pattern, wherein an internal state is A plurality of model learnings obtained by dividing the time-series data so as to be output as model learning data for use in learning of a learning model having and to assign one model learning data to one learning model Data is assigned to a plurality of learning models, and learning of a time-series pattern by the learning model is performed using the model learning data assigned to the learning model. One of the learning models is used for generating a learning model sequence used for generating time-series data. A start point model selecting means for selecting a start point model to be a start point of a Dell sequence, and an end point for selecting another one of the learning models as an end point model to be an end point of the generating model sequence For all of the plurality of learning models, the model selection means, the last part of the time series data generated by one of the learning models, and the first part of the time series data generated by the other learning model A value representing an error with respect to the data sequence is calculated as connectivity representing the appropriateness of connection of the time series pattern learned by the other learning model after the time series pattern learned by one of the learning models. A value corresponding to the connectivity obtained by performing one learning model after one learning model. As a connection cost for connecting the learning model, a generation model sequence calculation means for obtaining, as the generation model sequence, an arrangement of the learning models from the start point model to the end point model, which minimizes the cumulative value of the connection costs. , With respect to the learning model constituting the generating model sequence, the last part of the time series data generated by the learning model and the first part of the time series data generated by the learning model connected later Time series data generating means for determining an initial value of the internal state of the learning model and giving the initial value to the learning model so as to reduce an error with a data string, and generating time series data; A program for causing a computer to function as a data processing device or a data processing device.

本発明の第１の側面のデータ処理方法は、データ処理装置が、時系列データを、一部がオーバラップする複数のデータに分割し、時系列パターンを学習する学習モデルであって、内部状態を有する学習モデルの学習に用いるモデル学習用データとして出力し、１つの前記モデル学習用データを、１つの前記学習モデルに割り当てるように、前記時系列データを分割することにより得られる複数の前記モデル学習用データを、複数の前記学習モデルに割り当て、前記学習モデルによる時系列パターンの学習を、その学習モデルに割り当てられた前記モデル学習用データを用いて行うことにより得られる、学習後の複数の前記学習モデルのうちの、１つの前記学習モデルを、時系列データの生成に用いる前記学習モデルのシーケンスである生成用モデルシーケンスの始点となる始点モデルとして選択し、複数の前記学習モデルのうちの、他の１つの前記学習モデルを、前記生成用モデルシーケンスの終点となる終点モデルとして選択し、複数の前記学習モデルすべてについて、１つの前記学習モデルが生成する時系列データの最後の一部分のデータ列と、他の１つの前記学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を表す値を、１つの前記学習モデルが学習した前記時系列パターンの後に、他の１つの前記学習モデルが学習した前記時系列パターンが接続する適切さを表すコネクティビティとして算出することにより得られる前記コネクティビティに対応する値を、１つの前記学習モデルの後に、他の１つの前記学習モデルを接続する接続コストとして、前記接続コストの累積値を最小にする、前記始点モデルから前記終点モデルまでの前記学習モデルの並びを、前記生成用モデルシーケンスとして求め、前記生成用モデルシーケンスを構成する前記学習モデルについて、前記学習モデルが生成する時系列データの最後の一部分のデータ列と、後に接続される前記学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を小さくするように、前記学習モデルの前記内部状態の初期値を決定し、その初期値を、前記学習モデルに与えて、時系列データを生成するステップを含むデータ処理方法である。 A data processing method according to a first aspect of the present invention is a learning model in which a data processing device divides time-series data into a plurality of pieces of partially overlapping data, and learns a time-series pattern. A plurality of models obtained by dividing the time-series data so that one model learning data is assigned to one learning model, and is output as model learning data used for learning a learning model having A plurality of learned models obtained by assigning learning data to a plurality of learning models, and learning a time-series pattern by the learning model using the model learning data assigned to the learning model. One of the learning models is a generation model that is a sequence of the learning model used for generating time-series data. Selecting as a start point model to be a start point of a Dell sequence, selecting another one of the plurality of learning models as an end point model to be an end point of the generating model sequence, and a plurality of the learning models For all, a value representing an error between the data sequence of the last part of the time series data generated by one learning model and the data sequence of the first part of the time series data generated by the other learning model Corresponding to the connectivity obtained by calculating the connectivity indicating the appropriateness of connection of the time series pattern learned by one other learning model after the time series pattern learned by one learning model As a connection cost for connecting one learning model to another learning model after the one learning model, For the learning model constituting the generation model sequence, an array of the learning models from the start point model to the end point model that minimizes the cumulative value of the continuation cost is obtained as the generation model sequence. The internal part of the learning model is reduced so as to reduce an error between the last part of the time series data generated by the time series data and the first part of the time series data generated by the learning model connected later. A data processing method including the steps of determining an initial value of a state and providing the initial value to the learning model to generate time-series data.

以上のような第１の側面においては、学習後の複数の前記学習モデルのうちの、１つの前記学習モデルが、始点モデルとして選択されるとともに、他の１つの前記学習モデルが、終点モデルとして選択される。さらに、前記コネクティビティに対応する値を、１つの前記学習モデルの後に、他の１つの前記学習モデルを接続する接続コストとして、前記接続コストの累積値を最小にする、前記始点モデルから前記終点モデルまでの前記学習モデルの並びが、前記生成用モデルシーケンスとして求められる。そして、前記生成用モデルシーケンスを構成する前記学習モデルについて、前記学習モデルが生成する時系列データの最後の一部分のデータ列と、後に接続される前記学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を小さくするように、前記学習モデルの前記内部状態の初期値が決定され、その初期値を、前記学習モデルに与えて、時系列データが生成される。 In the first aspect as described above, one of the learning models after learning is selected as a starting point model, and the other learning model is used as an end point model. Selected. Further, the value corresponding to the connectivity is set as a connection cost for connecting the other learning model after the one learning model, and the accumulated value of the connection cost is minimized. The sequence of the learning models up to is obtained as the generation model sequence. For the learning model constituting the generation model sequence, the last part of the time series data generated by the learning model and the first part of the time series data generated by the learning model connected later The initial value of the internal state of the learning model is determined so as to reduce the error from the data sequence of the data, and the initial value is given to the learning model to generate time series data.

本発明の第２の側面のデータ処理装置、又は、プログラムは、時系列データを、一部がオーバラップする複数のデータに分割し、時系列パターンを学習する学習モデルであって、内部状態を有する学習モデルの学習に用いるモデル学習用データとして出力する分割手段と、１つの前記モデル学習用データを、１つの前記学習モデルに割り当てるように、前記時系列データを分割することにより得られる複数の前記モデル学習用データを、複数の前記学習モデルに割り当て、前記学習モデルによる時系列パターンの学習を、その学習モデルに割り当てられた前記モデル学習用データを用いて行う学習手段と、複数の前記学習モデルすべてについて、１つの前記学習モデルが生成する時系列データの最後の一部分のデータ列と、他の１つの前記学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を、１つの前記学習モデルが学習した前記時系列パターンの後に、他の１つの前記学習モデルが学習した前記時系列パターンが接続する適切さを表すコネクティビティとして算出するコネクティビティ算出手段とを備えるデータ処理装置、又は、データ処理装置として、コンピュータを機能させるためのプログラムである。 A data processing apparatus or program according to a second aspect of the present invention is a learning model for dividing time-series data into a plurality of pieces of data partially overlapping and learning a time-series pattern, wherein an internal state is Dividing means for outputting as model learning data for use in learning of a learning model having a plurality of time series data obtained by dividing the time series data so as to assign one model learning data to one learning model Learning means for assigning the model learning data to the plurality of learning models, and learning the time-series pattern by the learning model using the model learning data assigned to the learning model; and a plurality of the learning For all models, the last part of the time series data generated by one learning model and the other one The time series pattern learned by one of the other learning models is connected after the time series pattern learned by one of the learning models for an error from the first partial data sequence of the time series data generated by the model It is a program for causing a computer to function as a data processing device or a data processing device that includes connectivity calculation means for calculating connectivity representing appropriateness.

本発明の第２の側面のデータ処理方法は、データ処理装置が、時系列データを、一部がオーバラップする複数のデータに分割し、時系列パターンを学習する学習モデルであって、内部状態を有する学習モデルの学習に用いるモデル学習用データとして出力し、１つの前記モデル学習用データを、１つの前記学習モデルに割り当てるように、前記時系列データを分割することにより得られる複数の前記モデル学習用データを、複数の前記学習モデルに割り当て、前記学習モデルによる時系列パターンの学習を、その学習モデルに割り当てられた前記モデル学習用データを用いて行い、複数の前記学習モデルすべてについて、１つの前記学習モデルが生成する時系列データの最後の一部分のデータ列と、他の１つの前記学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を、１つの前記学習モデルが学習した前記時系列パターンの後に、他の１つの前記学習モデルが学習した前記時系列パターンが接続する適切さを表すコネクティビティとして算出するステップを含むデータ処理方法である。 A data processing method according to a second aspect of the present invention is a learning model in which a data processing device divides time-series data into a plurality of pieces of overlapping data, and learns a time-series pattern. A plurality of models obtained by dividing the time-series data so that one model learning data is assigned to one learning model, and is output as model learning data used for learning a learning model having Data for learning is assigned to a plurality of the learning models, and learning of a time series pattern by the learning model is performed using the data for model learning assigned to the learning model. A data sequence of the last part of the time series data generated by one learning model and a time system generated by the other learning model Connectivity representing the adequacy of connecting the time series pattern learned by one of the other learning models after the time series pattern learned by one of the learning models of the error from the data sequence of the first part of the data Is a data processing method including a step of calculating as follows.

以上のような第２の側面においては、時系列データが、一部がオーバラップする複数のデータに分割され、時系列パターンを学習する学習モデルであって、内部状態を有する学習モデルの学習に用いるモデル学習用データとして出力される。さらに、１つの前記モデル学習用データを、１つの前記学習モデルに割り当てるように、前記時系列データを分割することにより得られる複数の前記モデル学習用データが、複数の前記学習モデルに割り当てられ、前記学習モデルによる時系列パターンの学習が、その学習モデルに割り当てられた前記モデル学習用データを用いて行われる。また、複数の前記学習モデルすべてについて、１つの前記学習モデルが生成する時系列データの最後の一部分のデータ列と、他の１つの前記学習モデルが生成する時系列データの最初の一部分のデータ列との誤差が、１つの前記学習モデルが学習した前記時系列パターンの後に、他の１つの前記学習モデルが学習した前記時系列パターンが接続する適切さを表すコネクティビティとして算出される。 In the second aspect as described above, the time series data is divided into a plurality of partially overlapping data, and is a learning model for learning a time series pattern, for learning a learning model having an internal state. Output as model learning data to be used. Further, a plurality of model learning data obtained by dividing the time series data so as to assign one model learning data to one learning model is assigned to the plurality of learning models, Learning of a time-series pattern by the learning model is performed using the model learning data assigned to the learning model. In addition, for all of the plurality of learning models, a data string of the last part of the time series data generated by one learning model and a data string of the first part of the time series data generated by the other learning model Is calculated as connectivity representing the adequacy of connection of the time series pattern learned by the other learning model after the time series pattern learned by one of the learning models.

なお、データ処理装置は、独立した装置であっても良いし、１つの装置を構成している内部ブロックであっても良い。 Note that the data processing device may be an independent device or an internal block constituting one device.

また、プログラムは、伝送媒体を介して伝送することにより、又は、記録媒体に記録して、提供することができる。 The program can be provided by being transmitted via a transmission medium or by being recorded on a recording medium.

本発明の第１及び第２の側面によれば、複雑で、長時間の時系列データを、容易に学習し、また、学習結果に基づき、滑らかな時系列データを、精度良く生成することができる。 According to the first and second aspects of the present invention, it is possible to easily learn complicated and long-time time-series data, and to generate smooth time-series data with high accuracy based on the learning result. it can.

［本発明を適用したデータ処理装置の全体構成］
図１は、本発明を適用したデータ処理装置の一実施の形態の構成例を示すブロック図である。 [Overall configuration of data processing apparatus to which the present invention is applied]
FIG. 1 is a block diagram showing a configuration example of an embodiment of a data processing apparatus to which the present invention is applied.

データ処理装置は、例えば、現実のロボット等を行動させるための時系列データ（例えば、アクチュエータを駆動するデータ等）や、ディスプレイに表示される仮想的なキャラクタ等を行動させるための時系列データを学習する。さらに、データ処理装置は、その学習結果に基づき、現実のロボットや仮想的なキャラクタを、自律的に行動させるための時系列データを生成し、ロボット等に供給することで、そのロボット等（の行動）を制御する。 The data processing device, for example, generates time-series data for causing a real robot or the like to act (for example, data for driving an actuator) or time-series data for causing a virtual character or the like displayed on the display to act. learn. Furthermore, the data processing device generates time-series data for making a real robot or a virtual character act autonomously based on the learning result, and supplies the time series data to the robot. Control).

すなわち、図１において、データ処理装置は、学習装置１０とデータ生成装置２０等から構成される。 That is, in FIG. 1, the data processing device includes a learning device 10 and a data generation device 20.

ここで、データ処理装置は、学習装置１０、又は、データ生成装置２０だけから構成することができる。 Here, the data processing device can be configured by only the learning device 10 or the data generation device 20.

なお、データ生成装置２０では、学習装置１０が、後述する学習処理を行うことにより得られる情報（データ）を用いて、後述するデータ生成処理を行う。したがって、データ処理装置を、データ生成装置２０だけから構成する場合には、データ生成処理に必要な情報を、外部からデータ生成装置２０に供給するか、又は、データ生成装置２０の内部に記憶しておく必要がある。 In the data generation device 20, the learning device 10 performs data generation processing described later using information (data) obtained by performing learning processing described later. Therefore, when the data processing device is configured only from the data generation device 20, information necessary for the data generation processing is supplied to the data generation device 20 from the outside or stored inside the data generation device 20. It is necessary to keep.

学習装置１０は、時系列パターンの学習のために用意された時系列データ（以下、教師データともいう）を用い、時系列パターンを学習する学習モデルであって、内部状態を有する複数の学習モデルの学習を行い、学習後の複数の学習モデルすべてについて、任意の２つの学習モデルそれぞれが学習（記憶）した時系列パターンどうしが接続する適切さを表すコネクティビティを求める学習処理を行う。 The learning device 10 is a learning model that learns a time-series pattern using time-series data (hereinafter also referred to as teacher data) prepared for learning a time-series pattern, and includes a plurality of learning models having internal states. The learning process is performed for all the plurality of learning models after learning to obtain connectivity indicating the appropriateness of connecting the time series patterns learned (stored) by any two learning models.

すなわち、学習装置１０は、学習処理として、例えば、複雑で、長時間の時系列データ等である教師データを、複数の学習モデルで分担して学習し、その複数の学習モデルのそれぞれに、ダイナミクスである時系列パターンを獲得（記憶）させる処理を行う。 That is, the learning device 10 learns, for example, teacher data that is complex, long-time time-series data, etc. as a learning process by sharing a plurality of learning models, and each of the plurality of learning models has a dynamics. A process of acquiring (storing) a time series pattern is performed.

さらに、学習装置１０は、学習処理として、複数の学習モデルのそれぞれが獲得したダイナミクスとしての時系列パターンどうしが接続する適切さ（自然さ）（接続性）を表すコネクティビティを求める処理を行う。 Furthermore, the learning device 10 performs a process of obtaining connectivity representing appropriateness (naturalness) (connectivity) of connecting time series patterns as dynamics acquired by each of a plurality of learning models as a learning process.

ここで、ダイナミクスは、時間変化する力学系を表すもので、例えば、具体的な関数によって表現することができる。学習モデルでは、時系列データの時間変化の特徴、つまり、時系列パターンが、ダイナミクスとして記憶される。 Here, the dynamics represents a dynamic system that changes with time, and can be expressed by a specific function, for example. In the learning model, the temporal change feature of the time series data, that is, the time series pattern is stored as dynamics.

学習装置１０は、教師データ保存部１１、教師データ分割部１２、モデル学習用データ保存部１３、学習部１４、モデルパラメータ保存部１５、コネクティビティ算出部１６、及び、コネクティビティ保存部１７等から構成される。 The learning device 10 includes a teacher data storage unit 11, a teacher data division unit 12, a model learning data storage unit 13, a learning unit 14, a model parameter storage unit 15, a connectivity calculation unit 16, and a connectivity storage unit 17. The

教師データ保存部１１には、外部から教師データが供給される。教師データ保存部１１は、そこに供給される教師データを記憶（保存）する。 Teacher data is supplied to the teacher data storage unit 11 from the outside. The teacher data storage unit 11 stores (saves) teacher data supplied thereto.

ここで、教師データとしては、複雑で、長時間の時系列データを採用することができる。なお、教師データは、その他、例えば、単純で、短時間の時系列データであっても良いし、複雑であるが、それほど長時間ではない時系列データ等であっても良い。 Here, as the teacher data, complicated and long-time time series data can be adopted. The teacher data may be, for example, simple and short time series data, or may be complex but time series data that is not so long.

また、データ処理装置において、例えば、現実のロボットを、ある環境下で、自律的に行動させるための時系列データを生成する場合には、例えば、ロボットを行動させる環境下で、行動の教示を行うユーザがロボットを実際に移動させることで得られる時系列データが、教師データとして用いられる。 Further, in the data processing apparatus, for example, when generating time-series data for making an actual robot act autonomously in a certain environment, for example, teaching the action in an environment causing the robot to act. Time series data obtained by the user who actually moves the robot is used as teacher data.

すなわち、ユーザがロボットを移動させているときに、ロボットがセンシングすることができる物理量のデータ（以下、センサデータともいう）や、移動のために、ロボットのアクチュエータに与えられるデータ（信号）（以下、アクションデータともいう）等をコンポーネントとするベクトルの時系列が、教師データとして用いられる。 That is, data of physical quantities that can be sensed by the robot when the user is moving the robot (hereinafter also referred to as sensor data), and data (signals) (hereinafter referred to as the actuator) of the robot for movement. A time series of vectors having components such as action data) is used as teacher data.

ここで、以上のようなセンサデータとアクションデータとをコンポーネントとするベクトルの時系列を、以下、センサモータデータともいう。 Here, a vector time series having the sensor data and action data as components as described above is also referred to as sensor motor data.

教師データ分割部１２は、教師データ保存部１１に記憶された教師データとしての時系列データを、一部がオーバラップする複数のデータ（これも、時系列データである）に分割し、学習モデルの学習に用いるモデル学習用データとして、モデル学習用データ保存部１３に供給する。 The teacher data division unit 12 divides the time series data as the teacher data stored in the teacher data storage unit 11 into a plurality of data partially overlapping (this is also the time series data), and the learning model Is supplied to the model learning data storage unit 13 as model learning data used for learning.

ここで、教師データ分割部１２において、教師データを分割して得られる複数のモデル学習用データの長さ（サンプル数）は、同一であっても良いし、異なっていても良い。オーバラップの長さも同様である。 Here, the length (number of samples) of the plurality of model learning data obtained by dividing the teacher data in the teacher data dividing unit 12 may be the same or different. The same applies to the length of the overlap.

但し、以下では、説明を簡単にするため、教師データを分割して得られる複数のモデル学習用データは、すべて同一の固定長であることとし、また、オーバラップの長さも、固定長であることとする。 However, in the following, for simplicity of explanation, it is assumed that the plurality of model learning data obtained by dividing the teacher data are all the same fixed length, and the overlap length is also the fixed length. I will do it.

モデル学習用データ保存部１３は、教師データ分割部１２からの複数のモデル学習用データを記憶する。 The model learning data storage unit 13 stores a plurality of model learning data from the teacher data dividing unit 12.

学習部１４は、１つのモデル学習用データを、１つの学習モデルに割り当てるように、モデル学習用データ保存部１３に記憶された複数のモデル学習用データを、複数の学習モデルに割り当てる。さらに、学習部１４は、学習モデルによる時系列パターンの学習を、その学習モデルに割り当てられたモデル学習用データを用いて行うことで、学習モデルを定義するモデルパラメータを求める。そして、学習部１４は、複数の学習モデルそれぞれのモデルパラメータを、モデルパラメータ保存部１５に供給する。 The learning unit 14 assigns a plurality of model learning data stored in the model learning data storage unit 13 to a plurality of learning models so that one model learning data is assigned to one learning model. Further, the learning unit 14 obtains a model parameter that defines the learning model by performing learning of the time series pattern by the learning model using the model learning data assigned to the learning model. Then, the learning unit 14 supplies model parameters for each of the plurality of learning models to the model parameter storage unit 15.

ここで、学習部１４が学習の対象とする複数の学習モデルの数Nは、教師データ分割部１２で得られる複数の学習用モデルデータの数Nに一致する。 Here, the number N of the plurality of learning models to be learned by the learning unit 14 coincides with the number N of the plurality of learning model data obtained by the teacher data dividing unit 12.

したがって、例えば、教師データ分割部１２では、教師データが、あらかじめ用意された学習モデルの数以下の数のモデル学習用データに分割される。あるいは、学習部１４において、教師データ分割部１２で得られた複数のモデル学習用データの数と同一の数の学習モデルが生成される。なお、学習モデルの実体は、メモリ等の記憶領域（例えば、オブジェクト指向プログラミングにおけるインスタンス）である。 Therefore, for example, in the teacher data dividing unit 12, the teacher data is divided into data for model learning equal to or less than the number of learning models prepared in advance. Alternatively, the learning unit 14 generates the same number of learning models as the number of model learning data obtained by the teacher data dividing unit 12. Note that the substance of the learning model is a storage area such as a memory (for example, an instance in object-oriented programming).

モデルパラメータ保存部１５は、学習部１４から供給されるモデルパラメータを記憶する。 The model parameter storage unit 15 stores model parameters supplied from the learning unit 14.

コネクティビティ算出部１６は、モデル学習用データ保存部１３に記憶されたモデル学習用データと、モデルパラメータ保存部１５に記憶されたモデルパラメータとを用い、学習部１４で学習が行われた複数の学習モデルすべてについて、コネクティビティを算出し、コネクティビティ保存部１７に供給する。 The connectivity calculation unit 16 uses the model learning data stored in the model learning data storage unit 13 and the model parameters stored in the model parameter storage unit 15 to perform a plurality of learnings performed by the learning unit 14. For all the models, connectivity is calculated and supplied to the connectivity storage unit 17.

なお、コネクティビティとは、１つの学習モデルが学習した時系列パターンの後に、他の１つの学習モデルが学習した時系列パターンが接続する適切さを表す。コネクティビティ算出部１６は、１つの学習モデルが生成する時系列データの最後の一部分のデータ列と、他の１つの学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を、コネクティビティとして算出する。 Note that connectivity represents the appropriateness of connecting a time series pattern learned by another learning model after a time series pattern learned by one learning model. The connectivity calculation unit 16 calculates the error between the last partial data sequence of the time series data generated by one learning model and the initial partial data sequence of the time series data generated by the other learning model. Calculate as

コネクティビティ保存部１７は、コネクティビティ算出部１６から供給されるコネクティビティを記憶する。 The connectivity storage unit 17 stores the connectivity supplied from the connectivity calculation unit 16.

データ生成装置２０は、学習装置１０で得られた学習後の複数の学習モデル（モデルパラメータ）と、その複数の学習モデルについて算出されたコネクティビティとに基づき、教師データに相当するような、複雑で、長時間の、滑らかな時系列データを生成するデータ生成処理を行う。 The data generation device 20 is complex and corresponds to teacher data based on a plurality of learning models (model parameters) after learning obtained by the learning device 10 and connectivity calculated for the plurality of learning models. Data generation processing for generating long time, smooth time-series data is performed.

すなわち、データ生成装置２０は、データ生成処理として、学習後の複数の学習モデルのうちの、１つの学習モデルを、時系列データの生成に用いる学習モデルのシーケンスである生成用モデルシーケンスの始点となる始点モデルとして選択する処理を行う。さらに、データ生成装置２０は、データ生成処理として、複数の学習モデルのうちの、他の１つの学習モデルを、生成用モデルシーケンスの終点となる終点モデルとして選択する処理を行う。 That is, as a data generation process, the data generation device 20 uses one learning model among a plurality of learning models after learning as a starting point of a generation model sequence that is a sequence of learning models used for generating time-series data. A process of selecting as a starting point model is performed. Further, as the data generation process, the data generation apparatus 20 performs a process of selecting another one of the plurality of learning models as an end point model that is an end point of the generation model sequence.

また、データ生成装置２０は、データ生成処理として、コネクティビティに基づき、始点モデルから終点モデルまでの、ある学習モデルの並びを、生成用モデルシーケンスとして求める処理を行う。 In addition, as the data generation process, the data generation device 20 performs a process of obtaining an arrangement of a certain learning model from the start point model to the end point model as a generation model sequence based on the connectivity.

さらに、データ生成装置２０は、データ生成処理として、生成用モデルシーケンスに基づき、教師データに相当するような、複雑で、長時間の、滑らかな時系列データを生成する処理を行う。 Further, the data generation device 20 performs a process of generating complex, long-time, smooth time-series data corresponding to the teacher data based on the generation model sequence as the data generation process.

データ生成装置２０は、現在データ供給部２１、目標データ供給部２２、始点モデル選択部２３、終点モデル選択部２４、生成用モデルシーケンス算出部２５、時系列データ生成部２６、及び、時系列データ出力部２７等から構成される。 The data generation device 20 includes a current data supply unit 21, a target data supply unit 22, a start point model selection unit 23, an end point model selection unit 24, a generation model sequence calculation unit 25, a time series data generation unit 26, and time series data. The output unit 27 and the like are included.

現在データ供給部２１は、時系列データである現在データを、始点モデル選択部２３、及び、時系列データ生成部２６に供給する。 The current data supply unit 21 supplies current data, which is time series data, to the start point model selection unit 23 and the time series data generation unit 26.

ここで、データ処理装置が制御するロボット等は、教師データを構成するのと同様のベクトルの時系列を、観測可能なデータとして、データ処理装置に提供するようになっている。現在データとは、例えば、データ処理装置が制御するロボット等が提供する観測可能なセンサモータデータのうちの、現在時刻のサンプル（ベクトル）を含む、連続する複数のサンプルである。 Here, a robot or the like controlled by the data processing apparatus provides a time series of vectors similar to that constituting the teacher data to the data processing apparatus as observable data. The current data is, for example, a plurality of consecutive samples including samples (vectors) of the current time among observable sensor motor data provided by a robot or the like controlled by the data processing device.

なお、現在データを構成するサンプルの数は、例えば、モデル学習用データを構成するサンプルの数よりも少ないこととする。 Note that the number of samples constituting the current data is, for example, smaller than the number of samples constituting the model learning data.

現在データ供給部２１は、例えば、データ処理装置が制御するロボット等が提供する観測可能なセンサモータデータから、現在データを抽出し、始点モデル選択部２３、及び、時系列データ生成部２６に供給する。 For example, the current data supply unit 21 extracts current data from observable sensor motor data provided by a robot or the like controlled by the data processing apparatus, and supplies the current data to the start point model selection unit 23 and the time-series data generation unit 26. To do.

目標データ供給部２１は、時系列データである目標データを、終点モデル選択部２４に供給する。 The target data supply unit 21 supplies target data that is time-series data to the end point model selection unit 24.

ここで、目標データは、現在データと同様（同一次元）のデータであり、例えば、ユーザ等の外部から、目標データ供給部２１に提供される。 Here, the target data is the same data (same dimension) as the current data, and is provided to the target data supply unit 21 from the outside such as a user, for example.

例えば、データ生成装置２０において、データ処理装置が制御するロボットがいる現在位置から、ユーザ等の外部から指定された位置（以下、目標位置ともいう）まで、ロボットを移動させるための時系列データであるセンサモータデータを生成する場合には、ロボットが、現在位置で得るセンサモータデータ（数サンプルのセンサモータデータ）が、現在データとなり、目標位置で得られるであろうセンサモータデータが、目標データとなる。 For example, in the data generation device 20, time-series data for moving the robot from a current position where the robot controlled by the data processing device is located to a position designated from the outside such as a user (hereinafter also referred to as a target position). When generating certain sensor motor data, the sensor motor data (several samples of sensor motor data) obtained by the robot at the current position becomes the current data, and the sensor motor data that would be obtained at the target position is the target data. It becomes.

始点モデル選択部２３は、現在データ供給部２１からの現在データに基づき、モデルパラメータ保存部１５にモデルパラメータが記憶された複数の学習モデル、すなわち、学習後の複数の学習モデルのうちの、１つの学習モデルを、始点モデルとして選択する。さらに、始点モデル選択部２３は、始点モデルを特定する始点モデルID(Identification)を、生成用モデルシーケンス算出部２５に供給する。 The starting point model selection unit 23 is based on the current data from the current data supply unit 21, and among the plurality of learning models whose model parameters are stored in the model parameter storage unit 15, that is, among the learning models after learning, 1 One learning model is selected as the starting point model. Furthermore, the start point model selection unit 23 supplies a start point model ID (Identification) for specifying the start point model to the generation model sequence calculation unit 25.

終点モデル選択部２４は、目標データ供給部２２からの目標データに基づき、モデルパラメータ保存部１５にモデルパラメータが記憶された複数の学習モデル、すなわち、学習後の複数の学習モデルのうちの、１つの学習モデルを、終点モデルとして選択する。さらに、終点モデル選択部２４は、終点モデルを特定する終点モデルIDを、生成用モデルシーケンス算出部２５に供給する。 The end point model selection unit 24 is based on the target data from the target data supply unit 22, and among the plurality of learning models whose model parameters are stored in the model parameter storage unit 15, that is, among the learning models after learning, 1 One learning model is selected as the end point model. Further, the end point model selection unit 24 supplies an end point model ID for specifying the end point model to the generation model sequence calculation unit 25.

ここで、始点モデルとは、時系列データの生成に用いる学習モデルのシーケンスである生成用モデルシーケンスの始点となる学習モデルであり、終点モデルとは、生成用モデルシーケンスの終点となる学習モデルである。 Here, the starting point model is a learning model that is the starting point of a generating model sequence that is a sequence of learning models used for generating time-series data, and the end point model is a learning model that is an end point of a generating model sequence. is there.

始点モデルは、時系列データ生成部２６で生成される（長時間の）時系列データ（以下、生成時系列データともいう）の最初の部分を生成するのに用いられ、終点モデルは、生成時系列データの最後の部分を生成するのに用いられる。 The start point model is used to generate the first part of time series data (hereinafter also referred to as generation time series data) generated by the time series data generation unit 26, and the end point model is Used to generate the last part of the series data.

生成用モデルシーケンス算出部２５は、始点モデル選択部２３からの始点モデルIDによって特定される始点モデルから、終点モデル選択部２４からの終点モデルIDによって特定される終点モデルまでの、複数の学習モデルの、ある並びを、生成用モデルシーケンスとして求める。 The generation model sequence calculation unit 25 has a plurality of learning models from the start point model specified by the start point model ID from the start point model selection unit 23 to the end point model specified by the end point model ID from the end point model selection unit 24. Is obtained as a generation model sequence.

すなわち、生成用モデルシーケンス算出部２５は、コネクティビティ保存部１７に記憶されたコネクティビティに対応する値を、１つの学習モデルの後に、他の１つの学習モデルを接続する接続コストとして、接続コストの累積値を最小にする、始点モデルから終点モデルまでの学習モデルの並びを、生成用モデルシーケンスとして求める。 That is, the generation model sequence calculation unit 25 accumulates the connection cost using the value corresponding to the connectivity stored in the connectivity storage unit 17 as the connection cost for connecting one learning model to another learning model. The sequence of learning models from the start point model to the end point model that minimizes the value is obtained as a generation model sequence.

生成用モデルシーケンス算出部２５は、生成用モデルシーケンスを、時系列データ生成部２６に供給する。 The generation model sequence calculation unit 25 supplies the generation model sequence to the time series data generation unit 26.

時系列データ生成部２６は、生成用モデルシーケンス算出部２５からの生成用モデルシーケンスを構成する学習モデル（始点モデル）に、現在データ供給部２１からの現在データを与えることで、生成用モデルシーケンスを構成する各学習モデルに、時系列データを生成させる。 The time-series data generation unit 26 gives the current data from the current data supply unit 21 to the learning model (starting point model) constituting the generation model sequence from the generation model sequence calculation unit 25, thereby generating the generation model sequence. The time series data is generated in each learning model that constitutes.

さらに、時系列データ生成部２６は、生成用モデルシーケンスを構成する各学習モデルが生成した時系列データ（以下、モデル生成データともいう）を、生成用モデルシーケンスとしての学習モデルの並びの順に接続した生成時系列データを、時系列データ出力部２７に供給する。 Further, the time series data generation unit 26 connects the time series data generated by each learning model constituting the generation model sequence (hereinafter also referred to as model generation data) in the order of the learning models as the generation model sequence. The generated time-series data is supplied to the time-series data output unit 27.

なお、時系列データ生成部２６は、生成用モデルシーケンス算出部２５からの生成用モデルシーケンスを構成する学習モデルに、現在データ供給部２１からの現在データを与えて、モデル生成データを生成する前に、生成用モデルシーケンスを構成する学習モデルについて、学習モデルが生成する時系列データ（モデル生成データ）の最後の一部分のデータ列と、後（直後）に接続される学習モデルが生成する時系列データの最初の一部分のデータ列との誤差を小さくするように、学習モデルの内部状態の初期値を決定する。 The time series data generation unit 26 gives the current data from the current data supply unit 21 to the learning model constituting the generation model sequence from the generation model sequence calculation unit 25 before generating model generation data. In addition, with respect to the learning model that constitutes the generation model sequence, the last part of the time series data (model generation data) generated by the learning model and the time series generated by the learning model connected after (immediately after) The initial value of the internal state of the learning model is determined so as to reduce the error from the data string of the first part of the data.

そして、時系列データ生成部２６は、その初期値を、学習モデルに与えて、時系列データ（モデル生成データ）を生成する。その結果、生成用モデルシーケンスを構成する各学習モデルが生成したモデル生成データを、生成用モデルシーケンスとしての学習モデルの並びの順に接続した生成時系列データは、滑らかな時系列データとなる。 Then, the time series data generation unit 26 gives the initial value to the learning model, and generates time series data (model generation data). As a result, the generation time series data obtained by connecting the model generation data generated by each learning model constituting the generation model sequence in the order of the learning models as the generation model sequence becomes smooth time series data.

［学習装置１０の詳細構成例］
図２は、図１の学習装置１０のより詳細な構成例を示している。 [Detailed Configuration Example of Learning Device 10]
FIG. 2 shows a more detailed configuration example of the learning device 10 of FIG.

なお、図２では、教師データ分割部１２において、教師データが、複数であるN個のモデル学習用データに分割されることとする。 In FIG. 2, the teacher data dividing unit 12 divides the teacher data into a plurality of N pieces of model learning data.

ここで、N個のモデル学習用データの、時系列順で、n番目を、以下、モデル学習用データ#nとも記載する。 Here, the nth of the N pieces of model learning data in time series order is also referred to as model learning data #n.

上述のように、教師データ分割部１２において、教師データが、N個のモデル学習用データ#1,#2,・・・,#Nに分割される場合、モデル学習用データ保存部１３では、そのN個のモデル学習用データ#1ないし#Nが記憶される。 As described above, when the teacher data is divided into N pieces of model learning data # 1, # 2,..., #N in the teacher data dividing unit 12, the model learning data storage unit 13 The N pieces of model learning data # 1 to #N are stored.

学習部１４は、データ割り当て部４１と、モデル学習用データ#1ないし#Nの数に等しいN個の演算部４２₁，４２₂，・・・４２_N等から構成される。 The learning unit 14 includes a data allocating unit 41 and N arithmetic units 42 ₁ , 42 ₂ ,... 42 _N equal to the number of model learning data # 1 to #N.

データ割り当て部４１は、モデル学習用データ保存部１３に記憶されたN個のモデル学習用データ#1ないし#Nを読み出す。さらに、データ割り当て部４１は、１つのモデル学習用データ#nを、１つの学習モデル#nに割り当てるように、N個のモデル学習用データ#1ないし#Nを、N個の学習モデル#1ないし#Nに割り当てる。 The data allocation unit 41 reads out N pieces of model learning data # 1 to #N stored in the model learning data storage unit 13. Further, the data allocation unit 41 allocates N model learning data # 1 to #N to N learning models # 1 so that one model learning data #n is allocated to one learning model #n. Assign to #N.

そして、データ割り当て部４１は、モデル学習用データ#nを、学習モデル#nのモデルパラメータを演算する演算部４２_nに供給する。 Then, the data allocation unit 41 supplies the model learning data #n to the calculation unit 42 _n that calculates the model parameter of the learning model #n.

演算部４２_nは、学習モデル#nによる時系列パターンの学習を、その学習モデル#nに割り当てられたモデル学習用データ#nを用いて行うことで、学習モデル#nを定義するモデルパラメータ#nを求め（演算し）、モデルパラメータ保存部１５に供給する。 The calculation unit 42 _n performs learning of the time series pattern by the learning model #n by using the model learning data #n assigned to the learning model #n, so that the model parameter # that defines the learning model #n is used. n is obtained (calculated) and supplied to the model parameter storage unit 15.

モデルパラメータ保存部１５は、学習部１４の演算部４２₁ないし４２_Nから供給される、学習モデル#1ないし#Nそれぞれを定義するモデルパラメータ#1ないし#Nを記憶する。 The model parameter storage unit 15 stores model parameters # 1 to #N defining the learning models # 1 to #N supplied from the calculation units 42 ₁ to 42 _N of the learning unit 14, respectively.

コネクティビティ算出部１６は、モデルペア選択部５１、モデルパラメータ供給部５２、２個の認識生成部５３及び５４、並びに、コネクティビティ演算部５５等から構成される。 The connectivity calculation unit 16 includes a model pair selection unit 51, a model parameter supply unit 52, two recognition generation units 53 and 54, a connectivity calculation unit 55, and the like.

モデルペア選択部５１は、N個の学習モデル#1ないし#Nから、任意の２つの学習モデルの並び（順列）を、モデルペアとして選択し、モデルパラメータ供給部５２に供給する。 The model pair selection unit 51 selects an arbitrary two learning model sequence (permutation) from the N learning models # 1 to #N as a model pair and supplies the model pair to the model parameter supply unit 52.

すなわち、モデルペア選択部５１は、N個の学習モデル#1ないし#Nのうちの１つの学習モデルを、順次、注目モデルとして選択する。さらに、モデルペア選択部５１は、注目モデルに対して、N個の学習モデル#1ないし#Nのうちの、注目モデルの他の１つの学習モデルを、注目モデルの後に接続される後モデルとして選択する。そして、モデルペア選択部５１は、注目モデルと、後モデルとの並び（順列）を、モデルペアとして、モデルパラメータ供給部５２に供給する。 That is, the model pair selection unit 51 sequentially selects one learning model among the N learning models # 1 to #N as the attention model. Further, the model pair selection unit 51 selects one of the N learning models # 1 to #N as the subsequent model connected to the target model after the target model. select. The model pair selection unit 51 then supplies the model parameter supply unit 52 with the arrangement (permutation) of the model of interest and the subsequent model as a model pair.

モデルパラメータ供給部５２は、モデルペ選択部５１からのモデルペアを構成する２つの学習モデルのモデルパラメータを、モデルパラメータ保存部１５から読み出す。さらに、モデルパラメータ供給部５２は、モデルパラメータ保存部１５から読み出したモデルパラメータのうちの、モデルペアを構成する２つの学習モデルの並びのうちの１番目の学習モデル（以下、前モデルともいう）のモデルパラメータを、認識生成部５３に供給する。 The model parameter supply unit 52 reads out the model parameters of the two learning models constituting the model pair from the model pair selection unit 51 from the model parameter storage unit 15. Further, the model parameter supply unit 52 is the first learning model (hereinafter also referred to as the previous model) in the sequence of two learning models constituting the model pair among the model parameters read from the model parameter storage unit 15. Are supplied to the recognition generation unit 53.

また、モデルパラメータ供給部５２は、モデルパラメータ保存部１５から読み出したモデルパラメータのうちの、後モデル（モデルペアを構成する２つの学習モデルの並びのうちの２番目の学習モデル）のモデルパラメータを、認識生成部５４に供給する。 In addition, the model parameter supply unit 52 sets the model parameters of the subsequent model (second learning model in the sequence of two learning models constituting the model pair) out of the model parameters read from the model parameter storage unit 15. To the recognition generation unit 54.

認識生成部５３は、モデルパラメータ供給部５２からの、前モデルのモデルパラメータを、学習モデルに設定することで、前モデルを生成する（例えば、オブジェクト指向プログラミングにおける、前モデルとしての学習モデルのインスタンスを生成する）。 The recognition generation unit 53 generates the previous model by setting the model parameter of the previous model from the model parameter supply unit 52 in the learning model (for example, an instance of the learning model as the previous model in object-oriented programming). Generate).

また、認識生成部５３は、前モデルに割り当てられたモデル学習用データを、モデル学習用データ保存部１３から読み込み、前モデルに与えることで、前モデルから、時系列データであるモデル生成データを生成する。 In addition, the recognition generation unit 53 reads the model learning data assigned to the previous model from the model learning data storage unit 13 and gives it to the previous model, so that the model generation data that is time-series data is obtained from the previous model. Generate.

ここで、本実施の形態では、学習モデルは、内部状態を有し、時系列データ（モデル生成データ）の生成時には、内部状態の初期値が、学習モデルに与えられる。学習モデルから生成されるモデル生成データは、内部状態の初期値によって異なり、認識生成部５３は、前モデルが生成するモデル生成データの最後の一部分のデータ列（複数サンプル）と、認識生成部５４が後モデルから生成するモデル生成データの最初の一部分のデータ列との誤差（以下、接続誤差ともいう）が小さくなるように、前モデルに与える内部状態の初期値を決定する（更新する）。 Here, in the present embodiment, the learning model has an internal state, and an initial value of the internal state is given to the learning model when generating time-series data (model generation data). The model generation data generated from the learning model differs depending on the initial value of the internal state. The recognition generation unit 53 includes a data string (a plurality of samples) of the last part of the model generation data generated by the previous model, and the recognition generation unit 54. The initial value of the internal state given to the previous model is determined (updated) so that the error (hereinafter also referred to as a connection error) with the data string of the first part of the model generation data generated from the subsequent model is reduced.

そして、認識生成部５３は、接続誤差が小さくなったときの内部状態の初期値を、前モデルに与えて、その前モデルから、モデル生成データを生成し、コネクティビティ演算部５５に供給する。 Then, the recognition generation unit 53 gives the initial value of the internal state when the connection error becomes small to the previous model, generates model generation data from the previous model, and supplies it to the connectivity calculation unit 55.

認識生成部５４は、モデルパラメータ供給部５２からの、後モデルのモデルパラメータを、学習モデルに設定することで、後モデルを生成する（例えば、オブジェクト指向プログラミングにおける、後モデルとしての学習モデルのインスタンスを生成する）。 The recognition generation unit 54 generates a post model by setting the model parameter of the post model from the model parameter supply unit 52 in the learning model (for example, an instance of the learning model as the post model in object-oriented programming). Generate).

また、認識生成部５４は、後モデルに割り当てられたモデル学習用データを、モデル学習用データ保存部１３から読み込み、後モデルに与えることで、後モデルから、時系列データであるモデル生成データを生成する。 Further, the recognition generation unit 54 reads the model learning data assigned to the post model from the model learning data storage unit 13 and gives it to the post model, so that the model generation data that is time-series data is obtained from the post model. Generate.

ここで、認識生成部５４も、認識生成部５３と同様に、後モデルが生成するモデル生成データの最初の一部分のデータ列と、認識生成部５３が前モデルから生成するモデル生成データの最後の一部分のデータ列との誤差（接続誤差）が小さくなるように、後モデルに与える内部状態の初期値を決定する（更新する）。 Here, similarly to the recognition generation unit 53, the recognition generation unit 54 also includes the first partial data string of the model generation data generated by the subsequent model and the last model generation data generated by the recognition generation unit 53 from the previous model. An initial value of an internal state to be given to the subsequent model is determined (updated) so that an error (connection error) with a partial data string is reduced.

そして、認識生成部５４は、接続誤差が小さくなったときの内部状態の初期値を、後モデルに与えて、その後モデルから、モデル生成データを生成し、コネクティビティ演算部５５に供給する。 Then, the recognition generation unit 54 gives the initial value of the internal state when the connection error becomes small to the subsequent model, and then generates model generation data from the model and supplies it to the connectivity calculation unit 55.

コネクティビティ演算部５５は、認識生成部５３からの、前モデルから生成されたモデル生成データの最後の一部分のデータ列と、認識生成部５４からの、後モデルから生成されたモデル生成データの最初の一部分のデータ列との接続誤差を求める。そして、コネクティビティ演算部５５は、その接続誤差を、前モデルに対する後モデルのコネクティビティとして、コネクティビティ保存部１７に供給する。 The connectivity calculation unit 55 includes a data string of the last part of the model generation data generated from the previous model from the recognition generation unit 53 and the first model generation data generated from the rear model from the recognition generation unit 54. Find the connection error with a part of the data string. Then, the connectivity calculation unit 55 supplies the connection error to the connectivity storage unit 17 as the connectivity of the rear model with respect to the previous model.

ここで、学習モデル#iに対する学習モデル#jのコネクティビティを、c_ijと表す（i=1,2,・・・,N：j=1,2,・・・,N：i≠j）。 Here, the connectivity of the learning model #j for learning model #i, expressed as _{c ij (i = 1,2, ···} , N: j = 1,2, ···, N: i ≠ j).

コネクティビティ保存部１７は、コネクティビティ算出部１６（のコネクティビティ演算部５５）から供給される、N個の学習モデルについての、N×N−N個のコネクティビティc_ijを記憶する。 The connectivity storage unit 17 stores N × N−N connectivity c _ij for N learning models supplied from the connectivity calculation unit 16 (the connectivity calculation unit 55).

［学習モデルの説明］
次に、図１の学習装置１０で学習に用いられる学習モデルについて説明する。 [Description of learning model]
Next, a learning model used for learning in the learning apparatus 10 of FIG. 1 will be described.

学習モデルとしては、力学系を近似することができるモデル（力学系近似モデル）のうちの、内部状態を有する力学系近似モデルを採用することができる。 As a learning model, a dynamic system approximation model having an internal state among models capable of approximating a dynamic system (dynamic system approximation model) can be adopted.

内部状態を有する力学系近似モデルとしては、例えば、RNNがある。 An example of a dynamic system approximation model having an internal state is RNN.

図３は、RNNの構成例を示している。 FIG. 3 shows a configuration example of the RNN.

ここで、あるシステム（系）にデータを入力したときに、そのデータに対して、システムから出力されるデータを、出力データというとともに、システムに入力されるデータを、入力データという。 Here, when data is input to a certain system (system), data output from the system with respect to the data is referred to as output data, and data input to the system is referred to as input data.

図３では、RNNは、入力層、隠れ層（中間層）、及び出力層の３層で構成されている。入力層、隠れ層、及び出力層は、それぞれ任意の数の、ニューロンに相当するユニットにより構成される。 In FIG. 3, the RNN is composed of three layers: an input layer, a hidden layer (intermediate layer), and an output layer. Each of the input layer, the hidden layer, and the output layer is configured by an arbitrary number of units corresponding to neurons.

RNNでは、入力層の一部のユニットである入力ユニットに、外部から入力データx_tが入力（供給）される。ここで、入力データx_tは、時刻tのサンプル（値）を表す。 In the RNN, input data _xt is input (supplied) from the outside to an input unit that is a part of the input layer. Here, the input data x _t represents the sample time t (value).

入力層の、入力データx_tが入力される入力ユニット以外の、残りのユニットは、コンテキストユニットであり、コンテキストユニットには、出力層の一部のユニットの出力が、内部状態を表すコンテキストとしてフィードバックされる。 The remaining units of the input layer other than the input unit to which the input data _xt is input are context units, and the output of some units of the output layer is fed back to the context unit as a context representing the internal state. Is done.

ここで、時刻tの入力データx_tが入力層の入力ユニットに入力されるときに入力層のコンテキストユニットに入力される時刻tのコンテキストを、c_tと記載する。 Here, the context of the time t which is input to the context unit of the input layer when the input data x _t at time t is input to the input unit of the input layer, referred to as c _t.

隠れ層のユニットは、入力層に入力される入力データx_tとコンテキストc_tを対象として、所定のウエイト（重み）を用いた重み付け加算を行い、その重み付け加算の結果を引数とする非線形関数の演算を行って、その演算結果を、出力層のユニットに出力する。 The hidden layer unit performs weighted addition using predetermined weights (weights) for the input data x _t and context c _t input to the input layer, and the function of the nonlinear function using the result of the weighted addition as an argument. An operation is performed, and the operation result is output to the output layer unit.

出力層のユニットでは、隠れ層のユニットが出力するデータを対象として、隠れ層のユニットと同様の処理が行われる。そして、出力層の一部のユニットからは、上述したように、次の時刻t+1のコンテキストc_t+1が出力され、入力層にフィードバックされる。また、出力層の残りのユニットからは、例えば、入力データx_tに対する出力データが出力される。 In the output layer unit, the same processing as the hidden layer unit is performed on the data output from the hidden layer unit. Then, as described above, the context c _{t + 1} at the next time _{t + 1} is output from some units in the output layer and fed back to the input layer. Further, from the remaining units of the output layer, for example, output data corresponding to input data x _t is output.

すなわち、RNNの学習は、例えば、RNNに対して、ある時系列データの時刻tのサンプルを、入力データとして与えるとともに、その時系列データの、次の時刻t+1のサンプルを、出力データの真値として与え、出力データの、真値に対する誤差を小さくするように行われる。 In other words, RNN learning is performed by, for example, giving RNN a sample at time t of certain time-series data as input data, and adding a sample at time t + 1 of the time-series data to the true of output data. The value is given as a value, and the error of the output data with respect to the true value is reduced.

このような学習が行われたRNNでは、入力データx_tに対する出力データとして、その入力データx_tの次の時刻t+1の入力データx_t+1の予測値x^* _t+1が出力される。 In such learning is performed RNN, as output data to the input data x _t, predicted value x ^* _{t + 1} of the input data x _{t + 1} at the next time t + 1 of the input data x _t is output The

なお、上述したように、RNNでは、ユニットへの入力が重み付け加算されるが、この重み付け加算に用いられるウエイト（重み）が、RNNのモデルパラメータである。RNNのモデルパラメータとしてのウエイトには、入力ユニットから隠れ層のユニットへのウエイト、コンテキストユニットから隠れ層のユニットへウエイト、隠れ層のユニットから出力層のユニットへのウエイト等がある。 As described above, in the RNN, the input to the unit is weighted and added. The weight (weight) used for the weighted addition is a model parameter of the RNN. The weights as model parameters of the RNN include weights from the input unit to the hidden layer unit, weights from the context unit to the hidden layer unit, weights from the hidden layer unit to the output layer unit, and the like.

以上のようなRNNを、学習モデルとして採用する場合には、そのRNNの学習時には、入力データ及び出力データの真値として、時系列データであるモデル学習用データ（学習モデルに割り当てられたモデル学習用データ）が与えられる。 When the above RNN is adopted as a learning model, model learning data (model learning assigned to the learning model) that is time-series data is used as the true value of the input data and output data when learning the RNN. Data).

そして、RNNの学習では、モデル学習用データ#nの時刻tのサンプル（先頭からt番目のサンプル）を入力データとして、RNNに与えたときに、RNNが出力する出力データとしての時刻t+1のサンプルの予測値の予測誤差を小さくするウエイトが、例えば、BPTT(Back-Propagation Through Time)法により求められる。 In the learning of the RNN, the sample at the time t of the model learning data #n (t-th sample from the beginning) is used as the input data, and when given to the RNN, the time t + 1 as the output data output by the RNN The weight for reducing the prediction error of the predicted value of the sample is obtained by, for example, the BPTT (Back-Propagation Through Time) method.

また、RNNの学習時において、コンテキストの初期値（以下、初期コンテキストともいう）は、例えば、入力データに対する出力データ（入力データに対してRNNが出力する出力データ）の、出力データの真値に対する誤差が小さくなるように、自己組織的に決定（更新）される。 Further, when learning the RNN, the initial value of the context (hereinafter also referred to as the initial context) is, for example, the output data for the input data (the output data output by the RNN for the input data) with respect to the true value of the output data. It is determined (updated) in a self-organized manner so as to reduce the error.

ここで、自己組織的に決定されるとは、外部からの制御なしに、いわば自発的に決定されることを意味する。 Here, being determined in a self-organized manner means that it is determined spontaneously without any external control.

なお、RNNからの時系列データ（モデル生成データ）の生成は、外部から与えられるデータを、入力データとして、RNNに与えることや、RNNが出力する出力データを、入力データとして、RNNに与えることによって行われる。 In addition, generation of time series data (model generation data) from RNN is to give externally supplied data to RNN as input data, or to give output data output by RNN to RNN as input data Is done by.

以下では、学習モデルは、RNNであるとする。 In the following, it is assumed that the learning model is RNN.

［教師データの分割と、学習モデルの学習の説明］
図４を参照して、教師データ分割部１２（図１）による教師データの分割と、その分割によって得られるモデル学習用データを用いた学習モデルの学習について説明する。 [Explanation of teacher data division and learning model learning]
With reference to FIG. 4, the division of the teacher data by the teacher data dividing unit 12 (FIG. 1) and learning of the learning model using the model learning data obtained by the division will be described.

図４は、教師データと、その教師データを分割して得られるモデル学習用データの、学習モデルへの割り当てとを示している。 FIG. 4 shows teacher data and assignment of model learning data obtained by dividing the teacher data to a learning model.

図４において、教師データは、２つのコンポーネントを有するベクトルの時系列になっている。 In FIG. 4, the teacher data is a time series of vectors having two components.

教師データ分割部１２（図１）は、複数の学習モデルに、教師データを分担して学習させるために、教師データを、Lサンプルがオーバラップする、S(>L)サンプルのモデル学習用データに分割する。 The teacher data dividing unit 12 (FIG. 1) uses S (> L) sample model learning data in which L samples overlap in order to share the learning data with a plurality of learning models. Divide into

図４では、教師データは、４つのモデル学習用データ#1ないし#4に分割されている。 In FIG. 4, the teacher data is divided into four model learning data # 1 to # 4.

ここで、モデル学習用データにおいて、そのモデル学習用データと隣接するモデル学習用データとオーバラップしているLサンプルを、以下、モデル学習用データのオーバラップ部分ともいう。 Here, in the model learning data, the L sample that overlaps the model learning data and the adjacent model learning data is also referred to as an overlap portion of the model learning data.

Sサンプルの時系列であるモデル学習用データでは、その最初のLサンプルと、最後のLサンプルが、オーバラップ部分となっている（但し、正確には、教師データから分割された最初のモデル学習用データでは、最後のLサンプルだけがオーバラップ部分となっており、最後のモデル学習用データでは、最初のLサンプルだけがオーバラップ部分となっている）。 In the model learning data, which is a time series of S samples, the first L sample and the last L sample are overlapped parts (however, to be exact, the first model learning divided from the teacher data) In the data for use, only the last L sample is an overlap part, and in the last model learning data, only the first L sample is an overlap part).

学習部１４（図１）は、モデル学習用データ#1を、学習モデル#1に、モデル学習用データ#2を、学習モデル#2に、モデル学習用データ#3を、学習モデル#3に、モデル学習用データ#4を、学習モデル#4に、それぞれ割り当てる。 The learning unit 14 (FIG. 1) converts the model learning data # 1 into the learning model # 1, the model learning data # 2, the learning model # 2, the model learning data # 3, and the learning model # 3. The model learning data # 4 is assigned to the learning model # 4.

そして、学習部１４は、学習モデル#nによる時系列パターンの学習を、その学習モデル#nに割り当てられたモデル学習用データ#nを用いて行うことで、モデル学習用データ#nのダイナミクスとしての時系列パターンを、学習モデル#nの学習則に従って、時間発展方程式の関数近似モデルとして獲得する。 Then, the learning unit 14 performs learning of the time series pattern by the learning model #n by using the model learning data #n assigned to the learning model #n, thereby obtaining the dynamics of the model learning data #n. Is obtained as a function approximation model of the time evolution equation according to the learning rule of the learning model #n.

すなわち、学習モデル#nが、RNNである場合には、学習部１４は、モデル学習用データ#nを用いて、RNNのモデルパラメータであるウエイト（例えば、モデル学習用データ#nの時刻tのサンプルを入力データとして、RNNに与えたときに、RNNが出力する出力データとしての時刻t+1のサンプルの予測値の予測誤差を小さくするウエイト）が、例えば、BPTT法により求められる。 That is, when the learning model #n is an RNN, the learning unit 14 uses the model learning data #n to determine the weight that is the model parameter of the RNN (for example, at the time t of the model learning data #n. When the sample is given to the RNN as input data, a weight for reducing the prediction error of the prediction value of the sample at time t + 1 as output data output by the RNN is obtained by, for example, the BPTT method.

したがって、学習部１４では、隣接（連続）するモデル学習用データ#n及び#n+1がそれぞれ割り当てられる２つの学習モデル#n及び#n+1に注目した場合、学習モデル#n+1の学習は、最初のオーバラップ部分としてのLサンプルが、学習モデル#nの学習に用いられるモデル学習用データ#nの最後のオーバラップ部分としてのLサンプルに一致しているモデル学習用データ#n+1を用いて行われる。 Therefore, in the learning unit 14, when attention is paid to two learning models #n and # n + 1 to which the adjacent (continuous) model learning data #n and # n + 1 are assigned, the learning model # n + 1 Learning is model learning data #n where the L sample as the first overlap part matches the L sample as the last overlap part of model learning data #n used for learning learning model #n This is done using +1.

［コネクティビティの算出の方法］
図５を参照して、コネクティビティ算出部１６（図１）によるコネクティビティの算出の方法について説明する。 [Method of calculating connectivity]
With reference to FIG. 5, the connectivity calculation method by the connectivity calculation unit 16 (FIG. 1) will be described.

コネクティビティ算出部１６は、学習部１４による学習によって複数の学習モデル#1ないし#Nのそれぞれに記憶されたダイナミクスとしての時系列パターンどうしが接続する接続性（適切さ）を表すコネクティビティを求める。 The connectivity calculation unit 16 obtains connectivity representing connectivity (appropriateness) between time-series patterns as dynamics stored in each of the plurality of learning models # 1 to #N by learning by the learning unit 14.

すなわち、コネクティビティ算出部１６は、複数の学習モデル#1ないし#Nから、２つの学習モデル#iと#j（i≠j）の並び（順列）を、モデルペアとして選択する。 That is, the connectivity calculation unit 16 selects, from the plurality of learning models # 1 to #N, an arrangement (permutation) of two learning models #i and #j (i ≠ j) as a model pair.

さらに、コネクティビティ算出部１６は、モデルペアを構成する学習モデル#iと#jが生成するモデル生成データ#iと#jそれぞれの一部分のデータ列（複数サンプル）であるオーバラップ部分の、いわば順伝播と逆伝播（順伝搬と逆伝搬）を繰り返す。これにより、コネクティビティ算出部１６は、学習モデル#iと#jそれぞれが生成するモデル生成データ#iと#jどうしを、なるべく繋がりやすくする、学習モデル#iと#jの初期コンテキスト（以下、最適初期コンテキストともいう）を求める。 In addition, the connectivity calculation unit 16 performs the order of the overlapping portion, which is a partial data string (a plurality of samples) of the model generation data #i and #j generated by the learning models #i and #j constituting the model pair. Repeat propagation and back propagation (forward and back propagation). Thereby, the connectivity calculation unit 16 makes it possible to connect the model generation data #i and #j generated by the learning models #i and #j, respectively, as much as possible. Also called initial context).

ここで、モデル生成データのオーバラップ部分とは、学習モデルの学習に用いられたモデル学習用データのオーバラップ部分に相当する部分である。 Here, the overlap part of the model generation data is a part corresponding to the overlap part of the model learning data used for learning the learning model.

すなわち、図４で説明したように、学習モデルの学習は、オーバラップ部分を有するＳサンプルのモデル学習用データを用いて行われる。したがって、学習モデルから、Sサンプルの時系列を、モデル生成データとして生成させた場合、そのモデル生成データは、学習に用いられたモデル学習用データのオーバラップ部分に相当する部分を有する。この、モデル生成データが有する、モデル学習用データのオーバラップ部分に相当する部分が、モデル生成データのオーバラップ部分である。 That is, as described with reference to FIG. 4, learning of the learning model is performed using model learning data of S samples having overlapping portions. Therefore, when a time series of S samples is generated as model generation data from a learning model, the model generation data has a portion corresponding to an overlap portion of model learning data used for learning. The part corresponding to the overlap part of the model learning data included in the model generation data is the overlap part of the model generation data.

コネクティビティ算出部１６は、最適初期コンテキストを求めた後、学習モデル#iと#jに、それぞれの最適初期コンテキストを与えて、モデル生成データ#iと#jを生成する。 After calculating the optimal initial context, the connectivity calculation unit 16 assigns the respective optimal initial contexts to the learning models #i and #j to generate model generation data #i and #j.

そして、コネクティビティ算出部１６は、モデルペアを構成する前モデル、つまり、モデルペアの１番目の学習モデル#iが生成したモデル生成データ#iの最後のオーバラップ部分（最後のLサンプル）と、後モデル、つまり、モデルペアの２番目の学習モデル#jが生成したモデル生成データ#jの最初のオーバラップ部分（最初のLサンプル）との誤差（接続誤差）を、前モデルとしての学習モデル#iに対する、後モデルとしての学習モデル#jのコネクティビティc_ijとして求める。 Then, the connectivity calculation unit 16 includes the last overlap part (last L sample) of the model generated data #i generated by the first model constituting the model pair, that is, the first learning model #i of the model pair, The learning model as the previous model is the error (connection error) with the first overlap part (first L sample) of the model generation data #j generated by the second learning model #j of the model pair, that is, the second model. It is obtained as connectivity c _ij of learning model #j as a post model for #i.

ここで、学習モデルの学習では、図４で説明したように、２つの学習モデル#n及び#n+1に注目した場合、学習モデル#n+1の学習は、最初のオーバラップ部分としてのLサンプルが、学習モデル#nの学習に用いられるモデル学習用データ#nの最後のオーバラップ部分としてのLサンプルに一致しているモデル学習用データ#nを用いて行われる。 Here, in learning of the learning model, as described with reference to FIG. 4, when attention is paid to the two learning models #n and # n + 1, learning of the learning model # n + 1 is performed as the first overlap part. The L sample is performed using the model learning data #n that matches the L sample as the last overlap portion of the model learning data #n used for learning the learning model #n.

この場合、学習モデル#nが生成するモデル生成データ#nの最後のオーバラップ部分としてのLサンプルと、学習モデル#n+1が生成するモデル生成データ#n+1の最初のオーバラップ部分としてのLサンプルとは、理想的には、一致し、誤差（接続誤差）は0となる。 In this case, the L sample as the last overlap part of model generation data #n generated by learning model #n and the first overlap part of model generation data # n + 1 generated by learning model # n + 1 Ideally, the L samples match, and the error (connection error) is zero.

一方、最初のオーバラップ部分が、学習モデル#nの学習に用いられるモデル学習用データ#nの最後のオーバラップ部分に一致していないモデル学習用データ#n'（n'≠n,n'≠n-1,n'≠n+1）を用いて学習が行われた学習モデル#n'が生成するモデル生成データ#n'の最初のオーバラップ部分は、学習モデル#nが生成するモデル生成データ#nの最後のオーバラップ部分とは一致しない。そして、その一致しない度合い（程度）は、学習モデル#n'の学習に用いられたモデル学習用データ#n'の最初のオーバラップ部分と、学習モデル#nの学習に用いられるモデル学習用データ#nの最後のオーバラップ部分とが一致しない程度に相当する。 On the other hand, model learning data #n ′ (n ′ ≠ n, n ′) in which the first overlap portion does not match the last overlap portion of model learning data #n used for learning learning model #n ≠ n-1, n ′ ≠ n + 1), the first overlap part of the model generation data #n ′ generated by the learning model #n ′ generated by learning is the model generated by the learning model #n It does not match the last overlap part of generated data #n. The degree (degree) of the mismatch is determined by the first overlap portion of the model learning data #n ′ used for learning the learning model #n ′ and the model learning data used for learning the learning model #n. Corresponds to the extent that the last overlap part of #n does not match.

以上から、モデルペアを構成する学習モデル#i及び#jについては、学習モデル#iが生成したモデル生成データ#iの最後のオーバラップ部分と、学習モデル#jが生成したモデル生成データ#jの最初のオーバラップ部分とが一致する（一致しない）度合いは、学習モデル#iの学習に用いられたモデル学習用データ#iの最後のオーバラップ部分と、学習モデル#jの学習に用いられたモデル学習用データ#jの最初のオーバラップ部分とが一致する（一致しない）程度に相当する。 From the above, for learning models #i and #j that make up a model pair, the last overlap part of model generation data #i generated by learning model #i and model generation data #j generated by learning model #j The degree of coincidence (disagreement) with the first overlap part of is used for learning the last overlap part of model learning data #i used for learning learning model #i and learning model #j. This corresponds to the degree to which the first overlap portion of the model learning data #j matches (does not match).

ここで、学習モデル#iが生成したモデル生成データ#iの最後のオーバラップ部分と、学習モデル#jが生成したモデル生成データ#jの最初のオーバラップ部分とが一致しない度合いとしての誤差（接続誤差）を、学習モデル#iが学習した時系列パターンの時系列データの直後に、学習モデル#jが学習した時系列パターンの時系列データが接続する接続性を表すコネクティビティとして採用するために、教師データ分割部１２（図１）では、教師データが、オーバラップ部分を有するモデル学習用データに分割される。 Here, an error as a degree that the last overlap part of the model generation data #i generated by the learning model #i does not match the first overlap part of the model generation data #j generated by the learning model #j ( Connection error) as the connectivity representing the connectivity that connects the time series data of the time series pattern learned by the learning model #j immediately after the time series data of the time series pattern learned by the learning model #i In the teacher data dividing unit 12 (FIG. 1), the teacher data is divided into model learning data having an overlap portion.

すなわち、教師データ分割部１２において、教師データを、オーバラップ部分を有するモデル学習用データに分割するのは、コネクティビティを算出するためである。 That is, the reason why the teacher data dividing unit 12 divides the teacher data into model learning data having an overlap portion is to calculate connectivity.

図５を参照して、コネクティビティ算出部１６（図１）によるコネクティビティの算出について、さらに説明する。 With reference to FIG. 5, the calculation of connectivity by the connectivity calculation unit 16 (FIG. 1) will be further described.

コネクティビティ算出部１６は、N個の学習モデル#1ないし#Nのうちの１つの学習モデルから、前モデルとなる学習モデル#iを選択するとともに、その学習モデル#i以外の学習モデル#jを、後モデルとして選択する。 The connectivity calculation unit 16 selects a learning model #i as a previous model from one of N learning models # 1 to #N, and selects a learning model #j other than the learning model #i. Select as a post model.

そして、コネクティビティ算出部１６は、前モデルである学習モデル#iの入力データの最初の１サンプルとして、学習モデル#iに割り当てられたモデル学習用データ#iの最初の１サンプルを設定する。 Then, the connectivity calculation unit 16 sets the first one sample of the model learning data #i assigned to the learning model #i as the first one sample of the input data of the learning model #i that is the previous model.

さらに、コネクティビティ算出部１６は、後モデルである学習モデル#jの出力データの最後の１サンプルの真値として、学習モデル#jに割り当てられたモデル学習用データ#jの最後の１サンプルを設定する。 Further, the connectivity calculation unit 16 sets the last one sample of the model learning data #j assigned to the learning model #j as the true value of the last one sample of the output data of the learning model #j, which is the subsequent model. To do.

また、コネクティビティ算出部１６は、前モデルである学習モデル#iと、後モデルである学習モデル#jのそれぞれの初期コンテキストとして、ランダムな値を設定する。 In addition, the connectivity calculation unit 16 sets random values as initial contexts of the learning model #i that is the previous model and the learning model #j that is the subsequent model.

そして、コネクティビティ算出部１６は、前モデルである学習モデル#iに、入力データと初期コンテキストを与えて、例えば、モデル学習用データ#iと同一の長さのSサンプルのモデル生成データ#iを生成する。 Then, the connectivity calculation unit 16 gives input data and initial context to the learning model #i, which is the previous model, and, for example, generates model generation data #i of S samples having the same length as the model learning data #i. Generate.

前モデルである学習モデル#iから、Sサンプルのモデル生成データ#iを生成した後、コネクティビティ算出部１６は、そのモデル生成データ#iの最後のオーバラップ部分であるLサンプルを、後モデルである学習モデル#jの入力データの最初のLサンプルとして設定する。 After generating the S sample model generation data #i from the learning model #i, which is the previous model, the connectivity calculation unit 16 converts the L sample, which is the last overlap portion of the model generation data #i, into the subsequent model. Set as the first L sample of the input data of a learning model #j.

そして、コネクティビティ算出部１６は、後モデルである学習モデル#jに、入力データと初期コンテキストを与えて、例えば、モデル学習用データ#jと同一の長さのSサンプルのモデル生成データ#jを生成する。 Then, the connectivity calculation unit 16 gives the input data and the initial context to the learning model #j, which is the subsequent model, and, for example, generates the model generation data #j of S samples having the same length as the model learning data #j. Generate.

ここで、以上のように、前モデルである学習モデル#iから生成されたモデル生成データ#iの最後のオーバラップ部分であるLサンプルを、後モデルである学習モデル#jの入力データの最初のLサンプルとして設定し、後モデルである学習モデル#jから、モデル生成データ#jを生成することが、上述した、オーバラップ部分の順伝播である。 Here, as described above, the L sample that is the last overlap part of the model generation data #i generated from the learning model #i that is the previous model is used as the first input data of the learning model #j that is the subsequent model. The generation of the model generation data #j from the learning model #j, which is the subsequent model, is the above-described forward propagation of the overlap portion.

後モデルである学習モデル#jから、Sサンプルのモデル生成データ#jを生成した後、コネクティビティ算出部１６は、そのモデル生成データ#jの最後のサンプルの、後モデルの出力データの最後の１サンプルの真値（上述したように、学習モデル#jに割り当てられたモデル学習用データ#jの最後の１サンプル）に対する予測誤差を求める。 After generating the model generation data #j of the S sample from the learning model #j that is the rear model, the connectivity calculation unit 16 adds the last one of the output data of the rear model of the last sample of the model generation data #j. A prediction error for the true value of the sample (the last one sample of the model learning data #j assigned to the learning model #j as described above) is obtained.

そして、コネクティビティ算出部１６は、モデル生成データ#jの最後の１サンプルの予測誤差を、例えば、BPTT法に基づき、モデル生成データ#jの最初の１サンプルまで逆伝播（誤差の逆伝播）することで、その予測誤差を小さくするように、後モデルである学習モデル#jの初期コンテキストを更新する。 Then, the connectivity calculation unit 16 back propagates the prediction error of the last one sample of the model generation data #j to the first one sample of the model generation data #j based on the BPTT method, for example. Thus, the initial context of the learning model #j, which is the subsequent model, is updated so as to reduce the prediction error.

学習モデル#jの初期コンテキストの更新後、コネクティビティ算出部１６は、学習モデル#jに、入力データ（上述したように、前モデルである学習モデル#iから生成されたモデル生成データ#iの最後のオーバラップ部分であるLサンプル）と、更新後の初期コンテキストを与えて、Sサンプルのモデル生成データ#jを生成する。 After the initial context of the learning model #j is updated, the connectivity calculation unit 16 adds the input data (as described above, the last of the model generation data #i generated from the previous model learning model #i) to the learning model #j. L sample) which is the overlap portion of) and the initial context after update are given, and model generation data #j of S sample is generated.

さらに、コネクティビティ算出部１６は、後モデルである学習モデル#jから生成されたモデル生成データ#jの最初のオーバラップ部分であるLサンプルを、前モデルである学習モデル#iの最後のLサンプルの真値として設定する。 Furthermore, the connectivity calculation unit 16 uses the L sample that is the first overlap portion of the model generation data #j generated from the learning model #j that is the subsequent model as the last L sample of the learning model #i that is the previous model. Set as the true value of.

その後、コネクティビティ算出部１６は、前モデルである学習モデル#iから生成されたモデル生成データ#iの最後のLサンプルの、前モデルの出力データの最後のLサンプルの真値（上述したように、初期コンテキストの更新後の学習モデル#jから生成されたモデル生成データ#jの最初のオーバラップ部分であるLサンプル）に対する予測誤差を求める。 Thereafter, the connectivity calculation unit 16 calculates the true value of the last L sample of the output data of the previous model (as described above) of the last L sample of the model generation data #i generated from the learning model #i that is the previous model. Then, a prediction error is obtained for the L sample that is the first overlap portion of the model generation data #j generated from the learning model #j after the initial context update.

そして、コネクティビティ算出部１６は、モデル生成データ#iの最後のLサンプルの予測誤差を、例えば、BPTT法に基づき、モデル生成データ#iの最初の１サンプルまで逆伝播（誤差の逆伝播）することで、その予測誤差を小さくするように、前モデルである学習モデル#iの初期コンテキストを更新する。 Then, the connectivity calculation unit 16 back-propagates the prediction error of the last L sample of the model generation data #i to the first sample of the model generation data #i based on the BPTT method, for example. Thus, the initial context of the learning model #i that is the previous model is updated so as to reduce the prediction error.

学習モデル#iの初期コンテキストの更新後、コネクティビティ算出部１６は、学習モデル#iに、入力データ（上述したように、前モデルである学習モデル#iに割り当てられたモデル学習用データ#iの最初の１サンプル）と、更新後の初期コンテキストを与えて、Sサンプルのモデル生成データ#iを生成する。 After updating the initial context of the learning model #i, the connectivity calculation unit 16 adds the input data (as described above, the model learning data #i assigned to the learning model #i, which is the previous model), to the learning model #i. The first sample) and the updated initial context are given, and model generation data #i of S samples is generated.

ここで、以上のように、後モデルである学習モデル#jから生成されたモデル生成データ#jの最初のオーバラップ部分であるLサンプルを、前モデルである学習モデル#iの出力データの最後のLサンプルの真値として設定し、その真値に対する、モデル生成データ#iの最後のLサンプルの予測誤差が小さくなるように、学習モデル#iの初期コンテキストを更新して、モデル生成データ#iを生成することが、上述した、オーバラップ部分の逆伝播である。 Here, as described above, the L sample that is the first overlap part of the model generation data #j generated from the learning model #j that is the subsequent model is used as the last of the output data of the learning model #i that is the previous model. Update the initial context of learning model #i so that the prediction error of the last L sample of model generation data #i for that true value is small, and set the model generation data # Generating i is the above-described back propagation of the overlap portion.

コネクティビティ算出部１６は、前モデルである学習モデル#iから、Sサンプルのモデル生成データ#iを生成した後、そのモデル生成データ#iの最後のオーバラップ部分であるLサンプルを、後モデルである学習モデル#jの入力データの最初のLサンプルとして設定し、以下、同様の処理を繰り返す。 The connectivity calculation unit 16 generates the S sample model generation data #i from the learning model #i that is the previous model, and then uses the L model that is the last overlap portion of the model generation data #i as the subsequent model. This is set as the first L sample of input data of a learning model #j, and the same processing is repeated thereafter.

そして、コネクティビティ算出部１６は、学習モデル#iから生成されるモデル生成データ#iの最後のLサンプルの予測誤差と、学習モデル#jから生成されるモデル生成データ#jの最後の１サンプルの予測誤差とが、例えば、収束すると、そのとき得られている初期コンテキストを、最適初期コンテキストとする。 The connectivity calculating unit 16 then predicts the prediction error of the last L sample of the model generation data #i generated from the learning model #i and the last one sample of the model generation data #j generated from the learning model #j. When the prediction error converges, for example, the initial context obtained at that time is set as the optimum initial context.

さらに、コネクティビティ算出部１６は、学習モデル#iと#jのそれぞれに最適初期コンテキストを与えて、モデル生成データ#iと#jを生成する。 Furthermore, the connectivity calculation unit 16 gives the optimal initial context to each of the learning models #i and #j, and generates model generation data #i and #j.

そして、コネクティビティ算出部１６は、モデル生成データ#iの最後のLサンプルと、モデル生成データ#jの最初のLサンプルとの誤差（接続誤差）を、学習モデル#iに対する学習モデル#jのコネクティビティc_ijとして求める。 Then, the connectivity calculation unit 16 converts the error (connection error) between the last L sample of the model generation data #i and the first L sample of the model generation data #j to the connectivity of the learning model #j with respect to the learning model #i. c _Calculate as _ij .

以上のように、コネクティビティ算出部１６では、モデル生成データ#iの最後のLサンプルと、モデル生成データ#jの最初のLサンプルとの接続誤差を小さくするように、学習モデル#iと#jそれぞれの初期コンテキストが決定される。そして、その初期コンテキスト（最適初期コンテキスト）を、学習モデル#iと#jに与えて生成されるモデル生成データ#iと#jとの接続誤差が、学習モデル#iに対する学習モデル#jのコネクティビティc_ijとして求められる。 As described above, the connectivity calculation unit 16 reduces the connection error between the last L sample of the model generation data #i and the first L sample of the model generation data #j so as to reduce the connection model #i and #j. Each initial context is determined. The connection error between model generation data #i and #j generated by giving the initial context (optimal initial context) to learning models #i and #j is the connectivity of learning model #j to learning model #i. It is calculated as c _ij .

なお、コネクティビティc_ij、すなわち、学習モデル#iが生成したモデル生成データ#iの最後のオーバラップ部分のLサンプルと、学習モデル#jが生成したモデル生成データ#jの最初のオーバラップ部分のLサンプルとの接続誤差は、例えば、式（１）に従って計算される。 Note that connectivity c _ij , that is, the L overlap of the last overlap part of model generation data #i generated by learning model #i and the first overlap part of model generation data #j generated by learning model #j The connection error with the L sample is calculated, for example, according to Equation (1).

c_ij＝Σ｜y_j(t)-y_i(t+T)｜
・・・（１） c _ij = Σ | y _j (t) -y _i (t + T) |
... (1)

ここで、式（１）において、Σは、変数tを、1ないしLの整数に変えての総和を表す。また、y_j(t)は、学習モデル#jから生成されるモデル生成データ#jの時刻tのサンプル（モデル生成データ#jの先頭からtサンプル目）を表し、y_i(t+T)は、学習モデル#iから生成されるモデル生成データ#iの時刻t+Tのサンプルを表す。 Here, in Equation (1), Σ represents the total sum when the variable t is changed to an integer of 1 to L. Y _j (t) represents a sample at time t of model generation data #j generated from learning model #j (t sample from the top of model generation data #j), and y _i (t + T) Represents a sample at time t + T of model generation data #i generated from learning model #i.

なお、式（１）において、Tは、式S-T=Lを満たす値である。この場合、モデル生成データ#iの最後のオーバラップ部分のLサンプルは、y_i(1+T),y_i(2+T),・・・,y_i(L+T)(=y_i(s))で表される。 In Equation (1), T is a value that satisfies Equation ST = L. In this case, the L sample of the last overlap part of the model generation data #i is y _i (1 + T), y _i (2 + T), ..., y _i (L + T) (= y _i (s)).

また、式（１）のコネクティビティc_ijは、値が小さいほど、学習モデル#iから生成されるモデル生成データ#iの直後に、学習モデル#jから生成されたモデル生成データ#jが続く（接続する）ことが、より適切である（自然である）ことを表す。 Further, the smaller the value of the connectivity c _ij in the equation (1), the model generation data #j generated from the learning model #j immediately follows the model generation data #i generated from the learning model #i ( Connecting) is more appropriate (natural).

［学習装置１０の動作］
図６を参照して、学習装置１０（図１）の処理(学習処理）について説明する。 [Operation of Learning Device 10]
The process (learning process) of the learning device 10 (FIG. 1) will be described with reference to FIG.

学習処理では、ステップＳ１１において、教師データ保存部１１が、教師データを、教師データ分割部１２に供給して、処理は、ステップＳ１２に進む。 In the learning process, in step S11, the teacher data storage unit 11 supplies the teacher data to the teacher data dividing unit 12, and the process proceeds to step S12.

ステップＳ１２では、教師データ分割部１２は、教師データ保存部１１からの時系列データを、例えば、図４で説明したように、Lサンプルのオーバラップ部分を有するN個のモデル学習用データ#1ないし#Nに分割する。さらに、教師データ分割部１２は、N個のモデル学習用データ#1ないし#Nを、モデル学習用データ保存部１３に供給して記憶させ、処理は、ステップＳ１２からステップＳ１３に進む。 In step S12, the teacher data dividing unit 12 converts the time-series data from the teacher data storage unit 11 into N pieces of model learning data # 1 having an overlap portion of L samples as described with reference to FIG. Or split into #N. Further, the teacher data dividing unit 12 supplies and stores the N model learning data # 1 to #N to the model learning data storage unit 13, and the process proceeds from step S12 to step S13.

ステップＳ１３では、学習部１４が、モデル学習用データを用いて、学習モデルの学習を行う。 In step S13, the learning unit 14 learns a learning model using the model learning data.

すなわち、学習部１４は、１つのモデル学習用データ#nを、１つの学習モデル#nに割り当てるように、モデル学習用データ保存部１３に記憶されたモデル学習用データ#1ないし#Nを、学習モデル#1ないし#Nに割り当てる。さらに、学習部１４は、学習モデル#nによる時系列パターンの学習を、その学習モデル#nに割り当てられたモデル学習用データ#nを用いて行うことで、学習モデル#nを定義するモデルパラメータ#nを求める。そして、学習部１４は、学習モデル#1ないし#Nそれぞれのモデルパラメータ#1ないし#Nを、モデルパラメータ保存部１５に供給する。 That is, the learning unit 14 uses the model learning data # 1 to #N stored in the model learning data storage unit 13 so as to assign one model learning data #n to one learning model #n. Assign to learning models # 1 to #N. Further, the learning unit 14 performs learning of the time series pattern by the learning model #n using the model learning data #n assigned to the learning model #n, so that the model parameter that defines the learning model #n is used. Ask for #n. Then, the learning unit 14 supplies the model parameters # 1 to #N of the learning models # 1 to #N to the model parameter storage unit 15, respectively.

その後、処理は、ステップＳ１３からステップＳ１４に進み、モデルパラメータ保存部１５は、学習部１４から供給されるモデルパラメータ#1ないし#Nを記憶して、処理は、ステップＳ１５に進む。 Thereafter, the process proceeds from step S13 to step S14, the model parameter storage unit 15 stores the model parameters # 1 to #N supplied from the learning unit 14, and the process proceeds to step S15.

ステップＳ１５では、コネクティビティ算出部１６が、モデル学習用データ保存部１３に記憶されたモデル学習用データ#1ないし#Nと、モデルパラメータ保存部１５に記憶されたモデルパラメータ#1ないし#Nとを用い、学習部１４で学習が行われた学習モデル#1ないし#Nすべてについて、コネクティビティc_ijを算出するコネクティビティ算出処理を行い、学習処理は、終了する。 In step S 15, the connectivity calculation unit 16 uses the model learning data # 1 to #N stored in the model learning data storage unit 13 and the model parameters # 1 to #N stored in the model parameter storage unit 15. The connectivity calculation process for calculating the connectivity c _ij is performed for all the learning models # 1 to #N that have been used and learned by the learning unit 14, and the learning process ends.

［コネクティビティ算出処理の説明］
図７ないし図９を参照して、図６のステップＳ１５で行われるコネクティビティ算出処理について説明する。 [Explanation of connectivity calculation processing]
With reference to FIG. 7 thru | or FIG. 9, the connectivity calculation process performed by FIG.6 S15 is demonstrated.

図７は、コネクティビティ算出処理を説明するフローチャートである。 FIG. 7 is a flowchart for explaining connectivity calculation processing.

コネクティビティ算出処理では、ステップＳ２１において、コネクティビティ算出部１６（図１）が、N個の学習モデル#1ないし#Nから、まだ、モデルペアとして選択していない順列となる２つの学習モデル#iと#jの並びを選択して、処理は、ステップＳ２２に進む。 In the connectivity calculation process, in step S21, the connectivity calculation unit 16 (FIG. 1) selects two learning models #i that are permutations not yet selected as model pairs from the N learning models # 1 to #N. The sequence of #j is selected, and the process proceeds to step S22.

すなわち、コネクティビティ算出部１６は、N個の学習モデル#1ないし#Nのうちの１つの学習モデルから、モデルペアの前モデルとなる学習モデル#iを選択するとともに、その学習モデル#i以外の学習モデル#jを、モデルペアの後モデルとして選択する。 That is, the connectivity calculation unit 16 selects a learning model #i that is a previous model of the model pair from one learning model among the N learning models # 1 to #N, and other than the learning model #i. Learning model #j is selected as the model after the model pair.

ステップＳ２２では、コネクティビティ算出部１６は、モデル学習用データ保存部１３（図１）から、モデルペアを構成する２つの学習モデルである前モデルと後モデルのそれぞれに割り当てられたモデル学習用データを読み込み、処理は、ステップＳ２３に進む。 In step S22, the connectivity calculation unit 16 receives, from the model learning data storage unit 13 (FIG. 1), model learning data assigned to each of the previous model and the rear model, which are two learning models constituting the model pair. The reading and processing proceeds to step S23.

ステップＳ２３では、コネクティビティ算出部１６は、モデルペアを構成する前モデルと後モデルそれぞれのモデルパラメータを、モデルパラメータ保存部１５（図１）から読み出し、処理は、ステップＳ２４に進む。 In step S23, the connectivity calculation unit 16 reads out the model parameters of the previous model and the rear model constituting the model pair from the model parameter storage unit 15 (FIG. 1), and the process proceeds to step S24.

ステップＳ２４では、コネクティビティ算出部１６は、前モデルの入力データの最初の１サンプルとして、前モデルに割り当てられたモデル学習用データの最初の１サンプルを設定して、処理は、ステップＳ２５に進む。 In step S24, the connectivity calculation unit 16 sets the first one sample of the model learning data assigned to the previous model as the first one sample of the input data of the previous model, and the process proceeds to step S25.

ステップＳ２５では、コネクティビティ算出部１６は、前モデルのモデルパラメータを、学習モデルに設定することで、前モデルを生成し（例えば、オブジェクト指向プログラミングにおける、前モデルとしての学習モデルのインスタンスを生成し）、処理は、ステップＳ２６に進む。 In step S25, the connectivity calculation unit 16 generates the previous model by setting the model parameters of the previous model in the learning model (for example, generates an instance of the learning model as the previous model in object-oriented programming). The process proceeds to step S26.

ステップＳ２６では、コネクティビティ算出部１６は、後モデルの出力データの最後の１サンプルの真値として、後モデルに割り当てられたモデル学習用データの最後の１サンプルを設定し、処理は、ステップＳ２７に進む。 In step S26, the connectivity calculation unit 16 sets the last one sample of the model learning data assigned to the subsequent model as the true value of the last one sample of the output data of the subsequent model, and the process proceeds to step S27. move on.

ステップＳ２７では、コネクティビティ算出部１６は、後モデルのモデルパラメータを、学習モデルに設定することで、後モデルを生成し（例えば、オブジェクト指向プログラミングにおける、後モデルとしての学習モデルのインスタンスを生成し）、処理は、ステップＳ２８に進む。 In step S27, the connectivity calculation unit 16 sets the model parameters of the post model in the learning model, thereby generating the post model (for example, generating an instance of the learning model as the post model in object-oriented programming). The process proceeds to step S28.

ステップＳ２８では、コネクティビティ算出部１６は、前モデルと後モデルのそれぞれの初期コンテキストとして、ランダムな値を設定して、処理は、図８のステップＳ３１に進む。 In step S28, the connectivity calculation unit 16 sets a random value as the initial context of each of the previous model and the subsequent model, and the process proceeds to step S31 in FIG.

すなわち、図８は、図７に続くフローチャートである。 That is, FIG. 8 is a flowchart following FIG.

ステップＳ３１では、コネクティビティ算出部１６は、前モデルに、ステップＳ２４（図７）で設定された入力データと、初期コンテキスト（いまの場合、ステップＳ２８で設定された初期コンテキスト）を与えて、モデル生成データを生成し、処理は、ステップＳ３２に進む。 In step S31, the connectivity calculation unit 16 gives the input data set in step S24 (FIG. 7) and the initial context (in this case, the initial context set in step S28) to the previous model to generate a model. Data is generated, and the process proceeds to step S32.

ステップＳ３２では、コネクティビティ算出部１６は、前モデルから生成されたモデル生成データの最後のオーバラップ部分であるLサンプルを、後モデルの入力データの最初のLサンプルとして設定し、処理は、ステップＳ３３に進む。 In step S32, the connectivity calculation unit 16 sets the L sample that is the last overlap part of the model generation data generated from the previous model as the first L sample of the input data of the subsequent model, and the processing is performed in step S33. Proceed to

ステップＳ３３では、コネクティビティ算出部１６は、後モデルに、ステップＳ３２で設定された入力データ（前モデルから生成されたモデル生成データの最後のオーバラップ部分であるLサンプル）と、初期コンテキストを与えて、モデル生成データを生成し、処理は、ステップＳ３４に進む。 In step S33, the connectivity calculation unit 16 gives the subsequent model the input data set in step S32 (L sample which is the last overlap portion of the model generation data generated from the previous model) and the initial context. Then, model generation data is generated, and the process proceeds to step S34.

なお、ステップＳ３３において、後モデルに与えられる初期コンテキストは、モデルペアについて、後述するステップＳ３５の処理が既に行われている場合には、そのステップＳ３５での更新後の初期コンテキストであり、ステップＳ３５の処理が、まだ行われていない場合には、ステップＳ２８（図７）で設定された初期コンテキストである。 In step S33, the initial context given to the subsequent model is the updated initial context in step S35 when the process of step S35 described later has already been performed for the model pair. If the above process has not been performed yet, it is the initial context set in step S28 (FIG. 7).

ステップＳ３４では、コネクティビティ算出部１６は、後モデルから生成されたモデル生成データの最後の１サンプルの、ステップＳ２６（図７）で設定された真値に対する予測誤差を求め、処理は、ステップＳ３５に進む。 In step S34, the connectivity calculation unit 16 obtains a prediction error for the true value set in step S26 (FIG. 7) of the last sample of the model generation data generated from the subsequent model, and the process proceeds to step S35. move on.

ステップＳ３５では、コネクティビティ算出部１６は、ステップＳ３４で求められた予測誤差を、BPTT法に基づき、後モデルから生成されたモデル生成データの最初の１サンプルまで逆伝播することで、その予測誤差を小さくするように、後モデルの初期コンテキストを更新し、処理は、ステップＳ３６に進む。 In step S35, the connectivity calculation unit 16 back-propagates the prediction error obtained in step S34 to the first one sample of the model generation data generated from the subsequent model based on the BPTT method. The initial context of the subsequent model is updated so as to decrease, and the process proceeds to step S36.

ステップＳ３６では、コネクティビティ算出部１６は、後モデルに、ステップＳ３２で設定された入力データと、ステップＳ３５での更新後の初期コンテキストを与えて、モデル生成データを生成して、処理は、ステップＳ３７に進む。 In step S36, the connectivity calculating unit 16 generates the model generation data by giving the input data set in step S32 and the initial context updated in step S35 to the subsequent model, and the processing is performed in step S37. Proceed to

ステップＳ３７では、コネクティビティ算出部１６は、後モデルから生成されたモデル生成データの最初のオーバラップ部分のLサンプルを、前モデルの最後のLサンプルの真値として設定し、処理は、ステップＳ３８に進む。 In step S37, the connectivity calculation unit 16 sets the L sample of the first overlap portion of the model generation data generated from the subsequent model as the true value of the last L sample of the previous model, and the process proceeds to step S38. move on.

ステップＳ３８では、コネクティビティ算出部１６は、前モデルから生成されたモデル生成データの最後のLサンプルの、ステップＳ３７で設定された真値（初期コンテキストの更新後の後モデルから生成されたモデル生成データの最初のオーバラップ部分のLサンプル）に対する予測誤差を求め、処理は、ステップＳ３９に進む。 In step S38, the connectivity calculation unit 16 calculates the true value set in step S37 of the last L sample of the model generation data generated from the previous model (model generation data generated from the subsequent model after updating the initial context). And the process proceeds to step S39.

ステップＳ３９では、コネクティビティ算出部１６は、ステップＳ３８で求められた予測誤差を、例えば、BPTT法に基づき、前モデルから生成されたモデル生成データの最初の１サンプルまで逆伝播することで、その予測誤差を小さくするように、前モデルの初期コンテキストを更新し、処理は、図９のステップＳ４１に進む。 In step S39, the connectivity calculation unit 16 propagates the prediction error obtained in step S38 by back-propagating to the first sample of model generation data generated from the previous model based on, for example, the BPTT method. The initial context of the previous model is updated so as to reduce the error, and the process proceeds to step S41 in FIG.

すなわち、図９は、図８に続くフローチャートである。 That is, FIG. 9 is a flowchart following FIG.

ステップＳ４１では、コネクティビティ算出部１６は、前モデル及び後モデルの初期コンテキストの更新を終了する条件（以下、更新終了条件ともいう）が満たされているかどうかを判定する。 In step S41, the connectivity calculation unit 16 determines whether a condition for ending the update of the initial context of the previous model and the subsequent model (hereinafter also referred to as an update end condition) is satisfied.

ここで、更新終了条件としては、ステップＳ３４及びＳ３８（図８）で求められる予測誤差が、ある程度収束している状態にあることを採用することができる。具体的には、更新終了条件としては、所定の繰り返し回数だけ、モデルペアを構成する前モデル及び後モデルの初期コンテキストの更新が行われたことや、ステップＳ３４及びＳ３８で求められる予測誤差が、前回と今回とで、ほとんど変化しないこと、等を採用することができる。 Here, as the update end condition, it is possible to adopt that the prediction error obtained in steps S34 and S38 (FIG. 8) is in a state of being converged to some extent. Specifically, as the update end condition, the update of the initial context of the previous model and the subsequent model constituting the model pair is performed a predetermined number of times, and the prediction error obtained in steps S34 and S38 is: It can be adopted that there is almost no change between the previous time and this time.

ステップＳ４１において、更新終了条件が満たされていないと判定された場合、処理は、図８のステップＳ３１に戻り、コネクティビティ算出部１６は、前モデルに、ステップＳ２４（図７）で設定された入力データと、初期コンテキスト（いまの場合、ステップＳ３９（図８）での更新後の初期コンテキスト）を与えて、モデル生成データを生成し、以下、同様の処理が繰り返される。 If it is determined in step S41 that the update end condition is not satisfied, the process returns to step S31 in FIG. 8, and the connectivity calculation unit 16 inputs the previous model to the input set in step S24 (FIG. 7). Data and initial context (in this case, the initial context updated in step S39 (FIG. 8)) are given to generate model generation data, and the same processing is repeated thereafter.

また、ステップＳ４１において、更新終了条件が満たされていると判定された場合、コネクティビティ算出部１６は、前モデルの現在の初期コンテキストを、前モデルの最適初期コンテキストとするとともに、後モデルの現在の初期コンテキストを、後モデルの最適初期コンテキストとして、処理は、ステップＳ４２に進む。 If it is determined in step S41 that the update termination condition is satisfied, the connectivity calculation unit 16 sets the current model's current initial context as the previous model's optimal initial context and the subsequent model's current initial context. The process proceeds to step S42 with the initial context as the optimal initial context of the subsequent model.

ステップＳ４２では、コネクティビティ算出部１６は、前モデルに、ステップＳ２４（図７）で設定された入力データと、前モデルの最適初期コンテキスト（最後に行われたステップＳ３９（図８）で更新された初期コンテキスト）を与えて、モデル生成データを生成し、処理は、ステップＳ４３に進む。 In step S42, the connectivity calculation unit 16 updates the previous model with the input data set in step S24 (FIG. 7) and the optimal initial context of the previous model (last step S39 (FIG. 8)). (Initial context) is given to generate model generation data, and the process proceeds to step S43.

ステップＳ４３では、コネクティビティ算出部１６は、前モデルから生成されたモデル生成データの最後のオーバラップ部分であるLサンプルを、後モデルの入力データの最初のLサンプルとして設定し、処理は、ステップＳ４４に進む。 In step S43, the connectivity calculation unit 16 sets the L sample that is the last overlap portion of the model generation data generated from the previous model as the first L sample of the input data of the subsequent model, and the processing is performed in step S44. Proceed to

ステップＳ４４では、コネクティビティ算出部１６は、後モデルに、ステップＳ４３で設定された入力データ（前モデルから生成されたモデル生成データの最後のオーバラップ部分であるLサンプル）と、後モデルの最適初期コンテキスト（最後に行われたステップＳ３５で更新された初期コンテキスト）を与えて、モデル生成データを生成し、処理は、ステップＳ４５に進む。 In step S44, the connectivity calculation unit 16 adds the input data set in step S43 (L sample which is the last overlap part of the model generation data generated from the previous model) to the rear model and the optimal initial of the rear model. Given the context (initial context updated in the last performed step S35) to generate model generation data, the process proceeds to step S45.

ステップＳ４５では、コネクティビティ算出部１６は、ステップＳ４２で前モデルから生成されたモデル生成データの最後のLサンプルと、ステップＳ４４で後モデルから生成されたモデル生成データの最初のLサンプルとの接続誤差を、式（１）に従って求める。 In step S45, the connectivity calculation unit 16 connects the last L sample of the model generation data generated from the previous model in step S42 and the first L sample of the model generation data generated from the subsequent model in step S44. Is determined according to equation (1).

そして、コネクティビティ算出部１６は、その接続誤差を、前モデルに対する後モデルのコネクティビティc_ijとして、処理は、ステップＳ４５からステップＳ４６に進む。 Then, the connectivity calculation unit 16 sets the connection error as the connectivity c _ij of the subsequent model with respect to the previous model, and the process proceeds from step S45 to step S46.

ステップＳ４６では、コネクティビティ算出部１６は、ステップＳ４５で求めたコネクティビティc_ijを、コネクティビティ保存部１７に供給して記憶させ、処理は、ステップＳ４７に進む。 In step S46, the connectivity calculation unit 16 supplies the connectivity c _ij obtained in step S45 to the connectivity storage unit 17 for storage, and the process proceeds to step S47.

ステップＳ４７では、コネクティビティ算出部１６は、N個の学習モデル#1ないし#Nが取り得る、２つの学習モデルの順列のすべてを、モデルペアとして、コネクティビティを求めたかどうかがを判定する。 In step S47, the connectivity calculation unit 16 determines whether connectivity has been obtained by using all the permutations of the two learning models that can be taken by the N learning models # 1 to #N as model pairs.

ステップＳ４７において、まだ、モデルペアとしていない２つの学習モデルの順列があると判定された場合、処理は、図７のステップＳ２１に戻り、以下、同様の処理が繰り返される。 If it is determined in step S47 that there is still a permutation of two learning models that are not model pairs, the process returns to step S21 in FIG. 7, and the same process is repeated thereafter.

また、ステップＳ４７において、モデルペアとしていない２つの学習モデルの順列がないと判定された場合、処理はリターンする。 If it is determined in step S47 that there is no permutation of two learning models that are not model pairs, the process returns.

［データ生成装置２０の詳細構成例］
図１０は、図１のデータ生成装置２０のより詳細な構成例を示している。 [Detailed Configuration Example of Data Generation Device 20]
FIG. 10 shows a more detailed configuration example of the data generation apparatus 20 of FIG.

なお、図１０では、教師データが、複数であるN個のモデル学習用データ#1ないし#Nに分割され、そのN個のモデル学習用データ#1ないし#Nを用いての、N個の学習モデル#1ないし#Nの学習が、コネクティビティの算出も含めて、既に済んでいることとする。 In FIG. 10, the teacher data is divided into a plurality of N pieces of model learning data # 1 to #N, and N pieces of model learning data # 1 to #N are used. It is assumed that learning of learning models # 1 to #N has already been completed, including calculation of connectivity.

始点モデル選択部２３は、現在データ分配部６１、モデルパラメータ供給部６２、N個の認識生成部６３₁ないし６３_N、及び、始点モデル決定部６４等から構成される。 The start point model selection unit 23 includes a current data distribution unit 61, a model parameter supply unit 62, N recognition generation units 63 ₁ to 63 _N , a start point model determination unit 64, and the like.

現在データ分配部６１は、現在データ供給部２１から始点モデル選択部２３に供給される現在データを、N個の認識生成部６３₁ないし６３_Nすべてに供給(分配）する。 The current data distribution unit 61 supplies (distributes) the current data supplied from the current data supply unit 21 to the start point model selection unit 23 to all the _N recognition generation units 63 ₁ to 63 _N.

モデルパラメータ供給部６２は、N個の学習モデル#1ないし#Nのモデルパラメータ#1ないし#Nを、モデルパラメータ保存部１５から読み出す。さらに、モデルパラメータ供給部６２は、モデルパラメータ保存部１５から読み出したモデルパラメータ#nを、認識生成部６３_nに供給する。 The model parameter supply unit 62 reads the model parameters # 1 to #N of the N learning models # 1 to #N from the model parameter storage unit 15. Further, the model parameter supply unit 62 supplies the model parameter #n read from the model parameter storage unit 15 to the recognition generation unit 63 _n .

認識生成部６３_nは、モデルパラメータ供給部６２からのモデルパラメータ#nを、学習モデルに設定することで、学習モデル#nを生成する（例えば、モデル学習用データ#nを用いた学習が済んだ学習モデル#nの、オブジェクト指向プログラミングにおけるインスタンスを生成する）。 The recognition generation unit 63 _n generates a learning model #n by setting the model parameter #n from the model parameter supply unit 62 in the learning model (for example, learning using the model learning data #n has been completed). Create an instance of learning model #n in object-oriented programming).

そして、認識生成部６３_nは、現在データ分配部６１から供給される現在データを、学習モデル#nに与えることで、学習モデル#nから、現在データの予測値#nを生成する。 The recognition generating unit 63 _n is the current data currently supplied from the data distribution unit 61, by giving the training model #n, the learning model #n, generates a predicted value #n current data.

なお、学習モデル#nからの、現在データの予測値#nの生成において、学習モデル#nに与える初期コンテキストとしては、例えば、ランダムな値を採用することができる。また、学習モデル#nに与える初期コンテキストとしては、その他、例えば、現在データの予測値#nを小さくする初期コンテキスト（最適初期コンテキスト）を求め、その最適初期コンテキストを採用することができる。 For example, a random value can be adopted as the initial context given to the learning model #n in the generation of the predicted value #n of the current data from the learning model #n. In addition, as the initial context to be given to the learning model #n, for example, an initial context (optimum initial context) for reducing the predicted value #n of the current data can be obtained and the optimal initial context can be adopted.

認識生成部６３_nは、学習モデル#nから、現在データの予測値#nを生成すると、その予測値#nの予測誤差を求め、始点モデル決定部６４に供給する。 When the recognition generation unit 63 _n generates a prediction value #n of the current data from the learning model #n, the recognition generation unit 63 _n obtains a prediction error of the prediction value #n and supplies it to the start point model determination unit 64.

始点モデル決定部６４は、認識生成部６３₁ないし６３_Nからそれぞれ供給される、現在データの予測値#1ないし#Nの予測誤差が小さい上位１個以上の学習モデルを、始点モデルとして選択し、その始点モデルの始点モデルIDを、生成用モデルシーケンス算出部２５に供給する。 The start point model determination unit 64 selects, as start point models, the top one or more learning models with small prediction errors of the prediction values # 1 to #N of the current data supplied from the recognition generation units 63 ₁ to 63 _N , respectively. The start point model ID of the start point model is supplied to the generation model sequence calculation unit 25.

終点モデル選択部２４は、目標データ分配部７１、モデルパラメータ供給部７２、N個の認識生成部７３₁ないし７３_N、及び、終点モデル決定部７４等から構成される。 The end point model selection unit 24 includes a target data distribution unit 71, a model parameter supply unit 72, N recognition generation units 73 ₁ to 73 _N , an end point model determination unit 74, and the like.

目標データ分配部７１は、目標データ供給部２２から終点モデル選択部２４に供給される目標データを、N個の認識生成部７３₁ないし７３_Nすべてに供給(分配）する。 The target data distribution unit 71 supplies (distributes) the target data supplied from the target data supply unit 22 to the end point model selection unit 24 to all the _N recognition generation units 73 ₁ to 73 _N.

モデルパラメータ供給部７２は、N個の学習モデル#1ないし#Nのモデルパラメータ#1ないし#Nを、モデルパラメータ保存部１５から読み出す。さらに、モデルパラメータ供給部７２は、モデルパラメータ保存部１５から読み出したモデルパラメータ#nを、認識生成部７３_nに供給する。 The model parameter supply unit 72 reads the model parameters # 1 to #N of the N learning models # 1 to #N from the model parameter storage unit 15. Further, the model parameter supply unit 72 supplies the model parameter #n read from the model parameter storage unit 15 to the recognition generation unit 73 _n .

認識生成部７３_nは、モデルパラメータ供給部７２からのモデルパラメータ#nを、学習モデルに設定することで、学習モデル#nを生成する。 The recognition generation unit 73 _n generates the learning model #n by setting the model parameter #n from the model parameter supply unit 72 in the learning model.

そして、認識生成部７３_nは、目標データ分配部７１から供給される目標データを、学習モデル#nに与えることで、学習モデル#nから、目標データの予測値#nを生成する。 Then, the recognition generation unit 73 _n generates the predicted value #n of the target data from the learning model #n by giving the target data supplied from the target data distribution unit 71 to the learning model #n.

なお、学習モデル#nからの、目標データの予測値#nの生成において、学習モデル#nに与える初期コンテキストとしては、現在データの予測値#nの生成の場合と同様に、ランダムな値や、最適初期コンテキストを採用することができる。 In the generation of the target data prediction value #n from the learning model #n, the initial context given to the learning model #n is a random value or the same as in the generation of the current data prediction value #n. The optimal initial context can be employed.

認識生成部７３_nは、学習モデル#nから、目標データの予測値#nを生成すると、その予測値#nの予測誤差を求め、終点モデル決定部７４に供給する。 When the recognition generation unit 73 _n generates the prediction value #n of the target data from the learning model #n, the recognition generation unit 73 _n obtains a prediction error of the prediction value #n and supplies it to the end point model determination unit 74.

終点モデル決定部７４は、認識生成部７３₁ないし７３_Nからそれぞれ供給される、目標データの予測値#1ないし#Nの予測誤差が小さい上位１個以上の学習モデルを、終点モデルとして選択し、その終点モデルの終点モデルIDを、生成用モデルシーケンス算出部２５に供給する。 Endpoint model determination unit 74, are supplied from the recognition generating unit 73 ₁ to 73 _N, the predicted values # 1 through #N learning model prediction error is small upper one or more of the target data is selected as the end point model The end point model ID of the end point model is supplied to the generation model sequence calculation unit 25.

生成用モデルシーケンス算出部２５は、始点モデルID供給部８１、終点モデルID供給部８２、及び、シーケンス算出部８３等から構成される。 The generation model sequence calculation unit 25 includes a start point model ID supply unit 81, an end point model ID supply unit 82, a sequence calculation unit 83, and the like.

始点モデルID供給部８１は、始点モデル選択部２３（の始点モデル決定部６４）から生成用モデルシーケンス算出部２５に供給される始点モデルIDを受信し、シーケンス算出部８３に供給する。 The start point model ID supply unit 81 receives the start point model ID supplied to the generation model sequence calculation unit 25 from the start point model selection unit 23 (the start point model determination unit 64), and supplies it to the sequence calculation unit 83.

終点モデルID供給部８２は、終点モデル選択部２４（の終点モデル決定部７４）から生成用モデルシーケンス算出部２５に供給される終点モデルIDを受信し、シーケンス算出部８３に供給する。 The end point model ID supply unit 82 receives the end point model ID supplied from the end point model selection unit 24 (the end point model determination unit 74) to the generation model sequence calculation unit 25 and supplies it to the sequence calculation unit 83.

シーケンス算出部８３は、始点モデルID供給部８１からの始点モデルIDによって特定される始点モデルから、終点モデルID供給部８２からの終点モデルIDによって特定される終点モデルまでの、複数の学習モデルの、ある並びを、生成用モデルシーケンスとして求める。 The sequence calculation unit 83 includes a plurality of learning models from the start point model specified by the start point model ID from the start point model ID supply unit 81 to the end point model specified by the end point model ID from the end point model ID supply unit 82. A certain sequence is obtained as a model sequence for generation.

すなわち、シーケンス算出部８３は、コネクティビティ保存部１７に記憶されたコネクティビティc_ijに対応する値を、学習モデル#iの後に、学習モデル#jを接続するのに要するコスト（以下、接続コストともいう）として、接続コストの累積値を最小にする、始点モデルから終点モデルまでの学習モデルの並びを、生成用モデルシーケンスとして求める。 That is, the sequence calculation unit 83 uses the cost corresponding to connecting the learning model #j to the value corresponding to the connectivity c _ij stored in the connectivity storage unit 17 after the learning model #i (hereinafter also referred to as connection cost). ), A sequence of learning models from the start point model to the end point model that minimizes the cumulative value of the connection cost is obtained as a generation model sequence.

そして、シーケンス算出部８３は、生成用モデルシーケンスを、時系列データ生成部２６に供給する。 Then, the sequence calculation unit 83 supplies the generation model sequence to the time series data generation unit 26.

ここで、シーケンス算出部８３では、上述のように、学習モデル#iの後に、学習モデル#jを接続するのに要する接続コストとして、コネクティビティc_ijに対応する値を採用し、その接続コストを、ノード（学習モデル）どうしの距離とみなすことで、接続コストの累積値を最小にする、始点モデルから終点モデルまでの学習モデルの並びである生成用モデルシーケンスを、一般的な経路探索アルゴリズムによって求める。 Here, as described above, the sequence calculation unit 83 employs a value corresponding to the connectivity c _ij as the connection cost required to connect the learning model #j after the learning model #i, and calculates the connection cost. By using a general route search algorithm, a generation model sequence that is an array of learning models from the start point model to the end point model that minimizes the cumulative connection cost by considering the distance between nodes (learning models) Ask.

生成用モデルシーケンスを求めるための経路探索アルゴリズムとしては、例えば、ダイクストラ法や、ビタビアルゴリズムを採用することができる。 As a route search algorithm for obtaining the generation model sequence, for example, the Dijkstra method or the Viterbi algorithm can be employed.

なお、生成用モデルシーケンス算出部２５は、始点モデル選択部２３から、複数の始点モデルIDが供給される場合や、終点モデル選択部２４から、複数の終点モデルIDが供給される場合、つまり、複数の学習モデルが、始点モデルや終点モデルとして選択された場合、その複数の始点と終点の組み合わせすべてについて、生成用モデルシーケンスを算出する。 The generation model sequence calculation unit 25 is supplied when a plurality of start point model IDs are supplied from the start point model selection unit 23, or when a plurality of end point model IDs are supplied from the end point model selection unit 24, that is, When a plurality of learning models are selected as a start point model or an end point model, a generation model sequence is calculated for all combinations of the plurality of start points and end points.

すなわち、始点モデルとして選択された学習モデルの数をAと表すとともに、終点モデルとして選択された学習モデルの数をBと表すこととすると、生成用モデルシーケンス算出部２５は、A×B個の生成用モデルシーケンスを算出する。 That is, when the number of learning models selected as the start point model is represented as A and the number of learning models selected as the end point model is represented as B, the generation model sequence calculation unit 25 generates A × B pieces. A generation model sequence is calculated.

そして、生成用モデルシーケンス算出部２５は、A×B個の生成用モデルシーケンスのうちの、接続コストの累積値が最小の生成用モデルシーケンスを、時系列データの生成に用いる生成用モデルシーケンスに決定し、時系列データ生成部２６に供給する。 Then, the generation model sequence calculation unit 25 uses the generation model sequence having the minimum cumulative connection cost among the A × B generation model sequences as a generation model sequence used for generating time-series data. It is determined and supplied to the time-series data generation unit 26.

時系列データ生成部２６は、シーケンス供給部９１、モデルパラメータ供給部９２、N個の認識生成部９３₁ないし９３_N、及び、統合生成部９４等から構成される。 The time series data generation unit 26 includes a sequence supply unit 91, a model parameter supply unit 92, N recognition generation units 93 ₁ to 93 _N , an integrated generation unit 94, and the like.

シーケンス供給部９１は、生成用モデルシーケンス算出部２５（のシーケンス算出部８３）から供給される生成用モデルシーケンスを受信し、モデルパラメータ供給部９２に供給する。 The sequence supply unit 91 receives the generation model sequence supplied from the generation model sequence calculation unit 25 (the sequence calculation unit 83) and supplies the generated model sequence to the model parameter supply unit 92.

モデルパラメータ供給部９２は、シーケンス供給部９１からの生成用モデルシーケンスを構成する学習モデル（以下、構成モデルともいう）のモデルパラメータを、モデルパラメータ保存部１５から読み出し、認識生成部９３₁ないし９３_Nのうちの必要なブロックに供給する。 The model parameter supply unit 92 reads out model parameters of a learning model (hereinafter also referred to as a configuration model) constituting the generation model sequence from the sequence supply unit 91 from the model parameter storage unit 15, and recognizes and generates the generation units 93 ₁ to 93. Supply the necessary blocks of _N.

すなわち、生成用モデルシーケンスが、K（≦N）個の構成モデル#1ないし#Kの並びで構成されることとすると、モデルパラメータ供給部９２は、構成モデル#1ないし#Kのモデルパラメータ#1ないし#Kを、モデルパラメータ保存部１５から読み出す。 That is, if the generating model sequence is configured by an array of K (≦ N) configuration models # 1 to #K, the model parameter supply unit 92 includes model parameters # 1 to #K of configuration models # 1 to #K. 1 to #K are read from the model parameter storage unit 15.

さらに、モデルパラメータ供給部９２は、構成モデル#k（k=1,2,・・・,K）のモデルパラメータ#kを、認識生成部９３₁ないし９３_Nのうちの認識生成部９３_kに供給する。 Furthermore, the model parameter supply unit 92, configuration model #k (k = 1,2, ···, K) the model parameters #k of recognition generating unit 93 ₁ to the recognition generating unit 93 _k of 93 _N Supply.

認識生成部９３_kは、モデルパラメータ供給部９２からのモデルパラメータ#kを、学習モデルに設定することで、構成モデル#kを生成する（例えば、モデル学習用データ#kを用いた学習が済んだ学習モデル#kの、オブジェクト指向プログラミングにおけるインスタンスを生成する）。 The recognition generation unit 93 _k generates the configuration model #k by setting the model parameter #k from the model parameter supply unit 92 in the learning model (for example, learning using the model learning data #k has been completed). Create an instance of learning model #k in object-oriented programming).

さらに、認識生成部９３_kは、構成モデル#kから、モデル生成データ#kを生成し、そのモデル生成データ#kの最後のオーバラップ部分と、認識生成部９３_k+1が構成モデル#k+1から生成するモデル生成データ#k+1、すなわち、モデル生成データ#kに接続されるモデル生成データ#k+1の最初のオーバラップ部分との誤差を小さくするように、構成モデル#kの初期コンテキストを更新することで、コネクティビティ算出処理（図７ないし図９）の場合と同様にして、最適初期コンテキストを求める。 Furthermore, the recognition generation unit 93 _k generates model generation data #k from the configuration model #k, and the recognition data generation unit 93 _{k + 1 includes} the last overlap portion of the model generation data #k and the configuration model #k. The model generation data # k + 1 generated from +1, that is, the constituent model #k so as to reduce the error from the first overlap portion of the model generation data # k + 1 connected to the model generation data #k By updating the initial context, the optimal initial context is obtained in the same manner as in the connectivity calculation process (FIGS. 7 to 9).

そして、認識生成部９３₁は、構成モデル#1に、その構成モデル#1の最適初期コンテキストを与えるとともに、現在データ供給部２１から供給される現在データを入力データとして与えることで、モデル生成データ#1を生成して、統合生成部９４に供給する。 The recognition generating unit 93 _1, the configuration model # 1, along with providing optimal initial context of the configuration model # 1, by giving the current data supplied from the current data supplying unit 21 as input data, the model generated data # 1 is generated and supplied to the integrated generation unit 94.

認識生成部９３₁ないし９３_Kのうちの、認識生成部９３₁以外の認識生成部９３_kは、構成モデル#kに、その構成モデル#kの最適初期コンテキストを与えるとともに、前段の認識生成部９３_k-1が構成モデル#k-1から生成したモデル生成データ#k-1の最後のオーバラップ部分を入力データの最初のLサンプルとして与えることで、モデル生成データ#kを生成して、統合生成部９４に供給する。 Recognition generating unit 93 _k of, other than the recognition generating unit 93 ₁ of the to recognition generating unit 93 to ₁ 93 _K is the configuration model #k, with providing the optimum initial context of the configuration model #k, preceding recognition generating unit 93 _k-1 generates the model generation data #k by giving the last overlap part of the model generation data # k-1 generated from the configuration model # k-1 as the first L sample of the input data. The integrated generation unit 94 is supplied.

統合生成部９４は、認識生成部９３₁ないし９３_Kから供給されるモデル生成データ#1ないし#Kを、オーバラップ部分を考慮して接続することにより、滑らかな生成時系列データを構成(生成）し、時系列データ出力部２７に供給する。 Integrated generator 94, the to model without generating data # 1 supplied from to 93 to ₁ recognition generating unit 93 _K #K, configuration by connecting in consideration of the overlapped portion, a smooth product time-series data (generated ) And supply to the time-series data output unit 27.

［生成用モデルシーケンスの算出］
次に、生成用モデルシーケンス算出部２５（図１）において、生成用モデルシーケンスを、例えば、ビタビアルゴリズムに基づいて求める方法について説明する。 [Calculation of model sequence for generation]
Next, a method for obtaining the generation model sequence based on, for example, the Viterbi algorithm in the generation model sequence calculation unit 25 (FIG. 1) will be described.

ここで、ビタビアルゴリズムは、観測結果について、１つの最も尤もらしい説明を与える動的計画法のアルゴリズムであり、ビタビアルゴリズムで扱う事象（状態）の系列について、時刻tでの事象の計算は、直前の時刻t-1での事象の系列のみに依存していることを前提とする。すなわち、ビタビアルゴリズムで扱う事象は、未来の挙動が現在の値だけで決定され、過去の挙動と無関係であるという性質を持つマルコフ性を前提とする確率過程である。 Here, the Viterbi algorithm is a dynamic programming algorithm that gives one most plausible explanation for the observation result. For the sequence of events (states) handled by the Viterbi algorithm, the calculation of the event at time t It is assumed that it depends only on the sequence of events at time t-1. In other words, the event handled by the Viterbi algorithm is a stochastic process based on Markov property that has the property that the future behavior is determined only by the current value and is independent of the past behavior.

また、ビタビアルゴリズムは状態機械を仮定して動作する。すなわち、モデルとしたシステムは任意の時刻で何らかの状態を持つ。状態数は膨大であっても有限であり、リストアップ可能である。各状態はノードとして表される。与えられた状態に対応する状態の複数の系列（経路）が複数考えられるとしても、最も尤もらしい状態経路が１つある。ビタビアルゴリズムでは、ある状態に到達するあらゆる経路を調べ、最も尤もらしい経路を選ぶ。これを状態の並びに対して順次適用するため、あらゆる経路を保持しておく必要はなく、１つの状態につき１つの経路だけを保持すれば足りる。 The Viterbi algorithm operates assuming a state machine. That is, the modeled system has some state at an arbitrary time. Even if the number of states is enormous, it is finite and can be listed. Each state is represented as a node. Even if a plurality of sequences (routes) of states corresponding to a given state can be considered, there is one state route that is most likely. The Viterbi algorithm examines every route that reaches a certain state and selects the most likely route. Since this is applied sequentially to the sequence of states, it is not necessary to hold every route, and it is sufficient to hold only one route per state.

さらに、ビタビアルゴリズムでは、ある状態から別の状態への遷移について増分（通常、数）を付与する。この遷移は事象から求められる。また、ビタビアルゴリズムでは、事象は一般に加算的な意味で経路上で累積するとされる。ビタビアルゴリズムでは、各状態についての数を保持するとともに、ある事象が起きたとき、これまでの状態経路の持つ値と新たな遷移における増分を考慮し、最も良い状態を選択する。事象に対応した増分は、ある状態から別の状態への遷移確率に依存して決定される。 Further, in the Viterbi algorithm, an increment (usually a number) is given for a transition from one state to another. This transition is determined from the event. In the Viterbi algorithm, events are generally accumulated on the route in an additive sense. In the Viterbi algorithm, the number for each state is held, and when a certain event occurs, the best state is selected in consideration of the value of the state path so far and the increment in the new transition. The increment corresponding to the event is determined depending on the transition probability from one state to another.

生成用モデルシーケンスを、ビタビアルゴリズムに基づいて求める場合、学習後の学習モデル#1ないし#Nのそれぞれが、ビタビアルゴリズムにおける状態機械の状態（ノード）に相当する。したがって、学習後の学習モデル#1ないし#Nの数Nが、ビタビアルゴリズムの全状態数になる。 When the generation model sequence is obtained based on the Viterbi algorithm, each of the learned models # 1 to #N after learning corresponds to the state (node) of the state machine in the Viterbi algorithm. Therefore, the number N of learning models # 1 to #N after learning is the total number of states of the Viterbi algorithm.

また、ある状態から別の状態に遷移する際の事象に対応した増分、すなわち、ビタビアルゴリズムにおける遷移確率としては、接続コスト、すなわち、コネクティビティc_ijを用いることができる。但し、遷移確率と接続コスト（コネクティビティc_ij）とは、値の増減が逆の関係にある。すなわち、遷移確率は、値が大きいほど、状態遷移が生じやすいが、接続コストは、値が小さいほど、状態遷移に相当する、学習モデル#iと#jの接続が生じやすい（力学的接続可能性が高い）。 Also, it increments corresponding to the event at the time of transition from one state to another, i.e., the transition probability in the Viterbi algorithm, the connection cost, i.e., can be used connectivity c _ij. However, the transition probability and the connection cost (connectivity c _ij ) are inversely related to the increase or decrease in value. In other words, state transitions are more likely to occur as the transition probability is larger, but the connection between learning models #i and #j, corresponding to state transitions, is likely to occur as the connection cost is smaller (mechanical connection possible) High).

ビタビアルゴリズムでは、ある始点となる状態から目標とする状態への全経路のうちの、遷移確率の総和が最大となる経路を最も尤もらしい経路（ビタビパス(Vitarbi path)）として採用する。これと同様に、生成用モデルシーケンスの算出では、接続コストの累積値、つまり、コネクティビティc_ijの総和が最小となる経路をコストが最小の経路として採用し、その経路上の状態に相当する学習モデルの並びを、生成用モデルシーケンスとする。 In the Viterbi algorithm, a path with the maximum sum of transition probabilities among all paths from a certain starting point state to a target state is adopted as the most likely path (Vitarbi path). Similarly, in the generation of the model sequence for generation, the route with the minimum connection cost, that is, the sum of the connectivity c _ij is adopted as the route with the lowest cost, and learning corresponding to the state on the route is performed. The model sequence is a generation model sequence.

すなわち、生成用モデルシーケンス算出部２５は、始点モデルから終点モデルまでの接続コストの累積値が最小になる、学習モデルの並びを、生成用モデルシーケンスとして求める。 In other words, the generation model sequence calculation unit 25 obtains, as a generation model sequence, a sequence of learning models that minimizes the cumulative value of the connection costs from the start point model to the end point model.

いま、最初の時刻t=1（始点モデルに相当する状態の時刻）から、ある時刻t=τまでの、状態#nごとの接続コストの累積値δ_n(τ)をコンポーネントとするベクトルを、累積値ベクトルd(τ)＝（δ₁(τ)，δ₂(τ)，・・・，δ_N(τ)）ということとする。生成用モデルシーケンス算出部２５は、累積値ベクトルd(τ)＝（δ₁(τ)，δ₂(τ)，・・・，δ_N(τ)）を保持する。 Now, a vector whose component is the cumulative value δ _n (τ) of the connection cost for each state #n from the first time t = 1 (the time of the state corresponding to the start point model) to a certain time t = τ, cumulative value vector _{d (τ) = (δ 1} (τ), δ 2 (τ), ···, δ N (τ)) will be referred to. The generation model sequence calculation unit 25 holds a cumulative value vector d (τ) = (δ ₁ (τ), δ ₂ (τ),..., Δ _N (τ)).

また、状態#iから状態#jへの状態遷移のコスト、すなわち、状態#iに相当する学習モデル#i（が生成するモデル生成データ#i）の直後に、状態#jに相当する学習モデル#j（が生成するモデル生成データ#j）が接続する接続コストを、b_ijで表す。接続コストb_ijの集合は、接続コストb_ijを、第i行第j列のコンポーネントとするマトリクスで表すことができる。 In addition, the cost of state transition from state #i to state #j, that is, the learning model corresponding to state #j immediately after learning model #i corresponding to state #i (model generation data #i generated by) the concatenation cost #j (model generation data #j that but produce) are connected, represented by b _ij. Set of connection cost b _ij is the connection cost b _ij, it can be represented by a matrix of the components of the i-th row and j column.

ここで、接続コストb_ijを、第i行第j列のコンポーネントとするマトリクスを、接続コストマトリクスともいう。 Here, a matrix having the connection cost b _ij as a component in the i-th row and j-th column is also referred to as a connection cost matrix.

いま、学習モデル#iの直後に学習モデル#jが接続するのが不自然でないとみなすことができるコネクティビティc_ijの最大値を、c_maxと表し、その最大値c_maxを、コネクティビティc_ijの閾値とする。コネクティビティc_ijが、閾値c_max以下である場合には、接続コストb_ijとして、コネクティビティc_ijが採用される。また、コネクティビティc_ijが、閾値c_maxを超える場合には、接続コストb_ijとして、閾値c_maxより十分大きな値である接続不可能値c_infが採用される。 Now, the maximum value of the connectivity c _ij which that learning model #j is connected can be considered not to be unnatural immediately after learning model #i, expressed as c _max, the maximum value c _max, connectivity c _ij The threshold is used. When the connectivity c _ij is equal to or less than the threshold value c _max , the connectivity c _ij is employed as the connection cost b _ij . Moreover, connectivity c _ij is if it exceeds the threshold value c _max is a connection cost b _ij, unreachable value c _inf is employed than the threshold value c _max is a sufficiently large value.

ここで、閾値c_maxや接続不可能値c_infは、シミュレーション等によって求められる。 Here, the threshold value c _max and the inaccessible value c _inf are obtained by simulation or the like.

すなわち、例えば、多数の教師データを用いて、学習モデル#iから生成されるモデル生成データの最後のオーバラップ部分と、学習モデル#jから生成されるモデル生成データの最初のオーバラップ部分とが似ていない場合（学習モデル#iから生成されるモデル生成データの直後に、学習モデル#jから生成されるモデル生成データが繋がることが不自然である場合）のコネクティビティc_ijの平均値等が、シミュレーションによって求められ、閾値c_maxとして採用される。 That is, for example, using a lot of teacher data, the last overlap part of the model generation data generated from the learning model #i and the first overlap part of the model generation data generated from the learning model #j The average value of connectivity c _{ij in} the case where it is not similar (when it is unnatural that the model generation data generated from the learning model #j is connected immediately after the model generation data generated from the learning model #i) Obtained by simulation and adopted as the threshold c _max .

また、例えば、多数の教師データを用いて、複数の学習モデルのすべてを接続した場合のコネクティビティの総和の最大値が求められ、その最大値よりも大きい値（生成用モデルシーケンスを構成する学習モデルのコネクティビティc_ijの総和として取り得ない大きな値）が、接続不可能値c_infとして採用される。 In addition, for example, the maximum value of the sum of the connectivity when all of the plurality of learning models are connected using a large number of teacher data is obtained, and a value larger than the maximum value (the learning model constituting the generation model sequence) (A large value that cannot be taken as the sum of the connectivity c _ij ) is adopted as the inaccessible value c _inf .

以上のように、コネクティビティc_ijが、閾値c_max以下である場合には、接続コストb_ijとして、コネクティビティc_ijを採用し、コネクティビティc_ijが、閾値c_maxを超える場合には、接続コストb_ijとして、接続不可能値c_infを採用することで、生成用モデルシーケンスにおいて、ある学習モデルの直後に接続され得る学習モデルと、接続されることがない学習モデルとを明確に区別することができる。 As described above, connectivity c _ij is equal to or less than the threshold value c _max is a connection cost b _ij, if adopted connectivity c _ij, where connectivity c _ij is greater than the threshold value c _max is connected cost b By adopting the inaccessible value c _inf as _ij , it is possible to clearly distinguish between a learning model that can be connected immediately after a certain learning model and a learning model that is not connected in the generation model sequence. it can.

生成用モデルシーケンス算出部２５は、生成用モデルシーケンスを求めるにあたり、まず、上述したような接続コストマトリクスを生成するとともに、累積値ベクトルd(t)を初期化する。 In determining the generation model sequence, the generation model sequence calculation unit 25 first generates the connection cost matrix as described above and initializes the accumulated value vector d (t).

ここで、累積値ベクトルd(t)の初期化とは、時刻t=1のときの、累積値ベクトルd(1)のコンポーネントδ₁(1)，δ₂(1)，・・・，δ_N(1)の値を設定（セット）することである。累積値ベクトルd(t)の初期化では、コンポーネントδ₁(1)ないしδ_N(1)のうちの、始点モデルとなっている学習モデルに対応するコンポーネントが、0とされ、その他のコンポーネントは、接続不可能値c_infとされる。 Here, initialization of the accumulated value vector d (t) means that the components δ ₁ (1), δ ₂ (1),..., Δ of the accumulated value vector d (1) at time t = 1. _N Set (set) the value of (1). In the initialization of the cumulative vector d (t), the component corresponding to the learning model that is the starting point model among the components δ ₁ (1) to δ _N (1) is set to 0, and the other components are The inaccessible value c _inf is used.

接続コストマトリクスの生成と、累積値ベクトルd(t)の初期化が終了すると、生成用モデルシーケンス算出部２５は、前向き計算（前向き方向（未来方向）の計算）を行うことで、各時刻tの累積値ベクトルd(t)を求める。 When the generation of the connection cost matrix and the initialization of the accumulated value vector d (t) are completed, the generation model sequence calculation unit 25 performs forward calculation (calculation of the forward direction (future direction)), so that each time t The accumulated value vector d (t) is obtained.

図１１は、生成用モデルシーケンス算出部２５による前向き計算を説明する図である。 FIG. 11 is a diagram for explaining the forward calculation by the generation model sequence calculation unit 25.

図１１において、横軸は、時刻tを表し、縦軸は、状態に相当する学習モデルを表す。 In FIG. 11, the horizontal axis represents time t, and the vertical axis represents a learning model corresponding to the state.

生成用モデルシーケンス算出部２５は、累積値ベクトルd(t)のコンポーネントδ_j(t)を、式（２）に従って、コンポーネントδ_j(t+1)に更新することで、時刻tの累積値ベクトルd(t)を、時刻t+1の累積値ベクトルd(t+1)に更新する。 The generation model sequence calculation unit 25 _updates the component δ _j (t) of the accumulated value vector d (t) to the component δ _j (t + 1) according to the equation (2), thereby obtaining the accumulated value at time t. The vector d (t) is updated to the accumulated value vector d (t + 1) at time t + 1.

δ_j(t+1)＝min_i(δ_i(t)＋b_ij)
・・・（２） δ _j (t + 1) = min _i (δ _i (t) + b _ij )
... (2)

ここで、式（２）において、min_i()は、変数iを、1ないしNの整数に変えたときのかっこ内の値の最小値を表す。 Here, in the formula (2), min _i () represents the minimum value of the values in parentheses when the variable i, was changed to an integer from 1 to N.

式（２）によれば、時刻t+1の、学習モデル#jまでの接続コストの累積値δ_j(t+1)は、時刻tの、学習モデル#iまでの接続コストの累積値δ_i(t)と、学習モデル#iに対する学習モデル#jの接続コストb_ijとを用いて求められる。 According to Equation (2), the cumulative value δ _j (t + 1) of the connection cost up to the learning model #j at time t + 1 is the cumulative value δ of the connection cost up to the learning model #i at time t + 1. _i (t) and the connection cost b _ij of the learning model #j with respect to the learning model #i.

すなわち、式（２）によれば、時刻t+1において、学習モデル#jに至る、時刻tのすべての学習モデル#1ないし#Nからの接続（状態遷移に相当する）のうちの、時刻t+1の、学習モデル#jまでの接続コストの累積値が最小になる接続（以下、最小接続ともいう）が選択される。そして、その最小接続を介して、時刻t+1に、学習モデル#jに至るまでの接続コストの累積値が、時刻t+1の、学習モデル#jまでの接続コストの累積値δ_j(t+1)として用いられる。 That is, according to Equation (2), at time t + 1, the time of all connections (corresponding to state transitions) from all learning models # 1 to #N at time t that reach learning model #j. A connection (hereinafter also referred to as the minimum connection) that minimizes the cumulative value of the connection cost of t + 1 up to the learning model #j is selected. Then, via the minimum connection, at time t + 1, the cumulative value of the connection cost up to learning model #j is the cumulative value of connection cost δ _j (up to learning model #j at time t + 1. t + 1).

これにより、生成用モデルシーケンス算出部２５では、時刻t+1に、学習モデル#jに至る全経路を保持することなく、最小接続だけを選択することによって、時刻t+1の、学習モデル#jまでの接続コストの累積値δ_j(t+1)を求めることができる。 As a result, the generation model sequence calculation unit 25 selects only the minimum connection at time t + 1 without holding all the routes to the learning model #j, so that the learning model # at time t + 1 is selected. A cumulative value δ _j (t + 1) of connection costs up to j can be obtained.

なお、生成用モデルシーケンス算出部２５は、式（２）によって、接続コストの累積値δ_j(t+1)を求めた学習モデル#1ないし#Nそれぞれに至るまでの学習モデルの系列（並び）の情報（以下、系列情報ともいう）を保持する。 Note that the generation model sequence calculation unit 25 uses learning formulas # 2 to #N for which the cumulative value δ _j (t + 1) of the connection cost is obtained according to Equation (2). ) Information (hereinafter also referred to as sequence information).

すなわち、生成用モデルシーケンス算出部２５は、学習モデル#1ないし#Nそれぞれについて、時刻t+1の学習モデル#jへの最小接続となる、時刻tの学習モデル#iの情報（以下、最小接続元情報ともいう）を、時刻ごとに記憶する。 That is, the generation model sequence calculation unit 25 obtains information about the learning model #i at time t (hereinafter referred to as the minimum) for each of the learning models # 1 to #N, which is the minimum connection to the learning model #j at time t + 1. (Also referred to as connection source information) is stored for each time.

以上のような前向き計算の開始後、生成用モデルシーケンス算出部２５は、前向き計算を終了するための条件（以下、計算終了条件ともいう）の判定を開始し、計算終了条件が満たされたときに、前向き計算を終了する。 After the start of the forward calculation as described above, the generation model sequence calculation unit 25 starts determining a condition for ending the forward calculation (hereinafter also referred to as a calculation end condition), and the calculation end condition is satisfied. Finally, the forward calculation ends.

ここで、生成用モデルシーケンス算出部２５では、始点モデルから終点モデルまでの学習モデルの並びが、生成用モデルシーケンスとして求められるが、始点モデルから、何時刻後に、終点モデルに到達するかは、未知である。したがって、前向き計算を行うべき回数を、あらかじめ知ることは困難であり、そのため、前向き計算を終了するのに、計算終了条件が必要となる。 Here, in the generation model sequence calculation unit 25, the sequence of learning models from the start point model to the end point model is obtained as a generation model sequence. How many times later from the start point model the end point model is reached Is unknown. Therefore, it is difficult to know in advance the number of times that the forward calculation should be performed. Therefore, a calculation end condition is required to end the forward calculation.

計算終了条件としては、学習モデル#1ないし#Nのうちの、終点モデルに至るまでの接続コストの累積値δ_goal(t)が、閾値δ_th以下になったこと（式δ_goal(t)≦δ_thが満たされること）が採用される。 The calculation end condition is that the cumulative value δ _goal (t) of the connection cost up to the end point model among the learning models # 1 to #N is less than the threshold δ _th (formula δ _goal (t) ≦ δ _th is satisfied).

ここで、累積値ベクトルd(t)の初期化では、接続コストの累積値δ₁(1)ないしδ_N(1)のうちの、始点モデルの接続コストの累積値が、0とされ、始点モデル以外の学習モデルの接続コストの累積値は、接続不可能値c_infとされる。 Here, in the initialization of the cumulative value vector d (t), the cumulative value of the connection cost of the starting model among the cumulative values δ ₁ (1) to δ _N (1) of the connection cost is set to 0, and the starting point A cumulative value of connection costs of learning models other than the model is set to an inaccessible value c _inf .

したがって、例えば、終点モデルに至るまでの系列情報が表す学習モデルの系列（並び）のうちの最初の学習モデル（時刻t=1の状態に対応する学習モデル）が、始点モデルになっていない場合には、終点モデルに至るまでの接続コストの累積値は、接続不可能値c_inf以上の値となる。 Therefore, for example, when the first learning model (the learning model corresponding to the state at time t = 1) in the learning model sequence (arrangement) represented by the sequence information up to the end point model is not the start point model the cumulative value of the connection cost to reaching the end point model, a connection impossible value c _inf or more.

一方、終点モデルに至るまでの系列情報が表す学習モデルの系列のうちの最初の学習モデルが、始点モデルになった場合、すなわち、始点モデルから終点モデルまでの学習モデルの並びとして、接続コストの累積値を小にする適切な学習モデルの並びが得られた場合、終点モデルに至るまでの接続コストの累積値は、接続不可能値c_infよりも十分小さいコネクティビティc_ijの累積値となって、接続不可能値c_infより小さな値となる。 On the other hand, when the first learning model of the learning model sequence represented by the sequence information up to the end point model becomes the start point model, that is, as a sequence of learning models from the start point model to the end point model, When an appropriate sequence of learning models with a small cumulative value is obtained, the cumulative value of the connection cost up to the end point model is the cumulative value of connectivity c _ij that is sufficiently smaller than the inaccessible value c _inf. The connection impossible value c _inf is smaller.

したがって、終点モデルに至るまでの系列情報が表す学習モデルの系列のうちの最初の学習モデルが、始点モデルになる場合の、終点モデルに至るまでの接続コストの累積値の一般的な値（例えば、平均値等）より大で、かつ、接続不可能値c_infより小さい値を、閾値δ_thとして採用し、式δ_goal(t)≦δ_thで表される計算終了条件を判定することにより、始点モデルから終点モデルまでの、適切な学習モデルの並び、すなわち、生成用モデルシーケンスを得ることができる。 Therefore, when the first learning model among the series of learning models represented by the sequence information up to the end point model becomes the start point model, a general value of the cumulative value of the connection cost up to the end point model (for example, By adopting a value larger than the average value) and smaller than the inaccessible value c _inf as the threshold value δ _th and determining the calculation end condition represented by the formula δ _goal (t) ≦ δ _th An appropriate sequence of learning models from the start point model to the end point model, that is, a generation model sequence can be obtained.

なお、閾値δ_thは、シミュレーション等によって求められる。また、閾値δ_thとしては、固定の値を採用することもできるし、可変の値を採用することもできる。可変な値の閾値δ_thとしては、接続コストの累積回数（式（２）による前向き計算時の時刻t）に応じて増加する値等を採用することができる。 The threshold value δ _th is obtained by simulation or the like. Further, as the threshold value δ _th , a fixed value can be adopted, or a variable value can be adopted. As the variable value threshold δ _th , a value that increases according to the cumulative number of connection costs (time t during forward calculation according to equation (2)) can be employed.

生成用モデルシーケンス算出部２５は、前向き計算の開始後、計算終了条件が満たされると、前向き計算を終了し、バックトラック処理を行うことで、生成用モデルシーケンスを求める。 When the calculation end condition is satisfied after the start of the forward calculation, the generation model sequence calculation unit 25 ends the forward calculation and performs backtrack processing to obtain the generation model sequence.

すなわち、生成用モデルシーケンス算出部２５は、上述したように、前向き計算において、学習モデル#1ないし#Nそれぞれについて、時刻ごとに最小接続元情報を記憶する。 That is, as described above, the generation model sequence calculation unit 25 stores the minimum connection source information for each learning model # 1 to #N in the forward calculation for each time.

バックトラック処理では、生成用モデルシーケンス算出部２５は、終点モデルから、時刻を遡る方向に、最小接続元情報を、１時刻ずつ、始点モデルまで辿っていく。そして、生成用モデルシーケンス算出部２５は、最小接続元情報を、辿った順の逆の順番に並び替えることで、時刻順の並びとし、その時刻順の並びの最小接続元情報が表す、始点モデルから終点モデルまでの学習モデルの並びを、生成用モデルシーケンスとして求める。 In the backtrack process, the generation model sequence calculation unit 25 traces the minimum connection source information from the end point model to the start point model one time at a time in the direction of going back in time. Then, the generation model sequence calculation unit 25 rearranges the minimum connection source information in the reverse order of the traced order so that the order is in the order of time, and the start point represented by the minimum connection source information in the order of the time order The sequence of learning models from the model to the end point model is obtained as a generating model sequence.

なお、生成用モデルシーケンスは、時系列データ生成部２６で時系列データの生成に用いられる学習モデルの順番を表す。したがって、生成用モデルシーケンスは、時系列データの生成に用いる学習モデルの順番のプランということができる。 The generation model sequence represents the order of learning models used for generating time-series data by the time-series data generating unit 26. Therefore, it can be said that the generation model sequence is a plan of the order of learning models used for generating time-series data.

［生成用モデルシーケンスを用いた時系列データの生成］
図１２を参照して、時系列データ生成部２６による、生成用モデルシーケンスを用いた時系列データ（生成時系列データ）の生成について説明する。 [Generate time-series data using a model sequence for generation]
With reference to FIG. 12, generation of time series data (generation time series data) using a generation model sequence by the time series data generation unit 26 will be described.

図１２は、生成用モデルシーケンスが、４つの構成モデル（学習モデル）#1ないし#4の並びである場合に、その生成用モデルシーケンスを用いて生成される生成時系列データを示している。 FIG. 12 shows generation time-series data generated using the generation model sequence when the generation model sequence is an arrangement of four constituent models (learning models) # 1 to # 4.

時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデル#1ないし#4について、コネクティビティを算出する場合と同様の、モデル生成データのオーバラップ部分の順伝播と逆伝播を繰り返す。これにより、時系列データ生成部２６は、隣接する構成モデル#kと#k+1それぞれが生成するモデル生成データ#kと#k+1どうしを、なるべく繋がりやすくする、構成モデル#1ないし#4それぞれの初期コンテキスト（最適初期コンテキスト）を求める。 The time-series data generation unit 26 repeats the forward propagation and the reverse propagation of the overlap portion of the model generation data, similar to the case of calculating the connectivity for the configuration models # 1 to # 4 constituting the generation model sequence. As a result, the time series data generation unit 26 makes it easy to connect the model generation data #k and # k + 1 generated by the adjacent configuration models #k and # k + 1, respectively, as much as possible. 4 Find each initial context (optimal initial context).

そして、時系列データ生成部２６は、最適初期コンテキストを、構成モデル#1ないし#4に与えて、構成モデル#1ないし#4からモデル生成データ#1ないし#4を生成し、そのモデル生成データ#1ないし#4を接続することで、生成時系列データを生成する。 Then, the time series data generation unit 26 gives the optimal initial context to the configuration models # 1 to # 4, generates model generation data # 1 to # 4 from the configuration models # 1 to # 4, and the model generation data Generate time series data by connecting # 1 to # 4.

すなわち、時系列データ生成部２６は、まず、生成用モデルシーケンスを構成する始点モデルである構成モデル#1の入力データの最初の１サンプルとして、構成モデル#1に割り当てられたモデル学習用データ#iの最初の１サンプルを設定する。 That is, the time-series data generation unit 26 first stores the model learning data # assigned to the configuration model # 1 as the first sample of the input data of the configuration model # 1 that is the starting point model constituting the generation model sequence. Set the first sample of i.

さらに、時系列データ生成部２６は、生成用モデルシーケンスを構成する終点モデルである構成モデル#4の出力データの最後の１サンプルの真値として、構成モデル#4に割り当てられたモデル学習用データ#4の最後の１サンプルを設定する。 Further, the time series data generation unit 26 uses the model learning data assigned to the configuration model # 4 as the true value of the last one sample of the output data of the configuration model # 4 that is the end point model constituting the generation model sequence. Set the last sample of # 4.

また、時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデル#1ないし#4のそれぞれの初期コンテキストとして、ランダムな値を設定する。 In addition, the time-series data generation unit 26 sets a random value as the initial context of each of the configuration models # 1 to # 4 constituting the generation model sequence.

そして、時系列データ生成部２６は、始点モデルである構成モデル#1に、入力データと初期コンテキストを与えて、Sサンプルのモデル生成データ#1を生成する。 Then, the time-series data generation unit 26 gives the input data and the initial context to the configuration model # 1 that is the start point model, and generates model generation data # 1 of S samples.

始点モデルである構成モデル#1から、Sサンプルのモデル生成データ#1を生成した後、時系列データ生成部２６は、そのモデル生成データ#1の最後のオーバラップ部分であるLサンプルを、直後の構成モデル#2の入力データの最初のLサンプルとして設定する。 After generating S-sample model generation data # 1 from the configuration model # 1 that is the starting point model, the time-series data generation unit 26 immediately follows the L sample that is the last overlap portion of the model generation data # 1. Set as the first L sample of the input data of configuration model # 2.

そして、時系列データ生成部２６は、構成モデル#2に、入力データと初期コンテキストを与えて、Sサンプルのモデル生成データ#2を生成する。 Then, the time-series data generation unit 26 gives input data and an initial context to the configuration model # 2, and generates model generation data # 2 of S samples.

その後、時系列データ生成部２６は、構成モデル#2から生成されたモデル生成データ#2の最後のオーバラップ部分であるLサンプルを、直後の構成モデル#3の入力データの最初のLサンプルとして設定する。 Thereafter, the time-series data generation unit 26 uses the L sample that is the last overlap part of the model generation data # 2 generated from the configuration model # 2 as the first L sample of the input data of the immediately subsequent configuration model # 3. Set.

そして、時系列データ生成部２６は、構成モデル#3に、入力データと初期コンテキストを与えて、Sサンプルのモデル生成データ#3を生成する。 Then, the time-series data generation unit 26 gives input data and initial context to the configuration model # 3, and generates model generation data # 3 of S samples.

さらに、構成モデル#3から生成されたモデル生成データ#3の最後のオーバラップ部分であるLサンプルを、直後の構成モデル#4の入力データの最初のLサンプルとして設定する。 Further, the L sample which is the last overlap part of the model generation data # 3 generated from the configuration model # 3 is set as the first L sample of the input data of the immediately subsequent configuration model # 4.

そして、時系列データ生成部２６は、構成モデル#4に、入力データと初期コンテキストを与えて、Sサンプルのモデル生成データ#4を生成する。 Then, the time-series data generation unit 26 gives input data and initial context to the configuration model # 4, and generates model generation data # 4 of S samples.

以上のように、時系列データ生成部２６は、終点モデルである構成モデル#4から、モデル生成データ#4を生成すると、そのモデル生成データ#4の最後のサンプルの、構成モデル#4の出力データの最後の１サンプルの真値（上述したように、構成モデル#4に割り当てられたモデル学習用データ#4の最後の１サンプル）に対する予測誤差を求める。 As described above, when generating the model generation data # 4 from the configuration model # 4 that is the end point model, the time series data generation unit 26 outputs the configuration model # 4 of the last sample of the model generation data # 4. A prediction error for the true value of the last one sample of data (the last one sample of model learning data # 4 assigned to the constituent model # 4 as described above) is obtained.

そして、時系列データ生成部２６は、モデル生成データ#4の最後の１サンプルの予測誤差を、例えば、BPTT法に基づき、モデル生成データ#4の最初の１サンプルまで逆伝播（誤差の逆伝播）することで、その予測誤差を小さくするように、終点モデルである構成モデル#4の初期コンテキストを更新する。 Then, the time-series data generation unit 26 propagates the prediction error of the last one sample of the model generation data # 4 back to the first sample of the model generation data # 4 based on, for example, the BPTT method (error propagation back) ), The initial context of the constituent model # 4 that is the end point model is updated so as to reduce the prediction error.

構成モデル#4の初期コンテキストの更新後、時系列データ生成部２６は、構成モデル#4に、入力データ（上述したように、直前の構成モデル#3から生成されたモデル生成データ#3の最後のオーバラップ部分であるLサンプル）と、更新後の初期コンテキストを与えて、Sサンプルのモデル生成データ#4を生成する。 After updating the initial context of the configuration model # 4, the time series data generation unit 26 adds the input data (the last of the model generation data # 3 generated from the immediately previous configuration model # 3 as described above) to the configuration model # 4. L sample) that is the overlap portion of) and the initial context after the update are given, and model generation data # 4 of S sample is generated.

さらに、時系列データ生成部２６は、構成モデル#4から生成されたモデル生成データ#4の最初のオーバラップ部分であるLサンプルを、直前の構成モデル#3の最後のLサンプルの真値として設定する。 Further, the time-series data generation unit 26 sets the L sample that is the first overlap portion of the model generation data # 4 generated from the configuration model # 4 as the true value of the last L sample of the immediately previous configuration model # 3. Set.

その後、時系列データ生成部２６は、構成モデル#3から生成されたモデル生成データ#3の最後のLサンプルの、構成モデル#3の出力データの最後のLサンプルの真値（上述したように、初期コンテキストの更新後の学習モデル#4から生成されたモデル生成データ#4の最初のオーバラップ部分であるLサンプル）に対する予測誤差を求める。 Thereafter, the time-series data generating unit 26 calculates the true value of the last L sample of the output data of the configuration model # 3 of the last L sample of the model generation data # 3 generated from the configuration model # 3 (as described above). Then, a prediction error is obtained for the L sample which is the first overlap portion of the model generation data # 4 generated from the learning model # 4 after the initial context update.

そして、時系列データ生成部２６は、モデル生成データ#3の最後のLサンプルの予測誤差を、例えば、BPTT法に基づき、モデル生成データ#3の最初の１サンプルまで逆伝播（誤差の逆伝播）することで、その予測誤差を小さくするように、構成モデル#3の初期コンテキストを更新する。 Then, the time series data generation unit 26 propagates the prediction error of the last L sample of the model generation data # 3 back to the first one sample of the model generation data # 3 based on, for example, the BPTT method. ), The initial context of the configuration model # 3 is updated so as to reduce the prediction error.

その後、時系列データ生成部２６は、構成モデル#3から生成されたモデル生成データ#3の最初のオーバラップ部分であるLサンプルを、直前の構成モデル#2の最後のLサンプルの真値として設定する。 Thereafter, the time-series data generation unit 26 uses the L sample that is the first overlap portion of the model generation data # 3 generated from the configuration model # 3 as the true value of the last L sample of the immediately previous configuration model # 2. Set.

さらに、時系列データ生成部２６は、構成モデル#2から生成されたモデル生成データ#2の最後のLサンプルの、構成モデル#2の出力データの最後のLサンプルの真値（上述したように、初期コンテキストの更新後の学習モデル#3から生成されたモデル生成データ#3の最初のオーバラップ部分であるLサンプル）に対する予測誤差を求める。 Further, the time-series data generation unit 26 calculates the true value of the last L sample of the output data of the configuration model # 2 of the last L sample of the model generation data # 2 generated from the configuration model # 2 (as described above). Then, a prediction error is obtained for the L sample that is the first overlap portion of the model generation data # 3 generated from the learning model # 3 after the initial context update.

そして、時系列データ生成部２６は、モデル生成データ#2の最後のLサンプルの予測誤差を、例えば、BPTT法に基づき、モデル生成データ#2の最初の１サンプルまで逆伝播することで、その予測誤差を小さくするように、構成モデル#2の初期コンテキストを更新する。 Then, the time series data generation unit 26 propagates the prediction error of the last L sample of the model generation data # 2 back to the first one sample of the model generation data # 2 based on, for example, the BPTT method. The initial context of the configuration model # 2 is updated so as to reduce the prediction error.

その後、時系列データ生成部２６は、構成モデル#2から生成されたモデル生成データ#2の最初のオーバラップ部分であるLサンプルを、直前の構成モデル#1の最後のLサンプルの真値として設定する。 Thereafter, the time-series data generation unit 26 sets the L sample that is the first overlap portion of the model generation data # 2 generated from the configuration model # 2 as the true value of the last L sample of the immediately previous configuration model # 1. Set.

さらに、時系列データ生成部２６は、構成モデル#1から生成されたモデル生成データ#1の最後のLサンプルの、構成モデル#1の出力データの最後のLサンプルの真値（上述したように、初期コンテキストの更新後の学習モデル#2から生成されたモデル生成データ#2の最初のオーバラップ部分であるLサンプル）に対する予測誤差を求める。 Furthermore, the time-series data generation unit 26 calculates the true value of the last L sample of the output data of the configuration model # 1 of the last L sample of the model generation data # 1 generated from the configuration model # 1 (as described above). Then, a prediction error is obtained for the L sample that is the first overlap portion of the model generation data # 2 generated from the learning model # 2 after the initial context update.

そして、時系列データ生成部２６は、モデル生成データ#1の最後のLサンプルの予測誤差を、例えば、BPTT法に基づき、モデル生成データ#1の最初の１サンプルまで逆伝播することで、その予測誤差を小さくするように、構成モデル#1の初期コンテキストを更新する。 Then, the time series data generation unit 26 propagates the prediction error of the last L sample of the model generation data # 1 back to the first one sample of the model generation data # 1 based on, for example, the BPTT method. The initial context of the configuration model # 1 is updated so as to reduce the prediction error.

以上のように、終点モデルである構成モデル#4から、始点モデルである構成モデル#1までの初期コンテキストの更新が終了すると、時系列データ生成部２６は、構成モデル#1に、入力データ（上述したように、始点モデルである構成モデル#1に割り当てられたモデル学習用データ#1の最初の１サンプル）と、更新後の初期コンテキストを与えて、Sサンプルのモデル生成データ#1を生成する。 As described above, when the update of the initial context from the configuration model # 4 that is the end point model to the configuration model # 1 that is the start point model is completed, the time-series data generation unit 26 stores the input data ( As described above, the first sample of the model learning data # 1 assigned to the configuration model # 1 that is the starting point model) and the updated initial context are given to generate the model generation data # 1 of the S sample To do.

さらに、時系列データ生成部２６は、始点モデルである構成モデル#1から生成されたモデル生成データ#1の最後のオーバラップ部分であるLサンプルを、直後の構成モデル#2の入力データの最初のLサンプルとして設定し、以下、同様の処理を繰り返す。 Further, the time-series data generation unit 26 uses the L sample that is the last overlap part of the model generation data # 1 generated from the configuration model # 1 that is the start point model, as the first input data of the immediately subsequent configuration model # 2. The same processing is repeated thereafter.

そして、時系列データ生成部２６は、例えば、生成用モデルシーケンスを構成する構成モデル#1ないし#4それぞれで得られる予測誤差が収束すると、そのとき得られている初期コンテキストを、構成モデル#1ないし#4それぞれの最適初期コンテキストとする。 Then, for example, when the prediction error obtained in each of the configuration models # 1 to # 4 constituting the generation model sequence converges, the time series data generation unit 26 uses the initial context obtained at that time as the configuration model # 1. Or # 4 as the optimal initial context for each.

その後、時系列データ生成部２６は、始点モデルである構成モデル#1に、入力データとして、現在データを支えるとともに、最適初期コンテキストを与えて、モデル生成データ#1を生成する。 Thereafter, the time-series data generation unit 26 generates the model generation data # 1 by supporting the current data as the input data and providing the optimum initial context to the configuration model # 1 that is the start point model.

そして、時系列データ生成部２６は、構成モデル#1から生成されたモデル生成データ#1の最後のオーバラップ部分であるLサンプルを、直後の構成モデル#2の入力データの最初のLサンプルとして設定する。 Then, the time-series data generation unit 26 uses the L sample which is the last overlap part of the model generation data # 1 generated from the configuration model # 1 as the first L sample of the input data of the immediately subsequent configuration model # 2. Set.

さらに、時系列データ生成部２６は、構成モデル#2に、入力データと、最適初期コンテキストとを与えて、モデル生成データ#2を生成する。 Furthermore, the time-series data generation unit 26 generates model generation data # 2 by giving input data and an optimal initial context to the configuration model # 2.

そして、時系列データ生成部２６は、構成モデル#2から生成されたモデル生成データ#2の最後のオーバラップ部分であるLサンプルを、直後の構成モデル#3の入力データの最初のLサンプルとして設定する。 Then, the time-series data generation unit 26 uses the L sample that is the last overlap part of the model generation data # 2 generated from the configuration model # 2 as the first L sample of the input data of the immediately subsequent configuration model # 3. Set.

さらに、時系列データ生成部２６は、構成モデル#3に、入力データと、最適初期コンテキストとを与えて、モデル生成データ#3を生成する。 Further, the time-series data generation unit 26 generates model generation data # 3 by giving input data and an optimal initial context to the configuration model # 3.

そして、時系列データ生成部２６は、構成モデル#3から生成されたモデル生成データ#3の最後のオーバラップ部分であるLサンプルを、直後の構成モデル#4の入力データの最初のLサンプルとして設定する。 Then, the time-series data generation unit 26 uses the L sample that is the last overlap part of the model generation data # 3 generated from the configuration model # 3 as the first L sample of the input data of the immediately subsequent configuration model # 4. Set.

さらに、時系列データ生成部２６は、構成モデル#4に、入力データと、最適初期コンテキストとを与えて、モデル生成データ#4を生成する。 Furthermore, the time-series data generation unit 26 generates model generation data # 4 by giving input data and an optimal initial context to the configuration model # 4.

以上のように、生成用モデルシーケンスを構成する構成モデル#1ないし#4のそれぞれに、最適初期コンテキストを与えて、モデル生成データ#1ないし#4が生成されると、時系列データ生成部２６は、そのモデル生成データ#1ないし#4を接続して、生成時系列データを生成する。 As described above, when the model generation data # 1 to # 4 is generated by giving the optimum initial context to each of the configuration models # 1 to # 4 constituting the generation model sequence, the time-series data generation unit 26 Connects the model generation data # 1 to # 4 to generate generation time-series data.

すなわち、時系列データ生成部２６は、例えば、構成モデル#kから生成されたモデル生成データ#kの後に、直後の構成モデル#k+1から生成されたモデル生成データ#k+1の最初のオーバラップ部分より後のサンプル（モデル生成データ#k+1の先頭からL+1サンプル以降のサンプル）を接続することで、生成時系列データを生成する。 That is, the time-series data generation unit 26, for example, after the model generation data #k generated from the configuration model #k, the first model generation data # k + 1 generated from the immediately subsequent configuration model # k + 1. The generation time series data is generated by connecting the samples after the overlap portion (samples after the L + 1 sample from the top of the model generation data # k + 1).

［データ生成装置２０の動作］
図１３を参照して、データ生成装置２０の処理（データ生成処理）について説明する。 [Operation of Data Generation Device 20]
With reference to FIG. 13, the process (data generation process) of the data generation apparatus 20 will be described.

データ生成装置２０では、ステップＳ６１において、現在データ供給部２１、目標データ供給部２２、始点モデル選択部２３、終点モデル選択部２４、及び、生成用モデルシーケンス算出部２５が、生成用モデルシーケンスを算出する算出処理を行う。 In the data generation device 20, in step S61, the current data supply unit 21, the target data supply unit 22, the start point model selection unit 23, the end point model selection unit 24, and the generation model sequence calculation unit 25 generate the generation model sequence. A calculation process for calculating is performed.

さらに、ステップＳ６１では、生成用モデルシーケンス算出部２５が、生成用モデルシーケンスの算出処理において得られる生成用モデルシーケンスを、時系列データ生成部２６に供給して、処理は、ステップＳ６２に進む。 Further, in step S61, the generation model sequence calculation unit 25 supplies the generation model sequence obtained in the generation model sequence calculation process to the time-series data generation unit 26, and the process proceeds to step S62.

ステップＳ６２では、時系列データ生成部２６が、生成用モデルシーケンス算出部２５からの生成用モデルシーケンスを用いて、生成時系列データを生成し、時系列データ出力部２７に供給する時系列データ生成処理を行い、処理は、ステップＳ６３に進む。 In step S 62, the time series data generation unit 26 generates generation time series data using the generation model sequence from the generation model sequence calculation unit 25 and supplies the time series data output unit 27 to the time series data output unit 27. A process is performed, and the process proceeds to step S63.

ステップＳ６３では、時系列データ出力部２７が、時系列データ生成部２６からの生成時系列データを、図１のデータ処理装置が制御するロボットに出力して、データ生成処理は終了する。 In step S63, the time-series data output unit 27 outputs the generated time-series data from the time-series data generation unit 26 to the robot controlled by the data processing apparatus in FIG. 1, and the data generation process ends.

図１のデータ処理装置が制御するロボットは、時系列データ出力部２７からの生成時系列データ（センサモータデータ）のコンポーネントのうちのアクションデータに従って駆動する。これにより、ロボットは、所定の行動、すなわち、ロボットでセンシングされるセンサデータとして、現在データが得られている状態から、目標データが得られる状態となるのに適切な行動をとる。 The robot controlled by the data processing apparatus of FIG. 1 is driven according to action data among the components of the generated time series data (sensor motor data) from the time series data output unit 27. As a result, the robot takes a predetermined action, that is, an action appropriate for obtaining target data from a state in which data is currently obtained as sensor data sensed by the robot.

［生成用モデルシーケンスの算出処理］
図１４を参照して、図１３のステップＳ６１で行われる、生成用モデルシーケンスの算出処理について説明する。 [Generation of model sequence for generation]
With reference to FIG. 14, the generation model sequence calculation process performed in step S61 of FIG. 13 will be described.

ステップＳ７１において、現在データ供給部２１は、現在データを、始点モデル選択部２３、及び、時系列データ生成部２６に供給して、処理は、ステップＳ７２に進む。 In step S71, the current data supply unit 21 supplies the current data to the start point model selection unit 23 and the time series data generation unit 26, and the process proceeds to step S72.

ステップＳ７２では、始点モデル選択部２３は、現在データ供給部２１からの現在データを入力データとし、モデルパラメータ保存部１５にモデルパラメータが記憶されたN個の学習モデル#1ないし#Nのそれぞれから、現在データの予測値であるモデル生成データ#1ないし#Nを生成（認識生成）する。 In step S72, the start point model selection unit 23 uses the current data from the current data supply unit 21 as input data, and from each of the N learning models # 1 to #N whose model parameters are stored in the model parameter storage unit 15. Then, model generation data # 1 to #N, which are predicted values of the current data, are generated (recognized and generated).

そして、処理は、ステップＳ７２からステップＳ７３に進み、始点モデル選択部２３は、モデル生成データ#1ないし#Nそれぞれの、現在データの予測値の予測誤差を求める。さらに、始点モデル選択部２３は、N個の学習モデル#1ないし#Nのうちの、予測誤差が小さい、例えば、上位１個の学習モデルを始点モデルとして選択し、処理は、ステップＳ７３からステップＳ７４に進む。 Then, the process proceeds from step S72 to step S73, and the start point model selection unit 23 obtains a prediction error of the prediction value of the current data for each of the model generation data # 1 to #N. Furthermore, the start point model selection unit 23 selects, for example, the top one learning model having a small prediction error from among the N learning models # 1 to #N as the start point model, and the processing is performed from step S73 to step S73. Proceed to S74.

ステップＳ７４では、目標データ供給部２２は、目標データを、終点モデル選択部２４に供給して、処理は、ステップＳ７５に進む。 In step S74, the target data supply unit 22 supplies the target data to the end point model selection unit 24, and the process proceeds to step S75.

ステップＳ７５では、終点モデル選択部２４は、目標データ供給部２２からの目標データを入力データとし、モデルパラメータ保存部１５にモデルパラメータが記憶されたN個の学習モデル#1ないし#Nのそれぞれから、目標データの予測値であるモデル生成データ#1ないし#Nを生成（認識生成）する。 In step S75, the end point model selection unit 24 uses the target data from the target data supply unit 22 as input data, and from each of the N learning models # 1 to #N whose model parameters are stored in the model parameter storage unit 15. Then, model generation data # 1 to #N, which are predicted values of the target data, are generated (recognized and generated).

そして、処理は、ステップＳ７５からステップＳ７６に進み、終点モデル選択部２４は、モデル生成データ#1ないし#Nそれぞれの、目標データの予測値の予測誤差を求める。さらに、終点モデル選択部２４は、N個の学習モデル#1ないし#Nのうちの、予測誤差が小さい、例えば、上位１個の学習モデルを終点モデルとして選択し、処理は、ステップＳ７６からステップＳ７７に進む。 Then, the process proceeds from step S75 to step S76, and the end point model selection unit 24 obtains the prediction error of the predicted value of the target data for each of the model generation data # 1 to #N. Further, the end point model selection unit 24 selects, for example, the top one learning model having a small prediction error among the N learning models # 1 to #N as the end point model, and the processing is performed from step S76 to step S76. Proceed to S77.

ステップＳ７７では、始点モデル選択部２３が、始点モデルの始点モデルIDを、生成用モデルシーケンス算出部２５に供給する。さらに、ステップＳ７７では、終点モデル選択部２４が、終点モデルの終点モデルIDを、生成用モデルシーケンス算出部２５に供給して、処理は、ステップＳ７７からステップＳ７８に進む。 In step S 77, the start point model selection unit 23 supplies the start point model ID of the start point model to the generation model sequence calculation unit 25. Furthermore, in step S77, the end point model selection unit 24 supplies the end point model ID of the end point model to the generation model sequence calculation unit 25, and the process proceeds from step S77 to step S78.

ステップＳ７８では、生成用モデルシーケンス算出部２５が、始点モデル選択部２３からの始点モデルIDによって始点モデルを特定するとともに、終点モデル選択部２４からの終点モデルIDによって終点モデルを特定する。 In step S 78, the generation model sequence calculation unit 25 specifies the start point model based on the start point model ID from the start point model selection unit 23 and specifies the end point model based on the end point model ID from the end point model selection unit 24.

さらに、生成用モデルシーケンス算出部２５は、始点モデルから終点モデルまでの、複数の学習モデルの、ある並びを、生成用モデルシーケンスとして求める。 Further, the generation model sequence calculation unit 25 obtains a certain arrangement of a plurality of learning models from the start point model to the end point model as a generation model sequence.

すなわち、生成用モデルシーケンス算出部２５は、上述したように、コネクティビティ保存部１７に記憶されたコネクティビティに対応する値を、１つの学習モデルの後に、他の１つの学習モデルを接続する接続コストとして、接続コストの累積値を最小にする、始点モデルから終点モデルまでの学習モデルの並びを、生成用モデルシーケンスとして求める。 That is, as described above, the generation model sequence calculation unit 25 uses the value corresponding to the connectivity stored in the connectivity storage unit 17 as the connection cost for connecting one learning model to another learning model. The sequence of learning models from the start point model to the end point model that minimizes the cumulative value of connection costs is obtained as a generation model sequence.

そして、生成用モデルシーケンス算出部２５は、生成用モデルシーケンスを、時系列データ生成部２６に供給して、処理はリターンする。 Then, the generation model sequence calculation unit 25 supplies the generation model sequence to the time series data generation unit 26, and the process returns.

［時系列データ生成処理］
図１５ないし図１８を参照して、図１３のステップＳ６２で行われる時系列データ生成処理について説明する。 [Time-series data generation processing]
With reference to FIGS. 15 to 18, the time-series data generation process performed in step S62 of FIG. 13 will be described.

図１５は、時系列データ生成処理を説明するフローチャートである。 FIG. 15 is a flowchart for explaining time-series data generation processing.

時系列データ生成処理では、ステップＳ８１において、時系列データ生成部２６が、生成用モデルシーケンス算出部２５から供給される生成用モデルシーケンスを受信し、処理は、ステップＳ８２に進む。 In the time-series data generation process, in step S81, the time-series data generation unit 26 receives the generation model sequence supplied from the generation model sequence calculation unit 25, and the process proceeds to step S82.

ステップＳ８２では、時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデルのうちの、始点モデルと終点モデルのそれぞれに割り当てられたモデル学習用データを、モデル学習用データ保存部１３（図１）から読み込み、処理は、ステップＳ８３に進む。 In step S82, the time-series data generation unit 26 uses the model learning data storage unit 13 (the model learning data assigned to each of the start point model and the end point model among the constituent models constituting the generation model sequence. After reading from FIG. 1), the process proceeds to step S83.

ステップＳ８３では、時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデルそれぞれのモデルパラメータを、モデルパラメータ保存部１５（図１）から読み出し、処理は、ステップＳ８４に進む。 In step S83, the time-series data generation unit 26 reads out the model parameters of each of the constituent models constituting the generation model sequence from the model parameter storage unit 15 (FIG. 1), and the process proceeds to step S84.

ステップＳ８４では、時系列データ生成部２６は、始点モデルの入力データの最初の１サンプルとして、始点モデルに割り当てられたモデル学習用データの最初の１サンプルを設定して、処理は、ステップＳ８５に進む。 In step S84, the time-series data generation unit 26 sets the first one sample of the model learning data assigned to the start point model as the first one sample of the input data of the start point model, and the process proceeds to step S85. move on.

ステップＳ８５では、時系列データ生成部２６は、終点モデルの出力データの最後の１サンプルの真値として、終点モデルに割り当てられたモデル学習用データの最後の１サンプルを設定して、処理は、ステップＳ８６に進む。 In step S85, the time-series data generation unit 26 sets the last one sample of the model learning data assigned to the end point model as the true value of the last one sample of the output data of the end point model. Proceed to step S86.

ステップＳ８６では、時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデルのモデルパラメータを、学習モデルに設定することで、生成用モデルシーケンスを構成する構成モデルを生成し（例えば、オブジェクト指向プログラミングにおける、構成モデルとしての学習モデルのインスタンスを生成し）、処理は、ステップＳ８７に進む。 In step S86, the time-series data generation unit 26 generates a configuration model that constitutes the generation model sequence by setting model parameters of the configuration model that constitutes the generation model sequence in the learning model (for example, object In the oriented programming, an instance of a learning model as a configuration model is generated), and the process proceeds to step S87.

ステップＳ８８では、時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデルのそれぞれの初期コンテキストとして、ランダムな値を設定して、処理は、図１６のステップＳ９１に進む。 In step S88, the time-series data generation unit 26 sets a random value as the initial context of each of the constituent models constituting the generation model sequence, and the process proceeds to step S91 in FIG.

すなわち、図１６は、図１５に続くフローチャートである。 That is, FIG. 16 is a flowchart following FIG.

ステップＳ９１では、時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデルのうちの、始点モデルを、注目する注目モデルに選択する。さらに、ステップＳ９１では、注目モデルである始点モデルに、ステップＳ８４（図１５）で設定された入力データと、初期コンテキスト（いまの場合、ステップＳ８７で設定された初期コンテキスト）を与えて、モデル生成データを生成し、処理は、ステップＳ９２に進む。 In step S91, the time-series data generation unit 26 selects a start point model among the constituent models constituting the generation model sequence as a target model of interest. Further, in step S91, the input model set in step S84 (FIG. 15) and the initial context (in this case, the initial context set in step S87) are given to the starting point model that is the model of interest to generate a model. Data is generated, and the process proceeds to step S92.

ステップＳ９２では、時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデルのうちの、現在の注目モデルの直後の構成モデル（以下、直後モデルともいう）を、新たに、注目モデルに選択する。 In step S92, the time-series data generation unit 26 newly sets a configuration model immediately after the current model of interest (hereinafter also referred to as a model immediately after) of the configuration models constituting the generation model sequence as a model of interest. select.

さらに、時系列データ生成部２６は、注目モデルの入力データの最初のLサンプルとして、生成用モデルシーケンスを構成する構成モデルのうちの、現在の注目モデルの直前の構成モデル（以下、直前モデルともいう）から生成されたモデル生成データの最後のオーバラップ部分であるLサンプルを設定し、処理は、ステップＳ９２からステップＳ９３に進む。 Further, the time-series data generation unit 26 uses, as the first L sample of the input data of the model of interest, the configuration model immediately before the current model of interest among the configuration models constituting the generation model sequence (hereinafter referred to as the previous model) The L sample which is the last overlap part of the model generation data generated from the above is set, and the process proceeds from step S92 to step S93.

ステップＳ９３では、時系列データ生成部２６は、注目モデルに、ステップＳ９２で設定された入力データ（直前モデルから生成されたモデル生成データの最後のオーバラップ部分であるLサンプル）と、初期コンテキストを与えて、モデル生成データを生成し、処理は、ステップＳ９４に進む。 In step S93, the time-series data generation unit 26 uses the input data set in step S92 (L sample that is the last overlap part of the model generation data generated from the immediately preceding model) and the initial context as the target model. Given, model generation data is generated, and the process proceeds to step S94.

なお、ステップＳ９１及びＳ９３において、注目モデルに与えられる初期コンテキストは、後述するステップＳ１０２とＳ１０６（図１７）の処理が既に行われている場合には、そのステップＳ１０２とＳ１０６での更新後の初期コンテキストであり、ステップＳ１０２とＳ１０６の処理が、まだ行われていない場合には、ステップＳ８７（図１５）で設定された初期コンテキストである。 Note that in steps S91 and S93, the initial context given to the model of interest is the initial context after the update in steps S102 and S106 if the processing in steps S102 and S106 (FIG. 17) described later has already been performed. If it is a context and the processing of steps S102 and S106 has not been performed yet, it is the initial context set in step S87 (FIG. 15).

ステップＳ９４では、時系列データ生成部２６が、注目モデルが、終点モデルであるかどうかを判定する。ステップＳ９４において、注目モデルが、終点モデルでないと判定された場合、処理は、ステップＳ９２に戻り、以下、同様の処理が繰り返される。 In step S94, the time-series data generation unit 26 determines whether the model of interest is an end point model. If it is determined in step S94 that the model of interest is not the end point model, the process returns to step S92, and the same process is repeated thereafter.

また、ステップＳ９４において、注目モデルが、終点モデルであると判定された場合、つまり、生成用モデルシーケンスを構成する構成モデルのすべてから、モデル生成用データを生成した場合、処理は、図１７のステップＳ１０１に進む。 If it is determined in step S94 that the model of interest is an end point model, that is, if model generation data is generated from all of the constituent models constituting the generation model sequence, the process is as shown in FIG. Proceed to step S101.

すなわち、図１７は、図１６に続くフローチャートである。 That is, FIG. 17 is a flowchart following FIG.

ステップＳ１０１では、時系列データ生成部２６は、終点モデルから生成されたモデル生成データの最後の１サンプルの、ステップＳ８５（図１５）で設定された真値に対する予測誤差を求め、処理は、ステップＳ１０２に進む。 In step S101, the time-series data generation unit 26 obtains a prediction error for the true value set in step S85 (FIG. 15) of the last one sample of the model generation data generated from the end point model. Proceed to S102.

ステップＳ１０２では、時系列データ生成部２６は、ステップＳ１０２で求められた予測誤差を、BPTT法に基づき、終点モデルから生成されたモデル生成データの最初の１サンプルまで逆伝播することで、その予測誤差を小さくするように、終点モデルの初期コンテキストを更新し、処理は、ステップＳ１０３に進む。 In step S102, the time-series data generation unit 26 propagates the prediction error obtained in step S102 back to the first sample of model generation data generated from the end point model based on the BPTT method. The initial context of the end point model is updated so as to reduce the error, and the process proceeds to step S103.

ステップＳ１０３では、時系列データ生成部２６は、終点モデルを、注目モデルに選択する。さらに、ステップＳ１０３では、時系列データ生成部２６は、注目モデルである終点モデルに、ステップＳ９２（図１６）で設定された入力データと、ステップＳ１０２での更新後の初期コンテキストを与えて、モデル生成データを生成する。 In step S103, the time series data generation unit 26 selects the end point model as the model of interest. Further, in step S103, the time-series data generation unit 26 gives the input model set in step S92 (FIG. 16) and the updated initial context in step S102 to the end point model that is the model of interest, and the model Generate generated data.

そして、処理は、ステップＳ１０３からステップＳ１０４に進み、時系列データ生成部２６は、注目モデルの直前モデルを、新たに、注目モデルに選択する。さらに、ステップＳ１０４では、時系列データ生成部２６は、直前モデルから生成されたモデル生成データの最初のオーバラップ部分のLサンプルを、注目モデルの最後のLサンプルの真値として設定し、処理は、ステップＳ１０５に進む。 Then, the process proceeds from step S103 to step S104, and the time-series data generation unit 26 newly selects a model immediately before the target model as the target model. Further, in step S104, the time-series data generation unit 26 sets the L sample of the first overlap portion of the model generation data generated from the immediately preceding model as the true value of the last L sample of the model of interest. The process proceeds to step S105.

ステップＳ１０５では、時系列データ生成部２６は、注目モデルから生成されたモデル生成データの最後のLサンプルの、ステップＳ１０４で設定された真値（初期コンテキストの更新後の直後モデルから生成されたモデル生成データの最初のオーバラップ部分のLサンプル）に対する予測誤差を求め、処理は、ステップＳ１０６に進む。 In step S105, the time series data generation unit 26 calculates the true value set in step S104 of the last L sample of the model generation data generated from the model of interest (the model generated from the model immediately after the initial context is updated). The prediction error for L sample of the first overlap portion of the generated data is obtained, and the process proceeds to step S106.

ステップＳ１０６では、時系列データ生成部２６は、ステップＳ１０５で求められた予測誤差を、例えば、BPTT法に基づき、注目モデルから生成されたモデル生成データの最初の１サンプルまで逆伝播することで、その予測誤差を小さくするように、注目モデルの初期コンテキストを更新し、処理は、ステップＳ１０７に進む。 In step S106, the time series data generation unit 26 propagates the prediction error obtained in step S105 back to the first sample of model generation data generated from the model of interest based on, for example, the BPTT method. The initial context of the model of interest is updated so as to reduce the prediction error, and the process proceeds to step S107.

ステップＳ１０７では、時系列データ生成部２６が、注目モデルが、始点モデルであるかどうかを判定する。ステップＳ１０７において、注目モデルが、始点モデルでないと判定された場合、処理は、ステップＳ１０４に戻り、以下、同様の処理が繰り返される。 In step S107, the time-series data generation unit 26 determines whether the model of interest is a start point model. If it is determined in step S107 that the model of interest is not the start point model, the process returns to step S104, and the same process is repeated thereafter.

また、ステップＳ１０７において、注目モデルが、始点モデルであると判定された場合、すなわち、ステップＳ１０１ないしＳ１０６において、終点モデルから、始点モデルに向かって、生成用モデルシーケンスを構成する構成モデルのすべての初期コンテキストを更新した場合、処理は、ステップＳ１０８に進み、時系列データ生成部２６は、生成用モデルシーケンスを構成する構成モデルの初期コンテキストの更新を終了する条件（更新終了条件）が満たされているかどうかを判定する。 If it is determined in step S107 that the model of interest is the start point model, that is, in steps S101 to S106, all of the constituent models constituting the generation model sequence from the end point model toward the start point model are displayed. When the initial context is updated, the process proceeds to step S108, and the time-series data generation unit 26 satisfies a condition (update end condition) for ending the update of the initial context of the configuration model constituting the generation model sequence. Determine whether or not.

ここで、ステップＳ１０８での更新終了条件としては、ステップＳ１０１及びＳ１０５で求められる予測誤差が、ある程度収束している状態にあることを採用することができる。具体的には、更新終了条件としては、所定の繰り返し回数だけ、生成用モデルシーケンスを構成する構成モデルの初期コンテキストの更新が行われたことや、ステップＳ１０１及びＳ１０５で求められる予測誤差が、前回と今回とで、ほとんど変化しないこと、等を採用することができる。 Here, as the update end condition in Step S108, it can be adopted that the prediction error obtained in Steps S101 and S105 is in a state of being converged to some extent. Specifically, as the update end condition, the initial context of the configuration model constituting the generation model sequence has been updated by a predetermined number of repetitions, and the prediction error obtained in steps S101 and S105 is the previous time. And this time, it can be adopted that there is almost no change.

ステップＳ１０８において、更新終了条件が満たされていないと判定された場合、処理は、図１６のステップＳ９１に戻り、時系列データ生成部２６は、始点モデルに、ステップＳ８４（図１５）で設定された入力データと、初期コンテキスト（いまの場合、ステップＳ１０６での更新後の初期コンテキスト）を与えて、モデル生成データを生成し、以下、同様の処理が繰り返される。 If it is determined in step S108 that the update end condition is not satisfied, the process returns to step S91 in FIG. 16, and the time-series data generation unit 26 is set in the start point model in step S84 (FIG. 15). The input data and the initial context (in this case, the initial context after the update in step S106) are given to generate model generation data. Thereafter, the same processing is repeated.

また、ステップＳ１０８において、更新終了条件が満たされていると判定された場合、時系列データ生成部２６は、構成モデルの現在の初期コンテキストを、その構成モデルの最適初期コンテキストとして、処理は、図１８のステップＳ１１１に進む。 If it is determined in step S108 that the update end condition is satisfied, the time-series data generation unit 26 uses the current initial context of the configuration model as the optimal initial context of the configuration model, and the process is as shown in FIG. Proceed to step S111 of FIG.

すなわち、図１８は、図１７に続く図である。 That is, FIG. 18 is a diagram following FIG.

ステップＳ１１１において、時系列データ生成部２６は、現在データ供給部２１（図１）から供給される現在データを、始点モデルの入力データの最初の複数サンプル（現在データと同一のサンプル数だけのサンプル）として設定して、処理は、ステップＳ１１２に進む。 In step S111, the time-series data generation unit 26 uses the current data supplied from the current data supply unit 21 (FIG. 1) as the first plurality of samples of the input data of the start point model (the same number of samples as the current data). ) And the process proceeds to step S112.

ステップＳ１１２では、時系列データ生成部２６は、始点モデルを、注目モデルに選択する。 In step S112, the time series data generation unit 26 selects the start point model as the model of interest.

さらに、ステップＳ１１２では、時系列データ生成部２６は、注目モデルである始点モデルに、ステップＳ１１１で設定された入力データと、始点モデルの最適初期コンテキストを与えて、Sサンプルのモデル生成データを生成し、処理は、ステップＳ１１３に進む。 Further, in step S112, the time-series data generation unit 26 generates the S sample model generation data by giving the input model set in step S111 and the optimal initial context of the start point model to the start point model that is the model of interest. Then, the process proceeds to step S113.

ステップＳ１１３では、時系列データ生成部２６は、ステップＳ１１２で生成したSサンプルのモデル生成データを、生成時系列データ（の一部）として、時系列データ出力部２７（図１）に出力して、処理は、ステップＳ１１４に進む。 In step S113, the time series data generation unit 26 outputs the model generation data of the S samples generated in step S112 to the time series data output unit 27 (FIG. 1) as (part of) the generation time series data. The process proceeds to step S114.

ステップＳ１１４では、時系列データ生成部２６は、注目モデルの直後モデルを、新たに、注目モデルに選択する。 In step S114, the time-series data generation unit 26 newly selects a model immediately after the target model as the target model.

さらに、ステップＳ１１４では、時系列データ生成部２６は、注目モデルの直前モデルから生成されたモデル生成データの最後のオーバラップ部分であるLサンプルを、注目モデルの入力データの最初のLサンプルとして設定し、処理は、ステップＳ１１５に進む。 Further, in step S114, the time-series data generation unit 26 sets the L sample that is the last overlap portion of the model generation data generated from the model immediately before the target model as the first L sample of the input data of the target model. Then, the process proceeds to step S115.

ステップＳ１１５では、時系列データ生成部２６は、注目モデルに、ステップＳ１１４で設定された入力データ（直前モデルから生成されたモデル生成データの最後のオーバラップ部分であるLサンプル）と、注目モデルの最適初期コンテキストを与えて、モデル生成データを生成し、処理は、ステップＳ１１６に進む。 In step S115, the time-series data generation unit 26 adds the input data set in step S114 (L sample that is the last overlap part of the model generation data generated from the immediately preceding model) to the target model and the target model. An optimal initial context is given to generate model generation data, and the process proceeds to step S116.

ステップＳ１１６では、時系列データ生成部２６は、ステップＳ１１５で注目モデルから生成されたモデル生成データのうちの、L+1サンプル以降のサンプルを、直前に出力された生成時系列データに続く生成時系列データとして、時系列データ出力部２７（図１）に出力して、処理は、ステップＳ１１７に進む。 In step S116, the time series data generation unit 26 generates a sample after the L + 1 sample from the model generation data generated from the target model in step S115, following the generation time series data output immediately before. It outputs to the time series data output part 27 (FIG. 1) as series data, and a process progresses to step S117.

ステップＳ１１７では、時系列データ生成部２６は、注目モデルが、終点モデルであるかどうかを判定する。ステップＳ１１７において、注目モデルが、終点モデルでないと判定された場合、処理は、ステップＳ１１４に戻り、以下、同様の処理が繰り返される。 In step S117, the time-series data generation unit 26 determines whether the model of interest is an end point model. If it is determined in step S117 that the model of interest is not the end point model, the process returns to step S114, and the same process is repeated thereafter.

また、ステップＳ１１７において、注目モデルが、終点モデルであると判定された場合、すなわち、生成用モデルシーケンスを構成する構成モデルのすべてから、モデル生成用データを生成した場合、処理はリターンする。 If it is determined in step S117 that the model of interest is an end point model, that is, if model generation data is generated from all of the constituent models constituting the generation model sequence, the process returns.

以上のように、データ処理装置１０（図１）では、教師データ分割部１２が、時系列データである教師データを、一部がオーバラップする複数のデータに分割し、内部状態を有する学習モデルの学習に用いるモデル学習用データとして出力する。さらに、学習部１４が、１つのモデル学習用データを、１つの学習モデルに割り当てるように、複数のモデル学習用データを、複数の学習モデルに割り当て、学習モデルによる時系列パターンの学習を、その学習モデルに割り当てられたモデル学習用データを用いて行う。そして、コネクティビティ算出部１６が、複数の学習モデルすべてについて、１つの学習モデルが生成する時系列データの最後の一部分のデータ列であるオーバラップ部分と、他の１つの学習モデルが生成する時系列データの最初のオーバラップ部分との誤差を、１つの学習モデルが学習した時系列パターンの後に、他の１つの学習モデルが学習した時系列パターンが接続する適切さを表すコネクティビティとして算出する。 As described above, in the data processing device 10 (FIG. 1), the teacher data dividing unit 12 divides the teacher data, which is time series data, into a plurality of partially overlapping data, and has a learning model having an internal state. Is output as model learning data used for learning. Further, the learning unit 14 assigns a plurality of model learning data to a plurality of learning models so that one model learning data is assigned to one learning model, and the learning of the time series pattern by the learning model is performed. The model learning data assigned to the learning model is used. Then, the connectivity calculation unit 16 includes, for all of the plurality of learning models, an overlap part that is a data string of the last part of the time series data generated by one learning model and a time series generated by the other learning model. The error from the first overlap portion of the data is calculated as connectivity representing the appropriateness of connection of the time series pattern learned by one other learning model after the time series pattern learned by one learning model.

一方、データ生成装置２０（図１）では、始点モデル選択部２３が、学習後の複数の学習モデルのうちの、１つの学習モデルを、始点モデルとして選択するとともに、終点モデル選択部２４が、他の１つの学習モデルを、終点モデルとして選択する。さらに、生成用モデルシーケンス算出部２５が、コネクティビティに対応する値を、１つの学習モデルの後に、他の１つの学習モデルを接続する接続コストとして、接続コストの累積値を最小にする、始点モデルから終点モデルまでの学習モデルの並びを、生成用モデルシーケンスとして求める。そして、時系列データ生成部２６が、生成用モデルシーケンスを構成する学習モデル（構成モデル）について、学習モデルが生成する時系列データの最後のオーバラップ部分と、後に接続される学習モデルが生成する時系列データの最初のオーバラップ部分との誤差を小さくするように、学習モデルの内部状態の初期値を決定し、その初期値を、学習モデルに与えて、時系列データを生成する。 On the other hand, in the data generation device 20 (FIG. 1), the start point model selection unit 23 selects one learning model as a start point model from among a plurality of learning models after learning, and the end point model selection unit 24 One other learning model is selected as the end point model. Further, the generation model sequence calculation unit 25 uses the value corresponding to the connectivity as the connection cost for connecting another learning model after one learning model, and minimizes the cumulative value of the connection cost. The sequence of learning models from to the end model is obtained as a generation model sequence. Then, the time series data generation unit 26 generates, for the learning model (configuration model) constituting the generation model sequence, the last overlapping portion of the time series data generated by the learning model and the learning model connected later. The initial value of the internal state of the learning model is determined so as to reduce the error from the first overlap portion of the time series data, and the initial value is given to the learning model to generate time series data.

したがって、複雑で、長時間の時系列データを、容易に学習し、また、学習結果に基づき、滑らかな時系列データを、精度良く生成することができる。 Therefore, it is possible to easily learn complicated and long-time time-series data, and to generate smooth time-series data with high accuracy based on the learning result.

すなわち、学習装置１０では、１つの学習モデルでは記憶しきれない複雑（非線形、多次元）かつ長時間のダイナミクスを、複数の学習モデルで時間方向に分担して記憶する学習を行い、データ生成装置２０では、そのような学習後の学習モデルの並びである生成用モデルシーケンスを算出し、その生成用モデルシーケンスを構成する学習モデルを用いて、生成時系列データを生成する。 In other words, the learning device 10 performs learning by storing in a time direction the complex (non-linear, multi-dimensional) and long-time dynamics that cannot be stored by one learning model by using a plurality of learning models. In 20, a generation model sequence that is an array of such learning models after learning is calculated, and generation time-series data is generated using the learning models constituting the generation model sequence.

そして、生成用モデルシーケンスの算出では、学習モデルどうしの接続性に関する、いわば評価値であるコネクティビティに基づき、各学習モデルが記憶しているダイナミクスをなるべくスムーズに、かつ、始点から終点までより短いパスで接続するように、未経験のプラン（教師データの全部又は一部に相当する生成時系列データを生成する学習モデルの並び以外の学習モデルの並び）をも含む、学習モデルの並びが求められる。 In the calculation of the model sequence for generation, the dynamics stored in each learning model are as smooth as possible based on the connectivity that is the evaluation value for the connectivity between the learning models, and the path from the start point to the end point is as short as possible. As shown in FIG. 1, the learning model sequence including the inexperienced plan (the learning model sequence other than the learning model sequence generating generation time series data corresponding to all or part of the teacher data) is obtained.

さらに、生成時系列データの生成では、直前モデルから生成されたモデル生成データの最後のオーバラップ部分を、注目モデルの入力データの最初の部分として引き継ぐ順方向の伝播によって、順方向にモデル生成データを生成する一方、順方向に生成したモデル生成データをもとに、終点モデルで計算された予測誤差を、逆方向、つまり、始点モデル側の学習モデルに伝播することで、生成用モデルシーケンスを構成する学習モデルの初期コンテキストが修正（更新）される。そして、この順方向、及び逆方向の伝播を繰り返すことにより、生成用モデルシーケンスが、教師データの全部又は一部に相当する生成時系列データを生成する学習モデルの並び以外の学習モデルの並びであっても、生成用モデルシーケンスを構成する学習モデルから生成されるモデル生成データを滑らかに接続するように、初期コンテキストが修正され、滑らかな生成時系列データが生成（再構成）される。 Furthermore, in generation time series data generation, the model generation data in the forward direction is transferred by forward propagation that takes over the last overlap part of the model generation data generated from the immediately preceding model as the first part of the input data of the model of interest. On the other hand, based on the model generation data generated in the forward direction, the prediction error calculated by the end point model is propagated in the reverse direction, that is, the learning model on the start point model side, thereby generating the model sequence for generation. The initial context of the configured learning model is corrected (updated). Then, by repeating this forward and reverse propagation, the generation model sequence is a sequence of learning models other than the sequence of learning models that generate generation time-series data corresponding to all or part of the teacher data. Even in such a case, the initial context is modified so that the model generation data generated from the learning model constituting the generation model sequence is smoothly connected, and smooth generation time-series data is generated (reconstructed).

より具体的には、データ学習装置１０では、時系列データである教師データが、一部がオーバラップする複数のモデル学習用データに分割される。そして、１つのモデル学習用データを、１つの学習モデルに割り当てるように、複数のモデル学習用データが、複数の学習モデルに割り当てられ、学習モデルによる時系列パターンの学習が、その学習モデルに割り当てられたモデル学習用データを用いて行われる。 More specifically, in the data learning device 10, the teacher data that is time-series data is divided into a plurality of model learning data that partially overlap. Then, a plurality of model learning data is assigned to a plurality of learning models so that one model learning data is assigned to one learning model, and time-series pattern learning by the learning model is assigned to the learning model. The model learning data is used.

したがって、時系列データが、複数の学習モデルによって、いわば分担して学習（関数近似学習）されるので、時系列パターンの記憶容量の限界をなくし、複雑で、長時間の時系列パターンを、短時間の（短い）時系列パターンに分けて記憶することができる。さらに、そのような短時間の時系列パターンを記憶した学習モデルを用いて、複雑で、長時間の時系列パターンの時系列データを、精度良く生成（再構成）することができる。 Therefore, since time series data is shared and learned by multiple learning models (function approximation learning), the storage capacity of time series patterns is eliminated, and complicated and long time series patterns are reduced. The time (short) time-series pattern can be divided and stored. Furthermore, it is possible to accurately generate (reconstruct) time-series data of a complicated and long-time time series pattern using a learning model that stores such a short-time time-series pattern.

すなわち、１つの学習モデルが学習を担当する時系列パターンの長さが制限されるので、学習モデルが規模の小さいRNN等であっても、時系列パターンを精度良く学習（記憶）することができる。さらに、学習モデルを増加することで、複数の学習モデルの全体の記憶容量を増加することができるので、１つの学習モデルの記憶容量に左右されずに、複雑で長時間の時系列パターンを記憶することができる。 That is, since the length of the time series pattern in which one learning model is in charge of learning is limited, the time series pattern can be accurately learned (stored) even if the learning model is a small-scale RNN or the like. . Furthermore, by increasing the number of learning models, it is possible to increase the overall storage capacity of a plurality of learning models, so that complex and long time series patterns can be stored without being influenced by the storage capacity of one learning model. can do.

また、学習装置１０において、コネクティビティを求め、データ生成装置２０において、コネクティビティに基づき、生成用モデルシーケンスを算出するので、学習モデルが学習を担当したモデル学習用データが、教師データのどの位置のデータであるのかに依存することなく、時系列データの生成に用いる学習モデルの並びとしての生成用モデルシーケンスを算出することができる。 In addition, the learning device 10 obtains connectivity, and the data generation device 20 calculates a generation model sequence based on the connectivity. Therefore, the model learning data for which the learning model is responsible for learning is the data at which position of the teacher data. It is possible to calculate a generation model sequence as an array of learning models used for generating time-series data without depending on whether or not.

すなわち、例えば、ある環境において移動する移動ロボットが、現在位置から、ゴールとなるゴール位置まで移動するタスク（ナビゲーションタスク）を実行するには、教師データとして与えられた経験から、現在位置からゴール位置まで移動する経路のプラン（計画）をたてる必要がある。 That is, for example, in order for a mobile robot moving in a certain environment to execute a task (navigation task) that moves from the current position to the goal position that is the goal, from the experience given as teacher data, It is necessary to make a plan of the route to travel to.

例えば、移動ロボットが移動する環境（以下、移動環境ともいう）内の任意の２点の間を、移動ロボットが移動するときに、その移動の経路の各位置で、移動ロボットが獲得することができるセンサモータデータを、教師データとして与えて、学習を行うことにより、移動ロボットは、学習時の経験、つまり、教師データとしてのセンサモータデータを観測することができる経路に沿って、自律的に移動することができる。 For example, when the mobile robot moves between any two points in the environment in which the mobile robot moves (hereinafter also referred to as the mobile environment), the mobile robot may acquire at each position of the movement route. By providing sensor motor data that can be used as teacher data and performing learning, the mobile robot autonomously follows the learning experience, that is, along the path through which sensor motor data as teacher data can be observed. Can move.

すなわち、移動を開始するスタート位置として、学習時の経路上のある位置が与えられ、かつ、移動を終了するゴール位置として、学習時の経路上の、スタート位置よりも、移動方向が後の位置が与えられた場合、移動ロボットは、スタート位置からゴール位置まで移動する経路のプランをたてることができる。 In other words, a certain position on the learning path is given as the start position for starting movement, and the movement position is later on the learning path than the start position on the learning path as the goal position to end movement. Is given, the mobile robot can plan a route for moving from the start position to the goal position.

しかしながら、移動環境では、学習時の経路上の位置が、スタート位置及びゴール位置として与えられるとは限らず、また、ゴール位置として、学習時の経路上の、スタート位置よりも後の位置が与えられるとも限らない。 However, in the mobile environment, the position on the path at the time of learning is not always given as the start position and the goal position, and the position after the start position on the path at the time of learning is given as the goal position. It is not necessarily done.

すなわち、移動ロボットが自律的に移動する場合には、現在位置がスタート位置となるが、現在位置が、学習時の経路上の位置であるとは限らない。 That is, when the mobile robot moves autonomously, the current position becomes the start position, but the current position is not necessarily the position on the route at the time of learning.

さらに、スタート位置、及びゴール位置が、学習時の経路上の位置であったとしても、学習時に通ったスタート位置よりも前に通った位置が、ゴール位置として与えられることがある。 Furthermore, even if the start position and the goal position are positions on the route at the time of learning, a position that has passed before the start position that has been passed at the time of learning may be given as the goal position.

また、学習時の経路に沿って、スタート位置からゴール位置まで移動する経路が冗長で、スタート位置からゴール位置まで移動するのに、不必要に遠回りをする場合には、そのような遠回りをしない経路のプランをたてることが望ましい。 Also, if the path from the start position to the goal position is redundant along the learning path, and if you make an unnecessarily detour to move from the start position to the goal position, do not make such a detour. It is desirable to have a route plan.

従来の経路のプランをたてる方法としては、例えば、移動環境の地図上で移動可能な領域を求め、その領域を通過する線分をアークとしてグラフを生成し、そのグラフ上での経路の探索問題に帰着させる方法がある。 As a conventional method of planning a route, for example, a movable area is obtained on a map of a moving environment, a graph is generated by using a line segment passing through the region as an arc, and a route is searched on the graph. There is a way to bring it back to the problem.

グラフ上での経路の探索をする方法としては、各アークにコストを設定し、スタート位置からゴール位置までの経路のうちの、経路を構成するアークのコストの総和が最小となる経路を求める方法がある。アークのコストとしては、アークに対応する地図上の距離（アークの両端の間の距離）が用いられる。 As a method of searching for a route on the graph, a cost is set for each arc, and a route in which the sum of the costs of the arcs constituting the route is the smallest among the routes from the start position to the goal position is obtained. There is. As the arc cost, a distance on the map corresponding to the arc (distance between both ends of the arc) is used.

しかしながら、アークに対応する地図上の距離を求めるには、移動環境の地図（ひいては、その地図上での、アークの両端の位置の座標）が必要であり、地図が与えられていない場合には、地図上の距離を求めることが困難となる。 However, in order to obtain the distance on the map corresponding to the arc, a map of the moving environment (and consequently the coordinates of the positions of both ends of the arc on the map) is required, and if no map is given It becomes difficult to find the distance on the map.

したがって、地図が与えられない場合に備え、アークのコストとしては、アークに対応する地図上の距離に代わる指標を採用することが望ましい。 Therefore, in preparation for the case where no map is given, it is desirable to adopt an index instead of the distance on the map corresponding to the arc as the cost of the arc.

そこで、学習装置１０では、１つの学習モデルが学習した時系列パターンの後に、他の１つの学習モデルが学習した時系列パターンが接続する適切さを表すコネクティビティが求められる。 Therefore, in the learning device 10, connectivity representing the appropriateness of connection of the time series pattern learned by one other learning model is obtained after the time series pattern learned by one learning model.

そして、データ生成装置２０では、コネクティビティが、アークのコストとして採用され、ビタビアルゴリズムやダイクストラ法等の、グラフの経路探索アルゴリズムで、コスト（接続コスト）の累積値を最小にする経路（最短経路）としての生成用モデルシーケンスが探索される。 In the data generation device 20, the connectivity is adopted as the cost of the arc, and the route (shortest route) that minimizes the accumulated value of the cost (connection cost) by the graph route search algorithm such as the Viterbi algorithm or the Dijkstra method. A model sequence for generation as is searched.

すなわち、データ生成装置２０では、コネクティビティに対応する値を接続コストとして、接続コストの累積値を最小にする、始点モデルから終点モデルまでの学習モデルの並びが、生成用モデルシーケンスとして算出される。 That is, in the data generation device 20, a value corresponding to connectivity is used as a connection cost, and an arrangement of learning models from the start point model to the end point model that minimizes the cumulative value of the connection cost is calculated as a generation model sequence.

生成用モデルシーケンスの算出に用いられるコネクティビティは、一部がオーバラップするように教師データを分割して得られるモデル学習用データを用いて学習がされた２つの学習モデルのうちの一方の学習モデルが生成する時系列データの最後のオーバラップ部分と、他方の学習モデルが生成する時系列データの最初のオーバラップ部分との誤差であり、一方の学習モデルの学習に用いられたモデル学習用データと、他方の学習モデルの学習に用いられたモデル学習用データとが、教師データにおいて連続していたかどうかに依存しない（但し、教師データにおいて、他方の学習モデルの学習に用いられたモデル学習用データが、一方の学習モデルの学習に用いられたモデル学習用データに続くデータであれば、一方の学習モデルを前モデルとするとともに、他方の学習モデルを後モデルとするモデルペアのコネクティビティは、小になる）。 The connectivity used to calculate the model sequence for generation is one of the two learning models trained using the model learning data obtained by dividing the teacher data so that some of them overlap. Is the error between the last overlap part of the time series data generated by and the first overlap part of the time series data generated by the other learning model, and the model learning data used for learning one of the learning models And the model learning data used for learning of the other learning model do not depend on whether or not the training data is continuous in the teacher data (however, in the teacher data, the model learning data used for learning of the other learning model is used. If the data follows the model learning data used to learn one learning model, one learning model is With the Le, the connectivity model pair to the rear model the other learning models, becomes small).

すなわち、教師データにおいて、他方の学習モデルの学習に用いられたモデル学習用データが、一方の学習モデルの学習に用いられたモデル学習用データに続くデータでなくても、一方の学習モデルが生成する時系列データ（の時系列パターン）の最後のオーバラップ部分と、他方の学習モデルが生成する時系列データの最初のオーバラップ部分とが類似していれば、一方の学習モデルを前モデルとするとともに、他方の学習モデルを後モデルとするモデルペアのコネクティビティは、前モデルに後モデルを接続することが適切であることを表す小さな値となる。 That is, in the teacher data, one learning model is generated even if the model learning data used for learning the other learning model is not the data following the model learning data used for learning one learning model. If the last overlap part of the time series data (time series pattern) and the first overlap part of the time series data generated by the other learning model are similar, one learning model is In addition, the connectivity of the model pair having the other learning model as the rear model is a small value indicating that it is appropriate to connect the rear model to the previous model.

その結果、コネクティビティに基づいて算出される生成用モデルシーケンスとしての学習モデルの並びは、学習モデルの学習に用いられたモデル学習用データの、教師データ上の順番に依存しない。 As a result, the arrangement of the learning models as the generation model sequence calculated based on the connectivity does not depend on the order of the model learning data used for learning the learning model on the teacher data.

そして、学習モデルが記憶するのは、教師データの時系列パターンの、いわば断片である、モデル学習用データの時系列パターンであるが、データ生成装置２０では、その断片を使い回して、接続コストの累積値が小さい生成用モデルシーケンスを算出することができる。 The learning model stores the time series pattern of the model learning data, which is a so-called fragment of the time series pattern of the teacher data, but the data generation device 20 uses the fragment and reuses the connection cost. It is possible to calculate a generating model sequence having a small accumulated value.

すなわち、学習時には経験していない、例えば、スタート位置からゴール位置まで移動するのに、不必要に遠回りをしない経路に相当する生成用モデルシーケンスを算出することができる。また、例えば、学習時に経験した経路とは逆方向に移動する経路が、接続コストの累積値を小さくする経路であるのであれば、そのような経路に相当する生成用モデルシーケンスを算出することができる。 That is, it is possible to calculate a generation model sequence that is not experienced during learning, for example, corresponding to a route that does not travel unnecessarily when moving from the start position to the goal position. In addition, for example, if a route moving in the opposite direction to the route experienced during learning is a route that reduces the cumulative value of connection costs, a generation model sequence corresponding to such a route can be calculated. it can.

さらに、データ処理装置２０では、生成用モデルシーケンスとしての学習モデルの並びが、コネクティビティに基づいて算出されるため、生成用モデルシーケンスを構成するある構成モデル#kが生成するモデル生成データの後に、その直後の構成モデル#k+1が生成するモデル生成データを接続することが適切であること（接続部分の波形が似ていること）が保証される。 Furthermore, in the data processing device 20, since the arrangement of the learning models as the generation model sequence is calculated based on the connectivity, after the model generation data generated by a certain configuration model #k constituting the generation model sequence, It is ensured that it is appropriate to connect the model generation data generated by the immediately following configuration model # k + 1 (the waveform of the connected portion is similar).

但し、生成用モデルシーケンスを、コネクティビティに基づいて算出することによっては、構成モデル#kが生成するモデル生成データの後に、構成モデル#k+1が生成するモデル生成データを接続したときに、その接続部分が滑らかになることまでは、保証されない。 However, by calculating the model sequence for generation based on connectivity, when the model generation data generated by the configuration model # k + 1 is connected to the model generation data generated by the configuration model #k, There is no guarantee that the connection will be smooth.

すなわち、コネクティビティに基づいて算出される生成用モデルシーケンスの構成モデルの並びが、構成モデルの学習に用いられたモデル学習用データの、教師データ上の順番に一致している場合には、構成モデル#kが生成するモデル生成データの後に、構成モデル#k+1が生成するモデル生成データを接続したときに、その接続部分は滑らかになる。 That is, when the arrangement of the configuration model of the generation model sequence calculated based on connectivity matches the order of the model learning data used for learning the configuration model on the teacher data, the configuration model When the model generation data generated by the constituent model # k + 1 is connected after the model generation data generated by #k, the connected portion becomes smooth.

しかしながら、コネクティビティに基づいて算出される生成用モデルシーケンスの構成モデル（学習モデル）の並びが、構成モデルの学習に用いられたモデル学習用データの、教師データ上の順番に一致していない場合には、構成モデル#kが生成するモデル生成データの後に、構成モデル#k+1が生成するモデル生成データを接続したときに、その接続部分は滑らかになるとは限らない。 However, when the arrangement of the configuration model (learning model) of the generation model sequence calculated based on the connectivity does not match the order of the model learning data used for learning the configuration model on the teacher data When the model generation data generated by the configuration model # k + 1 is connected after the model generation data generated by the configuration model #k, the connection portion is not always smooth.

ここで、学習モデルが、モデル学習用データを、そのままテンプレートとして記憶する場合や、調節可能な内部状態を持たずに、関数近似で記憶する場合には、その記憶しているままの時系列データ（モデル生成データ）しか生成することができない。 Here, when the learning model stores the model learning data as a template as it is or when it is stored by function approximation without having an adjustable internal state, the time series data as it is stored (Model generation data) can only be generated.

そのため、そのような学習モデルの複数から生成されたモデル生成データを接続したときに、その接続部分が滑らかになるとは限らない。 For this reason, when model generation data generated from a plurality of such learning models is connected, the connected portion is not always smooth.

一方、データ生成装置２０では、学習モデルとして、時間発展するダイナミクスを関数近似の形で記憶することが可能で、内部状態としてのコンテキストを有するRNNを採用する。さらに、データ生成装置２０では、構成モデル#kが生成するモデル生成データの最後のオーバラップ部分と、後に接続される構成モデル#k+1が生成するモデル生成データの最初のオーバラップ部分との誤差を小さくするように、構成モデルとしてのRNNの初期コンテキストを決定し、その初期コンテキスト（最適初期コンテキスト）を、構成モデルに与えて、時系列データを生成する。 On the other hand, the data generation apparatus 20 employs an RNN having a context as an internal state that can store dynamics that evolve over time in the form of function approximation as a learning model. Further, in the data generation device 20, the last overlap portion of the model generation data generated by the configuration model #k and the first overlap portion of the model generation data generated by the configuration model # k + 1 connected later In order to reduce the error, an initial context of the RNN as a configuration model is determined, and the initial context (optimum initial context) is given to the configuration model to generate time series data.

したがって、構成モデル#kが生成するモデル生成データの後に、構成モデル#k+1が生成するモデル生成データを接続したときに、その接続部分を滑らかにすることができ、その結果、滑らかな生成時系列データを生成することができる。 Therefore, when the model generation data generated by the configuration model # k + 1 is connected after the model generation data generated by the configuration model #k, the connected portion can be smoothed. Time series data can be generated.

［データ生成装置２０が生成する生成時系列データ］
図１９は、教師データとしての時系列データと、その時系列データを用いた学習を行った学習モデルを用いて生成される生成時系列データとを示している。 [Generation time series data generated by the data generation device 20]
FIG. 19 shows time-series data as teacher data and generated time-series data generated using a learning model obtained by learning using the time-series data.

図１９Ａは、教師データとしての経路（以下、教示経路ともいう）を模式的に示している。 FIG. 19A schematically shows a route (hereinafter also referred to as a teaching route) as teacher data.

教示経路は、位置P₁からP₂までの経路の１つで、図１９Ａでは、７つの経路Q₁,Q₂,Q₃,Q₄,Q₅,Q₆、及びQ₇としてのモデル学習用データに分割されている。学習時には、経路Q_nが、学習モデル#nで学習される。 The teaching path is one of the paths from positions P ₁ to P ₂ , and in FIG. 19A, model learning as _seven paths Q ₁ , Q ₂ , Q ₃ , Q ₄ , Q ₅ , Q ₆ , and Q ₇ is performed. Is divided into data. At the time of learning, the route Q _n is learned by the learning model #n.

なお、図１９では、オーバラップ部分の図示は省略してある。 In FIG. 19, the overlap portion is not shown.

RNNである学習モデル#nは、パラメータa付きの時間発展方程式F(x,a)を近似する関数近似器とみなすことができる。そこで、経路Q_nを学習した学習モデル#nを、以下、F_n(x,a_n)とも表す。 The learning model #n, which is an RNN, can be regarded as a function approximator that approximates the time evolution equation F (x, a) with the parameter a. Therefore, the learning model #n that has learned the route Q _n is hereinafter also expressed as F _n (x, a _n ).

ここで、時間発展方程式F(x,a)の引数xは、入力データを表し、パラメータaは、内部状態の初期値（初期コンテキスト）を表す。 Here, the argument x of the time evolution equation F (x, a) represents input data, and the parameter a represents an initial value (initial context) of the internal state.

また、図１９Ａにおいて、学習モデルF_n(x,a_n)のパラメータa_nは、例えば、その学習モデルF_n(x,a_n)が学習した経路Q_nになるべく一致するモデル生成データを生成することができるときの内部状態の初期値を表す。 Further, in FIG. 19A, learning model F _n (x, a _n) parameters a _n are, for example, generates the learning model F _n (x, a _n) is the model generating data as possible matching path Q _n learned It represents the initial value of the internal state when it can be done.

図１９Ｂは、データ生成装置２０のデータ生成処理によって、学習モデルF₁(x,a)ないしF₇(x,a)を用いて生成される生成時系列データとしての経路（以下、生成経路ともいう）を模式的に示している。 Figure 19B by the data generating process of the data generating apparatus 20, learning model F ₁ (x, a) to F ₇ (x, a) the path of the generated time-series data generated using (hereinafter, both generation path Is schematically shown.

図１９Ｂにおいて、生成経路は、位置P₁からP₂までの経路ではあるが、図１９Ａの教示経路とは異なる経路になっている。 In FIG. 19B, generation path, albeit in a path from the position P ₁ to P _2, which is a different path from the taught path in FIG 19A.

すなわち、生成経路は、５つの経路Q'₁,Q'₂,Q'₃,Q'₆、及びQ'₇としてのモデル生成データが、その順番で接続されて構成されている。 That is, the generation path is configured by connecting model generation data as five paths Q ′ ₁ , Q ′ ₂ , Q ′ ₃ , Q ′ ₆ , and Q ′ ₇ in that order.

図１９Ｂでは、データ生成装置２０において、７つの学習モデルF₁(x,a)ないしF₇(x,a)から、冗長な経路を生成する学習モデルF₄(x,a),及びF₅(x,a)を除外した学習モデルF₁(x,a),F₂(x,a),F₃(x,a),F₆(x,a),F₇(x,a)の並びが、生成用モデルシーケンスとして求められている。 In FIG. 19B, the data generating apparatus 20, seven learning model F ₁ (x, a) learning model F ₄ from to no F ₇ (x, a), which generates a redundant path (x, a), and F ₅ Learning models F ₁ (x, a), F ₂ (x, a), F ₃ (x, a), F ₆ (x, a), F ₇ (x, a) excluding (x, a) A sequence is required as a model sequence for generation.

さらに、データ生成装置２０では、生成経路の生成にあたって、図１２等で説明した、モデル生成データのオーバラップ部分の順伝播と逆伝播を繰り返すことで、生成用モデルシーケンスを構成する学習モデルF₁(x,a),F₂(x,a),F₃(x,a),F₆(x,a),F₇(x,a)それぞれから生成されるモデル生成データのオーバラップ部分を滑らかに接続するパラメータaが求められる。 Further, in the generation of the generation path, the data generation apparatus 20 repeats the forward propagation and the reverse propagation of the overlap portion of the model generation data described in FIG. 12 and the like, thereby learning model F ₁ constituting the generation model sequence. _{(x, a), F 2} (x, a), F 3 (x, a), F 6 (x, a), F 7 (x, a) the overlapped portion of the model generation data generated from each of A parameter a for smooth connection is obtained.

図１９Ｂでは、オーバラップ部分が滑らかに接続するパラメータaとして、学習モデルF₁(x,a)については、値a₁が、学習モデルF₂(x,a)については、値a₂が、学習モデルF₃(x,a)については、値a'₃が、学習モデルF₆(x,a)については、値a'₆が、学習モデルF₇(x,a)については、値a₇がそれぞれ求められている。 In FIG. 19B, as parameters a overlapped portion is smoothly connected, the learning model F ₁ (x, a) the value a ₁ is the learning model F ₂ (x, a) is the value a _2, For the learning model F ₃ (x, a) the value a ' ₃ is for the learning model F ₆ (x, a), the value a' ₆ is for the learning model F ₇ (x, a), the value a ₇ is required for each.

そして、学習モデルF₁(x,a₁)からは、経路Q'₁が、学習モデルF₂(x,a₂)からは、経路Q'₂が、については、学習モデルF₃(x,a'₃)からは、経路Q'₃が、学習モデルF₆(x,a'₆)からは、経路Q'₆が、学習モデルF₇(x,a₇)からは、経路Q'₇が、それぞれ、モデル生成データとして生成されている。 The learning model F _₁ (x, a ₁₎ from the path Q _'1 are, learning model F ₂ (x, a ₂₎ from the path Q' is _2, for the learning model F ₃ (x, From a ′ ₃ ), the path Q ′ ₃ is from the learning model F ₆ (x, a ′ ₆ ), the path Q ′ ₆ is from the learning model F ₇ (x, a ₇ ), and the path Q ′ ₇ Are generated as model generation data.

図１９Ｂにおいて、パラメータaが図１９Ａの場合と一致している学習モデルF₁(x,a₁)，F₂(x,a₂)、及びF₇(x,a₇)から生成される経路Q'₁,Q'₂、及びQ'₇は、それぞれ、図１９Ａの、対応する経路Q₁,Q₂、及びQ₇と一致している。 In FIG. 19B, a path generated from the learning models F ₁ (x, a ₁ ), F ₂ (x, a ₂ ), and F ₇ (x, a ₇ ) in which the parameter a matches that in FIG. 19A. Q ′ ₁ , Q ′ ₂ , and Q ′ ₇ respectively correspond to the corresponding paths Q ₁ , Q ₂ , and Q ₇ in FIG. 19A.

一方、図１９Ｂにおいて、パラメータaが図１９Ａの場合と異なる学習モデルF₃(x,a'₃)から生成される経路Q'₃は、図１９Ａの、対応する経路Q₃と異なっている。 On the other hand, in FIG. 19B, ₃ parameter a 'path Q is generated from ₍₃ different learning models F ₃ in the case of FIG 19A x, a)' is in Figure 19A, are different from the corresponding path Q _3.

すなわち、図１９Ａの経路Q₃は、その始点側（位置P₁に近い側）が、経路Q₂に滑らかに接続するようになっているとともに、終点側（位置P₂に近い側）が、経路Q₄に滑らかに接続するようになっている。 That is, the route Q _{3 in} FIG. 19A has its start point side (side closer to the position P ₁ ) smoothly connected to the route Q ₂ , and the end point side (side closer to the position P ₂ ) It is adapted to smoothly connected to a path Q _4.

これに対して、図１９Ｂの経路Q'₃は、始点側が、経路Q₂と同一のQ'₂に滑らかに接続するようになっている点は、経路Q₃と一致するが、終点側が、経路Q'₆に滑らかに接続するようになっている点で、経路Q₃と異なる。 In contrast, the path Q in FIG. 19B _'3 the starting-point side, the same Q and path Q _2' is a point that is adapted to smoothly connect to _2, but consistent with the path Q _3, is the end point, It is different from the route Q _{3 in} that the route Q ′ ₆ is smoothly connected.

さらに、図１９Ｂにおいて、パラメータaが図１９Ａの場合と異なる学習モデルF₆(x,a'₆)から生成される経路Q'₆は、図１９Ａの、対応する経路Q₆と異なっている。 Further, in FIG. 19B, a path Q ′ ₆ generated from a learning model F ₆ (x, a ′ ₆ ) having a parameter a different from that in FIG. 19A is different from the corresponding path Q _{6 in} FIG. 19A.

すなわち、図１９Ａの経路Q₆は、その始点側が、経路Q₅に滑らかに接続するようになっているとともに、終点側が、経路Q₇に滑らかに接続するようになっている。 That is, the route Q _{6 in} FIG. 19A has its start point side smoothly connected to the route Q ₅ and its end point side smoothly connected to the route Q ₇ .

これに対して、図１９Ｂの経路Q'₆は、終点側が、経路Q₇と同一のQ'₇に滑らかに接続するようになっている点は、経路Q₆と一致するが、始点側が、経路Q'₃に滑らかに接続するようになっている点で、経路Q₆と異なる。 In contrast, the path Q in FIG. 19B _'6 is the end point side, the same Q and path Q _7' is that adapted to smoothly connect to _7, but consistent with the path Q _6, is the starting point side, This is different from the route Q _{6 in} that the route Q ′ ₃ is smoothly connected.

以上のようにして、データ生成装置２０では、冗長な経路が除外され、かつ滑らかに接続する生成経路が生成される。 As described above, in the data generation device 20, redundant paths are excluded, and generation paths that are smoothly connected are generated.

［シミュレーション結果］
次に、本件発明者が、図１のデータ処理装置について行ったシミュレーションについて説明する。 [simulation result]
Next, a simulation performed by the inventor on the data processing apparatus shown in FIG. 1 will be described.

シミュレーションでは、移動ロボットに、ナビゲーションタスクを行わせた。 In the simulation, the mobile robot performed navigation tasks.

図２０は、移動ロボットがナビゲーションタスクを行う移動環境の概要を示している。 FIG. 20 shows an outline of a mobile environment in which a mobile robot performs a navigation task.

移動環境としては、光源が設置され、四方が壁で囲まれた２次元平面を採用した。移動ロボットは、移動環境を自由に移動することができるが、壁をすり抜けて移動することはできない。なお、移動環境には、四方を囲む壁の他にも、障害物となる壁が存在する。 As a moving environment, a two-dimensional plane in which light sources were installed and four sides were surrounded by walls was adopted. A mobile robot can move freely in a moving environment, but cannot move through a wall. In the mobile environment, there are walls that become obstacles in addition to the walls that surround the four sides.

また、移動ロボットには、移動ロボットから周囲の８方向それぞれについて、壁（移動環境を囲む壁と、移動環境中の障害物としての壁との両方を含む）までの距離をセンシングする距離センサ、及び、光の強度をセンシングする光センサと、エネルギをセンシングするエネルギセンサとを搭載した。なお、エネルギとは、ここでは、光センサが出力する、８方向それぞれについての光の強度のうちの最大値に比例する物理量である。 In addition, the mobile robot includes a distance sensor that senses the distance from the mobile robot to the wall (including both the wall surrounding the mobile environment and the wall as an obstacle in the mobile environment) in each of the eight surrounding directions. And the optical sensor which senses the intensity | strength of light, and the energy sensor which senses energy were mounted. Here, the energy is a physical quantity proportional to the maximum value of the light intensity in each of the eight directions output from the optical sensor.

また、移動ロボットは、水平方向（x方向）の移動量m_xと、垂直方向（y方向）の移動量m_yとを表すベクトルである移動ベクトル(m_x,m_y)を、モータデータとして与えると、その移動ベクトル(m_x,m_y)だけ移動する。 The mobile robot includes a moving amount m _x in the horizontal direction (x-direction), the moving vector (m _x, m _y) is a vector representing the movement amount m _y in the vertical direction (y-direction), and as the motor data Given, the movement vector (m _x, m _y) moves only.

シミュレーションでは、以上のような移動ロボットを採用し、教師データ、現在データ、及び目標データとなるセンサモータデータとしては、モータデータとしての移動ベクトル(m_x,m_y)、並びに、センサデータとしての、距離センサが出力する、８方向それぞれについての距離d₁,d₂,d₃,d₄,d₅,d₆,d₇,d₈、光センサが出力する、８方向それぞれについての光の強度l₁,l₂,l₃,l₄,l₅,l₆,l₇,l₈、及び、エネルギセンサが出力するエネルギEをコンポーネントとする１９次元のベクトル(m_x,m_y,d₁,d₂,d₃,d₄,d₅,d₆,d₇,d₈,l₁,l₂,l₃,l₄,l₅,l₆,l₇,l₈,E)を採用した。 In the simulation, adopts above-described mobile robot, teacher data, as the present data, and the sensor motor data serving as the target data, the movement vector as a motor data (m _x, m _y), and, as a sensor data The distance sensor outputs the distances d ₁ , d ₂ , d ₃ , d ₄ , d ₅ , d ₆ , d ₇ , d _{8 for} each of the eight directions, and the light sensor outputs the light in each of the eight directions. intensity _{_{_{l 1, l 2, l 3}}} , l 4, l 5, l 6, l 7, l 8, and 19-dimensional vector having a component of the energy E of the energy sensor output (m _x, m _y, d ₁ , d ₂ , d ₃ , d ₄ , d ₅ , d ₆ , d ₇ , d ₈ , l ₁ , l ₂ , l ₃ , l ₄ , l ₅ , l ₆ , l ₇ , l ₈ , E) Adopted.

なお、センサモータデータは、人が手動で、移動ロボットを移動させた場合を含め、移動ロボットから観測される。 The sensor motor data is observed from the mobile robot including a case where the human has manually moved the mobile robot.

図２１は、シミュレーションで採用した移動環境を示す平面図である。 FIG. 21 is a plan view showing a moving environment adopted in the simulation.

移動環境は、四方が壁で囲まれた２次元平面であり、移動環境内には、幾つかの光源と、幾つかの障害物としての壁が存在する。 The moving environment is a two-dimensional plane surrounded by walls on all sides, and there are several light sources and several obstacle walls in the moving environment.

学習処理においては、移動環境内において、移動ロボットを、15010回だけ移動させることで得られる15010サンプルのセンサモータデータの時系列を教師データとした。 In the learning process, the time series of sensor motor data of 15010 samples obtained by moving the mobile robot only 15010 times in the mobile environment was used as teacher data.

教師データは、前後の10サンプルがオーバラップする、40サンプルのモデル学習用データに分割し、その結果得られる500個のモデル学習用データを、それぞれ、500個の学習モデルとしてのRNNに学習させた。 The teacher data is divided into 40 samples of model learning data where the previous and next 10 samples overlap, and the resulting 500 model learning data are trained by the RNN as 500 learning models, respectively. It was.

図２１において、３桁の数字は、500個の学習モデルを特定するためのモデルIDである。また、モデルIDが付されている線（実線や、点線、太線、細線等）は、そのモデルIDの学習モデルが学習したモデル学習用データが観測されたときの、移動ロボットの移動軌跡を表している。 In FIG. 21, a three-digit number is a model ID for identifying 500 learning models. A line with a model ID (solid line, dotted line, thick line, thin line, etc.) represents the movement trajectory of the mobile robot when the model learning data learned by the learning model with that model ID is observed. ing.

ここで、あるモデル学習用データが観測されたときの、移動ロボットの移動軌跡を、モデル学習用データに対応する移動軌跡ともいう。また、モデルIDが３桁の数字nの学習モデルを、学習モデル#nとも記載する。 Here, the movement trajectory of the mobile robot when certain model learning data is observed is also referred to as a movement trajectory corresponding to the model learning data. Also, a learning model with a model ID of a three-digit number n is also described as a learning model #n.

図２１では、個々の学習モデルが学習したモデル学習用データを分かりやすくするために、モデル学習用データを学習する学習モデルが切り替わるごとに、モデル学習用データに対応する移動軌跡を、その移動軌跡を表す線の種類（実線や、点線、太線、細線等）を変えて図示してある。 In FIG. 21, in order to make the model learning data learned by each learning model easy to understand, each time the learning model for learning the model learning data is switched, the movement locus corresponding to the model learning data is changed to the movement locus. The line types (solid line, dotted line, thick line, thin line, etc.) that represent are changed.

また、図２１では、図が煩雑になるのを避けるため、500個の学習モデル#001ないし#500のうちの、最初の80個の学習モデル#001ないし#080が学習したモデル学習用データに対応する移動軌跡だけを図示してある。 Further, in FIG. 21, in order to avoid complication of the figure, the model learning data obtained by learning the first 80 learning models # 001 to # 080 out of 500 learning models # 001 to # 500 is shown. Only the corresponding movement trajectory is shown.

図２２は、学習処理により得られたコネクティビティを表した移動環境を示している。 FIG. 22 shows a mobile environment representing connectivity obtained by learning processing.

図２２において、丸印（○印）は、学習モデルが学習したモデル学習用データに対応する移動軌跡上のある位置を表し、そのモデル学習用データを学習した学習モデルに対応する。 In FIG. 22, a circle (circle) represents a certain position on the movement trajectory corresponding to the model learning data learned by the learning model, and corresponds to the learning model learned from the model learning data.

図２２では、500個の学習モデル#001ないし#500から得られる500×499個のモデルペアについて求められたコネクティビティを閾値処理することによって、接続性が高いコネクティビティ（値が閾値以下のコネクティビティ）だけを抽出し、その接続性が高いコネクティビティを有するモデルペアを構成する２つの学習モデルどうしを、点線で結んである。 In FIG. 22, only connectivity with high connectivity (connectivity with a value equal to or less than the threshold) is obtained by thresholding the connectivity obtained for 500 × 499 model pairs obtained from 500 learning models # 001 to # 500. Are extracted, and two learning models constituting a model pair having connectivity with high connectivity are connected by a dotted line.

移動ロボットは、壁の配置等の、移動環境の状況を知らないが、コネクティビティとしては、移動ロボットに壁を跨ぐ移動をさせる、学習モデルどうしの接続を生じさせるような値は、存在しない。 The mobile robot does not know the situation of the mobile environment, such as the arrangement of walls, but there is no value for connectivity that causes the mobile robot to move across the wall and cause the connection between learning models.

また、接続性が高いコネクティビティを有するモデルペアを構成する２つの学習モデルどうしを、点線で結んだグラフには、移動環境の状況（構造）が表現されている（埋め込まれている）。 In addition, the situation (structure) of the mobile environment is expressed (embedded) in a graph in which two learning models constituting a model pair having connectivity with high connectivity are connected by a dotted line.

図２３は、ビタビアルゴリズムに基づき、コネクティビティを用いて算出された生成用モデルシーケンスを表した移動環境を示している。 FIG. 23 shows a mobile environment representing a generation model sequence calculated using connectivity based on the Viterbi algorithm.

図２３において、丸印（○印）は、図２２と同様に、学習モデルが学習したモデル学習用データに対応する移動軌跡上のある位置を表し、そのモデル学習用データを学習した学習モデルに対応する。 In FIG. 23, as in FIG. 22, circles (◯) represent a certain position on the movement trajectory corresponding to the model learning data learned by the learning model, and the learning model obtained by learning the model learning data. Correspond.

図２３では、生成用モデルシーケンスを構成する学習モデル（構成モデル）が、線分で結ばれている。 In FIG. 23, learning models (constitutive models) constituting the generation model sequence are connected by line segments.

図２４は、生成用モデルシーケンスを構成する学習モデルが学習したモデル学習用データに対応する移動軌跡を表した移動環境を示している。 FIG. 24 shows a movement environment representing a movement locus corresponding to the model learning data learned by the learning model constituting the generation model sequence.

図２４において、１桁の数字c、コロン(:)、及び、３桁の数字nからなる記号列c:nは、生成用モデルシーケンスを構成する学習モデルの順番と、その学習モデルのモデルIDを表す。 In FIG. 24, a symbol string c: n consisting of a one-digit number c, a colon (:), and a three-digit number n indicates the order of learning models constituting the generation model sequence and the model ID of the learning model. Represents.

すなわち、記号列c:nにおいて、１桁の数字cは、学習モデルが、生成用モデルシーケンスの先頭からc番目の学習モデルであることを表し、その後のコロンに続く３桁の数字nは、生成用モデルシーケンスの先頭からc番目の学習モデルが、学習モデル#nであることを表す。 That is, in the symbol string c: n, a one-digit number c indicates that the learning model is the c-th learning model from the beginning of the generation model sequence, and the three-digit number n following the colon after that is This represents that the c-th learning model from the top of the generation model sequence is learning model #n.

また、記号列c:nが付されている線は、学習モデル#nが学習したモデル学習用データに対応する移動軌跡を表している。 Also, the line with the symbol string c: n represents a movement locus corresponding to the model learning data learned by the learning model #n.

なお、図２４でも、図２１と同様に、個々の学習モデルが学習したモデル学習用データを分かりやすくするために、モデル学習用データを学習する学習モデルが切り替わるごとに、モデル学習用データに対応する移動軌跡を、その移動軌跡を表す線の種類を変えて図示してある。 In FIG. 24, as in FIG. 21, in order to make the model learning data learned by each learning model easy to understand, each time the learning model for learning the model learning data is switched, the model learning data is handled. The movement trajectory is shown by changing the type of line representing the movement trajectory.

シミュレーションでは、学習モデル#049を始点モデルとするとともに、学習モデル#277を終点モデルとして、生成用モデルシーケンスの算出を行い、学習モデル#049,#236,#209,#274,#275,#276,#277の並びが、生成用モデルシーケンスとして得られた。 In the simulation, the learning model # 049 is used as the start point model, the learning model # 277 is used as the end point model, the generation model sequence is calculated, and the learning models # 049, # 236, # 209, # 274, # 275, # A sequence of 276 and # 277 was obtained as a model sequence for generation.

始点モデルである学習モデル#049から、終点モデルである学習モデル#277に到達するためには、教師データによる経験によれば、227個の学習モデル#050ないし#276を経由することが必要である。 In order to reach learning model # 277, which is the end point model, from learning model # 049, which is the start point model, it is necessary to go through 227 learning models # 050 to # 276 according to experience with teacher data. is there.

しかしながら、コネクティビティを用いて生成用モデルシーケンスが算出される場合には、オーバラップ部分が類似している、接続性が高いコネクティビティの学習モデルどうしが、接続コストの累積値を小さくするように接続される結果、教師データによる経験がない接続（２つの学習モデルの並び）を含む生成用モデルシーケンスを得ることができる。 However, when the generation model sequence is calculated using connectivity, connectivity models with high connectivity and similar connectivity are connected to reduce the cumulative connection cost. As a result, it is possible to obtain a generation model sequence including a connection (a sequence of two learning models) that has no experience with teacher data.

すなわち、移動ロボットは、学習モデル#049,#236の並び、学習モデル#236,#209の並び、及び、学習モデル#209,#274の並びについては、教師データによる経験をしていないが、図２４では、そのような並びを含む生成用モデルシーケンスが得られている。 That is, the mobile robot has no experience with teacher data about the alignment of learning models # 049, # 236, the alignment of learning models # 236, # 209, and the alignment of learning models # 209, # 274, In FIG. 24, a generation model sequence including such a sequence is obtained.

ここで、教師データによる経験がない学習モデル#049,#236の並び、学習モデル#236,#209の並び、及び、学習モデル#209,#274並びのうちの、学習モデル#049,#236の並びと、学習モデル#209,#274の並びとしてのモデルペアについては、そのモデルペアを構成する前モデルと後モデルがそれぞれ学習したモデル学習用データの、教師データ上の位置が、時系列順になっている。これに対して、学習モデル#236,#209の並びとしてのモデルペアについては、そのモデルペアを構成する前モデルと後モデルがそれぞれ学習したモデル学習用データの、教師データ上の位置が、時系列順の逆順になっている。 Here, learning models # 049, # 236 out of the arrangement of learning models # 049, # 236, the arrangement of learning models # 236, # 209, and the arrangement of learning models # 209, # 274 that have no experience with teacher data And the model pair as a sequence of learning models # 209 and # 274, the position on the teacher data of the model learning data learned by the previous model and the rear model that make up the model pair is the time series It is in order. On the other hand, for the model pair as an array of learning models # 236 and # 209, the position of the model learning data learned by the previous model and the rear model that make up the model pair on the teacher data is It is in the reverse order of the sequence order.

以上のように、生成用モデルシーケンスの算出では、学習モデルが、接続コストの累積値を小さくするように接続されるので、学習に用いられたモデル学習用データの、教師データ上の位置が、時系列順の逆順になっている学習モデルの並び等を含む、教師データによる経験がない学習モデルの並びで構成される生成用モデルシーケンスが得られる。 As described above, in the calculation of the generation model sequence, since the learning model is connected so as to reduce the cumulative value of the connection cost, the position of the model learning data used for learning on the teacher data is A generation model sequence composed of a sequence of learning models that have no experience with teacher data, including a sequence of learning models that are in reverse order of time series, is obtained.

その結果、生成用モデルシーケンスは、始点モデルである学習モデル#049に対応する位置（現在データが観測される位置）から、終点モデルである学習モデル#277に対応する位置（目標データが観測される位置）に効率良く到達する経路として、教師データが観測される経路よりも短い経路が得られる学習モデルの並びとなる。 As a result, the generation model sequence has a position (target data is observed) corresponding to the learning model # 277, which is the end point model, from the position corresponding to the learning model # 049, which is the starting point model (where the current data is observed). As a route that efficiently arrives at a position), a learning model is obtained in which a route shorter than the route where the teacher data is observed is obtained.

図２５は、生成用モデルシーケンスから生成された生成時系列データが観測されるように、移動ロボットを移動した移動軌跡を表した移動環境を示している。 FIG. 25 shows a movement environment representing a movement locus of the movement of the mobile robot so that generation time series data generated from the generation model sequence is observed.

図２５において、１桁の数字c、コロン(:)、及び、３桁の数字nからなる記号列c:nは、図２４と同様に、生成用モデルシーケンスを構成する学習モデルの順番と、その学習モデルのモデルIDを表す。 In FIG. 25, a symbol string c: n consisting of a single-digit number c, a colon (:), and a three-digit number n is similar to FIG. Indicates the model ID of the learning model.

また、図２５において、○印は、移動ロボットが移動を開始したスタート位置を表す。 Further, in FIG. 25, a circle mark represents a start position where the mobile robot starts moving.

図２４で説明したように、生成用モデルシーケンスにおける学習モデル#049,#236の並び、学習モデル#236,#209の並び、及び、学習モデル#209,#274の並びについては、教師データによる経験をしていないが、学習モデル#049から学習モデル#236への切り替わり、学習モデル#236から学習モデル#209への切り替わり、及び、学習モデル#209から学習モデル#274への切り替わりにおいて、滑らかに接続する移動軌跡が描かれており、したがって、滑らかな生成時系列データが生成されることを確認することができる。 As described in FIG. 24, the arrangement of the learning models # 049 and # 236, the arrangement of the learning models # 236 and # 209, and the arrangement of the learning models # 209 and # 274 in the generation model sequence depend on the teacher data. Not experienced, but smooth in switching from learning model # 049 to learning model # 236, switching from learning model # 236 to learning model # 209, and switching from learning model # 209 to learning model # 274 Therefore, it is possible to confirm that smooth generation time-series data is generated.

［モデルパラメータの共有］
上述の実施の形態では、教師データを分割して得られる複数のモデル学習用データを、複数の学習モデルでそれぞれ学習し、学習モデルごとに、独立のモデルパラメータを求めたが、複数の学習モデルのモデルパラメータについては、モデルパラメータの共有を行うことができる。 [Sharing model parameters]
In the above-described embodiment, a plurality of model learning data obtained by dividing the teacher data is learned using a plurality of learning models, and independent model parameters are obtained for each learning model. Model parameters can be shared for the model parameters.

以下、モデルパラメータの共有について説明する。 Hereinafter, sharing of model parameters will be described.

図２６は、モデルパラメータの共有を行う学習装置の一実施の形態の構成例を示している。 FIG. 26 illustrates a configuration example of an embodiment of a learning apparatus that shares model parameters.

図２６において、学習装置は、複数であるN個の学習モジュール２１０₁ないし２１０_Nと、モデルパラメータ共有部２２０とから構成される。 In FIG. 26, the learning apparatus includes a plurality of N learning modules 210 ₁ to 210 _N and a model parameter sharing unit 220.

学習モジュール２１０_i(i=1,2,・・・,N)は、パターン入力部２１１_i、モデル学習部２１２_i、及びモデル記憶部２１３_iから構成され、モデル学習用データを用いて、学習モデルの学習を行い、学習モデルのモデルパラメータを求める。 Learning module _{210 i (i = 1,2, ···} , N) , the pattern input unit 211 _i, the model learning unit 212 _i, and is configured from the model storage unit 213 _i, using the model learning data, learning The model is learned, and model parameters of the learning model are obtained.

すなわち、パターン入力部２１１_iには、モデル記憶部２１３_iに記憶された学習モデルの学習に用いるモデル学習用データが供給される。 That is, the pattern input unit 211 _i, for model learning data is supplied for use in the learning of the stored learning model in the model storage unit 213 _i.

パターン入力部２１１_iは、そこに供給されるモデル学習用データを、モデル学習部２１２_iに供給する。 The pattern input unit 211 _i supplies the model learning data supplied thereto to the model learning unit 212 _i .

モデル学習部２１２_iは、パターン入力部２１１_iからのモデル学習用データを用いて、モデル記憶部２１３_iに記憶された学習モデルの学習を行い、その学習モデルのモデルパラメータを求める。 The model learning unit 212 _i uses the model learning data from the pattern input unit 211 _i to learn the learning model stored in the model storage unit 213 _i and obtains model parameters of the learning model.

モデル記憶部２１３_iは、モデルパラメータによって定義される、パターンを学習する学習モデルを記憶する。すなわち、モデル記憶部２１３_iは、学習モデルのモデルパラメータを記憶する。 The model storage unit 213 _i stores a learning model defined by model parameters for learning a pattern. That is, the model storage unit 213 _i stores model parameters of the learning model.

ここで、モデル記憶部２１３_iに記憶される学習モデルの実体は、その学習モデルのモデルパラメータである。 Here, the substance of the learning model stored in the model storage unit 213 _i is a model parameter of the learning model.

モデルパラメータ共有部２２０は、N個の学習モジュール２１０₁ないし２１０_Nのうちの、２以上の学習モジュールに、モデルパラメータを共有させる共有処理を行う。モデルパラメータ共有部２２０が共有処理を行うことにより、N個の学習モジュール２１０₁ないし２１０_Nのうちの、２以上の学習モジュールは、モデルパラメータを共有する。 The model parameter sharing unit 220 performs a sharing process in which two or more learning modules out of the _N learning modules 210 ₁ to 210 _N share model parameters. When the model parameter sharing unit 220 performs the sharing process, two or more learning modules among the _N learning modules 210 ₁ to 210 _N share the model parameters.

なお、以下では、説明を簡単にするため、モデルパラメータ共有部２２０は、N個の学習モジュール２１０₁ないし２１０_Nのすべてに、モデルパラメータを共有させる共有処理を行うこととする。 Hereinafter, in order to simplify the description, the model parameter sharing unit 220 performs a sharing process in which all _N learning modules 210 ₁ to 210 _N share model parameters.

次に、図２７のフローチャートを参照して、図２６の学習装置が行う、学習モデルの学習について説明する。 Next, learning of a learning model performed by the learning device of FIG. 26 will be described with reference to the flowchart of FIG.

ステップＳ２１１において、学習モジュール２１０_iのモデル学習部２１２_iは、モデル記憶部２１３_iに記憶されたモデルパラメータを、例えば、乱数等によって初期化して、処理は、ステップＳ２１２に進む。 In step S211, the model learning unit 212 _i of the learning module 210 _i initializes the model parameters stored in the model storage unit 213 _i with, for example, random numbers, and the process proceeds to step S212.

ステップＳ２１２では、学習モジュール２１０_iが、その学習モジュール２１０_iで学習すべきモデル学習用データが供給（入力）されるのを待って、そのモデル学習用データを用いて、学習モデルの学習、すなわち、モデル記憶部２１３_iに記憶されたモデルパラメータを初期値として、学習モデルのモデルパラメータを更新する。 In step S212, the learning module 210 _i waits for supply (input) of model learning data to be learned by the learning module 210 _i , and learning of the learning model using the model learning data, that is, The model parameters of the learning model are updated using the model parameters stored in the model storage unit 213 _i as initial values.

すなわち、ステップＳ２１２では、学習モジュール２１０_iにおいて、パターン入力部２１１_iが、学習モジュール２１０_iに供給されたモデル学習用データを、モデル学習部２１２_iに供給する。 That is, in step S212, in the learning module 210 _i , the pattern input unit 211 _i supplies the model learning data supplied to the learning module 210 _i to the model learning unit 212 _i .

さらに、ステップＳ２１２では、モデル学習部２１２_iが、パターン入力部２１１_iからのモデル学習用データを用いて、モデル記憶部２１３_iに記憶された学習モデルのモデルパラメータを更新し、その更新によって得られた新たなモデルパラメータによって、モデル記憶部２１３_iの記憶内容を更新する（上書きする）。 Further, in step S212, the model learning unit 212 _i updates the model parameters of the learning model stored in the model storage unit 213 _i using the model learning data from the pattern input unit 211 _i, and is obtained by the update. The stored contents of the model storage unit 213 _i are updated (overwritten) with the new model parameters thus obtained.

ここで、ステップＳ２１１及びＳ２１２の処理は、N個の学習モジュール２１０₁ないし２１０_Nのすべてで行われる。 Here, the processes of steps S211 and S212 are performed by all of the _N learning modules 210 ₁ to 210 _N.

ステップＳ２１２の後、処理は、ステップＳ２１３に進み、モデルパラメータ共有部２２０は、N個の学習モジュール２１０₁ないし２１０_Nのすべてに、モデルパラメータを共有させる共有処理を行う。 After step S212, the process proceeds to step S213, and the model parameter sharing unit 220 performs a sharing process in which all the _N learning modules 210 ₁ to 210 _N share the model parameter.

すなわち、学習モジュール２１０_iは、複数のモデルパラメータを有する（複数のモデルパラメータによって定義される）。学習モジュール２１０_iが有する複数のモデルパラメータのうちの、例えば、m番目のモデルパラメータに注目すると、モデルパラメータ共有部２２０は、N個の学習モジュール２１０₁ないし２１０_Nそれぞれのm番目のモデルパラメータに基づいて、学習モジュール２１０₁のm番目のモデルパラメータを補正する。 That is, the learning module 210 _i is (defined by a plurality of model parameters) having a plurality of model parameters. When attention is paid to, for example, the m-th model parameter among the plurality of model parameters of the learning module 210 _i , the model parameter sharing unit 220 determines the m-th model parameter of each of the _N learning modules 210 ₁ to 210 _N. based on, for correcting the m-th model parameter of the learning module 210 _1.

さらに、モデルパラメータ共有部２２０は、N個の学習モジュール２１０₁ないし２１０_Nそれぞれのm番目のモデルパラメータに基づいて、学習モジュール２１０₂のm番目のモデルパラメータを補正し、以下、同様にして、学習モジュール２１０₃ないし２１０_Nそれぞれのm番目のモデルパラメータを補正する。 Further, the model parameter sharing unit 220 corrects the m-th model parameter of the learning module 210 ₂ based on the m-th model parameter of each of the _N learning modules 210 ₁ to 210 _N , and so on. The mth model parameter of each of the learning modules 210 ₃ to 210 _N is corrected.

以上のように、モデルパラメータ共有部２２０が、学習モジュール２１０_iのm番目のモデルパラメータを、N個の学習モジュール２１０₁ないし２１０_Nそれぞれのm番目のモデルパラメータに基づいて補正することで、N個の学習モジュール２１０₁ないし２１０_Nのm番目のモデルパラメータのそれぞれは、N個の学習モジュール２１０₁ないし２１０_Nのm番目のモデルパラメータのすべての影響を受ける（N個の学習モジュール２１０₁ないし２１０_Nのm番目のモデルパラメータのそれぞれに、N個の学習モジュール２１０₁ないし２１０_Nのm番目のモデルパラメータのすべてを影響させる）。 As described above, the model parameter sharing unit 220 corrects the m-th model parameter of the learning module 210 _{i based on} the m-th model parameter of each of the N learning modules 210 ₁ to 210 _N , so that N each number of learning modules 210 ₁ to 210 _N m-th model parameters are to 210 ₁ to N learning modules all affected the m-th model parameter of 210 _N (to N learning module 210 no ₁ 210 _N each of the m-th model parameters, to 210 no ₁ N pieces of learning modules to affect all 210 _N m-th model parameters).

このように、複数の学習モジュールのモデルパラメータすべてを、その複数の学習モジュールのモデルパラメータのそれぞれに影響させること（複数の学習モジュールのモデルパラメータのそれぞれが、その複数の学習モジュールのモデルパラメータすべての影響を受けること）が、複数の学習モジュールによるモデルパラメータの共有である。 In this way, all the model parameters of a plurality of learning modules are affected by each of the model parameters of the plurality of learning modules (each of the model parameters of the plurality of learning modules is It is the sharing of model parameters by multiple learning modules.

モデルパラメータ共有部２２０は、ステップＳ２１３において、学習モジュール２１０_iのモデル記憶部２１３_iに記憶された複数のモデルパラメータのすべてを対象に、共有処理を行い、その共有処理によって得られたモデルパラメータによって、モデル記憶部２１３₁ないし２１３_Nの記憶内容を更新する。 In step S213, the model parameter sharing unit 220 performs a sharing process on all of the plurality of model parameters stored in the model storage unit 213 _i of the learning module 210 _i , and uses the model parameters obtained by the sharing process. , model storage unit 213 ₁ to update the memory content of 213 _N.

ステップＳ２１３の後、処理は、ステップＳ２１４に進み、図２６の学習装置は、学習の終了条件が満たされているかどうかを判定する。 After step S213, the process proceeds to step S214, and the learning device in FIG. 26 determines whether the learning end condition is satisfied.

ここで、ステップＳ２１４での学習の終了条件としては、例えば、学習の回数、つまり、ステップＳ２１２及びＳ２１３が繰り返された回数が、あらかじめ定められた所定の回数となったことを採用することができる。また、学習の終了条件としては、モデル学習用データに対して出力されるべき出力データ（モデル学習用データを入力データとしたときの、その入力データに対する出力データ）の真値が分かっている場合に、そのモデル学習用データに対して学習モデルから出力される出力データの、真値に対する誤差が所定値以下であること、等を採用することができる。 Here, as the learning end condition in step S214, for example, it is possible to adopt that the number of times of learning, that is, the number of times that steps S212 and S213 are repeated has reached a predetermined number of times. . In addition, as the learning termination condition, when the true value of the output data to be output for the model learning data (output data for the input data when the model learning data is used as input data) is known In addition, it is possible to adopt that the error of the output data output from the learning model with respect to the model learning data is less than or equal to a predetermined value.

ステップＳ２１４において、学習の終了条件が満たされていないと判定された場合、処理は、ステップＳ２１２に戻り、以下、同様の処理が繰り返される。 If it is determined in step S214 that the learning termination condition is not satisfied, the process returns to step S212, and the same process is repeated thereafter.

また、ステップＳ２１４において、学習の終了条件が満たされていると判定された場合、処理は終了する。 If it is determined in step S214 that the learning end condition is satisfied, the process ends.

次に、図２８は、学習モデルとして、RNNPB(RNN with Parametric Bias)を採用した場合の、図２６の学習装置の構成例を示している。 Next, FIG. 28 shows a configuration example of the learning apparatus in FIG. 26 when RNNPB (RNN with Parametric Bias) is adopted as a learning model.

なお、図２８においては、学習モジュール２１０_iのパターン入力部２１１_i及びモデル学習部２１２_iの図示を省略してある。 Incidentally, in FIG. 28, is not shown in the learning module 210 _i of the pattern input section 211 _i and the model learning unit 212 _i.

モデル記憶部２１３_iには、RNNPB（を定義する複数のモデルパラメータ）が記憶されている。ここで、モデル記憶部２１３_iに記憶されたRNNPBを、以下、適宜、RNNPB#iとも記載する。 The model storage unit 213 _i stores RNNPB (a plurality of model parameters that define RNNPB). Here, the RNNPB stored in the model storage unit 213 _i is hereinafter also referred to as RNNPB # i as appropriate.

図２８では、RNNPBは、入力層、隠れ層（中間層）、及び出力層の３層で構成されている。入力層、隠れ層、及び出力層は、それぞれ任意の数の、ニューロンに相当するユニットにより構成されている。 In FIG. 28, the RNNPB is composed of three layers: an input layer, a hidden layer (intermediate layer), and an output layer. Each of the input layer, the hidden layer, and the output layer is configured by an arbitrary number of units corresponding to neurons.

RNNPBでは、入力層の一部のユニットである入力ユニットに、モデル学習用データx_tが入力データとして入力（供給）される。 In RNNPB, the part of the unit is input units of the input layer, the model learning data x _t is input (supplied) as input data.

また、入力層の、モデル学習用データx_tが入力される入力ユニット以外のユニットの一部であるPBユニットには、PB(Parametric Bias)が入力される。PBによれば、同一の状態のRNNPBに対して、同一のモデル学習用データx_tが入力されても、PBを変更することにより、異なる出力データx^* _t+1を得ることができる。 Further, the input layer, the PB unit is part of a unit other than the input unit model learning data x _t is input, PB (Parametric Bias) is input. According to PB, even if the same model learning data x _t is input to RNNPB in the same state, different output data x ^* _{t + 1} can be obtained by changing PB.

入力層の、モデル学習用データx_tが入力される入力ユニット以外のユニットの残りであるコンテキストユニットには、出力層の一部のユニットより出力される出力データが、内部状態を表すコンテキストとしてフィードバックされる。 Output data output from some units in the output layer is fed back to the context unit, which is the rest of the units other than the input unit to which the model learning data x _t is input, as the context representing the internal state. Is done.

ここで、時刻tのモデル学習用データx_tが入力層の入力ユニットに入力されるときに入力層のPBユニットとコンテキストユニットに入力される時刻tのPBとコンテキストを、それぞれ、PB_tとc_tと記載する。 Here, when the model learning data xt at time _t is input to the input unit of the input layer, the PB and context at time t input to the PB unit and context unit of the input layer are respectively _expressed as PB _t and c _{Indicated as t} .

隠れ層のユニットは、入力層に入力されるモデル学習用データx_t，PB_t，コンテキストc_tを対象として、所定のウエイト（重み）を用いた重み付け加算を行い、その重み付け加算の結果を引数とする非線形関数の演算を行って、その演算結果を、出力層のユニットに出力する。 The hidden layer unit performs weighted addition using predetermined weights (weights) for model learning data x _t , PB _t , and context c _t input to the input layer, and the result of the weighted addition is an argument. And the result of the calculation is output to the output layer unit.

出力層の一部のユニットからは、上述したように、次の時刻t+1のコンテキストc_t+1となる出力データが出力され、入力層にフィードバックされる。また、出力層の残りのユニットからは、例えば、入力データとしてのモデル学習用データx_tに対する出力データとして、そのモデル学習用データx_tの次の時刻t+1のモデル学習用データx_t+1の予測値x^* _t+1が出力される。 As described above, output data serving as the context c _{t + 1} at the next time t + 1 is output from some units in the output layer and fed back to the input layer. Further, from the remaining units of the output layer, for example, as output data to the model learning data x _t as input data, data for the model training x _t at the next time t + 1 of the model learning data x _{t +} predicted value x ^* _{t + 1} of ₁ is output.

ここで、RNNPBでは、ユニットへの入力が重み付け加算されるが、この重み付け加算に用いられるウエイト（重み）が、RNNPBのモデルパラメータである。RNNPBのモデルパラメータとしてのウエイトには、例えば、入力ユニットから隠れ層のユニットへのウエイト、PBユニットから隠れ層のユニットへのウエイト、コンテキストユニットから隠れ層のユニットへウエイト、隠れ層のユニットから出力層のユニットへのウエイト、及び、隠れ層のユニットからコンテキストユニットへのウエイトがある。 Here, in RNNPB, the input to the unit is weighted and added, and the weight (weight) used for this weighted addition is a model parameter of RNNPB. Weights as model parameters of RNNPB include, for example, weights from input units to hidden layer units, weights from PB units to hidden layer units, weights from context units to hidden layer units, and outputs from hidden layer units. There is a weight to the layer unit and a weight from the hidden layer unit to the context unit.

学習モデルとして、以上のようなRNNPBを採用した場合、モデルパラメータ共有部２２０には、RNNPBのモデルパラメータとしてのウエイトを、学習モジュール２１０₁ないし２１０_Nに共有させるウエイトマトリクス共有部２１が設けられる。 When RNNPB as described above is adopted as the learning model, the model parameter sharing unit 220 is provided with a weight matrix sharing unit 21 that allows the learning modules 210 ₁ to 210 _N to share weights as model parameters of the RNNPB.

ここで、RNNPBのモデルパラメータとしてのウエイトは、複数あるが、その複数のウエイトをコンポーネントとするマトリクスを、ウエイトマトリクスという。 Here, there are a plurality of weights as model parameters of the RNNPB, but a matrix having the plurality of weights as components is called a weight matrix.

ウエイトマトリクス共有部２１は、モデル記憶部２１３₁ないし２１３_Nに記憶されたRNNPB#1ないしRNNPB#Nの複数のモデルパラメータとしてのウエイトマトリクスすべてを、学習モジュール２１０₁ないし２１０_Nのそれぞれに共有させる。 Weight matrix sharing unit 21 to RNNPB # 1 stored in the model storage unit 213 ₁ to 213 _N all weight matrix as a plurality of model parameters RNNPB # N, is shared by each of the learning modules 210 ₁ to 210 _N .

すなわち、RNNPB#iのウエイトマトリクスをw_iと表すこととすると、ウエイトマトリクス共有部２１は、ウエイトマトリクスw_iを、N個の学習モジュール２１０₁ないし２１０_Nそれぞれのウエイトマトリクスw₁ないしw_Nのすべてに基づいて補正することで、ウエイトマトリクスw_iに、ウエイトマトリクスw₁ないしw_Nのすべてを影響させる共有処理を行う。 That is, if the weight matrix of RNNPB # i is expressed as w _i , the weight matrix sharing unit 21 sets the weight matrix w _i to the weight matrices w ₁ to w _N of the _N learning modules 210 ₁ to 210 _N, respectively. By performing correction based on all of them, a sharing process that affects all of the weight matrices w ₁ to w _N is performed on the weight matrix w _i .

具体的には、ウエイトマトリクス共有部２１は、例えば、次式（３）に従い、RNNPB#iのウエイトマトリクスw_iを補正する。 Specifically, the weight matrix sharing unit 21 corrects the weight matrix w _i of RNNPB # i, for example, according to the following equation (3).

w_i＝w_i＋Δw_i
・・・（３） w _i = w _i + Δw _i
... (3)

ここで、式（３）において、△w_iは、ウエイトマトリクスw_iを補正する補正成分であり、例えば、式（４）に従って求められる。 Here, in equation (3), Δw _i is a correction component for correcting the weight matrix w _i and is obtained, for example, according to equation (4).

Δw_i＝α_iΣβ_ij(w_j-w_i)
・・・（４） Δw _i = α _i Σβ _ij (w _j -w _i )
... (4)

式（４）において、Σは、変数jを、1からNの範囲の整数に変えての総和を表す。 In Equation (4), Σ represents the total sum when the variable j is changed to an integer in the range of 1 to N.

また、式（４）において、β_ijは、RNNPB#iのウエイトマトリクスw_iに、RNNPB#j(j=1,2,・・・,N)のウエイトマトリクスw_jを影響させる度合いを表す係数である。 The coefficient in the formula (4), beta _ij is representative of the weight matrix w _i of RNNPB # i, RNNPB # j ( j = 1,2, ···, N) the degree to which the influence of the weight matrix w _j of It is.

したがって、式（４）の右辺のサメーションΣβ_ij(w_j-w_i)は、係数β_ijを重みとした、RNNPB#iのウエイトマトリクスw_jに対するRNNPB#1ないしRNNPB#Nのウエイトマトリクスw₁ないしw_Nそれぞれの偏差（差分）の重み付け平均値を表し、α_iは、その重み付け平均値Σβ_ij(w_j-w_i)を、ウエイトマトリクスw_iに影響させる度合いを表す係数である。 Therefore, the summation Σβ _ij (w _j -w _i ) on the right side of the equation (4) is the weight matrix w of RNNPB # 1 to RNNPB # N with respect to the weight matrix w _j of RNNPB # i using the coefficient β _ij as a weight. ₁ to w _N represents a weighted average value of deviations (differences), and α _i is a coefficient representing the degree of influence of the weighted average value Σβ _ij (w _j −w _i ) on the weight matrix w _i .

係数α_i及びβ_ijとしては、例えば、0.0より大で1.0より小の値を採用することができる。 As the coefficients α _i and β _ij , for example, values larger than 0.0 and smaller than 1.0 can be adopted.

式（４）によれば、係数α_iが小であるほど、いわば共有が弱くなり（ウエイトマトリクスw_iが受ける重み付け平均値Σβ_ij(w_j-w_i)の影響が小さくなり）、係数α_iが大であるほど、いわば共有が強まる。 According to the equation (4), the smaller the coefficient α _i , the weaker the sharing (the influence of the weighted average value Σβ _ij (w _j −w _i ) on the weight matrix w _i becomes smaller), and the coefficient α as _i is large, so to speak, sharing becomes stronger.

なお、ウエイトマトリクスw_iの補正の方法は、式（３）に限定されるものではなく、例えば、式（５）に従って行うことが可能である。 Note that the method of correcting the weight matrix w _i is not limited to the equation (3), and can be performed, for example, according to the equation (5).

w_i＝α'_iw_i＋(1-α'_i)Σβ'_ijw_j
・・・（５） w _i = α ' _i w _i + (1-α' _i ) Σβ ' _ij w _j
... (5)

式（５）において、Σは、変数jを、1からNの範囲の整数に変えての総和を表す。 In Equation (5), Σ represents the total sum when the variable j is changed to an integer in the range of 1 to N.

また、式（５）において、β^' _ijは、RNNPB#iのウエイトマトリクスw_iに、RNNPB#j(j=1,2,・・・,N)のウエイトマトリクスw_jを影響させる度合いを表す係数である。 Further, in the equation (5), β ^_'ij is the weight matrix w _i of RNNPB # i, represents the degree to which the influence RNNPB # j (j = 1,2, ···, N) of the weight matrix w _j of It is a coefficient.

したがって、式（５）の右辺の第２項におけるサメーションΣβ^' _ijw_jは、係数β^' _ijを重みとした、RNNPB#1ないしRNNPB#Nのウエイトマトリクスw₁ないしw_Nの重み付け平均値を表し、α^' _iは、その重み付け平均値Σβ^' _ijw_jを、ウエイトマトリクスw_iに影響させる度合いを表す係数である。 Therefore, the summation Σβ ^′ _ij w _j in the second term on the right side of the equation (5) is the weighted average value of the weight matrices w ₁ to w _N of the RNNPB # 1 to RNNPB # N with the coefficient β ^′ _ij as the weight. Α ^′ _i is a coefficient representing the degree of influence of the weighted average value Σβ ^′ _ij w _j on the weight matrix w _i .

係数α^' _i及びβ^' _ijとしては、例えば、0.0より大で1.0より小の値を採用することができる。 As the coefficients α ^′ _i and β ^′ _ij , for example, values larger than 0.0 and smaller than 1.0 can be adopted.

式（５）によれば、係数α^' _iが大であるほど、共有が弱くなり（ウエイトマトリクスw_iが受ける重み付け平均値Σβ^' _ijw_jの影響が小さくなり）、係数α^' _iが小であるほど、共有が強まる。 According to Equation (5), the larger the coefficient α ^′ _i , the weaker the sharing (the influence of the weighted average value Σβ ^′ _ij w _j received by the weight matrix w _i becomes smaller), and the smaller the coefficient α ^′ _i. The more it becomes, the stronger the sharing.

複数の学習モジュール２１０₁ないし２１０_Nは、全体として、新たなパターンを学習することを容易に行うことができるという規模拡張性に優れる。 The plurality of learning modules 210 ₁ to 210 _N as a whole is excellent in scale extensibility that a new pattern can be easily learned.

そして、全体として規模拡張性に優れた複数の学習モジュール２１０₁ないし２１０_Nそれぞれにおいて、モデルパラメータを共有しながら、その複数の学習モジュール２１０₁ないし２１０_Nそれぞれのモデルパラメータを更新すること（学習モデルの学習）により、１つの学習モジュールだけで行われる学習で得られる汎化特性が、複数の学習モジュール２１０₁ないし２１０_Nの全体で得ることができ、その結果、全体として、規模拡張性があり、同時に、汎化特性を有する学習モデルを得ることができる。 Then, each of the plurality of learning modules 210 ₁ to 210 _N excellent in scale extensibility as a whole updates the model parameters of the plurality of learning modules 210 ₁ to 210 _N while sharing the model parameters (learning model). Learning), generalization characteristics obtained by learning performed by only one learning module can be obtained by the whole of the plurality of learning modules 210 ₁ to 210 _N , and as a result, there is a scalability as a whole. At the same time, a learning model having generalization characteristics can be obtained.

次に、上述した一連の処理は、ハードウェアにより行うこともできるし、ソフトウェアにより行うこともできる。一連の処理をソフトウェアによって行う場合には、そのソフトウェアを構成するプログラムが、汎用のコンピュータ等にインストールされる。 Next, the series of processes described above can be performed by hardware or software. When a series of processing is performed by software, a program constituting the software is installed in a general-purpose computer or the like.

そこで、図２９は、上述した一連の処理を実行するプログラムがインストールされるコンピュータの一実施の形態の構成例を示している。 Therefore, FIG. 29 shows a configuration example of an embodiment of a computer in which a program for executing the series of processes described above is installed.

プログラムは、コンピュータに内蔵されている記録媒体としてのハードディスク１０５やROM１０３に予め記録しておくことができる。 The program can be recorded in advance on a hard disk 105 or a ROM 103 as a recording medium built in the computer.

あるいはまた、プログラムは、フレキシブルディスク、CD-ROM(Compact Disc Read Only Memory)，MO(Magneto Optical)ディスク，DVD(Digital Versatile Disc)、磁気ディスク、半導体メモリなどのリムーバブル記録媒体１１１に、一時的あるいは永続的に格納（記録）しておくことができる。このようなリムーバブル記録媒体１１１は、いわゆるパッケージソフトウエアとして提供することができる。 Alternatively, the program is stored temporarily on a removable recording medium 111 such as a flexible disk, a CD-ROM (Compact Disc Read Only Memory), a MO (Magneto Optical) disk, a DVD (Digital Versatile Disc), a magnetic disk, or a semiconductor memory. It can be stored permanently (recorded). Such a removable recording medium 111 can be provided as so-called package software.

なお、プログラムは、上述したようなリムーバブル記録媒体１１１からコンピュータにインストールする他、ダウンロードサイトから、ディジタル衛星放送用の人工衛星を介して、コンピュータに無線で転送したり、LAN(Local Area Network)、インターネットといったネットワークを介して、コンピュータに有線で転送し、コンピュータでは、そのようにして転送されてくるプログラムを、通信部１０８で受信し、内蔵するハードディスク１０５にインストールすることができる。 The program is installed in the computer from the removable recording medium 111 as described above, or transferred from the download site to the computer wirelessly via a digital satellite broadcasting artificial satellite, LAN (Local Area Network), The program can be transferred to a computer via a network such as the Internet, and the computer can receive the program transferred in this way by the communication unit 108 and install it in the built-in hard disk 105.

コンピュータは、CPU(Central Processing Unit)１０２を内蔵している。CPU１０２には、バス１０１を介して、入出力インタフェース１１０が接続されており、CPU１０２は、入出力インタフェース１１０を介して、ユーザによって、キーボードや、マウス、マイク等で構成される入力部１０７が操作等されることにより指令が入力されると、それに従って、ROM(Read Only Memory)１０３に格納されているプログラムを実行する。あるいは、また、CPU１０２は、ハードディスク１０５に格納されているプログラム、衛星若しくはネットワークから転送され、通信部１０８で受信されてハードディスク１０５にインストールされたプログラム、またはドライブ１０９に装着されたリムーバブル記録媒体１１１から読み出されてハードディスク１０５にインストールされたプログラムを、RAM(Random Access Memory)１０４にロードして実行する。これにより、CPU１０２は、上述したフローチャートにしたがった処理、あるいは上述したブロック図の構成により行われる処理を行う。そして、CPU１０２は、その処理結果を、必要に応じて、例えば、入出力インタフェース１１０を介して、LCD(Liquid Crystal Display)やスピーカ等で構成される出力部１０６から出力、あるいは、通信部１０８から送信、さらには、ハードディスク１０５に記録等させる。 The computer includes a CPU (Central Processing Unit) 102. An input / output interface 110 is connected to the CPU 102 via the bus 101, and the CPU 102 operates an input unit 107 including a keyboard, a mouse, a microphone, and the like by the user via the input / output interface 110. When a command is input by the equalization, a program stored in a ROM (Read Only Memory) 103 is executed accordingly. Alternatively, the CPU 102 also transfers from a program stored in the hard disk 105, a program transferred from a satellite or a network, received by the communication unit 108 and installed in the hard disk 105, or a removable recording medium 111 attached to the drive 109. The program read and installed in the hard disk 105 is loaded into a RAM (Random Access Memory) 104 and executed. Thus, the CPU 102 performs processing according to the above-described flowchart or processing performed by the configuration of the above-described block diagram. Then, the CPU 102 outputs the processing result from the output unit 106 configured with an LCD (Liquid Crystal Display), a speaker, or the like, for example, via the input / output interface 110, or from the communication unit 108 as necessary. Transmission and further recording on the hard disk 105 are performed.

ここで、本明細書において、コンピュータに各種の処理を行わせるためのプログラムを記述する処理ステップは、必ずしもフローチャートとして記載された順序に沿って時系列に処理する必要はなく、並列的あるいは個別に実行される処理（例えば、並列処理あるいはオブジェクトによる処理）も含むものである。 Here, in the present specification, the processing steps for describing a program for causing the computer to perform various processes do not necessarily have to be processed in time series in the order described in the flowcharts, but in parallel or individually. This includes processing to be executed (for example, parallel processing or processing by an object).

また、プログラムは、１のコンピュータにより処理されるものであっても良いし、複数のコンピュータによって分散処理されるものであっても良い。さらに、プログラムは、遠方のコンピュータに転送されて実行されるものであっても良い。 Further, the program may be processed by one computer or may be distributedly processed by a plurality of computers. Furthermore, the program may be transferred to a remote computer and executed.

なお、本発明の実施の形態は、上述した実施の形態に限定されるものではなく、本発明の要旨を逸脱しない範囲において種々の変更が可能である。 The embodiment of the present invention is not limited to the above-described embodiment, and various modifications can be made without departing from the gist of the present invention.

すなわち、例えば、コネクティビティの算出や、生成時系列データの生成において、モデル生成データのオーバラップ部分としては、モデル学習用データのオーバラップ部分に等しいLサンプルを採用する他、そのLサンプル未満の複数サンプルを採用することが可能である。 That is, for example, in the calculation of connectivity and generation of generation time series data, as the overlap part of the model generation data, an L sample equal to the overlap part of the model learning data is adopted, and a plurality of less than the L sample is used. Samples can be employed.

本発明を適用したデータ処理装置の一実施の形態の構成例を示すブロック図である。It is a block diagram which shows the structural example of one Embodiment of the data processor to which this invention is applied. 学習装置１０のより詳細な構成例を示すブロック図である。3 is a block diagram illustrating a more detailed configuration example of a learning device 10. FIG. 学習モデルとしてのRNNの構成例を示す図である。It is a figure which shows the structural example of RNN as a learning model. 教師データの分割と、その分割によって得られるモデル学習用データを用いた学習モデルの学習とを説明する図である。It is a figure explaining the division | segmentation of teacher data, and learning of the learning model using the data for model learning obtained by the division | segmentation. コネクティビティの算出の方法を説明する図である。It is a figure explaining the method of calculating connectivity. 学習処理を説明するフローチャートである。It is a flowchart explaining a learning process. コネクティビティ算出処理を説明するフローチャートである。It is a flowchart explaining a connectivity calculation process. コネクティビティ算出処理を説明するフローチャートである。It is a flowchart explaining a connectivity calculation process. コネクティビティ算出処理を説明するフローチャートである。It is a flowchart explaining a connectivity calculation process. データ生成装置２０のより詳細な構成例を示すブロック図である。3 is a block diagram illustrating a more detailed configuration example of a data generation device 20. FIG. 生成用モデルシーケンスの算出のために行われる前向き計算を説明する図である。It is a figure explaining the forward calculation performed for calculation of the model sequence for production | generation. 生成用モデルシーケンスを用いた生成時系列データの生成を説明する図である。It is a figure explaining the production | generation of the production | generation time series data using the model sequence for production | generation. データ生成処理を説明するフローチャートである。It is a flowchart explaining a data generation process. 生成用モデルシーケンスの算出処理を説明するフローチャートである。It is a flowchart explaining the calculation process of the model sequence for production | generation. 時系列データ生成処理を説明するフローチャートである。It is a flowchart explaining a time series data generation process. 時系列データ生成処理を説明するフローチャートである。It is a flowchart explaining a time series data generation process. 時系列データ生成処理を説明するフローチャートである。It is a flowchart explaining a time series data generation process. 時系列データ生成処理を説明するフローチャートである。It is a flowchart explaining a time series data generation process. 教師データとしての時系列データと、その時系列データを用いた学習を行った学習モデルを用いて生成される生成時系列データとを示す図である。It is a figure which shows the time series data as teacher data, and the production | generation time series data produced | generated using the learning model which performed learning using the time series data. 移動ロボットがナビゲーションタスクを行う移動環境の概要を示す図である。It is a figure which shows the outline | summary of the mobile environment where a mobile robot performs a navigation task. シミュレーションで採用した移動環境を示す平面図である。It is a top view which shows the movement environment employ | adopted by simulation. 学習処理により得られたコネクティビティを表した移動環境を示す図である。It is a figure which shows the movement environment showing connectivity obtained by the learning process. コネクティビティを用いて算出された生成用モデルシーケンスを表した移動環境を示す図である。It is a figure which shows the movement environment showing the model sequence for production | generation calculated using connectivity. 生成用モデルシーケンスを構成する学習モデルが学習したモデル学習用データに対応する移動軌跡を表した移動環境を示す図である。It is a figure which shows the movement environment showing the movement locus | trajectory corresponding to the data for model learning which the learning model which comprises the model sequence for production | generation learned. 生成用モデルシーケンスから生成された生成時系列データが観測されるように、移動ロボットを移動した移動軌跡を表した移動環境を示す図である。It is a figure which shows the movement environment showing the movement locus | trajectory which moved the mobile robot so that the production | generation time series data produced | generated from the model model for production | generation may be observed. モデルパラメータの共有を行う学習装置の一実施の形態の構成例を示すブロック図である。It is a block diagram which shows the structural example of one Embodiment of the learning apparatus which shares a model parameter. 学習装置による学習を説明するフローチャートである。It is a flowchart explaining the learning by a learning apparatus. 学習モデルとして、RNNPBを採用した場合の学習装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the learning apparatus at the time of employ | adopting RNNPB as a learning model. 本発明を適用したコンピュータの一実施の形態の構成例を示すブロック図である。It is a block diagram which shows the structural example of one Embodiment of the computer to which this invention is applied.

Explanation of symbols

１０学習装置，１１教師データ保存部，１２教師データ分割部，１３モデル学習用データ保存部，１４学習部，１５モデルパラメータ保存部，１６コネクティビティ算出部，１７コネクティビティ保存部，２０データ生成装置，２１現在データ供給部，２２目標データ供給部，２３始点モデル選択部，２４終点モデル選択部，２５生成用モデルシーケンス算出部，２６時系列データ生成部，２７時系列データ出力部，４１データ割り当て部，４２₁ないし４２_N 演算部，５１モデルペア選択部，５２モデルパラメータ供給部，５３，５４認識生成部7，５５コネクティビティ演算部，６１現在データ分配部，６２モデルパラメータ供給部，６３₁ないし６３_N 認識生成部，６４始点モデル決定部，７１目標データ分配部，７２モデルパラメータ供給部，７３₁ないし７３_N 認識生成部，７４終点モデル決定部，８１始点モデルID供給部，８２終点モデルID供給部，８３シーケンス算出部，９１シーケンス供給部，９２モデルパラメータ供給部，９３₁ないし９３_N 認識生成部，９４統合生成部，１０１バス，１０２ CPU，１０３ ROM，１０４ RAM，１０５ハードディスク，１０６出力部，１０７入力部，１０８通信部，１０９ドライブ，１１０入出力インタフェース，１１１リムーバブル記録媒体，２１０₁ないし２１０_N 学習モジュール，２１１₁ないし２１１_N パターン入力部，２１２₁ないし２１２_N モデル学習部，２１３₁ないし２１３_N モデル記憶部，２２０モデルパラメータ共有部，２２１ウエイトマトリクス共有部 DESCRIPTION OF SYMBOLS 10 Learning apparatus, 11 Teacher data preservation | save part, 12 Teacher data division | segmentation part, 13 Model learning data preservation | save part, 14 Learning part, 15 Model parameter preservation | save part, 16 Connectivity calculation part, 17 Connectivity preservation | save part, 20 Data generation apparatus, 21 Current data supply unit, 22 target data supply unit, 23 start point model selection unit, 24 end point model selection unit, 25 generation model sequence calculation unit, 26 time series data generation unit, 27 time series data output unit, 41 data allocation unit, 42 ₁ to 42 _N calculation unit, 51 model pair selection unit, 52 model parameter supply unit, 53, 54 recognition generation unit 7, 55 connectivity calculation unit, 61 current data distribution unit, 62 model parameter supply unit, 63 ₁ to 63 _N Recognition generation unit, 64 start point model determination unit, 71 target data Data distributing section, 72 model parameter supply section, 73 ₁ to 73 _N recognition generating unit, 74 an end point model determining unit 81 start the model ID supply unit, 82 an end point model ID supply unit, 83 sequence calculation unit, 91 sequence supplying section, 92 the model parameter supply section, 93 ₁ to 93 _N recognition generating unit 94 integrated generator, 101 bus, 102 CPU, 103 ROM, 104 RAM, 105 hard disk, 106 output unit, 107 input unit, 108 communication unit, 109 drive, 110 I / O interface, 111 removable recording medium, 210 ₁ to 210 _N learning module, 211 ₁ to 211 _N pattern input unit, 212 ₁ to 212 _N model learning unit, 213 ₁ to 213 _N model storage unit, 220 model parameter sharing unit, 221 Weight matrix sharing part

Claims

The time series data is a learning model that divides time series data into a plurality of partially overlapping data and learns a time series pattern, and outputs it as model learning data used for learning a learning model having an internal state,
A plurality of model learning data obtained by dividing the time series data so as to assign one model learning data to one learning model, and assign the plurality of learning models to the learning model; One of the plurality of learning models after learning obtained by performing the learning of the time-series pattern by using the model learning data assigned to the learning model is time-series. Starting point model selecting means for selecting as a starting point model to be a starting point of a generating model sequence that is a sequence of the learning model used for generating data;
End point model selecting means for selecting one of the plurality of learning models as the end point model to be the end point of the generating model sequence;
For all of the plurality of learning models, a data string of the last part of the time series data generated by one learning model and a data string of the first part of the time series data generated by one other learning model A value representing an error is obtained by calculating the connectivity indicating the appropriateness of connection of the time series pattern learned by one other learning model after the time series pattern learned by one learning model. A value corresponding to the connectivity is set as a connection cost for connecting the other learning model after one learning model, and a cumulative value of the connection cost is minimized, from the start point model to the end point model. A generating model sequence calculating means for obtaining the sequence of the learning models as the generating model sequence;
For the learning model constituting the generation model sequence, the last partial data sequence of the time series data generated by the learning model and the first partial data of the time series data generated by the learning model connected later Time series data generating means for determining an initial value of the internal state of the learning model and giving the initial value to the learning model so as to reduce an error with a column, and generating time series data. Data processing device.

The data processing apparatus according to claim 1, wherein the learning model is an RNN (Recurrent Neural Network).

Data processing device
The time series data is a learning model that divides time series data into a plurality of partially overlapping data and learns a time series pattern, and outputs it as model learning data used for learning a learning model having an internal state,
A plurality of model learning data obtained by dividing the time series data so as to assign one model learning data to one learning model, and assign the plurality of learning models to the learning model; One of the plurality of learning models after learning obtained by performing the learning of the time-series pattern by using the model learning data assigned to the learning model is time-series. Select as the starting point model to be the starting point of the generating model sequence that is a sequence of the learning model used for data generation,
The other one of the learning models is selected as an end point model that is an end point of the generation model sequence,
For all of the plurality of learning models, a data string of the last part of the time series data generated by one learning model and a data string of the first part of the time series data generated by one other learning model A value representing an error is obtained by calculating the connectivity indicating the appropriateness of connection of the time series pattern learned by one other learning model after the time series pattern learned by one learning model. A value corresponding to the connectivity is set as a connection cost for connecting the other learning model after one learning model, and a cumulative value of the connection cost is minimized, from the start point model to the end point model. Obtaining the sequence of the learning models as the generating model sequence;
For the learning model constituting the generation model sequence, the last partial data sequence of the time series data generated by the learning model and the first partial data of the time series data generated by the learning model connected later A data processing method comprising the steps of: determining an initial value of the internal state of the learning model so as to reduce an error with a column; and providing the initial value to the learning model to generate time series data.

The time series data is a learning model that divides time series data into a plurality of partially overlapping data and learns a time series pattern, and outputs it as model learning data used for learning a learning model having an internal state,
A plurality of model learning data obtained by dividing the time series data so as to assign one model learning data to one learning model, and assign the plurality of learning models to the learning model; One of the plurality of learning models after learning obtained by performing the learning of the time-series pattern by using the model learning data assigned to the learning model is time-series. Starting point model selecting means for selecting as a starting point model to be a starting point of a generating model sequence that is a sequence of the learning model used for generating data;
End point model selecting means for selecting one of the plurality of learning models as the end point model to be the end point of the generating model sequence;
For all of the plurality of learning models, a data string of the last part of the time series data generated by one learning model and a data string of the first part of the time series data generated by one other learning model A value representing an error is obtained by calculating the connectivity indicating the appropriateness of connection of the time series pattern learned by one other learning model after the time series pattern learned by one learning model. A value corresponding to the connectivity is set as a connection cost for connecting the other learning model after one learning model, and a cumulative value of the connection cost is minimized, from the start point model to the end point model. A generating model sequence calculating means for obtaining the sequence of the learning models as the generating model sequence;
For the learning model constituting the generation model sequence, the last partial data sequence of the time series data generated by the learning model and the first partial data of the time series data generated by the learning model connected later Time series data generating means for determining an initial value of the internal state of the learning model and giving the initial value to the learning model to generate time series data so as to reduce an error with a column. , A program to make a computer function.

A learning model that divides time series data into a plurality of partially overlapping data and learns a time series pattern, and outputs a model learning data used for learning a learning model having an internal state; ,
A plurality of model learning data obtained by dividing the time series data so as to assign one model learning data to one learning model, and assign the plurality of learning models to the learning model; Learning means for learning the time-series pattern by using the model learning data assigned to the learning model;
For all of the plurality of learning models, a data string of the last part of the time series data generated by one learning model and a data string of the first part of the time series data generated by one other learning model Connectivity calculation means for calculating an error as a connectivity representing the appropriateness of connection of the time series pattern learned by another one of the learning models after the time series pattern learned by one of the learning models; Processing equipment.

The data processing apparatus according to claim 5, wherein the learning model is an RNN (Recurrent Neural Network).

Data processing device
The time series data is a learning model that divides time series data into a plurality of partially overlapping data and learns a time series pattern, and outputs it as model learning data used for learning a learning model having an internal state,
A plurality of model learning data obtained by dividing the time series data so as to assign one model learning data to one learning model, and assign the plurality of learning models to the learning model; The time-series pattern learning is performed using the model learning data assigned to the learning model,
For all of the plurality of learning models, a data string of the last part of the time series data generated by one learning model and a data string of the first part of the time series data generated by one other learning model A data processing method including a step of calculating an error as connectivity representing appropriateness of connection of the time series pattern learned by one other learning model after the time series pattern learned by one learning model.

A learning model that divides time series data into a plurality of partially overlapping data and learns a time series pattern, and outputs a model learning data used for learning a learning model having an internal state; ,
A plurality of model learning data obtained by dividing the time series data so as to assign one model learning data to one learning model, and assign the plurality of learning models to the learning model; Learning means for learning the time-series pattern by using the model learning data assigned to the learning model;
For all of the plurality of learning models, a data string of the last part of the time series data generated by one learning model and a data string of the first part of the time series data generated by one other learning model Connectivity calculation means for calculating an error as connectivity representing the appropriateness of connection of the time series pattern learned by one other learning model after the time series pattern learned by one of the learning models, A program that allows a computer to function.