JP2011154554A

JP2011154554A - Deficit value prediction device, deficit value prediction method, and deficit value prediction program

Info

Publication number: JP2011154554A
Application number: JP2010015910A
Authority: JP
Inventors: Yuki Kosaka; 勇気小阪; Takayuki Nakada; 貴之中田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2010-01-27
Filing date: 2010-01-27
Publication date: 2011-08-11

Abstract

<P>PROBLEM TO BE SOLVED: To provide a deficit value prediction device for improving accuracy in predicting a deficit value in matrix shape data. <P>SOLUTION: A parameter estimating means 82 estimates a parameter which maximizes likelihood that data when a function converts a factor matrix is matrix shape data, among the parameters of the function for converting the factor matrix being the matrix which is defined by each factor element of one factor in the matrix shape data having two factors and also which expresses the features of the respective factor elements of the other factor in the matrix shape data. A deficit value predicting means 83 predicts the deficit value of the matrix element in the matrix shape data through the use of the parameter estimated by the parameter estimating means 82 and the value of the known matrix element in the matrix shape data. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、欠損値予測システム、欠損値予測方法及び欠損値予測プログラムに関し、特に、２つの因子を含んだ行列形データにおける未知の行列要素の値を予測する欠損値予測システム、欠損値予測方法及び欠損値予測プログラムに関する。 The present invention relates to a missing value prediction system, a missing value prediction method, and a missing value prediction program, and in particular, a missing value prediction system and a missing value prediction method for predicting an unknown matrix element value in matrix data including two factors. And a missing value prediction program.

未知の行列要素を含む行列形データを扱う際、観測した既知の部分から未知の行列要素の値（以下、欠損値と記すこともある。）を精度良く予測することが要求される。例えば、複数のユーザによる商品の評点を集めた行列データに基づく商品を推薦する協調フィルタリングシステムでは、ユーザによって評価されていない商品の未知評点を予測して、予測した評点の高い商品をユーザに推薦する。 When handling matrix data including unknown matrix elements, it is required to accurately predict the value of an unknown matrix element (hereinafter sometimes referred to as a missing value) from the observed known part. For example, in a collaborative filtering system that recommends products based on matrix data that collects product ratings from multiple users, it predicts unknown scores of products not evaluated by users and recommends products with high predicted scores to users. To do.

このようなシステムでは、行列データの欠損値を予測する一般的な方法として、欠損値を行列要素に含む行列形データを確率モデルを用いてサイズの小さい複数の行列データに分解し、その分解した行列データから元の行列データの欠損値を予測する方法が用いられる。このように欠損値を予測する方法が、非特許文献１及び非特許文献２に記載されている。 In such a system, as a general method for predicting missing values of matrix data, matrix data including missing values as matrix elements is decomposed into a plurality of small matrix data using a probabilistic model, and the decomposition is performed. A method of predicting missing values of original matrix data from matrix data is used. A method for predicting a missing value in this way is described in Non-Patent Document 1 and Non-Patent Document 2.

非特許文献１に記載された方法では、欠損値を行列要素に含む行列形データの線形な構造を確率モデルにより抽出し、行列を構成する因子ごとにサイズの小さい行列データに分解する。そして、分解した各行列データの線形結合を用いて元の行列形データに近似する行列データを予測する。 In the method described in Non-Patent Document 1, a linear structure of matrix data including missing values in matrix elements is extracted by a probability model, and is decomposed into small matrix data for each factor constituting the matrix. Then, matrix data that approximates the original matrix data is predicted using a linear combination of the decomposed matrix data.

また、非特許文献２に記載された方法では、欠損値を行列要素に含む行列形データの非線形な構造を確率モデルにより抽出し、行列を構成する因子ごとにサイズの小さい行列データに分解する。そして、分解した行列データを用いて元の行列形データに近似する行列データを予測する。すなわち、非特許文献２に記載された方法は、非特許文献１に記載された方法を拡張した方法であると言える。 Further, according to the method described in Non-Patent Document 2, a nonlinear structure of matrix data including missing values in matrix elements is extracted by a probability model, and is decomposed into small matrix data for each factor constituting the matrix. Then, matrix data that approximates the original matrix data is predicted using the decomposed matrix data. That is, it can be said that the method described in Non-Patent Document 2 is an extended method of the method described in Non-Patent Document 1.

Ruslan Salakhutdinov and Andriy Mnih,"Probabilistic Matrix Factorization" In Neural Information Processing Systems (NIPS), 2007.Ruslan Salakhutdinov and Andriy Mnih, "Probabilistic Matrix Factorization" In Neural Information Processing Systems (NIPS), 2007. Neil D. Lawrence and Raquel Urtasun, "Non-linear Matrix Factorization with Gaussian Processes", Proceedings of the 26th International Conference on Machine Learning (ICML), pp.601-608, Montreal, Canada, 2009.Neil D. Lawrence and Raquel Urtasun, "Non-linear Matrix Factorization with Gaussian Processes", Proceedings of the 26th International Conference on Machine Learning (ICML), pp.601-608, Montreal, Canada, 2009.

非特許文献１及び非特許文献２に記載された方法では、因子行列が与えられた場合、元の行列データが条件付独立であるため、元の行列データに対して条件付独立の制約がない場合に比べると、予測精度が低いという問題がある。 In the methods described in Non-Patent Document 1 and Non-Patent Document 2, when a factor matrix is given, the original matrix data is conditionally independent, so there is no conditionally independent constraint on the original matrix data. Compared to the case, there is a problem that the prediction accuracy is low.

例えば、非特許文献１に記載された方法では、元の行列形データに含まれる各因子の因子行列が与えられた場合、元の行列形データの各行列要素は条件付独立である。また、非特許文献１に記載された方法では、線形な構造を抽出する方法であるため、非線形で複雑な構造を抽出できないという課題がある。 For example, in the method described in Non-Patent Document 1, when a factor matrix of each factor included in the original matrix data is given, each matrix element of the original matrix data is conditionally independent. Further, since the method described in Non-Patent Document 1 is a method for extracting a linear structure, there is a problem that a nonlinear and complicated structure cannot be extracted.

一方、非特許文献２に記載された方法では、非線形な構造を抽出できるため、非特許文献１の欠損値予測システムに比べて、予測精度は高い。具体的には、非特許文献２に記載された方法では、一方の因子に事前分布を仮定して積分消去（周辺化）し、元のデータに与えるその因子の影響を削除することで非線形な構造を抽出する。しかし、非特許文献２に記載された方法においても、元の行列形データに含まれる２つの因子のうち、削除しなかった因子の因子行列が与えられた場合、元の行列形データの削除した方の因子要素は条件付独立である。 On the other hand, in the method described in Non-Patent Document 2, since a nonlinear structure can be extracted, the prediction accuracy is higher than the missing value prediction system of Non-Patent Document 1. Specifically, in the method described in Non-Patent Document 2, it is assumed that a prior distribution is assumed for one factor and integration is eliminated (marginalized), and the influence of the factor on the original data is eliminated to make nonlinearity. Extract structure. However, even in the method described in Non-Patent Document 2, when the factor matrix of the factor that was not deleted is given among the two factors included in the original matrix data, the original matrix data is deleted. The other factor element is conditionally independent.

以上のように、非特許文献１及び非特許文献２に記載された方法では、因子行列が与えられた場合、元の行列データが条件付独立であるという性質を有する。そのため、元の行列形データに対して条件付独立の制約がなく、また、元の行列形データの各行列要素間の相関関係を組み込んだモデルの予測精度と比べると、精度が低下してしまうという課題がある。 As described above, the methods described in Non-Patent Document 1 and Non-Patent Document 2 have the property that, when a factor matrix is given, the original matrix data is conditionally independent. For this reason, there is no conditionally independent constraint on the original matrix data, and the accuracy is reduced compared to the prediction accuracy of the model incorporating the correlation between each matrix element of the original matrix data. There is a problem.

そこで、本発明は、行列形データにおける欠損値の予測精度を向上できる欠損値予測装置、欠損値予測方法及び欠損値予測プログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide a missing value prediction apparatus, a missing value prediction method, and a missing value prediction program that can improve the accuracy of predicting missing values in matrix data.

本発明による欠損値予測装置は、２つの因子を含む行列形データにおける一方の因子の因子要素ごとに定義される行列であって、その行列形データにおけるもう一方の因子の各因子要素の特徴を表す行列である因子行列を変換する関数のパラメータのうち、その関数が因子行列を変換したときのデータが行列形データである尤もらしさを最大にするパラメータを推定するパラメータ推定手段と、パラメータ推定手段が推定したパラメータ及び行列形データにおける既知の行列要素の値を用いて、行列形データにおける行列要素の欠損値を予測する欠損値予測手段とを備えたことを特徴とする。 The missing value predicting apparatus according to the present invention is a matrix defined for each factor element of one factor in matrix data including two factors, and the characteristics of each factor element of the other factor in the matrix data are obtained. Parameter estimating means for estimating a parameter that maximizes the likelihood that the data when the function transforms the factor matrix is matrix data among the parameters of the function that converts the factor matrix that is a matrix to represent, and the parameter estimating means And a missing value predicting means for predicting a missing value of the matrix element in the matrix data using the estimated parameter and the value of the known matrix element in the matrix data.

本発明による欠損値予測方法は、２つの因子を含む行列形データにおける一方の因子の因子要素ごとに定義される行列であって、その行列形データにおけるもう一方の因子の各因子要素の特徴を表す行列である因子行列を変換する関数のうち、その因子行列を変換したときのデータが行列形データである尤もらしさを最大にする関数を推定し、関数を推定する際に用いられたパラメータ及び行列形データにおける既知の行列要素の値を用いて、行列形データにおける行列要素の欠損値を予測することを特徴とする。 The missing value prediction method according to the present invention is a matrix defined for each factor element of one factor in matrix data including two factors, and the characteristics of each factor element of the other factor in the matrix data are determined. Among the functions that convert the factor matrix that is the matrix to represent, the function that maximizes the likelihood that the data when the factor matrix is converted is matrix data is estimated, and the parameters used in estimating the function and It is characterized in that a missing value of a matrix element in the matrix data is predicted using a value of a known matrix element in the matrix data.

本発明による欠損値予測プログラムは、コンピュータに、２つの因子を含む行列形データにおける一方の因子の因子要素ごとに定義される行列であって、その行列形データにおけるもう一方の因子の各因子要素の特徴を表す行列である因子行列を変換する関数のパラメータのうち、その関数が因子行列を変換したときのデータが行列形データである尤もらしさを最大にするパラメータを推定するパラメータ推定処理、および、パラメータ推定処理で推定されたパラメータ及び行列形データにおける既知の行列要素の値を用いて、行列形データにおける行列要素の欠損値を予測する欠損値予測処理を実行させることを特徴とする。 The missing value prediction program according to the present invention is a matrix defined for each factor element of one factor in matrix data including two factors, and each factor element of the other factor in the matrix data is stored in a computer. A parameter estimation process for estimating a parameter that maximizes the likelihood that the data obtained when the function transforms the factor matrix is matrix data, among the parameters of the function that transforms the factor matrix that is a matrix representing the characteristics of Using the parameters estimated by the parameter estimation process and the values of known matrix elements in the matrix data, a missing value prediction process for predicting the missing values of the matrix elements in the matrix data is executed.

本発明によれば、行列形データにおける欠損値の予測精度を向上できる。 According to the present invention, it is possible to improve the accuracy of predicting missing values in matrix data.

本発明による欠損値予測装置の一実施形態を示すブロック図である。It is a block diagram which shows one Embodiment of the missing value prediction apparatus by this invention. 本発明の実施形態における動作の例を示すフローチャートである。It is a flowchart which shows the example of the operation | movement in embodiment of this invention. 行列形データの例を示す説明図である。It is explanatory drawing which shows the example of matrix form data. 本発明の実施例における不具合予測システムの例を示す説明図である。It is explanatory drawing which shows the example of the malfunction prediction system in the Example of this invention. 本発明の実施例における動作の例を示すシーケンス図である。It is a sequence diagram which shows the example of operation | movement in the Example of this invention. 送信される行列形データの例を示す説明図である。It is explanatory drawing which shows the example of the matrix form data transmitted. 予測した値で欠損値を埋めた予測結果の例を示す説明図である。It is explanatory drawing which shows the example of the prediction result which filled the missing value with the predicted value. 本発明による欠損値予測装置の最小構成の例を示すブロック図である。It is a block diagram which shows the example of the minimum structure of the missing value prediction apparatus by this invention.

以下、本発明の実施形態を図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１は、本発明による欠損値予測装置の一実施形態を示すブロック図である。本実施形態における欠損値予測装置１０１は、入力手段１０２と、近似手段１０３と、予測手段１０４と、出力手段１０５とを備えている。 FIG. 1 is a block diagram showing an embodiment of a missing value prediction apparatus according to the present invention. The missing value prediction apparatus 101 according to the present embodiment includes an input unit 102, an approximation unit 103, a prediction unit 104, and an output unit 105.

入力手段１０２は、例えば、ユーザにより入力される行列形データを近似手段１０３に通知する。ここで、行列形データは、因子数が２の行列であり、例えば、行列の縦軸と横軸がそれぞれ因子になるデータである。なお、以下の説明では、入力手段１０２に入力される行列形データをＹと記す。 For example, the input unit 102 notifies the approximation unit 103 of matrix data input by the user. Here, the matrix-type data is a matrix having a factor number of 2, for example, data in which the vertical axis and the horizontal axis of the matrix are factors. In the following description, matrix data input to the input means 102 is denoted as Y.

近似手段１０３は、入力された行列形データＹにおける一方の因子の因子要素ごとに、入力された行列形データＹにおけるもう一方の因子の各因子要素の特徴を表す行列形データ（以下、因子行列データと記す。）を定義する。そして、近似手段１０３は、因子行列データを変換する関数のパラメータのうち、その関数が因子行列データを変換したときのデータ入力された行列形データＹである尤もらしさを最大にするパラメータを推定する。なお、以下の説明では、一方の因子における各因子要素の特徴を表す因子行列データをＸと記し、因子行列データＸを変換する関数をＦと記す。 For each factor element of one factor in the input matrix data Y, the approximating means 103 performs matrix data (hereinafter referred to as factor matrix) representing the characteristics of each factor element of the other factor in the input matrix data Y. Defined as data). Then, the approximating means 103 estimates a parameter that maximizes the likelihood of the matrix-like data Y to which data is inputted when the function transforms the factor matrix data among the parameters of the function that transforms the factor matrix data. . In the following description, factor matrix data representing the characteristics of each factor element in one factor is denoted as X, and a function for converting the factor matrix data X is denoted as F.

具体的には、近似手段１０３は、入力手段１０２に入力された行列形データＹを近似することを目的として、入力された行列形データＹに含まれる一方の因子の因子要素ごとに、入力された行列形データＹに含まれるもう一方の因子の各因子要素の特徴を表す行列形データを引数とする関数のパラメータを推定する。 Specifically, the approximation means 103 is input for each factor element of one factor included in the input matrix data Y for the purpose of approximating the matrix data Y input to the input means 102. The parameter of the function having the matrix data representing the feature of each factor element of the other factor included in the matrix data Y as an argument is estimated.

予測手段１０４は、近似手段１０３が推定した結果と行列形データの既知の行列要素の値から、未知の行列要素の値（すなわち、欠損値）を予測する。例えば、予測手段１０４は、近似手段１０３が推定したパラメータ及び既知の行列要素の値を用いて、欠損値を予測する。 The prediction unit 104 predicts the value of an unknown matrix element (that is, a missing value) from the result estimated by the approximation unit 103 and the value of the known matrix element of the matrix data. For example, the predicting unit 104 predicts a missing value using the parameter estimated by the approximating unit 103 and the values of known matrix elements.

出力手段１０５は、予測手段１０４が予測した欠損値の予測結果を出力する。出力手段１０５は、例えば、ディスプレイなどの表示装置（図示せず）に予測結果を表示させてもよい。もしくは、出力手段１０５は、欠損値予測装置１０１が備えている、もしくは、欠損値予測装置１０１に接続されている記憶装置（図示せず）に予測結果を記憶させてもよい。さらに、出力手段１０５は、他のシステム（図示せず）に予測結果を通知してもよい。 The output unit 105 outputs the prediction result of the missing value predicted by the prediction unit 104. For example, the output unit 105 may display the prediction result on a display device (not shown) such as a display. Alternatively, the output unit 105 may store the prediction result in a storage device (not shown) provided in the missing value prediction apparatus 101 or connected to the missing value prediction apparatus 101. Further, the output unit 105 may notify the prediction result to another system (not shown).

近似手段１０３と、予測手段１０４とは、プログラム（欠損値予測プログラム）に従って動作するコンピュータのＣＰＵによって実現される。例えば、プログラムは、欠損値予測装置１０１の記憶部（図示せず）に記憶され、ＣＰＵは、そのプログラムを読み込み、プログラムに従って、近似手段１０３及び予測手段１０４として動作してもよい。また、近似手段１０３と、予測手段１０４とは、それぞれが専用のハードウェアで実現されていてもよい。 The approximation means 103 and the prediction means 104 are realized by a CPU of a computer that operates according to a program (missing value prediction program). For example, the program may be stored in a storage unit (not shown) of the missing value prediction apparatus 101, and the CPU may read the program and operate as the approximation unit 103 and the prediction unit 104 according to the program. Further, each of the approximating unit 103 and the predicting unit 104 may be realized by dedicated hardware.

次に、動作について説明する。図２は、本実施形態における動作の例を示すフローチャートである。また、図３は、行列形データの例を示す説明図である。図３に例示する行列形データは、各軸が行列形データの因子を表し、縦軸が因子「映画」を、横軸が因子「ユーザ」をそれぞれ表す。すなわち、行列形データＹは、縦軸と横軸それぞれが意味する内容が因子であり、因子数が２のデータである。また、図３に例示する行列形データの行列要素は、ユーザが各映画を１〜５の範囲で評価した点数であり、空白の部分（空白の行列要素）が、行列形データの欠損値を表す。 Next, the operation will be described. FIG. 2 is a flowchart showing an example of the operation in this embodiment. FIG. 3 is an explanatory diagram showing an example of matrix data. In the matrix data illustrated in FIG. 3, each axis represents a factor of the matrix data, the vertical axis represents the factor “movie”, and the horizontal axis represents the factor “user”. That is, the matrix data Y is data in which the vertical axis and the horizontal axis mean factors, and the number of factors is two. In addition, the matrix elements of the matrix data illustrated in FIG. 3 are scores obtained by the user evaluating each movie in the range of 1 to 5, and blank portions (blank matrix elements) indicate missing values of the matrix data. To express.

以下の説明では、２変量、１シーケンスデータの行列形データが入力されるものとする。また、入力される行列形データをＹとし、行列形データＹは、Ｍ×Ｎ次元の行列形データとする。図３に例示する行列形データは、映画の種類（すなわち、因子「映画」の因子要素）がＭ個（具体的には、Ａ〜Ｅの５個）、ユーザの数（すなわち、因子「ユーザ」の因子要素）がＮ個（具体的には、１〜３０の３０人）の行列形データである。 In the following description, it is assumed that matrix data of bivariate and single sequence data is input. The input matrix data is Y, and the matrix data Y is M × N-dimensional matrix data. The matrix data illustrated in FIG. 3 includes M types of movies (that is, factor elements of the factor “movie”) (specifically, five factors A to E), and the number of users (that is, the factor “user”). "Is a matrix data of N elements (specifically, 30 persons from 1 to 30).

まず、行列形データＹが入力されると、入力手段１０２は入力された行列形データＹを近似手段１０３に通知する（ステップＳ２０１）。近似手段１０３は、入力された行列形データＹを近似することを目的として、入力された行列形データに含まれる一方の因子の因子要素ごとに、その行列形データに含まれるもう一方の因子の各因子要素の特徴を表す行列形データを引数とする関数のパラメータを推定する（ステップＳ２０２）。 First, when the matrix data Y is input, the input unit 102 notifies the approximated unit 103 of the input matrix data Y (step S201). For the purpose of approximating the input matrix data Y, the approximating means 103 for each factor element of one factor included in the input matrix data Y, the other factor included in the matrix data. A parameter of a function having a matrix data representing the characteristics of each factor element as an argument is estimated (step S202).

具体的には、まず、近似手段１０３は、欠損値を行列要素に含む行列形データＹを、以下のような（Ｍ×Ｎ）×１次元のベクトルデータ（以下、ベクトルＹと記す。）に変形する。なお、Ｍ及びＮは、因子要素の数である。 Specifically, first, the approximating means 103 converts the matrix data Y including missing values into matrix elements into the following (M × N) × one-dimensional vector data (hereinafter referred to as vector Y). Deform. M and N are the number of factor elements.

Ｙ＝［Ｙ₁Ｙ₂・・・Ｙ_N］
＝［y₁₁,y₁₂,・・・,y_1M,y₂₁,y₂₂,・・・,y_2M,・・・,y_N1,y_N2,・・・,y_NM］^T Y = [Y ₁ Y ₂ ... Y _N ]
= [Y ₁₁ , y ₁₂ , ..., y _1M , y ₂₁ , y ₂₂ , ..., y _2M , ..., y _N1 , y _N2 , ..., y _NM ] ^T

ここで、Ｙ_Nは、ベクトルＹの行ベクトルを示し、y_NMは、ベクトルＹの行列要素を示す。次に、近似手段１０３は、ベクトルＹを以下の式１のように定義する。 Here, Y _N represents a row vector of the vector Y, and y _NM represents a matrix element of the vector Y. Next, the approximating means 103 defines the vector Y as shown in Equation 1 below.

Ｙ＝Ｆ（Ｘ）＋ε （式１） Y = F (X) + ε (Formula 1)

Ｘは、因子行列であり、また、関数Ｆは、（Ｍ×Ｎ）×１次元のベクトルである。具体的には、関数Ｆは、以下のように定義される。 X is a factor matrix, and the function F is a (M × N) × 1-dimensional vector. Specifically, the function F is defined as follows.

Ｆ＝［Ｆ₁Ｆ₂・・・Ｆ_N］
＝［ｆ₁₁,ｆ₁₂,・・・,ｆ_1M,ｆ₂₁,ｆ₂₂,・・・,ｆ_2M,・・・,ｆ_N1,ｆ_N2,・・・,ｆ_NM］^T F = [F ₁ F ₂ ... F _N ]
_{_{= [F 11, f 12,}} ···, f 1M, f 21, f 22, ···, f 2M, ···, f N1, f N2, ···, f NM] T

ここで、Ｆ_Nは、関数Ｆの行ベクトルを表す。また、式１におけるεは、以下のように定義される。 Here, F _N represents a row vector of the function F. Further, ε in Equation 1 is defined as follows.

ε＝Ｎ（０，σ^２Ｉ） ε = N (0, σ ² I)

ここで、Ｎ（）は、ガウス分布を表す。また、σ^２は分散を、Ｉは、単位行列をそれぞれ表す。次に、各因子要素の特徴を表す因子行列として、以下に例示する因子行列Ｘ’を定義する。 Here, N () represents a Gaussian distribution. Also, σ ² represents the variance, and I represents the unit matrix. Next, a factor matrix X ′ exemplified below is defined as a factor matrix representing the characteristics of each factor element.

上述の通り、Ｘ’は、元の行列形データＹの横軸の因子の特徴を表すＭＮ×ｑの行列である。ＭＮは、因子行列Ｘ’の特徴の個数を表す。ここで、行列形データＹの縦軸の因子の各因子要素の特徴は横軸の各因子要素で表わされ、その特徴の次元はＭである。すなわち、行列形データＹの１行目（すなわち、縦軸の因子の１番目の因子要素）の特徴は、行列要素の（１行目、１列目）、（１行目、２列名）、・・・、（１行目、Ｍ列目）で表される。すなわち、縦軸の因子要素数はＮであることから、因子行列Ｘ’は、縦軸の因子の因子要素ごとに、横軸の因子の各因子要素の特徴を表している行列形データであると言うことができる。 As described above, X ′ is a MN × q matrix representing the characteristics of the factors on the horizontal axis of the original matrix data Y. MN represents the number of features of the factor matrix X ′. Here, the feature of each factor element on the vertical axis of the matrix data Y is represented by each factor element on the horizontal axis, and the dimension of the feature is M. That is, the characteristics of the first row of the matrix data Y (that is, the first factor element of the factor on the vertical axis) are the matrix element (first row, first column), (first row, second column name). ,... (First row, Mth column). That is, since the number of factor elements on the vertical axis is N, the factor matrix X ′ is matrix data representing the characteristics of each factor element on the horizontal axis for each factor element on the vertical axis. Can be said.

また、ｑは、因子行列Ｘ’の特徴の次元を表すパラメータである。ｑの値は、ユーザ等により予め与えられる。 Q is a parameter representing the dimension of the feature of the factor matrix X ′. The value of q is given in advance by a user or the like.

ここで、Ｘ_１’は、因子行列Ｘ’の１行目の行ベクトルを表す。したがって、例えば、行列形データＹの１行目の行ベクトルＹ₁は、以下のように表すことができる。ここで、εは、式１におけるεと同様である。 Here, X ₁ ′ represents the first row vector of the factor matrix X ′. Therefore, for example, the row vector Y ₁ of the first row of the matrix data Y can be expressed as follows. Here, ε is the same as ε in Equation 1.

Ｙ₁＝Ｆ₁（Ｘ₁’）＋ε Y ₁ = F ₁ (X ₁ ') + ε

次に、近似手段１０３は、ベクトルＹの確率分布を、以下に例示する式２のように定義する。 Next, the approximating means 103 defines the probability distribution of the vector Y as shown in Equation 2 exemplified below.

Ｐ（Ｙ｜Ｆ（Ｘ’））＝Ｎ（Ｙ｜Ｆ（Ｘ’），σ^２Ｉ）（式２） P (Y | F (X ′)) = N (Y | F (X ′), σ ² I) (Formula 2)

ここで、関数Ｆの確率分布をガウシアンプロセスを用いて以下に例示する式３のように定義する場合について説明する。 Here, a case will be described in which the probability distribution of the function F is defined as in Expression 3 exemplified below using a Gaussian process.

Ｐ（Ｆ｜Ｘ）＝ＧＰ（Ｆ｜０，Ｋ^ＷＸ）＝Ｎ（Ｆ｜０，Ｋ^ＷＸ）（式３） P (F | X) = GP (F | 0, K ^WX ) = N (F | 0, K ^WX ) (Formula 3)

ガウシアンプロセスは、非線形の入出力関係を正規確率過程から得られたものとして捉える確率過程であり、行列演算のみで確率過程を記述できることを特徴とする方法である。また、Ｋ^ＷＸは、ガウス分布の共分散行列を表す。共分散行列の各要素ｋ^ＷＸ _ｉｊは、以下のように表すことができる。 The Gaussian process is a stochastic process that captures a nonlinear input / output relationship as being obtained from a normal stochastic process, and is characterized in that a stochastic process can be described only by matrix operation. K ^WX represents a Gaussian distribution covariance matrix. Each element k ^WX _ij of the covariance matrix can be expressed as follows.

ｋ^ＷＸ _ｉｊ＝＜Ｘ_ｉ’，Ｘ_ｊ’＞ k ^WX _ij = <X _i ′, X _j ′>

なお、＜＞は、内積を表す。すなわち、ｋ^ＷＸ _ｉｊは、Ｘ_ｉ’とＸ_ｊ’の内積として表すことができる。 Note that <> represents an inner product. That is, k ^WX _ij can be expressed as an inner product of X _i ′ and X _j ′.

共分散行列Ｋ^ＷＸは、推定が必要なパラメータであるが、サイズが大きいＮＭ×ＮＭの行列データである。そのため、共分散行列Ｋ^ＷＸを推定しようとすると、ＮＭ×ＮＭ個の行列要素全てを推定する必要があり、計算コストが大きくなってしまう。そこで、近似手段１０３は、式３を変更し、以下に例示する式４のように関数Ｆの確率分布を定義する。 The covariance matrix K ^WX is a parameter that needs to be estimated, but is NM × NM matrix data having a large size. Therefore, when trying to estimate the covariance matrix K ^WX , it is necessary to estimate all NM × NM matrix elements, which increases the calculation cost. Therefore, the approximating means 103 changes Equation 3 and defines the probability distribution of the function F as shown in Equation 4 exemplified below.

ここで、ＸはＭ×ｑの行列形データであり、ＷはＮ×ｒの関数を表す行列形データである。なお、ｒは、関数Ｗの特徴の次元を表すパラメータであり、ｒの値は、ユーザ等により予め与えられる。すなわち、式４は、式３における共分散行列Ｋ^ＷＸを、以下に示す内容（以下、推定対象パラメータと記す。）に置き換えたものである。 Here, X is M × q matrix data, and W is matrix data representing an N × r function. Note that r is a parameter representing the dimension of the feature of the function W, and the value of r is given in advance by a user or the like. That is, Expression 4 is obtained by replacing the covariance matrix K ^WX in Expression 3 with the following contents (hereinafter referred to as estimation target parameters).

ここで、以下の記号は、クロネッカー積を表す。 Here, the following symbols represent Kronecker products.

また、Ｋ^Ｗは、マーセルの定理を満たすカーネルと呼ばれるＮ×Ｎの行列データである。同様に、Ｋ^Ｘは、マーセルの定理を満たす（すなわち、カーネルと呼ばれる）Ｍ×Ｍの行列データである。 K ^W is N × N matrix data called a kernel that satisfies the Marcel theorem. Similarly, K ^X is M × M matrix data that satisfies Mercer's theorem (ie, called a kernel).

ここで、行列Ｋ^Ｗの行列要素ｋ^Ｗ _ｉｊ、及び、行列Ｋ^Ｘの行列要素ｋ^Ｘ _ｉｊは、それぞれ以下のように表すことができる。 Here, the matrix element k ^W _ij of the matrix K ^W and the matrix element k ^X _ij of the matrix K ^X can be expressed as follows, respectively.

ｋ^Ｗ _ｉｊ＝ｋ^Ｗ（Ｗ_ｉ，Ｗ_ｊ）＝＜Ｗ_ｉ，Ｗ_ｊ＞
ｋ^Ｘ _ｉｊ＝ｋ^Ｘ（Ｘ_ｉ，Ｘ_ｊ）＝＜Ｘ_ｉ，Ｘ_ｊ＞ k ^W _ij = k ^W (W _i , W _j ) = <W _i , W _j >
^{_{^{_{k X ij = k X (X}}}} i, X j) = <X i, X j>

なお、＜＞は、内積を表す。すなわち、ｋ^Ｗ _ｉｊは、Ｗ_ｉとＷ_ｊの内積として表すことができ、ｋ^Ｘ _ｉｊは、Ｘ_ｉとＸ_ｊの内積として表すことができる。具体的には、行列Ｋ^Ｗの行列要素ｋ^Ｗ（Ｗ_ｉ，Ｗ_ｊ）は、Ｗ_ｉとＷ_ｊとを非線形に写像した高次元ベクトル空間上におけるＷ_ｉとＷ_ｊとの距離を表す。行列Ｋ^Ｘの行列要素ｋ^Ｘ（Ｘ_ｉ，Ｘ_ｊ）についても同様である。 Note that <> represents an inner product. That is, k ^W _ij can be expressed as an inner product of W _i and W _j , and k ^X _ij can be expressed as an inner product of X _i and X _j . Specifically, the matrix element ^k W of the matrix ^{_{_{K W (W i, W j}}} ) represents the distance between the _{W i} and _{W j} on the high-dimensional vector space by mapping the _{W i} and _{W j} nonlinearly. The same applies to the matrix element k ^X (X _i , X _j ) of the matrix K ^X.

上述の推定対象パラメータは、共分散行列Ｋ^ＷＸと同様、推定が必要なパラメータである。推定対象パラメータの内容を推定する場合、近似手段１０３は、Ｎ×Ｎ＋Ｍ×Ｍ個の行列要素を推定することになる。すなわち、共分散行列Ｋ^ＷＸを推定する場合には、ＮＭ×ＮＭ個の行列要素全てを推定する必要があるが、推定対象パラメータを推定する場合、Ｎ×Ｎ＋Ｍ×Ｍ個の行列要素を推定すればよい。したがって、共分散行列Ｋ^ＷＸを推定する場合に比べ、計算コストを小さくすることができる。 The estimation target parameter described above is a parameter that needs to be estimated, like the covariance matrix ^KWX . When estimating the content of the estimation target parameter, the approximating means 103 estimates N × N + M × M matrix elements. That is, when estimating the covariance matrix K ^WX , it is necessary to estimate all NM × NM matrix elements, but when estimating the estimation target parameter, N × N + M × M matrix elements are estimated. That's fine. Therefore, the calculation cost can be reduced as compared with the case where the covariance matrix K ^WX is estimated.

次に、パラメータを推定する際の計算オーダを低減させる方法について説明する。上述の通り、式４に例示するモデルを用いることで、式３に例示するモデルよりも推定するパラメータを低減させることができる。さらに、式３に例示するモデルよりも計算オーダを低減させるため、まず、式４を以下の式５のように変形する。 Next, a method for reducing the calculation order when estimating the parameters will be described. As described above, by using the model illustrated in Expression 4, it is possible to reduce the parameter to be estimated as compared with the model illustrated in Expression 3. Furthermore, in order to reduce the calculation order as compared with the model illustrated in Expression 3, first, Expression 4 is transformed into Expression 5 below.

なお、Ｆ^Ｘは、Ｘを引数とする関数である。また、Ｆ^Ｗは、Ｗを引数とする関数である。ここで、式４及び式５に例示するモデルにおける計算オーダは、Ｏ（Ｍ^３Ｎ^３）である。そのため、式５を近似して、以下に例示する式６を定義する。式６に例示するモデルにおける計算オーダは、Ｏ（ＭＮＮ^２）になる。 Note that F ^X is a function having X as an argument. ^FW is a function having W as an argument. Here, the calculation order in the models illustrated in Expression 4 and Expression 5 is O (M ³ N ³ ). Therefore, Formula 5 illustrated below is defined by approximating Formula 5. The calculation order in the model illustrated in Equation 6 is O (MNN ² ).

Ｆ_ｎ ^Ｘは、各ｎについて独立であるという仮定のもとに定義される関数である。すなわち、Ｆ_ｎ ^Ｘを定義することは、式４において因子要素数Ｎの各因子要素の独立性を仮定しているとも言える。しかし、ここで仮定する独立性は、例えば、非特許文献２に記載された方法で用いられる独立性とは異なる。例えば、式６において、独立性を仮定したと言える部分は、以下の部分である。 F _n ^X is a function defined under the assumption that each n is independent. That is, it can be said that defining F _n ^X assumes independence of each factor element having the number N of factor elements in Equation 4. However, the independence assumed here is different from the independence used in the method described in Non-Patent Document 2, for example. For example, in Equation 6, the part that can be said to have assumed independence is the following part.

一方、非特許文献２に記載された方法では、独立性を仮定した部分は、以下の式７のように表わされる部分である。 On the other hand, in the method described in Non-Patent Document 2, the part assuming independence is a part represented by Equation 7 below.

両者を比較すると、式６において独立性を仮定したと言える部分がＦ^Ｗを含む点で式７と異なる。このように、式７における独立性の仮定とは異なり、式６では、独立性の仮定において欠落する情報をＦ^Ｗで補っている。 When both are compared, the part which can be said to have assumed the independence in Formula 6 is different from Formula 7 in that ^FW is included. Thus, unlike the independence assumption in Equation 7, in Equation 6, the missing information in the independence assumption is supplemented with ^FW .

なお、実問題においては、因子ごとの特徴を表すデータが新たに与えられることがある。例えば、「映画」という因子の特徴として、映画に出演した俳優の情報や、映画の発表年月日、映画製作会社、映画配給会社等の情報が与えられる。また、例えば、「ユーザ」という因子の特徴として、性別、年代、地域などの情報が与えられる。このように与えられるデータをメタデータと呼ぶ。これらのメタデータを利用し、式６に例示するモデルを、以下に例示する式８のように定義し、さらに式８を用いて式９のように定義してもよい。 In an actual problem, data representing characteristics for each factor may be newly given. For example, as a feature of the factor “movie”, information on actors who appeared in the movie, information on the date of movie announcement, movie production company, movie distribution company, and the like are given. Further, for example, information such as gender, age, and region is given as a feature of the factor “user”. Data given in this way is called metadata. Using these metadata, the model illustrated in Expression 6 may be defined as Expression 8 illustrated below, and further defined as Expression 9 using Expression 8.

ここで、Ｒは、行列形データＹの縦軸を表す因子（すなわち、因子要素数がＭの因子）のメタデータを表す。また、Ｓは、行列形データＹの横軸を表す因子（すなわち、因子要素数がＮの因子）のメタデータを表す。なお、メタデータを導入して拡張された式９は、式６の階層モデルと呼ばれる。 Here, R represents metadata of a factor representing the vertical axis of the matrix data Y (that is, a factor having M factor elements). S represents metadata of a factor representing the horizontal axis of the matrix data Y (that is, a factor having N factor elements). Note that Expression 9 expanded by introducing metadata is called a hierarchical model of Expression 6.

次に、パラメータを推定する具体的な方法について説明する。推定が必要なパラメータは、Ｋ^Ｗ、Ｋ^Ｘ及びσである。そこで、近似手段１０３は、関数Ｆが因子行列Ｘを変換したときの行列が行列形データＹである尤もらしさを最大にするパラメータを推定する。例えば、近似手段１０３は、以下の式１０に例示する周辺尤度ｌｏｇＰ（Ｙ｜Ｘ）を最大化するパラメータＫ^Ｗ、Ｋ^Ｘ及びσを、勾配法を用いて推定してもよい。 Next, a specific method for estimating parameters will be described. The parameters that need to be estimated are K ^W, K ^X and σ. Therefore, the approximating means 103 estimates a parameter that maximizes the likelihood that the matrix when the function F transforms the factor matrix X is matrix data Y. For example, the approximating means 103 may estimate parameters K ^W, K ^X and σ that maximize the marginal likelihood logP (Y | X) exemplified in the following Expression 10 using a gradient method.

このように導出されたモデル（欠損値を予測するモデル）は、以下のような特徴を持つ。すなわち、欠損値を行列要素に含む行列形データＹが確率モデルにより関数Ｆと因子行列Ｘでモデル化され、さらに、関数Ｆの事前確率（例えば、上述の式３）がガウシアンプロセスで定義されているため、非線形な相関構造を考慮可能なモデルになっている。 The model thus derived (model for predicting missing values) has the following characteristics. That is, the matrix data Y including missing values as matrix elements is modeled by the function F and the factor matrix X by the probability model, and the prior probability of the function F (for example, Equation 3 above) is defined by the Gaussian process. Therefore, the model can take into account the nonlinear correlation structure.

さらに、本モデルは、元の行列形データＹが関数Ｆと因子行列Ｘとを用いて表されているため、元の行列形データＹに対して条件付独立の制約がない。具体的には、本モデルには、元の行列形データＹの各行列要素間の相関関係が組み込まれている。そのため、予測精度が低下することを抑制できる。 Furthermore, since the original matrix data Y is expressed using the function F and the factor matrix X in this model, there is no conditionally independent constraint on the original matrix data Y. Specifically, the present model incorporates the correlation between the matrix elements of the original matrix data Y. Therefore, it can suppress that prediction accuracy falls.

さらに、本モデルでは、関数Ｆの共分散行列の構造に着目し、サイズの大きい共分散行列を、共分散を表すサイズの小さい２つの行列のクロネッカー積で表現している。そのため、推定が必要なパラメータ数を減らして、計算量を削減することを可能にしている。具体的には、元の行列形データＹに含まれる一方の因子の各要素の特徴を表す行列形データ（すなわち、因子行列）Ｘに関しては、全行列要素数（特徴の次元×要素数）のパラメータを推定し、もう一方の因子に関しては、行列形データの要素数の約２乗個のパラメータを推定すればよい。 Further, in this model, paying attention to the structure of the covariance matrix of the function F, a large covariance matrix is represented by a Kronecker product of two small matrices representing covariance. Therefore, it is possible to reduce the amount of calculation by reducing the number of parameters that need to be estimated. Specifically, with respect to matrix-shaped data (that is, factor matrix) X representing the characteristics of each element of one factor included in the original matrix-shaped data Y, the total number of matrix elements (feature dimension × number of elements) A parameter is estimated, and for the other factor, a parameter of approximately the square of the number of elements of matrix data may be estimated.

次に、予測手段１０４は、近似手段１０３が推定した結果と行列形データの既知の行列要素の値から、未知の行列要素の値を予測する（ステップＳ２０３）。予測手段１０４は、例えば、近似手段１０３が推定したパラメータであるＫ^Ｗ、Ｋ^Ｘ（すなわち、Ｗ及びＸ）及び、行列形データの既知の行列要素の値を用いて、以下に例示する式１１により未知の行列要素（欠損値）を予測する。 Next, the prediction unit 104 predicts the value of the unknown matrix element from the result estimated by the approximation unit 103 and the value of the known matrix element of the matrix data (step S203). The prediction unit 104 uses, for example, K ^W and K ^X (that is, W and X), which are parameters estimated by the approximation unit 103, and the values of known matrix elements of the matrix data, and the following Expression 11 To predict unknown matrix elements (missing values).

ここで、Ｄは、対角成分がσの二乗であるＮＭ×ＮＭの行列データである。また、Ｙは、元の行列形データＹを変形したベクトルデータである。また、ｋ^Ｗ _ｉ：は、Ｋ^Ｗのｉ行目を列ベクトルにしたＮ次元のベクトルを表し、ｋ^Ｘ _ｉ：は、Ｋ^Ｘのｉ行目における列ベクトルを表す。例えば、ベクトルＹの要素ｙ_１Ｍの値が欠損している場合、予測手段１０４は、式１１を用いた以下に例示する式１２により未知の行列要素（欠損値）を予測すればよい。 Here, D is NM × NM matrix data whose diagonal component is the square of σ. Y is vector data obtained by modifying the original matrix data Y. Further, k ^W _i: represents an N-dimensional vector in which the i-th row of K ^W is a column vector, and k ^X _i: represents a column vector in the i-th row of K ^X. For example, when the value of the element y _1M of the vector Y is missing, the prediction unit 104 may predict an unknown matrix element (missing value) using Equation 12 illustrated below using Equation 11.

以下、行列形データＹの全ての欠損箇所を予測したデータ（すなわち、行列形データの欠損値を予測した値で埋めたデータ）をＹ’とする。最後に、出力手段１０５は、欠損値を予測した値で埋めた行列データＹ’を出力する（ステップＳ２０４）。 Hereinafter, Y ′ represents data in which all missing portions of the matrix data Y are predicted (that is, data in which missing values of the matrix data are filled with predicted values). Finally, the output unit 105 outputs matrix data Y ′ in which missing values are filled with predicted values (step S <b> 204).

以上のように、本実施形態によれば、まず、２つの因子を含む行列形データＹの縦軸方向の因子の因子要素ごとに、その行列形データＹにおける横軸方向の因子の各因子要素の特徴を表す因子行列Ｘが定義される。次に、近似手段１０３が、因子行列Ｘを変換する関数Ｆのパラメータのうち、その関数が因子行列Ｘを変換したときのデータが行列形データＹである尤もらしさ（周辺尤度）を最大にするパラメータを推定する。そして、予測手段１０４が、推定されたパラメータＫ^Ｗ、Ｋ^Ｘ（すなわち、Ｗ及びＸ）及び行列形データＹにおける既知の行列要素の値を用いて、行列形データの行列要素の欠損値を予測する。このようにすることで、行列形データにおける欠損値の予測精度を向上させることができる。 As described above, according to the present embodiment, first, for each factor element of the factor in the vertical axis direction of the matrix data Y including two factors, each factor element of the factor in the horizontal axis direction of the matrix data Y A factor matrix X representing the characteristics of is defined. Next, the approximation means 103 maximizes the likelihood (peripheral likelihood) that the data when the function transforms the factor matrix X among the parameters of the function F that transforms the factor matrix X is matrix data Y. Parameters to be estimated. Then, the prediction unit 104 predicts a missing value of the matrix element of the matrix data using the estimated parameters K ^W and K ^X (that is, W and X) and the values of the known matrix elements in the matrix data Y. To do. In this way, it is possible to improve the accuracy of predicting missing values in matrix data.

また、例えば、非特許文献１に記載された方法では、元の行列形データを全ての因子ごとにサイズの小さな行列形データに分解し、分解した因子行列データの全要素数を推定する必要があるため、計算コストは大きい。また、非特許文献２に記載された方法では、片方の因子行列データの全要素数だけ推定すればよいが、それでも、非特許文献１の計算コストの半分であり、元の行列データが条件付き独立であるという性質は有したままである。 For example, in the method described in Non-Patent Document 1, it is necessary to decompose the original matrix data into small matrix data for every factor and estimate the total number of elements of the decomposed factor matrix data. Therefore, the calculation cost is high. Further, in the method described in Non-Patent Document 2, it is only necessary to estimate the total number of elements of one factor matrix data, but it is still half the calculation cost of Non-Patent Document 1, and the original matrix data is conditional. The property of being independent remains.

しかし、本実施形態によれば、サイズの大きい共分散行列を、共分散を表すサイズの小さい２つの行列のクロネッカー積で表現することで推定が必要なパラメータ数を減らしているため、計算コストを抑えることが出来る。 However, according to the present embodiment, the number of parameters that need to be estimated is reduced by expressing the large covariance matrix by the Kronecker product of the two small matrices representing the covariance. It can be suppressed.

以下、具体的な実施例により本発明を説明するが、本発明の範囲は以下に説明する内容に限定されない。以下の実施例では、会社や工場から製品の不具合情報を受信し、その不具合情報から将来発生する不具合を事前に予測するシステム（以下、不具合予測システムと記す。）に本発明を適用する場合について説明する。 Hereinafter, the present invention will be described with reference to specific examples, but the scope of the present invention is not limited to the contents described below. In the following embodiment, a case where the present invention is applied to a system that receives product defect information from a company or factory and predicts future defects from the defect information in advance (hereinafter referred to as a defect prediction system) will be described. explain.

図４は、本実施例における不具合予測システムの例を示す説明図である。なお、上記実施形態と同様の構成については、図１と同一の符号を付し、説明を省略する。本実施例における不具合予測システムは、クライアントシステム４０１と、サーバシステム４０３とを備えている。クライアントシステム４０１と、サーバシステム４０３とは、通信ネットワーク４０２を介して相互に接続される。 FIG. 4 is an explanatory diagram illustrating an example of a failure prediction system according to the present embodiment. In addition, about the structure similar to the said embodiment, the code | symbol same as FIG. 1 is attached | subjected and description is abbreviate | omitted. The defect prediction system in this embodiment includes a client system 401 and a server system 403. The client system 401 and the server system 403 are connected to each other via the communication network 402.

初めに、本実施例の概要を説明する。まず、テレビジョンや冷蔵庫などの家電製品や半導体などの部品製品を製作する各会社で運用されているクライアントシステム、もしくは、製品を製作する各工場で運用されているクライアントシステムから、製品の不具合情報をサーバシステムに送信する。サーバシステムでは、送信された過去の製品の不具合情報（既知の不具合情報）から、現時点で未発生であるが将来発生する不具合を事前に予測して、未発生の不具合が起こるか否かを各クライアントシステムへ送信する。クライアントシステム側では、このように送信された予測情報を使うことで、不具合の発生を未然に防ぐことができるようになる。また、クライアントシステム側では、不具合が起こる前から不具合に備えることができるようになる。 First, the outline of the present embodiment will be described. First, product defect information from client systems operated at each company that manufactures home appliances such as televisions and refrigerators, and parts products such as semiconductors, or from client systems operated at each factory that manufactures products. Is sent to the server system. The server system predicts in advance the defects that have not occurred at the present time but will occur in the future based on the transmitted defect information of the past product (known defect information). Send to client system. On the client system side, by using the prediction information transmitted in this way, it is possible to prevent the occurrence of defects. Also, the client system can be prepared for the trouble before the trouble occurs.

以下、各構成要素について説明する。クライアントシステム４０１は、製品の故障情報等を管理する。具体的には、クライアントシステム４０１は、過去に発生した製品の故障情報を記憶し、その故障情報を通信ネットワーク４０２を介してサーバシステム４０３に送信する。 Hereinafter, each component will be described. The client system 401 manages product failure information and the like. Specifically, the client system 401 stores product failure information that has occurred in the past, and transmits the failure information to the server system 403 via the communication network 402.

サーバシステム４０３は、欠損値予測装置１０１と、予測結果記憶部４０４とを備えている。サーバシステム４０３は、クライアントシステム４０１の情報を管理する。具体的には、サーバシステム４０３は、クライアントシステム４０１から製品の故障情報を受信すると、その故障情報をもとに将来発生する不具合を予測して、その予測情報をクライアントシステム４０１に送信する。 The server system 403 includes a missing value prediction apparatus 101 and a prediction result storage unit 404. The server system 403 manages information of the client system 401. Specifically, when the server system 403 receives product failure information from the client system 401, the server system 403 predicts a future failure based on the failure information and transmits the prediction information to the client system 401.

欠損値予測装置１０１は、上述の実施形態に記載した欠損値予測装置１０１と同様である。ここで、欠損値予測装置１０１は、入力された行列形データから、製品の未発生の不具合箇所の情報を予測する。また、予測結果記憶部４０４は、欠損値予測装置１０１が出力した欠損値予測済みの行列形データを記憶する。 The missing value prediction apparatus 101 is the same as the missing value prediction apparatus 101 described in the above embodiment. Here, the missing value predicting apparatus 101 predicts information on a defective part that has not occurred in the product from the input matrix data. In addition, the prediction result storage unit 404 stores the matrix data with missing value prediction output from the missing value prediction apparatus 101.

次に、動作について説明する。図５は、本実施例における動作の例を示すシーケンス図である。 Next, the operation will be described. FIG. 5 is a sequence diagram showing an example of operation in the present embodiment.

まず、各クライアントシステム４０１が、通信ネットワーク４０２を介して、サーバシステム４０３に、各製品の不具合情報を送信する（ステップＳ５０１）。図６は、本実施例において各クライアントシステム４０１から送信される行列形データの例を示す説明図である。図６に例示する行列形データの各軸は、行列形データの因子を表す。行列形データは、縦軸と横軸を持つデータであるため、因子の数は２になる。また、図６に例示する行列形データは、縦軸が「不具合」の内容を表し、横軸が不具合が発生しうる「製品」を表す。 First, each client system 401 transmits defect information of each product to the server system 403 via the communication network 402 (step S501). FIG. 6 is an explanatory diagram illustrating an example of matrix data transmitted from each client system 401 in the present embodiment. Each axis of the matrix data illustrated in FIG. 6 represents a factor of the matrix data. Since matrix data is data having a vertical axis and a horizontal axis, the number of factors is two. In the matrix data illustrated in FIG. 6, the vertical axis represents the content of “failure”, and the horizontal axis represents “product” in which the failure may occur.

また、以下の説明では、行列形データの行列要素が製品の不具合発生件数である場合について説明する。すなわち、図６に例示する行列要素は、製品の不具合件数を示し、空白の行列要素が欠損値（すなわち、不具合未発生）を示す。ただし、行列要素は、不具合件数に限られない。行列要素は、例えば、不具合が発生する確率であってもよく、不具合が発生したか否かを表すバイナリ値であってもよい。なお、行列要素が製品の不具合発生件数の場合、欠損箇所の予測値は、将来起こりうる不具合の件数と言える。 Further, in the following description, a case will be described in which the matrix element of the matrix data is the number of product defects. That is, the matrix elements illustrated in FIG. 6 indicate the number of product defects, and the blank matrix elements indicate missing values (that is, no defects have occurred). However, the matrix element is not limited to the number of defects. The matrix element may be, for example, a probability of occurrence of a failure, or may be a binary value indicating whether or not a failure has occurred. When the matrix element is the number of product defects, the predicted value of the missing part can be said to be the number of defects that may occur in the future.

例えば、各クライアントシステム４０１は、図６に例示する行列形データをサーバシステム４０３に送信する。なお、行列形データにおける欠損部分を「（製品、不具合）」という形式で表現する場合、図６に例示する行列データの欠損部分は、（１、Ｄ）、（１、Ｅ）、（２、Ｄ）、（３、Ｅ）、（４、Ａ）、（４、Ｃ）、（５、Ｅ）、（３０、Ａ）及び（３０、Ｂ）である。 For example, each client system 401 transmits the matrix data illustrated in FIG. 6 to the server system 403. Note that when the missing portion in the matrix data is expressed in the form of “(product, defect)”, the missing portions of the matrix data illustrated in FIG. 6 are (1, D), (1, E), (2, D), (3, E), (4, A), (4, C), (5, E), (30, A) and (30, B).

サーバシステム４０３は、製品の不具合情報を受信すると、その製品の不具合情報を行列形データとして欠損値予測装置１０１に入力する（ステップＳ５０２）。以下、この入力形データをＹとする。欠損値予測装置１０１（より具体的には、図１における近似手段１０３及び予測手段１０４）は、入力された行列形データＹをもとに欠損値を予測する（ステップＳ５０３）。 Upon receiving the product defect information, the server system 403 inputs the product defect information to the missing value prediction apparatus 101 as matrix data (step S502). Hereinafter, this input data is assumed to be Y. The missing value predicting apparatus 101 (more specifically, the approximating means 103 and the predicting means 104 in FIG. 1) predicts missing values based on the input matrix data Y (step S503).

具体的には、上記実施形態に示したように、まず、製品を示す情報ごとに製品の不具合を示す情報の特徴を表す因子行列が定義される。近似手段１０３は、その因子行列を変換する関数のパラメータのうち、その関数が因子行列を変換したときのデータが入力された行列形データである尤もらしさを最大にするパラメータを推定する。そして、予測手段１０４が、推定されたパラメータ及び行列形データにおける既知の不具合箇所の情報を用いて、製品の未発生の不具合箇所の情報を予測する。 Specifically, as shown in the above embodiment, first, for each piece of information indicating a product, a factor matrix that represents a feature of information indicating a product defect is defined. The approximating means 103 estimates a parameter that maximizes the likelihood that the data of the function that transforms the factor matrix is the matrix data to which the data when the function transforms the factor matrix is input. Then, the predicting unit 104 predicts information on a defective part that has not occurred in the product using the estimated parameters and information on known defective parts in the matrix data.

その後、欠損値予測装置１０１（より具体的には、図１における出力手段１０５）は、予測した値で欠損値を埋めた行列形データ（以下、Ｙ’と記す。）を出力する。具体的には、欠損値予測装置１０１は、予測結果を予測結果記憶部４０４に記憶させる（ステップＳ５０４）。 Thereafter, the missing value prediction apparatus 101 (more specifically, the output unit 105 in FIG. 1) outputs matrix data (hereinafter referred to as Y ′) in which missing values are filled with predicted values. Specifically, the missing value prediction apparatus 101 stores the prediction result in the prediction result storage unit 404 (step S504).

図７は、本実施例において予測した値で欠損値を埋めた予測結果（すなわち、行列形データＹ’）の例を示す説明図である。なお、行列形データＹにおける欠損部分を「（製品、不具合、予測値）」という形式で表現する場合、図７に例示する行列形データＹ’の欠損部分は、（１、Ｄ、１．０１）、（１、Ｅ、０．３）、（２、Ｄ、２）、（３、Ｅ、２．０）、（４、Ａ、０．５）、（４、Ｃ、２．５）、（５、Ｅ、５）、（３０、Ａ、２．０）及び（３０、Ｂ、２．０）である。 FIG. 7 is an explanatory diagram illustrating an example of a prediction result (that is, matrix data Y ′) in which missing values are filled with values predicted in the present embodiment. When the missing part in the matrix data Y is expressed in the form of “(product, defect, predicted value)”, the missing part of the matrix data Y ′ illustrated in FIG. 7 is (1, D, 1.01). ), (1, E, 0.3), (2, D, 2), (3, E, 2.0), (4, A, 0.5), (4, C, 2.5), (5, E, 5), (30, A, 2.0) and (30, B, 2.0).

次に、欠損値予測装置１０１（より具体的には、図１における出力手段１０５）は、予測結果（すなわち、行列形データＹ’）をクライアントシステム４０１に送信し（ステップＳ５０５）、クライアントシステム４０１は、その予測結果（すなわち、行列形データＹ’）を受信する（ステップＳ５０６）。 Next, the missing value prediction apparatus 101 (more specifically, the output unit 105 in FIG. 1) transmits a prediction result (that is, matrix data Y ′) to the client system 401 (step S505), and the client system 401. Receives the prediction result (that is, matrix data Y ′) (step S506).

以上のことから、例えば、テレビジョンや冷蔵庫などの家電製品や半導体などの部品製品を製作する各会社や製品を製作する各工場で運用されているクライアントシステム４０１側では、将来起こるかもしれない不具合の予測情報を使用することで、不具合の発生を未然に防ぐことができる。また、上記各会社や各工場では、不具合発生前から不具合に備えることができる。 From the above, for example, problems that may occur in the future on the client system 401 side operated in each company that manufactures home appliances such as televisions and refrigerators and parts products such as semiconductors, and in each factory that manufactures products. By using the prediction information, it is possible to prevent the occurrence of defects. Moreover, in each said company and each factory, it can prepare for a malfunction before malfunction occurs.

次に、本発明による欠損値予測装置の最小構成の例を説明する。図８は、本発明による欠損値予測装置の最小構成の例を示すブロック図である。本発明による欠損値予測装置８１は、２つの因子（例えば、縦軸と横軸の因子）を含む行列形データ（例えば、行列形データＹ）における一方の因子の因子要素（例えば、縦軸の因子の因子要素）ごとに定義される行列であって、その行列形データにおけるもう一方の因子の各因子要素（例えば、横軸の因子の各因子要素）の特徴を表す行列である因子行列（例えば、因子行列Ｘ）を変換する関数（例えば、関数Ｆ）のパラメータのうち、その関数が因子行列を変換したときのデータが行列形データである尤もらしさを最大にするパラメータを推定する（例えば、式５を用いて推定する）パラメータ推定手段８２と、パラメータ推定手段８２が推定したパラメータ（例えば、Ｗ、Ｘ）及び行列形データにおける既知の行列要素の値を用いて、行列形データにおける行列要素（例えば、式１１におけるｙ_ｉｊ）の欠損値を予測する（例えば、式１１により欠損値を予測する）欠損値予測手段８３とを備えている。 Next, an example of the minimum configuration of the missing value prediction apparatus according to the present invention will be described. FIG. 8 is a block diagram showing an example of the minimum configuration of the missing value prediction apparatus according to the present invention. The missing value predicting apparatus 81 according to the present invention includes a factor element (for example, vertical axis) of one factor in matrix data (for example, matrix data Y) including two factors (for example, vertical axis and horizontal axis factors). Factor matrix that is defined for each factor element of the other factor in the matrix data (for example, each factor element of the factor on the horizontal axis) For example, among parameters of a function (for example, function F) that transforms the factor matrix X), a parameter that maximizes the likelihood that the data when the function transforms the factor matrix is matrix data is estimated (for example, Parameter estimation means 82 (estimated using Equation 5), parameters estimated by the parameter estimation means 82 (for example, W, X) and values of known matrix elements in the matrix data, Matrix elements in the row form data (e.g., y _ij in equation ₁₁₎ to predict the missing values (e.g., to predict the missing value by Equation 11) and a missing value predicting means 83.

そのような構成により、行列形データにおける欠損値の予測精度を向上できる。具体的には、元の行列形データが関数と因子行列とを用いて表されるため、元の行列形データに対して条件付独立の制約がなくなるため、予測精度が向上させることができる。 With such a configuration, the accuracy of predicting missing values in matrix data can be improved. Specifically, since the original matrix data is expressed using a function and a factor matrix, there is no condition-independent constraint on the original matrix data, so that the prediction accuracy can be improved.

なお、少なくとも以下に示すような欠損値予測装置も、上記に示すいずれかの実施形態に開示されている。 Note that at least a missing value prediction apparatus as described below is also disclosed in any of the embodiments described above.

（１）２つの因子（例えば、縦軸と横軸の因子）を含む行列形データ（例えば、行列形データＹ）における一方の因子の因子要素（例えば、縦軸の因子の因子要素）ごとに定義される行列であって、その行列形データにおけるもう一方の因子の各因子要素（例えば、横軸の因子の各因子要素）の特徴を表す行列である因子行列（例えば、因子行列Ｘ）を変換する関数（例えば、関数Ｆ）のパラメータのうち、その関数が因子行列を変換したときのデータが行列形データである尤もらしさを最大にするパラメータを推定する（例えば、式１０を用いて推定する）パラメータ推定手段と、パラメータ推定手段が推定したパラメータ（例えば、Ｗ、Ｘ）及び行列形データにおける既知の行列要素の値を用いて、行列形データにおける行列要素（例えば、式１１におけるｙ_ｉｊ）の欠損値を予測する（例えば、式１１により欠損値を予測する）欠損値予測手段とを備えた欠損値予測装置。 (1) For each factor element of one factor (for example, factor element of the vertical axis) in matrix data (for example, matrix data Y) including two factors (for example, vertical and horizontal axes) A factor matrix (for example, factor matrix X) that is a matrix that is defined and that represents the characteristics of each factor element of the other factor (for example, each factor element of the factor on the horizontal axis) in the matrix data Of the parameters of the function to be converted (for example, function F), the parameter that maximizes the likelihood that the data when the function converts the factor matrix is matrix data is estimated (for example, estimated using Equation 10). Parameter estimation means, the parameters estimated by the parameter estimation means (for example, W, X), and the values of known matrix elements in the matrix data, , To predict the missing values of y _ij) in the equation 11 (e.g., to predict the missing value by Equation 11) missing values prediction apparatus and a missing value predicting means.

（２）パラメータ推定手段が、ガウス分布の共分散行列（例えば、Ｋ^ＷＸ）を用いて定義された関数の因子行列における確率分布（例えば、式３）に基づいて、その関数が因子行列を変換したときのデータが行列形データである尤もらしさを最大にする関数のパラメータを推定する欠損値予測装置。 (2) The parameter estimation means converts the factor matrix based on the probability distribution (for example, Equation 3) in the factor matrix of the function defined using the Gaussian distribution covariance matrix (for example, K ^WX ). A missing value prediction device that estimates the parameters of a function that maximizes the likelihood that the data is matrix data.

（３）パラメータ推定手段が、ガウス分布の共分散行列を共分散を表す２つの行列のクロネッカー積（例えば、推定対象パラメータ）として表現した確率分布（例えば、式４）に基づいて、その関数が因子行列を変換したときのデータが行列形データである尤もらしさを最大にする関数のパラメータを推定する欠損値予測装置。 (3) Based on a probability distribution (for example, Equation 4) in which the parameter estimation means represents a Gaussian distribution covariance matrix as a Kronecker product (for example, an estimation target parameter) of two matrices representing covariance, A missing value prediction apparatus that estimates a parameter of a function that maximizes the likelihood that data obtained by transforming a factor matrix is matrix data.

（４）パラメータ推定手段が、関数の因子行列における確率分布と、関数及び因子行列における行列形データの確率分布とにより決定される周辺尤度の値を最大化する関数のパラメータを推定する欠損値予測装置。 (4) A parameter estimation unit estimates a parameter of a function that maximizes a marginal likelihood value determined by the probability distribution of the function in the factor matrix and the probability distribution of the matrix data in the function and factor matrix. Prediction device.

（５）パラメータ推定手段が、行列形データが２つの因子として製品及びその製品の不具合を示す情報を含む場合に、その行列形データにおける一方の因子である製品を示す情報ごとに定義される行列であって、その行列形データにおけるもう一方の因子であるその製品の不具合を示す情報の特徴を表す因子行列を変換する関数のパラメータうち、その因子行列を変換したときのデータが行列形データである尤もらしさを最大にするパラメータを推定し、欠損値予測手段が、推定されたパラメータ及び行列形データにおける既知の不具合箇所の情報を用いて、製品の未発生の不具合箇所の情報を予測する欠損値予測装置。 (5) When the parameter estimation means includes information indicating a product and a defect of the product as two factors, the matrix defined for each information indicating a product which is one factor in the matrix data Among the parameters of the function that converts the factor matrix that represents the characteristics of the information indicating the malfunction of the product, which is the other factor in the matrix data, the data when the factor matrix is converted is matrix data. A deficiency in which a parameter that maximizes a certain likelihood is estimated, and the missing value predicting means predicts information on a defect location that has not occurred in the product using the estimated parameter and information on a known failure location in the matrix data. Value prediction device.

本発明は、２つの因子を含んだ行列形データにおける未知の行列要素の値を予測する欠損値予測システムに好適に適用される。 The present invention is suitably applied to a missing value prediction system that predicts the value of an unknown matrix element in matrix data including two factors.

１０１欠損値予測装置
１０２入力手段
１０３近似手段
１０４予測手段
１０５出力手段
４０１クライアントシステム
４０２通信ネットワーク
４０３サーバシステム
４０４予測結果記憶部 DESCRIPTION OF SYMBOLS 101 Missing value prediction apparatus 102 Input means 103 Approximation means 104 Prediction means 105 Output means 401 Client system 402 Communication network 403 Server system 404 Prediction result storage part

Claims

Transforms a factor matrix that is defined for each factor element of one factor in matrix data including two factors and that represents the characteristics of each factor element of the other factor in the matrix data Parameter estimation means for estimating a parameter that maximizes the likelihood that the data when the function transforms a factor matrix among the parameters of the function is the matrix data;
And a missing value predicting means for predicting a missing value of a matrix element in the matrix data using the parameter estimated by the parameter estimating means and a value of a known matrix element in the matrix data. Missing value prediction device.

Based on the probability distribution in the factor matrix of the function defined using the Gaussian covariance matrix, the parameter estimation means maximizes the likelihood that the data when the function transforms the factor matrix is matrix data. The missing value prediction apparatus according to claim 1, wherein a parameter of a function to be estimated is estimated.

The parameter estimation means is the likelihood that the data when the function transforms the factor matrix based on the probability distribution that expresses the Gaussian distribution covariance matrix as the Kronecker product of the two matrices representing the covariance. The missing value predicting apparatus according to claim 2, wherein a parameter of a function that maximizes is estimated.

The parameter estimation means estimates a parameter of a function that maximizes a marginal likelihood value determined by the probability distribution of the function in the factor matrix and the probability distribution of the matrix data in the function and the factor matrix. The missing value prediction apparatus according to claim 3.

The parameter estimation means is a matrix defined for each piece of information indicating a product which is one factor in the matrix data when the matrix data includes information indicating a product and a defect of the product as two factors. Among the parameters of the function that converts the factor matrix that represents the characteristic of the information indicating the malfunction of the product that is the other factor in the matrix data, the data when the function converts the factor matrix is the matrix data Estimate the parameters that maximize the likelihood that
The missing value predicting means predicts information on a non-occurring defect location of the product using the estimated parameter and information on a known defect location in the matrix data. The missing value prediction apparatus according to claim 1.

Transforms a factor matrix that is defined for each factor element of one factor in matrix data including two factors and that represents the characteristics of each factor element of the other factor in the matrix data Among the parameters of the function, the parameter that maximizes the likelihood that the data when the function transforms the factor matrix is the matrix data is estimated,
A missing value prediction method, wherein a missing value of a matrix element in the matrix data is predicted using an estimated parameter and a value of a known matrix element in the matrix data.

Based on the probability distribution in the factor matrix of a function defined using a Gaussian distribution covariance matrix, the parameter of the function that maximizes the likelihood that the data when the function transforms the factor matrix is matrix data. The missing value prediction method according to claim 6.

On the computer,
Transforms a factor matrix that is defined for each factor element of one factor in matrix data including two factors and that represents the characteristics of each factor element of the other factor in the matrix data A parameter estimation process for estimating a parameter that maximizes the likelihood that data when the function transforms a factor matrix is the matrix data among the parameters of the function; and
Missing value prediction for executing missing value prediction processing for predicting missing values of matrix elements in the matrix data using the parameters estimated by the parameter estimation processing and the values of known matrix elements in the matrix data program.

On the computer,
In the parameter estimation process, based on the probability distribution in the factor matrix of the function defined using the Gaussian distribution covariance matrix, maximize the likelihood that the data when the function transforms the factor matrix is matrix data The missing value prediction program according to claim 8, wherein a parameter of a function to be estimated is estimated.