JP7502808B2

JP7502808B2 - Learning device, learning method, and learning program

Info

Publication number: JP7502808B2
Application number: JP2022101590A
Authority: JP
Inventors: 弘揮出野; 龍一高橋; 嗣加藤
Original assignee: Tokyo Weld Co Ltd
Current assignee: Tokyo Weld Co Ltd
Priority date: 2022-06-24
Filing date: 2022-06-24
Publication date: 2024-06-19
Anticipated expiration: 2042-06-24
Also published as: CN119487530A; TWI849972B; JP2024002431A; TW202405702A; KR20250020696A; WO2023248948A1

Description

本発明は、学習装置、学習方法及び学習プログラムに関する。 The present invention relates to a learning device, a learning method, and a learning program.

従来から、画像処理の技術を用いて製品の良否の判定等を行うために、ニューラルネットワークを用いたディープラーニングによる学習結果を利用することが行われている。また、ディープラーニングによる学習を効率化するために、特許文献１において、モデルの学習を適切に行うための学習支援装置、及び学習装置等が提案されている。 Conventionally, the results of learning through deep learning using neural networks have been used to determine the quality of products using image processing technology. In addition, in order to make learning through deep learning more efficient, Patent Document 1 proposes a learning support device and a learning device for appropriately learning a model.

特許文献１の学習支援装置は、対象データを第１ラベル及び第２ラベルの何れかに分類するように教師データを用いて学習されたモデルと、第１ラベルが付与された第１データ及び第２ラベルが付与された第２データを有する教師データとに基づいて、教師データの特徴量を教師データごとに導出している。 The learning support device of Patent Document 1 derives features of the teacher data for each piece of teacher data based on a model trained using teacher data to classify target data into either a first label or a second label, and the teacher data having first data assigned with a first label and second data assigned with a second label.

また、同装置では、第１ラベル及び第２ラベルの何れかがそれぞれに付与された少なくとも１つの教師候補データとモデルとに基づいて教師候補データの特徴量を教師候補データごとに導出する導出部と、教師候補データと第１データとの距離、及び、教師候補データと第２データとの距離の少なくとも一方を教師候補データごとに算出する算出部と、距離に基づいて教師候補データの中から教師データとして追加するデータを選択する選択部と、を備える。 The device also includes a derivation unit that derives features of the teacher candidate data for each teacher candidate data based on at least one of teacher candidate data to which either a first label or a second label has been assigned and a model, a calculation unit that calculates at least one of the distance between the teacher candidate data and the first data and the distance between the teacher candidate data and the second data for each teacher candidate data, and a selection unit that selects data to be added as teacher data from the teacher candidate data based on the distance.

特許文献１の学習支援装置では、第１ラベルに分類された複数の教師候補データの中から、前記距離が第２ラベルの教師データに近いものを選択し、第２ラベルに分類された複数の教師候補データの中から、前記距離が第１ラベルの教師データに近いものを選択する。 In the learning support device of Patent Document 1, from among multiple teacher candidate data classified into a first label, data whose distance is close to the teacher data of a second label is selected, and from among multiple teacher candidate data classified into a second label, data whose distance is close to the teacher data of the first label is selected.

当該処理によって、第１ラベルである教師候補データのうち第２ラベルに近いもの、即ち、第１ラベルか第２ラベルかの識別が困難な教師候補データを選択することになる。第２ラベルの教師候補データについても同様である。特許文献１の学習支援装置では、このような識別が困難な教師候補データを教師データとすることにより、教師データの質を向上させ、モデルを効率的に学習させることが可能となった。 This process selects teacher candidate data with the first label that is close to the second label, i.e., teacher candidate data that is difficult to distinguish between the first label and the second label. The same applies to teacher candidate data with the second label. In the learning support device of Patent Document 1, by using such teacher candidate data that is difficult to distinguish as teacher data, it is possible to improve the quality of the teacher data and efficiently train the model.

特開２０２１－１０３３４４号公報JP 2021-103344 A

特許文献１に記載の学習支援装置によって、効率のよい学習を行うことができるようになった。ところで、一般的な学習方法においては、教師データを所定の手法によって増加させる増加処理を行うと、学習効果を高められることが一般に知られている。このため、特許文献１に記載の学習支援装置においても、教師データを増加させることにより、学習効果が高まることが期待される。 The learning support device described in Patent Document 1 makes it possible to carry out efficient learning. However, in general learning methods, it is generally known that the learning effect can be improved by performing an increase process in which the teacher data is increased using a specified method. For this reason, it is expected that the learning effect will also be improved by increasing the teacher data in the learning support device described in Patent Document 1.

しかしながら、特許文献１に記載の学習支援装置において、選択された教師データに対して増加処理を行ったところ、最大エポック数に達しても学習の終了条件を満たすことができないケースや、学習時間が過大となるケースが発生することが判明した。 However, when the learning support device described in Patent Document 1 performed an increase process on selected teacher data, it was found that there were cases where the learning termination condition could not be met even when the maximum number of epochs was reached, and cases where the learning time became excessive.

本発明は、上記課題に鑑み、モデルの学習を適切に行うことができる学習装置、学習方法及び学習プログラムを提供することを目的とする。 In view of the above problems, the present invention aims to provide a learning device, a learning method, and a learning program that can appropriately learn a model.

上記目的を達成するために、本発明の学習装置は、対象データについて評価値を算出し、前記評価値と所定の閾値を比較して少なくとも第１ラベル又は第２ラベルの何れかのラベルに分類するように教師データを用いて学習するモデルに、学習を行わせる学習装置であって、少なくとも前記第１ラベル又は前記第２ラベルの何れかに分類された教師候補データを取得するデータ取得部と、前記教師候補データの内容の一部を変更して一又は複数の変更教師候補データを生成する増加処理部と、前記教師候補データ及び前記変更教師候補データを母集団として、前記母集団について前記モデルを用いて評価値を算出し、前記評価値が前記閾値を含む所定の抽出範囲にあるデータを抽出する抽出処理を行う抽出部と、所定の学習率で前記モデルに学習処理を行わせる学習部と、前記抽出部及び前記学習部を制御する制御部を備え、前記制御部は、前記抽出部により前記抽出処理を行った結果抽出された抽出データの数が所定の停止数を超えるときは、前記抽出データを前記教師データに追加して前記学習処理を行うと共に、前記抽出処理を行い、新たな抽出データの数が前記停止数となるまで前記学習処理と前記抽出処理を実行することを特徴とする。 In order to achieve the above object, the learning device of the present invention is a learning device that performs learning on a model that uses teacher data to calculate an evaluation value for target data, compares the evaluation value with a predetermined threshold, and classifies the target data into at least one of a first label or a second label. The learning device includes a data acquisition unit that acquires teacher candidate data classified into at least one of the first label or the second label, an increase processing unit that changes part of the content of the teacher candidate data to generate one or more modified teacher candidate data, an extraction unit that performs an extraction process that uses the teacher candidate data and the modified teacher candidate data as a population, calculates an evaluation value for the population using the model, and extracts data whose evaluation value is in a predetermined extraction range including the threshold, a learning unit that causes the model to perform a learning process at a predetermined learning rate, and a control unit that controls the extraction unit and the learning unit. When the number of extracted data extracted as a result of the extraction unit performing the extraction process exceeds a predetermined stop number, the control unit adds the extracted data to the teacher data and performs the learning process, and performs the extraction process, and performs the learning process and the extraction process until the number of new extracted data reaches the stop number.

本発明の学習装置は、教師候補データの内容の一部を変更して一又は複数の変更教師候補データを生成する増加処理部を備えており、教師候補データのみならず変更教師候補データを含めて教師候補データを抽出するための母集団としている。この増加処理部によるデータの増加は、学習処理と抽出処理を行う前に行っているため、学習処理と抽出処理を繰り返すことにより最終的には抽出データの数が所定の停止数となり、学習が終了する。従って、本発明の学習装置は、従来の学習装置と比べて、モデルに対して多くの母集団を用いて効率よく学習させることができる。 The learning device of the present invention is equipped with an augmentation processing unit that modifies part of the content of the teacher candidate data to generate one or more modified teacher candidate data, and uses not only the teacher candidate data but also the modified teacher candidate data as a population for extracting teacher candidate data. Since the data augmentation by this augmentation processing unit is performed before the learning process and the extraction process are performed, the number of extracted data eventually reaches a predetermined stopping number by repeating the learning process and the extraction process, and learning ends. Therefore, compared to conventional learning devices, the learning device of the present invention can efficiently train a model using a larger population.

本発明の学習装置において、前記制御部は、学習済みのモデルを用いて抽出処理を行う際に、前記母集団から前記抽出データを除いた新たな母集団を作成し、前記新たな母集団に対して前記抽出部によって前記抽出処理を行うようにしてもよい。当該構成によれば、学習処理と抽出処理を繰り返す度に母集団のデータ数が減少していくので、制御部における処理の負担を軽減することができる。 In the learning device of the present invention, when performing the extraction process using a trained model, the control unit may create a new population by removing the extracted data from the population, and perform the extraction process on the new population using the extraction unit. With this configuration, the amount of data in the population decreases each time the learning process and the extraction process are repeated, thereby reducing the processing load on the control unit.

本発明の学習装置は、前記抽出処理において、変更の基礎となった前記教師候補データが共通する前記変更教師候補データが複数あるときは、一度の前記抽出処理において所定の限度抽出数のみの前記教師候補データ又は前記変更教師候補データを抽出する限定処理を行うようにしてもよい。当該構成によれば、学習処理を行う際の教師候補データ又は変更教師候補データの数が抑えられるので、学習処理を迅速に行うことができる。 In the extraction process, when there are multiple pieces of modified teacher candidate data that share the teacher candidate data that was the basis for the change, the learning device of the present invention may perform a limiting process to extract only a predetermined limited number of teacher candidate data or modified teacher candidate data in one extraction process. With this configuration, the number of teacher candidate data or modified teacher candidate data when performing the learning process is reduced, so that the learning process can be performed quickly.

また、本発明の学習装置において、前記抽出データを表示させる表示部と、前記表示部に表示された前記抽出データについて前記ラベルの変更が可能なラベル変更部をさらに備え、前記抽出データを前記教師データに追加する前に、前記ラベル変更部により前記ラベルの変更を可能としてもよい。当該構成によれば、ユーザはラベル変更部と入力部により、抽出部におけるラベルの判定結果を確認でき、判定結果に誤りがあるときはラベル変更部により変更ができるので、抽出部における判定の精度を向上させることができる。 The learning device of the present invention may further include a display unit that displays the extracted data, and a label change unit that can change the label of the extracted data displayed on the display unit, and the label change unit may change the label before the extracted data is added to the teacher data. With this configuration, a user can check the label determination result in the extraction unit using the label change unit and the input unit, and if there is an error in the determination result, it can be changed by the label change unit, thereby improving the accuracy of the determination in the extraction unit.

また、本発明の学習装置において、前記制御部が、前記抽出データの数が前記停止数を超え、所定の基準データ数未満の場合、前記抽出範囲の幅を広げる拡張処理を行って前記抽出処理を行い、前記抽出データの数が前記基準データ数以上となったときに前記学習処理を行うようにしてもよい。 In addition, in the learning device of the present invention, the control unit may perform the extraction process by performing an extension process to widen the width of the extraction range when the number of extracted data exceeds the stop number and is less than a predetermined reference number of data, and perform the learning process when the number of extracted data becomes equal to or greater than the reference number of data.

抽出データが基準データ数より少ない状態で、ユーザがラベル変更部によりラベルの変更を行うと、ラベルの変更の頻度が多くなりユーザに負担が生じる。本発明では、抽出データの数が基準データ数以上となってから学習処理及び抽出処理を行うので、ユーザが選択部によりラベルの選択を行う頻度を減少させることができる。 When the number of extracted data is less than the reference number of data, if the user changes the labels using the label change unit, the frequency of label changes increases, placing a burden on the user. In the present invention, the learning process and extraction process are performed after the number of extracted data becomes equal to or greater than the reference number of data, so that the frequency with which the user selects labels using the selection unit can be reduced.

また、本発明の学習装置において、前記制御部が、前記拡張処理として前記抽出範囲の幅を広げると共に前記学習率を低下させるようにしてもよい。このように、抽出範囲を広げることで、抽出データの数を増加させることができる。また、学習率は、1回の学習でニューラルネットワーク内の重みやバイアスを更新する量の調整値である。本発明では、拡張処理において、抽出範囲を広げた際に学習率を低下させることで、抽出データの数の微調整が行われる。 In addition, in the learning device of the present invention, the control unit may widen the width of the extraction range and decrease the learning rate as the expansion process. In this way, by widening the extraction range, the number of extracted data can be increased. Furthermore, the learning rate is an adjustment value for the amount by which weights and biases in the neural network are updated in one learning session. In the present invention, in the expansion process, the learning rate is decreased when the extraction range is widened, thereby fine-tuning the number of extracted data.

また、本発明の学習装置は、対象データについて評価値を算出し、前記評価値と所定の閾値を比較して少なくとも第１ラベル又は第２ラベルの何れかのラベルに分類するように教師データを用いて学習するモデルに、学習を行わせる学習装置であって、少なくとも前記第１ラベル又は前記第２ラベルの何れかに分類された教師候補データを取得するデータ取得部と、前記教師候補データを母集団として、前記母集団について前記モデルを用いて評価値を算出し、前記評価値が前記閾値を含む所定の抽出範囲にあるデータを抽出する抽出処理を行う抽出部と、所定の学習率で前記モデルに学習処理を行わせる学習部と、前記抽出部及び前記学習部を制御する制御部を備え、前記制御部は、前記抽出部により抽出された抽出データの数が所定の停止数を超え、所定の基準データ数未満の場合、前記抽出範囲の幅を広げる拡張処理を行って前記抽出処理を行い、前記抽出データの数が前記基準データ数以上となったときに、前記抽出データを前記教師データに追加して前記学習処理を行うと共に、前記抽出処理を行い、新たな抽出データの数が前記停止数となるまで前記学習処理と前記抽出処理を実行することを特徴とする。 The learning device of the present invention is a learning device that performs learning on a model that uses teacher data to calculate an evaluation value for target data, compares the evaluation value with a predetermined threshold, and classifies the target data into at least one of the first label or the second label. The learning device includes a data acquisition unit that acquires teacher candidate data classified into at least one of the first label or the second label, an extraction unit that performs an extraction process that uses the teacher candidate data as a population, calculates an evaluation value for the population using the model, and extracts data whose evaluation value is in a predetermined extraction range including the threshold, a learning unit that causes the model to perform a learning process at a predetermined learning rate, and a control unit that controls the extraction unit and the learning unit. When the number of extracted data extracted by the extraction unit exceeds a predetermined stop number and is less than a predetermined reference data number, the control unit performs an extension process that widens the width of the extraction range and performs the extraction process, and when the number of extracted data becomes equal to or greater than the reference data number, the control unit adds the extracted data to the teacher data and performs the learning process, and performs the extraction process, and performs the learning process and the extraction process until the number of new extracted data becomes the stop number.

上記構成の学習装置によれば、抽出手段による抽出数が所定の基準データ数より少ない場合は、抽出数が多くなるように抽出範囲の幅を広げるため、抽出数が多くなるので、効率よく学習処理を行うことができる。 According to the learning device configured as above, if the number of data extracted by the extraction means is less than a predetermined reference number, the width of the extraction range is widened so that the number of data extracted increases, and the number of data extracted increases, allowing the learning process to be performed efficiently.

本発明の学習方法は、対象データについて評価値を算出し、前記評価値と所定の閾値を比較して少なくとも第１ラベル又は第２ラベルの何れかのラベルに分類するように教師データを用いて学習するモデルに、学習を行わせる学習方法であって、少なくとも前記第１ラベル又は前記第２ラベルの何れかに分類された教師候補データを取得するデータ取得工程と、前記教師候補データの内容の一部を変更して一又は複数の変更教師候補データを生成する増加処理工程と、前記教師候補データ及び前記変更教師候補データを母集団として、前記母集団について前記モデルを用いて評価値を算出し、前記評価値が前記閾値を含む所定の抽出範囲にあるデータを抽出する抽出工程と、所定の学習率で前記モデルに学習を行わせる学習工程とを含み、前記抽出工程により抽出された抽出データの数が所定の停止数を超えるときは、前記抽出データを前記教師データに追加して前記学習工程を実行すると共に、前記母集団から前記抽出データを除いた新たな母集団を作成し、前記新たな母集団に対して前記抽出工程を実行し、新たな抽出データの数が前記停止数となるまで前記学習工程と前記抽出工程を実行することを特徴とする。 The learning method of the present invention is a learning method for causing a model that uses teacher data to learn so as to calculate an evaluation value for target data, compare the evaluation value with a predetermined threshold value, and classify the target data into at least one of the first label or the second label, and includes a data acquisition step of acquiring teacher candidate data classified into at least one of the first label or the second label, an increase processing step of generating one or more modified teacher candidate data by modifying a part of the content of the teacher candidate data, an extraction step of calculating an evaluation value for the teacher candidate data and the modified teacher candidate data as a population using the model, and extracting data whose evaluation value is within a predetermined extraction range including the threshold value, and a learning step of causing the model to learn at a predetermined learning rate, and when the number of extracted data extracted by the extraction step exceeds a predetermined stopping number, the extracted data is added to the teacher data and the learning step is executed, and a new population is created by removing the extracted data from the population, and the extraction step is executed for the new population, and the learning step and the extraction step are executed until the number of new extracted data reaches the stopping number.

本発明の学習プログラムは、コンピュータを上記各学習装置として機能させるためのプログラムである。 The learning program of the present invention is a program for causing a computer to function as each of the above learning devices.

本発明の実施形態の一例である学習装置の機能的構成を示す説明図。FIG. 1 is an explanatory diagram showing a functional configuration of a learning device according to an embodiment of the present invention. 本実施形態の学習装置における学習方法を示すフローチャート。4 is a flowchart showing a learning method in the learning device of the present embodiment. 本実施形態の学習装置の抽出部による抽出処理のイメージを示す説明図。FIG. 4 is an explanatory diagram showing an image of an extraction process performed by an extraction unit of the learning device of this embodiment. 本実施形態の学習装置の表示部によって表示された表示画面を示す説明図であり、（Ａ）は良品の場合、（Ｂ）は不良品の場合を示す。1A and 1B are explanatory diagrams showing a display screen displayed by the display unit of the learning device of this embodiment, in which (A) shows the case of a non-defective product and (B) shows the case of a defective product. 本実施形態の学習装置の増加処理部による増加処理を示す説明図であり、（Ａ）はフリップ処理、（Ｂ）はシフト処理、（Ｃ）は微小回転処理、（Ｄ）はフィルタ処理を示す。13A to 13D are explanatory diagrams showing the increase processing by the increase processing unit of the learning device of this embodiment, where (A) shows flip processing, (B) shows shift processing, (C) shows minute rotation processing, and (D) shows filter processing. 本実施形態の学習装置の学習時間とデータ数との関係を示すグラフ。11 is a graph showing the relationship between the learning time and the number of pieces of data in the learning device of the present embodiment.

次に、図１～図６を参照して、本発明の実施形態である学習装置、学習方法及び学習プログラムについて説明する。図１は、本実施形態の学習装置１の機能的構成を示す説明図である。 Next, a learning device, a learning method, and a learning program according to an embodiment of the present invention will be described with reference to Figures 1 to 6. Figure 1 is an explanatory diagram showing the functional configuration of the learning device 1 according to this embodiment.

本実施形態の学習装置１は、対象データについて評価値を算出して、少なくとも第１ラベルか第２ラベルかを判定して分類するモデルＭについて、モデルＭが適切な判定を行うことができるように学習をさせる装置である。ここで、モデルとは、コンピュータが判別可能な何らかの入力値を受け取り、何らかの評価・判定をして出力値を出す仕組みをいう。 The learning device 1 of this embodiment is a device that performs learning on a model M that calculates an evaluation value for target data and classifies the data by determining at least whether it is a first label or a second label, so that the model M can make appropriate judgments. Here, a model refers to a mechanism that receives some input value that a computer can distinguish, performs some evaluation or judgment, and outputs an output value.

モデルＭは、ニューラルネットワークとパラメータとを含む構造を有する。ニューラルネットワークは、複数のニューロンを結合させた構造を有する。一例として、ニューラルネットワークは、複数のニューロンがグループ化された層を連ねた階層型の多層ニューラルネットワークとすることができる。 The model M has a structure including a neural network and parameters. The neural network has a structure in which multiple neurons are connected. As an example, the neural network can be a hierarchical multi-layer neural network in which layers are connected in which multiple neurons are grouped.

ニューラルネットワークは、ニューロンの個数及び結合関係で定義される。ニューロン間又は層間の結合強度は、パラメータ（重み係数など）を用いて定義される。ニューラルネットワークでは、対象データが入力され、複数のニューロンの演算結果及びパラメータに基づいて、対象データの評価及びラベルの付与が行われる。 A neural network is defined by the number of neurons and their connection relationships. The strength of connections between neurons or layers is defined using parameters (such as weighting coefficients). Target data is input into a neural network, and the target data is evaluated and labeled based on the calculation results and parameters of multiple neurons.

モデルＭは、対象データの内容を認識し、少なくとも第１ラベルか第２ラベルかの判定を行う。例えば、対象データが画像データである場合、ラベルとしては、被写体の種類（人物、乗り物、動物等）、又は被写体の品質（良品、不良品等）とすることができる。このラベルは、対象データに紐付けて記憶される。なお、モデルＭの構成は、特許文献１と同様であるので、詳細な説明は省略する。 Model M recognizes the contents of the target data and determines whether it is at least the first label or the second label. For example, if the target data is image data, the label can be the type of subject (person, vehicle, animal, etc.) or the quality of the subject (good, defective, etc.). This label is linked to the target data and stored. Note that the configuration of model M is the same as that of Patent Document 1, so a detailed description will be omitted.

次に、図１を参照して、本実施形態の学習装置１の機能的構成について説明する。本実施形態の学習装置１は、機能的構成として、データ取得部２、増加処理部３、制御部４、抽出部５、学習部６、表示部７及びラベル変更部８を備えている。 Next, the functional configuration of the learning device 1 of this embodiment will be described with reference to FIG. 1. The learning device 1 of this embodiment includes, as its functional configuration, a data acquisition unit 2, an increase processing unit 3, a control unit 4, an extraction unit 5, a learning unit 6, a display unit 7, and a label change unit 8.

本実施形態の学習装置１で取り扱う対象データは、第１ラベル又は第２ラベルの何れかが付与された教師データ１１と、教師データ１１の候補となる教師候補データ１２と、増加処理部３によって増加処理がなされた変更教師候補データ１３である。教師候補データ１２及び変更教師候補データ１３についても、教師データ１１と同様に、第１ラベル又は第２ラベルの何れかが付与されている。 The target data handled by the learning device 1 of this embodiment are teacher data 11 to which either a first label or a second label has been assigned, teacher candidate data 12 which is a candidate for the teacher data 11, and modified teacher candidate data 13 which has been augmented by the augmentation processing unit 3. Like the teacher data 11, the teacher candidate data 12 and the modified teacher candidate data 13 are also assigned either a first label or a second label.

ここで、教師データとは、モデルＭに与えられる例題と解答を示すデータをいう。また、教師候補データとは、モデルＭに与える新たな教師データの候補となりうるデータをいう。 Here, teacher data refers to data that shows example questions and answers to be given to model M. Also, candidate teacher data refers to data that can be candidates for new teacher data to be given to model M.

データ取得部２は、学習装置１によってモデルＭの学習を行うために、教師候補データ１２を取得するデータ取得工程を行う機能部である。 The data acquisition unit 2 is a functional unit that performs a data acquisition process to acquire teacher candidate data 12 in order to learn the model M using the learning device 1.

増加処理部３は、教師候補データ１２の内容の一部を変更して一又は複数の変更教師候補データ１３を生成する機能部である。増加処理としては、例えば、教師候補データ１２が画像データの場合、以下のような処理を挙げることができる。 The increase processing unit 3 is a functional unit that changes part of the content of the teacher candidate data 12 to generate one or more modified teacher candidate data 13. For example, when the teacher candidate data 12 is image data, the increase processing can be the following processing.

例えば、元画像データを左右反転したデータ、上下反転したデータ、及び１８０°回転したデータを作成するフリップ処理、元画像データをＸ－Ｙ方向に少しずつ座標をずらしたデータを作成するシフト処理、元画像データを右方向及び左方向に微小角度回転させたデータを作成する微小回転処理、元画像データに中央値フィルタやガウシアンフィルタ等のフィルタをかけるフィルタ処理等である。 For example, there is flip processing, which creates data that is mirror-inverted, mirror-inverted, or rotated 180 degrees from the original image data; shift processing, which creates data in which the coordinates of the original image data are shifted slightly in the X-Y direction; micro-rotation processing, which creates data in which the original image data is rotated a small angle to the right or left; and filter processing, which applies a filter such as a median filter or Gaussian filter to the original image data.

制御部４は、抽出部５と学習部６を制御してモデルＭに効果的な学習処理を行わせる機能部である。抽出部５は、教師候補データ１２及び変更教師候補データ１３を母集団１４として、この母集団１４についてモデルＭを用いて評価値を算出し、評価値が第１ラベルと第２ラベルとの閾値を含む所定の抽出範囲にあるデータを抽出する抽出処理を行う機能部である。学習部６は、所定の学習率でモデルＭに学習処理を行わせる機能部である。学習処理とは、モデルＭのパラメータを最適値に近づけるように調整する処理である。 The control unit 4 is a functional unit that controls the extraction unit 5 and the learning unit 6 to make the model M perform an effective learning process. The extraction unit 5 is a functional unit that performs an extraction process in which, using the teacher candidate data 12 and the modified teacher candidate data 13 as a population 14, an evaluation value is calculated for this population 14 using the model M, and data whose evaluation value is within a predetermined extraction range that includes the thresholds of the first label and the second label is extracted. The learning unit 6 is a functional unit that causes the model M to perform a learning process at a predetermined learning rate. The learning process is a process of adjusting the parameters of the model M so that they approach optimal values.

表示部７は、対象データの判定を行うユーザが、抽出部５によって抽出された抽出データ１５を確認できるように、ディスプレイ等の表示機器にデータを表示させる機能部である。ラベル変更部８は、抽出データ１５に付与された第１ラベル又は第２ラベルのラベルについて、ユーザが確認してラベルを変更する必要がある場合に、抽出データ１５に付されたラベルを変更することができる機能部である。 The display unit 7 is a functional unit that displays data on a display device such as a display so that a user who determines the target data can check the extracted data 15 extracted by the extraction unit 5. The label change unit 8 is a functional unit that can change the label attached to the extracted data 15 when the user checks and needs to change the first label or second label attached to the extracted data 15.

本実施形態の学習装置１は、主要なハードウェアとしてコンピュータ（図示省略）を備えている。コンピュータは、ＣＰＵ、ＧＰＵ等のプロセッサ、ＲＡＭ、ＲＯＭ、ハードディスク又はＳＳＤ（ソリッドステートドライブ）等の記憶装置、インターネット等のネットワークへの接続を行う通信部等を備えている。また、コンピュータの記憶装置には、コンピュータを本実施形態の学習装置１として作動させるための学習プログラムが記憶されている。なお、コンピュータには、クラウドコンピューティングシステムが含まれる。前記各機能部は、ハードウェアとしてコンピュータと、ソフトウェアである学習プログラムによって実現される。 The learning device 1 of this embodiment includes a computer (not shown) as its main hardware. The computer includes a processor such as a CPU or GPU, a storage device such as a RAM, a ROM, a hard disk or an SSD (solid state drive), a communication unit for connecting to a network such as the Internet, and the like. The storage device of the computer also stores a learning program for operating the computer as the learning device 1 of this embodiment. The computer includes a cloud computing system. Each of the functional units is realized by the computer as hardware and the learning program as software.

次に、図２を参照して、本実施形態の学習装置１の作動である学習方法について、対象データが画像である場合を例にして説明する。まず、データ取得部２が教師候補データ１２を取得する取得工程を行う（ＳＴＥＰ１）。この教師候補データ１２は、予め、第１ラベル又は第２ラベルの何れかのラベルが付されている。教師候補データ１２の数は、モデルＭの学習に必要な数を予め準備しておく。 Next, referring to FIG. 2, a learning method which is the operation of the learning device 1 of this embodiment will be described using an example in which the target data is an image. First, an acquisition step is performed in which the data acquisition unit 2 acquires teacher candidate data 12 (STEP 1). This teacher candidate data 12 is previously labeled with either a first label or a second label. The number of teacher candidate data 12 required for learning the model M is prepared in advance.

次に、増加処理部３が、教師候補データ１２に対して増加処理を行って変更教師候補データ１３を生成する増加処理工程を行う（ＳＴＥＰ２）。増加処理としては、前述のフリップ処理、シフト処理、微小回転処理及びフィルタ処理の何れか、又はこれらの処理の組み合わせることにより行う。いずれの処理を行うかは、対象データの性質、モデルＭの学習の目的、或いはモデルＭによる判定の内容によって適宜選択することができる。 Next, the augmentation processing unit 3 performs an augmentation process step (STEP 2) in which the teacher candidate data 12 is augmented to generate modified teacher candidate data 13. The augmentation process is performed by any one of the above-mentioned flip process, shift process, small rotation process, and filter process, or a combination of these processes. Which process to perform can be appropriately selected depending on the properties of the target data, the purpose of learning by the model M, or the contents of the judgment by the model M.

次に、制御部４が、初回の処理か否かの確認を行う（ＳＴＥＰ３）。初回の処理の場合は（ＳＴＥＰ３でＹＥＳ）、学習済みで初期状態のモデルＭについて、学習率を初期値に設定する（ＳＴＥＰ５）。この学習率の初期値は、例えば０．００１に設定することができる。また、抽出部５における抽出範囲を初期値に設定する（ＳＴＥＰ５）。この抽出範囲の初期値は、例えば３に設定することができる。なお、抽出範囲の内容については後述する。 Next, the control unit 4 checks whether or not this is the first processing (STEP 3). If this is the first processing (YES in STEP 3), the learning rate is set to an initial value for the model M that has been trained and is in an initial state (STEP 5). The initial value of this learning rate can be set to 0.001, for example. In addition, the extraction range in the extraction unit 5 is set to an initial value (STEP 5). The initial value of this extraction range can be set to 3, for example. The contents of the extraction range will be described later.

次に、抽出部５により抽出処理（抽出工程）を行う（ＳＴＥＰ６）。抽出処理においては、教師候補データ１２と変更教師候補データ１３を母集団１４として、この母集団１４のデータをモデルＭに入力し、モデルＭにおいて評価値を算出し、この評価値が抽出範囲内であれば抽出データ１５として抽出を行い、評価値が抽出範囲外であれば抽出を行わない。 Next, the extraction unit 5 performs an extraction process (extraction step) (STEP 6). In the extraction process, the teacher candidate data 12 and the modified teacher candidate data 13 are treated as a population 14, the data of this population 14 is input to the model M, an evaluation value is calculated in the model M, and if this evaluation value is within the extraction range, it is extracted as extracted data 15, and if the evaluation value is outside the extraction range, extraction is not performed.

図３は、本実施形態の学習装置１における抽出処理をイメージ的に表現したものである。この抽出部５における抽出処理では、母集団１４の各データについて、モデルＭを用いて評価値を算出する。図３においては、評価値は図の右側に行くほど高く、左側に行くほど低くなる。本実施形態においては、評価が高い方が第１ラベルとなり、評価が低い方が第２ラベルとなる。 Figure 3 is an image of the extraction process in the learning device 1 of this embodiment. In the extraction process in the extraction unit 5, an evaluation value is calculated for each data of the population 14 using the model M. In Figure 3, the evaluation value is higher toward the right side of the figure and lower toward the left side. In this embodiment, the higher the evaluation, the first label, and the lower the evaluation, the second label.

図３においては、第１ラベルと第２ラベルとの境界線Ｂが両ラベルを分ける閾値であり、この閾値を含む所定の範囲を抽出範囲としている。ここで、図３に示すように、評価値が閾値に近い領域にあるものは、第１ラベルであっても第２ラベルに近いものとなり、良品と不良品の差が少ないものとなる。本実施形態においては、図３における抽出範囲の値は３に設定されているが、１～２０の値で設定することができる。この値は、抽出の条件や学習の対象に応じて適宜変更が可能である。 In FIG. 3, the boundary line B between the first label and the second label is the threshold value that separates the two labels, and a predetermined range including this threshold value is the extraction range. Here, as shown in FIG. 3, if an evaluation value is in a region close to the threshold value, it will be close to the second label even if it is the first label, and there will be little difference between good and bad products. In this embodiment, the value of the extraction range in FIG. 3 is set to 3, but it can be set to a value between 1 and 20. This value can be changed as appropriate depending on the extraction conditions and the target of learning.

このように、第１ラベルと第２ラベルの所定の領域にあるデータを用いてモデルＭの学習を行えば、第１ラベルか第２ラベルかについて見分けが付きにくいデータで学習を行うことができる。このようなデータで学習を行ったモデルＭは、些細な差異についての判断を正確に行うことができるようになる。即ち、このようなデータは、効率よくモデルＭの学習を行うことができる良質なデータとなる。 In this way, by training model M using data in a specified area of the first label and the second label, it is possible to train with data that is difficult to distinguish as being the first label or the second label. Model M trained with such data will be able to accurately judge minor differences. In other words, such data is good quality data that can be used to train model M efficiently.

抽出部５における抽出処理の内容は、特許文献１に記載された処理と同様である。具体的には、教師候補データ１２と変更教師候補データ１３をモデルＭに入力し、モデルＭと教師データ１１に基づいて、予め定められた次元の特徴空間で表現される特徴量（ベクトル）をデータごとに算出する。また、特徴空間における教師データ１１と教師候補データ１２及び変更教師候補データ１３との距離をそれぞれ算出する。この距離が抽出範囲内にあればそのデータは抽出され、抽出範囲外であれば当該データは抽出しないという処理を行う。なお、特徴空間における距離の算出の手法については、特許文献１と同様であるので、詳細な説明は省略する。 The content of the extraction process in the extraction unit 5 is the same as the process described in Patent Document 1. Specifically, the teacher candidate data 12 and the modified teacher candidate data 13 are input to the model M, and a feature quantity (vector) expressed in a feature space of a predetermined dimension is calculated for each data based on the model M and the teacher data 11. In addition, the distance between the teacher data 11 and the teacher candidate data 12 and the modified teacher candidate data 13 in the feature space is calculated. If this distance is within the extraction range, the data is extracted, and if it is outside the extraction range, the data is not extracted. Note that the method for calculating the distance in the feature space is the same as in Patent Document 1, so a detailed explanation will be omitted.

本実施形態の学習装置１においては、抽出部５による抽出処理を行う際に、母集団１４において、変更教師候補データ１３の変更の基礎となった教師候補データ１２が共通するデータが複数あるときは、一度の抽出処理において所定の限度抽出数のデータのみを抽出する限定処理を行っている（ＳＴＥＰ６）。 In the learning device 1 of this embodiment, when the extraction unit 5 performs the extraction process, if there are multiple pieces of data in the population 14 that share the same teacher candidate data 12 that was the basis for changing the changed teacher candidate data 13, a limiting process is performed to extract only a predetermined limited number of pieces of data in one extraction process (STEP 6).

この限定処理を行うことにより、後述するＳＴＥＰ５において新たな教師データ１１によりモデルＭを学習させる際に、急激に教師データ１１が増加することがないので、学習処理を迅速に行うことができる。本実施形態では、限度抽出数を１としている。このため、本実施形態では、同じ教師候補データ１２から複数の変更教師候補データ１３が生成されている場合であっても、１回の抽出処理において１個の教師候補データ１２又は変更教師候補データ１３のみが抽出される。この限度抽出数は、モデルＭの状態やハードウェア等の状態により適宜変更することができる。 By performing this limiting process, when learning the model M with new teacher data 11 in STEP 5 described below, the teacher data 11 does not increase suddenly, so that the learning process can be performed quickly. In this embodiment, the limit number of extractions is set to 1. Therefore, in this embodiment, even if multiple pieces of modified teacher candidate data 13 are generated from the same teacher candidate data 12, only one piece of teacher candidate data 12 or modified teacher candidate data 13 is extracted in one extraction process. This limit number of extractions can be changed as appropriate depending on the state of the model M, the state of the hardware, etc.

次に、制御部４が、抽出部５により抽出された抽出データ１５の数を確認する（ＳＴＥＰ７）。具体的には、（１）として、抽出データ１５の数ｘが、所定の停止数を超えているか、及び所定の基準データ数以上であるか、或いは抽出範囲が拡張範囲であるか否かの確認を行う（図２のＳＴＥＰ７においては、「基準データ数」を「基準数」と表記している）。又は、ＳＴＥＰ７では、（２）として、抽出データ１５が停止数を超えているが、基準データ数未満であるか否かの確認を行う。 Next, the control unit 4 checks the number of extracted data 15 extracted by the extraction unit 5 (STEP 7). Specifically, (1) it is checked whether the number x of extracted data 15 exceeds a predetermined stop number and is equal to or greater than a predetermined reference data number, or whether the extraction range is an extended range (in STEP 7 of FIG. 2, the "reference data number" is written as the "reference number"). Or, in STEP 7, (2) it is checked whether the extracted data 15 exceeds the stop number but is less than the reference data number.

ここで、停止数は、学習装置１による学習を停止させるか否かの基準となる数字であり、例えば０とすることができる。この場合、抽出部５により抽出されるデータが０になった場合に学習を停止させることになる。 The stop count is a number that is a criterion for whether or not to stop learning by the learning device 1, and can be set to 0, for example. In this case, learning will be stopped when the data extracted by the extraction unit 5 becomes 0.

また、基準データ数は、後述するユーザの確認作業（ＳＴＥＰ８～９）の頻度を低減させるための数字であり、例えば第１ラベルが２０、第２ラベルが２０とすることができる。この基準データ数は、データの種類や学習環境等の要因により適宜変更することが可能である。 The standard number of pieces of data is a number for reducing the frequency of the user's confirmation work (STEPs 8-9) described below, and can be, for example, 20 for the first label and 20 for the second label. This standard number of pieces of data can be changed as appropriate depending on factors such as the type of data and the learning environment.

この場合、抽出部５により抽出される抽出データ１５の数が、第１ラベルが２０未満、又は第２ラベルが２０未満の場合、ユーザの確認作業を行わずに、再度抽出処理を行って、抽出データ１５の数が４０以上となった場合にユーザの確認作業を行う。当該処理により、ユーザの確認作業の頻度が減少するので、ユーザの負担軽減を行うことができる。 In this case, if the number of extracted data 15 extracted by the extraction unit 5 is less than 20 for the first label or less than 20 for the second label, the extraction process is performed again without the user's confirmation, and if the number of extracted data 15 becomes 40 or more, the user's confirmation is performed. This process reduces the frequency of the user's confirmation work, thereby reducing the burden on the user.

抽出データ１５の数が、停止数を超えると共に、基準データ数以上である場合は（ＳＴＥＰ７で１）、表示部７に抽出データ１５を表示させる（ＳＴＥＰ８）。図４は、表示部７に抽出データ１５が表示された状態である。表示画面１６には、ラベル変更部８によって抽出データ１５に付されたラベル１７が適切か否かを確認し、ラベル１７の変更を行うための変更表示１８が表示される。 If the number of extracted data 15 exceeds the number of stops and is equal to or greater than the reference number of data (1 in STEP 7), the extracted data 15 is displayed on the display unit 7 (STEP 8). Figure 4 shows the state in which the extracted data 15 is displayed on the display unit 7. The display screen 16 displays a change display 18 for checking whether the label 17 attached to the extracted data 15 by the label change unit 8 is appropriate and for changing the label 17.

図４（Ａ）において、表示画面１６の上方には、「これは本当に良品ですか？」との記載が表示され、表示画面１６の中央には抽出された抽出データ１５が表示され、抽出データ１５の左側にはラベル１７が「良品」として表示され、表示画面１６の下方には変更表示１８として「はい」「いいえ」「わからない」の３個のボタンが表示される。ユーザがこの変更表示１８のボタンをタップする等の操作を行うことにより、抽出データに付されたラベル１７（この場合は「良品」）が適切であるか否かを確認し、適切でない場合はラベル１７を変更することが可能となる。 In FIG. 4(A), the words "Is this really a good product?" are displayed at the top of the display screen 16, the extracted data 15 is displayed in the center of the display screen 16, a label 17 is displayed as "good product" to the left of the extracted data 15, and three buttons "Yes," "No," and "Don't know" are displayed at the bottom of the display screen 16 as change indications 18. By performing an operation such as tapping the button of this change indication 18, the user can check whether the label 17 (in this case, "good product") attached to the extracted data is appropriate, and if it is not appropriate, the label 17 can be changed.

図４（Ｂ）の場合は、ラベル１７が「不良品」であり、抽出データ１５には不良品と判定された画像が表示される。なお、変更表示１８においてユーザが「わからない」を選択した際は、ユーザによる判定を保留して、事後的に判定を行うことができるようになっている。 In the case of FIG. 4(B), the label 17 is "Defective" and the extracted data 15 displays an image determined to be defective. Note that when the user selects "Don't know" in the change display 18, the user's judgment is withheld and the judgment can be made later.

ユーザによる抽出データ１５の確認が行われた後、制御部４は、当該抽出データ１５を教師データ１１として格納し、新たな教師データ１１が次回のラベル１７の学習に用いられる。また、制御部４は、母集団１４から抽出データ１５を取り除き、新たな母集団１４を作成する（ＳＴＥＰ１０）。 After the user has confirmed the extracted data 15, the control unit 4 stores the extracted data 15 as training data 11, and the new training data 11 is used for the next learning of the label 17. The control unit 4 also removes the extracted data 15 from the population 14, and creates a new population 14 (STEP 10).

次に、再度初回の処理か否かが確認されるが（ＳＴＥＰ３）、この場合は既に初回の処理が終了して２回目以降の処理となるため（ＳＴＥＰ３でＮＯ）、制御部４は、新たな教師データ１１を加えたモデルＭを用いて、学習処理（学習工程）を行う（ＳＴＥＰ４）。新たに加えられた教師データ１１は、抽出部５によって抽出された学習効果の高い抽出データ１５であるので、モデルＭによる評価の精度が向上する。その後、学習率を初期値に設定すると共に、抽出部５における抽出範囲を初期値に設定する（ＳＴＥＰ５）。 Next, it is checked again whether this is the first processing or not (STEP 3), but in this case, the first processing has already ended and it is the second or subsequent processing (NO in STEP 3), so the control unit 4 performs a learning process (learning step) using the model M to which new teacher data 11 has been added (STEP 4). The newly added teacher data 11 is extracted data 15 with a high learning effect extracted by the extraction unit 5, so the accuracy of the evaluation by the model M is improved. After that, the learning rate is set to the initial value, and the extraction range in the extraction unit 5 is also set to the initial value (STEP 5).

制御部４は、以上の抽出処理と学習処理を抽出データ１５の数が停止数である０になるまで繰り返す。抽出データ１５が停止数である０になったときは（ＳＴＥＰ７で３）、処理が終了となる。 The control unit 4 repeats the above extraction process and learning process until the number of extracted data 15 becomes 0, which is the number of stops. When the number of extracted data 15 becomes 0, which is the number of stops (STEP 7: 3), the process ends.

一方で、抽出データ１５が停止数を超えているが、基準データ数未満である場合は（ＳＴＥＰ７で２）、抽出範囲の幅を広げて拡張範囲（例えば３．７）とすると共に、学習率を低下（例えば０．０００２）させる拡張処理を行う（ＳＴＥＰ１１）。この状態で抽出処理を行うと（ＳＴＥＰ６）、抽出範囲の幅が広がっており、抽出部５によって抽出されるデータ数が増加する。また、この抽出処理により、学習率の微調整が行われる。 On the other hand, if the extracted data 15 exceeds the stop count but is less than the reference data count (2 in STEP 7), an extension process is performed to widen the extraction range to an extended range (e.g., 3.7) and lower the learning rate (e.g., 0.0002) (STEP 11). When the extraction process is performed in this state (STEP 6), the width of the extraction range is widened and the number of data extracted by the extraction unit 5 increases. This extraction process also fine-tunes the learning rate.

次に、制御部４は、抽出データ数を確認するが（ＳＴＥＰ７）、この場合はＳＴＥＰ１１によって抽出範囲が拡張範囲となっているため（ＳＴＥＰ７で１）、抽出処理によって抽出された抽出データの表示及び確認を行う（ＳＴＥＰ８～９）。抽出範囲が拡張範囲である場合は、ＳＴＥＰ１１の拡張処理がなされている状態であり、仮に抽出データ１５の数が基準データ数未満であっても、当該基準データ数に近い数のデータが存在することが予想されるためである。 Next, the control unit 4 checks the number of extracted data (STEP 7), but in this case, because the extraction range has been set to an extended range in STEP 11 (1 in STEP 7), the extracted data extracted by the extraction process is displayed and confirmed (STEPs 8-9). If the extraction range is an extended range, the extension process of STEP 11 has been performed, and even if the number of extracted data 15 is less than the reference number of data, it is expected that there will be a number of data close to the reference number of data.

次に、図５を参照して、増加処理部３によって行われる増加処理の具体例について説明する。図５（Ａ）は、フリップ処理を行った状態を示す説明図であり、オリジナル画像である教師候補データ１２から、フリップ処理により、上下反転した画像データ、左右反転した画像データ、及び１８０°回転させた画像データが生成される。このフリップ処理を行うことで、教師候補データ１２に加えて、３個の変更教師候補データ１３を得ることができる。 Next, referring to Fig. 5, a specific example of the increase processing performed by the increase processing unit 3 will be described. Fig. 5 (A) is an explanatory diagram showing the state after flip processing, in which vertically inverted image data, horizontally inverted image data, and image data rotated 180° are generated from the original image, which is the teacher candidate data 12, by the flip processing. By performing this flip processing, in addition to the teacher candidate data 12, three modified teacher candidate data 13 can be obtained.

図５（Ｂ）は、シフト処理を行った状態を示す説明図であり、元画像データである教師候補データ１２と、フリップ処理で生成された３個の変更教師候補データ１３の合計４個のデータについて、Ｘ－Ｙ方向にそれぞれ（－１，－１）、（－１，１）・・・という形で画像を微小移動させる。このシフト処理により、３２個の画像データを得ることができる。このシフトの単位は、画像であれば画素（ピクセル）としてもよく、ｍｍ、或いはμｍとしてもよい。また、シフトの範囲は学習の目的等によって適宜変更することができる。 Figure 5 (B) is an explanatory diagram showing the state after shift processing, in which the original image data, teacher candidate data 12, and the three modified teacher candidate data 13 generated by the flip processing, a total of four pieces of data, are shifted slightly in the X-Y direction in the form of (-1, -1), (-1, 1), etc. This shift processing makes it possible to obtain 32 pieces of image data. The unit of this shift may be pixels for images, or mm or μm. The range of the shift may be changed as appropriate depending on the purpose of learning, etc.

図５（Ｃ）は、微小回転処理を行った状態を示す説明図であり、回転無しの画像データに対して、左に１°回転した画像データと、右に１°回転した画像データを生成する。この微小回転の角度は、学習の目的等によって適宜変更することができる。 Figure 5 (C) is an explanatory diagram showing the state after micro-rotation processing, in which image data rotated 1° to the left and image data rotated 1° to the right are generated in comparison with unrotated image data. The angle of this micro-rotation can be changed as appropriate depending on the purpose of learning, etc.

図５（Ｄ）は、フィルタ処理を行った状態を示す説明図であり、フィルタ無しの画像データに対して、中央値フィルタを施したデータと、ガウシアンフィルタを施したデータを生成する。中央値フィルタは、メディアンフィルタとも呼ばれており、画像のノイズを除去する際に用いられるフィルタである。ガウシアンフィルタは、平滑化フィルタの一種であり、ガウス分布に従って画像をぼかしてなめらかにするフィルタである。なお、フィルタ処理として、公知の他のフィルタを用いてもよい。 Figure 5 (D) is an explanatory diagram showing the state after filter processing, in which data that has been subjected to a median filter and data that has been subjected to a Gaussian filter are generated from unfiltered image data. A median filter is also called a median filter, and is a filter used to remove noise from an image. A Gaussian filter is a type of smoothing filter that blurs and smoothes an image according to a Gaussian distribution. Note that other well-known filters may also be used for filter processing.

増加処理部３において行われる増加処理は、図５（Ａ）のフリップ処理のみであれば、データ数は４倍となる。さらに、図５（Ｂ）のシフト処理を行うと３２倍のデータとなり、シフト処理をしない４個のデータを加えると３６倍のデータとなる。さらに、図５（Ｃ）の微小回転処理を行えば、シフト処理を行った３２倍のデータがさらに３倍されて９６個のデータとなり、最初の４個のデータを加えると１００倍のデータとなる。さらに、図５（Ｄ）のフィルタ処理を行えば、１００倍のデータが３倍の３００倍となる。 If the increase processing performed by the increase processing unit 3 is only the flip processing of FIG. 5(A), the amount of data will be four times as much. Furthermore, if the shift processing of FIG. 5(B) is performed, the amount of data will be 32 times as much, and if the four data pieces that are not shifted are added, the amount of data will be 36 times as much. Furthermore, if the micro-rotation processing of FIG. 5(C) is performed, the 32 times the amount of data that has been shifted is further multiplied by three to become 96 pieces of data, and if the first four pieces of data are added, the amount of data will be 100 times as much. Furthermore, if the filter processing of FIG. 5(D) is performed, the 100 times the amount of data will be tripled to become 300 times as much.

このように、本実施形態の学習装置１では、増加処理によって教師候補データ１２に近似する変更教師候補データ１３を多数生成することができるので、抽出処理によって学習効果の高いデータを多数抽出することができる。 In this way, in the learning device 1 of this embodiment, a large amount of modified teacher candidate data 13 that is similar to the teacher candidate data 12 can be generated by the augmentation process, and a large amount of data with high learning effectiveness can be extracted by the extraction process.

次に、本実施形態の学習装置１の作用効果について、図６を参照して説明する。図６は、対象データ数と学習時間との関係を示すグラフである。グラフにおいて実線で示すデータは本実施形態の学習装置１であり、点線で示すデータは従来の学習装置（比較例）を示している。図６のグラフに示すように、本実施形態の学習装置１は、対象データ数が３００倍である場合も、学習を終了させることができた。一方で、比較例においては、対象データ数の増加倍数が８倍を超えると、学習を終了させることができなかった。 Next, the effect of the learning device 1 of this embodiment will be described with reference to FIG. 6. FIG. 6 is a graph showing the relationship between the number of target data and learning time. In the graph, the data shown by the solid line is the learning device 1 of this embodiment, and the data shown by the dotted line is a conventional learning device (comparative example). As shown in the graph in FIG. 6, the learning device 1 of this embodiment was able to complete learning even when the number of target data was 300 times. On the other hand, in the comparative example, learning could not be completed when the increase in the number of target data exceeded 8 times.

以上の通り、本実施形態の学習装置１は、従来の学習装置に比べて対象データを増加させることにより、学習の質を向上させることができると共に、対象データが増加された場合であっても学習を終了させることができる。 As described above, the learning device 1 of this embodiment can improve the quality of learning by increasing the amount of target data compared to conventional learning devices, and can also terminate learning even when the amount of target data is increased.

なお、上記実施形態においては、抽出データ１５の数が、停止数を超えると共に、基準データ数以上である場合は（ＳＴＥＰ７で１）、抽出データ１５を表示部７に表示させてユーザによる確認作業を行っているが、表示部７にデータを表示させずに、抽出データ１５を教師データ１１に加えるようにしてもよい。例えば、増加処理において、すでに教師候補データ１２をもとに作成された変更教師候補データ１３が表示されて確認作業を受けていれば、表示部７での表示を省略することができる。 In the above embodiment, when the number of extracted data 15 exceeds the stop number and is equal to or greater than the reference number of data (1 in STEP 7), the extracted data 15 is displayed on the display unit 7 and the user confirms it, but the extracted data 15 may be added to the teacher data 11 without displaying the data on the display unit 7. For example, in the increase process, if the modified teacher candidate data 13 created based on the teacher candidate data 12 has already been displayed and confirmed, the display on the display unit 7 can be omitted.

また、上記実施形態においては、対象データが画像データである場合について説明したが、対象データは、音声データ、グラフデータ、又は動画データ等のデータであってもよい。また、上記実施形態では、ＳＴＥＰ７において確認する停止数を０に設定しているが、これに限らず、他の数字（例えば１或いは１０等の整数）としてもよい。 In the above embodiment, the target data is image data, but the target data may be audio data, graph data, video data, or other data. In the above embodiment, the number of stops to be confirmed in STEP 7 is set to 0, but this is not limited to this and may be another number (e.g., an integer such as 1 or 10).

また、上記実施形態においては、モデルＭは、被写体の良品又は不良品を判定するために、第１ラベル又は第２ラベルの判定を行う例について説明しているが、被写体の種類（人物、乗り物、動物等）の判定を行うモデルＭについても、上記実施形態と同様に適用が可能である。 In addition, in the above embodiment, an example is described in which model M judges the first label or the second label to determine whether the subject is good or bad, but the above embodiment can also be applied to model M that judges the type of subject (person, vehicle, animal, etc.).

例えば、被写体の種類をラベルＡ，ラベルＢ及びラベルＣに分類するモデルの場合、ＳＴＥＰ７において、ラベルＡ，ラベルＢ及びラベルＣについて、それぞれラベルについての抽出数が全て基準データ数を超えた場合にＳＴＥＰ８～ＳＴＥＰ１０の処理を行い、それぞれラベルについての抽出数が停止数を超え、且つ基準データ数未満の場合にＳＴＥＰ１１の処理を行うようにすればよい。 For example, in the case of a model that classifies the type of subject into labels A, B, and C, in STEP 7, if the number of extractions for each of labels A, B, and C exceeds the reference number of data, the processing of STEP 8 to STEP 10 is performed, and if the number of extractions for each of the labels exceeds the stop number and is less than the reference number of data, the processing of STEP 11 is performed.

また、上記実施形態においては、増加処理として、フリップ処理、シフト処理、微小回転処理及びフィルタ処理を例にしているが、これに限らず、インパルズノイズの付加、コントラスト調整、明度調整、拡大・縮小、部分マスク、トリミング、変形、或いは変色等の各処理を行ってもよい。従って、増加するデータの数は上記実施形態の３００倍に限られず、適宜変更することができる。 In addition, in the above embodiment, flip processing, shift processing, micro-rotation processing, and filter processing are given as examples of increasing processing, but the present invention is not limited to these, and various processing such as adding impulse noise, adjusting contrast, adjusting brightness, enlarging/reducing, partial masking, trimming, deformation, or discoloring may also be performed. Therefore, the amount of data to be increased is not limited to 300 times that of the above embodiment, and can be changed as appropriate.

また、上記実施形態においては、学習済みのモデルＭを用いた学習の例を示したが、これに限らず、未学習のモデルＭを用いて上記処理を行ってもよい。その際、図２のＳＴＥＰ１の前処理として、未学習のモデルＭに学習用の教師データを用いて学習を行い、その後にＳＴＥＰ１以降の処理を行えばよい。 In addition, in the above embodiment, an example of learning using a trained model M is shown, but this is not limiting, and the above processing may be performed using an untrained model M. In this case, as a preprocessing step of STEP 1 in FIG. 2, the untrained model M is trained using training teacher data, and then the processing from STEP 1 onward is performed.

また、上記実施形態においては、ＳＴＥＰ７のにおいて、抽出範囲が拡張範囲である場合に、（１）側に移動して、抽出処理によって抽出された抽出データの表示及び確認を行っているが（ＳＴＥＰ８～９）、これに限らず、（２）側に移動して再度抽出範囲を拡張範囲として（ＳＴＥＰ１１）、抽出処理（ＳＴＥＰ６）を行ってもよい。その際、拡張範囲の値を変更してもよく、抽出処理（ＳＴＥＰ６）の回数を制限してもよい。 In the above embodiment, if the extraction range is an extended range in STEP 7, the process moves to side (1) and the extracted data extracted by the extraction process is displayed and confirmed (STEPs 8-9). However, this is not limiting, and the process may move to side (2) and the extraction range may be set to an extended range again (STEP 11), and the extraction process (STEP 6) may be performed. In this case, the value of the extended range may be changed, and the number of times the extraction process (STEP 6) is performed may be limited.

Ｍ…モデル
１…学習装置
２…データ取得部
３…増加処理部
４…制御部
５…抽出部
６…学習部
７…表示部
８…ラベル変更部
１１…教師データ
１２…教師候補データ
１３…変更教師候補データ
１４…母集団
１５…抽出データ
１６…表示画面
１７…ラベル
１８…変更表示 M... Model 1... Learning device 2... Data acquisition unit 3... Increase processing unit 4... Control unit 5... Extraction unit 6... Learning unit 7... Display unit 8... Label change unit 11... Teacher data 12... Teacher candidate data 13... Changed teacher candidate data 14... Population 15... Extracted data 16... Display screen 17... Label 18... Changed display

Claims

A learning device that performs learning on a model that uses training data to calculate an evaluation value for target data, compares the evaluation value with a predetermined threshold, and classifies the target data into at least one of a first label and a second label, the learning device comprising:
a data acquisition unit that acquires teacher candidate data classified into at least one of the first label and the second label;
an augmentation processing unit that generates one or more pieces of modified teacher candidate data by modifying a part of the content of the teacher candidate data;
an extraction unit that performs an extraction process for calculating an evaluation value for the teacher candidate data and the modified teacher candidate data as a population using the model, and extracting data in which the evaluation value is within a predetermined extraction range including the threshold value;
A learning unit that causes the model to undergo a learning process at a predetermined learning rate;
a control unit that controls the extraction unit and the learning unit,
The control unit is
When the number of extracted data extracted as a result of the extraction process performed by the extraction unit exceeds a predetermined stop number, the extracted data is added to the teacher data and the learning process is performed while the extraction process is performed, and the learning process and the extraction process are executed until the number of new extracted data reaches the stop number.

The learning device according to claim 1 ,
The control unit, when performing an extraction process using a trained model, creates a new population by removing the extracted data from the population, and performs the extraction process on the new population using the extraction unit.

The learning device according to claim 1 ,
In the extraction process, when there are a plurality of changed teacher candidate data which have the same teacher candidate data that was the basis of the change, a limiting process is performed to extract only a predetermined limited number of the teacher candidate data or the changed teacher candidate data in one extraction process.

The learning device according to claim 1 ,
The apparatus further includes a display unit that displays the extracted data, and a label change unit that can change the label of the extracted data displayed on the display unit,
A learning device that enables the label change unit to change the label of the extracted data before the data is added to the training data.

The learning device according to claim 4,
When the number of the extracted data exceeds the stop number and is less than a predetermined reference number of data, the control unit performs an extension process to widen the width of the extraction range and performs the extraction process;
A learning device that performs the learning process when the number of the extracted data becomes equal to or greater than the reference number of data.

The learning device according to claim 5,
The control unit is a learning device that performs the extension process by widening the width of the extraction range and decreasing the learning rate.

A learning device that performs learning on a model that uses training data to calculate an evaluation value for target data, compares the evaluation value with a predetermined threshold, and classifies the target data into at least one of a first label and a second label, the learning device comprising:
a data acquisition unit that acquires teacher candidate data classified into at least one of the first label and the second label;
an extraction unit that performs an extraction process of calculating an evaluation value for the teacher candidate data as a population using the model and extracting data in which the evaluation value is within a predetermined extraction range including the threshold value;
A learning unit that causes the model to undergo a learning process at a predetermined learning rate;
a control unit that controls the extraction unit and the learning unit,
The control unit is
When the number of extracted data extracted by the extraction unit exceeds a predetermined number of stops and is less than a predetermined reference number of data, an extension process is performed to widen the width of the extraction range, and the extraction process is performed.
A learning device characterized in that, when the number of extracted data becomes equal to or greater than the reference number of data, the extracted data is added to the teacher data, the learning process is performed, and the extraction process is performed, and the learning process and the extraction process are performed until the number of newly extracted data becomes the stopping number.

A learning method for learning a model that uses training data to calculate an evaluation value for target data, compares the evaluation value with a predetermined threshold, and classifies the target data into at least one of a first label and a second label, the learning method comprising the steps of:
a data acquisition step of acquiring teacher candidate data classified into at least either the first label or the second label;
an augmentation process for generating one or more modified teacher candidate data by modifying a part of the content of the teacher candidate data;
the teacher candidate data and the modified teacher candidate data are treated as a population, and an extraction step of calculating an evaluation value for the population at a predetermined learning rate using the model, and extracting data in which the evaluation value is within a predetermined extraction range including the threshold value; and a learning step of causing the model to learn,
A learning method characterized in that, when the number of extracted data extracted by the extraction process exceeds a predetermined stopping number, the extracted data is added to the teacher data and the learning process is executed, while a new population is created by removing the extracted data from the population, and the extraction process is executed on the new population, and the learning process and the extraction process are executed until the number of new extracted data reaches the stopping number.

A learning program for causing a computer to function as a learning device according to any one of claims 1 to 7.