JP7607934B2

JP7607934B2 - Trained model, control device, friction stir welding system, neural network system, and trained model generating method

Info

Publication number: JP7607934B2
Application number: JP2021507153A
Authority: JP
Inventors: 英俊藤井; 好昭森貞
Original assignee: Osaka University NUC
Current assignee: Osaka University NUC
Priority date: 2019-03-20
Filing date: 2020-03-02
Publication date: 2025-01-06
Anticipated expiration: 2040-03-02
Also published as: JPWO2020189235A1; WO2020189235A1

Description

本発明は、摩擦攪拌接合における継手特性を予測するための学習済みモデル、その学習済みを含む制御装置、その制御装置を含む摩擦攪拌接合システム、学習済みモデルを生成するためのニューラルネットワークシステム、及び学習済みモデルの生成方法に関する。 The present invention relates to a trained model for predicting joint characteristics in friction stir welding, a control device including the trained model, a friction stir welding system including the control device, a neural network system for generating the trained model, and a method for generating the trained model.

摩擦攪拌接合は、棒状のツールを高速で回転させながら材料と接触させ、材料との摩擦熱を利用して接合する手法である（例えば特許文献１）。かかる摩擦攪拌接合は、接合中の最高到達温度が母材の融点に到達しない固相接合であり、従来の溶融溶接と比べて継手効率が高く、場合によっては接合部が母材よりも高強度化されるという画期的な接合法である（例えば、特許文献１参照）。Friction stir welding is a method of joining materials by contacting them with a rod-shaped tool while rotating at high speed, utilizing the heat of friction between the materials (see, for example, Patent Document 1). Friction stir welding is a solid-phase joining method in which the maximum temperature reached during joining does not reach the melting point of the base material, and it is an innovative joining method that has higher joint efficiency than conventional fusion welding and, in some cases, the joint can be stronger than the base material (see, for example, Patent Document 1).

特許第２７１２８３８号公報Patent No. 2712838

摩擦攪拌接合では、接合速度やツール回転速度等の接合条件を設定すれば、基本的に同じ継手を再現性良く得ることができる。しかし、適切な接合条件は、被接合材の材質、サイズ、形状等によって異なり、接合条件を決定するためには接合部ごとに多数の予備実験が必要となる。換言すれば、接合条件から継手特性を精度よく予測することは困難である。 In friction stir welding, if the welding conditions such as welding speed and tool rotation speed are set, essentially the same joint can be obtained with good reproducibility. However, the appropriate welding conditions vary depending on the material, size, shape, etc. of the materials to be welded, and determining the welding conditions requires numerous preliminary experiments for each joint. In other words, it is difficult to accurately predict the joint characteristics from the welding conditions.

そこで、本発明は、摩擦攪拌接合において、接合条件から継手特性を精度よく予測することを目的とする。なお、本発明において、摩擦攪拌接合は、突合せ接合、重ね接合、線接合、点接合及びこれらの組合せを全て含み、表面改質技術である摩擦攪拌プロセスも含むものである。Therefore, the present invention aims to accurately predict joint characteristics from the joining conditions in friction stir welding. Note that in the present invention, friction stir welding includes butt joining, lap joining, line joining, spot joining, and combinations of these, as well as the friction stir process, which is a surface modification technology.

上述した課題を解決すべく、本発明は、摩擦攪拌接合の接合条件を示す入力データに基づいて継手特性の予測値を出力するよう、コンピュータを機能させるための学習済みモデルであって、入力層、出力層、及び、前記入力層と前記出力層との間に介在し、非線形の活性化関数で表される複数の隠れユニットを有する隠れ層、を含むニューラルネットワークから構成され、前記ニューラルネットワークが、前記継手特性を示す実験データのバラツキを用いて、前記実験データと前記予測値との誤差で表される関数を最小化するように重み係数を学習し、前記入力層に入力された、摩擦攪拌接合の接合条件を示す入力データに対し、前記学習済みモデルに基づく演算を行い、継手特性の予測値を前記出力層から出力するとともに当該予測値の信頼性を示す情報を出力するよう、コンピュータを機能させるための学習済みモデルを提供する。
ここで、学習済みモデルは接合条件を出力してもよい。例えば、学習済みモデルは、継手特性の予測値を用いて、より良い継手特性を得るための接合条件を算出し、出力してもよい。 In order to solve the above-mentioned problems, the present invention provides a trained model for causing a computer to function so as to output a predicted value of joint characteristics based on input data indicating joining conditions of friction stir welding, the trained model being composed of a neural network including an input layer, an output layer, and a hidden layer having a plurality of hidden units interposed between the input layer and the output layer and represented by a nonlinear activation function, the neural network uses the variation in experimental data indicating the joint characteristics to learn weighting coefficients so as to minimize a function represented by the error between the experimental data and the predicted value, and performs a calculation based on the trained model on input data indicating the joining conditions of friction stir welding input to the input layer, and outputs a predicted value of the joint characteristics from the output layer and information indicating the reliability of the predicted value. A trained model for causing a computer to function is provided.
Here, the trained model may output the joining conditions. For example, the trained model may use the predicted values of the joint characteristics to calculate and output joining conditions for obtaining better joint characteristics.

上記のような構成を有する本発明の学習済みモデルでは、前記ニューラルネットワークが、更に前記重み係数のバラツキを用いて、前記重み係数を学習し、前記予測値に対する前記接合条件の影響度合いを示す情報を更に出力するよう、コンピュータを機能させること、が好ましい。In the trained model of the present invention having the above-mentioned configuration, it is preferable to cause the computer to function so that the neural network further learns the weighting coefficients using the variation in the weighting coefficients and further outputs information indicating the degree of influence of the joining conditions on the predicted value.

また、上記のような構成を有する本発明の学習済みモデルでは、前記ニューラルネットワークは、異なる数の隠れユニットを有する複数のモデルのそれぞれについて前記重み係数を学習し、前記複数のモデルのうち、予測値と実験データとの誤差が最小となるモデルを予測モデルとすること、が好ましい。 Furthermore, in the trained model of the present invention having the above-mentioned configuration, it is preferable that the neural network learns the weighting coefficients for each of a plurality of models having different numbers of hidden units, and selects, as the prediction model, a model among the plurality of models that minimizes the error between the predicted value and the experimental data.

また、上記のような構成を有する本発明の学習済みモデルでは、前記ニューラルネットワークは、異なる数の隠れユニットを有する複数のモデルのそれぞれについて前記重み係数を学習し、前記複数のモデルを、予測値と実験データとの誤差に基づいて順位付け、高順位のモデルから順次組み合わせて生成される新たな複数のモデルのうち、予測値と実験データとの誤差が最小となるモデルを予測モデルとすること、が好ましい。 In addition, in the trained model of the present invention having the above-mentioned configuration, it is preferable that the neural network learns the weighting coefficients for each of a plurality of models having different numbers of hidden units, ranks the plurality of models based on the error between the predicted value and experimental data, and among the plurality of new models generated by sequentially combining the highest-ranked models, the model that has the smallest error between the predicted value and experimental data is used as the prediction model.

また、本発明は、摩擦攪拌接合における接合条件を示す観測データを取得する入力部と、上記の学習済みモデルを記憶する記憶部と、前記観測データの前記学習済みモデルへの適用結果に基づいて摩擦攪拌接合装置の操作量を決定する演算装置と、決定された前記操作量を前記摩擦攪拌接合装置に指示する出力部と、を含む制御装置をも提供する。The present invention also provides a control device including an input unit that acquires observation data indicating the joining conditions in friction stir welding, a memory unit that stores the above-mentioned trained model, a calculation unit that determines an operation amount of a friction stir welding device based on a result of applying the observation data to the trained model, and an output unit that instructs the friction stir welding device on the determined operation amount.

また、本発明は、摩擦攪拌接合における接合条件及び継手特性を示す観測データを取得する入力部と、所定の学習アルゴリズムを記憶する記憶部と、前記観測データの前記所定の学習アルゴリズムへの適用結果に基づいて摩擦攪拌接合装置の操作量を決定する演算部と、決定された前記操作量を前記摩擦攪拌接合装置に指示する出力部と、を含む制御装置であって、前記所定の学習アルゴリズムは、前記摩擦攪拌接合装置の接合条件及び前記継手特性を示す入力データに基づいて前記摩擦攪拌接合装置の適切な操作量を出力するように前記制御装置を機能させるべく、入力層、出力層、及び、前記入力層と前記出力層との間に介在し、非線形の活性化関数で表される複数の隠れユニットを有する隠れ層、を含むニューラルネットワークから構成され、前記ニューラルネットワークが、前記接合条件と前記継手特性との関係性を含む価値関数で表される誤差関数を最小化するように重み係数を学習し、前記入力層に入力された、摩擦攪拌接合の接合条件を示す入力データに対し、前記学習済みモデルに基づく演算を行い、前記摩擦攪拌接合装置の操作量を前記出力層から出力する、制御装置をも提供する。The present invention also provides a control device including an input unit that acquires observation data indicating the welding conditions and joint characteristics in friction stir welding, a memory unit that stores a predetermined learning algorithm, a calculation unit that determines an operation amount of a friction stir welding device based on a result of applying the observation data to the predetermined learning algorithm, and an output unit that instructs the friction stir welding device on the determined operation amount, wherein the predetermined learning algorithm is composed of a neural network including an input layer, an output layer, and a hidden layer interposed between the input layer and the output layer and having a plurality of hidden units represented by a nonlinear activation function, so as to cause the control device to function to output an appropriate operation amount of the friction stir welding device based on input data indicating the welding conditions of the friction stir welding device and the joint characteristics, the neural network learns weighting coefficients so as to minimize an error function represented by a value function including a relationship between the welding conditions and the joint characteristics, performs calculations based on the learned model for input data indicating the welding conditions of friction stir welding input to the input layer, and outputs the operation amount of the friction stir welding device from the output layer.

更に、本発明は、摩擦攪拌接合を実行する摩擦攪拌接合装置と、前記摩擦攪拌接合における接合条件を示す観測データを計測する計測器と、上記の制御装置と、を含む摩擦攪拌接合システムをも提供する。 Furthermore, the present invention also provides a friction stir welding system including a friction stir welding apparatus for performing friction stir welding, a measuring instrument for measuring observation data indicating the welding conditions in the friction stir welding, and the above-mentioned control device.

また、本発明は、摩擦攪拌接合の接合条件を示す入力データに基づいて継手特性の予測値を出力するようにコンピュータを機能させるための学習済みモデルを生成するニューラルネットワークシステムであって、入力層、出力層、及び、前記入力層と前記出力層との間に介在し、非線形の活性化関数で表される複数の隠れユニットを有する隠れ層、を含むニューラルネットワークを含み、前記ニューラルネットワークは、前記継手特性を示す実験データのバラツキを用いて、前記実験データと前記予測値との誤差で表される関数を最小化するように重み係数を学習し、前記入力層に入力された、摩擦攪拌接合の接合条件を示す入力データに対し、前記学習済みモデルに基づく演算を行い、継手特性の予測値を前記出力層から出力するとともに当該予測値の信頼性を示す情報を出力する学習済みモデルを生成するように構成されているニューラルネットワークシステムをも提供する。The present invention also provides a neural network system that generates a trained model for causing a computer to function to output a predicted value of joint characteristics based on input data indicating the joining conditions of friction stir welding, the neural network including an input layer, an output layer, and a hidden layer interposed between the input layer and the output layer and having a plurality of hidden units represented by a nonlinear activation function, the neural network using the variation in experimental data indicating the joint characteristics to learn weighting coefficients so as to minimize a function represented by the error between the experimental data and the predicted value, and performing calculations based on the trained model on input data indicating the joining conditions of friction stir welding input to the input layer, to generate a trained model that outputs a predicted value of the joint characteristics from the output layer and outputs information indicating the reliability of the predicted value.

更に、本発明は、摩擦攪拌接合の接合条件を示す入力データに基づいて継手特性の予測値を出力するようにコンピュータを機能させるための学習用モデルの生成方法であって、前記学習済みモデルを生成するための学習用モデルは、入力層、出力層、及び、前記入力層と前記出力層との間に介在し、非線形の活性化関数で表される複数の隠れユニットを有する隠れ層、を含むニューラルネットワークを含み、前記ニューラルネットワークにおいて、前記継手特性を示す実験データのバラツキを用いて、前記実験データと前記予測値との誤差で表される関数を最小化するように重み係数を学習するステップと、前記入力層に入力された、摩擦攪拌接合の接合条件を示す入力データに対し、前記学習済みモデルに基づく演算を行い、継手特性の予測値を前記出力層から出力するとともに当該予測値の信頼性を示す情報を出力する学習済みモデルを生成するステップと、を含む、学習用モデルの生成方法をも提供する。The present invention further provides a method for generating a learning model for causing a computer to function to output a predicted value of joint characteristics based on input data indicating the joining conditions of friction stir welding, the learning model for generating the trained model includes a neural network including an input layer, an output layer, and a hidden layer interposed between the input layer and the output layer and having a plurality of hidden units represented by a nonlinear activation function, the method including: learning weighting coefficients in the neural network using the variance in experimental data indicating the joint characteristics to minimize a function represented by the error between the experimental data and the predicted value; and performing a calculation based on the trained model on input data indicating the joining conditions of friction stir welding input to the input layer, to generate a trained model that outputs a predicted value of the joint characteristics from the output layer and outputs information indicating the reliability of the predicted value.

本発明によれば、摩擦攪拌接合において、接合条件から継手特性を精度よく予測することができる。したがって、本発明をＦＳＷ装置の制御に適用することで、多数の予備実験を必要とせず、また作業者の技術レベルによらずに、最適な接合条件を用いて各種継手を簡単に得ることができる。ＦＳＷで得られる継手が種々の優れた特性を有していることは広く知られており、適切な接合条件を自動的に把握することができれば、摩擦攪拌接合の利用者は飛躍的に増加すると見込まれる。According to the present invention, in friction stir welding, joint characteristics can be accurately predicted from the welding conditions. Therefore, by applying the present invention to the control of an FSW device, various joints can be easily obtained using optimal welding conditions without the need for numerous preliminary experiments and regardless of the skill level of the worker. It is widely known that joints obtained by FSW have various excellent characteristics, and if it were possible to automatically grasp the appropriate welding conditions, it is expected that the number of users of friction stir welding will increase dramatically.

本発明の実施形態に係る摩擦攪拌接合システム１の概略図である。1 is a schematic diagram of a friction stir welding system 1 according to an embodiment of the present invention. 摩擦攪拌接合において使用される回転ツール１１の一例を示す図である。FIG. 1 is a diagram showing an example of a rotating tool 11 used in friction stir welding. 材料４１，４２の接合部の一例を示す断面図である。2 is a cross-sectional view showing an example of a joint between materials 41 and 42. FIG. ニューラルネットワークの一例を示す概念図である。FIG. 1 is a conceptual diagram illustrating an example of a neural network. 学習済みモデルの生成手順の一例を示すフローチャートである。1 is a flowchart illustrating an example of a procedure for generating a trained model. ニューラルネットワークの層数の決定手順例を示すフローチャートである。13 is a flowchart illustrating an example of a procedure for determining the number of layers of a neural network. 複数の予測モデルを組み合わせてより適切なモデルを生成する手順例を示すフローチャートである。11 is a flowchart illustrating an example of a procedure for combining a plurality of prediction models to generate a more appropriate model. 予測モデルから出力される予測値及びエラーバーの一例を示すグラフである。1 is a graph showing an example of predicted values and error bars output from a prediction model. 予測モデルから出力される接合条件の重要度の一例を示すグラフである。13 is a graph showing an example of the importance of bonding conditions output from a prediction model. 制御装置２０の機能構成例を示すブロック図である。2 is a block diagram showing an example of a functional configuration of a control device 20. FIG. 制御装置２０の動作例を示すフローチャートである。4 is a flowchart showing an example of the operation of the control device 20. 制御装置１２０の機能構成例を示すブロック図である。FIG. 2 is a block diagram showing an example of the functional configuration of a control device 120. 制御装置１２０の動作例を示すフローチャートである。4 is a flowchart showing an example of the operation of the control device 120.

以下、本発明の代表的な実施形態を、図面を参照しつつ詳細に説明する。ただし、本発明はこれら図面に限定されるものではない。また、図面は、本発明を概念的に説明するためのものであるから、理解容易のために、必要に応じて寸法、比又は数を誇張又は簡略化して表している場合もある。Representative embodiments of the present invention will be described in detail below with reference to the drawings. However, the present invention is not limited to these drawings. In addition, since the drawings are intended to conceptually explain the present invention, dimensions, ratios, or numbers may be exaggerated or simplified as necessary for ease of understanding.

ここでは、摩擦攪拌接合システムの全体像を説明したうえで、継手特性の予測モデル（学習済みモデル）及びその学習法、並びに当該モデルの制御への適用について説明することとする。Here, we will explain the overall picture of the friction stir welding system, and then explain the predictive model (trained model) of joint characteristics and its learning method, as well as the application of the model to control.

１．摩擦攪拌接合システムの全体構成
本実施形態に係る摩擦攪拌接合（ＦＳＷ）システム１は、種々の材料４０を摩擦攪拌により接合するためのものである。ここで、接合対象となる材料４０は、典型的には、アルミニウム、マグネシウム、銅、チタン、亜鉛、鉛、鉄鋼、ニッケル、コバルト及びこれら材料の合金や複合材料などの金属材料であるが、例えばプラスチックやＣＦＲＰなどの樹脂材料でもよい。 1. Overall Configuration of the Friction Stir Welding System The friction stir welding (FSW) system 1 according to this embodiment is for welding various materials 40 by friction stirring. Here, the material 40 to be welded is typically a metal material such as aluminum, magnesium, copper, titanium, zinc, lead, steel, nickel, cobalt, or an alloy or composite material of these materials, but may also be a resin material such as plastic or CFRP.

図１に示されるように、ＦＳＷシステム１は、摩擦攪拌接合を実行する摩擦攪拌接合（ＦＳＷ）装置１０、ＦＳＷ装置１０を制御する制御装置２０、及び計測器３０を含んでいる。計測器３０は、ＦＳＷの接合条件及び継手特性を計測する１種又は複数種類の計測機器の総称であり、例えば、ＦＳＷ装置１０及び材料４０の状態（例えば回転ツール１１の回転速度・押圧荷重・角度・移動速度や、接合部の温度・幅等）を計測する。計測器３０は、接合部を撮影する撮像デバイスを含むものである。なお、計測器３０はＦＳＷ装置１０及び制御装置２０に含まれていてもよい。As shown in FIG. 1, the FSW system 1 includes a friction stir welding (FSW) apparatus 10 that performs friction stir welding, a control device 20 that controls the FSW apparatus 10, and a measuring device 30. The measuring device 30 is a collective term for one or more types of measuring equipment that measure the joining conditions and joint characteristics of FSW, and measures, for example, the state of the FSW apparatus 10 and the material 40 (e.g., the rotation speed, pressing load, angle, and moving speed of the rotating tool 11, and the temperature and width of the joint). The measuring device 30 includes an imaging device that photographs the joint. The measuring device 30 may be included in the FSW apparatus 10 and the control device 20.

１－１．摩擦攪拌接合装置の概要
ＦＳＷ装置１０は、所望の回転速度で回転する回転ツール１１を取り付けて摩擦攪拌接合を実施するものである。
図２に示すように、回転ツール１１は、柱状のショルダ１２と、ショルダ１２の先端に配置されてショルダ１２よりも小さい径を持つプローブ１３と、を含む。回転ツール１１としては、プローブ１３とショルダ１２とが一体的に駆動されるタイプと、プローブ１３とショルダ１２とが各々独立して駆動されるタイプとの、いずれのタイプの回転ツール１１を使用することができる。 1-1. Overview of Friction Stir Welding Apparatus An FSW apparatus 10 is equipped with a rotary tool 11 that rotates at a desired rotation speed to perform friction stir welding.
2, the rotary tool 11 includes a columnar shoulder 12 and a probe 13 that is disposed at the tip of the shoulder 12 and has a smaller diameter than the shoulder 12. As the rotary tool 11, either a type in which the probe 13 and the shoulder 12 are driven integrally, or a type in which the probe 13 and the shoulder 12 are driven independently can be used.

プローブ１３は、図１に示すように、材料４０中に押し入れられ、接合すべき突合せ面に沿って移動する。このとき、材料は裏当て板に拘束され、固体の状態を維持しながら、回転ツール１１による塑性流動によって接合されることになる。なお、材料４０中に圧入した回転ツール１１は必ずしも横移動させる必要はなく、そのまま引き抜くことで点接合を達成することができる。As shown in Figure 1, the probe 13 is pressed into the material 40 and moves along the butt surfaces to be joined. At this time, the material is restrained by the backing plate and is joined by the plastic flow caused by the rotating tool 11 while maintaining a solid state. Note that the rotating tool 11 pressed into the material 40 does not necessarily need to be moved laterally; spot joining can be achieved by simply pulling it out.

材料４０がアルミニウム合金やマグネシウム合金等の摩擦攪拌が容易な材料の場合、プローブ１３にはネジ１４が形成されていることが好ましい。プローブ１３の長さは、材料４０の板厚とほぼ等しいが、裏当ての板と接触しないように僅かに（例えば０．０５ｍｍ～０．２ｍｍ程度）短いことが好ましい。プローブ１３の断面形状としては、円柱状が一般的であるが、楕円状等の他の形状でもよいものとする。 When the material 40 is an aluminum alloy, magnesium alloy, or other material that is easy to stir by friction, it is preferable that the probe 13 is formed with a screw 14. The length of the probe 13 is approximately equal to the plate thickness of the material 40, but it is preferable that it is slightly shorter (for example, about 0.05 mm to 0.2 mm) so as not to come into contact with the backing plate. The cross-sectional shape of the probe 13 is generally cylindrical, but other shapes such as elliptical may also be used.

ショルダ１２は、軟化した材料４０が飛び出すのを防止するとともに、摩擦熱を発生及び維持する役割を担う。ショルダ１２は、材料４０に接触しながら回転するとともに、接合方向Ｘに向かって所望の移動速度（接合速度）で移動する。The shoulder 12 prevents the softened material 40 from popping out and generates and maintains frictional heat. The shoulder 12 rotates while in contact with the material 40 and moves at a desired moving speed (joining speed) in the joining direction X.

ＦＳＷ装置１０はまた、回転ツール１１を保持するツールホルダ、回転ツール１１を駆動するモータ、回転ツール１１を材料４０に向かって押圧する押圧機構、回転ツール１１を接合方向（Ｘ方向）及び接合方向と直交する方向（Ｙ方向）に移動させる移動機構、並びに、材料４０を保持する材料ホルダ（いずれも図示せず）を含んでいる。なお、ツールホルダは、回転ツール１１を材料に対して一定の角度だけ傾けた状態で回転ツール１１を保持することができる。したがって、接合時には、回転ツール１１は、プローブ１３の先端がショルダ１２よりも先行するように、所定の角度（例えば１～５度程度）だけ傾けられる。The FSW device 10 also includes a tool holder that holds the rotating tool 11, a motor that drives the rotating tool 11, a pressing mechanism that presses the rotating tool 11 against the material 40, a moving mechanism that moves the rotating tool 11 in the joining direction (X direction) and in a direction perpendicular to the joining direction (Y direction), and a material holder that holds the material 40 (all not shown). The tool holder can hold the rotating tool 11 in a state in which the rotating tool 11 is tilted at a certain angle relative to the material. Therefore, during joining, the rotating tool 11 is tilted at a predetermined angle (for example, about 1 to 5 degrees) so that the tip of the probe 13 leads the shoulder 12.

図３に示すように、ＦＳＷにより接合された材料４０（４１，４２）の断面組織は、幾つかの領域に分けられる。つまり、接合部位の中央には、通常は数μｍの等軸粒からなる再結晶組織を有する攪拌部（接合部）４３が存在する。攪拌部４３の外側には、塑性変形により結晶粒が伸びた形状を持つ熱加工影響部４４が存在し、更にその外側には、塑性変形は受けていないが熱の影響を受けた熱影響部（ＨＡＺ）４５が存在する。なお、攪拌部４３、熱加工影響部４４及びＨＡＺ４５の状態（例えば幅、軟化の程度など）は継手特性に影響を及ぼす。As shown in Figure 3, the cross-sectional structure of the material 40 (41, 42) joined by FSW is divided into several regions. That is, in the center of the joint, there is a stir zone (joint) 43, which usually has a recrystallized structure consisting of equiaxed grains of several μm. Outside the stir zone 43, there is a thermal processing affected zone 44, which has a shape in which the crystal grains have been elongated due to plastic deformation, and further outside that, there is a heat affected zone (HAZ) 45, which has not been plastically deformed but has been affected by heat. The state of the stir zone 43, the thermal processing affected zone 44, and the HAZ 45 (e.g., width, degree of softening, etc.) affects the joint characteristics.

したがって、ＦＳＷでは、接合対象となる材料４０の確実な保持のほか、回転ツール１１の形状、押圧荷重、角度、回転速度、接合速度等の接合条件（ＦＳＷ条件）を適切に調節することが、良い継手を得るために重要となる。ただし、接合条件は、材料の種類、板厚及び接合形状に応じて異なる。また、接合の良し悪し、つまり継手特性は、例えば、継手の機械的性質（引張強度、曲げ強度、疲労強度、耐食性など）、バリの発生量、エネルギー投入量、接合速度等で評価することができる。Therefore, in FSW, in addition to securely holding the materials 40 to be joined, it is important to properly adjust the joining conditions (FSW conditions) such as the shape of the rotating tool 11, pressing load, angle, rotation speed, and joining speed in order to obtain a good joint. However, the joining conditions differ depending on the type of material, plate thickness, and joining shape. In addition, the quality of the joint, that is, the joint characteristics, can be evaluated, for example, by the mechanical properties of the joint (tensile strength, bending strength, fatigue strength, corrosion resistance, etc.), the amount of burrs generated, the amount of energy input, the joining speed, etc.

１－２．制御装置の概要
制御装置２０は、計測器３０から計測データを取得し、この計測データに基づいてＦＳＷ装置１０を制御する。かかる制御装置２０は、演算装置（ＣＰＵ、ＧＰＵなど）及び記憶装置（ＲＡＭ、ＲＯＭなど）を含むコンピュータとして構成することができる。制御装置２０は、１台のコンピュータでもよいし、複数台のコンピュータで構成されてもよい。 1-2. Overview of the Control Device The control device 20 acquires measurement data from the measuring device 30 and controls the FSW device 10 based on this measurement data. The control device 20 can be configured as a computer including an arithmetic unit (CPU, GPU, etc.) and a storage device (RAM, ROM, etc.). The control device 20 may be a single computer or may be configured with multiple computers.

制御装置２０によるＦＳＷ装置１０の制御は、学習済みの予測モデルを用いて行うことができる。予測モデルは、制御装置２０において生成されてもよいし、例えばクラウドコンピューティングのように外部コンピュータにおいて生成されてもよい。
以下、制御装置２０の詳細を説明するに先立ち、本実施形態における機会学習について説明することとする。 The control device 20 can control the FSW device 10 using a trained prediction model. The prediction model may be generated in the control device 20, or may be generated in an external computer, for example, by cloud computing.
Before describing the details of the control device 20, machine learning in this embodiment will be described below.

２．予測モデルの構築
２－１．ニューラルネットワークによる学習
ニューラルネットワークは、非線形関数を組み合わせることで複雑な関係性を表現することができる。本実施形態では、このようなニューラルネットワークの柔軟性に注目し、ニューラルネットワークを、多くの要素が複雑に絡み合う継手特性のデータ処理に適用することとしている。 2. Building a predictive model
2-1. Learning by Neural Network A neural network can express complex relationships by combining nonlinear functions. In this embodiment, focusing on the flexibility of such a neural network, the neural network is applied to data processing of joint characteristics in which many elements are intricately intertwined.

ただし、ニューラルネットワークを継手特性の予測に直ちに適用することは容易ではなく、データのバラツキ及びフィッティングによる誤差を適切に処理する必要がある。そこで、発明者らは、ニューラルネットワークにベイズ推定の考え方を導入することとした。これにより、統計的なエラーバーを予測することができる。エラーバーの大きさはその時の入力条件に依存し、データのバラツキが大きく信頼性の低い場合には、エラーバーは大きくなる。However, it is not easy to immediately apply neural networks to predicting joint characteristics, and it is necessary to appropriately handle errors due to data variance and fitting. Therefore, the inventors decided to introduce the idea of Bayesian estimation into neural networks. This makes it possible to predict statistical error bars. The size of the error bar depends on the input conditions at the time, and if the data has a large variance and is unreliable, the error bar will be large.

詳細に説明すると、図４に、ニューラルネットワークの構造の概略を示す。図示のとおり、ニューラルネットワークは、入力層及び出力層を含んでいる。入力ｘ_ｉとして各種の接合条件（ＦＳＷ条件）を与えると、出力として継手特性の予測値ｙを得る。出力ｙは１つでも複数でもよいが、ここでは説明の便宜上、出力ｙが１つであるものとしている。 More specifically, the structure of the neural network is shown in FIG. 4. As shown in the figure, the neural network includes an input layer and an output layer. When various joining conditions (FSW conditions) are given as input x _i , a predicted value y of the joint characteristics is obtained as output. The output y may be one or more, but for convenience of explanation, it is assumed here that the output y is one.

入力される接合条件としては、例えば、試料の化学組成、回転ツール１１の回転速度、移動速度、印加荷重（又はプローブ１３の挿入量）、前進角、試料の形状、試料の厚さ、押え冶具形状、ツール形状等の「接合前パラメータ」と、接合部の温度、接合中のツールトルク、接合中のツール荷重、接合中のツール位置等の「接合中パラメータ」がある。また、出力される継手特性は、接合部の特性を定量的に表した数値であり、例えば継手強度、攪拌部４３の幅、バリ発生量、ＨＡＺ軟化、硬度分布、最高硬度、耐食性などがある。The input joining conditions include, for example, "pre-joining parameters" such as the chemical composition of the sample, the rotation speed of the rotating tool 11, the movement speed, the applied load (or the insertion amount of the probe 13), the advance angle, the shape of the sample, the thickness of the sample, the shape of the clamping jig, and the tool shape, as well as "joining parameters" such as the temperature of the joint, the tool torque during joining, the tool load during joining, and the tool position during joining. The output joint characteristics are numerical values that quantitatively represent the characteristics of the joint, and include, for example, the joint strength, the width of the stirring portion 43, the amount of burr generation, HAZ softening, hardness distribution, maximum hardness, and corrosion resistance.

入力層と出力層との間には、複数の隠れユニットを有する隠れ層が配置され、複雑な関数の表現を可能にしている。ここでは、説明の便宜上、隠れユニットは１層の隠れ層を構成するものとしているが、当然のことながら、隠れユニットは複数層の隠れ層を構成することができる。なお、全ての隠れユニットが非線形関数で表される必要はなく、線形関数で表されるユニットを含んでいてもよい。 Between the input layer and the output layer, a hidden layer having multiple hidden units is arranged, making it possible to express complex functions. Here, for the sake of convenience, the hidden units are assumed to constitute one hidden layer, but it goes without saying that the hidden units can constitute multiple hidden layers. Note that it is not necessary for all hidden units to be expressed by nonlinear functions, and they may include units expressed by linear functions.

このようなニューラルネットワークにおいて、入力ｘ_ｊとｉ番目の隠れユニットｈ_ｉとの間の関係式は、非線形な活性化関数を用いて次のように表せる。
ただし、ｗ_ｉｊ ^（１）はｘ_ｊとｈ_ｉとの間の重みであり、θ_ｉ ^（１）はしきい値である。双曲線正接関数ｔａｎｈは非線形関数の一例であり、例えばシグモイド関数、正規化線形関数（ＲｅＬＵ）などの他の非線形関数を用いてもよい。 In such a neural network, the relationship between the input x _j and the i-th hidden unit h _i can be expressed as follows using a nonlinear activation function:
where w _ij ⁽¹⁾ is the weight between x _j and h _i , and θ _i ⁽¹⁾ is a threshold value. The hyperbolic tangent function tanh is an example of a nonlinear function, and other nonlinear functions such as a sigmoid function or a rectified linear function (ReLU) may be used.

また、隠れユニットｈ_ｉと出力ｙの間の関係は次のように線形関数で表せる。
ただし、ｗ_ｉ ^（２）はｈ_ｉとｙとの間の重みであり、θ^（２）はしきい値である。 Moreover, the relationship between the hidden unit h _i and the output y can be expressed as a linear function as follows:
where w _i ⁽²⁾ is the weight between h _i and y, and θ ⁽²⁾ is a threshold value.

上記式１のように入力ｘと隠れユニットｈとの関係を非線形関数で表現することにより、複雑な関係を表現できるネットワークを構成することができる。以下、式１，２における重みｗ_ｉｊ ^（１）及びｗ_ｉ ^（２）を単に重み係数ｗと言い、しきい値θ_ｉ ^（１）及びθ^（２）を単にしきい値θと言うがある。 A network that can express complex relationships can be constructed by expressing the relationship between the input x and the hidden unit h as a nonlinear function as in the above formula 1. Hereinafter, the weights w _ij ⁽¹⁾ and w _i ⁽²⁾ in formulas 1 and 2 will be simply referred to as weight coefficients w, and the thresholds θ _i ⁽¹⁾ and θ ⁽²⁾ will be simply referred to as thresholds θ.

ここで、入力変数ｘを次式で±０．５の範囲に規格化してもよく、出力変数ｙもまた同じ手法で規格化してもよい。
ただし、ｘ_Ｎは規格化されたｘ、ｘ_ｍａｘは元のデータの最大値、ｘ_ｍｉｎは最小値である。 Here, the input variable x may be normalized to the range of ±0.5 using the following formula, and the output variable y may also be normalized using the same method.
Here, _xN is the normalized x, _xmax is the maximum value of the original data, and _xmin is the minimum value.

２－２．エネルギー関数Ｍ（ｗ）
上述した重み係数ｗ及びしきい値θを決定するために、次式で表されるエネルギー関数Ｍ（ｗ）を最小にすることを考える。
ここで、Ｅ_Ｄは誤差関数、Ｅ_ｗ（ｃ）は適切化項（正則化項）であり、追って詳しく述べる。また、パラメータベクトルｗは、重み係数ｗ及びしきい値θを含むものとする。α_ｃ及びβは、モデルの複雑さを制御するパラメータである。 2-2. Energy function M(w)
In order to determine the above-mentioned weighting coefficient w and threshold value θ, it is considered to minimize the energy function M(w) expressed by the following equation.
Here, E _D is an error function, E _w(c) is a regularization term (term) and will be described in detail later. The parameter vector w includes a weighting coefficient w and a threshold θ. α _c and β are parameters that control the complexity of the model.

エネルギー関数Ｍ（ｗ）の最小化をコンピュータ上で実現するためには、例えば可変計量法を用いることができ、また、Ｍ（ｗ）の勾配の計算には例えば逆誤差伝播法を用いることができるが、ここでは詳細に立ち入らない。 To minimize the energy function M(w) on a computer, for example, the variable metric method can be used, and the gradient of M(w) can be calculated using the backpropagation method, but we will not go into details here.

上式のようにエネルギー関数Ｍ（ｗ）は、誤差関数Ｅ_Ｄと適切化項Ｅ_ｗ（ｃ）とからなる。そして、誤差関数Ｅ_Ｄは、次式のように予測モデルの予測値ｙ（ｘ^ｍ；ｗ）と実験値ｔ^ｍの差の二乗和で表すことができる。
ここで｛ｘ^ｍ、ｔ^ｍ｝はデータセットであり、ｘ^ｍは入力変数を、ｔ^ｍは実験データすなわちターゲットを示す。ｍはデータとターゲットの組合せのラベルである。 As shown in the above equation, the energy function M(w) is composed of an error function E _D and an appropriated term E _w(c) . The error function E _D can be expressed as the sum of squares of the difference between the predicted value y(x ^m ; w) of the prediction model and the experimental value t ^m as shown in the following equation.
Here, {x ^m , t ^m } is a data set, x ^m is an input variable, and t ^m is experimental data, i.e., a target, and m is a label for the combination of data and target.

上式から、予測モデルの予測結果が実験データに良く一致するとき、すなわちｙ（ｘ^ｍ；ｗ）がｔ^ｍに近いとき、誤差関数Ｅ_Ｄは最小となる。 From the above equation, when the prediction results of the prediction model match the experimental data well, ie, when y( ^xm ;w) is close to ^tm , the error function E _D is at a minimum.

次いで、適切化項Ｅ_ｗは、モデル出力ｙ（ｘ；ｗ）を入力ｘの滑らかな関数にする働きがある。換言すれば、適切化項は重み係数ｗを小さくすることを促し、予測モデルがデータセットのばらつきにオーバーフィットすること、つまり過学習を抑制する。 Next, the fit term _Ew serves to make the model output y(x;w) a smooth function of the input x. In other words, the fit term encourages the weight coefficient w to be small, suppressing the overfitting of the predictive model to the variability of the data set, i.e., overfitting.

適切化項Ｅ_ｗは複数の適切化項Ｅ_ｗ（ｃ）の和で表されてもよい。例えば、入力ｘと隠れユニットｈ間の重み係数で一つのクラス、隠れユニットｈと出力ｙとの間の重み係数で一つのクラス、隠れユニットｈのしきい値で一つのクラスをつくり、各適切化項Ｅ_ｗ（ｃ）を計算してもよい。この場合、各適切化項Ｅ_ｗ（ｃ）は、次式で示されるように、各クラスに属する係数ｗ_ｉの二乗和で表される。
The appropriateness term _Ew may be expressed as the sum of a plurality of appropriateness terms _Ew(c) . For example, one class may be created based on the weight coefficient between the input x and the hidden unit h, one class based on the weight coefficient between the hidden unit h and the output y, and one class based on the threshold value of the hidden unit h, and each appropriateness term _Ew(c) may be calculated. In this case, each appropriateness term Ew _(c) is expressed as the sum of squares of the coefficients w _i belonging to each class, as shown in the following equation.

次いで、パラメータα_ｃ及びβは、隠れユニットｈの個数とともに、モデルの複雑さをコントロールしている。例えばデータがσ_νの標準偏差で示されるガウス分布でばらついているとすると、βはデータのバラツキσ_ν ^２＝１／βを規定し、α_ｃは重み係数ｗのバラツキσ_ｗ（ｃ） ^２＝１／α_ｃを規定していることになる。 The parameters _αc and β then control the complexity of the model, along with the number of hidden units h. For example, if the data is distributed Gaussianly with a standard deviation of _σν , then β defines the variance of the data _σν2 = ¹ /β, and _αc defines the variance of the weighting coefficient w, σw _(c) ² = 1/ _αc .

式４から、パラメータα_ｃは重み係数ｗを減少させる効果がある。したがって、大きなσ_ｗは、それに対応する入力が出力に大きな変化を与えることを意味している。このことから、σ_ｗを、各入力の重要性を示す指標とすることができる。また、σ_νはデータのバラツキを示し、後述するようにエラーバーとして使用できる。 From Equation 4, the parameter α _c has the effect of decreasing the weight coefficient w. Therefore, a large σ _w means that the corresponding input has a large change in the output. From this, σ _w can be used as an index showing the importance of each input. Also, σ _ν shows the variance of the data and can be used as an error bar as described later.

本実施形態では、追って詳しく述べるように、パラメータα_ｃおよびβは、ベイズ推定の考え方を用いて算出できる。各パラメータの初期値は、ユーザが適宜決定してもよく、例えば、重み係数ｗに小さなバラツキを持たせて初期値を設定してもよい。 In this embodiment, as will be described in detail later, the parameters _αc and β can be calculated using the concept of Bayesian estimation. The initial values of each parameter may be appropriately determined by the user, and for example, the initial values may be set by allowing small variations in the weighting coefficient w.

２－３．パラメータα _ｃ、βの決定
次いで、パラメータα_ｃ、βについて詳細に説明する。
パラメータβが大きすぎると関数の自由度が大きくなり、オーバーフィッティングが起こりやすくなる。逆に、パラメータα_ｃ（以下、単にαと表記する）が大きくなると関数が滑らかになりすぎ、データにフィットしなくなる。このようなα、βの重要性を考慮し、発明者らは、ベイズ推定の考え方を導入してα、βに統計的な意味を持たせることとした。 2-3 Determination of parameters α _c and β Next, the parameters α _c and β will be described in detail.
If the parameter β is too large, the degree of freedom of the function increases, and overfitting becomes more likely. Conversely, if the parameter α _c (hereinafter simply referred to as α) is large, the function becomes too smooth and does not fit the data. Considering the importance of α and β, the inventors have decided to introduce the idea of Bayesian estimation to give α and β statistical meaning.

一般に、あるデータＤが起こるという条件のもとで、重み係ｗおよびしきい値θに対してある組合せｗが起こりうる条件付き確率ｐ（ｗ｜Ｄ）は、次のように表される。
In general, the conditional probability p(w|D) that a certain combination w occurs for a weighting factor w and a threshold value θ, given that certain data D occurs, is expressed as follows:

最も確からしい重み係数ｗやしきい値θを決定するには、ｐ（ｗ｜Ｄ）が最大になるようにすれば良い。つまり、上式より、
の関係があるので、右辺に含まれる確率ｐ（ｗ｜Ｄ）、ｐ（ｗ）について正規分布に従ったばらつきがあるものとして、各確率を求めることとする。 In order to determine the most likely weighting coefficient w and threshold value θ, it is necessary to maximize p(w|D).
Since the following relationship exists, each probability is calculated assuming that the probabilities p(w|D) and p(w) included in the right-hand side vary according to a normal distribution.

まず確率ｐ（ｗ｜Ｄ）を求める。
一般に、正規分布ｆ（ｘ）は、平均をｍ、標準偏差をσとすると、次式で表される。
したがって、重み係数ｗ及びしきい値θがあるベクトルｗで表される場合のデータのばらつきは、次式で表される。
このとき、ｘ^（ｍ）は入力変数を、ｔ^（ｍ）は実験データすなわちターゲットであり、Ｚ_Ｄは規格化定数、σ_νはデータのバラツキである。 First, the probability p(w|D) is calculated.
In general, a normal distribution f(x) is expressed by the following equation, where m is the mean and σ is the standard deviation.
Therefore, when the weighting coefficient w and the threshold value θ are expressed by a vector w, the variance of the data is expressed by the following equation.
In this case, x ^(m) is an input variable, t ^(m) is experimental data, that is, a target, _ZD is a normalization constant, and _σν is the variance of the data.

ここで、式５で表される誤差関数Ｅ_Ｄ（ｗ）を式１０に代入すると、次式を得る。
Here, by substituting the error function E _D (w) expressed by the formula 5 into the formula 10, the following formula is obtained.

次いで確率ｐ（ｗ）を求めると、ｐ（ｗ）にもバラツキがあり、その確率は、式５を考慮すると、
で表される。ここでＺ_ｗは規格化定数であり、σ_ｗは真のｗの値からのバラツキである。 Next, when the probability p(w) is calculated, p(w) also has variation. Taking into account Equation 5, the probability is as follows:
where Z _w is a normalization constant and σ _w is the deviation from the true w value.

式１１、１２を式８に代入すると、
となる。 Substituting Equations 11 and 12 into Equation 8, we get
It becomes.

したがって、ｐ（ｗ｜Ｄ）を最大にするには、
を最小にすれば良い。 Therefore, to maximize p(w|D),
should be minimized.

そして、式４で表されるエネルギー関数Ｍ（ｗ）を最小にすることと比較すると、パラメータα、βは
という統計的意味を持つことがわかる。これにより的確なトレーニングが可能になる。 In comparison with minimizing the energy function M(w) expressed by Equation 4, the parameters α and β are
This has statistical significance, making it possible to provide precise training.

このとき、エラーバーσは、次式で表される。
In this case, the error bar σ is expressed by the following equation.

２－４．トレーニング手順
以上から、予測モデル生成のためのトレーニングは、次の手順で行われる。トレーニングは、制御装置２０におけるトレーニング用プログラムの実行を介して行われてもよいし、トレーニング用プログラムを実行可能な外部コンピュータで行われてもよい。 2-4. Training Procedure As described above, training for generating a prediction model is performed according to the following procedure. The training may be performed via execution of a training program in the control device 20, or may be performed by an external computer capable of executing the training program.

具体的に説明すると、図５のステップＳ１１において、訓練データ（ｘ^ｍ，ｔ^ｍ）を取得する。ここで、ｘ^ｍは入力変数となる接合条件であり、ｔ^ｍはターゲットとなる継手特性（実験データ）である。併せて、ステップＳ１２において、パラメータベクトルｗを設定する。ここでベクトルｗは、重み係数ｗ及びしきい値θを含んでいてもよい。 More specifically, in step S11 of Fig. 5, training data ( ^xm , ^tm ) is acquired. Here, ^xm is a joining condition that is an input variable, and ^tm is a target joint characteristic (experimental data). In addition, in step S12, a parameter vector w is set. Here, the vector w may include a weighting coefficient w and a threshold value θ.

次いで、ステップＳ１３において、訓練データ（ｘ^ｍ，ｔ^ｍ）のバラツキσ_ν及びパラメータベクトルｗのバラツキσ_ｗを算出する。 Next, in step S13, the variance σ _v of the training data (x ^m , t ^m ) and the variance σ _w of the parameter vector w are calculated.

そして、ステップＳ１４において、エネルギー関数Ｍ（ｗ）を最小化するようなパラメータベクトルｗを算出する。この計算には可変計量法を用いることができ、Ｍ（ｗ）の勾配の計算には例えば逆誤差伝播法を用いることができるThen, in step S14, a parameter vector w that minimizes the energy function M(w) is calculated. The variable metric method can be used for this calculation, and the gradient of M(w) can be calculated using, for example, the backpropagation method.

ステップＳ１５において、前手順の計算結果にしたがってパラメータベクトルｗを更新するとともに、そのバラツキσ_ｗを算出及び更新する。 In step S15, the parameter vector w is updated in accordance with the calculation results of the previous step, and the variation σ _w is calculated and updated.

そして、ステップＳ１４，Ｓ１５を所定の回数だけ繰り返し、一連の手順を終了する。あるいは、パラメータベクトルｗが予め設定した範囲内に収束することを終了条件にしてもよい。このようにして得られたパラメータベクトルｗを、予測モデルのパラメータとして用いる。Then, steps S14 and S15 are repeated a predetermined number of times to end the series of procedures. Alternatively, the end condition may be that the parameter vector w converges within a preset range. The parameter vector w thus obtained is used as a parameter of the prediction model.

この作業を、隠れユニットｈの個数、つまりパラメータベクトルｗの要素数を調整しながら繰り返す。そうすると、隠れユニットの個数が増えるにしたがって、予測モデルの予測値と実験データとの差が単調に減少する。ここで、入力と出力の関係が複雑であるほど、多くの隠れユニットが必要であるが、実験データは常に誤差を含んでいるため、隠れユニットｈの個数を増やし過ぎるとオーバーフィット（過学習）が生じ、予測精度つまり汎化性能が逆に低下する。This process is repeated while adjusting the number of hidden units h, i.e., the number of elements in the parameter vector w. As a result, the difference between the predicted value of the predictive model and the experimental data monotonically decreases as the number of hidden units increases. Here, the more complex the relationship between the input and output, the more hidden units are required, but because experimental data always contains errors, increasing the number of hidden units h too much will result in overfitting, and the prediction accuracy, i.e., generalization performance, will actually decrease.

そこで、例えば、データの半分をランダムに選択し、その半分のデータだけでニューラルネットワークをトレーニングし、残りの半分をモデルの汎化性能を調べるためのテストデータとしてもよい。テストデータを用いて、モデルの予測値とテストデータとの間の誤差、つまりテストエラーを測定するのである。なお、データセットの数が少ない場合には、大部分のデータをトレーニング用とし、残りのデータをテスト用としてよい。例えば、データセット数が３０個の場合、その内の２７個をトレーニング用に、残りの３個をテスト用とすることが好ましい。 For example, half of the data can be randomly selected, the neural network can be trained using only that half of the data, and the remaining half can be used as test data to examine the generalization performance of the model. The test data is used to measure the error between the model's predicted value and the test data, i.e., the test error. If the number of data sets is small, most of the data can be used for training and the remaining data for testing. For example, if there are 30 data sets, it is preferable to use 27 of them for training and the remaining 3 for testing.

隠れユニットｈが、ある個数となるとき、テストエラーは最小となる。この時のモデルを最適と判断し、学習済みモデルとして用いると予測精度が向上する。 When the number of hidden units h reaches a certain number, the test error is minimized. If the model at this point is judged to be optimal and used as a trained model, the prediction accuracy will improve.

したがって、隠れユニットｈの適切な個数ｋ^＊を決定する手順は次のようにすればよい。
まず図６のステップＳ２１において、前述したトレーニングを繰り返すことで、ｋ個（ｋ＝１，２，・・・）の隠れユニットｈ^ｋを持つ複数の予測モデルを準備する。 Therefore, the procedure for determining an appropriate number k ^* of hidden units h can be as follows.
First, in step S21 of FIG. 6, the above-mentioned training is repeated to prepare a plurality of prediction models having k (k=1, 2, . . . ) hidden units h ^k .

次いで、ステップＳ２２において、各予測モデルについてテストエラーを算出し、ステップＳ２３において、テストエラーが最小となる個数ｋ^＊を決定し、それに対応する予測モデルを最適モデルとする。なお、次に述べる予測モデルの組合せのために、テストエラーが小さい順に、最適モデル、第２最適モデル、第３最適モデル・・・のように順序付けしておいてもよい。 Next, in step S22, the test error is calculated for each prediction model, and in step S23, the number k ^* that minimizes the test error is determined, and the corresponding prediction model is set as the optimal model. Note that, for the combination of prediction models described below, the models may be ordered in ascending order of test error, such as the optimal model, the second optimal model, the third optimal model, and so on.

２－５．予測モデルの組合せによる最適化
予測精度を更に向上させるために、複数のモデルを組み合わせてもよい。つまり、ニューラルネットワークでは、誤差の点では最適モデルとあまり変わらないが、全く異なった構造を持つモデルを数多く作ることができる。これらモデルを組み合わせることで、単体モデルの欠点を相互に補うことができ、予測精度を更に向上させることができる。 2-5. Optimization by combining predictive models Multiple models may be combined to further improve the prediction accuracy. In other words, with neural networks, it is possible to create many models that are not very different from the optimal model in terms of error but have completely different structures. By combining these models, the shortcomings of each individual model can be mutually compensated for, and the prediction accuracy can be further improved.

具体的には、図７のステップＳ３１において、誤差関数（又は上述したテストエラーの比較結果）を用いて、複数の予測モデルを最適モデル、第２最適モデル、第３最適モデル・・・のように順位付けする。Specifically, in step S31 of FIG. 7, the error function (or the results of the comparison of the test errors described above) is used to rank the multiple predictive models as an optimal model, a second optimal model, a third optimal model, etc.

ステップＳ３２において、次式に従い、最適モデルの予測値と第２最適モデルの予測値との平均値を新たなモデルの予測値とし、ステップＳ３３において、この予測値と実験データとの誤差を算出する。
In step S32, the average value of the predicted value of the optimum model and the predicted value of the second optimum model is determined as the predicted value of the new model according to the following equation, and in step S33, the error between this predicted value and the experimental data is calculated.

そして、ステップＳ３４において、先に算出した誤差が最小となるかどうかを判定する。最小でないと判定されると、ステップＳ３５において、上式にしたがって次順位のモデルを組み合わせて、再度ステップＳ３３，Ｓ３４を実行する。Then, in step S34, it is determined whether the previously calculated error is a minimum. If it is determined that the error is not a minimum, in step S35, the next model is combined according to the above formula, and steps S33 and S34 are executed again.

誤差が最小と判定されると、ステップＳ３６において、そのときのモデルの組合せを予測モデルとして採用する。例えば、第４最適モデルまでを組み合わせることによって誤差が小さくなるが、第５最適モデル以降を足し合わせると逆に誤差が大きくなるとき、最適モデルから第４最適モデルまでを組み合わせたものを予測モデルとすればよい。If it is determined that the error is minimal, in step S36, the combination of models at that time is adopted as the prediction model. For example, if the error is reduced by combining the first four optimal models, but the error increases when the fifth optimal model and onward are added, the combination of the optimal model through the fourth optimal model can be used as the prediction model.

このとき、新たな予測のエラーバーσは、次式を用いて計算すればよい。
ただし、Ｎ：モデル数、ｙ_ｉ、σ_ｉ：個々のモデルの予測値及びエラーバーである。 In this case, the error bar σ of the new prediction may be calculated using the following formula:
where N is the number of models, and y _i and σ _i are the predicted values and error bars of each model.

ところで、本実施形態の予測モデルでは、ある接合条件に対して継手特性の予測値及びエラーバーを出力することができる。例えば図８の例では、接合速度と接合強度の予測値との関係が曲線Ｌで表されるとともに、曲線Ｌを挟むようにエラーバーの上限及び下限を示す曲線Ｄで表されている。あるいは、エラーバーはバラツキσ_νに対応する長さを持つ線分Ｅで表されてもよい。もちろん、予測値及びエラーバーは数値で表されてもよい。 In the prediction model of this embodiment, it is possible to output predicted values and error bars of joint characteristics for certain joining conditions. For example, in the example of Fig. 8, the relationship between the joining speed and the predicted value of the joining strength is represented by a curve L, and is also represented by curves D indicating the upper and lower limits of the error bar on either side of the curve L. Alternatively, the error bar may be represented by a line segment E having a length corresponding to the variation σ _ν . Of course, the predicted value and the error bar may be represented by numerical values.

このような予測値及びエラーバーを得ることで、ユーザは、継手特性の妥当な予測を得るとともに予測の確かさを知ることができる。エラーバーが大きいことは、実験データの数が不足しているか又は実験の精度が不十分であることを意味しているから、ユーザは、エラーバーの大きい領域において実験データを増やすか又は精度の高い実験を行うとよく（補強実験の実施）、これによりモデルの予測の信頼性を高めたり、当該材料に関する研究を更に深めたりすることができる。あるいは、予測モデルをＦＳＷ装置１０の制御のために利用する場合に、エラーバーの小さい領域にある予測値を優先的に採用することで、高精度かつ安定的な制御を期待できる。By obtaining such predicted values and error bars, the user can obtain a reasonable prediction of the joint characteristics and know the accuracy of the prediction. Since a large error bar means that the number of experimental data is insufficient or the accuracy of the experiment is insufficient, the user should increase the experimental data in the region with a large error bar or conduct a highly accurate experiment (conduct a reinforcement experiment), thereby improving the reliability of the model prediction and further deepening the research on the material. Alternatively, when the prediction model is used to control the FSW device 10, highly accurate and stable control can be expected by preferentially adopting the predicted value in the region with a small error bar.

また、予測モデルは、パラメータベクトルｗのバラツキσ_ｗに基づいて、例えば図９のように、継手特性に対する接合条件ごとの重要度（影響度合い）を示すことができる。これにより、ユーザは、該当する継手特性に対して影響力のある接合条件を把握することができ、これを研究・開発の指針として利用できる。また、ＦＳＷ装置１０の制御においては、例えば、重要度の高い接合条件を優先的に調整することで目標値に素早くかつ効率的に近づけることも可能である。
あるいは、予測モデルは接合条件を出力してもよい。例えば、予測モデルは、継手特性の予測値を用いて、より良い継手特性を得るための接合条件を算出し、出力してもよい。予測モデルは、より良い接合条件の選定に際して、上述したエラーバー及び重要度を参照してもよい。これによりユーザは、より優れた継手を得るための示唆を得ることができる。 Furthermore, the prediction model can indicate the importance (degree of influence) of each welding condition on the joint characteristics based on the variation σ _w of the parameter vector w, for example, as shown in Fig. 9. This allows the user to understand the welding conditions that have an influence on the relevant joint characteristics, and can use this as a guideline for research and development. In addition, in the control of the FSW device 10, it is also possible to quickly and efficiently approach the target value by, for example, preferentially adjusting the welding conditions with high importance.
Alternatively, the prediction model may output the joining conditions. For example, the prediction model may use the predicted values of the joint characteristics to calculate and output joining conditions for obtaining better joint characteristics. The prediction model may refer to the error bars and importance described above when selecting better joining conditions. This allows the user to obtain suggestions for obtaining a better joint.

このようにして得られた予測モデルについて、複数の入力変数（接合条件）のうち特定の入力を変化させ、残りの入力を変化させないと、出力（予測値）は、その特定の入力に対する応答特性を示すことになる。このような出力の応答特性を全ての入力変数について調べると、入力に対する出力のトレンドを把握することができるから、パラメータベクトルｗのバラツキσ_ｗとともに又はこれに代えて、応答特性に優れた接合条件を把握することができる。この知見をＦＳＷ装置１０の制御のために利用することもできるし、当該材料に関する研究に使用することもできる。 For the prediction model obtained in this way, if a specific input among the multiple input variables (joining conditions) is changed while the remaining inputs are not changed, the output (predicted value) will show the response characteristics to that specific input. By examining the response characteristics of such an output for all input variables, it is possible to grasp the trend of the output with respect to the input, and therefore it is possible to grasp the joining conditions with excellent response characteristics together with or instead of the variation σ _w of the parameter vector w. This knowledge can be used for controlling the FSW device 10, and can also be used for research on the material in question.

３．制御手法の例
上述したモデルはＦＳＷ装置の制御にも適用することができる。幾つかの適用形態が考えられ、その中には強化学習を利用した制御があるので、まず強化学習の概略を述べることとする。 3. Examples of control methods The above-mentioned model can also be applied to the control of an FSW device. Several application forms are possible, including control that utilizes reinforcement learning, so we will first provide an overview of reinforcement learning.

３－１．強化学習
強化学習とは、試行錯誤を通じて環境に適応する学習制御の枠組である。
学習主体であるエージェントは、時刻ｔにおいて、制御対象である環境の状態の観測値ｓ（ｔ）に応じて意志決定を行い、行動ａ（ｔ）を出力する。エージェントの行動により、環境はｓ（ｔ＋１）へ状態遷移し、その遷移に応じた報酬ｒ（ｔ）をエージェントへ与える。この一連の流れが繰り返されて、エージェントは、利得の最大化を目的として、状態観測から行動出力への方策πを学習する。 3-1. Reinforcement learning Reinforcement learning is a framework for learning control that adapts to the environment through trial and error.
The agent, which is the learning subject, makes a decision at time t according to the observed value s(t) of the state of the environment to be controlled, and outputs an action a(t). The agent's action causes the environment to transition to a state s(t+1), and a reward r(t) corresponding to the transition is given to the agent. This series of steps is repeated, and the agent learns a policy π from the state observation to the action output with the goal of maximizing the profit.

例えば、強化学習の一種であるＱ学習では、エージェントが、試行錯誤により、ある状態ｓの下で様々な行動ａをとり、そのときの報酬を用いて最適な行動価値Ｑ（ｓ，ａ）を学習する。行動価値関数Ｑ（ｓ，ａ）の更新式は、次式で表される。
ここで、ｓ_ｔ及びａ_ｔは、時刻ｔにおける環境の状態ｓ及び行動ａである。 For example, in Q-learning, which is a type of reinforcement learning, an agent takes various actions a in a certain state s by trial and error, and learns the optimal action value Q(s, a) using the reward at that time. The update formula for the action value function Q(s, a) is expressed as follows:
where s _t and a _t are the state s of the environment and the action a at time t.

行動ａ_ｔにより、環境は状態ｓ_ｔ＋１に遷移し、それに伴って報酬ｒ_ｔ＋１が算出される。上式において、ｍａｘＱ（ｓ_ｔ＋１，ａ）の項は、状態ｓ_ｔ＋１の下で、最もＱ値の高い行動ａに対応するＱ値であり、割引率γ（γは０＜γ≦１）が乗算されている。また、α（０＜α≦１）は学習係数である。 By the action a _t , the environment transitions to state s _t+1 , and the reward r _t+1 is calculated accordingly. In the above formula, the term maxQ(s _t+1 , a) is the Q value corresponding to the action a with the highest Q value in state s _t+1 , multiplied by the discount rate γ (γ is 0<γ≦1). In addition, α (0<α≦1) is the learning coefficient.

この更新式は、状態ｓ_ｔにおける行動ａ_ｔの評価値Ｑ（ｓ_ｔ、ａ_ｔ）よりも、次の状態ｓ_ｔ＋１における最良の行動の評価値ｍａｘ_ａＱ（ｓ_ｔ＋１、ａ）の方が大きければ、Ｑ（ｓ_ｔ、ａ_ｔ）を増加させ、逆に小さければ、Ｑ（ｓ_ｔ、ａ_ｔ）を減少させる。 This update equation increases Q(s _t , a _t ₎ if the evaluation value max _a Q(s t ₊₁ , a) of the best action in the next state s _t+1 _is greater than the evaluation value Q(s _t , a _t ) of action a t in state s t, and decreases Q(s _t , a _t ) if it is smaller.

ところで、Ｑ学習などの強化学習をＦＳＷ装置の制御に適用する場合、行動ａを例えば接合条件の操作量に対応させ、状態ｓを例えば種々の観測データに対応させればよい。また、報酬に関する条件としては、例えば接合の品質、接合速度、エネルギー消費量が挙げられ、これら要素に応じて変化する価値関数を構築するとよい。例えば接合の強度が目標値に近付くか一致すると報酬が増え、目標値から離れるほど報酬は小さくなるか減る。また、バリ発生量が少なければ報酬が増え、多ければ減る。さらに、エネルギー消費量が多ければ報酬は減り、少なければ増える。 When applying reinforcement learning such as Q-learning to the control of an FSW device, action a can be made to correspond, for example, to the manipulated variable of the joining conditions, and state s can be made to correspond, for example, to various observed data. Furthermore, conditions related to reward include, for example, the joining quality, joining speed, and energy consumption, and a value function can be constructed that changes depending on these elements. For example, the reward increases when the joining strength approaches or matches the target value, and the reward decreases or decreases as it deviates from the target value. Furthermore, the reward increases if the amount of burrs generated is small, and decreases if the amount is large. Furthermore, the reward decreases if energy consumption is high, and increases if energy consumption is low.

ＦＳＷが複雑な現象を伴うことを踏まえれば、行動価値関数Ｑ（ｓ，ａ）は、テーブルよりもむしろ、関数近似で表現されることが好ましい。行動価値関数Ｑ（ｓ，ａ）の関数近似を行うためには、まず行動価値関数Ｑ（ｓ，ａ）を、パラメータベクトルｗで表される関数Ｑ’（ｓ，ａ；ｗ）としてモデル化する。そして、学習中はＱ値そのものを更新するのではなく、Ｑ’のパラメータベクトルｗを更新するものとする。Given that FSW involves complex phenomena, it is preferable that the action value function Q(s, a) be expressed by a function approximation rather than a table. To perform a function approximation of the action value function Q(s, a), first, the action value function Q(s, a) is modeled as a function Q'(s, a; w) represented by a parameter vector w. Then, during learning, the parameter vector w of Q' is updated rather than updating the Q value itself.

そして、パラメータｗの学習のために、教師あり学習を応用することができる。摩擦攪拌接合のように複雑な問題に対処するためには、例えばニューラルネットワークを利用することができる。例えば、状態ｓを入力すると、可能な全ての行動ａに対する価値関数Ｑを出力するニューラルネットワークを準備し、これを関数近似Ｑ’（ｓ，ａ；ｗ）として用いればよい（Ｑネットワーク）。 Supervised learning can then be applied to learn the parameter w. To deal with complex problems such as friction stir welding, for example, neural networks can be used. For example, a neural network can be prepared that, when a state s is input, outputs a value function Q for all possible actions a, and this can be used as the function approximation Q'(s, a; w) (Q network).

ここで、関数近似のパラメータｗはニューラルネットワークの重みパラメータに対応している。そして、重みパラメータの調整は、次式の誤差関数を最小化するように行えばよい。
ここで、ｙ_ｔ（ｓ，ａ）は標的値（ターゲット）である。
したがって、先に述べた式４のエネルギー関数Ｍ（ｗ）や式５の誤差関数Ｅ_Ｄを利用することができる。 Here, the parameter w of the function approximation corresponds to the weight parameter of the neural network. The weight parameter can be adjusted so as to minimize the error function of the following equation:
Here, y _t (s, a) is the target value.
Therefore, the energy function M(w) of the above-mentioned equation 4 and the error function E _D of the above-mentioned equation 5 can be used.

もっとも、強化学習では標的値ｙ_ｔ（ｓ，ａ）として用いる教師データは与えられないので、上式の標的値ｙ_ｔを、例えば、環境から実際にサンプリングされた状態及び報酬で置き換えた、次式
を用いてもよい。 However, in reinforcement learning, training data to be used as the target value y _t (s, a) is not given, so the target value y _t in the above equation is replaced with, for example, the state and reward actually sampled from the environment, and the following equation is used:
may also be used.

そうすると、価値関数Ｑ’のパラメータｗの更新は、例えば次式
を用いて行えばよい。 Then, the parameter w of the value function Q' is updated, for example, as follows:
This can be done by using the following:

そして、パラメータｗ及び評価関数Ｑ’の更新は、例えば接合の度に行えばよい。なお、Ｑネットワークの実現方法として様々な手法が提案されており、このような手法を本実施形態に取り入れることができる。その際、上式を適宜修正したうえで実行処理が行われることになる。 The parameter w and the evaluation function Q' can be updated, for example, every time a connection is made. Various methods have been proposed for implementing the Q network, and these methods can be incorporated into this embodiment. In this case, the above formula is modified as appropriate before the execution process is performed.

４．制御装置の詳細
本実施形態では、上述の学習済みモデル又は学習モデルを用いてＦＳＷ装置１０を制御している。例えば、制御装置は、学習済みモデル（予測モデル）が出力した予測値に基づいてＦＳＷ装置の操作量を決定してもよいし（構成例１）、あるいは、強化学習の学習モデルに最適な操作量を判断させてもよい（構成例２）。 4. Details of the Control Device In this embodiment, the above-mentioned trained model or learning model is used to control the FSW device 10. For example, the control device may determine the operation amount of the FSW device based on a predicted value output by the trained model (prediction model) (Configuration Example 1), or may cause a reinforcement learning learning model to determine the optimal operation amount (Configuration Example 2).

４－１．制御装置の機能構成例１
まず、予測モデルの予測値に基づいてＦＳＷ装置の操作量を決定するタイプの制御装置２０について説明する。制御装置２０は、図１０に示すように、入力部２１、演算部２２、記憶部２３及び出力部２４の各機能部を含む。 4-1. Control device functional configuration example 1
First, a description will be given of a control device 20 that determines the manipulated variable of the FSW device based on the predicted value of a prediction model. As shown in FIG. 10 , the control device 20 includes an input unit 21, a calculation unit 22, a storage unit 23, and an output unit 24.

入力部２１は、ＦＳＷに関する観測量（観測データ）を取得する。入力部は、観測データを自ら計測して取得してもよいし、例えばＦＳＷ装置１０や他の計測器から取得してもよいし、更にはユーザ入力によって取得してもよい。また、観測データは、ＦＳＷの実行中に取得されてもよいし、実行後に取得されてもよい。更に、観測データは、例えばエネルギー消費量のように、複数の観測量の演算により得られるものでもよい。The input unit 21 acquires observation quantities (observation data) related to FSW. The input unit may acquire the observation data by measuring it itself, or may acquire it from, for example, the FSW device 10 or another measuring device, or may even acquire it by user input. Furthermore, the observation data may be acquired during the execution of FSW, or may be acquired after execution. Furthermore, the observation data may be obtained by calculating multiple observation quantities, such as energy consumption.

ここで、観測データは、ＦＳＷの接合条件に関するものと、継手特性に関するものを含む。ただし、接合条件に関する観測データと継手特性に関する観測データとは重複していてもよい。Here, the observation data includes data related to the joining conditions of the FSW and data related to the joint characteristics. However, the observation data related to the joining conditions and the observation data related to the joint characteristics may overlap.

具体的には、接合条件に関する観測データとして、例えば、回転ツールの回転速度、移動速度、印加荷重（又はツール挿入量）、前進角がある。なお、各接合条件の調整量は、制御装置２０により決定される。Specifically, the observed data on the joining conditions include, for example, the rotation speed of the rotating tool, the moving speed, the applied load (or the tool insertion amount), and the advance angle. The adjustment amount of each joining condition is determined by the control device 20.

また、継手特性に関する観測データしては、例えば、ＦＳＷ中の接合温度、ツール回転トルク、ツールＹ軸（接合方向）荷重、ツールＺ軸（垂直方向）荷重、攪拌部幅、施工部周辺でのバリ発生量、継手強度、ＨＡＺ軟化、硬度分布、最高硬度、耐食性等がある。これらの物理量の多くは攪拌部に関する情報であり、接合条件に応じて変化する。 Examples of observed data on joint characteristics include the joining temperature during FSW, tool rotational torque, tool Y-axis (joining direction) load, tool Z-axis (vertical) load, stir zone width, amount of burrs generated around the processing zone, joint strength, HAZ softening, hardness distribution, maximum hardness, corrosion resistance, etc. Many of these physical quantities are information related to the stir zone, and change depending on the joining conditions.

次いで演算部２２について説明すると、演算部２２は、記憶部２３に記憶された制御プログラムをメモリに読み込んで実行することで、観測データに基づいて継手特性を予測し、予測結果に基づいて最適な接合条件を決定する。演算部２２は、設定部２５、予測部２６及び更新部２７を含んでいる。Next, the calculation unit 22 will be described. The calculation unit 22 loads a control program stored in the storage unit 23 into memory and executes it to predict joint characteristics based on observed data and to determine optimal joining conditions based on the prediction results. The calculation unit 22 includes a setting unit 25, a prediction unit 26, and an update unit 27.

設定部２５は、ユーザ入力等に基づいて接合条件の初期設定を行う。設定される条件としては、例えば、接合対象となる材料の情報、目標とする継手特性（強度等）・作業時間・エネルギー消費量といった情報がある。また、設定部２５は、学習済みの予測モデルを用いて、目標となる継手特性を満たす接合条件を把握し、この条件を初期設定としてもよい。The setting unit 25 performs initial setting of the joining conditions based on user input, etc. The conditions to be set include, for example, information on the materials to be joined, the target joint characteristics (strength, etc.), the working time, and the amount of energy consumption. The setting unit 25 may also use a trained prediction model to determine the joining conditions that satisfy the target joint characteristics, and set these conditions as the initial settings.

次いで、予測部２６は、予測モデルに観測データを入力し、継手特性に関する予測値を得る。そして、更新部２７は、この予測値を目標値と比較し、比較結果に基づいて接合条件の操作量を更新する。例えば、接合強度が目標より不足している場合、更新部２７は、接合条件に微少量Δを加えた量を予測モデルに入力して予測値の変化を把握し、接合強度が目標値に近付くようにその接合条件を調整すればよい。このとき、更新部２７は、該当する接合条件の影響度合い及び信頼性（バラツキ）を考慮してもよい。Next, the prediction unit 26 inputs the observation data into the prediction model to obtain a predicted value for the joint characteristics. The update unit 27 then compares this predicted value with a target value and updates the manipulated variable of the joining conditions based on the comparison result. For example, if the joint strength is less than the target, the update unit 27 inputs an amount obtained by adding a small amount Δ to the joining condition into the prediction model to grasp the change in the predicted value, and adjusts the joining condition so that the joint strength approaches the target value. At this time, the update unit 27 may take into account the degree of influence and reliability (variation) of the corresponding joining condition.

そして、演算部２２は、予め設定された時間間隔で、観測と接合条件の調整とを繰り返し、決定した調整量を、出力部２４を介してＦＳＷ装置１０に指示する。Then, the calculation unit 22 repeats the observation and adjustment of the joining conditions at a preset time interval, and instructs the FSW device 10 of the determined adjustment amount via the output unit 24.

次いで、記憶部２３は、例えば、学習済みの予測モデル、ＦＳＷ装置１０の制御プログラム及び各種データを記憶している。各種データとしては、例えば、観測データ、接合条件の設定、報酬設定等がある。記憶部２３はまた、例えば継手形状、板厚、母材姿勢、ギャップ量などの施工条件を記憶してもよい。なお、記憶部２３の全部又は一部は、制御装置２０内に組み込まれていてもよいし、例えば外部コンピュータの記憶領域でもよい。Next, the memory unit 23 stores, for example, the learned prediction model, the control program of the FSW device 10, and various data. The various data include, for example, observation data, joining condition settings, and reward settings. The memory unit 23 may also store processing conditions such as joint shape, plate thickness, base material attitude, and gap amount. Note that all or a part of the memory unit 23 may be incorporated in the control device 20, or may be, for example, a memory area of an external computer.

出力部２４は、演算部２２からの指示をＦＳＷ装置１０に出力するほか、例えば、演算部２２の予測結果及びＦＳＷ装置１０への指示内容を、図示しないディスプレイに表示させてもよい。The output unit 24 outputs instructions from the calculation unit 22 to the FSW device 10, and may also, for example, display the prediction results of the calculation unit 22 and the instructions to the FSW device 10 on a display not shown.

次いで、図１１を参照して、制御装置２０の動作例を説明する。
まずステップＳ４１において、設定部２５は、ユーザ入力に応じて接合条件を設定する。次いで、ステップＳ４２において、入力部２１は観測データを取得し、記憶部２３又は外部の記憶装置に記憶する。そして、ステップＳ４３において、予測部２６は、学習済みの予測モデルを用いて継手特性の予測値を算出する。更に、ステップＳ４４において、更新部２７は、予測部２５の予測結果を参照し、継手特性を目標値に近付けるように接合条件を調整及び更新するとともに、調整後の接合条件を出力する。 Next, an example of the operation of the control device 20 will be described with reference to FIG.
First, in step S41, the setting unit 25 sets the joining conditions in response to a user input. Next, in step S42, the input unit 21 acquires observed data and stores it in the storage unit 23 or an external storage device. Then, in step S43, the prediction unit 26 calculates a predicted value of the joint characteristics using the trained prediction model. Furthermore, in step S44, the update unit 27 refers to the prediction result of the prediction unit 25, adjusts and updates the joining conditions so as to bring the joint characteristics closer to the target values, and outputs the adjusted joining conditions.

そして、制御装置２０は、これらステップＳ４２～Ｓ４４を設定された時間間隔で繰り返し実行する。これにより、ユーザの所望する継手特性に応じて適切な接合条件を自動的に決定することができる。The control device 20 then repeatedly executes steps S42 to S44 at set time intervals, allowing the control device 20 to automatically determine appropriate joining conditions according to the joint characteristics desired by the user.

つまり、予測モデルは、非線形関数で表現された隠れユニットを含むニューラルネットワークにより生成されるから、複雑な関係を伴う接合現象を適切に表現できる。これにより継手特性の予測精度が高まると同時に、この予測モデルが制御装置２０に組み込まれることで、ユーザの要望に応じて、より適切な接合条件を設定することが可能となる。そして、予測モデルの予測精度は、学習機会の増加に伴って更に高まり、更に適切な接合条件の設定が可能になる。したがって、多数の予備実験を必要とせず、また作業者の技術レベルによらずに、最適な接合条件を用いて各種継手を簡単に得ることができる。In other words, because the prediction model is generated by a neural network including hidden units expressed by nonlinear functions, it can appropriately represent joining phenomena involving complex relationships. This improves the prediction accuracy of the joint characteristics, and at the same time, by incorporating this prediction model into the control device 20, it becomes possible to set more appropriate joining conditions according to the user's requests. The prediction accuracy of the prediction model further improves as learning opportunities increase, making it possible to set even more appropriate joining conditions. Therefore, various joints can be easily obtained using optimal joining conditions without the need for numerous preliminary experiments and regardless of the skill level of the worker.

加えて、予測モデルは予測値にエラーバーを設けることができる。予測値のバラツキは使用したデータの不足及び／又は信頼性の不足を意味しており、積極的にデータを収集するべき対象を明確にすることができる。また、このエラーバーを踏まえた制御を行うことで、信頼性の高い制御を実現できる。 In addition, the predictive model can provide error bars for the predicted values. Variation in the predicted values indicates a lack of data used and/or a lack of reliability, making it possible to clarify targets for which data should be actively collected. Furthermore, highly reliable control can be achieved by controlling based on these error bars.

４－２．制御装置の機能構成例２
次に、強化学習の学習モデルに最適な操作量を判断させるタイプの制御装置１２０について説明する。 4-2. Control device functional configuration example 2
Next, a control device 120 of a type that determines an optimal operation amount for a learning model of reinforcement learning will be described.

制御装置１２０は、接合条件を決定した結果（環境の状態遷移）に基づいて報酬を計算し、計算された報酬に基づいて評価関数を更新し、かかる評価関数の更新を繰り返すことによって、報酬が最も多く得られるような（より好適な）接合条件を決定するようにプログラムされている。制御装置１２０は、図１２に示すように、入力部１２１、演算部１２２、記憶部１２３及び出力部１２４の各機能部を含む。The control device 120 is programmed to calculate a reward based on the result of determining the connection condition (environment state transition), update an evaluation function based on the calculated reward, and determine a (more suitable) connection condition that will obtain the most reward by repeating the update of the evaluation function. As shown in FIG. 12, the control device 120 includes the functional units of an input unit 121, a calculation unit 122, a memory unit 123, and an output unit 124.

入力部１２１は、先に述べた入力部２１と同様に、ＦＳＷに関する観測データを取得する。記憶部１２３は、例えば、学習モデル、制御プログラム、観測データ、設定情報等を記憶している。また、出力部１２４は、演算部１２２が決定した接合条件の操作量をＦＳＷ装置１０に指示する。The input unit 121 acquires observation data related to FSW, similar to the input unit 21 described above. The memory unit 123 stores, for example, a learning model, a control program, observation data, setting information, etc. The output unit 124 indicates to the FSW device 10 the operation amount of the joining condition determined by the calculation unit 122.

演算部１２２は、観測データに基づいて機械学習を実行し、最適な接合条件を決定する。つまり、演算部１２２は、強化学習におけるエージェントに相当する役割を果たす。かかる演算部１２２は、設定部１２５、関数更新部１２６、報酬計算部１２７及び方策決定部１２８を含んでいる。The calculation unit 122 performs machine learning based on the observation data and determines optimal joining conditions. In other words, the calculation unit 122 plays a role equivalent to an agent in reinforcement learning. The calculation unit 122 includes a setting unit 125, a function update unit 126, a reward calculation unit 127, and a policy determination unit 128.

設定部１２５は、ユーザ入力等に基づいて接合条件の初期設定を行うとともに、報酬に関する設定（例えば報酬及び価値関数の初期値、報酬の付与条件など）を行う。なお、接合条件の初期設定に当たっては、設定部１２５は、記憶部１２３又は外部の記憶装置に記憶された学習済みの予測モデルを用いて、目標となる継手特性を満たす接合条件を把握し、この条件を初期設定としてもよいThe setting unit 125 performs initial setting of the joining conditions based on user input, etc., and performs settings related to rewards (for example, initial values of rewards and value functions, conditions for granting rewards, etc.). In addition, when initially setting the joining conditions, the setting unit 125 may use a learned prediction model stored in the memory unit 123 or an external storage device to grasp joining conditions that satisfy the target joint characteristics, and set these conditions as the initial settings.

関数更新部１２６は、例えば、観測データ及び式２１を用いてパラメータベクトルｗを更新することで、可能な行動ａに対して最大となる価値関数Ｑ^＊を選択し、価値関数Ｑ’を更新する。報酬計算部１２７は、観測された環境の状態ｓに基づき、更新された価値関数Ｑ’を用いて報酬を計算する。方策決定部１２８は、更新された価値関数Ｑ’に対応する方策πを決定する。 Function update unit 126 selects a maximum value function Q ^* for a possible action a by updating parameter vector w using, for example, the observation data and Equation 21, and updates value function Q'. Reward calculation unit 127 calculates a reward using the updated value function Q' based on the observed environmental state s. Policy determination unit 128 determines a policy π corresponding to the updated value function Q'.

そして、演算部２２は、予め設定された時間間隔で、観測と接合条件の調整とを繰り返し、決定した方策つまり接合条件の調整量を、出力部２４を介してＦＳＷ装置１０に指示する。The calculation unit 22 then repeats the observation and adjustment of the joining conditions at a preset time interval, and instructs the FSW device 10 on the determined measure, i.e., the amount of adjustment of the joining conditions, via the output unit 24.

次いで、図１３を参照して、制御装置１２０の動作例を説明する。
まずステップＳ５１において、設定部１２５は、ユーザ入力に応じて接合条件等の初期設定を行う。次いで、ステップＳ５２において、方策決定部１２８は価値関数Ｑ’に基づいて方策πを決定する。併せて、ステップＳ５３において、関数更新部１２６は、学習モデルを用いて価値関数Ｑ’のパラメータベクトルｗを更新し、これにより価値関数Ｑ’を更新する。 Next, an example of the operation of the control device 120 will be described with reference to FIG.
First, in step S51, the setting unit 125 performs initial settings of joining conditions and the like in response to user input. Next, in step S52, the measure determination unit 128 determines the measure π based on the value function Q'. In addition, in step S53, the function update unit 126 updates the parameter vector w of the value function Q' using the learning model, thereby updating the value function Q'.

ステップＳ５４において、制御部１２０は、ＦＳＷ装置１０に方策πを実行させ、計測器３０から環境の新たな状態ｓを取得する。ステップＳ５５において、報酬計算部１２７は、観測された環境の新たな状態ｓ及び更新された価値関数Ｑ’を用いて報酬を計算する。In step S54, the control unit 120 causes the FSW device 10 to execute the strategy π and obtains a new state s of the environment from the measuring instrument 30. In step S55, the reward calculation unit 127 calculates the reward using the observed new state s of the environment and the updated value function Q'.

そして、制御装置２０は、これらステップＳ５２～Ｓ５５を設定された時間間隔で繰り返し実行する。これにより環境の状態に応じて接合条件を最適化することができる。つまり、このような処理が繰返されることで行動価値関数の信頼度が高められ、また、信頼性の高い行動価値関数に基づいてより多くの報酬を得るように方策を決定することで接合条件の設定をより最適に決定することが可能となる。The control device 20 then repeatedly executes steps S52 to S55 at set time intervals. This allows the connection conditions to be optimized according to the state of the environment. In other words, by repeating this process, the reliability of the action value function is increased, and by determining a strategy to obtain more rewards based on a highly reliable action value function, it becomes possible to more optimally determine the setting of the connection conditions.

換言すれば、制御装置２０は、種々の接合条件と当該接合条件の下で得られる接合部との関係を報酬の概念を介して学習し、ユーザの所望する継手特性に応じて適切な接合条件を自動的に決定することができる。したがって、多数の予備実験を必要とせず、また作業者の技術レベルによらずに、最適な接合条件を用いて各種継手を簡単に得ることができる。In other words, the control device 20 can learn the relationship between various joining conditions and the joints obtained under those joining conditions through the concept of reward, and automatically determine appropriate joining conditions according to the joint characteristics desired by the user. Therefore, various joints can be easily obtained using optimal joining conditions without the need for numerous preliminary experiments and regardless of the skill level of the operator.

以上、本発明の代表的な実施形態について説明したが、本発明はこれらに限定されるものではなく、種々の設計変更が可能であり、それらも本発明に含まれる。 The above describes typical embodiments of the present invention, but the present invention is not limited to these, and various design modifications are possible, which are also included in the present invention.

１・・・摩擦攪拌接合（ＦＳＷ）システム、
１０・・・摩擦攪拌接合（ＦＳＷ）装置、
２０・・・制御装置、
３０・・・計測器。

1...Friction stir welding (FSW) system,
10...Friction stir welding (FSW) device,
20...Control device,
30...Measuring instrument.

Claims

A trained model for causing a computer to function so as to output a predicted value of a joint characteristic based on input data indicating a joining condition of friction stir welding,
The neural network includes an input layer, an output layer, and a hidden layer interposed between the input layer and the output layer and having a plurality of hidden units represented by a nonlinear activation function;
the neural network learns weighting coefficients using information indicating a variance in experimental data indicating the joint characteristics so as to minimize a function represented by an error between the experimental data and the predicted value;
A trained model for causing a computer to function in such a way that, for input data indicating joining conditions for friction stir welding input to the input layer, calculations are performed based on the trained model, and a predicted value of joint characteristics is output from the output layer, along with information indicating the reliability of the predicted value.

the neural network further learns the weighting coefficients using information indicating the variation of the weighting coefficients;
The predicted model according to claim 1 , which causes a computer to function to further output information indicating a degree of influence of the joining condition on the predicted value.

The neural network comprises:
learning the weighting factors for each of a plurality of models having different numbers of hidden units;
The trained model according to claim 1 or 2, wherein a model among the plurality of models that has a minimum error between a predicted value and experimental data is set as a prediction model.

The neural network comprises:
learning the weighting factors for each of a plurality of models having different numbers of hidden units;
Ranking the plurality of models based on the error between the predicted values and the experimental data;
3. The trained model according to claim 1 or 2, wherein, among a plurality of new models generated by sequentially combining models from a higher ranking, a model that has a minimum error between a predicted value and experimental data is set as a prediction model.

An input unit for acquiring observation data indicating welding conditions in friction stir welding;
A storage unit that stores the trained model according to any one of claims 1 to 4;
A calculation unit that determines an operation amount of a friction stir welding apparatus based on a result of applying the observation data to the trained model; and
an output unit that instructs the friction stir welding apparatus on the determined operation amount;
A control device including:

An input unit for acquiring observation data indicating welding conditions and joint characteristics in friction stir welding;
A storage unit that stores a predetermined learning algorithm;
a calculation unit that determines an operation amount of the friction stir welding apparatus based on a result of applying the observation data to the predetermined learning algorithm; and
an output unit that instructs the friction stir welding apparatus on the determined operation amount;
A control device comprising:
The predetermined learning algorithm is
The control device is caused to function so as to output an appropriate operation amount of the friction stir welding device based on input data indicating welding conditions of the friction stir welding device and the joint characteristics.
The neural network includes an input layer, an output layer, and a hidden layer that is interposed between the input layer and the output layer and is represented by a nonlinear activation function.
The neural network learns weighting coefficients so as to minimize an error function represented by a value function including a relationship between the joining conditions and the joint characteristics;
A control device that performs calculations based on a learned model on input data indicating welding conditions for friction stir welding input to the input layer, and outputs an operation amount of the friction stir welding device from the output layer.

A friction stir welding apparatus for performing friction stir welding;
A measuring instrument for measuring observation data indicating welding conditions in the friction stir welding;
A control device according to claim 5 or 6;
A friction stir welding system comprising:

A neural network system for generating a trained model for causing a computer to function to output a predicted value of a joint characteristic based on input data indicating a joining condition of friction stir welding,
The neural network includes an input layer, an output layer, and a hidden layer interposed between the input layer and the output layer and having a plurality of hidden units represented by a nonlinear activation function,
The neural network comprises:
learning a weighting factor so as to minimize a function represented by an error between the experimental data and the predicted value, using information indicating a variation in the experimental data indicating the joint characteristics;
A neural network system configured to generate a trained model that performs calculations based on the trained model on input data indicating welding conditions for friction stir welding input to the input layer, and outputs a predicted value of joint characteristics from the output layer as well as information indicating the reliability of the predicted value.

A method for generating a trained model for causing a computer to function to output a predicted value of a joint characteristic based on input data indicating a joining condition of friction stir welding,
The learning model for generating the trained model includes a neural network including an input layer, an output layer, and a hidden layer interposed between the input layer and the output layer and having a plurality of hidden units represented by a nonlinear activation function;
a step of learning weighting coefficients in the neural network by using information indicating a variance in experimental data indicating the joint characteristics so as to minimize a function represented by an error between the experimental data and the predicted value;
A step of generating a trained model that performs a calculation based on the trained model on input data indicating welding conditions of friction stir welding input to the input layer, and outputs a predicted value of joint characteristics from the output layer and information indicating the reliability of the predicted value;
How to generate a trained model, including: