JP6814441B2

JP6814441B2 - Learning control device and learning control method for drive machines

Info

Publication number: JP6814441B2
Application number: JP2017167080A
Authority: JP
Inventors: 尭和田; 友近　信行; 信行友近; 一郎丸田; 俊治杉江
Original assignee: Kobe Steel Ltd; Kyoto University
Current assignee: Kobe Steel Ltd; Kyoto University
Priority date: 2017-08-31
Filing date: 2017-08-31
Publication date: 2021-01-20
Anticipated expiration: 2037-08-31
Also published as: JP2019042842A

Description

本発明は、繰り返し動作を行う駆動機械を制御するための学習制御装置及び学習制御方法に関する。 The present invention relates to a learning control device and a learning control method for controlling a driving machine that repeatedly operates.

一定の動作パターンを繰り返すロボット等の駆動機械が動作する場合に、駆動機械の固有振動に伴う振動が発生し、これによって高精度に動作の制御を行えないという問題がある。従来、かかる問題を解決するために、例えば機械先端点等の対象部位の前回周期における位置又は軌道誤差を求め、次回周期における位置又は軌道誤差を低減するように駆動機械を制御し、これを繰り返すことで位置又は軌道誤差を０付近までに収束させる学習制御方法が提案されている（特許文献１及び２参照）。 When a driving machine such as a robot that repeats a certain operation pattern operates, there is a problem that vibration is generated due to the natural vibration of the driving machine, and the operation cannot be controlled with high accuracy. Conventionally, in order to solve such a problem, for example, the position or trajectory error of a target part such as a machine tip point in the previous cycle is obtained, the drive machine is controlled so as to reduce the position or trajectory error in the next cycle, and this is repeated. Therefore, a learning control method for converging the position or orbital error to near 0 has been proposed (see Patent Documents 1 and 2).

特許文献１に開示された方法は、機械先端点に取り付けられた加速度センサによる検出加速度に基づいて機械先端点の本来の位置からのずれ量Δθを求め、位置指令Ｐｃから位置フィードバックＰ１を減じて第１の位置偏差ε１を求め、第１の位置偏差ε１にずれ量Δθを加算して第２の位置偏差ε２を求め、第２の位置偏差ε２から補正量を求め、これを第１の位置偏差ε１に加算して速度指令Ｖｃを求めるというものである。 In the method disclosed in Patent Document 1, the deviation amount Δθ from the original position of the machine tip point is obtained based on the detected acceleration by the acceleration sensor attached to the machine tip point, and the position feedback P1 is subtracted from the position command Pc. The first position deviation ε1 is obtained, the deviation amount Δθ is added to the first position deviation ε1 to obtain the second position deviation ε2, the correction amount is obtained from the second position deviation ε2, and this is the first position. The speed command Vc is obtained by adding to the deviation ε1.

特許文献２に開示された方法は、多関節ロボットの制御対象部位に設けられたセンサの検出結果から制御対象部位の位置を算出し、位置誤差を補正するための学習補正量を算出し、学習補正量に基づいて、制御対象部位の目標位置に関する位置指令データから算出される位置偏差データを補正し、補正された位置偏差データに基づいて多関節ロボットを所定の動作速度で動作させるというものである。この方法では、学習補正量の算出過程において、最大動作速度に至るまで多関節ロボットの動作速度を増加させながら学習補正量を算出する。 The method disclosed in Patent Document 2 calculates the position of the control target part from the detection result of the sensor provided in the control target part of the articulated robot, calculates the learning correction amount for correcting the position error, and learns. Based on the correction amount, the position deviation data calculated from the position command data related to the target position of the control target part is corrected, and the articulated robot is operated at a predetermined operation speed based on the corrected position deviation data. is there. In this method, in the process of calculating the learning correction amount, the learning correction amount is calculated while increasing the operation speed of the articulated robot up to the maximum operation speed.

特開２００６−１７２１４９号公報Japanese Unexamined Patent Publication No. 2006-172149 特開２０１２−２４０１４２号公報Japanese Unexamined Patent Publication No. 2012-240142

多関節ロボットのような多入出力システムでは、１つの入力が１つの出力に影響するだけでなく、他の出力にも干渉する。したがって、１つの出力の制御に１つの入力を用いただけでは、この出力の制御を正確に行うことはできない。しかしながら、上記の特許文献１及び２には、かかる多入出力システムにおける干渉問題が考慮されておらず、干渉を抑制した正確な制御を行うことはできない。 In a multi-input / output system such as an articulated robot, one input not only affects one output, but also interferes with other outputs. Therefore, it is not possible to accurately control this output by using only one input for controlling one output. However, the above-mentioned Patent Documents 1 and 2 do not consider the interference problem in such a multi-input / output system, and accurate control that suppresses the interference cannot be performed.

本発明は斯かる事情に鑑みてなされたものであり、その主たる目的は、上記課題を解決することができる駆動機械の学習制御装置及び学習制御方法を提供することにある。 The present invention has been made in view of such circumstances, and a main object thereof is to provide a learning control device and a learning control method for a drive machine capable of solving the above problems.

上述した課題を解決するために、本発明の一の態様の駆動機械の学習制御装置は、複数の可動部位を有する駆動機械に一定の繰り返し動作を複数回実行させるように制御する駆動機械の学習制御装置であって、前記駆動機械の前記可動部位である複数の対象部位それぞれにおいて観測された位置に関する物理量である観測物理量を取得する観測物理量取得手段と、Ｎ回目の前記繰り返し動作における前記複数の対象部位それぞれに対する目標の前記物理量である目標物理量のそれぞれと、前記観測物理量取得手段によってＮ回目の前記繰り返し動作において取得された前記複数の対象部位それぞれに対する前記観測物理量のそれぞれとに基づいて、前記複数の対象部位毎に学習出力信号を生成する学習制御手段と、前記学習制御手段によって生成された前記学習出力信号に基づいて、Ｎ＋１回目の前記繰り返し動作を前記駆動機械に実行させるための制御信号を、前記複数の対象部位それぞれに対して生成する制御信号生成手段とを備える。 In order to solve the above-mentioned problems, the learning control device for a driving machine according to one aspect of the present invention is for learning a driving machine that controls a driving machine having a plurality of movable parts to execute a certain repetitive operation a plurality of times. An observation physical quantity acquisition means for acquiring an observation physical quantity which is a physical quantity related to a position observed at each of a plurality of target parts which are the movable parts of the drive machine, and the plurality of the control devices in the Nth repetitive operation. Based on each of the target physical quantities which are the target physical quantities for each target part and each of the observed physical quantities for each of the plurality of target parts acquired in the Nth repetitive operation by the observed physical quantity acquisition means. A control signal for causing the driving machine to perform the N + 1th repetitive operation based on the learning control means that generates a learning output signal for each of a plurality of target parts and the learning output signal generated by the learning control means. Is provided with a control signal generation means for generating each of the plurality of target parts.

この態様において、前記学習制御手段は、前記駆動機械の時間空間におけるＮ回目の前記繰り返し動作における前記目標物理量及び前記観測物理量を、所定の基底信号の射影空間に射影した目標物理量射影成分及び観測物理量射影成分を算出する射影手段と、前記射影手段により算出されたＮ回目の前記繰り返し動作における前記目標物理量射影成分及び前記観測物理量射影成分に基づいて、Ｎ＋１回目の前記繰り返し動作における前記射影空間での学習出力信号射影成分を算出する学習出力射影成分算出手段と、前記学習出力射影成分算出手段により算出された前記学習出力信号射影成分に基づいて、Ｎ＋１回目の前記繰り返し動作における前記時間空間での前記学習出力信号を算出する学習出力信号算出手段とを有してもよい。 In this embodiment, the learning control means projects the target physical quantity and the observed physical quantity in the Nth repetitive operation in the time space of the driving machine onto a projection space of a predetermined base signal, and the target physical quantity projection component and the observed physical quantity. Based on the projection means for calculating the projection component, the target physical quantity projection component in the Nth repetitive operation calculated by the projection means, and the observed physical quantity projection component, in the projection space in the N + 1th repetitive operation. Based on the learning output projection component calculating means for calculating the learning output signal projection component and the learning output signal projection component calculated by the learning output projection component calculating means, the said in the time space in the N + 1th repetitive operation. It may have a learning output signal calculating means for calculating a learning output signal.

また、上記態様において、前記学習出力射影成分算出手段は、前記時間空間における前記観測物理量と前記学習出力信号との関係を示す数理モデルを前記射影空間における数理モデルに変換した射影空間モデルに基づいて構成された制御器により、前記学習出力信号射影成分を算出するように構成されていてもよい。 Further, in the above aspect, the learning output projection component calculation means is based on a projection space model obtained by converting a mathematical model showing the relationship between the observed physical quantity in the time space and the learning output signal into a mathematical model in the projection space. The controller may be configured to calculate the learning output signal projection component.

また、上記態様において、前記学習出力射影成分算出手段は、前記射影空間モデルを定常誤差ゼロで安定化させる制御器として構成されていてもよい。 Further, in the above aspect, the learning output projective component calculation means may be configured as a controller that stabilizes the projective space model with a steady error of zero.

また、上記態様において、前記基底信号は、前記対象部位の追従誤差に関する信号であってもよい。 Further, in the above aspect, the base signal may be a signal relating to the tracking error of the target portion.

また、上記態様において、前記基底信号は、前記繰り返し動作の周波数の整数倍の周波数成分を有する正弦波信号の組合せであってもよい。 Further, in the above aspect, the base signal may be a combination of sinusoidal signals having a frequency component that is an integral multiple of the frequency of the repetitive operation.

また、本発明の他の態様の駆動機械の学習制御方法は、複数の可動部位を有する駆動機械に一定の繰り返し動作を複数回実行させるように制御する駆動機械の学習制御方法であって、前記駆動機械の前記可動部位である複数の対象部位それぞれにおいて観測された位置に関する物理量である観測物理量を取得するステップと、Ｎ回目の前記繰り返し動作における前記複数の対象部位それぞれに対する目標の前記物理量である目標物理量のそれぞれと、Ｎ回目の前記繰り返し動作において取得された前記複数の対象部位それぞれに対する前記観測物理量のそれぞれとに基づいて、前記複数の対象部位毎に学習出力信号を生成するステップと、生成された前記学習出力信号に基づいて、Ｎ＋１回目の前記繰り返し動作を前記駆動機械に実行させるための制御信号を、前記複数の対象部位それぞれに対して生成するステップとを有する。 Further, the learning control method for a driving machine according to another aspect of the present invention is a learning control method for a driving machine that controls a driving machine having a plurality of movable parts to execute a certain repetitive operation a plurality of times. The step of acquiring the observed physical quantity, which is a physical quantity related to the position observed in each of the plurality of target parts, which are the movable parts of the drive machine, and the target physical quantity for each of the plurality of target parts in the Nth repetitive operation. A step of generating a learning output signal for each of the plurality of target parts based on each of the target physical quantities and each of the observed physical quantities for each of the plurality of target parts acquired in the Nth repetitive operation. Based on the learned output signal, the step includes a step of generating a control signal for causing the driving machine to execute the N + 1th repetitive operation for each of the plurality of target parts.

本発明に係る駆動機械の学習制御装置及び学習制御方法によれば、多入出力システムにおける出力に対する入力の干渉を抑制することが可能となる。 According to the learning control device and the learning control method of the drive machine according to the present invention, it is possible to suppress the interference of the input with respect to the output in the multi-input / output system.

実施の形態に係る自動溶接システムの構成を示す模式図。The schematic diagram which shows the structure of the automatic welding system which concerns on embodiment. 実施の形態に係る学習制御装置の構成を示すブロック図。The block diagram which shows the structure of the learning control apparatus which concerns on embodiment. 実施の形態に係る学習制御装置による制御の原理を説明するための機能ブロック図。The functional block diagram for demonstrating the principle of control by the learning control apparatus which concerns on embodiment. 学習制御部の構成を示す機能ブロック図。The functional block diagram which shows the structure of the learning control part. ２リンクロボットの構成を示す模式図。The schematic diagram which shows the structure of the 2 link robot. 学習出力射影成分算出部の構築を説明するための機能ブロック図。A functional block diagram for explaining the construction of the learning output projection component calculation unit. 学習出力射影成分算出部の構築例を示す機能ブロック図。The functional block diagram which shows the construction example of the learning output projection component calculation part. 実施の形態に係る学習制御装置の動作の手順を示すフローチャート。The flowchart which shows the procedure of operation of the learning control device which concerns on embodiment. 学習出力信号生成処理の手順を示すフローチャート。The flowchart which shows the procedure of the learning output signal generation processing. 従来手法の構成を示す機能ブロック図。A functional block diagram showing the configuration of the conventional method. 従来手法における１軸目の角度指令信号及び関節角度の時間変化を示すグラフ。The graph which shows the time change of the angle command signal and the joint angle of the 1st axis in the conventional method. 従来手法における２軸目の角度指令信号及び関節角度の時間変化を示すグラフ。The graph which shows the angle command signal of the 2nd axis and the time change of a joint angle in the conventional method. 従来手法におけるウィービング動作中の２リンクロボットの先端部位の横方向移動及び上下動を示すグラフ。The graph which shows the lateral movement and the vertical movement of the tip part of a 2-link robot during a weaving operation in the conventional method. 従来手法におけるウィービング動作中の２リンクロボットの先端部位の移動を示すグラフ。The graph which shows the movement of the tip part of a 2-link robot during a weaving operation in a conventional method. 本手法の学習初期における１軸目の角度指令信号及び関節角度の時間変化を示すグラフ。The graph which shows the time change of the angle command signal and the joint angle of the 1st axis in the early stage of learning of this method. 本手法の学習初期における２軸目の角度指令信号及び関節角度の時間変化を示すグラフ。The graph which shows the time change of the angle command signal and the joint angle of the 2nd axis in the early stage of learning of this method. 本手法の学習初期におけるウィービング動作中の２リンクロボットの先端部位の横方向移動及び上下動を示すグラフ。The graph which shows the lateral movement and the vertical movement of the tip part of the 2-link robot during the weaving operation in the early stage of learning of this method. 本手法の学習初期におけるウィービング動作中の２リンクロボットの先端部位の移動を示すグラフ。The graph which shows the movement of the tip part of a 2-link robot during the weaving operation in the early stage of learning of this method. 本手法の学習後期における１軸目の角度指令信号及び関節角度の時間変化を示すグラフ。The graph which shows the time change of the angle command signal and the joint angle of the 1st axis in the learning period of this method. 本手法の学習後期における２軸目の角度指令信号及び関節角度の時間変化を示すグラフ。The graph which shows the time change of the angle command signal of the 2nd axis and the joint angle in the latter part of learning of this method. 本手法の学習後期におけるウィービング動作中の２リンクロボットの先端部位の横方向移動及び上下動を示すグラフ。The graph which shows the lateral movement and up-and-down movement of the tip part of a 2-link robot during the weaving operation in the latter part of learning of this method. 本手法の学習後期におけるウィービング動作中の２リンクロボットの先端部位の移動を示すグラフ。The graph which shows the movement of the tip part of the 2-link robot during the weaving operation in the latter part of learning of this method.

以下、本発明の好ましい実施の形態を、図面を参照しながら説明する。なお、以下に示す各実施の形態は、本発明の技術的思想を具体化するための方法及び装置を例示するものであって、本発明の技術的思想は下記のものに限定されるわけではない。本発明の技術的思想は、特許請求の範囲に記載された技術的範囲内において種々の変更を加えることができる。また、以下に示す各実施の形態では多関節マニピュレータの溶接ロボットを例に挙げて説明するが、本発明の適用対象はこれらに限定されるわけではなく、多入出力システムであれば多関節マニピュレータ以外の駆動機械を適用対象とすることも可能である。 Hereinafter, preferred embodiments of the present invention will be described with reference to the drawings. It should be noted that each of the embodiments shown below exemplifies a method and an apparatus for embodying the technical idea of the present invention, and the technical idea of the present invention is not limited to the following. Absent. The technical idea of the present invention can be modified in various ways within the technical scope described in the claims. Further, in each of the following embodiments, a welding robot of an articulated manipulator will be described as an example, but the application target of the present invention is not limited to these, and if it is an articulated input / output system, the articulated manipulator It is also possible to apply other drive machines.

＜自動溶接システムの構成＞
図１は、本実施の形態に係る自動溶接システムの構成を示す模式図である。自動溶接システム１０は、溶接ロボット２０と、学習制御装置３０と、電源装置４０とを備えている。 <Structure of automatic welding system>
FIG. 1 is a schematic view showing a configuration of an automatic welding system according to the present embodiment. The automatic welding system 10 includes a welding robot 20, a learning control device 30, and a power supply device 40.

溶接ロボット２０は、垂直多関節型のマニピュレータから構成され、その先端に溶接トーチ２１を有している。本実施の形態に係る溶接ロボット２０は、ＭＩＧ（Metal Inert Gas）溶接又はＭＡＧ（Metal Active Gas）溶接等の溶極式のアーク溶接を行う。かかる溶接ロボット２０は、学習制御装置３０及び電源装置４０のそれぞれに接続されている。 The welding robot 20 is composed of a vertical articulated manipulator, and has a welding torch 21 at its tip. The welding robot 20 according to the present embodiment performs welding electrode type arc welding such as MIG (Metal Inert Gas) welding or MAG (Metal Active Gas) welding. The welding robot 20 is connected to each of the learning control device 30 and the power supply device 40.

溶接トーチ２１にはワイヤ送給装置２３から溶接ワイヤ２４が送り込まれ、溶接トーチ２１の先端からこれが送り出される。電源装置４０は定電圧電源装置であり、溶接ワイヤ２４に電力を供給する。これにより、溶接ワイヤ２４とワーク（被溶接材）５０との間に溶接電圧が印加され、アークが発生する。また、溶接ロボット２０の先端部位、即ち溶接トーチ２１を支持するアームの先端部分には、加速度センサ２５が設けられている。加速度センサ２５は、学習制御装置３０に接続されている。電源装置４０は、溶接中に生じる溶接電流を検出する電流センサ（図示せず）を備えている。 The welding wire 24 is sent from the wire feeding device 23 to the welding torch 21, and is sent out from the tip of the welding torch 21. The power supply device 40 is a constant voltage power supply device and supplies electric power to the welding wire 24. As a result, a welding voltage is applied between the welding wire 24 and the work (material to be welded) 50, and an arc is generated. Further, an acceleration sensor 25 is provided at the tip portion of the welding robot 20, that is, the tip portion of the arm that supports the welding torch 21. The acceleration sensor 25 is connected to the learning control device 30. The power supply device 40 includes a current sensor (not shown) that detects a welding current generated during welding.

電源装置４０は、ＣＰＵとメモリとを備えており、電源制御用のコンピュータプログラムをＣＰＵが実行することで溶接電力の制御を行う。また、電源装置４０はワイヤ送給装置２３に接続されており、ＣＰＵがワイヤの送給速度を制御する。かかる電源装置４０は、学習制御装置３０との間でデータ通信を行う。 The power supply device 40 includes a CPU and a memory, and controls welding power by executing a computer program for power supply control by the CPU. Further, the power supply device 40 is connected to the wire feeding device 23, and the CPU controls the wire feeding speed. The power supply device 40 performs data communication with the learning control device 30.

次に、学習制御装置３０の構成について説明する。学習制御装置３０は、溶接ロボット２０の動作を制御する。図２は、学習制御装置３０の構成を示すブロック図である。学習制御装置３０は、ＣＰＵ３０１と、メモリ３０２と、複数のスイッチを含む操作パネル３０３と、教示ペンダント３０４と、入出力部３０５と、通信部３０６とを備えている。 Next, the configuration of the learning control device 30 will be described. The learning control device 30 controls the operation of the welding robot 20. FIG. 2 is a block diagram showing the configuration of the learning control device 30. The learning control device 30 includes a CPU 301, a memory 302, an operation panel 303 including a plurality of switches, a teaching pendant 304, an input / output unit 305, and a communication unit 306.

溶接ロボット２０の学習制御用のコンピュータプログラムである学習制御プログラム３１０がメモリ３０２に格納されており、この学習制御プログラム３１０をＣＰＵ３０１が実行することで、溶接ロボット２０による溶接動作の学習制御が行われる。 A learning control program 310, which is a computer program for learning control of the welding robot 20, is stored in the memory 302. When the CPU 301 executes the learning control program 310, the welding robot 20 performs learning control of the welding operation. ..

学習制御装置３０に対する指示の入力には、操作パネル３０３及び教示ペンダント３０４が用いられる。オペレータは、教示ペンダント３０４に教示プログラムを入力することができる。学習制御装置３０は、教示ペンダント３０４から入力された教示プログラムにしたがって、溶接ロボット２０を制御する。また、この教示プログラムは、図示しないコンピュータによって作成することも可能である。この場合、可搬型記録媒体によって受け渡ししたり、データ通信によって伝送したりして、教示プログラムを学習制御装置３０に与えることができる。 The operation panel 303 and the teaching pendant 304 are used for inputting instructions to the learning control device 30. The operator can input the teaching program to the teaching pendant 304. The learning control device 30 controls the welding robot 20 according to the teaching program input from the teaching pendant 304. Further, this teaching program can also be created by a computer (not shown). In this case, the teaching program can be given to the learning control device 30 by being delivered by a portable recording medium or transmitted by data communication.

入出力部３０５には、上述した加速度センサ２５、並びに、電源装置４０に設けられた電流センサ、溶接ロボット２０のアクチュエータであるモータの駆動回路、角度センサ及び角速度センサ（図示せず）が接続されている。加速度センサ２５によって検出された加速度、電流センサによって検出された溶接電流の電流値、溶接ロボット２０の各軸のモータの回転角度及び角速度が入出力部３０５に入力され、ＣＰＵ３０１に与えられる。また、ＣＰＵ３０１は、学習制御プログラム３１０により、後述するような溶接ロボット２０の学習制御を行い、制御信号を溶接ロボット２０のモータの駆動回路それぞれに出力する。 The above-mentioned acceleration sensor 25, a current sensor provided in the power supply device 40, a drive circuit of a motor which is an actuator of the welding robot 20, an angle sensor, and an angular velocity sensor (not shown) are connected to the input / output unit 305. ing. The acceleration detected by the acceleration sensor 25, the current value of the welding current detected by the current sensor, the rotation angle and the angular velocity of the motor of each axis of the welding robot 20 are input to the input / output unit 305 and given to the CPU 301. Further, the CPU 301 performs learning control of the welding robot 20 as described later by the learning control program 310, and outputs a control signal to each of the drive circuits of the motor of the welding robot 20.

通信部３０６は、有線又は無線通信機能を有する。かかる通信部３０６は、所定の通信プロトコルを使用して電源装置４０との間でデータ通信を行う。 The communication unit 306 has a wired or wireless communication function. The communication unit 306 performs data communication with the power supply device 40 using a predetermined communication protocol.

以上のような構成の学習制御装置３０は、溶接ロボット２０の各軸のモータを制御して、溶接トーチ２１の位置及び速度を制御する。かかる学習制御装置３０は、溶接ロボット２０にウィービング動作を実行させる。ウィービング動作は、溶接方向に対して交差する方向に溶接トーチ２１を交互に揺動させる繰り返し動作である。学習制御装置３０は、設定されたウィービング周期、振幅、溶接速度によってウィービング動作を行うように溶接ロボット２０を制御する。 The learning control device 30 having the above configuration controls the motor of each axis of the welding robot 20 to control the position and speed of the welding torch 21. The learning control device 30 causes the welding robot 20 to perform a weaving operation. The weaving operation is a repetitive operation in which the welding torch 21 is alternately swung in a direction intersecting the welding direction. The learning control device 30 controls the welding robot 20 so as to perform the weaving operation according to the set weaving cycle, amplitude, and welding speed.

＜学習制御の原理＞
図３は、本実施の形態に係る学習制御装置による制御の原理を説明するための機能ブロック図である。学習制御装置３０は、溶接ロボット２０に所要の動作を行わせるために各モータの角度指令を算出し、それぞれの角度指令と、各モータの角度及び角速度の測定値とに基づいて、角度指令通りにモータが動作するようにモータに電流を与える。学習制御装置３００は軸毎にモータを制御する１軸目のモータ制御部３１１，２軸目のモータ制御部３２１，…の各機能ブロックを有しており、モータ制御部３１１，３２１，…は各別に角度・角速度制御部３１２，３２２，…と電流制御部３１３，３２３，…とを有している。１軸目の角度・角速度制御部３１２には１軸目のモータへの角度指令信号θ_{ＲＥＦ，１}（ｔ）と、１軸目のモータの角度及び角速度の測定値が入力される。角度・角速度制御部３１２は入力された角度指令信号θ_{ＲＥＦ，１}（ｔ）並びに角度及び角速度の測定値から１軸目のモータへのトルク指令を算出する。２軸目のモータ制御部３２１も同様にして２軸目のモータへのトルク指令を算出する。３軸目以降のモータ制御部も同様である。 <Principle of learning control>
FIG. 3 is a functional block diagram for explaining the principle of control by the learning control device according to the present embodiment. The learning control device 30 calculates an angle command for each motor in order to cause the welding robot 20 to perform a required operation, and based on each angle command and the measured values of the angle and the angular velocity of each motor, as per the angle command. Apply current to the motor so that it operates. The learning control device 300 has each functional block of the first-axis motor control unit 311, 2nd-axis motor control unit 321, ... That controls the motor for each axis, and the motor control units 311, 321, ... Each of them has an angle / angular velocity control unit 312, 322, ... And a current control unit 313, 323, .... The angle command signal θ _{REF, 1} (t) to the motor of the first axis and the measured values of the angle and the angular velocity of the motor of the first axis are input to the angle / angular velocity control unit 312 of the first axis. The angle / angular velocity control unit 312 calculates a torque command to the motor of the first axis from the input angle command signal θ _{REF, 1} (t) and the measured values of the angle and the angular velocity. The second-axis motor control unit 321 also calculates the torque command to the second-axis motor in the same manner. The same applies to the motor control unit for the third and subsequent axes.

溶接ロボット２０の先端部位に設けられた加速度センサ２５の検出加速度は、データ変換部３１４により位置データ（ロボット先端位置信号）に変換され、座標変換部３１５に入力される。座標変換部３１５は、溶接ロボット２０の運動学モデルを用いて、ロボット先端位置信号（直交座標系での位置）から各軸の関節角度を示す関節角度位置信号を算出する。以下、変換された関節角度位置信号を「対象部位位置信号」という。 The detected acceleration of the acceleration sensor 25 provided at the tip portion of the welding robot 20 is converted into position data (robot tip position signal) by the data conversion unit 314 and input to the coordinate conversion unit 315. The coordinate conversion unit 315 calculates a joint angle position signal indicating the joint angle of each axis from the robot tip position signal (position in the Cartesian coordinate system) using the kinematic model of the welding robot 20. Hereinafter, the converted joint angle position signal is referred to as a “target site position signal”.

１軸目の対象部位位置信号θ_Ｌ，１（ｔ），２軸目の対象部位位置信号θ_Ｌ，２（ｔ），…は、ウィービング動作の各周期において取得され、学習制御部３１６に与えられる。また、学習制御部３１６には、各軸の角度指令信号θ_{ＲＥＦ，１}（ｔ），θ_{ＲＥＦ，２}（ｔ），…も与えられる。学習制御部３１６は、Ｎ回目のウィービング動作における各対象部位位置信号θ_Ｌ，ｉ（ｔ）（つまり、観測物理量）及び各角度指令信号θ_{ＲＥＦ，ｉ}（ｔ）（つまり、目標物理量）とに基づいて、Ｎ＋１回目のウィービング動作用の学習出力信号ｚ_ｉ（ｔ）を軸（つまり、対象部位）毎に出力する。なお、ｉは溶接ロボット２０の関節（軸）を示すインデックスである。 The target site position signals θ _{L, 1} (t) of the first axis, the target site position signals θ _{L, 2} (t), ... Of the second axis are acquired in each cycle of the weaving operation and given to the learning control unit 316. Be done. Further, the learning control unit 316 is also given angle command signals θ _{REF, 1} (t), θ _{REF, 2} (t), ... For each axis. The learning control unit 316 sets each target site position signal θ _{L, i} (t) (that is, observed physical quantity) and each angle command signal θ _{REF, i} (t) (that is, target physical quantity) in the Nth weaving operation. based on, and outputs the N + 1 th learning output signal z _i for weaving operation _(t) axis (i.e., target site) for each. Note that i is an index indicating the joint (axis) of the welding robot 20.

各軸の学習出力信号ｚ_ｉは、対応する軸のＮ＋１回目のウィービング動作用のトルク指令に加算され、そのトルク指令が電流制御部３１３，３２３，…に入力される。つまり、１軸目の学習出力信号ｚ_１は、１軸目の角度・角速度制御部３１２から出力されたトルク指令に加算され、このトルク指令が１軸目の電流制御部３１３に与えられる。２軸目以降の学習出力信号ｚ_２，ｚ_３，…も同様である。電流制御部３１３，３２３，…は、与えられたトルク指令を対応するモータが出力するための電流値を算出し、これを制御信号として出力する。このようにして算出された電流値の電流が各モータに供給され、各モータが駆動される。 Learning output signal z _i of each axis is added to the torque command for the N + 1 th weaving operation of the corresponding axis, the torque command current control unit 313 and 323, are input to .... In other words, learning the output signal z ₁ of the first axis is added to the torque command output from the first axis angular-velocity control section 312, the torque command is given to the first axis of the current control unit 313. Biaxial subsequent learning output signal _{_z} 2, _z _3, ... is the same. The current control units 313, 323, ... Calculate the current value for the corresponding motor to output the given torque command, and output this as a control signal. The current of the current value calculated in this way is supplied to each motor, and each motor is driven.

ここで、学習制御部３１６についてさらに詳しく説明する。図４は、学習制御部３１６の構成を示す機能ブロック図である。学習制御部３１６は、信号記憶部３３１と、信号射影部３３２と、学習出力射影成分算出部３３３と、学習出力信号生成部３３４とを有している。学習制御部３１６に入力された角度指令信号θ_{ＲＥＦ，ｉ}（ｔ）及び対象部位位置信号θ_Ｌ，ｉ（ｔ）は信号記憶部３３１に記憶される。かかる信号記憶部３３１は、Ｎ回目のウィービング動作１周期分の角度指令信号及び対象部位位置信号θ_{ＲＥＦ，ｉ}（ｔ），θ_Ｌ，ｉ（ｔ）（ｔ∈［（Ｎ−１）Ｔ，ＮＴ］、Ｔ：１回のウィービング動作の期間）を記憶する。 Here, the learning control unit 316 will be described in more detail. FIG. 4 is a functional block diagram showing the configuration of the learning control unit 316. The learning control unit 316 has a signal storage unit 331, a signal projection unit 332, a learning output projection component calculation unit 333, and a learning output signal generation unit 334. The angle command signals θ _{REF, i} (t) and the target site position signals θ _{L, i} (t) input to the learning control unit 316 are stored in the signal storage unit 331. The signal storage unit 331 includes an angle command signal for one cycle of the Nth weaving operation and a target site position signal θ _{REF, i} (t), θ _{L, i} (t) (t ∈ [(N-1) T,). NT], T: The period of one weaving operation) is stored.

信号記憶部３３１によって記憶されたＮ回目のウィービング動作１周期分の角度指令信号及び対象部位位置信号θ_{ＲＥＦ，ｉ}（ｔ），θ_Ｌ，ｉ（ｔ）（ｔ∈［（Ｎ−１）Ｔ，ＮＴ］）は、信号射影部３３２に与えられる。信号射影部３３２は、角度指令信号及び対象部位位置信号θ_{ＲＥＦ，ｉ}（ｔ），θ_Ｌ，ｉ（ｔ）（ｔ∈［（Ｎ−１）Ｔ，ＮＴ］）を、次式（１）で定義される基底Ｆに射影し、角度指令射影成分及び対象部位位置射影成分を算出する。
ここで、信号θ_{ＲＥＦ，ｉ}（ｔ）を基底Ｆ上に射影することを次式（２）で表現するものとする。
本実施の形態に係る溶接ロボット２０のように複数のアクチュエータを有する多入出力システムの駆動機械においては、各関節の角度指令射影成分を並べるようにして、全体の角度指令射影成分を表現する。なお、対象部位位置射影成分についても同様である。 Angle command signal and target site position signal for one cycle of the Nth weaving operation stored by the signal storage unit 331 θ _{REF, i} (t), θ _{L, i} (t) (t ∈ [(N-1) T) , NT]) is given to the signal projection unit 332. The signal projection unit 332 applies the angle command signal and the target site position signal θ _{REF, i} (t), θ _{L, i} (t) (t ∈ [(N-1) T, NT]) to the following equation (1). Projects onto the basis F defined in, and calculates the angle command projection component and the target site position projection component.
Here, it is assumed that the projection of the signals θ _{REF, i} (t) on the basis F is expressed by the following equation (2).
In a drive machine of a multi-input / output system having a plurality of actuators such as the welding robot 20 according to the present embodiment, the angle command projection component of each joint is arranged so as to express the entire angle command projection component. The same applies to the target site position projection component.

ここで、基底信号Ｆとしては、次式（３）で表されるように、繰り返し動作の周波数１／Ｔの整数倍の周波数成分を有する正弦波信号の組合せとすることができる。
Here, the base signal F can be a combination of sinusoidal signals having a frequency component that is an integral multiple of the frequency 1 / T of the repetitive operation, as represented by the following equation (3).

なお、基底信号は、対象部位の追従誤差（＝角度指令信号−対象部位位置信号）の周波数成分を含むように選定することが好ましい。例えば、ロボットの固有振動による追従誤差が生じる場合には、固有振動周波数成分を有する正弦波信号を含んだ基底信号を選定する。これにより、対象部位位置信号にノイズが含まれていても、追従誤差を精度よく抽出することが可能になる。このため、このような基底信号を選定することで、ローパスフィルタ等のノイズ除去回路を設ける必要がなくなり、ローパスフィルタ等を設けることによる必要な情報の欠落、位相遅れの発生等が発生することがない。 The base signal is preferably selected so as to include the frequency component of the tracking error (= angle command signal-target site position signal) of the target site. For example, when a tracking error occurs due to the natural vibration of the robot, a base signal including a sinusoidal signal having a natural vibration frequency component is selected. This makes it possible to accurately extract the tracking error even if the target site position signal contains noise. Therefore, by selecting such a base signal, it is not necessary to provide a noise removal circuit such as a low-pass filter, and the necessary information may be lost or a phase delay may occur due to the provision of the low-pass filter or the like. Absent.

信号射影部３３２によって生成された角度指令射影成分及び対象部位位置射影成分のそれぞれは、学習出力射影成分算出部３３３に入力される。学習出力射影成分算出部３３３は、与えられたＮ回目のウィービング動作１周期分の角度指令射影成分及び対象部位位置射影成分から、Ｎ＋１回目のウィービング動作用の学習出力射影成分を算出する。この学習出力射影成分算出部３３３は、後述する射影空間モデルに基づいて構成される。 Each of the angle command projection component and the target site position projection component generated by the signal projection unit 332 is input to the learning output projection component calculation unit 333. The learning output projection component calculation unit 333 calculates the learning output projection component for the N + 1th weaving operation from the angle command projection component and the target site position projection component for one cycle of the given Nth weaving operation. The learning output projective component calculation unit 333 is configured based on a projective space model described later.

射影空間モデルの導出について説明する。まず、時間空間において、学習出力信号を入力とし対象部位位置信号を出力とするモデルを考える。溶接ロボット２０のモデルＰを以下の状態空間表現とする。
なお、ここでは簡単のため、電流制御部のモデルを１と仮定した。 Derivation of the projective space model will be described. First, consider a model in which the learning output signal is input and the target site position signal is output in time and space. The model P of the welding robot 20 is represented by the following state space.
For the sake of simplicity, the model of the current control unit is assumed to be 1.

ここで、２つの関節のみを有する２リンクロボットのモデルを例に挙げて説明する。図５は、２リンクロボットの構成を示す模式図である。２リンクロボット２００は、２つのモータ２１１，２１２と、２つのアーム２２１，２２２とを有する。１軸目のモータ２１１の回転角をθ_Ｍ，１、２軸目のモータ２１２の回転角をθ_Ｍ，２と表し、１軸目のアーム２２１（１軸目のモータ２１１により駆動されるアーム）の回転角をθ_Ｌ，１、２軸目のアーム２２１（２軸目のモータ２１２により駆動されるアーム）の回転角をθ_Ｌ，２と表す。このとき、２リンクロボット２００のモデルは、次のように表現される。
Here, a model of a two-link robot having only two joints will be described as an example. FIG. 5 is a schematic view showing the configuration of a two-link robot. The two-link robot 200 has two motors 211 and 212 and two arms 221,222. The angle of rotation of the motor 211 of the first axis is θ _{M, 1} , and the angle of rotation of the motor 212 of the second axis is θ _{M, 2.} The arm 221 of the first axis (arm driven by the motor 211 of the first axis) ) Is represented by θ _{L, 1} , and the rotation angle of the second axis arm 221 (arm driven by the second axis motor 212) is represented by θ _{L, 2} . At this time, the model of the 2-link robot 200 is expressed as follows.

上記の２リンクロボット２００の場合、次式のような状態空間表現となる。
In the case of the above-mentioned two-link robot 200, the state space expression is as follows.

また、制御器側のモデルＣは、次のような状態表現となる。
Further, the model C on the controller side has the following state expression.

上記のモデルＰとＣとを組み合わせると、以下のように対象部位位置信号θ_Ｌ（ｔ）と学習出力信号ｚ（ｔ）との関係を示す数理モデルＧを得ることができる。
但し、上式において角度指令信号θ_ＲＥＦは関係がなくなるので省略している。 By combining the above models P and C, a mathematical model G showing the relationship between the target site position signal θ _L (t) and the learning output signal z (t) can be obtained as follows.
However, in the above equation, the angle command signal θ _REF is omitted because it has no relation.

ここで、上述した時間空間上の数理モデルＧを基底信号の射影空間上の表現に変換した射影空間モデルを導出する。Ｎ回目の繰り返し動作開始時の状態ｘ^Ｎを次式（５）のように定義する。
このとき、学習出力信号射影成分と対象部位位置射影成分との関係を示す射影空間モデルは、次のように表される。
上記の射影空間モデルは、１回の繰り返し動作の試行を１サンプルとする離散時間動的システムとなる。 Here, a projective space model is derived by converting the above-mentioned mathematical model G on time space into a representation on the projective space of the basis signal. The state x ^N at the start of the Nth repeated operation is defined as in the following equation (5).
At this time, the projective space model showing the relationship between the learning output signal projection component and the target site position projection component is expressed as follows.
The above-mentioned projective space model is a discrete-time dynamic system in which one trial of repeated operation is used as one sample.

次に、上記のように導出された射影空間モデルを用いて学習出力射影成分算出部３３３を構築する。図６は、学習出力射影成分算出部３３３の構築を説明するための機能ブロック図である。射影空間モデル３５０を学習対象の離散時間動的システムとして扱い、射影空間モデル３５０の出力である対象部位位置射影成分を角度指令射影成分に一致させるための制御器３５１を設計し、この制御器３５１を学習出力射影成分算出部３３３とする。つまり、学習出力射影成分算出部３３３は、射影空間モデルに基づいて構成された制御器である。 Next, the learning output projection component calculation unit 333 is constructed using the projective space model derived as described above. FIG. 6 is a functional block diagram for explaining the construction of the learning output projection component calculation unit 333. The projective space model 350 is treated as a discrete time dynamic system to be trained, and a controller 351 for matching the target site position projection component, which is the output of the projective space model 350, with the angle command projection component is designed, and this controller 351 is used. Is a learning output projective component calculation unit 333. That is, the learning output projection component calculation unit 333 is a controller configured based on the projection space model.

ここで、繰り返し動作の場合、角度指令射影成分は一定値となる。射影空間上で考えることにより、時間により変化する角度指令信号θ_ＲＥＦ（ｔ）への追従制御問題を、一定値への収束問題とすることができる。これにより、この離散時間動的システムを定常誤差ゼロで安定化させるような制御器（学習出力射影成分算出部）を設計すれば、対象部位位置射影成分を角度指令射影成分と一致させることが可能になり、対象部位位置信号θ_Ｌ（ｔ）＝角度指令信号θ_ＲＥＦ（ｔ）とすることができる。 Here, in the case of repeated operation, the angle command projection component becomes a constant value. By considering in projective space, the tracking control problem for the angle command signal θ _REF (t) that changes with time can be set as the convergence problem to a constant value. By designing a controller (learning output projection component calculation unit) that stabilizes this discrete-time dynamic system with zero steady-state error, it is possible to match the target site position projection component with the angle-command projection component. Therefore, the target site position signal θ _L (t) = angle command signal θ _REF (t).

図７は、学習出力射影成分算出部３３３の構築例を示す機能ブロック図である。図７に示すように、定常誤差ゼロで安定化させる制御器３５１（学習出力射影成分算出部３３３）の設計として、状態推定部３４１、状態フィードバック部３４２、積分サーボ部３４３を備えた構成とすることができる。ここで、状態推定部３４１は、状態推定オブザーバ、カルマンフィルタ等の状態推定方法を用いて、対象部位位置射影成分と学習出力射影成分から射影空間モデル３５０の状態を推定する。状態フィードバック部３４２は、状態推定部３４１で推定された状態に対して状態フィードバックゲインを乗じて出力する。積分サーボ部３４３は、角度指令射影成分と対象部位位置射影成分との誤差を積分し、積分ゲインＫ_Ｉを乗じて出力する。 FIG. 7 is a functional block diagram showing a construction example of the learning output projection component calculation unit 333. As shown in FIG. 7, the controller 351 (learning output projection component calculation unit 333) that stabilizes with zero stationary error is designed to include a state estimation unit 341, a state feedback unit 342, and an integration servo unit 343. be able to. Here, the state estimation unit 341 estimates the state of the projective space model 350 from the target site position projection component and the learning output projection component by using a state estimation method such as a state estimation observer or a Kalman filter. The state feedback unit 342 outputs the state estimated by the state estimation unit 341 by multiplying the state feedback gain. Integrating servo section 343 integrates the error between the angle command projection component and the target portion position projection component, it outputs the result integral gain K _I.

ここで、Ｎ回目の繰り返し動作の終了後に動作が止まる、即ち、振動が収まるのを待つようにした場合は、状態ｘ^Ｎ＝０となり、射影空間モデル３５０は静的システムとなる。本実施の形態では、ｘ^Ｎ≠０を考慮した動的システムに基づいて制御器である学習出力射影成分算出部３３３を構成したため、Ｎ回目の繰り返し動作終了後に待ち時間を入れずにＮ＋１回目の繰り返し動作を開始し、Ｎ＋１回目の繰り返し動作においてＮ回目の繰り返し動作において発生した振動の影響が存在する状態であっても問題なく学習制御を行うことができる。また、上記の動的システムは多入出力システムの駆動機械のモデルにより導出されたものであり、この動的システムに基づいて学習出力射影成分算出部３３３を構成したため、多入出力システムの駆動機械においても出力に対する入力の干渉を抑制した制御を行うことができる。 Here, when the operation is stopped after the end of the Nth repeated operation, that is, when the vibration is settled, the state x ^N = 0, and the projective space model 350 becomes a static system. In the present embodiment, since the learning output projection component calculation unit 333, which is a controller, is configured based on a dynamic system considering x ^N ≠ 0, the N + 1th time is performed without waiting after the Nth time of the repeated operation is completed. The learning control can be performed without any problem even when the repetitive operation is started and the influence of the vibration generated in the Nth repetitive operation exists in the N + 1th repetitive operation. Further, the above dynamic system is derived from the model of the drive machine of the multi-input / output system, and since the learning output projection component calculation unit 333 is configured based on this dynamic system, the drive machine of the multi-input / output system It is also possible to perform control that suppresses input interference with the output.

再び図４を参照する。上記のような学習出力射影成分算出部３３３によって出力された学習出力射影成分は、学習出力信号生成部３３４に入力される。学習出力信号生成部３３４は、入力された学習出力射影成分と、基底信号Ｆとに基づいて、次式（６）のように学習出力信号ｚ_ｉ（ｔ）（ｔ∈［（Ｎ−１）Ｔ，ＮＴ］）を生成する。
See FIG. 4 again. The learning output projection component output by the learning output projection component calculation unit 333 as described above is input to the learning output signal generation unit 334. Learning output signal generation unit 334, and the input learning output projection component, based on the baseband signal F, the learning output signal z _{i (t)} as in the following equation (6) (t∈ [(N -1) T, NT]) is generated.

＜学習制御装置の動作＞
以下、本実施の形態に係る学習制御装置３０の動作について説明する。図８は、本実施の形態に係る学習制御装置の動作の手順を示すフローチャートである。学習制御装置３０のＣＰＵ３０１は、Ｎ回目のウィービング動作を溶接ロボット２０に実行させるよう、各軸のモータの角度指令を算出し、それぞれの角度指令信号を生成する（ステップＳ１０１）。また、溶接ロボット２０の各関節に設けられた角度センサ及び角速度センサから出力される角度及び角速度の測定値が学習制御装置３０に与えられる。ＣＰＵ３０１は、生成された角度指令信号と、入力された角度及び角速度の測定値とに基づいて、各軸のモータへのトルク指令（以下、「補正前トルク指令」という）を算出する（ステップＳ１０２）。ステップＳ１０２の処理は、角度・角速度制御部３１２，３２２，…により実行される（図３参照）。 <Operation of learning control device>
Hereinafter, the operation of the learning control device 30 according to the present embodiment will be described. FIG. 8 is a flowchart showing the operation procedure of the learning control device according to the present embodiment. The CPU 301 of the learning control device 30 calculates the angle command of the motor of each axis so that the welding robot 20 executes the Nth weaving operation, and generates each angle command signal (step S101). Further, the learning control device 30 is given the measured values of the angle and the angular velocity output from the angle sensor and the angular velocity sensor provided in each joint of the welding robot 20. The CPU 301 calculates a torque command (hereinafter, referred to as “pre-correction torque command”) to the motor of each axis based on the generated angle command signal and the input measured values of the angle and the angular velocity (step S102). ). The process of step S102 is executed by the angle / angular velocity control units 312, 322, ... (See FIG. 3).

次に、ＣＰＵ３０１は、Ｎ回目（但し、ここではＮ≧２）のウィービング動作用の学習出力信号を、補正前トルク指令に加算して、Ｎ回目のウィービング動作用のトルク指令を軸毎に算出する（ステップＳ１０３）。この学習出力信号は、上述した学習制御部３１６により、Ｎ−１回目のウィービング動作用の角度指令信号、及びＮ−１回目のウィービング動作において検出された溶接ロボット２０の先端部位の位置データから算出された対象部位位置信号に基づいて生成されたものである。 Next, the CPU 301 adds the learning output signal for the Nth weaving operation (where N ≧ 2 in this case) to the pre-correction torque command, and calculates the torque command for the Nth weaving operation for each axis. (Step S103). This learning output signal is calculated from the angle command signal for the N-1th weaving operation and the position data of the tip portion of the welding robot 20 detected in the N-1th weaving operation by the learning control unit 316 described above. It is generated based on the target site position signal.

ＣＰＵ３０１は、トルク指令に示されるトルクを対応するモータが出力するための電流値を軸毎に算出し（ステップＳ１０４）。算出された電流値を示す制御信号を出力する（ステップＳ１０５）。これにより、電流が各モータに供給され、溶接ロボット２０がＮ回目のウィービング動作を実行する。ステップＳ１０４の処理は、電流制御部３１３，３２３，…により実行される（図３参照）。 The CPU 301 calculates the current value for outputting the torque indicated by the torque command by the corresponding motor for each axis (step S104). A control signal indicating the calculated current value is output (step S105). As a result, an electric current is supplied to each motor, and the welding robot 20 executes the Nth weaving operation. The process of step S104 is executed by the current control units 313, 323, ... (See FIG. 3).

Ｎ回目のウィービング動作における溶接ロボット２０の先端部位の加速度は、加速度センサ２５によって検出される。加速度センサ２５による検出加速度は学習制御装置３０に与えられ、ＣＰＵ３０１は、加速度センサ２５の検出加速度から溶接ロボット２０の先端部位の位置を算出し、この位置から各軸の関節角度を示す対象部位位置信号を算出する（ステップＳ１０６）。ステップＳ１０６の処理は、データ変換部３１４及び座標変換部３１５により実行される（図３参照）。 The acceleration of the tip portion of the welding robot 20 in the Nth weaving operation is detected by the acceleration sensor 25. The detected acceleration by the acceleration sensor 25 is given to the learning control device 30, and the CPU 301 calculates the position of the tip portion of the welding robot 20 from the detected acceleration of the acceleration sensor 25, and from this position, the target portion position indicating the joint angle of each axis. The signal is calculated (step S106). The process of step S106 is executed by the data conversion unit 314 and the coordinate conversion unit 315 (see FIG. 3).

ＣＰＵ３０１は、Ｎ＋１回目のウィービング動作用の学習出力信号を生成する学習出力信号生成処理を実行する（ステップＳ１０７）。ステップＳ１０７の処理は、学習制御部３１６により実行される（図３参照）。 The CPU 301 executes a learning output signal generation process for generating a learning output signal for the N + 1th weaving operation (step S107). The process of step S107 is executed by the learning control unit 316 (see FIG. 3).

図９は、学習出力信号生成処理の手順を示すフローチャートである。学習出力信号生成処理において、まずＣＰＵ３０１は、Ｎ回目のウィービング動作における各軸の対象部位位置信号と、Ｎ回目のウィービング動作のための角度指令信号とを、メモリ３０２内の領域である信号記憶部３３１に記憶させる（ステップＳ２０１）。信号記憶部３３１には、Ｎ回目のウィービング動作１周期分の各軸の対象部位位置信号及び角度指令信号が記憶される。 FIG. 9 is a flowchart showing the procedure of the learning output signal generation processing. In the learning output signal generation process, the CPU 301 first stores the target site position signal of each axis in the Nth weaving operation and the angle command signal for the Nth weaving operation in a signal storage unit which is an area in the memory 302. It is stored in 331 (step S201). The signal storage unit 331 stores the target site position signal and the angle command signal of each axis for one cycle of the Nth weaving operation.

次にＣＰＵ３０１は、信号記憶部３３１に記憶された角度指令信号及び対象部位位置信号を基底信号に射影し、角度指令射影成分及び対象部位位置射影成分を算出する（ステップＳ２０２）。ステップＳ２０２の処理は、信号射影部３３２により実行される（図４参照）。 Next, the CPU 301 projects the angle command signal and the target site position signal stored in the signal storage unit 331 onto the base signal, and calculates the angle command projection component and the target site position projection component (step S202). The process of step S202 is executed by the signal projection unit 332 (see FIG. 4).

ＣＰＵ３０１は、Ｎ回目のウィービング動作１周期分の角度指令射影成分及び対象部位位置射影成分から、Ｎ＋１回目のウィービング動作用の学習出力射影成分を算出する（ステップＳ２０３）。ステップＳ２０３の処理は、学習出力射影成分算出部３３３により実行される（図４参照）。次にＣＰＵ３０１は、学習出力射影成分と基底信号とに基づいて、Ｎ＋１回目のウィービング動作用の学習出力信号を生成する（ステップＳ２０４）。ステップＳ２０４の処理は、学習出力信号生成部３３４により実行される（図４参照）。以上で、学習出力信号生成処理が終了する。 The CPU 301 calculates the learning output projection component for the N + 1th weaving operation from the angle command projection component and the target site position projection component for one cycle of the Nth weaving operation (step S203). The process of step S203 is executed by the learning output projection component calculation unit 333 (see FIG. 4). Next, the CPU 301 generates a learning output signal for the N + 1th weaving operation based on the learning output projection component and the base signal (step S204). The process of step S204 is executed by the learning output signal generation unit 334 (see FIG. 4). This completes the learning output signal generation process.

Ｎ＋１回目のウィービング動作用の学習出力信号を生成すると、ＣＰＵ３０１は次の周期（Ｎ＋１回目）のウィービング動作用の角度指令信号の算出を開始するタイミングまで待機する（ステップＳ１０８）。次の周期のウィービング動作用の角度指令信号の算出タイミングに到達すると、ＣＰＵ３０１はステップＳ１０１に処理を戻し、Ｎ＋１回目のウィービング動作用の角度指令信号を生成し（ステップＳ１０１）、Ｎ＋１回目のウィービング動作用の補正前トルク指令を算出する（ステップＳ１０２）。そして、ＣＰＵ３０１はＮ＋１回目のウィービング動作用の学習出力信号を、Ｎ＋１回目のウィービング動作用の補正前トルク指令に加算して、Ｎ＋１回目のウィービング動作用のトルク指令を軸毎に算出する（ステップＳ１０３）。以降、ＣＰＵ３０１は、ステップＳ１０４乃至Ｓ１０８の処理を実行する。以上のように、ステップＳ１０８のタイミング調整処理を設けることで、前回の周期のウィービング動作に係る角度指令信号及び対象部位位置信号に基づいて生成された学習出力信号を、次回の周期のウィービング動作に係る補正前トルク指令に加算することができる。 When the learning output signal for the N + 1th weaving operation is generated, the CPU 301 waits until the timing to start the calculation of the angle command signal for the weaving operation in the next cycle (N + 1th time) (step S108). When the calculation timing of the angle command signal for the weaving operation in the next cycle is reached, the CPU 301 returns to step S101 to generate the angle command signal for the N + 1th weaving operation (step S101), and the N + 1th weaving operation. The pre-correction torque command for is calculated (step S102). Then, the CPU 301 adds the learning output signal for the N + 1th weaving operation to the pre-correction torque command for the N + 1th weaving operation, and calculates the torque command for the N + 1th weaving operation for each axis (step S103). ). After that, the CPU 301 executes the processes of steps S104 to S108. As described above, by providing the timing adjustment process of step S108, the learning output signal generated based on the angle command signal and the target site position signal related to the weaving operation of the previous cycle can be used for the weaving operation of the next cycle. It can be added to the pre-correction torque command.

以上説明したようなステップＳ１０１乃至Ｓ１０８の処理を繰り返し実行することにより、ウィービング動作の学習制御が繰り返し行われる。 By repeatedly executing the processes of steps S101 to S108 as described above, the learning control of the weaving operation is repeatedly performed.

以上の如く構成したことにより、本実施の形態に係る学習制御装置３０によれば、多入出力システムである溶接ロボット２０のＮ回目のウィービング動作における各軸に対する角度指令信号のそれぞれと、Ｎ回目のウィービング動作において取得された各軸に対する対象部位位置信号のそれぞれとに基づいて、各軸に対するＮ＋１回目の学習出力信号を生成するため、溶接ロボット２０の１つの軸に対する角度指令信号（入力）が他の軸の角度（出力）に干渉すること、即ち出力に対する入力の干渉を抑制することができる。 With the above configuration, according to the learning control device 30 according to the present embodiment, each of the angle command signals for each axis in the Nth weaving operation of the welding robot 20 which is a multi-input / output system and the Nth time. In order to generate the N + 1th learning output signal for each axis based on each of the target site position signals for each axis acquired in the weaving operation of, the angle command signal (input) for one axis of the welding robot 20 is generated. Interfering with the angle (output) of another axis, that is, the interference of the input with respect to the output can be suppressed.

＜評価試験＞
発明者は、２リンクロボットに対して学習制御を伴わないフィードバック制御（以下、「従来手法」という）と本実施の形態に係る学習制御方法（以下、「本手法」という）を実施し、本手法の性能を評価した。図１０は、従来手法の構成を示す機能ブロック図である。図１０に示すように、角度・角速度制御部３１２，３２２，…及び電流制御部３１３，３２３，…を有し、データ変換部３１４，座標変換部３１５，及び学習制御部３１６を有しない制御装置によって従来手法を実施した。また、本評価試験では、従来手法及び本手法によって、ウィービング周波数２Ｈｚ、全振幅４ｍｍの条件でウィービング動作を２リンクロボット２００に実行させた。 <Evaluation test>
The inventor has implemented feedback control without learning control (hereinafter referred to as "conventional method") and learning control method according to the present embodiment (hereinafter referred to as "this method") for the two-link robot. The performance of the method was evaluated. FIG. 10 is a functional block diagram showing the configuration of the conventional method. As shown in FIG. 10, a control device having an angle / angular velocity control unit 312, 322, ..., a current control unit 313, 323, ..., And not having a data conversion unit 314, a coordinate conversion unit 315, and a learning control unit 316. The conventional method was carried out by. Further, in this evaluation test, the two-link robot 200 was made to execute the weaving operation under the conditions of the weaving frequency of 2 Hz and the total amplitude of 4 mm by the conventional method and the present method.

図１１Ａ乃至図１１Ｄに、従来手法を実施した場合の２リンクロボット２００の挙動を示す。図１１Ａは、１軸目の角度指令信号及び関節角度（対象部位位置信号）の時間変化を示すグラフであり、図１１Ｂは、２軸目の角度指令信号及び関節角度（対象部位位置信号）の時間変化を示すグラフであり、図１１Ｃは、ウィービング動作中の２リンクロボット２００の先端部位の横方向（溶接方向に対する水平交差方向）の移動及び上下動を示すグラフであり、図１１Ｄはウィービング動作中の２リンクロボット２００の先端部位の移動を示すグラフである。図１１Ａ及び図１１Ｂにおいて、縦軸は関節角度を、横軸は時間を示している。図１１Ｃにおいて、縦軸は距離を、横軸は時間を示している。また、図１１Ｄにおいて、縦軸は２リンクロボット２００の先端部位の上下動の距離を、横軸はその横方向の移動の距離を示している。 11A to 11D show the behavior of the 2-link robot 200 when the conventional method is carried out. FIG. 11A is a graph showing the time change of the angle command signal and the joint angle (target site position signal) of the first axis, and FIG. 11B is the angle command signal and the joint angle (target site position signal) of the second axis. 11C is a graph showing a time change, FIG. 11C is a graph showing lateral movement and vertical movement of the tip portion of the 2-link robot 200 during the weaving operation (horizontal crossing direction with respect to the welding direction), and FIG. 11D is a weaving operation. It is a graph which shows the movement of the tip part of the 2-link robot 200 inside. In FIGS. 11A and 11B, the vertical axis represents the joint angle and the horizontal axis represents time. In FIG. 11C, the vertical axis represents distance and the horizontal axis represents time. Further, in FIG. 11D, the vertical axis represents the vertical movement distance of the tip portion of the two-link robot 200, and the horizontal axis represents the horizontal movement distance thereof.

図１１Ａ及び図１１Ｂからは、振動により各関節の角度指令信号と関節角度との間にずれが生じていることが分かる。また、図１１Ｃ及び図１１Ｄからは、ウィービング動作の横方向移動において正弦波状の波形を示しておらず、また±約０．７ｍｍの範囲で上下動が発生していることが分かる。これらの挙動は、前回のウィービング周期で生じた振動が、次回のウィービング動作においても残存しており、この振動による影響であると考えられる。 From FIGS. 11A and 11B, it can be seen that the vibration causes a deviation between the angle command signal of each joint and the joint angle. Further, from FIGS. 11C and 11D, it can be seen that the lateral movement of the weaving operation does not show a sinusoidal waveform, and that the vertical movement occurs within a range of ± about 0.7 mm. These behaviors are considered to be the influence of the vibration generated in the previous weaving cycle, which remains in the next weaving operation.

図１２Ａ乃至図１２Ｄに、本手法を実施した場合の２リンクロボット２００の学習初期の挙動を示し、図１３Ａ乃至図１３Ｄに、学習後期の挙動を示す。図１２Ａ及び図１３Ａは、１軸目の角度指令信号及び関節角度の時間変化を示すグラフであり、図１２Ｂ及び図１３Ｂは、２軸目の角度指令信号及び関節角度の時間変化を示すグラフであり、図１２Ｃ及び図１３Ｃは、ウィービング動作中の２リンクロボット２００の先端部位の横方向移動及び上下動の時間変化を示すグラフであり、図１２Ｄ及び図１３Ｄはウィービング動作中の２リンクロボット２００の先端部位の移動を示すグラフである。図１２Ａ、図１２Ｂ、図１３Ａ、及び図１３Ｂにおいて、縦軸は角度を、横軸は時間を示している。図１２Ｃ及び図１３Ｃにおいて、縦軸は距離を、横軸は時間を示している。また、図１２Ｄ及び図１３Ｄにおいて、縦軸は２リンクロボット２００の先端部位の上下動の距離を、横軸はその横方向の移動の距離を示している。 12A to 12D show the behavior of the two-link robot 200 at the initial stage of learning when this method is performed, and FIGS. 13A to 13D show the behavior of the latter stage of learning. 12A and 13A are graphs showing the time change of the angle command signal and the joint angle of the first axis, and FIGS. 12B and 13B are graphs showing the time change of the angle command signal and the joint angle of the second axis. 12C and 13C are graphs showing the time change of lateral movement and vertical movement of the tip portion of the 2-link robot 200 during the weaving operation, and FIGS. 12D and 13D show the 2-link robot 200 during the weaving operation. It is a graph which shows the movement of the tip part of. In FIGS. 12A, 12B, 13A, and 13B, the vertical axis represents an angle and the horizontal axis represents time. In FIGS. 12C and 13C, the vertical axis represents distance and the horizontal axis represents time. Further, in FIGS. 12D and 13D, the vertical axis represents the vertical movement distance of the tip portion of the 2-link robot 200, and the horizontal axis represents the horizontal movement distance thereof.

図１２Ａ及び図１２Ｂからは、従来手法に比べて各関節の角度指令信号と関節角度との間のずれが大幅に減少していることが分かる。また、図１２Ｃ及び図１２Ｄからは、従来手法に比べて、ウィービング動作の横方向移動が正弦波状の波形に近くなっており、また上下動の範囲が減少していることが分かる。このように、学習初期においては、従来手法に比べてウィービング動作の挙動が改善している。また、図１３Ａ及び図１３Ｂからは、各関節の角度指令信号と関節角度との間のずれがほとんどなくなっていることが、図１３Ｃ及び図１３Ｄからは、ウィービング動作の横方向移動が正弦波状の波形を示し、上下動が殆ど発生していないことが分かる。このように、多入出力システムにおける連続した（つまり、繰り返し動作の間に待ち時間がない）一連の動作に本手法を適用した場合に、繰り返し学習制御が進むにしたがって振動が徐々に低減していることが分かる。 From FIGS. 12A and 12B, it can be seen that the deviation between the angle command signal of each joint and the joint angle is significantly reduced as compared with the conventional method. Further, from FIGS. 12C and 12D, it can be seen that the lateral movement of the weaving operation is closer to the sinusoidal waveform and the range of the vertical movement is reduced as compared with the conventional method. As described above, in the initial stage of learning, the behavior of the weaving motion is improved as compared with the conventional method. Further, from FIGS. 13A and 13B, there is almost no deviation between the angle command signal of each joint and the joint angle, and from FIGS. 13C and 13D, the lateral movement of the weaving operation is sinusoidal. It shows a waveform, and it can be seen that almost no vertical movement occurs. In this way, when this method is applied to a series of continuous operations (that is, there is no waiting time between repeated operations) in a multi-input / output system, the vibration gradually decreases as the iterative learning control progresses. You can see that there is.

（その他の実施の形態）
上述した実施の形態においては、繰り返し動作であるウィービング動作の１周期ずつ学習出力信号を生成する構成について述べたが、これに限定されるものではない。連続した複数周期の繰り返し動作についての角度指令信号及び対象部位位置信号をまとめて処理し、当該複数周期分の学習出力信号を生成するように構成してもよい。 (Other embodiments)
In the above-described embodiment, the configuration in which the learning output signal is generated for each cycle of the weaving operation, which is a repetitive operation, has been described, but the present invention is not limited to this. The angle command signal and the target site position signal for the continuous operation of a plurality of cycles may be collectively processed to generate the learning output signal for the plurality of cycles.

また、上述した実施の形態では、単一の学習制御装置３０によって学習制御プログラム３１０のすべての処理が実行される構成について述べたが、本発明はこれに限定されるものではなく、学習制御プログラム３１０と同様の処理を、複数の装置（コンピュータ）により分散して実行する分散システムとすることも可能である。 Further, in the above-described embodiment, the configuration in which all the processes of the learning control program 310 are executed by the single learning control device 30, but the present invention is not limited to this, and the learning control program is not limited to this. It is also possible to make a distributed system in which the same processing as 310 is distributed and executed by a plurality of devices (computers).

本発明の駆動機械の学習制御装置及び学習制御方法は、繰り返し動作を行う駆動機械を制御するための学習制御装置及び学習制御方法等として有用である。 The learning control device and learning control method for a drive machine of the present invention are useful as a learning control device and a learning control method for controlling a drive machine that performs repetitive operations.

１０自動溶接システム
２０溶接ロボット
２５加速度センサ
３０学習制御装置
３０１ＣＰＵ
３０２メモリ
３１０学習制御プログラム
３１６学習制御部
３３１信号記憶部
３３２信号射影部
３３３学習出力射影成分算出部
３３４学習出力信号生成部
２００２リンクロボット
10 Automatic welding system 20 Welding robot 25 Accelerometer 30 Learning control device 301 CPU
302 Memory 310 Learning control program 316 Learning control unit 331 Signal storage unit 332 Signal projection unit 333 Learning output projection component calculation unit 334 Learning output signal generation unit 2002 Link robot

Claims

A learning control device for a drive machine that controls a drive machine having a plurality of movable parts to execute a certain repetitive operation a plurality of times.
An observation physical quantity acquisition means for acquiring an observation physical quantity which is a physical quantity related to a position observed at each of a plurality of target parts which are the movable parts of the drive machine.
The target physical quantity, which is the target physical quantity for each of the plurality of target parts in the Nth repetitive motion, and the plurality of target parts acquired in the Nth repetitive motion by the observation physical quantity acquisition means. A learning control means that generates a learning output signal for each of the plurality of target parts based on each of the observed physical quantities, and
A control signal generating means for generating a control signal for causing the driving machine to execute the N + 1th repetitive operation based on the learning output signal generated by the learning control means for each of the plurality of target parts. equipped with a door,
The learning control means
A projection means for calculating the target physical quantity projection component and the observed physical quantity projection component by projecting the target physical quantity and the observed physical quantity in the Nth repetitive operation in the time space of the driving machine into the projective space of a predetermined base signal.
Learning to calculate the learning output signal projection component in the projection space in the N + 1th repetitive operation based on the target physical quantity projection component and the observed physical quantity projection component in the Nth repetitive operation calculated by the projection means. Output projection component calculation means and
A learning output signal calculating means for calculating the learning output signal in the time space in the N + 1th repetitive operation based on the learning output signal projection component calculated by the learning output projection component calculating means.
Have ,
Learning control device for drive machines.

The learning output projective component calculation means is provided by a controller configured based on a projective space model obtained by converting a mathematical model showing the relationship between the observed physical quantity in the time space and the learning output signal into a mathematical model in the projective space. , It is configured to calculate the learning output signal projection component,
The learning control device for a drive machine according to claim 1.

The learning output projective component calculation means is configured as a controller that stabilizes the projective space model with zero steady-state error.
The learning control device for a drive machine according to claim 2.

The basal signal is a signal relating to the tracking error of the target portion.
The learning control device for a drive machine according to any one of claims 1 to 3.

The basis signal is a combination of sinusoidal signals having a frequency component that is an integral multiple of the frequency of the repetitive operation.
The learning control device for a drive machine according to claim 4.

It is a learning control method of a drive machine that controls a drive machine having a plurality of movable parts to execute a certain repetitive operation a plurality of times.
A step of acquiring an observed physical quantity which is a physical quantity related to a position observed in each of a plurality of target parts which are the movable parts of the driving machine.
The target physical quantity, which is the target physical quantity for each of the plurality of target parts in the Nth repetitive motion, and the observed physical quantity for each of the plurality of target parts acquired in the Nth repetitive motion, respectively. Based on the step of generating the learning output signal for each of the plurality of target parts,
Generated on the basis of the learning output signal, a control signal for executing the N + 1 th of the repetitive operation in the drive machine, possess and generating to said plurality of sites,
In the step of generating the learning output signal,
The target physical quantity and the observed physical quantity in the Nth repetitive operation in the time space of the driving machine are projected onto the projective space of a predetermined base signal, and the target physical quantity projection component and the observed physical quantity projection component are calculated.
Based on the calculated target physical quantity projection component and the observed physical quantity projection component in the Nth repetitive operation, the learning output signal projection component in the projection space in the N + 1th repetitive operation is calculated.
Based on the calculated learning output signal projection component, the learning output signal in the time space in the N + 1th repetitive operation is calculated .
Learning control method for drive machines.