JP2024167393A

JP2024167393A - Information processing system, information processing method, computer program and vehicle

Info

Publication number: JP2024167393A
Application number: JP2024152249A
Authority: JP
Inventors: 辰美黒田; Tatsumi Kuroda; 博司前川; Hiroshi Maekawa; 勉足立; Tsutomu Adachi; 寛隆福田; Hirotaka Fukuda; 健司水野; Kenji Mizuno; 健純近藤; Takeyoshi Kondo; 茂林; Shigeru Hayashi; 丈誠横井; Takemasa Yokoi; 謙史竹中; Kenji Takenaka
Original assignee: Case Patent Inc]
Current assignee: Case Patent Inc]
Priority date: 2014-12-25
Filing date: 2024-09-04
Publication date: 2024-12-03
Also published as: WO2016103881A1; JP7122038B2; JP6744529B2; JP6544693B2; JP2023065621A; JP7555620B2; JP2020173453A; JP2023072025A; JP7251833B2; JP2019095795A; JP2021185410A; JP2022043116A; JPWO2016103881A1; JP2024170597A; JP6994781B2

Abstract

PROBLEM TO BE SOLVED: To provide a robot capable of reducing burden of a user regarding learning of the robot.

SOLUTION: A robot includes: a voice recognition unit for recognizing voice; a learning unit for learning a recognition result of the voice recognition unit; a processing unit for performing corresponding processing of the recognition result of the voice recognition unit using a learning result of the learning unit; a storage unit for storing the learning result in an external storage device; and a learning result acquisition unit for acquiring the learning result from the storage device. A standard program storage unit 23, an artificial intelligence unit 25 and an operation unit 27 are examples of the voice recognition unit, the processing unit and a learning stop unit.

SELECTED DRAWING: Figure 1

Description

REFERENCE TO RELATED APPLICATIONS

本国際出願は、２０１４年１２月２５日に日本国特許庁に出願された日本国特許出願第２０１４－２６２９０７号に基づく優先権を主張するものであり、日本国特許出願第２０１４－２６２９０７号の全内容を本国際出願に参照により援用する。 This international application claims priority based on Japanese Patent Application No. 2014-262907, filed with the Japan Patent Office on December 25, 2014, the entire contents of which are incorporated herein by reference.

本開示はロボットに関する。 This disclosure relates to robots.

近年、人と会話できるロボットが注目されている。このロボットは、人が発音する音声をマイクで取得し、音声認識により音声の意味内容を推定する。そして、推定した意味内容に対し、予め関連付けられた回答を行う（特許文献１参照）。 In recent years, robots that can converse with people have been attracting attention. These robots use a microphone to pick up speech produced by people and infer the meaning of the speech through speech recognition. They then respond to the inferred meaning with a response that has been previously associated with the speech (see Patent Document 1).

特許第４０１５４２４号公報Patent No. 4015424

ロボットの会話能力を、人工知能等を用いた学習により高めることが考えられる。しかしながら、ある特定のロボットの会話能力を学習により高めたとしても、例えば、異なる場所にある別のロボットを使用する場合、初歩から学習を行わなければならない。本開示は、ロボットの学習に関するユーザの負担を軽減できるロボットを提供することを一側面とする。 It is conceivable that a robot's conversational ability could be improved by learning using artificial intelligence, etc. However, even if the conversational ability of a particular robot is improved through learning, when using another robot in a different location, for example, learning must be carried out from the very basics. One aspect of the present disclosure is to provide a robot that can reduce the burden on users in terms of learning to use a robot.

本開示のロボットは、音声認識を行う音声認識ユニットと、前記音声認識ユニットの認識結果について学習する学習ユニットと、前記音声認識ユニットの認識結果に対し、前記学習ユニットの学習結果を用いて、対応する処理を行う処理ユニットと、前記学習結果を外部の記憶装置に記憶する記憶ユニットと、前記学習結果を前記記憶装置から取得する学習結果取得ユニットと、を備える。 The robot disclosed herein includes a voice recognition unit that performs voice recognition, a learning unit that learns from the recognition results of the voice recognition unit, a processing unit that performs corresponding processing on the recognition results of the voice recognition unit using the learning results of the learning unit, a storage unit that stores the learning results in an external storage device, and a learning result acquisition unit that acquires the learning results from the storage device.

本開示のロボットは、音声認識ユニットの認識結果について学習することができる。また、本開示のロボットは、学習結果を外部の記憶装置に記憶するとともに、その記憶装置から、学習結果を取得することができる。 The robot disclosed herein can learn about the recognition results of the voice recognition unit. The robot disclosed herein can also store the learning results in an external storage device and retrieve the learning results from the storage device.

そのため、ユーザは、例えば、第１のロボットの使用により生じた学習結果を外部の記憶装置に記憶しておき、第２のロボットを使用するとき、記憶しておいた学習結果を第２のロボットに導入することができる。その結果、ユーザは、第１のロボットにおける学習結果を利用することができ、第２のロボットを必ずしも初歩から学習させなくてもよい。 Therefore, for example, a user can store the learning results resulting from use of a first robot in an external storage device, and when using a second robot, can introduce the stored learning results into the second robot. As a result, the user can utilize the learning results of the first robot, and does not necessarily have to train the second robot from the very basics.

一実施形態のロボットの電気的構成を表すブロック図である。FIG. 2 is a block diagram showing an electrical configuration of the robot according to the embodiment. ロボットの構成を表す正面図である。FIG. 2 is a front view showing the configuration of the robot. ロボットが実行するプログラム設定処理を表すフローチャートである。11 is a flowchart showing a program setting process executed by a robot. ロボットが実行する会話処理を表すフローチャートである。1 is a flowchart showing a conversation process executed by a robot. ロボットが実行する学習停止判断処理を表すフローチャートである。11 is a flowchart showing a learning stop determination process executed by a robot. 図６Ａは学習停止を指示する動作の一例を表す説明図であり、図６Ｂは、学習再開を指示する動作の一例を表す説明図である。FIG. 6A is an explanatory diagram showing an example of an operation for instructing to stop learning, and FIG. 6B is an explanatory diagram showing an example of an operation for instructing to restart learning. コンピュータの電気的構成を表すブロック図である。FIG. 2 is a block diagram showing the electrical configuration of a computer. コンピュータの外観を表す斜視図である。FIG. 1 is a perspective view illustrating an external appearance of a computer. 車載機の電気的構成を表すブロック図である。FIG. 2 is a block diagram showing an electrical configuration of an in-vehicle device. 車載機が実行する車両制御処理を表すフローチャートである。4 is a flowchart showing a vehicle control process executed by an in-vehicle device. 別の実施形態のロボットの電気的構成を表すブロック図である。FIG. 11 is a block diagram showing an electrical configuration of a robot according to another embodiment. ロボットが実行するプログラムインストール処理を表すフローチャートである。11 is a flowchart showing a program installation process executed by a robot. ロボットが実行する会話処理を表すフローチャートである。1 is a flowchart showing a conversation process executed by a robot. ロボットが実行する第２の人工知能ユニット学習停止判断処理を表すフローチャートである。13 is a flowchart showing the second artificial intelligence unit learning stop determination process executed by the robot. ロボットが実行するプログラムアップロード処理を表すフローチャートである。13 is a flowchart showing a program upload process executed by a robot. 更に他の実施形態のロボットの電気的構成を表すブロック図である。FIG. 13 is a block diagram showing an electrical configuration of a robot according to still another embodiment. ロボットが実行する会話処理を表すフローチャートである。1 is a flowchart showing a conversation process executed by a robot. 通話処理を実行するための構成を表す説明図である。FIG. 2 is an explanatory diagram illustrating a configuration for executing a call process. ホストコンピュータが実行する通話ペア設定処理を表すフローチャートである。10 is a flowchart showing a call pair setting process executed by a host computer. アイコンを表示しているディスプレイを表す説明図である。FIG. 2 is an explanatory diagram showing a display showing icons.

１、３０１…ロボット、３…制御ユニット、５…マイク、７…カメラ、９…タッチパネル、１１…センサ群、１３…ＧＰＳ、１５…スピーカ、１７…モータ群、１９…ディスプレイ、２１…入力ユニット、２３…標準プログラム記憶ユニット、２５…人工知能ユニット、２６…通信ユニット、２７…演算ユニット、２９…出力ユニット、３１…クラウドネットワーク、３３…端末、３５…頭部、３７…胴部、３９…右腕部、４１…左腕部、４３…右脚部、４５…左脚部、４７、４９…車輪、５１…移動用モータ、５３…首関節、５５…肩関節、５７…肘関節、５９…手首関節、６１…関節用モータ、６３…キーボード、６５…マウス、６７…筐体、６９…端子、７１…車両制御部、１０１…コンピュータ、２０１…車載機 1, 301...robot, 3...control unit, 5...microphone, 7...camera, 9...touch panel, 11...sensor group, 13...GPS, 15...speaker, 17...motor group, 19...display, 21...input unit, 23...standard program storage unit, 25...artificial intelligence unit, 26...communication unit, 27...arithmetic unit, 29...output unit, 31...cloud network, 33...terminal, 35...head, 37...torso, 39...right arm, 41...left arm, 43...right leg, 45...left leg, 47, 49...wheels, 51...movement motor, 53...neck joint, 55...shoulder joint, 57...elbow joint, 59...wrist joint, 61...joint motor, 63...keyboard, 65...mouse, 67...casing, 69...terminal, 71...vehicle control unit, 101...computer, 201...vehicle-mounted device

本開示の実施形態を図面に基づき説明する。
＜第１の実施形態＞
１．ロボット１の構成
ロボット１の構成を図１、図２に基づき説明する。ロボット１は、図１に示すように、制御ユニット３、マイク５、カメラ７、タッチパネル９、センサ群１１、ＧＰＳ１３、スピーカ１５、モータ群１７、及びディスプレイ１９を備える。 An embodiment of the present disclosure will be described with reference to the drawings.
First Embodiment
1. Configuration of Robot 1 The configuration of the robot 1 will be described with reference to Figures 1 and 2. As shown in Figure 1, the robot 1 includes a control unit 3, a microphone 5, a camera 7, a touch panel 9, a group of sensors 11, a GPS 13, a speaker 15, a group of motors 17, and a display 19.

制御ユニット３はマイクロコンピュータを備える。制御ユニットは、具体的には、入力ユニット２１、標準プログラム記憶ユニット２３、人工知能ユニット２５、通信ユニット２６、演算ユニット２７、及び出力ユニット２９を備える。 The control unit 3 includes a microcomputer. Specifically, the control unit includes an input unit 21, a standard program storage unit 23, an artificial intelligence unit 25, a communication unit 26, a calculation unit 27, and an output unit 29.

入力ユニット２１は、マイク５、カメラ７、タッチパネル９、センサ群１１、及びＧＰＳ１３から情報を取得し、その情報を演算ユニット２７及び人工知能ユニット２５に出力する。 The input unit 21 acquires information from the microphone 5, camera 7, touch panel 9, sensor group 11, and GPS 13, and outputs the information to the calculation unit 27 and artificial intelligence unit 25.

標準プログラム記憶ユニット２３は、ロボット１の各種動作を実行するための標準プログラムを常時記憶している。標準プログラムは、後述するＡＩ用プログラムとは異なり、
内容が変化しないプログラムである。 The standard program storage unit 23 constantly stores standard programs for executing various operations of the robot 1. The standard programs are different from the AI programs described later.
It is a program whose contents do not change.

人工知能ユニット２５は、ロボット１の各種動作を実行するためのＡＩ（人工知能）用プログラムを記憶可能である。人工知能ユニット２５は、ＡＩ用プログラムを記憶しているとき、それを学習により変化、発展させる。また、人工知能ユニット２５は、ＡＩ用プログラムを新規にインストールすること、及びＡＩ用プログラムを消去することが可能である。 The artificial intelligence unit 25 can store AI (artificial intelligence) programs for executing various operations of the robot 1. When the artificial intelligence unit 25 stores an AI program, it changes and develops it through learning. The artificial intelligence unit 25 can also install new AI programs and erase AI programs.

通信ユニット２６は、外部のクラウドネットワーク３１、端末３３等と通信を行う。通信は無線通信であってもよいし、有線通信であってもよい。
演算ユニット２７は、標準プログラム又はＡＩ用プログラムを用いて、ロボット１の各種動作を実現するたに必要な演算を実行する。 The communication unit 26 communicates with an external cloud network 31, a terminal 33, etc. The communication may be wireless communication or wired communication.
The arithmetic unit 27 uses a standard program or an AI program to execute the calculations necessary to realize various operations of the robot 1.

出力ユニット２９は演算ユニット２７の演算結果をスピーカ１５、モータ群１７、及びディスプレイ１９に出力する。なお、制御ユニット３に属する各ユニットの詳しい機能は後述する。 The output unit 29 outputs the calculation results of the calculation unit 27 to the speaker 15, the motor group 17, and the display 19. The detailed functions of each unit belonging to the control unit 3 will be described later.

図２に示すように、ロボット１は、人型の外形を有しており、頭部３５、胴部３７、右腕部３９、左腕部４１、右脚部４３、及び左脚部４５を備える。
マイク５、カメラ７、及びスピーカ１５は頭部３５に設けられている。また、ディスプレイ１９、及びタッチパネル９は胴部３７の正面側に設けられている。 As shown in FIG. 2, the robot 1 has a humanoid external shape and includes a head 35 , a torso 37 , a right arm 39 , a left arm 41 , a right leg 43 , and a left leg 45 .
The microphone 5, the camera 7, and the speaker 15 are provided in the head 35. The display 19 and the touch panel 9 are provided on the front side of the body 37.

右脚部４３、及び左脚部４５は、それぞれ、移動用の車輪４７、４９を備えている。車輪４７、４９は、それぞれ、前後方向に２つ設けられている。よって、ロボット１は、合計４つの車輪により地面に接する。車輪４７、４９は、モータ群１７に属する移動用モータ５１により駆動される。移動用モータ５１の駆動力によって車輪４７、４９を回転させることにより、ロボット１は前後に移動することができる。また、車輪４７の回転数と、車輪４９の回転数とを異ならせることにより、ロボット１は右旋回又は左旋回をすることができる。 The right leg 43 and the left leg 45 each have wheels 47, 49 for movement. Two wheels 47, 49 are provided in the front-to-rear direction. Therefore, the robot 1 comes into contact with the ground with a total of four wheels. The wheels 47, 49 are driven by a movement motor 51 belonging to the motor group 17. The robot 1 can move forward and backward by rotating the wheels 47, 49 with the driving force of the movement motor 51. In addition, by making the rotation speed of the wheel 47 different from the rotation speed of the wheel 49, the robot 1 can turn right or left.

ロボット１は、首関節５３、肩関節５５、肘関節５７、及び手首関節５９を備える。各関節の自由度は、１～３の中から適宜設定できる。各関節は、モータ群１７に属する関節用モータ６１の駆動力により、動作する。上記の関節のうち、適宜選択したものを関節用モータ６１で動かすことにより、ロボット１は所定の動作を行い、また、所定の姿勢を表現する。 The robot 1 has a neck joint 53, a shoulder joint 55, an elbow joint 57, and a wrist joint 59. The degree of freedom of each joint can be set appropriately from 1 to 3. Each joint is operated by the driving force of a joint motor 61 belonging to the motor group 17. By driving an appropriately selected one of the above joints with the joint motor 61, the robot 1 performs a predetermined operation and expresses a predetermined posture.

センサ群１１に属する複数のセンサは、ロボット１における各部の位置、速度、加速度、傾き、関節の角度等を検出する。その検出結果は、入力ユニット２１を介して演算ユニット２７にフィードバックされ、ロボット１の動作制御に用いられる。 The multiple sensors belonging to the sensor group 11 detect the position, speed, acceleration, inclination, joint angles, etc. of each part of the robot 1. The detection results are fed back to the arithmetic unit 27 via the input unit 21 and are used to control the movement of the robot 1.

なお、標準プログラム記憶ユニット２３、人工知能ユニット２５、及び演算ユニット２７は、音声認識ユニット、処理ユニット、及び学習停止ユニットの一例である。人工知能ユニット２５は、学習ユニットの一例である。通信ユニット２６は、記憶ユニット及び学習結果取得ユニットの一例である。入力ユニット２１及び通信ユニット２６は識別情報取得ユニットの一例である。クラウドネットワーク３１は外部の記憶装置の一例である。 The standard program storage unit 23, the artificial intelligence unit 25, and the calculation unit 27 are examples of a voice recognition unit, a processing unit, and a learning stop unit. The artificial intelligence unit 25 is an example of a learning unit. The communication unit 26 is an example of a storage unit and a learning result acquisition unit. The input unit 21 and the communication unit 26 are examples of an identification information acquisition unit. The cloud network 31 is an example of an external storage device.

２．ロボット１が実行する処理
（２－１）プログラム設定処理
制御ユニット３は、標準プログラムと、ＡＩ用プログラムとのうち、使用するプログラムを決めるために、図３に示すプログラム設定処理を実行する。このプログラム設定処理
は、ロボット１の電源がオンであるとき、所定時間ごとに繰り返し実行される。 2. Processing Executed by Robot 1 (2-1) Program Setting Processing In order to determine which of the standard program and the AI program to use, the control unit 3 executes the program setting processing shown in Fig. 3. This program setting processing is executed repeatedly at predetermined time intervals when the power supply of the robot 1 is on.

ステップ１では、その時点でＡＩ用プログラムを使用しているか否かを判断する。ＡＩ用プログラムを使用していない場合（すなわち、標準プログラムを使用している場合）はステップ２に進み、ＡＩ用プログラムを使用している場合はステップ５に進む。 In step 1, it is determined whether or not an AI program is being used at that time. If an AI program is not being used (i.e., the standard program is being used), proceed to step 2; if an AI program is being used, proceed to step 5.

ステップ２では、ユーザの識別情報が入力されたか否かを判断する。識別情報は、例えば、以下の方法で入力することができる。
・端末３３から通信ユニット２６に識別情報を送信する。 In step 2, it is determined whether or not the user's identification information has been input. The identification information can be input, for example, in the following manner.
- The identification information is transmitted from the terminal 33 to the communication unit 26.

・識別情報を表す電磁波（例えば、電波、赤外線等）を通信ユニット２６に送信する。識別情報を表す電磁波は、端末３３から送信してもよいし、固定式の装置（例えば、ビーコン、無線ＬＡＮのアクセスポイント等）から送信してもよい。識別情報を表す電磁波は、定期的に送信してもよいし、ユーザの指示に応じて送信してもよいし、端末３３や固定式の装置等がロボット１を検出することをきっかけとして送信してもよい。 - Electromagnetic waves (e.g., radio waves, infrared rays, etc.) representing the identification information are transmitted to the communication unit 26. The electromagnetic waves representing the identification information may be transmitted from the terminal 33 or from a fixed device (e.g., a beacon, a wireless LAN access point, etc.). The electromagnetic waves representing the identification information may be transmitted periodically, in response to a user's instruction, or when the terminal 33 or a fixed device detects the robot 1.

・ユーザが識別情報の内容を声に出して言う。マイク５がその音声を取得し、音声認識により、識別情報を特定する。
・識別情報を表す一次元バーコードや二次元バーコードをカメラ７で撮影する。 The user speaks out the content of the identification information. The microphone 5 picks up the voice and identifies the identification information through voice recognition.
A one-dimensional or two-dimensional barcode representing identification information is photographed by the camera 7.

・タッチパネル９を用いて識別情報を入力する。
識別情報は、数字や文字で構成されるものであってもよいし、１次元又は２次元の画像（例えば一次元バーコードや二次元バーコード）であってもよいし、ユーザの生体情報（例えば、指紋、身体のいずれかの部位における静脈のパターン、虹彩、顔等）であってもよいし、音声で構成されるものであってもよい。 Use the touch panel 9 to input identification information.
The identification information may be composed of numbers or letters, a one-dimensional or two-dimensional image (e.g., a one-dimensional or two-dimensional barcode), biometric information of the user (e.g., a fingerprint, a vein pattern in any part of the body, iris, face, etc.), or may be composed of voice.

識別情報が入力された場合はステップ３に進み、識別情報が入力されなかった場合は本処理を終了する。
ステップ３では、前記ステップ２で入力されたと判断された識別情報に対応するＡＩ用プログラム及びデータセットを、クラウドネットワーク３１からインストールする。インストールしたＡＩ用プログラム及びデータセットは、人工知能ユニット２５に記憶する。 If the identification information has been input, the process proceeds to step 3, and if the identification information has not been input, the process ends.
In step 3, the AI program and data set corresponding to the identification information determined to have been input in step 2 are installed from the cloud network 31. The installed AI program and data set are stored in the artificial intelligence unit 25.

なお、クラウドネットワーク３１には、識別情報と、ＡＩ用プログラム及びデータセットとが、対応付けられて記憶されている。データセットは、ＡＩ用プログラムの使用、学習等において用いるデータセットであり、音声認識及び音声合成において用いる辞書データを含む。 In addition, the cloud network 31 stores identification information in association with AI programs and data sets. The data sets are used for using and learning AI programs, and include dictionary data used in voice recognition and voice synthesis.

ステップ４では、ロボット１が使用するプログラムを、標準プログラムから、前記ステップ３でインストールしたＡＩ用プログラムに変更する。この時点以降、ロボット１はＡＩ用プログラムを使用する。 In step 4, the program used by robot 1 is changed from the standard program to the AI program installed in step 3. From this point on, robot 1 uses the AI program.

一方、前記ステップ１で肯定判断された場合はステップ５において、ロボット１の使用終了条件が充足されたか否かを判断する。使用終了条件は、例えば、以下のものとすることができる。 On the other hand, if the result of step 1 is positive, then in step 5 it is determined whether the conditions for ending use of the robot 1 have been met. The conditions for ending use can be, for example, as follows:

・所定時間以上、ユーザがロボット１を操作しないこと。
・カメラ７の画像やマイク５で取得した音声においてユーザを認識できない状態が所定時間以上続くこと。 The user does not operate the robot 1 for a predetermined period of time or longer.
A state in which the user cannot be recognized in images from the camera 7 or in audio picked up by the microphone 5 continues for a predetermined period of time or longer.

・ユーザがロボット１に対し、使用終了に該当する入力を行うこと（例えば、ユーザが
「使用終了」と声に出して言う、または、タッチパネル９に使用終了に該当する内容の入力を行う等）。 The user makes an input to the robot 1 indicating that use has ended (for example, the user says out loud "end of use" or inputs information indicating that use has ended on the touch panel 9).

・ロボット１の電源がオフになる。
・予め設定された時刻になる。
使用終了条件が充足された場合はステップ６に進み、使用終了条件が充足されない場合は本処理を終了する。・Robot 1 is powered off.
・The preset time arrives.
If the usage end condition is satisfied, the process proceeds to step 6, and if the usage end condition is not satisfied, the process ends.

ステップ６では、ＡＩ用プログラム及びデータセットを、クラウドネットワーク３１にアップロードする。このとき、ユーザの識別情報と対応付けてアップロードする。なお、後述する学習が行われた場合、アップロードするＡＩ用プログラム及びデータセットは、学習後のものである。 In step 6, the AI program and dataset are uploaded to the cloud network 31. At this time, they are uploaded in association with the user's identification information. Note that if learning, which will be described later, has been performed, the AI program and dataset uploaded will be those after learning.

ステップ７では、人工知能ユニット２５からＡＩ用プログラム及びデータセットを消去する。
ステップ８では、使用するプログラムを、ＡＩ用プログラムから標準プログラムに変更する。この時点以降、ロボット１は標準プログラムを使用する。 In step 7, the AI programs and data sets are erased from the artificial intelligence unit 25.
In step 8, the program to be used is changed from the AI program to the standard program. From this point on, the robot 1 uses the standard program.

（２－２）会話処理
制御ユニット３は、図４に示す会話処理を実行する。この処理は、マイク５が所定の閾値以上の音量を検出したときに実行される。 (2-2) Conversation Processing The control unit 3 executes the conversation processing shown in Fig. 4. This processing is executed when the microphone 5 detects a volume equal to or greater than a predetermined threshold.

ステップ１１では、マイク５を用いて音声を取得する。
ステップ１２では、周知の音声認識技術により、前記ステップ１１で取得した音声の内容を認識する。このとき、標準プログラムを使用している場合は、ロボット１が予め備えている標準辞書データを用いて音声の内容を認識する。また、ＡＩ用プログラムを使用している場合は、ＡＩ用プログラムとともにクラウドネットワーク３１からインストールされ、過去の学習によって強化された辞書データを用いて音声の内容を認識する。 In step 11, the microphone 5 is used to capture voice.
In step 12, the contents of the voice acquired in step 11 are recognized by a well-known voice recognition technology. At this time, if the standard program is used, the contents of the voice are recognized using standard dictionary data that is pre-installed in the robot 1. In addition, if the AI program is used, the contents of the voice are recognized using dictionary data that is installed from the cloud network 31 together with the AI program and that has been reinforced by past learning.

ステップ１３では、前記ステップ１２で認識した音声の内容に対し、回答する音声のデータ（以下では回答音声データとする）を作成する。このとき、標準プログラムを使用している場合は、ロボット１が予め備えている標準辞書データを用いて回答音声データを作成する。また、ＡＩ用プログラムを使用している場合は、ＡＩ用プログラムとともにクラウドネットワーク３１からインストールされ、過去の学習によって強化された辞書データを用いて回答音声データを作成する。 In step 13, data of the voice that responds to the content of the voice recognized in step 12 (hereinafter referred to as response voice data) is created. At this time, if the standard program is being used, the response voice data is created using standard dictionary data that is pre-installed in the robot 1. In addition, if the AI program is being used, the response voice data is created using dictionary data that is installed from the cloud network 31 together with the AI program and that has been strengthened by past learning.

回答音声データの内容は、例えば、前記ステップ１２で認識した音声の内容からキーワードを検出し、そのキーワードに予め対象付けられた事項を辞書データから探し、作成することができる。また、回答音声データの内容は、前記ステップ１２で認識した音声の内容に対し、人工知能を用いて推論したものであってもよい。 The content of the response voice data can be created, for example, by detecting keywords from the content of the voice recognized in step 12, and searching dictionary data for items that are previously targeted by the keywords. The content of the response voice data can also be inferred from the content of the voice recognized in step 12 using artificial intelligence.

ステップ１４では、前記ステップ１３で作成した回答音声データに基づき、スピーカ１５を用いて発音する。すなわち、前記ステップ１１で取得した音声に対する回答を発音する。 In step 14, the speaker 15 is used to pronounce the answer voice data created in step 13. In other words, the answer to the voice acquired in step 11 is pronounced.

ステップ１５では、その時点でＡＩ用プログラムを使用中であるか否かを判断する。ＡＩ用プログラムを使用中である場合はステップ１６に進み、標準プログラムを使用中である場合は本処理を終了する。 In step 15, it is determined whether or not the AI program is being used at that time. If the AI program is being used, the process proceeds to step 16, and if the standard program is being used, the process ends.

ステップ１６では、その時点で学習停止中であるか否かを判断する。なお、学習停止は
、後述する学習停止判断処理により設定される。学習停止中ではない場合はステップ１７に進み、学習停止中である場合は本処理を終了する。 In step 16, it is determined whether learning is stopped at that time. Note that whether learning is stopped is set by a learning stop determination process described later. If learning is not stopped, the process proceeds to step 17, and if learning is stopped, the process ends.

ステップ１７では、その時点で設定されている学習制限内容を取得する。学習制限内容としては、例えば、ユーザ（前記ステップ２で入力されたと判断された識別情報に対応するユーザ）の家族、知人に関する情報（名前、住所、電話番号、メールアドレス、経歴、顔を含む画像）等である。 In step 17, the learning restriction contents set at that time are obtained. The learning restriction contents include, for example, information about the user's (the user corresponding to the identification information determined to have been entered in step 2) family and acquaintances (names, addresses, telephone numbers, email addresses, career history, images including faces), etc.

ステップ１８では、前記ステップ１２で認識した音声の内容について学習を行う。学習としては、例えば、機械学習が挙げられる。機械学習は、教師付き学習であってもよいし、教師無し学習であってもよい。また、学習は、人工無能による学習であってもよい。この場合、前記ステップ１２で認識した音声からキーワードを抽出し、そのキーワードをデータセット（例えば、音声認識に用いる辞書データ）に追加することができる。このキーワードは、例えば、回答音声データを作成する処理（前記ステップ１３）において利用できる。 In step 18, learning is performed on the content of the voice recognized in step 12. An example of learning is machine learning. Machine learning may be supervised learning or unsupervised learning. Furthermore, learning may be learning by artificial intelligence. In this case, keywords may be extracted from the voice recognized in step 12, and the keywords may be added to a data set (e.g., dictionary data used for voice recognition). The keywords may be used, for example, in the process of creating answer voice data (step 13).

ただし、前記ステップ１２で認識した音声の内容であっても、前記ステップ１７で取得した学習制限内容に該当する事項は、学習しないようにする。
ステップ１９では、前記ステップ１８での学習結果を反映するように、ＡＩ用プログラムとデータセットとを更新する。なお、学習結果を反映するように更新されたＡＩ用プログラム及びデータセットは、学習結果の一例である。 However, even if the contents of the voice recognized in step 12 are subject to the learning restriction contents acquired in step 17, they are not to be learned.
In step 19, the AI program and the dataset are updated to reflect the learning results in step 18. The AI program and the dataset updated to reflect the learning results are an example of the learning results.

（２－３）学習停止判断処理
制御ユニット３は、図５に示す学習停止判断処理を所定時間ごとに繰り返し実行する。図５のステップ２１では、ＡＩ用プログラムを使用中であるか否かを判断する。ＡＩ用プログラムを使用中である場合はステップ２２に進み、標準プログラムを使用中である場合は本処理を終了する。 (2-3) Learning stop determination process The control unit 3 repeatedly executes the learning stop determination process shown in Fig. 5 at predetermined time intervals. In step 21 of Fig. 5, it is determined whether or not the AI program is being used. If the AI program is being used, the process proceeds to step 22, and if the standard program is being used, the process ends.

ステップ２２では、ＧＰＳ１３を用いてロボット１の位置情報を取得する。
ステップ２３では、カメラ７を用いて、ロボット１の周囲を撮像した画像を取得する。
ステップ２４では、マイク５を用いて、音声を取得する。 In step 22 , the position information of the robot 1 is acquired using the GPS 13 .
In step 23, an image of the surroundings of the robot 1 is captured using the camera 7.
In step 24, the microphone 5 is used to capture voice.

ステップ２５では、その時点で学習停止中であるか否かを判断する。なお、学習停止の状態は、後述するステップ２８において開始され、後述するステップ３０において学習を再開したときに終了する。学習停止中ではない場合はステップ２６に進み、学習停止中である場合はステップ２９に進む。 In step 25, it is determined whether learning is currently stopped. The state of learning being stopped begins in step 28, which will be described later, and ends when learning is resumed in step 30, which will be described later. If learning is not currently stopped, proceed to step 26; if learning is currently stopped, proceed to step 29.

ステップ２６では、前記ステップ２３で取得した画像、又は前記ステップ２４で取得した音声に、学習停止のきっかけとなるものがあるか否かを判断する。学習停止のきっかけとしては、例えば、以下のものが挙げられる。 In step 26, it is determined whether there is anything in the image acquired in step 23 or the audio acquired in step 24 that would trigger learning to stop. Examples of learning stop triggers include the following:

・前記ステップ２３で取得した画像において、学習停止を指示する動作として予め設定された動作が認識されること。その動作として、例えば、図６Ａに示すように、人差し指を立てて口の前に置く動作が挙げられる。また、他の動作として、ウインクが挙げられる。 - A preset action is recognized in the image acquired in step 23 as an action instructing the user to stop learning. One example of such an action is placing the index finger upright in front of the mouth, as shown in FIG. 6A. Another example of such an action is a wink.

・前記ステップ２４で取得した音声において、学習停止を指示するキーワードとして予め設定された音声が認識されること。そのキーワードとして、例えば、「秘密」、「オフレコ」、「プライベート」等が挙げられる。また、別のキーワードとして、例えば、ユーザの家族や知人の名前等が挙げられる。ユーザは、家族や知人の名前をキーワードとして
予めロボット１に登録しておくことができる。また、ロボット１が、過去に認識した音声データに基づき、どの言葉がユーザの家族や知人の名前であるかを推論し、ユーザの家族や知人の名前であると推論した言葉をキーワードとして登録してもよい。 In the voice acquired in step 24, a voice that is preset as a keyword instructing the user to stop learning is recognized. Examples of such keywords include "secret,""off the record," and "private." Other examples of such keywords include the names of the user's family members or acquaintances. The user can register the names of family members or acquaintances in the robot 1 as keywords in advance. The robot 1 may also infer which words are the names of the user's family members or acquaintances based on previously recognized voice data, and register the words that it infers are the names of the user's family members or acquaintances as keywords.

・前記ステップ２３で取得した画像において、予め登録された人の顔が認識されること。この人としては、例えば、ユーザの家族、知人等が挙げられる。なお、ユーザは、学習停止のきっかけとする人の顔画像を予め登録しておくことができる。 - The face of a person registered in advance is recognized in the image acquired in step 23. This person may be, for example, the user's family or acquaintances. The user can register in advance a facial image of a person that will trigger the learning to stop.

学習停止のきっかけがある場合はステップ２８に進み、学習停止のきっかけがない場合はステップ２７に進む。
ステップ２７では、前記ステップ２２で取得した位置情報が、学習を停止するべき位置として予め登録された位置に該当するか否かを判断する。学習を停止するべき位置としては、例えば、ユーザの自宅、会議室等が挙げられる。 If there is a trigger to stop learning, the process proceeds to step 28; if there is no trigger to stop learning, the process proceeds to step 27.
In step 27, it is determined whether the location information acquired in step 22 corresponds to a location registered in advance as a location where learning should be stopped. Examples of the location where learning should be stopped include the user's home, a conference room, etc.

ユーザは、学習を停止するべき位置を予め登録しておくことができる。また、ロボット１が、過去のデータに基づき、学習を停止するべき場所を推論し、推論した場所を登録することができる。例えば、ロボット１は、学習停止のきっかけが過去に認識された場所を、学習を停止するべき場所として推論することができる。 The user can register in advance the location where learning should be stopped. Also, the robot 1 can infer the location where learning should be stopped based on past data, and register the inferred location. For example, the robot 1 can infer that the location where learning should be stopped is the location where learning was previously recognized as a trigger for stopping learning.

ステップ２８では、学習停止の状態を開始する。この時点以降、前記会話処理における前記ステップ１６では、肯定判断がなされ、前記ステップ１８での学習が行われない。
一方、前記ステップ２５で肯定判断された場合はステップ２９にて、前記ステップ２３で取得した画像、又は前記ステップ２４で取得した音声に、学習を再開するきっかけとなるものがあるか否かを判断する。学習再開のきっかけとしては、例えば、以下のものが挙げられる。 A learning stop state is initiated in step 28. From this point onwards, a positive determination is made in step 16 in the conversation process, and learning in step 18 is not performed.
On the other hand, if the answer is YES in step 25, then in step 29 it is determined whether or not there is anything in the image acquired in step 23 or the audio acquired in step 24 that may be a trigger for restarting learning. Examples of triggers for restarting learning include the following:

・前記ステップ２３で取得した画像において、学習再開を指示する動作として予め設定された動作が認識されること。その動作として、例えば、図６Ｂに示すように、親指と人差し指とで輪を作る（いわゆるＯＫを示す）動作が挙げられる。 - A preset action is recognized in the image acquired in step 23 as an action instructing the user to resume learning. For example, as shown in FIG. 6B, one such action is making a circle with the thumb and index finger (indicating "OK").

・前記ステップ２４で取得した音声において、学習再開を指示するキーワードとして予め設定された音声が認識されること。そのキーワードとして、例えば、「ＯＫ」、「再開」、「学習」等が挙げられる。 - In the voice acquired in step 24, a voice that has been set in advance as a keyword instructing the user to resume learning is recognized. Examples of such keywords include "OK," "resume," and "learn."

学習再開のきっかけがある場合はステップ３０に進み、学習再開のきっかけがない場合は本処理を終了する。
ステップ３０では、学習を再開する。この時点以降、前記会話処理における前記ステップ１６では否定判断がなされ、前記ステップ１８の学習が行われる。 If there is a trigger to restart learning, the process proceeds to step 30, and if there is no trigger to restart learning, the process ends.
Learning is resumed in step 30. From this point on, a negative determination is made in step 16 in the conversation process, and learning in step 18 is performed.

（２－４）スケジュール管理処理
ロボット１は、以下に示すスケジュール管理処理を実行することができる。ユーザは予め自らのスケジュール情報を、ロボット１に入力しておく。スケジュール情報の入力は、例えば、タッチパネル９を用いて行ってもよいし、音声入力により行ってもよい。また、端末３３からスケジュール情報を通信ユニット２６に送信してもよい。 (2-4) Schedule Management Process The robot 1 can execute the schedule management process shown below. The user inputs his/her own schedule information to the robot 1 in advance. The schedule information may be input, for example, using the touch panel 9 or by voice input. In addition, the schedule information may be transmitted from the terminal 33 to the communication unit 26.

スケジュール情報は、少なくとも、期日と、その期日までに実行すべき事項とを含む。ロボット１は、期日よりも所定時間（例えば、１日、３時間）前の時点で、マイク５により取得した音声、カメラ７により取得した画像、端末３３から取得した情報等に基づき、ユーザがスケジュール情報に含まれる事項を実行済みであるか否かを判断し、未だ実行していない場合は、スピーカ１５の音声、又はディスプレイ１９に表示する画像により、ユ
ーザに警告する。 The schedule information includes at least a due date and items to be performed by that due date. The robot 1 determines whether or not the user has performed the items included in the schedule information based on the voice captured by the microphone 5, the image captured by the camera 7, the information captured from the terminal 33, etc., at a time point a predetermined time (e.g., one day, three hours) before the due date. If the items have not been performed yet, the robot 1 warns the user by a voice from the speaker 15 or an image displayed on the display 19.

３．ロボット１が奏する効果
（１Ａ）ロボット１は、ＡＩ用プログラムを、学習により変化、発展させることができる。また、ロボット１は、データセット（例えば辞書データ）の内容を、学習により強化することができる。そして、ロボット１は、学習後のＡＩ用プログラム及びデータセットを、クラウドネットワーク３１にアップロードすることができる。また、ロボット１は、クラウドネットワーク３１にアップロードされたＡＩ用プログラム及びデータセットを、インストールすることができる。 3. Effects of the Robot 1 (1A) The robot 1 can change and develop the AI program through learning. The robot 1 can also strengthen the contents of a data set (e.g., dictionary data) through learning. The robot 1 can then upload the AI program and data set after learning to the cloud network 31. The robot 1 can also install the AI program and data set uploaded to the cloud network 31.

学習後のＡＩ用プログラム及びデータセットをインストールするロボット１は、過去に学習を行ったロボット１と同じであっても、異なっていてもよい。また、学習後のＡＩ用プログラム及びデータセットをインストールするロボット１は、過去に学習を行った場所にあるものであってもよいし、異なる場所にあるものであってもよい。 The robot 1 on which the learned AI program and data set are installed may be the same as the robot 1 that previously performed learning, or it may be different. Furthermore, the robot 1 on which the learned AI program and data set are installed may be located in the same place where learning was previously performed, or it may be located in a different place.

よって、ユーザは、過去の自らの使用よって学習したＡＩ用プログラム及びデータセットを、ユーザがその時点でいる場所（例えば、職場、店舗、自宅等様々な場所）のロボット１に、クラウドネットワーク３１からインストールし、ロボット１を使用することができる。 Therefore, the user can install the AI program and data set that have been learned from the user's past use from the cloud network 31 onto the robot 1 at the location where the user is at the time (e.g., various locations such as the workplace, a store, or home), and use the robot 1.

（１Ｂ）ロボット１は、ユーザの識別情報が入力されることを条件として、その識別情報に対応するＡＩ用プログラム及びデータセットのインストールを許容する。そのため、あるユーザのＡＩ用プログラム及びデータセットを、他人が勝手に使用してしまうことを抑制できる。 (1B) The robot 1 allows the installation of an AI program and a data set corresponding to a user's identification information, provided that the user's identification information is input. This prevents a user's AI program and data set from being used by another person without permission.

（１Ｃ）ロボット１は、人の音声を認識し、それに対する回答の音声を発音することができる。すなわち、ロボット１は人と会話をすることができる。また、ロボット１は、ＡＩ用プログラムを使用している場合、音声の認識結果に基づき学習を行い、その学習結果を用いて回答の音声を作成するので、学習が進むほど、より高度な会話を行うことができる。 (1C) Robot 1 can recognize human voices and produce a speech response to them. In other words, robot 1 can converse with people. Furthermore, when using an AI program, robot 1 learns based on the results of voice recognition and uses the learning results to create a speech response, so the more it learns, the more advanced the conversations it can have.

（１Ｄ）ロボット１は、その周囲にいる人の動作や、人の識別結果等に応じて、学習を停止する。そのため、ユーザにとって望ましくない事項をロボット１が学習し、後に他人に話してしまうことを抑制できる。 (1D) The robot 1 stops learning depending on the actions of people around it, the results of identifying people, etc. This prevents the robot 1 from learning things that are undesirable for the user and then telling others about them later.

（１Ｅ）ロボット１は、それが存在する場所に応じて、学習を停止する。そのため、学習してほしくない場所で学習した事項を、ロボット１が後に他人に話してしまうことを抑制できる。 (1E) The robot 1 stops learning depending on the location where it is located. This prevents the robot 1 from telling others about things it learned in a location where it is not desired to learn.

（１Ｆ）ロボット１は、その周囲にいる人の動作等に応じて、学習を再開する。そのため、ロボット１の学習を促進することができる。
（１Ｇ）ロボット１は、学習制限内容に該当する事項を学習しない。そのため、ユーザにとって望ましくない事項をロボット１が学習し、後に他人に話してしまうことを抑制できる。 (1F) The robot 1 resumes learning in response to the actions of people around it, etc. This can promote the learning of the robot 1.
(1G) The robot 1 does not learn any matter that falls under the learning restriction contents. This prevents the robot 1 from learning any matter that is undesirable for the user and then telling others about it later.

（１Ｈ）ロボット１は、ユーザのスケジュール管理を行うことができる。
（１Ｉ）ロボット１は、ロボット１の使用終了条件が充足された場合、ＡＩ用プログラム及びデータセットを消去する。そのため、ユーザは、自分のＡＩ用プログラム及びデータセットを後で他人が使用することを抑制できる。
＜第２の実施形態＞
１．コンピュータ１０１の構成
コンピュータ１０１の構成を図７、図８に基づき説明する。コンピュータ１０１の電気的構成は、前記第１の実施形態におけるロボット１と基本的に同じである。ただし、コンピュータ１０１における入力ユニット２１は、外部のマイク５、カメラ７、キーボード６３、マウス６５、タッチパネル９と接続している。また、出力ユニット２９は、外部のスピーカ１５、及びディスプレイ１９と接続している。 (1H) The robot 1 can manage the user's schedule.
(1I) When the conditions for ending the use of the robot 1 are satisfied, the robot 1 erases the AI program and data set. This allows the user to prevent others from using the user's AI program and data set later.
Second Embodiment
1. Configuration of the Computer 101 The configuration of the computer 101 will be described with reference to Figures 7 and 8. The electrical configuration of the computer 101 is basically the same as that of the robot 1 in the first embodiment. However, the input unit 21 in the computer 101 is connected to an external microphone 5, camera 7, keyboard 63, mouse 65, and touch panel 9. In addition, the output unit 29 is connected to an external speaker 15 and display 19.

コンピュータ１０１は、図８に示すように、箱型の筐体６７を備え、その内部に各構成を収容している。また、コンピュータ１０１は、外部の機器（例えば、マイク５、カメラ７、キーボード６３、マウス６５、タッチパネル９、スピーカ１５、ディスプレイ１９等）を接続可能な端子６９を複数備えている。なお、コンピュータ１０１は広義でのロボットである。 As shown in FIG. 8, the computer 101 has a box-shaped housing 67 that houses each component. The computer 101 also has multiple terminals 69 to which external devices (e.g., microphone 5, camera 7, keyboard 63, mouse 65, touch panel 9, speaker 15, display 19, etc.) can be connected. The computer 101 is a robot in the broad sense of the word.

２．コンピュータ１０１が実行する処理
コンピュータ１０１は、前記第１の実施形態のロボット１と同様に、プログラム設定処理、会話処理、学習停止判断処理、及びスケジュール管理処理を実行する。また、コンピュータ１０１は、周知のコンピュータと同様の機能を有する。 2. Processing Executed by the Computer 101 The computer 101 executes a program setting process, a conversation process, a learning stop determination process, and a schedule management process, similar to the robot 1 of the first embodiment. The computer 101 also has the same functions as a well-known computer.

３．コンピュータ１０１が奏する効果
コンピュータ１０１は、前記（１Ａ）～（１Ｉ）の効果を奏する。
＜第３の実施形態＞
１．車載機２０１の構成
車載機２０１の構成を図９に基づき説明する。車載機２０１は車両に搭載される。車載機２０１の電気的構成は、前記第２の実施形態におけるコンピュータ１０１と基本的に同じである。ただし、車載機２０１における入力ユニット２１は、車両に設けられたマイク５、カメラ７、タッチパネル９、センサ群１１、及びＧＰＳ１３と接続している。 3. Effects of the Computer 101 The computer 101 has the effects (1A) to (1I) described above.
Third Embodiment
1. Configuration of the on-vehicle device 201 The configuration of the on-vehicle device 201 will be described with reference to Fig. 9. The on-vehicle device 201 is mounted on a vehicle. The electrical configuration of the on-vehicle device 201 is basically the same as that of the computer 101 in the second embodiment. However, the input unit 21 in the on-vehicle device 201 is connected to the microphone 5, camera 7, touch panel 9, sensor group 11, and GPS 13 provided in the vehicle.

マイク５は車両の車室内に設けられ、車両の乗員（ドライバ、又は他の乗員）の声を検出する。カメラ７は乗員を撮影する。タッチパネル９は車両の車室内に設けられ、乗員により操作される。センサ群１１は、ドライバの運転操作（操舵角、アクセルの踏み込み量、ブレーキの踏み込み量、シフト位置等）と、車両の状態（速度、加速度、ヨーレート、パワーユニット（内燃機関、モータ等）の状態、燃料の残量、バッテリーの残量等）とを検出する。 The microphone 5 is installed inside the vehicle's cabin and detects the voices of the vehicle occupants (driver or other passengers). The camera 7 photographs the occupants. The touch panel 9 is installed inside the vehicle's cabin and is operated by the occupants. The sensor group 11 detects the driver's driving operations (steering angle, accelerator depression amount, brake depression amount, shift position, etc.) and the vehicle's state (speed, acceleration, yaw rate, state of the power unit (internal combustion engine, motor, etc.), remaining fuel, remaining battery, etc.).

また、出力ユニット２９は、スピーカ１５、ディスプレイ１９、及び車両制御部７１と接続している。スピーカ１５及びディスプレイ１９は車両の車室内に設けられている。車両制御部７１は、車両に関する様々な制御（例えば、操舵、加速、減速、シフトチェンジ等）を行う。なお、車載機２０１は広義でのロボットである。 The output unit 29 is also connected to the speaker 15, the display 19, and the vehicle control unit 71. The speaker 15 and the display 19 are provided inside the vehicle cabin. The vehicle control unit 71 performs various controls related to the vehicle (e.g., steering, acceleration, deceleration, shift changes, etc.). The vehicle-mounted device 201 is a robot in the broad sense.

２．車載機２０１が実行する処理
（２－１）ロボット１と同様の処理
車載機２０１は、前記第１の実施形態のロボット１と同様に、プログラム設定処理、会話処理、学習停止判断処理、及びスケジュール管理処理を実行する。 2. Processing Executed by the Vehicle-Mounted Unit 201 (2-1) Processing Similar to That of the Robot 1 The vehicle-mounted unit 201 executes program setting processing, conversation processing, learning stop determination processing, and schedule management processing, similar to the robot 1 of the first embodiment.

（２－２）車両制御処理
車載機２０１の制御ユニット３は、図１０に示す車両制御処理を所定時間ごとに繰り返し実行する。この処理は、マイク５が所定の閾値以上の音量を検出したときに実行される。 (2-2) Vehicle Control Processing The control unit 3 of the vehicle-mounted device 201 repeatedly executes the vehicle control processing shown in Fig. 10 at predetermined time intervals. This processing is executed when the microphone 5 detects a volume equal to or greater than a predetermined threshold.

ステップ３１では、マイク５を用いて音声を取得する。ステップ３２では、周知の音声
認識技術により、前記ステップ３１で取得した音声から、車両に対する指示（例えば、発進、停止、減速、加速、右左折、レーンチェンジ、シフトチェンジ等）を認識する。このとき、標準プログラムを使用している場合は、車載機２０１が予め備えている標準辞書データを用いて上記の指示を認識する。また、ＡＩ用プログラムを使用している場合は、ＡＩ用プログラムとともにクラウドネットワーク３１からインストールされ、過去の学習によって強化された辞書データを用いて上記の指示を認識する。 In step 31, voice is acquired using the microphone 5. In step 32, instructions to the vehicle (e.g., starting, stopping, decelerating, accelerating, turning right or left, lane changing, shifting, etc.) are recognized from the voice acquired in step 31 using a well-known voice recognition technology. At this time, if a standard program is being used, the above instructions are recognized using standard dictionary data that is pre-installed in the in-vehicle device 201. Also, if an AI program is being used, the above instructions are recognized using dictionary data that is installed from the cloud network 31 together with the AI program and that has been strengthened by past learning.

ステップ３３では、前記ステップ３２で認識した、車両対する指示に応じて、車両の制御内容を決定する。例えば、車両に対する指示が発進である場合、ブレーキを解除するタイミング、エンジン回転数の増加量及び増加速度等を決定する。このとき、標準プログラムを使用している場合は、標準プログラムを用いて車両の制御内容を決定する。また、ＡＩ用プログラムを使用している場合は、過去の学習によって進化したＡＩ用プログラムを用いて制御内容を決定する。 In step 33, the vehicle control details are determined according to the instruction for the vehicle recognized in step 32. For example, if the instruction for the vehicle is to start, the timing for releasing the brakes, the amount of increase in engine speed and the rate of increase, etc. are determined. At this time, if a standard program is being used, the vehicle control details are determined using the standard program. Also, if an AI program is being used, the control details are determined using an AI program that has evolved through past learning.

ステップ３４では、前記ステップ３３で決定した制御内容を車両制御部７１に出力する。なお、車両制御部７１は、その制御内容に従って車両を制御する。
ステップ３５では、その時点でＡＩ用プログラムを使用中であるか否かを判断する。ＡＩ用プログラムを使用中である場合はステップ３６に進み、標準プログラムを使用中である場合は本処理を終了する。 In step 34, the control contents determined in step 33 are output to the vehicle control unit 71. The vehicle control unit 71 controls the vehicle in accordance with the control contents.
In step 35, it is determined whether or not the AI program is being used at that time. If the AI program is being used, the process proceeds to step 36, and if the standard program is being used, the process ends.

ステップ３６では、センサ群１１から、車両の状態に関する検出結果を取得する。
ステップ３７では、前記ステップ３４で出力した制御内容と、前記ステップ３６で取得したセンサ出力とに基づき学習を行う。その学習は、前記ステップ３６で取得したセンサ出力が予め設定された最適範囲となるように、制御内容を修正するものである。 In step 36, detection results relating to the state of the vehicle are obtained from the sensor group 11.
In step 37, learning is performed based on the control content output in step 34 and the sensor output obtained in step 36. This learning is to modify the control content so that the sensor output obtained in step 36 falls within a preset optimum range.

例えば、前記ステップ３４で出力した制御内容が発進であった場合、発進の過程におけるセンサ出力（例えば、速度、加速度等のセンサ出力）が最適範囲であったか否かを確認し、最適範囲から外れていたならば、次回以降の発進時におけるセンサ出力が最適範囲に近づくように、発進の制御内容を修正する。 For example, if the control content output in step 34 was starting, it is confirmed whether the sensor output during the starting process (e.g., sensor output of speed, acceleration, etc.) was within the optimal range, and if it was outside the optimal range, the control content for starting is modified so that the sensor output for the next and subsequent starts approaches the optimal range.

ステップ３８では、前記ステップ３７での学習結果を反映するように、ＡＩ用プログラムを更新する。
３．車載機２０１が奏する効果
車載機２０１は、前記（１Ａ）～（１Ｉ）の効果を奏する。さらに、車載機２０１は次の効果も奏する。 In step 38, the AI program is updated to reflect the learning results in step 37.
3. Effects of the Vehicle-Mounted Device 201 The vehicle-mounted device 201 provides the effects (1A) to (1I) described above. In addition, the vehicle-mounted device 201 also provides the following effects.

（３Ａ）車載機２０１は、人の音声を認識し、それに対応する車両制御を行うことができる。また、車載機２０１は、ＡＩ用プログラムを使用している場合、車両制御部７１に出力した車両制御の内容と、車両の状態を表すセンサ出力とに基づき学習を行うので、学習が進むほど、より高度な車両制御を行うことができる。 (3A) The vehicle-mounted device 201 can recognize human voices and perform corresponding vehicle control. In addition, when the vehicle-mounted device 201 is using an AI program, the vehicle-mounted device 201 learns based on the vehicle control content output to the vehicle control unit 71 and the sensor output indicating the vehicle state, so that the more the learning progresses, the more advanced the vehicle control can be performed.

（３Ｂ）車載機２０１は、車両制御処理に関し、ＡＩ用プログラムを、学習により変化、発展させることができる。また、車載機２０１は、クラウドネットワーク３１にアップロードされたＡＩ用プログラムを、インストールすることができる。 (3B) The vehicle-mounted device 201 can change and develop the AI program for vehicle control processing through learning. The vehicle-mounted device 201 can also install the AI program uploaded to the cloud network 31.

学習後のＡＩ用プログラムをインストールする車載機２０１は、過去に学習を行った車載機２０１と同じであっても、異なっていてもよい。
よって、ユーザは、過去の自らの使用よって学習したＡＩ用プログラムを、任意の車両の車載機２０１に、クラウドネットワーク３１からインストールし、使用することができる。
＜第４の実施形態＞
１．ロボット３０１の構成
本実施形態のロボット３０１の構成は、基本的には前記第１の実施形態のロボット１と同様である。以下では、第１の実施形態との相違点を中心に説明する。ロボット３０１は、図１１に示すように、第１の人工知能ユニット７３と、第２の人工知能ユニット７５とを備えている。 The vehicle-mounted device 201 into which the learned AI program is installed may be the same as the vehicle-mounted device 201 that performed learning in the past, or it may be different.
Therefore, a user can install and use an AI program that has been learned through his or her own past use in the onboard device 201 of any vehicle from the cloud network 31.
Fourth Embodiment
1. Configuration of Robot 301 The configuration of the robot 301 of this embodiment is basically the same as that of the robot 1 of the first embodiment. The following mainly describes the differences from the first embodiment. As shown in FIG. 11, the robot 301 includes a first artificial intelligence unit 73 and a second artificial intelligence unit 75.

第１の人工知能ユニット７３は、ロボット３０１の各種動作を実行するためのＡＩ用プログラム及びデータセットを記憶している。以下では、このプログラムを第１のＡＩ用プログラムとし、このデータセットを第１のデータセットとする。 The first artificial intelligence unit 73 stores an AI program and a data set for executing various operations of the robot 301. Hereinafter, this program will be referred to as the first AI program, and this data set will be referred to as the first data set.

第１人工知能ユニット７３は、第１のＡＩ用プログラム及び第１のデータセットを学習により変化、発展させる。第１の人工知能ユニット７３が行う学習は、後述する学習停止、及び学習制限に影響されない。また、第１のＡＩ用プログラム及び第１のデータセットは、クラウドネットワーク３１にアップロードされることはない。 The first artificial intelligence unit 73 changes and develops the first AI program and the first data set through learning. The learning performed by the first artificial intelligence unit 73 is not affected by the learning stop and learning restrictions described below. In addition, the first AI program and the first data set are not uploaded to the cloud network 31.

第２の人工知能ユニット７５は、ＡＩ用プログラム及びデータセットを記憶可能である。以下では、このプログラムを第２のＡＩ用プログラムとし、このデータセットを第２のデータセットとする。 The second artificial intelligence unit 75 is capable of storing an AI program and a dataset. Hereinafter, this program will be referred to as the second AI program, and this dataset will be referred to as the second dataset.

第２のＡＩ用プログラムは、基本的には第１のＡＩ用プログラムと同様であるが、ロボット３０１の各種動作を実行するためには使用されない。第２人工知能ユニット７５は、第２のＡＩ用プログラム及び第２のデータセットを学習により変化、発展させる。第２のＡＩ用プログラム及び第２のデータセットは、学習結果を蓄積し、クラウドネットワーク３１にアップロードされる。そのことにより、学習結果をクラウドネットワーク３１に記憶することができる。 The second AI program is basically the same as the first AI program, but is not used to execute various operations of the robot 301. The second artificial intelligence unit 75 changes and develops the second AI program and the second data set through learning. The second AI program and the second data set accumulate the learning results and are uploaded to the cloud network 31. This allows the learning results to be stored in the cloud network 31.

ただし、第２の人工知能ユニット７５が行う学習は、後述する学習停止、及び学習制限により制限される。そのため、第２のＡＩ用プログラム及び第２のデータセットのアップロードによってクラウドネットワーク３１に記憶される学習結果は制限される。 However, the learning performed by the second artificial intelligence unit 75 is limited by the learning stop and learning limit described below. Therefore, the learning results stored in the cloud network 31 by uploading the second AI program and the second data set are limited.

第２のＡＩ用プログラム及び第２のデータセットは、クラウドネットワーク３１から第２の人工知能ユニット７５にダウンロードすることが可能である。そして、ダウンロードされた第２のＡＩ用プログラム及び第２のデータセットに基づき、第１のＡＩ用プログラム及び第１のデータセットの学習を行うことができる。詳しくは後述する。 The second AI program and the second data set can be downloaded from the cloud network 31 to the second artificial intelligence unit 75. Then, based on the downloaded second AI program and the second data set, learning of the first AI program and the first data set can be performed. Details will be described later.

２．ロボット３０１が実行する処理
（２－１）プログラムインストール処理
制御ユニット３は、ロボット３０１の電源がオンになったとき、図１２に示すプログラムインストール処理を実行する。 2. Processing Executed by the Robot 301 (2-1) Program Installation Processing When the power supply of the robot 301 is turned on, the control unit 3 executes a program installation processing shown in FIG.

ステップ４１では、ユーザの識別情報が入力されたか否かを判断する。識別情報は、前記第１の実施形態と同様のものとすることができる。識別情報が入力された場合はステップ４２に進み、識別情報が入力されなかった場合は本処理を終了する。 In step 41, it is determined whether or not the user's identification information has been input. The identification information may be the same as that in the first embodiment. If the identification information has been input, the process proceeds to step 42; if the identification information has not been input, the process ends.

ステップ４２では、前記ステップ４１で入力されたと判断された識別情報に対応する第２のＡＩ用プログラム及び第２のデータセットを、クラウドネットワーク３１からインストールする。インストールした第２のＡＩ用プログラム及び第２のデータセットは、第２の人工知能ユニット７５に記憶する。第２の人工知能ユニット７５に既に第２のＡＩ用プログラム及び第２のデータセットが記憶されていた場合は上書きする。 In step 42, the second AI program and the second data set corresponding to the identification information determined to have been input in step 41 are installed from the cloud network 31. The installed second AI program and the second data set are stored in the second artificial intelligence unit 75. If the second AI program and the second data set are already stored in the second artificial intelligence unit 75, they are overwritten.

ステップ４３では、前記ステップ４２でインストールした第２のＡＩ用プログラム及び第２のデータセットには含まれているが、第１のＡＩ用プログラム及び第１のデータセットには記憶されていない内容を学習する。 In step 43, the content that is included in the second AI program and the second data set installed in step 42 but is not stored in the first AI program and the first data set is learned.

ステップ４４では、第１のＡＩ用プログラム及び第１のデータセットに、前記ステップ４３で学習した内容を加え、更新する。すなわち、ダウンロードされた第２のＡＩ用プログラム及び第２のデータセットに基づき、第１のＡＩ用プログラム及び第１のデータセットの学習を行う。 In step 44, the first AI program and the first data set are updated by adding the contents learned in step 43. In other words, the first AI program and the first data set are learned based on the downloaded second AI program and second data set.

（２－２）会話処理
制御ユニット３は、図１３に示す会話処理を実行する。この処理は、マイク５が所定の閾値以上の音量を検出したときに実行される。会話処理は、第１の人工知能ユニット７３に記憶されている第１のＡＩ用プログラム及び第１のデータセットを用いて行われる。 (2-2) Conversation Processing The control unit 3 executes the conversation processing shown in Fig. 13. This processing is executed when the microphone 5 detects a volume equal to or greater than a predetermined threshold. The conversation processing is performed using a first AI program and a first data set stored in the first artificial intelligence unit 73.

ステップ５１～５４の処理は、前記第１の実施形態におけるステップ１１～１４の処理と同様である。
ステップ５５では、第１の人工知能ユニット７３が、前記ステップ５２で認識した音声の内容について学習を行う。学習の内容は、前記第１の実施形態と同様である。 The processing in steps 51 to 54 is similar to the processing in steps 11 to 14 in the first embodiment.
In step 55, the first artificial intelligence unit 73 learns about the contents of the voice recognized in step 52. The contents of the learning are the same as those in the first embodiment.

ステップ５６では、前記ステップ５５での学習結果を反映するように、第１のＡＩ用プログラム及び第１のデータセットを更新する。
ステップ５７では、第２の人工知能ユニット７５がその時点で学習停止中であるか否かを判断する。なお、第２の人工知能ユニット７５の学習停止は、後述する、第２の人工知能ユニット学習停止判断処理により設定される。第２の人工知能ユニット７５が学習停止中ではない場合はステップ５８に進み、学習停止中である場合は本処理を終了する。 In step 56, the first AI program and the first data set are updated to reflect the learning results in step 55.
In step 57, it is determined whether or not the second artificial intelligence unit 75 is currently stopping learning. The stopping of learning of the second artificial intelligence unit 75 is set by a second artificial intelligence unit learning stop determination process, which will be described later. If the second artificial intelligence unit 75 is not currently stopping learning, the process proceeds to step 58, and if learning is currently stopping, the process ends.

ステップ５８では、その時点で設定されている学習制限内容を取得する。学習制限内容としては、例えば、ユーザ（前記ステップ４１で入力されたと判断された識別情報に対応するユーザ）の家族、知人に関する情報（名前、住所、電話番号、メールアドレス、経歴、顔を含む画像）等である。 In step 58, the learning restriction contents set at that time are obtained. The learning restriction contents include, for example, information about the family and acquaintances of the user (the user corresponding to the identification information determined to have been entered in step 41) (names, addresses, telephone numbers, email addresses, career history, images including faces), etc.

ステップ５９では、第２の人工知能ユニット７５が、前記ステップ５２で認識した音声の内容について学習を行う。学習の内容は、前記第１の実施形態と同様である。ただし、前記ステップ５２で認識した音声の内容であっても、前記ステップ５８で取得した学習制限内容に該当する事項は、学習しないようにする。 In step 59, the second artificial intelligence unit 75 learns about the contents of the voice recognized in step 52. The contents of the learning are the same as those in the first embodiment. However, even if the contents of the voice recognized in step 52 are subject to the learning restriction contents acquired in step 58, they are not learned.

ステップ６０では、前記ステップ５９での学習結果を反映するように、第２のＡＩ用プログラム及び第２のデータセットを更新する。
（２－３）第２の人工知能ユニット学習停止判断処理
制御ユニット３は、図１４に示す第２の人工知能ユニット学習停止判断処理を所定時間ごとに繰り返し実行する。 In step 60, the second AI program and the second data set are updated to reflect the learning results in step 59.
(2-3) Processing for Determining Whether to Stop Learning of the Second Artificial Intelligence Unit The control unit 3 repeatedly executes the processing for determining whether to stop learning of the second artificial intelligence unit shown in FIG. 14 at predetermined time intervals.

ステップ７１では、ＧＰＳ１３を用いてロボット３０１の位置情報を取得する。
ステップ７２では、カメラ７を用いて、ロボット３０１の周囲を撮像した画像を取得する。 In step 71 , the position information of the robot 301 is acquired using the GPS 13 .
In step 72, the camera 7 is used to capture an image of the surroundings of the robot 301.

ステップ７３では、マイク５を用いて、音声を取得する。
ステップ７４では、第２の人工知能ユニット７５がその時点で学習停止中であるか否かを判断する。なお、学習停止の状態は、後述するステップ７７において開始され、後述す
るステップ７９において学習を再開したときに終了する。学習停止中ではない場合はステップ７５に進み、学習停止中である場合はステップ７８に進む。 In step 73, the microphone 5 is used to obtain voice.
In step 74, it is determined whether or not the second artificial intelligence unit 75 is currently halted from learning. The halted learning state is started in step 77, which will be described later, and ends when learning is resumed in step 79, which will be described later. If learning is not currently halted, the process proceeds to step 75. If learning is currently halted, the process proceeds to step 78.

ステップ７５では、前記ステップ７２で取得した画像、又は前記ステップ７３で取得した音声に、学習停止のきっかけとなるものがあるか否かを判断する。学習停止のきっかけは、前記第１の実施形態と同様である。学習停止のきっかけがある場合はステップ７７に進み、学習停止のきっかけがない場合はステップ７６に進む。 In step 75, it is determined whether or not there is a trigger for stopping learning in the image acquired in step 72 or the audio acquired in step 73. The trigger for stopping learning is the same as in the first embodiment. If there is a trigger for stopping learning, proceed to step 77, and if there is no trigger for stopping learning, proceed to step 76.

ステップ７６では、前記ステップ７１で取得した位置情報が、学習を停止するべき位置として予め登録された位置に該当するか否かを判断する。学習を停止するべき位置は、前記第１の実施形態と同様である。 In step 76, it is determined whether the location information acquired in step 71 corresponds to a location that has been registered in advance as a location where learning should be stopped. The location where learning should be stopped is the same as in the first embodiment.

ステップ７７では、第２の人工知能ユニット７５について学習停止の状態を開始する。この時点以降、前記会話処理における前記ステップ５７では、肯定判断がなされ、前記ステップ５９での学習が行われない。 In step 77, the second artificial intelligence unit 75 enters a learning stop state. From this point onwards, a positive judgment is made in step 57 in the conversation process, and learning is not performed in step 59.

一方、前記ステップ７４で肯定判断された場合はステップ７８にて、前記ステップ７２で取得した画像、又は前記ステップ７３で取得した音声に、学習を再開するきっかけとなるものがあるか否かを判断する。学習再開のきっかけは、前記第１の実施形態と同様である。学習再開のきっかけがある場合はステップ７９に進み、学習再開のきっかけがない場合は本処理を終了する。 On the other hand, if the determination in step 74 is affirmative, then in step 78, it is determined whether or not there is a trigger for restarting learning in the image acquired in step 72 or the audio acquired in step 73. The trigger for restarting learning is the same as in the first embodiment. If there is a trigger for restarting learning, the process proceeds to step 79, and if there is no trigger for restarting learning, the process ends.

ステップ７９では、第２の人工知能ユニット７５における学習を再開する。この時点以降、前記会話処理における前記ステップ５７では否定判断がなされ、前記ステップ５９の学習が行われる。 In step 79, learning in the second artificial intelligence unit 75 is resumed. From this point onwards, a negative judgment is made in step 57 in the conversation process, and learning in step 59 is carried out.

（２－４）プログラムアップロード処理
制御ユニット３は、図１５に示すプログラムアップロード処理を所定時間ごとに繰り返し実行する。 (2-4) Program Upload Processing The control unit 3 repeatedly executes the program upload processing shown in FIG. 15 at predetermined time intervals.

ステップ８１では、ロボット３０１の使用終了条件が充足されたか否かを判断する。使用終了条件は、前記第１の実施形態の前記ステップ５で判断するものと同様である。使用終了条件が充足された場合はステップ８２に進み、使用終了条件が充足されない場合は本処理を終了する。 In step 81, it is determined whether the usage end condition of the robot 301 has been satisfied. The usage end condition is the same as that determined in step 5 of the first embodiment. If the usage end condition has been satisfied, the process proceeds to step 82, and if the usage end condition has not been satisfied, the process ends.

ステップ８２では、第２のＡＩ用プログラム及び第２のデータセットを、クラウドネットワーク３１にアップロードする。このとき、ユーザの識別情報と対応付けてアップロードする。なお、前述した学習が行われた場合、アップロードする第２のＡＩ用プログラム及び第２のデータセットは、学習後のものである。 In step 82, the second AI program and the second data set are uploaded to the cloud network 31. At this time, they are uploaded in association with the user's identification information. Note that if the above-mentioned learning has been performed, the second AI program and the second data set to be uploaded are those after learning.

３．ロボット３０１が奏する効果
ロボット３０１は、前記（１Ｂ）、（１Ｃ）、（１Ｆ）、（１Ｈ）の効果を奏する。さらに、ロボット３０１は、次の効果も奏する。 3. Effects of the Robot 301 The robot 301 has the effects (1B), (1C), (1F), and (1H) described above. In addition, the robot 301 also has the following effects.

（４Ａ）ロボット３０１は、第１のＡＩ用プログラム及び第１のデータセットを、学習により変化、発展させることができる。さらに、第１のＡＩ用プログラム及び第１のデータセットにおける学習は、学習停止の処理、及び学習制限の処理に影響されない。 (4A) The robot 301 can change and develop the first AI program and the first data set through learning. Furthermore, the learning in the first AI program and the first data set is not affected by the learning stop process and the learning limit process.

また、第１のＡＩ用プログラム及び第１のデータセットはクラウドネットワーク３１にアップロードされないので、その内容が他人に知られることを抑制できる。
（４Ｂ）ロボット３０１は、学習後の第２のＡＩ用プログラム及び第２のデータセットを、クラウドネットワーク３１にアップロードすることができる。また、ロボット３０１は、第２のＡＩ用プログラム及び第２のデータセットをクラウドネットワーク３１からインストールすることができる。 Furthermore, since the first AI program and the first data set are not uploaded to the cloud network 31, their contents can be prevented from being known to others.
(4B) The robot 301 can upload the second AI program and the second data set after learning to the cloud network 31. The robot 301 can also install the second AI program and the second data set from the cloud network 31.

学習後の第２のＡＩ用プログラム及び第２のデータセットをインストールするロボット３０１は、過去に学習を行ったロボット３０１と同じであっても、異なっていてもよい。また、学習後の第２のＡＩ用プログラム及び第２のデータセットをインストールするロボット３０１は、過去に学習を行った場所にあるものであってもよいし、異なる場所にあるものであってもよい。 The robot 301 on which the second AI program and the second data set are installed after learning may be the same as the robot 301 that has previously learned, or it may be different. Furthermore, the robot 301 on which the second AI program and the second data set are installed after learning may be located in the same place where learning was previously performed, or it may be located in a different place.

よって、ユーザは、過去の自らの使用よって学習した第２のＡＩ用プログラム及び第２のデータセットを、ユーザがその時点でいる場所（例えば、職場、店舗、自宅等様々な場所）のロボット３０１に、クラウドネットワーク３１からインストールすることができる。そして、第２のＡＩ用プログラム及び第２データセットを用いて学習を行い、第１のＡＩ用プログラム及び第１のデータセットを変化、発展させることができる。 Therefore, the user can install the second AI program and the second data set, which have been learned through the user's own past use, from the cloud network 31 to the robot 301 in the location where the user is at the time (e.g., various locations such as the workplace, store, home, etc.). The second AI program and the second data set can then be used to learn, and the first AI program and the first data set can be changed and developed.

（４Ｃ）ロボット３０１は、クラウドネットワーク３１にアップロードされる第２のＡＩ用プログラム及び第２のデータセットついて、学習を制限する。すなわち、ロボット３０１は、その周囲にいる人の動作や、人の識別結果等に応じて、第２のＡＩ用プログラム及び第２のデータセットについての学習を停止する。また、ロボット３０１は、それが存在する場所に応じて、第２のＡＩ用プログラム及び第２のデータセットにおける学習を停止する。また、ロボット３０１は、学習制限内容に該当する事項を、第２のＡＩ用プログラム及び第２のデータセットに含ませない。 (4C) The robot 301 restricts learning of the second AI program and the second data set uploaded to the cloud network 31. That is, the robot 301 stops learning of the second AI program and the second data set depending on the actions of people around it, the results of identifying people, etc. The robot 301 also stops learning of the second AI program and the second data set depending on the location where it is located. The robot 301 also does not include any items that fall under the learning restriction content in the second AI program and the second data set.

そのため、ユーザが他人に知られたくない事項を含む第２のＡＩ用プログラム及び第２のデータセットが、クラウドネットワーク３１にアップロードされることを抑制できる。＜第５の実施形態＞
１．ロボット４０１の構成
本実施形態のロボット４０１の構成は、基本的には前記第１の実施形態のロボット１と同様である。以下では、第１の実施形態との相違点を中心に説明する。ロボット４０１は、図１６に示すように、通信ユニット２６を用いてインターネット７７と接続することができる。インターネット７７はネットワークの一例である。また、ロボット４０１は車両内に置くことができる。この場合、ロボット４０１は、後述する会話等処理により、車両の乗員と会話することができる。 Therefore, it is possible to prevent the second AI program and the second data set, which include information that the user does not want others to know, from being uploaded to the cloud network 31. <Fifth embodiment>
1. Configuration of the Robot 401 The configuration of the robot 401 of this embodiment is basically the same as that of the robot 1 of the first embodiment. The following mainly describes the differences from the first embodiment. As shown in FIG. 16, the robot 401 can connect to the Internet 77 using the communication unit 26. The Internet 77 is an example of a network. The robot 401 can be placed inside a vehicle. In this case, the robot 401 can converse with a vehicle occupant by a conversation process, which will be described later.

２．ロボット４０１が実行する処理
（２－１）ロボット１と同様の処理
ロボット４０１は、前記第１の実施形態のロボット１と同様に、プログラム設定処理、学習停止判断処理、及びスケジュール管理処理を実行する。 2. Processing Executed by Robot 401 (2-1) Processing Similar to Robot 1 The robot 401 executes a program setting process, a learning stop determination process, and a schedule management process, similar to the robot 1 of the first embodiment.

（２－２）会話等処理
ロボット４０１の制御ユニット３は、図１７に示す会話等処理を所定時間ごとに繰り返し実行する。この処理は、マイク５が所定の閾値以上の音量を検出したときに実行される。 (2-2) Conversation, etc. Processing The control unit 3 of the robot 401 repeatedly executes the conversation, etc. processing shown in Fig. 17 at predetermined time intervals. This processing is executed when the microphone 5 detects a volume equal to or greater than a predetermined threshold.

ステップ９１、９２の処理は、それぞれ、前記第１の実施形態におけるステップ１１、１２の処理と同様である。
ステップ９３では、前記ステップ９２で認識した音声の内容に関連する情報を、通信ユニット２６を用い、インターネット７７において検索し、取得する。例えば、前記ステッ
プ９２で認識した音声の内容からキーワードを抽出し、そのキーワードに予め関連付けられた事項を含む情報を検索する。検索の対象としては、例えば、ＳＮＳ（ソーシャルネットワーキングサービス）、ブログ、電子掲示板等が挙げられる。また、前記ステップ９２で認識した音声の内容が質問である場合、その質問に対する回答を検索する。検索の方法としては、例えば、公知の検索エンジンを用いることができる。 The processes in steps 91 and 92 are similar to the processes in steps 11 and 12 in the first embodiment, respectively.
In step 93, information related to the content of the voice recognized in step 92 is searched and acquired from the Internet 77 using the communication unit 26. For example, a keyword is extracted from the content of the voice recognized in step 92, and information including items previously associated with the keyword is searched for. Examples of search targets include SNS (social networking services), blogs, and electronic bulletin boards. Furthermore, if the content of the voice recognized in step 92 is a question, an answer to the question is searched for. As a search method, for example, a known search engine can be used.

ステップ９４では、前記ステップ９２で認識した音声の内容に対し、基本的には前記第１の実施形態におけるステップ１３と同様に、回答音声データを作成する。ただし、本ステップでは、前記ステップ９３で取得した情報も用いて、回答音声データを作成する。前記ステップ９３で取得した情報を用いるとは、例えば、その情報を音声化したものを、回答音声データに含めることである。 In step 94, response voice data is created for the content of the voice recognized in step 92, essentially in the same manner as in step 13 in the first embodiment. However, in this step, the information acquired in step 93 is also used to create the response voice data. Using the information acquired in step 93 means, for example, including a voiced version of that information in the response voice data.

ステップ９５では、後述するステップ９６で発音するときの音声の種類を、前記ステップ９４で作成した回答の内容に応じて決定する。音声の種類としては、例えば、男性の声、女性の声、大人の声、子供の声等が挙げられる。 In step 95, the type of voice to be pronounced in step 96 (described later) is determined based on the content of the answer created in step 94. Examples of the type of voice include a male voice, a female voice, an adult voice, a child's voice, etc.

音声の種類は、具体的には、以下のようにして決定する。まず、制御ユニット３は、前記ステップ９４で作成した回答の内容から、特徴（例えば、男性に特有の特徴、女性に特有の特徴、大人に特有の特徴、子供に特有の特徴等）を抽出する。 Specifically, the type of voice is determined as follows: First, the control unit 3 extracts features (e.g., features unique to men, features unique to women, features unique to adults, features unique to children, etc.) from the content of the answer created in step 94.

制御ユニット３は、回答の内容における特徴と、音声の種類とを対応付けたマップを予め備えている。制御ユニット３は、上記のように抽出した特徴をそのマップに入力することで、抽出した特徴に対応した音声の種類を決定する。 The control unit 3 is provided with a map that associates features in the content of the answer with the type of voice. The control unit 3 inputs the features extracted as described above into the map, thereby determining the type of voice that corresponds to the extracted features.

例えば、前記ステップ９４で作成した回答の内容から、男性に特有の特徴と大人に特有の特徴とが抽出されれば、制御ユニット３は、大人の男性の音声を決定する。
ステップ９６では、前記ステップ９４で作成した回答音声データに基づき、スピーカ１５を用いて発音する。このとき、前記ステップ９５で設定した種類の音声を用いて発音する。 For example, if features specific to men and features specific to adults are extracted from the content of the answer created in step 94, the control unit 3 determines the voice of an adult male.
In step 96, a voice is generated from the speaker 15 based on the answer voice data created in step 94. At this time, the voice type set in step 95 is used for the voice.

ステップ９７では、まず、前記ステップ９２で認識した音声の内容、又は、前記ステップ９４で作成した回答の内容に対応する感情（例えば、喜び、怒り、悲しみ、平静等）を取得する。この感情の取得は以下のように行う。 In step 97, first, the emotion (e.g., joy, anger, sadness, calm, etc.) corresponding to the content of the voice recognized in step 92 or the content of the answer created in step 94 is obtained. This emotion is obtained as follows.

制御ユニット３は、音声の内容や回答の内容に現れる特徴と、感情とを対応付けたマップを予め備えている。音声の内容に現れる特徴としては、例えば、音量、音声の抑揚、音程の高低等が挙げられる。また、回答の内容に現れる特徴としては、例えば、回答の内容に含まれる、感情を反映した語句、感情を反映した言い回し等が挙げられる。 The control unit 3 is provided with a map that associates emotions with features that appear in the content of the voice or the content of the answer. Examples of features that appear in the content of the voice include volume, intonation, and high and low pitch. Also, examples of features that appear in the content of the answer include words and phrases that reflect emotions that are included in the content of the answer.

制御ユニット３は、前記ステップ９２で認識した音声の内容、又は、前記ステップ９４で作成した回答から抽出した特徴を前記マップに入力することにより、対応する感情を取得する。 The control unit 3 obtains the corresponding emotion by inputting the content of the voice recognized in step 92 or the features extracted from the answer created in step 94 into the map.

次に、制御ユニット３は、取得した感情を表現する人又はキャラクタの顔画像を作成し、ディスプレイ１９に表示する。例えば、取得した感情が喜びである場合、笑顔の人又はキャラクタの顔画像をディスプレイ１９に表示する。また、取得した感情が悲しみである場合、泣き顔の人又はキャラクタの顔画像をディスプレイ１９に表示する。 Next, the control unit 3 creates a facial image of a person or character expressing the acquired emotion and displays it on the display 19. For example, if the acquired emotion is joy, the control unit 3 displays a facial image of a smiling person or character on the display 19. If the acquired emotion is sadness, the control unit 3 displays a facial image of a crying person or character on the display 19.

ステップ９８～１０２の処理は、それぞれ、前記第１の実施形態におけるステップ１５～１９の処理と同様である。
３．ロボット４０１が奏する効果
ロボット４０１は、前記（１Ａ）～（１Ｉ）の効果を奏する。さらに、ロボット４０１は次の効果も奏する。 The processes in steps 98 to 102 are similar to the processes in steps 15 to 19 in the first embodiment, respectively.
3. Effects of the Robot 401 The robot 401 has the effects (1A) to (1I) described above. In addition, the robot 401 also has the following effects.

（５Ａ）ロボット４０１は、音声の内容に関連する情報を、インターネット７７において検索し、取得することができる。そして、ロボット４０１は、そのように取得した情報も用いて、回答音声データを作成する。そのことにより、より適切な回答音声データを作成することができる。 (5A) The robot 401 can search and acquire information related to the content of the voice on the Internet 77. The robot 401 then uses the information acquired in this way to create response voice data. This allows the robot 401 to create more appropriate response voice data.

（５Ｂ）ロボット４０１は、回答の内容に応じて音声の種類を決定することができる。そのため、ユーザは、回答の内容と、それを発音する音声の種類との不調和を感じにくい。 (5B) The robot 401 can determine the type of voice depending on the content of the answer. Therefore, the user is less likely to feel a discord between the content of the answer and the type of voice that pronounces it.

（５Ｃ）ロボット４０１は、認識した音声の内容、又は、回答の内容に対応する感情を取得し、取得した感情を表現する人又はキャラクタの顔画像をディスプレイ１９に表示する。そのことにより、ユーザは、ロボット４０１をあたかも人間のように感じることができる。
＜第６の実施形態＞
１．ロボット５０１の構成
本実施形態のロボット５０１の構成は、基本的には前記第１の実施形態のロボット１と同様である。 (5C) The robot 401 acquires an emotion corresponding to the content of the recognized voice or the content of the reply, and displays a facial image of a person or character expressing the acquired emotion on the display 19. This allows the user to feel as if the robot 401 is a human being.
Sixth Embodiment
1. Configuration of Robot 501 The configuration of the robot 501 of this embodiment is basically the same as that of the robot 1 of the first embodiment.

２．ロボット５０１が実行する処理
（２－１）ロボット１と同様の処理
ロボット５０１は、前記第１の実施形態のロボット１と同様に、プログラム設定処理、会話処理、学習停止判断処理、及びスケジュール管理処理を実行する。 2. Processing Executed by Robot 501 (2-1) Processing Similar to Robot 1 The robot 501 executes program setting processing, conversation processing, learning stop determination processing, and schedule management processing, similar to the robot 1 of the first embodiment.

（２－２）通話処理
図１８に示すように、複数のロボット５０１は、ユーザ７９とユーザ８１との間の通話を可能にする。例えば、ユーザ７９側のロボット５０１（以下ではロボット５０１Ａとする）は、ユーザ７９が発音した音声をマイク５で取得する。また、ロボット５０１Ａは、ユーザ７９による、通話相手のロボット５０１（以下ではロボット５０１Ｂとする）を指定する入力を、タッチパネル９を用いて受け付ける。 18, a plurality of robots 501 enable a call between a user 79 and a user 81. For example, the robot 501 on the user 79's side (hereinafter referred to as robot 501A) acquires the voice uttered by the user 79 through the microphone 5. In addition, the robot 501A receives an input from the user 79 specifying the robot 501 (hereinafter referred to as robot 501B) with which to make a call, through the touch panel 9.

ロボット５０１Ａは、ユーザ７９が発音した音声を変換した信号（以下では音声変換信号とする）と、ロボット５０１Ａの識別信号と、ユーザ７９により指定された通話相手のロボット５０１Ｂの識別信号とを、通信ユニット２６を用いてホストコンピュータ８３に送信する。 The robot 501A transmits to the host computer 83, using the communication unit 26, a signal obtained by converting the voice uttered by the user 79 (hereinafter referred to as a converted voice signal), an identification signal for the robot 501A, and an identification signal for the robot 501B, the other party designated by the user 79.

ホストコンピュータ８３は、受信したロボット５０１Ａの識別信号と、通話相手のロボット５０１Ｂの識別信号とが、通話ペアとして設定されているか否かを判断する。通話ペアとして設定されていれば、音声変換信号を、通話相手のロボット５０１Ｂに転送する。なお、通話ペアの設定については後述する。 The host computer 83 determines whether the identification signal of the received robot 501A and the identification signal of the call partner robot 501B are set as a call pair. If they are set as a call pair, the host computer 83 transfers the voice conversion signal to the call partner robot 501B. The setting of the call pair will be described later.

ロボット５０１Ｂは、通信ユニット２６を用いて、転送された音声変換信号を受信し、その音声変換信号に基づき、スピーカ１５を用いて発音する。以上の処理により、ユーザ７９が発音した音声が、ロボット５０１Ｂ側のユーザ８１に伝えられる。また、ロボット５０１Ｂは、音声を発音することに加えて、その音声を文字に変換し、その文字をディスプレイ１９に表示する。 The robot 501B receives the transferred voice conversion signal using the communication unit 26, and produces a sound using the speaker 15 based on the voice conversion signal. Through the above process, the voice produced by the user 79 is transmitted to the user 81 on the robot 501B side. In addition to producing a sound, the robot 501B also converts the voice into characters and displays the characters on the display 19.

また、ロボット５０１Ｂが上述したロボット５０１Ａの処理を行い、ロボット５０１Ａが上述したロボット５０１Ｂの処理を行うことで、ユーザ８１が発音した音声を、ユーザ７９に伝えることもできる。 In addition, by having robot 501B perform the processing of robot 501A described above, and robot 501A perform the processing of robot 501B described above, it is possible to transmit the voice spoken by user 81 to user 79.

一方、ホストコンピュータ８３は、受信したロボット５０１Ａの識別信号と、通話相手のロボット５０１Ｂの識別信号とが、通話ペアとして設定されてなければ、上記の音声変換信号の転送を行わない。 On the other hand, if the identification signal of the received robot 501A and the identification signal of the other party's robot 501B are not set as a call pair, the host computer 83 will not transfer the above voice conversion signal.

（２－３）通話ペア設定処理
次に、ホストコンピュータ８３が上記の通話ペアを設定する処理を、図１９に基づき説明する。 (2-3) Call Pair Setting Process Next, the process in which the host computer 83 sets up the above call pair will be described with reference to FIG.

ステップ１１１では、通話ペアの設定要求を新たに受信したか否かを判断する。通話ペアの設定要求とは、ロボット５０１Ａがホストコンピュータ８３に送信する要求である。通話ペアの設定要求には、要求の送信元であるロボット５０１Ａの識別信号と、そのロボット５０１Ａ側のユーザ７９の特徴（例えば、趣味、衣食住の好み、関心を持つ事項等）と、ユーザ７９の顔写真の画像データとが含まれる。 In step 111, it is determined whether a new call pair setting request has been received. A call pair setting request is a request that robot 501A sends to host computer 83. The call pair setting request includes an identification signal of robot 501A that sent the request, characteristics of user 79 on the robot 501A side (e.g., hobbies, preferences in food, clothing, and shelter, interests, etc.), and image data of a facial photograph of user 79.

通話ペアの設定要求は、ユーザの指示に応じてロボット５０１Ａが送信してもよいし、ロボット５０１Ａが自動的に送信してもよい。通話ペアの設定要求をホストコンピュータ８３が新たに受信した場合はステップ１１２に進み、受信しなかった場合は本処理を終了する。 The call pair setting request may be sent by the robot 501A in response to a user instruction, or may be sent automatically by the robot 501A. If the host computer 83 receives a new call pair setting request, the process proceeds to step 112; if not, the process ends.

ステップ１１２では、前記ステップ１１１で受信したと判断した通話ペアの設定要求（以下では、新たな設定要求とする）に含まれるユーザ７９の特徴を、設定待ちリストにおいて検索する。ここで、設定待ちリストとは、過去にいずれかのロボット５０１から受信した、通話ペアの設定要求（以下では、過去の設定要求とする）のリストである。 In step 112, the characteristics of user 79 included in the call pair setting request (hereinafter referred to as a new setting request) determined to have been received in step 111 is searched for in the setting waiting list. Here, the setting waiting list is a list of call pair setting requests (hereinafter referred to as past setting requests) previously received from any robot 501.

ステップ１１３では、前記ステップ１１２での検索の結果、新たな設定要求に含まれるユーザ７９の特徴と一致する特徴を有する、過去の設定要求が発見されたか否かを判断する。そのような過去の設定要求が発見された場合はステップ１１４に進み、発見されなかった場合はステップ１１６に進む。 In step 113, it is determined whether or not a past setting request is found that has characteristics matching those of user 79 included in the new setting request as a result of the search in step 112. If such a past setting request is found, the process proceeds to step 114; if not, the process proceeds to step 116.

ステップ１１４では、前記ステップ１１３で発見された、過去の設定要求の送信元であるロボット５０１Ｂと、新たな設定要求の送信元であるロボット５０１Ａとを、通話ペアとして設定する。 In step 114, robot 501B, which was found in step 113 and was the sender of the previous setting request, and robot 501A, which is the sender of the new setting request, are set as a call pair.

ステップ１１５では、通話ペアとして設定されたロボット５０１Ａ、５０１Ｂのそれぞれに、相手のロボット５０１に関する情報を通知する。すなわち、ロボット５０１Ａにはロボット５０１Ｂに関する情報を通知し、ロボット５０１Ｂにはロボット５０１Ａに関する情報を通知する。通知する情報には、相手のロボット５０１の識別信号、対応するユーザの顔写真の画像データ等が含まれる。 In step 115, each of robots 501A and 501B set as a call pair is notified of information about the other robot 501. That is, robot 501A is notified of information about robot 501B, and robot 501B is notified of information about robot 501A. The notified information includes an identification signal of the other robot 501, image data of a face photo of the corresponding user, etc.

一方、前記ステップ１１３で否定判断された場合はステップ１１６にて、新たな設定要求を設定待ちリストに追加する。これ以降、新たな設定要求は、過去の設定要求のリストにおける一部となる。 On the other hand, if the determination in step 113 is negative, the new configuration request is added to the configuration waiting list in step 116. From this point on, the new configuration request becomes part of the list of past configuration requests.

なお、相手側のロボット５０１に関する情報を通知されたロボット５０１は、その情報を用いて、通話ペアの相手を表すアイコン８５を作成し、図２０に示すように、ディスプレイ１９に表示する。アイコン８５は、通話ペアの相手側のユーザの顔写真を含む。ロボ
ット５０１は、複数のアイコン８５をディスプレイ１９に表示することができる。 The robot 501, which has been notified of the information about the other robot 501, uses the information to create an icon 85 representing the other user in the call pair, and displays it on the display 19 as shown in Fig. 20. The icon 85 includes a facial photograph of the other user in the call pair. The robot 501 can display multiple icons 85 on the display 19.

ロボット５０１は、特定のアイコン８５がユーザによってタッチされたとき、そのアイコン８５に対応するユーザを通話相手として認識する。そして、ロボット５０１は、音声変換信号を上記のようにホストコンピュータ８３に送信するとき、タッチしたアイコン８５に対応するロボット５０１の識別信号を、ホストコンピュータ８３に送信する。 When a specific icon 85 is touched by a user, the robot 501 recognizes the user corresponding to that icon 85 as the communication partner. Then, when the robot 501 transmits the voice conversion signal to the host computer 83 as described above, it transmits an identification signal of the robot 501 corresponding to the touched icon 85 to the host computer 83.

（２－４）その他の処理
ホストコンピュータ８３は、前記（２－２）の通話処理のとき、ロボット５０１Ａが送信した音声変換信号の内容を分析する。その分析結果が予め設定された禁止事項（例えば犯罪に関する事項等）に該当する場合、通話処理を開始しないようにしたり、通話処理を途中で終了したりする。そのため、ホストコンピュータ８３は、通話処理が犯罪等に利用されることを抑制できる。 (2-4) Other Processing The host computer 83 analyzes the contents of the converted voice signal sent by the robot 501A during the call processing described in (2-2) above. If the analysis result corresponds to a prohibited matter (e.g., matter related to a crime) that has been set in advance, the host computer 83 will not start the call processing or will end the call processing midway. Therefore, the host computer 83 can prevent the call processing from being used for crimes, etc.

また、前記（２－２）の通話処理のとき、ロボット５０１Ｂは、他のコンピュータ５０１Ａから送信された音声も用いて学習を行う。そのため、学習を一層効率的に行うことができる。 In addition, during the call processing in (2-2) above, the robot 501B also uses the voice transmitted from the other computer 501A to learn. This allows learning to be carried out even more efficiently.

また、前記（２－２）の通話処理において、ユーザ７９の発音が所定時間以上途絶えたとき、又は、ユーザ７９が指示したとき、ロボット５０１Ａは、自らが作成した音声変換信号をホストコンピュータ８３に送信する。この場合、ロボット５０１Ａと、ユーザ８１とが通話することになる。コンピュータ５０１Ａは、過去の通話処理により得られた学習結果を用いて音声変換信号を作成することができる。また、ロボット５０１Ａは、過去の通話においてロボット５０１Ａ又はロボット５０１Ｂが発音した音声（例えば、相槌等）を記憶しておき、その音声に対応する音声変換信号を作成してもよい。 In addition, in the call processing of (2-2) above, when the user 79 stops speaking for a predetermined period of time or when the user 79 gives an instruction, the robot 501A transmits the voice conversion signal that it created to the host computer 83. In this case, the robot 501A and the user 81 will be in conversation. The computer 501A can create the voice conversion signal using the learning results obtained from past call processing. The robot 501A may also store sounds (e.g., backchannels, etc.) that the robot 501A or the robot 501B made in past calls and create a voice conversion signal corresponding to the sounds.

３．ロボット５０１が奏する効果
ロボット５０１は、前記（１Ａ）～（１Ｉ）の効果を奏する。さらに、ロボット５０１は次の効果も奏する。 3. Effects of the Robot 501 The robot 501 has the effects (1A) to (1I) described above. In addition, the robot 501 also has the following effects.

（６Ａ）ロボット５０１Ａ側のユーザ７９と、ロボット５０１Ｂ側のユーザ８１とは、通話を行うことができる。
（６Ｂ）ユーザ７９とユーザ８１との通話は、ロボット５０１Ａとロボット５０１Ｂとが通話ペアとして設定されていることが前提になる。通話ペアは、ユーザ７９とユーザ８１との特徴が一致する場合に設定される。よって、ロボット５０１は、特徴が一致するユーザ同士の通話を選択的に可能にする。 (6A) A user 79 of the robot 501A and a user 81 of the robot 501B can communicate with each other.
(6B) A call between user 79 and user 81 is premised on the premise that robot 501A and robot 501B are set as a call pair. A call pair is set when the characteristics of user 79 and user 81 match. Thus, robot 501 selectively enables a call between users with matching characteristics.

（６Ｃ）ロボット５０１は通話相手のユーザの顔写真を含むアイコン８５を作成し、ディスプレイ１９に表示する。ユーザは、アイコン８５をタッチすることで、容易に通話相手を選択することができる。また、アイコン８５は通話相手の顔写真を含んでいるので、ユーザは、どのアイコン８５がどの通話相手に対応しているのかを容易に理解することができる。
＜その他の実施形態＞
（１）前記第１～第６の実施形態において、ロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、状況に応じて学習制限内容を増減してもよい。例えば、当初は学習制限内容に属していた事項が、複数の人物によって話されたと認識した場合、その事項を学習制限内容から除外してもよい。 (6C) The robot 501 creates an icon 85 including a facial photograph of the user who is the other party to the call, and displays it on the display 19. The user can easily select the other party to the call by touching the icon 85. In addition, since the icon 85 includes a facial photograph of the other party to the call, the user can easily understand which icon 85 corresponds to which other party to the call.
<Other embodiments>
(1) In the first to sixth embodiments, the robots 1, 301, 401, 501, the computer 101, and the vehicle-mounted device 201 may increase or decrease the learning restriction content depending on the situation. For example, when it is recognized that an item that was originally included in the learning restriction content was spoken by multiple people, the item may be excluded from the learning restriction content.

（２）前記第１～第６の実施形態において、ロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、マイク５により音声を取得したとき、その音声の声
色を判断し、その判断結果に応じて、音声の内容を学習するか否かを決めてもよい。 (2) In the first to sixth embodiments, when the robot 1, 301, 401, 501, the computer 101, and the in-vehicle unit 201 acquire voice through the microphone 5, they may judge the tone of the voice and decide whether or not to learn the content of the voice depending on the result of the judgment.

（３）前記第１～第６の実施形態において、音声の認識結果に応じて行う処理は、回答音声データの作成（前記ステップ１３）、車両の制御内容決定（前記ステップ３３）以外のものであってもよい。例えば、音声の認識結果に応じて、ロボット１、３０１、４０１、５０１を移動させたり、コンピュータ１０１に接続した外部装置を操作したりしてもよい。また、音声の認識結果に応じて、例えば、所定の動作（例えば、箱の開閉、窓の開閉、鍵の施錠又は開錠、家電製品の操作等）を行ってもよい。 (3) In the first to sixth embodiments, the process performed in response to the voice recognition result may be something other than creating answer voice data (step 13) or determining the vehicle control content (step 33). For example, the robot 1, 301, 401, 501 may be moved or an external device connected to the computer 101 may be operated in response to the voice recognition result. In addition, a predetermined action (e.g. opening and closing a box, opening and closing a window, locking or unlocking a door, operating a home appliance, etc.) may be performed in response to the voice recognition result.

（４）前記第１～第６の実施形態において、ＡＩ用プログラム及びデータセットを記憶するものは、クラウドネットワーク３１以外のものであってもよい。例えば、周知のサーバ、記憶媒体等に記憶してもよい。 (4) In the first to sixth embodiments, the AI program and data set may be stored in something other than the cloud network 31. For example, they may be stored in a well-known server, storage medium, etc.

（５）前記第３の実施形態において、車載機２０１は、車両以外の移動体（例えば、鉄道車両、航空機、船舶等）に搭載され、それらを制御するものであってもよい。
（６）前記第１、第４～６の実施形態において、ロボット１、３０１、４０１、５０１の形態は人型でなくてもよい。例えば、動物、魚、想像上キャラクタ等の形態であってもよい。 (5) In the third embodiment, the on-board device 201 may be mounted on a moving object other than a vehicle (for example, a railroad car, an aircraft, a ship, etc.) and may control the moving object.
(6) In the first, fourth to sixth embodiments, the form of the robot 1, 301, 401, 501 does not have to be a human form. For example, the form may be an animal, a fish, an imaginary character, or the like.

（７）前記第１～第３、第５、第６の実施形態において、ロボット１、４０１、５０１、コンピュータ１０１、車載機２０１は、標準プログラムと、ＡＩ用プログラムとを同時に使用してもよい。この場合、標準プログラムにより基本的な処理を実行するとともに、ＡＩ用プログラムにより、学習の結果得られた付加的な処理を実行することができる。 (7) In the first to third, fifth and sixth embodiments, the robot 1, 401, 501, computer 101 and in-vehicle device 201 may use the standard program and the AI program simultaneously. In this case, basic processing is performed by the standard program, and additional processing obtained as a result of learning can be performed by the AI program.

（８）前記第１～第６の実施形態において、ロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１の形態は、家電製品（例えば、テレビ、冷蔵庫、エアコン、掃除機、洗濯機等）、携帯端末（例えば、携帯電話（スマートフォンを含む）、メガネ型端末、腕時計型端末）等であってもよい。 (8) In the first to sixth embodiments, the robot 1, 301, 401, 501, computer 101, and in-vehicle device 201 may take the form of a home appliance (e.g., a television, a refrigerator, an air conditioner, a vacuum cleaner, a washing machine, etc.), a mobile terminal (e.g., a mobile phone (including a smartphone), a glasses-type terminal, a wristwatch-type terminal), etc.

（９）前記第１～第６の実施形態において、ロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、カメラ７を用いて自動的に画像を取得し、取得した画像をネットワーク上に送信する機能を有していてもよい。送信先としては、サーバ、端末、車両、他のロボット等が挙げられる。送信する画像は、カメラ７を用いて取得した画像そのものであってもよいし、カメラ７を用いて取得した画像から抽出した一部（例えば、人の顔、人の全身、車両、車両のナンバープレート等）の画像であってもよい。 (9) In the first to sixth embodiments, the robot 1, 301, 401, 501, computer 101, and in-vehicle device 201 may have a function of automatically acquiring images using the camera 7 and transmitting the acquired images over a network. Examples of destinations include a server, a terminal, a vehicle, another robot, and the like. The image to be transmitted may be the image itself acquired using the camera 7, or may be an image of a portion extracted from the image acquired using the camera 7 (e.g., a person's face, a person's entire body, a vehicle, a vehicle license plate, etc.).

上記の機能を有するロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、防犯の用途、他のロボットの監視の用途等に使用することができる。上記の機能を有するロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、例えば、室内、路上、特定の施設（例えば、住居、マンション、オフィス、駐車場等）の入口、他のロボットの近傍等に設置することができる。 The robots 1, 301, 401, 501, computers 101, and vehicle-mounted devices 201 having the above functions can be used for crime prevention purposes, monitoring other robots, and the like. The robots 1, 301, 401, 501, computers 101, and vehicle-mounted devices 201 having the above functions can be installed, for example, indoors, on the street, at the entrance to a specific facility (e.g., a house, an apartment building, an office, a parking lot, etc.), near other robots, and the like.

また、上記の機能を有するロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、空中を飛行する機能を持つか、飛行物に搭載することができる。その場合、上空の視点から撮影した地上の画像をネットワーク上で送信することができる。この場合、ロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、撮影した画像において道路の白線を認識し、その白線に沿って移動することができる。また、ロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、上空の視点から撮影した地上の画像において信号機を認識し、その表示内容（赤信号、青信号等）を、地上の車両に送信することができる。 Furthermore, the robot 1, 301, 401, 501, computer 101, and vehicle-mounted device 201 having the above functions can have the function of flying in the air or can be mounted on an aircraft. In this case, images of the ground taken from a viewpoint in the sky can be transmitted over a network. In this case, the robot 1, 301, 401, 501, computer 101, and vehicle-mounted device 201 can recognize white lines on the road in the captured images and move along the white lines. Furthermore, the robot 1, 301, 401, 501, computer 101, and vehicle-mounted device 201 can recognize traffic lights in images of the ground taken from a viewpoint in the sky and transmit the display contents (red light, green light, etc.) to a vehicle on the ground.

（１０）前記第１～第６の実施形態において、ロボット１、３０１、４０１、５０１、コンピュータ１０１、車載機２０１は、カメラ７を用いて人の行動を認識し、その認識結果に予め関連付けられた音声を出力してもよい。 (10) In the first to sixth embodiments, the robot 1, 301, 401, 501, computer 101, and vehicle-mounted device 201 may recognize human behavior using the camera 7 and output a sound that is pre-associated with the recognition result.

例えば、ガスの火をつけたまま人が台所を離れるという行動を認識した場合、その人に対する警告の音声を出力することができる。また、人が所定の物を探す行動を認識した場合、その物の場所を探し、物のありかを音声で知らせることができる。また、人が住宅の入口から室内に入るという行動を認識したとき、「おかえりなさい」という音声を出力することができる。人の行動と、それに関連付けられた音声とは、学習により増加させることができる。 For example, if the system recognizes that a person leaves the kitchen with the gas still on, it can output a voice warning to that person. Also, if the system recognizes that a person is searching for a specific object, it can search for the location of that object and notify the user of its location by voice. Also, if the system recognizes that a person enters a house through the entrance, it can output a voice saying "Welcome home." Human actions and the voices associated with them can be increased through learning.

上記のように音声を発する場合、その音声の種類は、そのときの状況と、音声の内容とに関連付けられたものとすることができる。例えば、画像において父親を認識した場合、「おかえりなさい」という音声の種類は、その子供の声とすることができる。 When sound is emitted as described above, the type of sound can be associated with the situation at the time and the content of the sound. For example, if a father is recognized in an image, the type of sound "Welcome home" can be determined to be the voice of his child.

（１１）前記第１～第６の実施形態において、制御ユニットは、マイクロコンピュータを備えているが、個別の電子回路の組合せであってもよいし、ＡＩＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｅｄＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）であってもよいし、ＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）などのプログラマブル・ロジック・デバイスあるいはこれらの組合せであってもよい。
（１２）前記第１～第６の実施形態の構成の一部又は全部を適宜組み合わせてもよい。例えば、前記第４～第６の実施形態の構成を、前記第２、第３の実施形態に適用してもよい。 (11) In the first to sixth embodiments, the control unit includes a microcomputer. However, the control unit may be a combination of individual electronic circuits, an AISIC (Application Specified Integrated Circuit), a programmable logic device such as an FPGA (Field Programmable Gate Array), or a combination of these.
(12) A part or all of the configurations of the first to sixth embodiments may be combined as appropriate. For example, the configurations of the fourth to sixth embodiments may be applied to the second and third embodiments.

本開示は、情報処理システム、情報処理方法、及びコンピュータプログラムに関する。 The present disclosure relates to an information processing system, an information processing method, and a computer program .

ロボットの会話能力を、人工知能を用いた学習により高めることが考えられる。本開示は、人工知能を用いた会話に関する新規技術を提供することを一側面とする。It is conceivable that the conversation ability of a robot can be improved by learning using artificial intelligence. One aspect of the present disclosure is to provide a new technology related to conversation using artificial intelligence.

本開示の一側面によれば、情報処理システムが提供される。情報処理システムは、音声取得ユニットと、出力ユニットと、学習制御ユニットと、を備える。According to one aspect of the present disclosure, there is provided an information processing system, the information processing system including a voice acquisition unit, an output unit, and a learning control unit.
音声取得ユニットは、ユーザの音声を取得するように構成される。The voice capturing unit is configured to capture the voice of the user.
出力ユニットは、音声取得ユニットが取得したユーザの音声に対する応答音声を、ユーザとの間の会話を学習する学習機能を有する人工知能を用いて作成し、応答音声を、ユーザに向けて出力するように構成される。The output unit is configured to create a response voice to the user's voice acquired by the voice acquisition unit using artificial intelligence having a learning function of learning the conversation with the user, and to output the response voice to the user.
学習制御ユニットは、ユーザの音声に基づき、学習機能を制限するように構成される。The learning control unit is configured to limit the learning function based on the user's voice.

本開示の別側面によれば、音声取得ユニットと、読出ユニットと、出力ユニットと、学習ユニットと、学習制御ユニットと、を備える情報処理システムが提供されてもよい。According to another aspect of the present disclosure, there may be provided an information processing system including a voice acquisition unit, a read-out unit, an output unit, a learning unit, and a learning control unit.
音声取得ユニットは、ユーザの音声を取得するように構成される。The voice capturing unit is configured to capture the voice of the user.
読出ユニットは、ユーザとの過去の会話により学習された人工知能に関するデータセットを、記憶装置から読み出すように構成される。The reading unit is configured to read from the storage device a dataset relating to the artificial intelligence that has been trained through past conversations with the user.
出力ユニットは、音声取得ユニットが取得した音声に応答する言葉としての応答語を、読み出されたデータセットを用いて、人工知能により作成し、ユーザに向けて出力するように構成される。The output unit is configured to use the retrieved data set to generate a response word as a word responsive to the voice acquired by the voice acquisition unit through artificial intelligence, and to output the response word to a user.
学習ユニットは、ユーザの音声に基づく学習動作を実行し、記憶装置が記憶するデータセットを更新することによって、データセットに学習結果を記録するように構成される。The learning unit is configured to perform learning operations based on the user's voice and record the learning results in the dataset by updating the dataset stored by the storage device.
学習制御ユニットは、ユーザの音声に基づき、学習ユニットによる学習動作を制限するように構成される。The learning control unit is configured to limit a learning operation by the learning unit based on the user's voice.

本開示の別側面によれば、音声取得ユニットと、検索ユニットと、出力ユニットとを備える情報処理システムが提供されてもよい。According to another aspect of the present disclosure, there may be provided an information processing system including a voice acquisition unit, a search unit, and an output unit.
音声取得ユニットは、ユーザの音声を取得するように構成される。The voice capturing unit is configured to capture the voice of the user.
検索ユニットは、音声取得ユニットが取得したユーザの音声に含まれる質問に関連する関連情報を、質問の内容に基づいてインターネット上で検索するように構成される。The search unit is configured to search for relevant information related to the question contained in the user's voice acquired by the voice acquisition unit on the Internet based on the content of the question.
出力ユニットは、検索ユニットがインターネットから取得した関連情報に基づき、質問に対する回答を作成し、回答を少なくとも音声の形態でユーザに向けて出力するように構成される。The output unit is adapted to generate an answer to the question based on the relevant information retrieved from the Internet by the search unit, and to output the answer to the user at least in the form of voice.

Claims

A voice recognition unit (23, 25, 27) for performing voice recognition;
a learning unit (25) for learning about the recognition result of the speech recognition unit;
a processing unit (23, 25, 27) for performing corresponding processing on the recognition result of the speech recognition unit using the learning result of the learning unit;
a storage unit (26) for storing the learning result in an external storage device;
a learning result acquisition unit (26) for acquiring the learning result from the storage device;
A robot (1, 301) comprising:

An identification information acquisition unit (21, 26) for acquiring user identification information,
The robot according to claim 1 , wherein the learning result acquisition unit acquires the learning result associated with the identification information from the external storage device.

The robot according to claim 1 or 2, further comprising a memory limiting unit (23, 25, 27, 75) that limits the learning results stored in the external memory device according to the position of the robot or a user instruction.