JP4453076B2

JP4453076B2 - Humanoid robot

Info

Publication number: JP4453076B2
Application number: JP2004105530A
Authority: JP
Inventors: 敬宏宮下; 浩石黒
Original assignee: ATR Advanced Telecommunications Research Institute International
Current assignee: ATR Advanced Telecommunications Research Institute International
Priority date: 2004-03-31
Filing date: 2004-03-31
Publication date: 2010-04-21
Anticipated expiration: 2024-03-31
Also published as: JP2005288600A

Description

この発明はヒューマノイドロボットおよびマップ作成方法に関し、特にたとえば、環境を認識し、その環境に適した行動をするヒューマノイドロボット、およびそのようなヒューマノイドロボットのための、環境と行動とセンサ出力とを対応付けたマップを作成するマップ作成方法に関する。 The present invention relates to a humanoid robot and a map creation method, and particularly relates to, for example, a humanoid robot that recognizes an environment and performs an action suitable for the environment, and associates an environment, an action, and a sensor output for such a humanoid robot. The present invention relates to a map creation method for creating a map.

たとえば非特許文献１に紹介されるＡＳＩＭＯ（商品名）のような２足歩行型のヒューマノイドロボットは、環境を認識しきれないと転倒して潰れてしまうおそれがある。そのため、たとえば適当に行動を起こして、その後に環境を認識するようなことはできないので、行動の前にロボットの置かれている環境を認識してやる必要があった。従来のヒューマノイドロボットでは、環境のほかにも人間や障害物などを認識するために多くのセンサが設けられるので、あらゆるセンサの出力を活用して、環境を認識してから行動している。
“ASIMO SPECIAL SITE”、本田技研工業株式会社、インタネット＜ＵＲＬ：http://www.honda.co.jp/ASIMO/＞ For example, a biped walking humanoid robot such as ASIMO (trade name) introduced in Non-Patent Document 1 may fall over and collapse if it cannot recognize the environment. For this reason, for example, it is not possible to appropriately take action and then recognize the environment, so it is necessary to recognize the environment where the robot is placed before the action. In conventional humanoid robots, in addition to the environment, many sensors are provided for recognizing humans and obstacles. Therefore, the outputs of all sensors are used to recognize the environment before acting.
“ASIMO SPECIAL SITE”, Honda Motor Co., Ltd., Internet <URL: http://www.honda.co.jp/ASIMO/>

従来技術では、環境の認識のために多くのセンサを必要とするので、コスト、配線および消費電力などの面で問題があった。 In the prior art, many sensors are required for environment recognition, and thus there are problems in terms of cost, wiring, and power consumption.

それゆえに、この発明の主たる目的は、センサの数や種類が少なくても、環境を認識することができる、ヒューマノイドロボット、およびそのようなヒューマノイドロボットのための環境と行動とセンサ出力とを対応付けたマップを作成するマップ作成方法を提供することである。 Therefore, a main object of the present invention is to associate a humanoid robot capable of recognizing the environment even if the number and types of sensors are small, and to associate the environment, behavior, and sensor output for such a humanoid robot. It is to provide a map creation method for creating a map.

請求項１の発明は、環境を認識するヒューマノイドロボットであって、センサ、ヒューマノイドロボットの置かれる環境と、ヒューマノイドロボットの行動と、センサの出力の時系列データとを対応付けたマップを記憶するマップ記憶手段、および環境認識のための行動を実行する第１実行手段と、第１実行手段によって行動を実行しているときのセンサの出力の時系列データを計測する第１計測手段とを含み、第１計測手段によって計測されたセンサの出力の時系列データと、マップに基づいて作成された環境を判別対象とした決定木とに基づいて、環境を認識する認識手段を備える、ヒューマノイドロボットである。 The invention of claim 1 is a humanoid robot that recognizes the environment, and stores a map in which a sensor, the environment in which the humanoid robot is placed, the behavior of the humanoid robot, and time-series data of the output of the sensor are associated with each other Storage means, first execution means for executing an action for environment recognition, and first measurement means for measuring time series data of an output of a sensor when the action is executed by the first execution means, A humanoid robot comprising recognition means for recognizing an environment based on time-series data output from a sensor measured by a first measurement means and a decision tree based on the environment created based on a map. .

請求項１の発明では、ヒューマノイドロボット（以下、単にロボットともいう。）は環境を認識するものであり、センサを含む。マップ記憶手段は、ロボットの置かれる環境と、ロボットの行動と、センサの出力の時系列データとを対応付けたマップを記憶していて、ロボットは、このマップを用いて、実際にロボットの存在している環境がいずれであるかを認識する。環境は認識手段によって認識される。つまり、認識手段の第１実行手段は環境認識のための行動を実行し、第１計測手段は、第１実行手段によって行動を実行しているときのセンサの出力の時系列データを計測する。そして、この認識手段は、第１計測手段によって計測されたセンサの出力の時系列データと、マップに基づいて作成された環境を判別対象とした決定木に基づいて、環境を認識する。決定木は、ロボットの実行した行動とそのときのセンサ出力とによってロボットの存在する環境を判定していく決定手順の適用順序を定めたものである。したがって、認識手段は、決定木において環境が決定されるまで、決定木に従って行動の実行とそのときのセンサ出力による判定とを繰り返していくことで、環境を認識することができる。したがって、請求項１の発明によれば、センサ出力の時系列データとロボットの行動と環境とを対応付けたマップを備えているので、センサが少なくても、マップに基づいて作成された環境を判別対象とした決定木に基づいて、環境を認識することができる。 In the invention of claim 1, the humanoid robot (hereinafter also simply referred to as a robot) recognizes the environment and includes a sensor. The map storage means stores a map in which the environment in which the robot is placed, the behavior of the robot, and the time-series data of the sensor output are associated with each other. Recognize which environment is working. The environment is recognized by the recognition means. That is, the first execution unit of the recognition unit executes an action for environment recognition, and the first measurement unit measures time-series data of the sensor output when the first execution unit executes the action. Then, the recognizing unit recognizes the environment based on the time series data of the sensor output measured by the first measuring unit and the decision tree having the environment created based on the map as a discrimination target. The decision tree defines the order of application of the decision procedure for determining the environment in which the robot exists based on the action performed by the robot and the sensor output at that time. Therefore, the recognition means can recognize the environment by repeating the execution of the action and the determination based on the sensor output at that time according to the decision tree until the environment is determined in the decision tree. Therefore, according to the first aspect of the present invention, since the map in which the time series data of the sensor output is associated with the behavior of the robot and the environment is provided, the environment created based on the map can be obtained even if the number of sensors is small. The environment can be recognized based on the decision tree as the discrimination target.

請求項２の発明は、請求項１の発明に従属し、認識手段は、マップに基づいて環境を判別対象とした決定木を作成する決定木作成手段をさらに含む。 The invention of claim 2 is dependent on the invention of claim 1, and the recognizing means further includes a decision tree creating means for creating a decision tree having the environment as a discrimination target based on the map.

請求項２の発明では、決定木作成手段は、マップに基づいて環境を判別対象とした決定木を作成する。これにより、環境を認識しようとするたびごとにマップに基づいて決定木を作成することができる。したがって、請求項２の発明によれば、環境認識を行う際に、最初の行動を任意に選択することができる。 In the invention according to claim 2, the decision tree creating means creates a decision tree having the environment as a discrimination target based on the map. This makes it possible to create a decision tree based on the map each time an attempt is made to recognize the environment. Therefore, according to the invention of claim 2, when performing environment recognition, the first action can be arbitrarily selected.

請求項３の発明は、請求項１または２の発明に従属し、センサは加速度センサを含み、行動は、歩く、寝ころぶ、起き上がる動作の少なくとも１つを含み、認識の対象となる環境は、少なくとも床の硬さまたは床の滑りによって分類されるものである。 The invention of claim 3 is dependent on the invention of claim 1 or 2, wherein the sensor includes an acceleration sensor, the action includes at least one of walking, sleeping, and getting up, and the environment to be recognized is at least Classified by floor hardness or floor slip.

請求項３の発明では、ロボットに設けられるセンサは加速度センサを含み、ロボットの実行する行動は、歩く、寝ころぶ、起き上がる動作の少なくとも１つを含み、そして、認識の対象となる環境は、少なくとも床の硬さまたは床の滑りによって分類されるものである。したがって、請求項３の発明によれば、歩く、寝ころぶ、起き上がる動作のような床との接触を伴う行動を実行しているときの、加速度センサの出力の時系列データを計測することによって、床の硬さまたは床の滑り（摩擦）などの違いのある環境を認識することができる。 In the invention of claim 3, the sensor provided in the robot includes an acceleration sensor, the action performed by the robot includes at least one of walking, sleeping, and getting up, and the environment to be recognized is at least a floor Are classified according to their hardness or floor slip. Therefore, according to the invention of claim 3, by measuring the time series data of the output of the acceleration sensor when performing an action involving contact with the floor such as walking, lying down, and getting up, the floor is measured. It is possible to recognize an environment having a difference such as hardness or floor slip (friction).

請求項４の発明は、請求項１ないし３のいずれかに従属し、認識手段によって認識された環境に適した行動を実行する第２実行手段をさらに備える。 The invention according to claim 4 is dependent on any one of claims 1 to 3, and further comprises second execution means for executing an action suitable for the environment recognized by the recognition means.

請求項４の発明では、第２実行手段は、認識手段によって認識された環境に適した行動を実行する。したがって、請求項４の発明によれば、環境を認識した後には、当該環境に適した行動を実行することができるので、環境に合わせて、望み通りの動作を行うことができる。 In the invention of claim 4, the second execution means executes an action suitable for the environment recognized by the recognition means. Therefore, according to the fourth aspect of the invention, after recognizing the environment, an action suitable for the environment can be executed, so that a desired operation can be performed according to the environment.

請求項５の発明は、請求項４の発明に従属し、第２実行手段によって行動を実行しているときのセンサの出力の時系列データを計測する第２計測手段、および第２計測手段によって計測されたセンサの出力の時系列データが、認識手段によって認識された環境と第２実行手段によって実行された行動とに対応するものであるか否かをマップに基づいて判断する環境変化判別手段をさらに備え、環境変化判別手段によってセンサの出力の時系列データが環境と行動とに対応するものではないと判断されたとき、認識手段によって環境を認識する。 The invention according to claim 5 is dependent on the invention according to claim 4, and includes a second measuring unit that measures time-series data of the output of the sensor when the action is being executed by the second executing unit, and a second measuring unit. Environment change determination means for determining whether the time-series data of the measured sensor output corresponds to the environment recognized by the recognition means and the action executed by the second execution means based on the map When the environment change determination unit determines that the time series data of the sensor output does not correspond to the environment and the action, the recognition unit recognizes the environment.

請求項５の発明では、第２計測手段は、第２実行手段によって行動を実行しているときのセンサの出力の時系列データを計測する。環境変化判別手段は、第２計測手段によって計測されたセンサの出力の時系列データが、認識手段によって認識された環境と第２実行手段によって実行された行動とに対応するものであるか否かをマップに基づいて判断する。認識している環境に適した行動を実行したときのセンサ出力は、マップで対応付けられているセンサ出力となるはずであるので、この環境変化判別手段によって、センサ出力に基づいて環境の変化を判別している。そして、環境変化判別手段によってセンサの出力の時系列データが環境と行動とに対応するものではないと判断されたときには、環境が変化したことがわかるので、認識手段によって環境を認識する。したがって、請求項５の発明では、一旦環境を認識した後でも環境の変化を把握することができるので、変化後の環境を認識することができる。 In the invention of claim 5, the second measuring means measures the time series data of the output of the sensor when the action is being executed by the second executing means. The environmental change determination means determines whether the time-series data of the sensor output measured by the second measurement means corresponds to the environment recognized by the recognition means and the action executed by the second execution means. Is determined based on the map. Since the sensor output when the action suitable for the recognized environment is executed should be the sensor output associated with the map, the environment change determination unit can change the environment based on the sensor output. Judging. When the environment change determination means determines that the time series data of the sensor output does not correspond to the environment and the action, the environment is recognized by the recognition means because the environment has changed. Therefore, in the invention of claim 5, since the change of the environment can be grasped even after the environment is recognized once, the environment after the change can be recognized.

請求項６の発明は、センサを備えて環境を認識するヒューマノイドロボットのための、ヒューマノイドロボットの置かれる環境と、ヒューマノイドロボットの行動と、センサの出力の時系列データとを対応付けたマップを作成するマップ作成方法であって、(a) ヒューマノイドロボットが認識すべき複数の環境において各環境に適した行動を実行するように複数の行動がそれぞれ調整されているヒューマノイドロボットを準備し、(b) ヒューマノイドロボットを用いて、認識すべき複数の環境で複数の行動をそれぞれ実行することによって、各行動を実行しているときのセンサの出力の時系列データを計測し、(c) 環境と、当該環境で実行した行動と、当該行動を実行しているときに計測されたセンサの出力の時系列データとを対応付ける、マップ作成方法である。 The invention of claim 6 creates a map for a humanoid robot having a sensor and recognizing the environment by associating the environment where the humanoid robot is placed, the behavior of the humanoid robot, and the time-series data of the sensor output. (A) preparing a humanoid robot in which a plurality of behaviors are adjusted so as to execute a behavior suitable for each environment in a plurality of environments to be recognized by the humanoid robot, and (b) By using a humanoid robot to perform multiple actions in multiple environments that should be recognized, the time series data of the sensor output when each action is being executed is measured. A mapping that associates actions performed in the environment with time-series data of sensor outputs measured when the actions are executed. It is the creation method.

請求項６の発明によれば、環境と行動とセンサ出力とを対応付けたマップを作成することができるので、センサを備えたヒューマノイドロボットがこのマップを用いて環境を認識することが可能になる。 According to the invention of claim 6, since a map in which the environment, the action, and the sensor output are associated with each other can be created, the humanoid robot equipped with the sensor can recognize the environment using this map. .

この発明によれば、センサ出力の時系列データとロボットの行動と環境とを対応付けたマップを備えているので、このマップに基づいて作成された環境を判別対象とした決定木と、行動中のセンサ出力とに基づいて、環境を判定することができる。したがって、センサが少なくても環境を認識することができる。さらに、認識した環境に適した行動を実行することができるので、環境内を円滑に動作することができる。 According to the present invention, since the map in which the time series data of the sensor output, the robot action and the environment are associated with each other is provided, the decision tree for which the environment created based on this map is a discrimination target, The environment can be determined based on the sensor output. Therefore, the environment can be recognized even if there are few sensors. Furthermore, since an action suitable for the recognized environment can be executed, the environment can be smoothly operated.

この発明の上述の目的，その他の目的，特徴および利点は、図面を参照して行う以下の実施例の詳細な説明から一層明らかとなろう。 The above object, other objects, features and advantages of the present invention will become more apparent from the following detailed description of embodiments with reference to the drawings.

図１はこの発明の一実施例のヒューマノイドロボット（以下、単に「ロボット」ともいう。）１０の外観を示し、図２はこのロボット１０をモデル表記した図解図である。図２を参照して、この実施例のロボット１０は、垂直方向または縦方向に間隔を隔てて設けられる２つの横杆１２および１４を含み、この上下横杆１２および１４の間には、胴体１６が形成される。 FIG. 1 shows the appearance of a humanoid robot (hereinafter also simply referred to as “robot”) 10 according to one embodiment of the present invention, and FIG. 2 is an illustrative view showing the robot 10 as a model. Referring to FIG. 2, the robot 10 of this embodiment includes two recumbent arms 12 and 14 that are spaced apart in the vertical direction or the longitudinal direction, and between the upper and lower recumbent surfaces 12 and 14, 16 is formed.

胴体１６の上部には、胸の自由度を形成する１つのヨー（Ｙａｗ）軸関節１８が設けられ、その下方には腰の自由度を形成する１つのピッチ（Ｐｉｔｃｈ）軸関節２０が設けられる。なお、よく知られているように、ヨー軸とは鉛直回転軸（Ｚ軸）であり、ピッチ軸とはこのヨー軸に直行する２つの軸の一方、たとえばＸ軸であり、そして、後述のロール（Ｒｏｌｌ）軸がヨー軸に直行する２つの軸の他方、たとえばＹ軸である。Ｘ軸（ピッチ軸）は図２の紙面に平行な方向に延びる回転軸であり、Ｙ軸（ロール軸）は図２の紙面に直行する方向に延びる回転軸である。 One yaw shaft joint 18 that forms the degree of freedom of the chest is provided on the upper portion of the body 16, and one pitch shaft joint 20 that forms the degree of freedom of the waist is provided below the body 16. . As is well known, the yaw axis is a vertical rotation axis (Z axis), the pitch axis is one of two axes perpendicular to the yaw axis, for example, the X axis, and will be described later. The roll axis is the other of the two axes perpendicular to the yaw axis, for example, the Y axis. The X axis (pitch axis) is a rotation axis extending in a direction parallel to the paper surface of FIG. 2, and the Y axis (roll axis) is a rotation axis extending in a direction perpendicular to the paper surface of FIG.

上横杆１２の両端には、腕２２Ｒおよび２２Ｌがそれぞれ取り付けられる。これら左右の腕２２Ｒおよび２２Ｌは同じ構成である。なお、右左を区別する必要があるときは、右を示す「Ｒ」または左を表す「Ｌ」を付し、また区別する必要がないときはこの「Ｒ」や「Ｌ」を付けないで示すようにして、重複する説明はできるだけ省く。このことは、後述の足についても同様である。 Arms 22 </ b> R and 22 </ b> L are attached to both ends of the upper horizontal arm 12, respectively. These left and right arms 22R and 22L have the same configuration. When it is necessary to distinguish between right and left, “R” indicating right or “L” representing left is added, and when it is not necessary to distinguish, “R” or “L” is not added. In this way, duplicate descriptions are omitted as much as possible. The same applies to the feet described later.

腕２２は、肩の自由度を形成するロール軸関節２４を含み、この肩関節２４によって上横杆１２に、ロール軸周りに回転可能に取り付けられる。この肩関節２４の先には、それぞれ１つずつのピッチ軸関節２６、ヨー軸関節２８、およびロール軸関節３０の直列接続が設けられる。したがって、この実施例のロボット１０の場合、腕２２は、４自由度であり、左右で８自由度である。 The arm 22 includes a roll shaft joint 24 that forms a degree of freedom of the shoulder, and is attached to the upper lateral reed 12 by the shoulder joint 24 so as to be rotatable around the roll shaft. In front of the shoulder joint 24, one pitch shaft joint 26, one yaw shaft joint 28, and one roll shaft joint 30 are connected in series. Therefore, in the robot 10 of this embodiment, the arm 22 has 4 degrees of freedom and 8 degrees of freedom on the left and right.

また、下横杆１４の両端には、足３２Ｒおよび３２Ｌがそれぞれ取り付けられる。足３２は、股関節の自由度を形成する１つのロール軸関節３４によって下横杆１４に取り付けられ、この股関節３２から先の下端方向には、連続する３つのピッチ軸関節３６、３８および４０、および１つのロール軸関節４２、さらには１つのヨー軸関節４４の直列接続が取り付けられる。したがって、この実施例のロボット１０の場合、足３２は、６自由度であり、左右で１２自由度である。 In addition, legs 32R and 32L are attached to both ends of the lower lying heel 14, respectively. The foot 32 is attached to the lower recumbent joint 14 by one roll shaft joint 34 that forms a degree of freedom of the hip joint. In the lower end direction from the hip joint 32, three pitch shaft joints 36, 38 and 40, which are continuous, And a series connection of one roll shaft joint 42 and one yaw shaft joint 44 is attached. Therefore, in the case of the robot 10 of this embodiment, the foot 32 has 6 degrees of freedom and 12 degrees of freedom on the left and right.

図１の実施例のロボット１０の電気的構成が図２のブロック図に示される。図２に示すように、このロボット１０は、全体の制御のためにマイクロコンピュータまたはＣＰＵ４６を含み、このＣＰＵ４６には、バス４８を通して、メモリ５０，モータ制御ボード５２，およびセンサ入力／出力ボード５４が接続される。 The electrical configuration of the robot 10 of the embodiment of FIG. 1 is shown in the block diagram of FIG. As shown in FIG. 2, the robot 10 includes a microcomputer or CPU 46 for overall control. The CPU 46 includes a memory 50, a motor control board 52, and a sensor input / output board 54 through a bus 48. Connected.

メモリ５０は、図示しないが、ＲＯＭやＲＡＭを含み、ＲＯＭにはこのロボット１０の制御プログラムおよびデータが予め書き込まれていて、ＲＡＭは、一時記憶メモリとして用いられるとともにワーキングメモリとして利用され得る。なお、このメモリ５０には、図４に示すような環境‐最適行動テーブルや、図５に示すようなセンサ出力時系列‐行動‐環境対応付けマップが予め記憶されているが、これらの説明は後述する。 Although not shown, the memory 50 includes a ROM and a RAM. In the ROM, the control program and data of the robot 10 are written in advance, and the RAM can be used as a temporary storage memory and a working memory. The memory 50 stores in advance an environment-optimal behavior table as shown in FIG. 4 and a sensor output time series-behavior-environment association map as shown in FIG. It will be described later.

センサ入力／出力ボード５４には、図２の胴体１６に設けられる１つの２軸加速度センサ５６と、関節角度センサ５８とが接続される。なお、関節角度センサ５８は１つのブロックで示すが、実際には、上で説明した２２個の関節にそれぞれ個別に設けられ、それぞれの関節角を検出する２２個の関節角度センサを含む。２軸加速度センサ５６は、ロボット１０の前後方向および左右方向の２軸で加速度を検出することができる。この２軸加速度センサ５６の出力を用いて、ＣＰＵ４６は、ロボット１０の２足歩行や反動起き上がり等を制御する。また、ＣＰＵ４６は、この２軸加速度センサ５６の出力時系列を計測して、メモリ５０に記憶された対応付けマップを用いて環境を認識する。 The sensor input / output board 54 is connected with one biaxial acceleration sensor 56 provided on the body 16 of FIG. 2 and a joint angle sensor 58. The joint angle sensor 58 is shown as one block, but actually includes 22 joint angle sensors that are individually provided for the 22 joints described above and detect the respective joint angles. The biaxial acceleration sensor 56 can detect acceleration with two axes in the front-rear direction and the left-right direction of the robot 10. Using the output of the biaxial acceleration sensor 56, the CPU 46 controls the bipedal walking, reaction rise, and the like of the robot 10. Further, the CPU 46 measures the output time series of the biaxial acceleration sensor 56 and recognizes the environment using the association map stored in the memory 50.

モータ制御ボード５２は、たとえばＤＳＰ(Digital Signal Processor) で構成され、各腕、各足、胸、腰の各関節軸モータ６０を制御する。図３では図の簡略化のために、関節軸モータ６０は１つのブロックで示すが、実際には、上で説明した２２個の関節のそれぞれに１つのモータ（サーボモータ）が設けられ、それぞれの関節角度を制御する２２個のモータを含む。 The motor control board 52 is composed of, for example, a DSP (Digital Signal Processor), and controls each joint shaft motor 60 for each arm, each leg, chest, and waist. In FIG. 3, the joint shaft motor 60 is shown as one block for the sake of simplification, but in reality, one motor (servo motor) is provided for each of the 22 joints described above. It includes 22 motors that control the joint angle.

図４には、メモリ５０に予め記憶された、環境と最適行動とを対応付けたテーブルの一例が示される。この実施例のロボット１０では、当該ロボット１０が置かれることになるすなわちロボット１０が認識すべき多数の環境Ｅ１，Ｅ２，…Ｅｉ（ｉは所定の整数）が予め把握されている。環境は、たとえばその床（または地面）の反力および形状等によって分類される。具体的には、床の硬さ（硬＜＞柔）、および床の滑り（摩擦：小＜＞大）などの要素の違いで分類され、この実施例ではｉ個の環境が想定される。この図４の例では、たとえば、環境Ｅ１は、硬さが硬くかつ摩擦が小さいものであり、環境Ｅ２は、硬さが硬くかつ摩擦が中程度のものであり、また、環境Ｅｉは、硬さが柔らかくかつ摩擦が大きいものである。 FIG. 4 shows an example of a table stored in advance in the memory 50 in which the environment and the optimum behavior are associated with each other. In the robot 10 of this embodiment, the robot 10 is placed, that is, a large number of environments E1, E2,... Ei (i is a predetermined integer) that the robot 10 should recognize are known in advance. The environment is classified by, for example, the reaction force and shape of the floor (or the ground). Specifically, it is classified by the difference in factors such as floor hardness (hard <> soft) and floor slip (friction: small <> large). In this embodiment, i environments are assumed. In the example of FIG. 4, for example, the environment E1 is hard and has low friction, the environment E2 is hard and has moderate friction, and the environment Ei is hard. Is soft and has high friction.

また、このロボット１０では、たとえば「歩く」、「寝ころぶ」、「反動起き上がり」等の複数（ｊ個：ｊは所定の整数）の基本行動の全てについて、各環境で最適になるように予め調整された環境ごとの行動制御データが準備されている。そして、この環境‐最適行動テーブルでは、各環境に対応付けられて当該環境に最適な行動（識別子）が記憶されている。したがって、このテーブルを用いて各行動はどの環境に最適な行動であるのかを把握することが可能である。この図４の例では、たとえば、環境Ｅ１に最適な行動は、「歩く」行動に関しては行動Ａ１１であり、「寝ころぶ」行動に関しては行動Ａ１２であり、また、「反動起き上がり」行動に関しては行動Ａ１ｊであることが分かる。 Further, in this robot 10, for example, all of a plurality of basic actions (j: j is a predetermined integer) such as “walking”, “falling down”, “rebound rising”, etc. are adjusted in advance so as to be optimal in each environment. Action control data for each environment is prepared. In this environment-optimal action table, an action (identifier) that is optimal for the environment is stored in association with each environment. Therefore, using this table, it is possible to grasp which environment is the most suitable action for each action. In the example of FIG. 4, for example, the optimal action for the environment E1 is the action A11 for the “walking” action, the action A12 for the “falling down” action, and the action A1j for the “wake-up reaction” action. It turns out that it is.

図５には、メモリ５０に予め記憶された、センサ出力時系列‐行動‐環境対応付けマップの一例が示される。対応付けマップは、２軸加速度センサ５６の出力の時系列データと、ロボット１０の実行した行動と、ロボット１０の置かれた環境とを対応付けたものである。 FIG. 5 shows an example of a sensor output time series-behavior-environment association map stored in advance in the memory 50. The association map associates the time-series data of the output of the biaxial acceleration sensor 56, the action performed by the robot 10, and the environment where the robot 10 is placed.

ここで、この対応付けマップの作成方法について説明する。本発明者等は、外界の情報を検知するセンサの出力は、ロボットの行動と環境の特徴とに密接な関係があると考え、ロボットに環境を認識させ、また環境に適した行動を選択させるために、「ロボットの行動」と「センサ出力の時系列」と「環境」とを対応付けたマップを作成することとした。 Here, a method for creating this association map will be described. The present inventors consider that the output of a sensor that detects information of the outside world is closely related to the behavior of the robot and the characteristics of the environment, and allows the robot to recognize the environment and to select a behavior suitable for the environment. Therefore, a map in which “robot behavior”, “time series of sensor outputs”, and “environment” are associated with each other is created.

対応付けマップを作成する際には、まず、ユーザが、ロボット１０の全て（ｊ個）の基本行動について、認識すべき多数（ｉ個）の環境Ｅ１−Ｅｉのそれぞれに最適なものになるようにチューニングをそれぞれ行って、各環境での最適行動Ａ１１−Ａｉｊとして各制御データを作成し、メモリ５０に格納しておく。たとえば、「歩く」行動については、滑り易い環境と滑り難い環境とでは足の下ろし方を変えたりするなど各環境で動作するのに適するようなチューニングが行われる。こうして、このロボット１０では、認識すべき多数の環境において各環境に適した行動を実行するように全ての行動がそれぞれ調整されている。 When creating a correspondence map, first, the user is optimal for each of a large number (i) of environments E1-Ei to be recognized for all (j) basic actions of the robot 10. Each control data is created as optimum behavior A11-Aij in each environment, and stored in the memory 50. For example, the “walking” behavior is tuned so as to be suitable for operation in each environment, such as changing the way the foot is lowered in a slippery environment and a slippery environment. Thus, in this robot 10, all the actions are adjusted so as to execute actions suitable for each environment in a large number of environments to be recognized.

なお、図３では図示を省略してあるが、このロボット１０には外部インタフェース（たとえばＲＳ２３２Ｃ）が設けられ、図示しない外部のコンピュータとの間でデータを通信することが可能にされており、したがって、外部のコンピュータ上で作成した最適行動の制御データ等をロボット１０のメモリ５０に書き込むことができる。 Although not shown in FIG. 3, the robot 10 is provided with an external interface (for example, RS232C) so that data can be communicated with an external computer (not shown). The control data of the optimum behavior created on the external computer can be written in the memory 50 of the robot 10.

そして、このチューニングされたロボット１０を用いて、認識すべき多数の環境Ｅ１−Ｅｉにおいて、全ての行動Ａ１１−Ａｉｊを実行することによって、各行動を実行しているときの２軸加速度センサ５６の出力の時系列データを計測してメモリ５０に蓄積する。具体的には、この実施例では、（ｊ×ｉ）個の行動をｉ個の環境で実行するので、（ｊ×ｉ×ｉ）個のセンサ出力時系列データが得られる。メモリ５０に蓄積したセンサ出力データは、たとえば、上述の外部インタフェースを介して外部コンピュータに転送する。 Then, by using this tuned robot 10, by executing all the actions A11-Aij in a large number of environments E1-Ei to be recognized, the two-axis acceleration sensor 56 when executing each action Output time-series data is measured and stored in the memory 50. Specifically, in this embodiment, since (j × i) actions are executed in i environments, (j × i × i) sensor output time-series data can be obtained. The sensor output data stored in the memory 50 is transferred to an external computer via the above external interface, for example.

そして、たとえば外部コンピュータ上で、ロボット１０の置かれた環境と、当該環境でロボット１０の実行した行動と、当該行動を実行しているときのセンサ出力の時系列データとを対応付けることによって、図５に示すようなマップを作成する。作成されたマップデータは、外部コンピュータからロボット１０に転送されてメモリ５０に格納される。 For example, on an external computer, the environment in which the robot 10 is placed, the action performed by the robot 10 in the environment, and the time-series data of the sensor output when the action is executed are associated with each other. A map as shown in FIG. The created map data is transferred from the external computer to the robot 10 and stored in the memory 50.

なお、上述では、外部コンピュータ上でマップを作成する場合を説明したが、環境と行動とセンサ出力の対応付けは、ロボット１０上でそのまま行うようにしてもよい。 In the above description, the map is created on the external computer. However, the environment, the action, and the sensor output may be associated on the robot 10 as they are.

図５の例では、たとえば、センサ出力（時系列データ）Ｄ１と、行動Ａ１１と、環境Ｅ１，Ｅ２およびＥ５とが対応付けられていることが分かる。これは、行動Ａ１１を環境Ｅ１，Ｅ２およびＥ５で実行すると、同じセンサ出力Ｄ１が得られることを意味している。なお、行動Ａ１１は環境Ｅ１に最適になるようにチューニングされたものであるので、環境Ｅ１で行動Ａ１１を実行したときに得られたセンサ出力Ｄ１は、最適なセンサ出力と表現できる。 In the example of FIG. 5, for example, it can be seen that the sensor output (time-series data) D1, the action A11, and the environments E1, E2, and E5 are associated with each other. This means that when the action A11 is executed in the environments E1, E2, and E5, the same sensor output D1 is obtained. Since the action A11 is tuned to be optimal for the environment E1, the sensor output D1 obtained when the action A11 is executed in the environment E1 can be expressed as an optimum sensor output.

この実施例のロボット１０は、当該ロボット１０がどのような環境に置かれているのかを、最初に、あるいは環境の変化を検出したときなどに判定して、環境を認識する。環境を認識する際には、上述のようなセンサ出力時系列‐行動‐環境対応付けマップに基づいて、環境を認識するための決定木（decision tree）が作成される。つまり、ロボット１０のメモリ５０には、たとえばＩＤ３やＣ４．５等のような決定木自動生成アルゴリズムによる決定木作成プログラムが備えられていて、ＣＰＵ４６はその決定木作成プログラムを実行して、対応付けマップに基づいて環境を判別対象として決定木を作成する。この実施例では、ロボット１０の内部で決定木を自動作成することができるので、環境認識のために最初に実行する行動をランダムに選択することができる。 The robot 10 of this embodiment recognizes the environment by determining in what environment the robot 10 is placed at the beginning or when a change in the environment is detected. When recognizing the environment, a decision tree for recognizing the environment is created based on the sensor output time series-behavior-environment association map as described above. That is, the memory 50 of the robot 10 is provided with a decision tree creation program based on a decision tree automatic generation algorithm such as ID3 or C4.5, for example, and the CPU 46 executes the decision tree creation program to associate Based on the map, a decision tree is created with the environment as a discrimination target. In this embodiment, since a decision tree can be automatically created inside the robot 10, an action to be executed first for environment recognition can be selected at random.

なお、決定木は、入力パターンのクラスを決定するための多段決定過程（multistage decision process）の一手法である。決定木では、各決定手順の適用順序が予め完全に定まっており、決定手順（分類器）の結果によって、次の段でどの分類器に行くかが定まる（参考文献：鳥脇純一郎、テレビジョン学会教科書シリーズ９「認識工学―パターン認識とその応用―」、コロナ社、１９９３）。 The decision tree is a technique for determining the input pattern class. In the decision tree, the order of application of each decision procedure is completely determined in advance, and the decision step (classifier) determines which classifier to go to in the next stage (Reference: Junichiro Toriwaki, Television Society) Textbook series 9 “Cognitive engineering: pattern recognition and its application”, Corona, 1993).

この実施例の決定木では、ロボット１０の実行した行動とそのときのセンサ出力とによってロボットの存在する環境を判定していく決定手順の適用順序が定められている。したがって、決定木において環境が決定されるまで、決定木に従って行動の実行とそのときのセンサ出力による判定とを繰り返していくことで、環境を認識することができる。 In the decision tree of this embodiment, the application order of the decision procedure for determining the environment in which the robot exists is determined by the action performed by the robot 10 and the sensor output at that time. Therefore, until the environment is determined in the decision tree, the environment can be recognized by repeating the execution of the action and the determination based on the sensor output at that time according to the decision tree.

詳しくは、このロボット１０は、決定木に基づいて、その未認識の環境で行動するとともに、その行動中のセンサ出力の時系列を計測する。そして、センサ出力の計測結果に基づいて、環境の判定を試みる。環境が認識できなかった場合には、決定木に基づいて次の行動を選択し、再び行動、計測および判定を繰り返す。このようにして、このロボット１０は、決定木に基づいて、行動をしてじたばたするうちに、どの環境にいるのかを認識することができる。 Specifically, the robot 10 acts in the unrecognized environment based on the decision tree, and measures a time series of sensor outputs during the behavior. Then, based on the measurement result of the sensor output, an attempt is made to determine the environment. If the environment cannot be recognized, the next action is selected based on the decision tree, and the action, measurement, and determination are repeated again. In this way, the robot 10 can recognize which environment it is in while performing an action based on the decision tree.

図６には、作成される決定木の一例の一部の概略が示される。この図６の例では、最初の段に行動Ａ１１が設定されている。なお、この実施例では決定木の最初の段の行動はランダムで決定される。この図６の決定木による環境判定の流れの一例を説明すると、まず、行動Ａ１１の実行の結果、センサ出力Ｄ１が計測された場合には、次の段では、行動Ａ５５が選択されて実行される。そして、この行動Ａ５５の実行の結果、センサ出力Ｄ５が計測されなかった場合には、環境が決定され、つまり、このロボット１０の置かれている環境は環境Ｅ２であることが認識される。 FIG. 6 shows an outline of a part of an example of a decision tree to be created. In the example of FIG. 6, action A11 is set in the first stage. In this embodiment, the behavior of the first stage of the decision tree is determined at random. An example of the flow of environment determination by the decision tree of FIG. 6 will be described. First, when the sensor output D1 is measured as a result of the execution of the action A11, the action A55 is selected and executed in the next stage. The If the sensor output D5 is not measured as a result of the execution of the action A55, the environment is determined, that is, it is recognized that the environment where the robot 10 is placed is the environment E2.

そして、ロボット１０は、環境を認識すると、その認識した環境に適した行動を、たとえば環境‐最適行動テーブルに基づいて、実行することができる。したがって、ロボット１０は、認識した環境に合わせて、望みどおりに動作することができる。具体的には、床の硬さや滑り具合に合わせて、たとえば２足歩行することができるので、滑って転倒したりバランスを崩したりすることなく円滑に行動することができる。 When the robot 10 recognizes the environment, the robot 10 can execute an action suitable for the recognized environment, for example, based on the environment-optimal action table. Therefore, the robot 10 can operate as desired in accordance with the recognized environment. Specifically, since it is possible to walk, for example, two legs according to the hardness of the floor and the degree of sliding, it is possible to act smoothly without slipping and falling or breaking the balance.

さらに、このロボット１０では、環境に適した行動を実行しているときの、２軸加速度センサ５６の出力の時系列データを計測して、環境の変化を判別するようにしている。つまり、認識している環境に適した行動を実行したときのセンサ出力の時系列データは、マップで対応付けられている最適なセンサ出力となるはずである。そこで、センサ出力の時系列データが、認識している環境と実行した行動とに対応するものであるか否かを、対応付けマップに基づいて判断することによって、環境の変化を把握する。そして、センサ出力の時系列データがその環境と行動とに対応するものではなかった場合には、環境が変化してしまったことがわかるので、再び上述のような環境を認識するための処理を実行する。したがって、この実施例では、一旦環境を認識した後でも環境の変化を把握できるので、変化後の環境を認識でき、さらに変化後の環境に適した行動を実行できる。 Further, the robot 10 measures the time series data of the output of the biaxial acceleration sensor 56 when executing an action suitable for the environment, and discriminates a change in the environment. That is, the time series data of the sensor output when the action suitable for the recognized environment is executed should be the optimum sensor output associated with the map. Therefore, the change in the environment is grasped by determining whether the time series data of the sensor output corresponds to the recognized environment and the executed action based on the association map. If the time series data of the sensor output does not correspond to the environment and action, it can be seen that the environment has changed, so the process for recognizing the environment as described above is performed again. Execute. Therefore, in this embodiment, since the change of the environment can be grasped even after the environment is recognized once, the environment after the change can be recognized, and further, the action suitable for the environment after the change can be executed.

具体的には、ロボット１０のＣＰＵ４６は、図７に示すようなフロー図に従って処理を実行する。図７の最初のステップＳ１では、メモリ５０に予め登録されている行動の中からランダムに１つの行動を選択する。次に、ステップＳ３で、決定木作成プログラムを実行して、選択した行動を初めの段とする決定木を、センサ出力時系列‐行動‐環境対応付けマップに基づいて生成する。 Specifically, the CPU 46 of the robot 10 executes processing according to a flowchart as shown in FIG. In the first step S <b> 1 in FIG. 7, one action is randomly selected from actions registered in advance in the memory 50. Next, in step S3, the decision tree creation program is executed to generate a decision tree having the selected action as the first stage based on the sensor output time series-behavior-environment association map.

続いて、ステップＳ５で、選択した行動の制御データをメモリ５０から読み出して、その行動の実行を開始する。ステップＳ７では、行動中の２軸加速度センサ５６からの出力をセンサ入力／出力ボード５４を介して一定周期で計測して、行動中のセンサ出力時系列データをメモリ５０に記憶する。 Subsequently, in step S5, the control data of the selected action is read from the memory 50, and the execution of the action is started. In step S <b> 7, the output from the biaxial acceleration sensor 56 during action is measured at a constant cycle via the sensor input / output board 54, and the sensor output time series data during action is stored in the memory 50.

そして、ステップＳ９では、決定木に基づく判定を行う。つまり、計測されたセンサ出力時系列データに基づいて、決定木の次の段でどの決定手順に進むかを判定する。たとえば図６の最初の行動Ａ１１を実行した場合で説明すると、計測されたセンサ出力データが、センサ出力Ｄ１であるか、またはその他の出力であるかを判断し、次にどちらの行動へ進むかを決定する。 In step S9, determination based on the decision tree is performed. That is, based on the measured sensor output time-series data, it is determined which decision procedure is to be advanced at the next stage of the decision tree. For example, in the case where the first action A11 in FIG. 6 is executed, it is determined whether the measured sensor output data is the sensor output D1 or another output, and which action is followed next. To decide.

続いて、ステップＳ１１で、環境を認識できたか否かを判断する。つまり、ステップＳ９の判定によって、決定木においてセンサ出力に基づいて進めた次の段が環境を決定するものであったか否かを判断する。 Subsequently, in step S11, it is determined whether or not the environment has been recognized. That is, the determination in step S9 determines whether the next stage advanced based on the sensor output in the decision tree is to determine the environment.

ステップＳ１１で“ＮＯ”であれば、つまり、環境を未だ認識するに至っていない場合には、続くステップＳ１３で、決定木に基づいて次の行動を選択する。つまり、決定木において進めた次の段の決定手順として設定されている行動を選択する。そして、ステップＳ５へ戻って、再び、選択した行動の実行（ステップＳ５）および行動中のセンサ出力計測（ステップＳ７）を処理し、センサ出力結果に基づいて決定木による判定（ステップＳ９）を行って、環境を認識できたか判断する（ステップＳ１１）。このように、ステップＳ５からステップＳ１３の処理を繰り返して、決定木に基づいて環境の認識を行う。 If “NO” in the step S11, that is, if the environment has not yet been recognized, in the subsequent step S13, the next action is selected based on the decision tree. That is, the action set as the decision procedure for the next stage advanced in the decision tree is selected. Then, returning to step S5, the execution of the selected action (step S5) and the sensor output measurement during the action (step S7) are processed again, and the determination by the decision tree (step S9) is performed based on the sensor output result. Then, it is determined whether the environment has been recognized (step S11). In this way, the processing from step S5 to step S13 is repeated to recognize the environment based on the decision tree.

そして、ステップＳ１１で“ＹＥＳ”であれば、つまり、決定木において環境の決定に至った場合には、続くステップＳ１５で、認識した環境がいずれであるかをメモリ５０に記録する。ステップＳ１７では、ロボット１０の内部プログラムあるいは外部プログラムまたはユーザから下される行動指令に従い、その環境に適した行動をたとえば環境‐最適行動テーブルに基づいて選択する。たとえば、認識した環境が環境Ｅ１であった場合には、当該環境Ｅ１に最適な行動Ａ１１−行動Ａ１ｊの中から実行すべき行動を選択する。 If “YES” in the step S11, that is, if the environment is determined in the decision tree, in the subsequent step S15, the recognized environment is recorded in the memory 50. In step S17, an action suitable for the environment is selected based on, for example, the environment-optimal action table in accordance with an internal program or an external program of the robot 10 or an action command given by the user. For example, when the recognized environment is the environment E1, the action to be executed is selected from the action A11-action A1j that is optimal for the environment E1.

続いて、ステップＳ１９で、選択した行動の実行を開始するとともに、ステップＳ２１で、行動中の２軸加速度センサ５６の出力時系列データを計測する。 Subsequently, in step S19, execution of the selected action is started, and in step S21, output time-series data of the biaxial acceleration sensor 56 during action is measured.

そして、ステップＳ２３では、計測したセンサ出力データが、認識した環境と実行した行動に最適なものであったか否かを対応付けマップに基づいて判断する。つまり、このステップＳ２３では、環境が変化したかどうかを判定している。詳しくは、ステップＳ１９では、認識された環境において、当該環境に最適になるように調整された行動が実行されるので、ステップＳ２１では、マップにおいて当該環境と当該行動とに対応付けられた最適なセンサ出力が計測されるはずである。したがって、認識した環境において最適な行動を実行したときに計測されたセンサ出力が、対応付けマップに設定されたセンサ出力ではなかった場合には、認識していた環境が変化してしまったことが分かる。 In step S23, it is determined based on the association map whether the measured sensor output data is optimal for the recognized environment and the executed action. That is, in this step S23, it is determined whether or not the environment has changed. Specifically, in step S19, an action adjusted to be optimal for the environment is executed in the recognized environment. Therefore, in step S21, an optimum associated with the environment and the action in the map is executed. The sensor output should be measured. Therefore, if the sensor output measured when the optimal action is executed in the recognized environment is not the sensor output set in the association map, the recognized environment may have changed. I understand.

そこで、ステップＳ２３で“ＮＯ”であれば、つまり、環境が変化してしまった場合には、その環境の認識を試みるべく、再びステップＳ１に戻って処理を繰り返す。これによって、変化した環境を認識することができるので、以後もその変化した環境に最適な行動を実行していくことができる。 Therefore, if “NO” in the step S23, that is, if the environment has changed, the process returns to the step S1 again to repeat the process in order to try to recognize the environment. As a result, the changed environment can be recognized, and the optimum action for the changed environment can be executed thereafter.

一方、ステップＳ２３で“ＹＥＳ”であれば、つまり、環境が変化していない場合には、ステップＳ２５で、ロボット１０の内部プログラムあるいは外部プログラムまたはユーザから下される次の行動指令に従い、その環境において実行すべき次の行動の選択を処理する。そして、ステップＳ２７で、行動を終了するか否かを判断し、“ＮＯ”であれば、つまり、実行すべき行動がある場合には、ステップＳ１９に戻って、次の行動を実行する。他方、ステップ２７で“ＹＥＳ”であればこの処理を終了する。 On the other hand, if “YES” in the step S23, that is, if the environment has not changed, in the step S25, the environment is determined in accordance with the internal program or the external program of the robot 10 or the next action command issued from the user. Process the selection of the next action to be performed. In step S27, it is determined whether or not to end the action. If “NO”, that is, if there is an action to be executed, the process returns to step S19 to execute the next action. On the other hand, if “YES” in the step 27, the process is ended.

この実施例によれば、行動中のセンサ出力を長い時系列で見ることによって、２軸加速度センサ５６の１種類１個という非常に少量のセンサであっても、通常の使い方では分からなかった情報を抽出でき、つまり、環境を認識することができる。したがって、このロボット１０では、認識した環境に適した行動を実行していくことができる。また、センサが非常に少量でも認識できるので、たとえば省配線および省電力を実現することができる。また、その他のセンサと組み合わせることによって、さらに多くの環境情報を抽出することができ、より複雑な行動選択を行うことも可能となる。 According to this embodiment, by looking at the sensor output during action in a long time series, even a very small amount of sensors of one type of the biaxial acceleration sensor 56 could not be known in normal usage. Can be extracted, that is, the environment can be recognized. Therefore, the robot 10 can execute an action suitable for the recognized environment. Further, since the sensor can be recognized even with a very small amount, for example, wiring saving and power saving can be realized. Further, by combining with other sensors, more environmental information can be extracted, and more complex action selection can be performed.

なお、上述の実施例では、図７のステップＳ２３で環境の変化を検出して、再び環境の認識処理を行なうとき、ステップＳ１に戻ってランダムに行動を選択することとなるが、他の実施例では、環境の変化を認識したときに実行していた行動を選択することによって、行動を継続しつつ環境の認識処理を行うようにしてもよい。 In the above-described embodiment, when an environmental change is detected in step S23 of FIG. 7 and the environment recognition process is performed again, the process returns to step S1 to randomly select an action. In the example, the environment recognition process may be performed while continuing the action by selecting the action that was executed when the change in the environment was recognized.

また、上述の各実施例では、メモリ５０に対応付けマップを記憶しておき、環境の認識のための最初の行動を任意に選択し、環境の認識処理を実行するたびに決定木を作成するようにしていたが、他の実施例では、外部コンピュータなどで対応付けマップから予め作成しておいた決定木を、メモリ５０に記憶しておくようにしてもよい。この場合には、決定木が固定されるので、環境の認識手法が固定される。つまり、ロボット１０は、環境を認識する際には、最初にいつも同じ行動を実行することとなる。 In each of the above-described embodiments, a correspondence map is stored in the memory 50, a first action for environment recognition is arbitrarily selected, and a decision tree is created each time an environment recognition process is executed. However, in another embodiment, a decision tree created in advance from an association map by an external computer or the like may be stored in the memory 50. In this case, since the decision tree is fixed, the environment recognition method is fixed. That is, the robot 10 always performs the same action first when recognizing the environment.

また、上述の各実施例では、ロボット１０には２軸加速度センサ５６を設けるようにしていた。そして、床（あるいは地面）の硬さや床の滑り等の要素の違いによって分類される環境において、歩く、寝ころぶ、反動起き上がりなどの床に接触する行動を実行し、行動中のセンサ出力時系列を計測することで、対応付けマップを作成していた。しかし、ロボット１０に設けるセンサの種類は適宜変更されてよい。ただし、認識ないし判別する環境の特徴と行動とが、センサ出力に反映されるものである必要がある。 In each of the above-described embodiments, the robot 10 is provided with the biaxial acceleration sensor 56. Then, in an environment that is classified according to differences in factors such as the hardness of the floor (or the ground) and the slipping of the floor, actions that touch the floor such as walking, lying down, and recoiling are executed, and the sensor output time series during the action is calculated. By making a measurement, a correspondence map was created. However, the type of sensor provided in the robot 10 may be changed as appropriate. However, the environmental features and behaviors to be recognized or discriminated need to be reflected in the sensor output.

また、上述の各実施例の自由度やそれらの具体的構成は単なる例示であり、必ずしも２２自由度なければならないというものではなく、関節軸モータは任意に変更可能である。なお、上述の実施例の２足歩行型のような自由度の多いロボットであれば、床の硬さや滑り（摩擦）の違いなど、判別可能な環境が多くなり、一方、たとえば車輪移動型のロボットの場合には、平滑か凹凸かといった程度で把握できるだけであり、判別可能な環境が少なくなる。 In addition, the degrees of freedom and specific configurations of the above-described embodiments are merely examples, and do not necessarily have 22 degrees of freedom, and the joint shaft motor can be arbitrarily changed. If the robot has a high degree of freedom, such as the biped walking type of the above-described embodiment, the environment that can be discriminated increases such as the difference in floor hardness and sliding (friction). In the case of a robot, it is only possible to grasp whether it is smooth or uneven, and the environment that can be discriminated is reduced.

この発明の一実施例のロボットを示す外観図である。1 is an external view showing a robot according to an embodiment of the present invention. 図１実施例のロボットを示す図解図である。It is an illustration figure which shows the robot of FIG. 1 Example. 図１実施例のロボットの電気的構成を示すブロック図である。It is a block diagram which shows the electrical structure of the robot of FIG. 1 Example. 図３のメモリに記憶される環境ごとの最適行動に関するテーブルの一例を示す図解図である。It is an illustration figure which shows an example of the table regarding the optimal action for every environment memorize | stored in the memory of FIG. 図３のメモリに記憶されるセンサ出力時系列と行動と環境とを対応付けたマップの一例を概略を示す図解図である。It is an illustration figure which shows an outline of an example of the map which matched the sensor output time series memorize | stored in the memory of FIG. 3, an action, and an environment. マップから生成される決定木の一例の概略を示す図解図である。It is an illustration figure which shows the outline of an example of the decision tree produced | generated from a map. 図１実施例のロボットの動作の一例を示すフロー図である。It is a flowchart which shows an example of operation | movement of the robot of FIG. 1 Example.

Explanation of symbols

１０ …ロボット（ヒューマノイド）
２２Ｒ、２２Ｌ …腕
３２Ｒ、３２Ｌ …足
４６ …ＣＰＵ
５０ …メモリ
５２ …モータ制御ボード
５４ …センサ入力／出力ボード
５６ …２軸加速度センサ
６０ …関節軸モータ 10 ... Robot (Humanoid)
22R, 22L ... arms 32R, 32L ... feet 46 ... CPU
DESCRIPTION OF SYMBOLS 50 ... Memory 52 ... Motor control board 54 ... Sensor input / output board 56 ... Biaxial acceleration sensor 60 ... Joint axis motor

Claims

A humanoid robot that recognizes the environment,
Sensor,
Map storage means for storing a map in which the environment in which the humanoid robot is placed, the action of the humanoid robot, and the time series data of the output of the sensor are associated, and the first execution for executing the action for environment recognition Means and first measurement means for measuring time series data of the sensor output when the action is being executed by the first execution means, and the output of the sensor measured by the first measurement means. A humanoid robot comprising recognition means for recognizing an environment based on time-series data and a decision tree for which the environment created based on the map is a discrimination target.

The humanoid robot according to claim 1, wherein the recognizing unit further includes a decision tree creating unit that creates a decision tree whose environment is a discrimination target based on the map.

The sensor includes an acceleration sensor;
The action includes at least one of walking, falling asleep, and getting up;
The humanoid robot according to claim 1 or 2, wherein the environment to be recognized is classified by at least floor hardness or floor slip.

The humanoid robot according to claim 1, further comprising second execution means for executing an action suitable for an environment recognized by the recognition means.

Second measurement means for measuring time-series data of the output of the sensor when the action is being executed by the second execution means, and time-series data of the output of the sensor measured by the second measurement means, An environment change determination unit that determines whether the environment corresponds to the environment recognized by the recognition unit and the action executed by the second execution unit based on the map;
The humanoid robot according to claim 4, wherein when the environment change determination unit determines that the time series data of the output of the sensor does not correspond to the environment and the action, the recognition unit recognizes the environment.

A map creation method for creating a map that associates an environment in which the humanoid robot is placed, an action of the humanoid robot, and time-series data of the output of the sensor for a humanoid robot that includes a sensor and recognizes the environment Because
(a) preparing the humanoid robot in which a plurality of behaviors are adjusted so as to execute a behavior suitable for each environment in a plurality of environments to be recognized by the humanoid robot;
(b) Using the humanoid robot, by measuring each of the plurality of actions in the plurality of environments to be recognized, to measure time series data of the output of the sensor when executing each action,
(c) A map creation method that associates an environment, an action executed in the environment, and time-series data of the sensor output measured when the action is executed.