JP6671577B2

JP6671577B2 - An autonomous robot that identifies people

Info

Publication number: JP6671577B2
Application number: JP2018549031A
Authority: JP
Inventors: 要林
Original assignee: Groove X Inc
Current assignee: Groove X Inc
Priority date: 2016-11-07
Filing date: 2017-11-01
Publication date: 2020-03-25
Anticipated expiration: 2037-11-01
Also published as: WO2018084170A1; JPWO2018084170A1

Description

本発明は、内部状態または外部環境に応じて自律的に行動選択するロボット、に関する。 The present invention relates to a robot that autonomously selects an action according to an internal state or an external environment.

人間は、感覚器官を通して外部環境からさまざまな情報を取得し、行動選択する。意識的に行動選択することもあれば、無意識的な行動選択もある。繰り返し行動はやがて無意識的行動となり、新しい行動は意識領域にとどまる。 Humans acquire various information from the external environment through sensory organs and select actions. There are conscious action choices and unconscious action choices. Repetitive behavior eventually becomes unconscious, and new behavior stays in the conscious domain.

人間は、自らの行動を自由に選択する意志、すなわち、自由意志をもっていると信じている。人間が他人に対して愛情や憎しみといった感情を抱くのは、他人にも自由意志があると信じているからである。自由意志を持つ者、少なくとも自由意志を持っていると想定可能な存在は、人の寂しさを癒す存在にもなる。 We believe that we have the will to choose our actions freely, that is, free will. Humans have feelings of love and hatred for others because they believe that others have free will. A person who has free will, at least a person who can be assumed to have free will, will also heal the loneliness of a person.

人間がペットを飼う理由は、人間の役に立つか否かよりも、ペットが癒しを与えてくれるからである。ペットは、多かれ少なかれ自由意志を感じさせる存在であるからこそ、人間のよき伴侶となることができる。 Humans keep pets because they provide healing rather than whether they help. Pets can be good companions to humans because they are more or less free-willing.

その一方、ペットの世話をする時間を十分に確保できない、ペットを飼える住環境にない、アレルギーがある、死別がつらい、といったさまざまな理由により、ペットをあきらめている人は多い。もし、ペットの役割が務まるロボットがあれば、ペットを飼えない人にもペットが与えてくれるような癒しを与えられるかもしれない（特許文献１参照）。 On the other hand, many people give up pets for various reasons, such as not having enough time to care for pets, lack of living environment for pets, allergies, and difficult bereavement. If there is a robot that plays the role of a pet, healing may be provided to a person who cannot keep a pet, as provided by the pet (see Patent Document 1).

特開２０００−３２３２１９号公報JP 2000-323219 A

近年、ロボット技術は急速に進歩しつつあるが、ペットのような伴侶としての存在感を実現するには至っていない。ロボットに自由意志があるとは思えないからである。人間は、ペットの自由意志があるとしか思えないような行動を観察することにより、ペットに自由意志の存在を感じ、ペットに共感し、ペットに癒される。
したがって、人間的・生物的な行動を表現できるロボットであれば、特に、相手に応じて行動を変化させるロボットであれば、ロボットへの共感を大きく高めることができると考えられる。In recent years, robot technology has been advancing rapidly, but has not yet achieved a presence as a companion like a pet. This is because the robot does not seem to have free will. By observing the behavior of a pet that only seems to have a free will, a human feels the free will of the pet, sympathizes with the pet, and is healed by the pet.
Therefore, it is considered that if the robot can express human / biological behavior, particularly if the robot changes its behavior depending on the partner, the empathy with the robot can be greatly increased.

上述の行動特性を実現するためには、ロボットに人間を識別する能力を持たせなければならない。顔認証技術においては、既知の人物Ａの基準となるべき撮像画像（以下、「マスタ画像」とよぶ）と未確認の人物Ｘの撮像画像（以下、「検査画像」とよぶ）を比較することにより、人物Ａと人物Ｘが同一人物であるか否かを判定する。マスタ画像の取得に際しては、システムが被写体となる人物に撮像時の姿勢や表情について指示することも多い。 In order to realize the above behavioral characteristics, the robot must have the ability to identify a human. In the face authentication technology, a captured image to be a reference of a known person A (hereinafter, referred to as a “master image”) is compared with a captured image of an unconfirmed person X (hereinafter, referred to as an “inspection image”). It is determined whether the person A and the person X are the same person. When acquiring a master image, the system often instructs a person to be a subject about a posture and an expression at the time of imaging.

人物の識別精度を高めるためには質のよいマスタ画像が必要であるが、マスタ画像を取得させるためにユーザに過度の負担をかけることは好ましくない。特に、生物的な行動特性を実現すべきロボットにおいてユーザに行動強制することは、ロボットの非生物性をユーザに感じさせてしまうおそれもある。 Although a high-quality master image is required to improve the accuracy of identifying a person, it is not preferable to place an excessive burden on the user to obtain the master image. In particular, forcing a user to perform a behavior in a robot that should realize biological behavior characteristics may cause the user to feel the non-living nature of the robot.

本発明は上記認識に基づいて完成された発明であり、その主たる目的は、ユーザへの負担を抑制しつつロボットの識別能力を高める技術、を提供することにある。 The present invention has been completed based on the above recognition, and a main object of the present invention is to provide a technique for improving the identification ability of a robot while suppressing a burden on a user.

本発明のある態様における自律行動型ロボットは、カメラを制御する撮像制御部と、移動物体の撮像画像から抽出される特徴ベクトルに基づいて移動物体を判別する認識部と、判別結果に応じて、ロボットのモーションを選択する動作選択部と、動作選択部により選択されたモーションを実行する駆動機構と、移動物体によるロボットの抱え上げを検出する動作検出部と、を備える。
認識部は、移動物体にロボットが抱え上げられたときの撮像画像をマスタ画像として設定し、マスタ画像から抽出される特徴ベクトルに基づいて移動物体の判別基準を設定する。An autonomous behavior robot according to an aspect of the present invention, an imaging control unit that controls a camera, a recognition unit that determines a moving object based on a feature vector extracted from a captured image of the moving object, An operation selection unit that selects a motion of the robot, a drive mechanism that executes the motion selected by the operation selection unit, and an operation detection unit that detects the holding of the robot by the moving object.
The recognition unit sets a captured image when the robot is held by the moving object as a master image, and sets a criterion for determining the moving object based on a feature vector extracted from the master image.

本発明の別の態様における自律行動型ロボットは、カメラを制御する撮像制御部と、移動物体の撮像画像から抽出される特徴ベクトルに基づいて移動物体を判別する認識部と、判別結果に応じて、ロボットのモーションを選択する動作選択部と、動作選択部により選択されたモーションを実行する駆動機構と、移動物体によるタッチを検出する動作検出部と、を備える。
認識部は、タッチが検出されたときの撮像画像をマスタ画像として設定し、マスタ画像から抽出される特徴ベクトルに基づいて移動物体の判別基準を設定する。An autonomous behavior robot according to another aspect of the present invention includes an imaging control unit that controls a camera, a recognition unit that determines a moving object based on a feature vector extracted from a captured image of the moving object, , A motion selecting unit for selecting a motion of the robot, a driving mechanism for executing the motion selected by the motion selecting unit, and a motion detecting unit for detecting a touch by the moving object.
The recognizing unit sets a captured image when a touch is detected as a master image, and sets criteria for determining a moving object based on a feature vector extracted from the master image.

本発明の別の態様における自律行動型ロボットは、カメラを制御する撮像制御部と、移動物体の撮像画像から抽出される特徴ベクトルに基づいて移動物体を判別する認識部と、判別結果に応じて、ロボットのモーションを選択する動作選択部と、動作選択部により選択されたモーションを実行する駆動機構と、を備える。
認識部は、移動物体がロボットに対して所定の相対地点に位置したことを契機として撮像した画像をマスタ画像として設定し、マスタ画像から抽出される特徴ベクトルに基づいて移動体の判別基準を設定する。An autonomous behavior robot according to another aspect of the present invention includes an imaging control unit that controls a camera, a recognition unit that determines a moving object based on a feature vector extracted from a captured image of the moving object, , A motion selecting unit for selecting a motion of the robot, and a driving mechanism for executing the motion selected by the motion selecting unit.
The recognition unit sets an image captured when the moving object is located at a predetermined relative point with respect to the robot as a master image, and sets a determination criterion for the moving object based on a feature vector extracted from the master image. I do.

本発明のある態様における行動制御プログラムは、ロボットによる物体認識のためのコンピュータプログラムである。
このプログラムは、移動物体にロボットが抱え上げられたときの移動物体の撮像画像をマスタ画像として設定する機能と、マスタ画像から抽出される特徴ベクトルに基づいて移動物体の判別基準を設定する機能と、移動物体の撮像画像から抽出される特徴ベクトルに基づいて移動物体を判別する機能と、をロボットに発揮させる。The behavior control program according to an aspect of the present invention is a computer program for object recognition by a robot.
This program has a function of setting a captured image of a moving object when the robot is held by the moving object as a master image, and a function of setting a criterion for determining a moving object based on a feature vector extracted from the master image. And a function of determining a moving object based on a feature vector extracted from a captured image of the moving object.

本発明の別の態様における行動制御プログラムは、ロボットによる物体認識のためのコンピュータプログラムである。
移動物体にロボットがタッチされたときの移動物体の撮像画像をマスタ画像として設定する機能と、マスタ画像から抽出される特徴ベクトルに基づいて移動物体の判別基準を設定する機能と、移動物体の撮像画像から抽出される特徴ベクトルに基づいて移動物体を判別する機能と、をロボットに発揮させる。An action control program according to another aspect of the present invention is a computer program for object recognition by a robot.
A function to set a captured image of the moving object when the robot is touched to the moving object as a master image, a function to set a criterion of the moving object based on a feature vector extracted from the master image, and an image of the moving object The function of discriminating a moving object based on a feature vector extracted from an image is provided to the robot.

本発明によれば、ユーザへの負担を抑制しつつ、ロボットの識別能力を高めやすくなる。 ADVANTAGE OF THE INVENTION According to this invention, it becomes easy to raise the identification capability of a robot, suppressing the burden on a user.

ロボットの正面外観図である。It is a front external view of a robot. ロボットの側面外観図である。It is a side appearance view of a robot. ロボットの構造を概略的に表す断面図である。It is sectional drawing which represents the structure of a robot schematically. ロボットシステムの構成図である。It is a block diagram of a robot system. 感情マップの概念図である。It is a conceptual diagram of an emotion map. ロボットのハードウェア構成図である。FIG. 2 is a hardware configuration diagram of a robot. ロボットシステムの機能ブロック図である。It is a functional block diagram of a robot system. ロボットを抱っこしたときのイメージ図である。It is an image figure when carrying a robot. マスタ情報のデータ構造図である。It is a data structure figure of master information. ユーザ識別方法を説明するための第１の模式図である。FIG. 4 is a first schematic diagram for explaining a user identification method. ユーザ識別方法を説明するための第２の模式図である。FIG. 9 is a second schematic diagram for explaining a user identification method. マスタベクトルの抽出処理過程を示すフローチャートである。It is a flowchart which shows the extraction process of a master vector. ユーザの画像追跡方法を示す模式図である。It is a schematic diagram which shows the image tracking method of a user. マスタベクトルを遠隔から抽出する方法を説明するための模式図である。FIG. 6 is a schematic diagram for explaining a method of extracting a master vector from a remote place.

図１（ａ）は、ロボット１００の正面外観図である。図１（ｂ）は、ロボット１００の側面外観図である。
本実施形態におけるロボット１００は、外部環境および内部状態に基づいて行動や仕草（ジェスチャー）を決定する自律行動型のロボットである。外部環境は、カメラやサーモセンサなど各種のセンサにより認識される。内部状態はロボット１００の感情を表現するさまざまなパラメータとして定量化される。これらについては後述する。FIG. 1A is a front external view of the robot 100. FIG. 1B is a side external view of the robot 100.
The robot 100 according to the present embodiment is an autonomous behavior type robot that determines a behavior or a gesture based on an external environment and an internal state. The external environment is recognized by various sensors such as a camera and a thermosensor. The internal state is quantified as various parameters expressing the emotion of the robot 100. These will be described later.

ロボット１００は、原則として、オーナー家庭の家屋内を行動範囲とする。以下、ロボット１００に関わる人間を「ユーザ」とよび、ロボット１００が所属する家庭の構成員となるユーザのことを「オーナー」とよぶ。ロボット１００が識別すべき「移動物体」は、人間およびペットの双方を含むが、本実施形態においては人間（ユーザ）を対象として説明する。 As a general rule, the robot 100 sets the home of the owner's home as the range of action. Hereinafter, a person related to the robot 100 is referred to as a “user”, and a user who is a member of the home to which the robot 100 belongs is referred to as an “owner”. The “moving object” to be identified by the robot 100 includes both a human and a pet, but in the present embodiment, a description will be given of a human (user).

ロボット１００のボディ１０４は、全体的に丸みを帯びた形状を有し、ウレタンやゴム、樹脂、繊維などやわらかく弾力性のある素材により形成された外皮を含む。ロボット１００に服を着せてもよい。丸くてやわらかく、手触りのよいボディ１０４とすることで、ロボット１００はユーザに安心感とともに心地よい触感を提供する。 The body 104 of the robot 100 has a rounded shape as a whole, and includes an outer skin made of a soft and elastic material such as urethane, rubber, resin, or fiber. The robot 100 may be dressed. By making the body 104 round, soft, and comfortable, the robot 100 provides the user with a sense of security as well as a comfortable tactile sensation.

ロボット１００は、総重量が１５キログラム以下、好ましくは１０キログラム以下、更に好ましくは、５キログラム以下である。生後１３ヶ月までに、赤ちゃんの過半数は一人歩きを始める。生後１３ヶ月の赤ちゃんの平均体重は、男児が９キログラム強、女児が９キログラム弱である。このため、ロボット１００の総重量が１０キログラム以下であれば、ユーザは一人歩きできない赤ちゃんを抱きかかえるのとほぼ同等の労力でロボット１００を抱きかかえることができる。生後２ヶ月未満の赤ちゃんの平均体重は男女ともに５キログラム未満である。したがって、ロボット１００の総重量が５キログラム以下であれば、ユーザは乳児を抱っこするのと同等の労力でロボット１００を抱っこできる。 The robot 100 has a total weight of 15 kg or less, preferably 10 kg or less, more preferably 5 kg or less. By 13 months of age, the majority of babies will start walking alone. The average weight of a 13-month-old baby is over 9 kilograms for boys and less than 9 kilograms for girls. Therefore, if the total weight of the robot 100 is 10 kg or less, the user can hold the robot 100 with substantially the same effort as holding a baby who cannot walk alone. The average weight of babies less than two months old is less than 5 kilograms for both men and women. Therefore, if the total weight of the robot 100 is 5 kg or less, the user can hold the robot 100 with the same effort as holding an infant.

適度な重さと丸み、柔らかさ、手触りのよさ、といった諸属性により、ユーザがロボット１００を抱きかかえやすく、かつ、抱きかかえたくなるという効果が実現される。同様の理由から、ロボット１００の身長は１．２メートル以下、好ましくは、０．７メートル以下であることが望ましい。本実施形態におけるロボット１００にとって、抱きかかえることができるというのは重要なコンセプトである。 By various attributes such as moderate weight, roundness, softness, and good touch, the effect that the user can easily hold the robot 100 and want to hold the robot 100 is realized. For the same reason, it is desirable that the height of the robot 100 is 1.2 meters or less, preferably 0.7 meters or less. An important concept for the robot 100 in the present embodiment is that it can be held.

ロボット１００は、３輪走行するための３つの車輪を備える。図示のように、一対の前輪１０２（左輪１０２ａ，右輪１０２ｂ）と、一つの後輪１０３を含む。前輪１０２が駆動輪であり、後輪１０３が従動輪である。前輪１０２は、操舵機構を有しないが、回転速度や回転方向を個別に制御可能とされている。後輪１０３は、いわゆるオムニホイールからなり、ロボット１００を前後左右へ移動させるために回転自在となっている。左輪１０２ａよりも右輪１０２ｂの回転数を大きくすることで、ロボット１００は左折したり、左回りに回転できる。右輪１０２ｂよりも左輪１０２ａの回転数を大きくすることで、ロボット１００は右折したり、右回りに回転できる。 The robot 100 includes three wheels for traveling on three wheels. As shown, it includes a pair of front wheels 102 (left wheel 102a, right wheel 102b) and one rear wheel 103. The front wheel 102 is a driving wheel, and the rear wheel 103 is a driven wheel. Although the front wheel 102 does not have a steering mechanism, the rotation speed and the rotation direction can be individually controlled. The rear wheel 103 is formed of a so-called omni wheel, and is rotatable to move the robot 100 forward, backward, left, and right. By increasing the number of rotations of the right wheel 102b compared to the left wheel 102a, the robot 100 can turn left or rotate counterclockwise. By making the rotation speed of the left wheel 102a higher than that of the right wheel 102b, the robot 100 can turn right or rotate clockwise.

前輪１０２および後輪１０３は、駆動機構（回動機構、リンク機構）によりボディ１０４に完全収納できる。走行時においても各車輪の大部分はボディ１０４に隠れているが、各車輪がボディ１０４に完全収納されるとロボット１００は移動不可能な状態となる。すなわち、車輪の収納動作にともなってボディ１０４が降下し、床面Ｆに着座する。この着座状態においては、ボディ１０４の底部に形成された平坦状の着座面１０８（接地底面）が床面Ｆに当接する。 The front wheel 102 and the rear wheel 103 can be completely housed in the body 104 by a driving mechanism (rotating mechanism, link mechanism). During running, most of the wheels are hidden by the body 104. However, when the wheels are completely stored in the body 104, the robot 100 cannot move. That is, the body 104 descends as the wheels are stored, and sits on the floor F. In this seated state, the flat seating surface 108 (ground contact bottom surface) formed on the bottom of the body 104 abuts on the floor surface F.

ロボット１００は、２つの手１０６を有する。手１０６には、モノを把持する機能はない。手１０６は上げる、振る、振動するなど簡単な動作が可能である。２つの手１０６も個別制御可能である。 The robot 100 has two hands 106. The hand 106 does not have a function of holding an object. The hand 106 can perform simple operations such as raising, shaking, and vibrating. The two hands 106 can also be individually controlled.

目１１０には、液晶素子または有機ＥＬ素子による画像表示が可能である。ロボット１００は、音源方向を特定可能なマイクロフォンアレイや超音波センサ、ニオイセンサ、測距センサ、加速度センサなどさまざまなセンサを搭載する。また、ロボット１００はスピーカーを内蔵し、簡単な音声を発することもできる。ロボット１００のボディ１０４には、静電容量式のタッチセンサが設置される。タッチセンサにより、ロボット１００はユーザのタッチを検出できる。 An image can be displayed on the eye 110 by a liquid crystal element or an organic EL element. The robot 100 is equipped with various sensors such as a microphone array, an ultrasonic sensor, an odor sensor, a distance measurement sensor, and an acceleration sensor capable of specifying a sound source direction. Further, the robot 100 has a built-in speaker and can emit a simple voice. A capacitive touch sensor is installed on the body 104 of the robot 100. With the touch sensor, the robot 100 can detect a user's touch.

ロボット１００の頭部にはツノ１１２が取り付けられる。上述のようにロボット１００は軽量であるため、ユーザはツノ１１２をつかむことでロボット１００を持ち上げることも可能である。ツノ１１２には全天球カメラが取り付けられ、ロボット１００の上部全域を一度に撮像可能である。 A horn 112 is attached to the head of the robot 100. As described above, since the robot 100 is lightweight, the user can lift the robot 100 by grasping the horn 112. A celestial sphere camera is attached to the horn 112, and can image the entire upper region of the robot 100 at a time.

図２は、ロボット１００の構造を概略的に表す断面図である。
図２に示すように、ロボット１００のボディ１０４は、ベースフレーム３０８、本体フレーム３１０、一対の樹脂製のホイールカバー３１２および外皮３１４を含む。ベースフレーム３０８は、金属からなり、ボディ１０４の軸芯を構成するとともに内部機構を支持する。ベースフレーム３０８は、アッパープレート３３２とロアプレート３３４とを複数のサイドプレート３３６により上下に連結して構成される。複数のサイドプレート３３６間には通気が可能となるよう、十分な間隔が設けられる。ベースフレーム３０８の内方には、バッテリー１１８、制御回路３４２および各種アクチュエータが収容されている。FIG. 2 is a cross-sectional view schematically illustrating the structure of the robot 100.
As shown in FIG. 2, the body 104 of the robot 100 includes a base frame 308, a main body frame 310, a pair of resin wheel covers 312, and an outer skin 314. The base frame 308 is made of metal, constitutes the axis of the body 104, and supports an internal mechanism. The base frame 308 is configured by vertically connecting an upper plate 332 and a lower plate 334 with a plurality of side plates 336. A sufficient space is provided between the plurality of side plates 336 so as to allow ventilation. The battery 118, the control circuit 342, and various actuators are housed inside the base frame 308.

本体フレーム３１０は、樹脂材からなり、頭部フレーム３１６および胴部フレーム３１８を含む。頭部フレーム３１６は、中空半球状をなし、ロボット１００の頭部骨格を形成する。胴部フレーム３１８は、段付筒形状をなし、ロボット１００の胴部骨格を形成する。胴部フレーム３１８は、ベースフレーム３０８と一体に固定される。頭部フレーム３１６は、胴部フレーム３１８の上端部に相対変位可能に組み付けられる。 The main body frame 310 is made of a resin material, and includes a head frame 316 and a body frame 318. The head frame 316 has a hollow hemisphere shape and forms a head skeleton of the robot 100. The torso frame 318 has a stepped cylindrical shape and forms a torso skeleton of the robot 100. The body frame 318 is fixed integrally with the base frame 308. The head frame 316 is attached to the upper end of the body frame 318 so as to be relatively displaceable.

頭部フレーム３１６には、ヨー軸３２０、ピッチ軸３２２およびロール軸３２４の３軸と、各軸を回転駆動するためのアクチュエータ３２６が設けられる。アクチュエータ３２６は、各軸を個別に駆動するための複数のサーボモータを含む。首振り動作のためにヨー軸３２０が駆動され、頷き動作のためにピッチ軸３２２が駆動され、首を傾げる動作のためにロール軸３２４が駆動される。 The head frame 316 is provided with three axes of a yaw axis 320, a pitch axis 322, and a roll axis 324, and an actuator 326 for driving each axis to rotate. The actuator 326 includes a plurality of servomotors for individually driving each axis. The yaw axis 320 is driven for swinging, the pitch axis 322 is driven for nodding, and the roll axis 324 is driven for tilting.

頭部フレーム３１６の上部には、ヨー軸３２０を支持するプレート３２５が固定されている。プレート３２５には、上下間の通気を確保するための複数の通気孔３２７が形成される。 A plate 325 that supports the yaw axis 320 is fixed to an upper part of the head frame 316. A plurality of ventilation holes 327 are formed in the plate 325 to ensure ventilation between the upper and lower sides.

頭部フレーム３１６およびその内部機構を下方から支持するように、金属製のベースプレート３２８が設けられる。ベースプレート３２８は、クロスリンク機構３２９（パンタグラフ機構）を介してプレート３２５と連結される一方、ジョイント３３０を介してアッパープレート３３２（ベースフレーム３０８）と連結されている。 A metal base plate 328 is provided to support head frame 316 and its internal mechanism from below. The base plate 328 is connected to the plate 325 via a cross link mechanism 329 (pantograph mechanism), and is connected to the upper plate 332 (base frame 308) via a joint 330.

胴部フレーム３１８は、ベースフレーム３０８と車輪駆動機構３７０を収容する。車輪駆動機構３７０は、回動軸３７８およびアクチュエータ３７９を含む。胴部フレーム３１８の下半部は、ホイールカバー３１２との間に前輪１０２の収納スペースＳを形成するために小幅とされている。 The body frame 318 houses the base frame 308 and the wheel drive mechanism 370. The wheel driving mechanism 370 includes a rotation shaft 378 and an actuator 379. The lower half of the body frame 318 has a small width to form a storage space S for the front wheel 102 with the wheel cover 312.

外皮３１４は、ウレタンゴムからなり、本体フレーム３１０およびホイールカバー３１２を外側から覆う。手１０６は、外皮３１４と一体成形される。外皮３１４の上端部には、外気を導入するための開口部３９０が設けられる。 The outer cover 314 is made of urethane rubber and covers the main body frame 310 and the wheel cover 312 from outside. Hand 106 is integrally formed with outer skin 314. An opening 390 for introducing outside air is provided at the upper end of the outer cover 314.

図３は、ロボットシステム３００の構成図である。
ロボットシステム３００は、ロボット１００、サーバ２００および複数の外部センサ１１４を含む。家屋内にはあらかじめ複数の外部センサ１１４（外部センサ１１４ａ、１１４ｂ、・・・、１１４ｎ）が設置される。外部センサ１１４は、家屋の壁面に固定されてもよいし、床に載置されてもよい。サーバ２００には、外部センサ１１４の位置座標が登録される。位置座標は、ロボット１００の行動範囲として想定される家屋内においてｘ，ｙ座標として定義される。FIG. 3 is a configuration diagram of the robot system 300.
The robot system 300 includes the robot 100, the server 200, and a plurality of external sensors 114. A plurality of external sensors 114 (external sensors 114a, 114b,..., 114n) are installed in the house in advance. The external sensor 114 may be fixed to a wall surface of a house or may be mounted on a floor. The position coordinates of the external sensor 114 are registered in the server 200. The position coordinates are defined as x and y coordinates in a house that is assumed as an action range of the robot 100.

サーバ２００は、家屋内に設置される。本実施形態におけるサーバ２００とロボット１００は、通常、１対１で対応する。ロボット１００の内蔵するセンサおよび複数の外部センサ１１４から得られる情報に基づいて、サーバ２００がロボット１００の基本行動を決定する。
外部センサ１１４はロボット１００の感覚器を補強するためのものであり、サーバ２００はロボット１００の頭脳を補強するためのものである。The server 200 is installed indoors. Normally, the server 200 and the robot 100 in this embodiment correspond one-to-one. The server 200 determines the basic behavior of the robot 100 based on information obtained from a sensor built in the robot 100 and a plurality of external sensors 114.
The external sensor 114 is for reinforcing the sensory organs of the robot 100, and the server 200 is for reinforcing the brain of the robot 100.

外部センサ１１４は、定期的に外部センサ１１４のＩＤ（以下、「ビーコンＩＤ」とよぶ）を含む無線信号（以下、「ロボット探索信号」とよぶ）を送信する。ロボット１００はロボット探索信号を受信するとビーコンＩＤを含む無線信号（以下、「ロボット返答信号」とよぶ）を返信する。サーバ２００は、外部センサ１１４がロボット探索信号を送信してからロボット返答信号を受信するまでの時間を計測し、外部センサ１１４からロボット１００までの距離を測定する。複数の外部センサ１１４とロボット１００とのそれぞれの距離を計測することで、ロボット１００の位置座標を特定する。
もちろん、ロボット１００が自らの位置座標を定期的にサーバ２００に送信する方式でもよい。The external sensor 114 periodically transmits a wireless signal (hereinafter, referred to as “robot search signal”) including the ID of the external sensor 114 (hereinafter, referred to as “beacon ID”). Upon receiving the robot search signal, the robot 100 returns a wireless signal including a beacon ID (hereinafter, referred to as a “robot response signal”). The server 200 measures the time from when the external sensor 114 transmits the robot search signal to when the robot response signal is received, and measures the distance from the external sensor 114 to the robot 100. By measuring respective distances between the plurality of external sensors 114 and the robot 100, the position coordinates of the robot 100 are specified.
Of course, a method in which the robot 100 periodically transmits its own position coordinates to the server 200 may be used.

図４は、感情マップ１１６の概念図である。
感情マップ１１６は、サーバ２００に格納されるデータテーブルである。ロボット１００は、感情マップ１１６にしたがって行動選択する。図４に示す感情マップ１１６は、ロボット１００の場所に対する好悪感情の大きさを示す。感情マップ１１６のｘ軸とｙ軸は、二次元空間座標を示す。ｚ軸は、好悪感情の大きさを示す。ｚ値が正値のときにはその場所に対する好感が高く、ｚ値が負値のときにはその場所を嫌悪していることを示す。FIG. 4 is a conceptual diagram of the emotion map 116.
Emotion map 116 is a data table stored in server 200. The robot 100 selects an action according to the emotion map 116. The emotion map 116 shown in FIG. The x-axis and y-axis of the emotion map 116 indicate two-dimensional space coordinates. The z-axis indicates the magnitude of evil feelings. When the z value is a positive value, the user has a good feeling for the place, and when the z value is a negative value, the user dislikes the place.

図４の感情マップ１１６において、座標Ｐ１は、ロボット１００の行動範囲としてサーバ２００が管理する屋内空間のうち好感情が高い地点（以下、「好意地点」とよぶ）である。好意地点は、ソファの陰やテーブルの下などの「安全な場所」であってもよいし、リビングのように人が集まりやすい場所、賑やかな場所であってもよい。また、過去にやさしく撫でられたり、触れられたりした場所であってもよい。
ロボット１００がどのような場所を好むかという定義は任意であるが、一般的には、小さな子どもや犬や猫などの小動物が好む場所を好意地点として設定することが望ましい。In the emotion map 116 of FIG. 4, the coordinate P1 is a point (hereinafter, referred to as a “favorable point”) in the indoor space managed by the server 200 as an action range of the robot 100, where the favorable emotion is high. The favored point may be a “safe place” such as behind a sofa or under a table, a place where people can easily gather like a living room, or a lively place. In addition, it may be a place that has been gently stroked or touched in the past.
The definition of what kind of place the robot 100 prefers is arbitrary, but in general, it is desirable to set a place favored by small children, small animals such as dogs and cats as favor points.

座標Ｐ２は、悪感情が高い地点（以下、「嫌悪地点」とよぶ）である。嫌悪地点は、テレビの近くなど大きな音がする場所、お風呂や洗面所のように濡れやすい場所、閉鎖空間や暗い場所、ユーザから乱暴に扱われたことがある不快な記憶に結びつく場所などであってもよい。
ロボット１００がどのような場所を嫌うかという定義も任意であるが、一般的には、小さな子どもや犬や猫などの小動物が怖がる場所を嫌悪地点として設定することが望ましい。The coordinates P2 are points where bad emotions are high (hereinafter, referred to as “dislike points”). Dislike points are places that emit loud noises, such as near a TV, places that are easily wet such as baths and toilets, enclosed spaces and dark places, and places that are connected to unpleasant memories that have been treated violently by users. There may be.
The definition of what kind of place the robot 100 dislikes is arbitrary, but in general, it is desirable to set a place where small animals such as small children, dogs and cats are afraid as disgust points.

座標Ｑは、ロボット１００の現在位置を示す。複数の外部センサ１１４が定期的に送信するロボット探索信号とそれに対するロボット返答信号により、サーバ２００はロボット１００の位置座標を特定する。たとえば、ビーコンＩＤ＝１の外部センサ１１４とビーコンＩＤ＝２の外部センサ１１４がそれぞれロボット１００を検出したとき、２つの外部センサ１１４からロボット１００の距離を求め、そこからロボット１００の位置座標を求める。 The coordinates Q indicate the current position of the robot 100. The server 200 specifies the position coordinates of the robot 100 based on a robot search signal periodically transmitted by the plurality of external sensors 114 and a robot response signal thereto. For example, when the external sensor 114 with beacon ID = 1 and the external sensor 114 with beacon ID = 2 respectively detect the robot 100, the distance of the robot 100 is obtained from the two external sensors 114, and the position coordinates of the robot 100 are obtained therefrom. .

あるいは、ビーコンＩＤ＝１の外部センサ１１４は、ロボット探索信号を複数方向に送信し、ロボット１００はロボット探索信号を受信したときロボット返答信号を返す。これにより、サーバ２００は、ロボット１００がどの外部センサ１１４からどの方向のどのくらいの距離にいるかを把握してもよい。また、別の実施の形態では、前輪１０２または後輪１０３の回転数からロボット１００の移動距離を算出して、現在位置を特定してもよいし、カメラから得られる画像に基づいて現在位置を特定してもよい。
図４に示す感情マップ１１６が与えられた場合、ロボット１００は好意地点（座標Ｐ１）に引き寄せられる方向、嫌悪地点（座標Ｐ２）から離れる方向に移動する。Alternatively, the external sensor 114 with the beacon ID = 1 transmits a robot search signal in a plurality of directions, and the robot 100 returns a robot reply signal when receiving the robot search signal. In this way, the server 200 may know from which external sensor 114 the robot 100 is located in which direction and how long. Further, in another embodiment, the moving distance of the robot 100 may be calculated from the rotation speed of the front wheel 102 or the rear wheel 103 to specify the current position, or the current position may be determined based on an image obtained from a camera. It may be specified.
When the emotion map 116 shown in FIG. 4 is given, the robot 100 moves in a direction to be drawn to the favored point (coordinate P1) and in a direction away from the disliked point (coordinate P2).

感情マップ１１６は動的に変化する。ロボット１００が座標Ｐ１に到達すると、座標Ｐ１におけるｚ値（好感情）は時間とともに低下する。これにより、ロボット１００は好意地点（座標Ｐ１）に到達して、「感情が満たされ」、やがて、その場所に「飽きてくる」という生物的行動をエミュレートできる。同様に、座標Ｐ２における悪感情も時間とともに緩和される。時間経過とともに新たな好意地点や嫌悪地点が生まれ、それによってロボット１００は新たな行動選択を行う。ロボット１００は、新しい好意地点に「興味」を持ち、絶え間なく行動選択する。 The emotion map 116 changes dynamically. When the robot 100 reaches the coordinate P1, the z value (favorable emotion) at the coordinate P1 decreases with time. Thereby, the robot 100 can reach the favorable point (coordinate P1) and emulate the biological behavior of "satisfied with emotion" and eventually "gets tired" of the place. Similarly, the bad feeling at the coordinate P2 is alleviated with time. With the passage of time, new favorable points and dislike points are created, whereby the robot 100 makes a new action selection. The robot 100 is "interested" in the new favor point and constantly selects an action.

感情マップ１１６は、ロボット１００の内部状態として、感情の起伏を表現する。ロボット１００は、好意地点を目指し、嫌悪地点を避け、好意地点にしばらくとどまり、やがてまた次の行動を起こす。このような制御により、ロボット１００の行動選択を人間的・生物的なものにできる。 The emotion map 116 expresses the undulation of emotion as an internal state of the robot 100. The robot 100 aims at the favored point, avoids the disliked point, stays at the favored point for a while, and eventually takes the next action. By such control, the action selection of the robot 100 can be made human and biological.

なお、ロボット１００の行動に影響を与えるマップ（以下、「行動マップ」と総称する）は、図４に示したようなタイプの感情マップ１１６に限らない。たとえば、好奇心、恐怖を避ける気持ち、安心を求める気持ち、静けさや薄暗さ、涼しさや暖かさといった肉体的安楽を求める気持ち、などさまざまな行動マップを定義可能である。そして、複数の行動マップそれぞれのｚ値を重み付け平均することにより、ロボット１００の目的地点を決定してもよい。 The map that influences the behavior of the robot 100 (hereinafter, collectively referred to as an “action map”) is not limited to the emotion map 116 of the type shown in FIG. For example, a variety of action maps can be defined, such as curiosity, fear avoidance, desire for security, and desire for physical comfort such as quietness and dimness, coolness and warmth. Then, the destination point of the robot 100 may be determined by weighting and averaging the z values of each of the plurality of action maps.

ロボット１００は、行動マップとは別に、さまざまな感情や感覚の大きさを示すパラメータを有する。たとえば、寂しさという感情パラメータの値が高まっているときには、安心する場所を評価する行動マップの重み付け係数を大きく設定し、目標地点に到達することでこの感情パラメータの値を低下させる。同様に、つまらないという感覚を示すパラメータの値が高まっているときには、好奇心を満たす場所を評価する行動マップの重み付け係数を大きく設定すればよい。 The robot 100 has parameters indicating the size of various emotions and sensations, separately from the behavior map. For example, when the value of the emotion parameter of loneliness is increasing, the weighting coefficient of the behavior map for evaluating a place where people feel safe is set large, and the value of the emotion parameter is reduced by reaching the target point. Similarly, when the value of the parameter indicating the feeling of boring is increasing, the weighting coefficient of the action map for evaluating a place satisfying curiosity may be set to a large value.

図５は、ロボット１００のハードウェア構成図である。
ロボット１００は、内部センサ１２８、通信機１２６、記憶装置１２４、プロセッサ１２２、駆動機構１２０およびバッテリー１１８を含む。駆動機構１２０は、上述した車輪駆動機構３７０を含む。プロセッサ１２２と記憶装置１２４は、制御回路３４２に含まれる。各ユニットは電源線１３０および信号線１３２により互いに接続される。バッテリー１１８は、電源線１３０を介して各ユニットに電力を供給する。各ユニットは信号線１３２により制御信号を送受する。バッテリー１１８は、リチウムイオン二次電池であり、ロボット１００の動力源である。FIG. 5 is a hardware configuration diagram of the robot 100.
The robot 100 includes an internal sensor 128, a communication device 126, a storage device 124, a processor 122, a driving mechanism 120, and a battery 118. The drive mechanism 120 includes the wheel drive mechanism 370 described above. The processor 122 and the storage device 124 are included in the control circuit 342. Each unit is connected to each other by a power supply line 130 and a signal line 132. The battery 118 supplies power to each unit via a power supply line 130. Each unit transmits and receives a control signal via a signal line 132. The battery 118 is a lithium ion secondary battery, and is a power source of the robot 100.

内部センサ１２８は、ロボット１００が内蔵する各種センサの集合体である。具体的には、カメラ（全天球カメラ）、マイクロフォンアレイ、測距センサ（赤外線センサ）、サーモセンサ、タッチセンサ、加速度センサ、ニオイセンサ、タッチセンサなどである。タッチセンサは、外皮３１４と本体フレーム３１０の間に設置され、ユーザのタッチを検出する。ニオイセンサは、匂いの元となる分子の吸着によって電気抵抗が変化する原理を応用した既知のセンサである。ニオイセンサは、さまざまな匂いを複数種類のカテゴリに分類する。 The internal sensor 128 is an aggregate of various sensors built in the robot 100. Specifically, it includes a camera (omnidirectional camera), a microphone array, a distance measurement sensor (infrared sensor), a thermo sensor, a touch sensor, an acceleration sensor, an odor sensor, a touch sensor, and the like. The touch sensor is installed between the outer skin 314 and the main body frame 310, and detects a user's touch. The odor sensor is a known sensor to which the principle of changing the electric resistance by the adsorption of the molecule that causes the odor is applied. The odor sensor classifies various odors into a plurality of categories.

通信機１２６は、サーバ２００や外部センサ１１４、ユーザの有する携帯機器など各種の外部機器を対象として無線通信を行う通信モジュールである。記憶装置１２４は、不揮発性メモリおよび揮発性メモリにより構成され、コンピュータプログラムや各種設定情報を記憶する。プロセッサ１２２は、コンピュータプログラムの実行手段である。駆動機構１２０は、内部機構を制御するアクチュエータである。このほかには、表示器やスピーカーなども搭載される。 The communication device 126 is a communication module that performs wireless communication with various external devices such as the server 200, the external sensor 114, and a portable device owned by the user. The storage device 124 includes a nonvolatile memory and a volatile memory, and stores a computer program and various setting information. The processor 122 is a means for executing a computer program. The drive mechanism 120 is an actuator that controls an internal mechanism. In addition, displays and speakers are also installed.

プロセッサ１２２は、通信機１２６を介してサーバ２００や外部センサ１１４と通信しながら、ロボット１００の行動選択を行う。内部センサ１２８により得られるさまざまな外部情報も行動選択に影響する。駆動機構１２０は、主として、車輪（前輪１０２）と頭部（頭部フレーム３１６）を制御する。駆動機構１２０は、２つの前輪１０２それぞれの回転速度や回転方向を変化させることにより、ロボット１００の移動方向や移動速度を変化させる。また、駆動機構１２０は、車輪（前輪１０２および後輪１０３）を昇降させることもできる。車輪が上昇すると、車輪はボディ１０４に完全に収納され、ロボット１００は着座面１０８にて床面Ｆに当接し、着座状態となる。 The processor 122 selects an action of the robot 100 while communicating with the server 200 and the external sensor 114 via the communication device 126. Various external information obtained by the internal sensor 128 also affects the action selection. The drive mechanism 120 mainly controls the wheels (the front wheels 102) and the head (the head frame 316). The drive mechanism 120 changes the moving direction and the moving speed of the robot 100 by changing the rotating speed and the rotating direction of each of the two front wheels 102. The drive mechanism 120 can also raise and lower the wheels (the front wheel 102 and the rear wheel 103). When the wheels rise, the wheels are completely housed in the body 104, and the robot 100 comes into contact with the floor surface F at the seating surface 108 to be in a seated state.

図６は、ロボットシステム３００の機能ブロック図である。
上述のように、ロボットシステム３００は、ロボット１００、サーバ２００および複数の外部センサ１１４を含む。ロボット１００およびサーバ２００の各構成要素は、ＣＰＵ（Central Processing Unit）および各種コプロセッサなどの演算器、メモリやストレージといった記憶装置、それらを連結する有線または無線の通信線を含むハードウェアと、記憶装置に格納され、演算器に処理命令を供給するソフトウェアによって実現される。コンピュータプログラムは、デバイスドライバ、オペレーティングシステム、それらの上位層に位置する各種アプリケーションプログラム、また、これらのプログラムに共通機能を提供するライブラリによって構成されてもよい。以下に説明する各ブロックは、ハードウェア単位の構成ではなく、機能単位のブロックを示している。
ロボット１００の機能の一部はサーバ２００により実現されてもよいし、サーバ２００の機能の一部または全部はロボット１００により実現されてもよい。FIG. 6 is a functional block diagram of the robot system 300.
As described above, the robot system 300 includes the robot 100, the server 200, and the plurality of external sensors 114. Each component of the robot 100 and the server 200 includes: an arithmetic unit such as a CPU (Central Processing Unit) and various coprocessors; a storage device such as a memory and a storage; hardware including a wired or wireless communication line connecting them; This is realized by software that is stored in the device and supplies processing instructions to the arithmetic unit. The computer program may be configured by a device driver, an operating system, various application programs located in an upper layer thereof, and a library that provides a common function to these programs. Each block described below is not a configuration in a hardware unit but a block in a functional unit.
Some of the functions of the robot 100 may be realized by the server 200, and some or all of the functions of the server 200 may be realized by the robot 100.

（サーバ２００）
サーバ２００は、通信部２０４、データ処理部２０２およびデータ格納部２０６を含む。
通信部２０４は、外部センサ１１４およびロボット１００との通信処理を担当する。データ格納部２０６は各種データを格納する。データ処理部２０２は、通信部２０４により取得されたデータおよびデータ格納部２０６に格納されるデータに基づいて各種処理を実行する。データ処理部２０２は、通信部２０４およびデータ格納部２０６のインタフェースとしても機能する。(Server 200)
The server 200 includes a communication unit 204, a data processing unit 202, and a data storage unit 206.
The communication unit 204 handles communication processing with the external sensor 114 and the robot 100. The data storage unit 206 stores various data. The data processing unit 202 performs various processes based on the data acquired by the communication unit 204 and the data stored in the data storage unit 206. The data processing unit 202 also functions as an interface between the communication unit 204 and the data storage unit 206.

本実施形態においては、サーバ２００の通信部２０４は、ロボット１００の通信部１４２と第１通信回線および第２通信回線の２種類の通信回線により接続する。第１通信回線は、９２０ＭＨｚのＩＳＭ周波数（Industrial, Scientific and Medical Band）通信回線である。第２通信回線は、２．４ＧＨｚの通信回線である。第１通信回線は、第２通信回線よりも周波数が低いため電波が回り込みやすいが、通信速度は遅い。 In the present embodiment, the communication unit 204 of the server 200 is connected to the communication unit 142 of the robot 100 by two types of communication lines, a first communication line and a second communication line. The first communication line is a 920 MHz ISM frequency (Industrial, Scientific and Medical Band) communication line. The second communication line is a 2.4 GHz communication line. Since the first communication line has a lower frequency than the second communication line, radio waves are likely to wrap around, but the communication speed is slow.

データ格納部２０６は、モーション格納部２３２、マップ格納部２１６および個人データ格納部２１８を含む。
ロボット１００は、複数の動作パターン（モーション）を有する。手１０６を震わせる、蛇行しながらオーナーに近づく、首をかしげたままオーナーを見つめる、などさまざまなモーションが定義される。The data storage unit 206 includes a motion storage unit 232, a map storage unit 216, and a personal data storage unit 218.
The robot 100 has a plurality of motion patterns (motions). Various motions are defined, such as shaking the hand 106, meandering toward the owner, and staring at the owner with his neck stuck.

モーション格納部２３２は、モーションの制御内容を定義する「モーションファイル」を格納する。各モーションは、モーションＩＤにより識別される。モーションファイルは、ロボット１００のモーション格納部１６０にもダウンロードされる。どのモーションを実行するかは、サーバ２００で決定されることもあるし、ロボット１００で決定されることもある。 The motion storage unit 232 stores a “motion file” that defines the control content of the motion. Each motion is identified by a motion ID. The motion file is also downloaded to the motion storage unit 160 of the robot 100. Which motion is executed may be determined by the server 200 or may be determined by the robot 100.

ロボット１００のモーションの多くは、複数の単位モーションを含む複合モーションとして構成される。たとえば、ロボット１００がオーナーに近づくとき、オーナーの方に向き直る単位モーション、手を上げながら近づく単位モーション、体を揺すりながら近づく単位モーション、両手を上げながら着座する単位モーションの組み合わせとして表現されてもよい。このような４つのモーションの組み合わせにより、「オーナーに近づいて、途中で手を上げて、最後は体をゆすった上で着座する」というモーションが実現される。モーションファイルには、ロボット１００に設けられたアクチュエータの回転角度や角速度などが時間軸に関連づけて定義される。モーションファイル（アクチュエータ制御情報）にしたがって、時間経過とともに各アクチュエータを制御することで様々なモーションが表現される。 Many of the motions of the robot 100 are configured as a composite motion including a plurality of unit motions. For example, when the robot 100 approaches the owner, it may be expressed as a combination of a unit motion facing the owner, a unit motion approaching with the hand raised, a unit motion approaching while shaking the body, and a unit motion sitting with the hands raised. . By such a combination of the four motions, a motion of "approaching the owner, raising his hand on the way, and finally sitting with his body shaken" is realized. In the motion file, a rotation angle, an angular velocity, and the like of an actuator provided in the robot 100 are defined in association with a time axis. Various motions are expressed by controlling each actuator over time according to the motion file (actuator control information).

先の単位モーションから次の単位モーションに変化するときの移行時間を「インターバル」とよぶ。インターバルは、単位モーション変更に要する時間やモーションの内容に応じて定義されればよい。インターバルの長さは調整可能である。
以下、いつ、どのモーションを選ぶか、モーションを実現する上での各アクチュエータの出力調整など、ロボット１００の行動制御に関わる設定のことを「行動特性」と総称する。ロボット１００の行動特性は、モーション選択アルゴリズム、モーションの選択確率、モーションファイル等により定義される。The transition time when changing from the previous unit motion to the next unit motion is called an “interval”. The interval may be defined according to the time required for changing the unit motion or the content of the motion. The length of the interval is adjustable.
Hereinafter, settings relating to the behavior control of the robot 100, such as when and which motion is selected, output adjustment of each actuator for realizing the motion, etc., are collectively referred to as "behavior characteristics". The behavior characteristics of the robot 100 are defined by a motion selection algorithm, a motion selection probability, a motion file, and the like.

モーション格納部２３２は、モーションファイルのほか、各種のイベントが発生したときに実行すべきモーションを定義するモーション選択テーブルを格納する。モーション選択テーブルにおいては、イベントに対して１以上のモーションとその選択確率が対応づけられる。 The motion storage unit 232 stores, in addition to the motion file, a motion selection table that defines a motion to be executed when various events occur. In the motion selection table, one or more motions and their selection probabilities are associated with events.

マップ格納部２１６は、複数の行動マップのほか、椅子やテーブルなどの障害物の配置状況を示すマップも格納する。個人データ格納部２１８は、ユーザ、特に、オーナーの情報を格納する。具体的には、ユーザに対する親密度とユーザの身体的特徴・行動的特徴を示すマスタ情報を格納する。年齢や性別などの他の属性情報を格納してもよい。マスタ情報の詳細は図８に関連して後述する。 The map storage unit 216 stores, in addition to a plurality of action maps, a map indicating an arrangement state of obstacles such as chairs and tables. The personal data storage unit 218 stores information of a user, particularly, an owner. Specifically, it stores master information indicating intimacy with the user and physical and behavioral characteristics of the user. Other attribute information such as age and gender may be stored. Details of the master information will be described later with reference to FIG.

ロボットシステム３００（ロボット１００およびサーバ２００）はユーザの身体的特徴や行動的特徴に基づいてユーザを識別する。ロボット１００は、全天球カメラで周辺を撮像する。そして、画像に写る人物の身体的特徴と行動的特徴を抽出する。身体的特徴とは、目と目の間隔の大きさ、目と口と鼻のバランス、背の高さ、好んで着る服、メガネの有無、肌の色、髪の色、耳の大きさなど身体に付随する視覚的特徴であってもよいし、平均体温や匂い、声質、などその他の特徴も含めてもよい。行動的特徴とは、具体的には、ユーザが好む場所、動きの活発さ、喫煙の有無など行動に付随する特徴である。たとえば、父親として識別されるオーナーは在宅しないことが多く、在宅時にはソファで動かないことが多いが、母親は台所にいることが多く、行動範囲が広い、といった行動上の特徴を抽出する。
本実施形態におけるロボットシステム３００は、後述のマスタ画像により身体的特徴を示す複数のパラメータを抽出し、このマスタ画像に基づいてユーザを識別する。以下、マスタ画像に基づいてユーザを識別する処理のことを「ユーザ識別処理」とよぶ。ユーザ識別処理の詳細は後述する。The robot system 300 (the robot 100 and the server 200) identifies the user based on the physical and behavioral characteristics of the user. The robot 100 captures an image of the periphery with a spherical camera. Then, the physical and behavioral characteristics of the person appearing in the image are extracted. Physical features include eye-to-eye spacing, eye-to-mouth-to-nose balance, height, preferred clothing, glasses, skin color, hair color, ear size, etc. It may be a visual feature associated with the body, or may include other features such as average body temperature, odor, voice quality, and the like. Specifically, the behavioral features are features accompanying the behavior, such as a place preferred by the user, activeness of the movement, and the presence or absence of smoking. For example, behavior characteristics such as an owner who is identified as a father often not staying at home and often not moving on a sofa at home, but a mother often staying in a kitchen and having a wide range of activity are extracted.
The robot system 300 according to the present embodiment extracts a plurality of parameters indicating physical characteristics from a master image described later, and identifies a user based on the master image. Hereinafter, the process of identifying a user based on the master image is referred to as “user identification process”. Details of the user identification processing will be described later.

ロボット１００は、ユーザごとに親密度という内部パラメータを有する。ロボット１００が、自分を抱き上げる、声をかけてくれるなど、自分に対して好意を示す行動を認識したとき、そのユーザに対する親密度が高くなる。ロボット１００に関わらないユーザや、乱暴を働くユーザ、出会う頻度が低いユーザに対する親密度は低くなる。 The robot 100 has an internal parameter called intimacy for each user. When the robot 100 recognizes an action that favors itself, such as holding up or calling out, the intimacy with the user increases. The intimacy with a user who is not involved in the robot 100, a user who works violently, or a user who meets with a low frequency is low.

データ処理部２０２は、位置管理部２０８、マップ管理部２１０、認識部２１２、動作制御部２２２、親密度管理部２２０および感情管理部２４４を含む。
位置管理部２０８は、ロボット１００の位置座標を、図３を用いて説明した方法にて特定する。位置管理部２０８はユーザの位置座標もリアルタイムで追跡してもよい。The data processing unit 202 includes a position management unit 208, a map management unit 210, a recognition unit 212, an operation control unit 222, a closeness management unit 220, and an emotion management unit 244.
The position management unit 208 specifies the position coordinates of the robot 100 by the method described with reference to FIG. The position management unit 208 may also track the position coordinates of the user in real time.

感情管理部２４４は、ロボット１００の感情（寂しさ、楽しさ、恐怖など）を示すさまざまな感情パラメータを管理する。これらの感情パラメータは常に揺らいでいる。感情パラメータに応じて複数の行動マップの重要度が変化し、行動マップによってロボット１００の移動目標地点が変化し、ロボット１００の移動や時間経過によって感情パラメータが変化する。 The emotion management unit 244 manages various emotion parameters indicating the emotion (loneliness, enjoyment, fear, etc.) of the robot 100. These emotion parameters are always fluctuating. The importance of the plurality of action maps changes according to the emotion parameter, the movement target point of the robot 100 changes according to the action map, and the emotion parameter changes according to the movement of the robot 100 and the passage of time.

たとえば、寂しさを示す感情パラメータが高いときには、感情管理部２４４は安心する場所を評価する行動マップの重み付け係数を大きく設定する。ロボット１００が、この行動マップにおいて寂しさを解消可能な地点に至ると、感情管理部２４４は寂しさを示す感情パラメータを低下させる。また、後述の応対行為によっても各種感情パラメータは変化する。たとえば、オーナーから「抱っこ」をされると寂しさを示す感情パラメータは低下し、長時間にわたってオーナーを視認しないときには寂しさを示す感情パラメータは少しずつ増加する。 For example, when the emotion parameter indicating loneliness is high, the emotion management unit 244 sets a large weighting coefficient of the action map that evaluates a place where people feel safe. When the robot 100 reaches a point where loneliness can be eliminated in the action map, the emotion management unit 244 decreases the emotion parameter indicating loneliness. Various emotion parameters also change depending on a response action described later. For example, the emotion parameter indicating loneliness decreases when the owner “holds”, and the emotion parameter indicating loneliness gradually increases when the owner is not visually recognized for a long time.

マップ管理部２１０は、複数の行動マップについて図４に関連して説明した方法にて各座標のパラメータを変化させる。マップ管理部２１０は、複数の行動マップのいずれかを選択してもよいし、複数の行動マップのｚ値を加重平均してもよい。たとえば、行動マップＡでは座標Ｒ１、座標Ｒ２におけるｚ値が４と３であり、行動マップＢでは座標Ｒ１、座標Ｒ２におけるｚ値が−１と３であるとする。単純平均の場合、座標Ｒ１の合計ｚ値は４−１＝３、座標Ｒ２の合計ｚ値は３＋３＝６であるから、ロボット１００は座標Ｒ１ではなく座標Ｒ２の方向に向かう。
行動マップＡを行動マップＢの５倍重視するときには、座標Ｒ１の合計ｚ値は４×５−１＝１９、座標Ｒ２の合計ｚ値は３×５＋３＝１８であるから、ロボット１００は座標Ｒ１の方向に向かう。The map management unit 210 changes the parameters of each coordinate for the plurality of action maps by the method described with reference to FIG. The map management unit 210 may select any one of the plurality of action maps, or may perform a weighted average of z values of the plurality of action maps. For example, in the action map A, the z values at the coordinates R1 and R2 are 4 and 3, and in the action map B, the z values at the coordinates R1 and R2 are -1 and 3. In the case of the simple average, since the total z value of the coordinates R1 is 4-1 = 3 and the total z value of the coordinates R2 is 3 + 3 = 6, the robot 100 goes not in the direction of the coordinate R1 but in the direction of the coordinate R2.
When the action map A is 5 times more important than the action map B, the total z value of the coordinates R1 is 4 × 5-1 = 19 and the total z value of the coordinates R2 is 3 × 5 + 3 = 18. Head in the direction of.

認識部２１２は、外部環境を認識する。外部環境の認識には、温度や湿度に基づく天候や季節の認識、光量や温度に基づく物陰（安全地帯）の認識など多様な認識が含まれる。ロボット１００の認識部１５６は、内部センサ１２８により各種の環境情報を取得し、これを一次処理した上でサーバ２００の認識部２１２に転送する。ロボット１００の認識部１５６は、画像から移動物体、特に、人物や動物に対応する画像領域を抽出し、抽出した画像領域から移動物体の身体的特徴や行動的特徴を示す「特徴ベクトル」を抽出する。ロボット１００は、特徴ベクトルをサーバ２００に送信する。 The recognition unit 212 recognizes an external environment. The recognition of the external environment includes various recognitions such as recognition of weather and season based on temperature and humidity, and recognition of a shadow (safety zone) based on light amount and temperature. The recognizing unit 156 of the robot 100 acquires various types of environmental information by the internal sensor 128, performs primary processing on the acquired information, and transfers the information to the recognizing unit 212 of the server 200. The recognition unit 156 of the robot 100 extracts a moving object, in particular, an image region corresponding to a person or an animal from the image, and extracts a “feature vector” indicating the physical characteristics and the behavioral characteristics of the moving object from the extracted image region. I do. The robot 100 transmits the feature vector to the server 200.

サーバ２００の認識部２１２は、更に、人物認識部２１４と応対認識部２２８を含む。
人物認識部２１４は、ロボット１００の内蔵カメラによる撮像画像から抽出された特徴ベクトルと、個人データ格納部２１８にあらかじめ登録されているユーザの特徴ベクトルと比較することにより、撮像されたユーザがどの人物に該当するかを判定する（ユーザ識別処理）。人物認識部２１４は、表情認識部２３０を含む。表情認識部２３０は、ユーザの表情を画像認識することにより、ユーザの感情を推定する。
なお、人物認識部２１４は、人物以外の移動物体、たとえば、ペットである猫や犬についてもユーザ識別処理を行う。The recognition unit 212 of the server 200 further includes a person recognition unit 214 and a response recognition unit 228.
The person recognizing unit 214 compares the feature vector extracted from the image captured by the built-in camera of the robot 100 with the feature vector of the user registered in advance in the personal data storage unit 218 to determine which person the captured user is. Is determined (user identification processing). The person recognition unit 214 includes a facial expression recognition unit 230. The facial expression recognition unit 230 estimates the emotion of the user by image-recognizing the facial expression of the user.
Note that the person recognizing unit 214 also performs user identification processing on a moving object other than a person, for example, a cat or dog that is a pet.

以上のように、本実施形態においては、ロボット１００の認識部１５６が撮像画像から移動物体（人物および動物）に対応する画像領域を抽出し、抽出した撮像画像から特徴ベクトルを抽出する。サーバ２００の個人データ格納部２１８には、あらかじめ複数のユーザの特徴ベクトル（以下、「マスタベクトル」とよぶ）が登録されている。マスタベクトルは、ユーザのマスタ画像に基づいて抽出される特徴ベクトルである。サーバ２００の人物認識部２１４は、ロボット１００から送られる特徴ベクトルとマスタベクトルを比較することによりユーザを識別する。 As described above, in the present embodiment, the recognition unit 156 of the robot 100 extracts an image region corresponding to a moving object (person and animal) from a captured image, and extracts a feature vector from the extracted captured image. In the personal data storage unit 218 of the server 200, feature vectors of a plurality of users (hereinafter, referred to as “master vectors”) are registered in advance. The master vector is a feature vector extracted based on the master image of the user. The person recognizing unit 214 of the server 200 identifies the user by comparing the feature vector sent from the robot 100 with the master vector.

以下、個人データ格納部２１８にマスタベクトルが登録されているユーザを「登録ユーザ」、カメラにより認識されたユーザ識別処理の対象となる未確認のユーザを「未知ユーザ」とよぶ。登録ユーザＡのマスタベクトルと未知ユーザＸの特徴ベクトル（以下、「検査ベクトル」ともよぶ）が一致または類似していれば、未知ユーザＸは登録ユーザＡと同一人物であると判定する。 Hereinafter, a user whose master vector is registered in the personal data storage unit 218 is referred to as a “registered user”, and an unidentified user who is a target of the user identification process recognized by the camera is referred to as an “unknown user”. If the master vector of the registered user A and the feature vector of the unknown user X (hereinafter, also referred to as “test vector”) match or are similar, it is determined that the unknown user X is the same person as the registered user A.

応対認識部２２８は、ロボット１００になされたさまざまな応対行為を認識し、快・不快行為に分類する。応対認識部２２８は、また、ロボット１００の行動に対するオーナーの応対行為を認識することにより、肯定・否定反応に分類する。
快・不快行為は、ユーザの応対行為が、生物として心地よいものであるか不快なものであるかにより判別される。たとえば、抱っこされることはロボット１００にとって快行為であり、蹴られることはロボット１００にとって不快行為である。肯定・否定反応は、ユーザの応対行為が、ユーザの快感情を示すものか不快感情を示すものであるかにより判別される。たとえば、抱っこされることはユーザの快感情を示す肯定反応であり、蹴られることはユーザの不快感情を示す否定反応である。The response recognition unit 228 recognizes various response actions performed on the robot 100 and classifies the actions into pleasant and unpleasant actions. The response recognition unit 228 also classifies the response into an affirmative / negative response by recognizing the owner's response to the behavior of the robot 100.
Pleasant or unpleasant behavior is determined based on whether the user's response behavior is comfortable or unpleasant as a living thing. For example, being held is a pleasure for the robot 100, and being kicked is a discomfort to the robot 100. The affirmative / negative response is determined based on whether the user's response action indicates a pleasant feeling or a discomfort feeling of the user. For example, being hugged is a positive response indicating a user's pleasant feeling, and kicking is a negative reaction indicating a user's unpleasant feeling.

サーバ２００の動作制御部２２２は、ロボット１００の動作制御部１５０と協働して、ロボット１００のモーションを決定する。サーバ２００の動作制御部２２２は、マップ管理部２１０による行動マップ選択に基づいて、ロボット１００の移動目標地点とそのための移動ルートを作成する。動作制御部２２２は、複数の移動ルートを作成し、その上で、いずれかの移動ルートを選択してもよい。 The operation control unit 222 of the server 200 determines the motion of the robot 100 in cooperation with the operation control unit 150 of the robot 100. The operation control unit 222 of the server 200 creates a movement target point of the robot 100 and a movement route therefor based on the action map selection by the map management unit 210. The operation control unit 222 may create a plurality of travel routes, and then select any one of the travel routes.

動作制御部２２２は、モーション格納部２３２の複数のモーションからロボット１００のモーションを選択する。各モーションには状況ごとに選択確率が対応づけられている。たとえば、オーナーから快行為がなされたときには、モーションＡを２０％の確率で実行する、気温が３０度以上となったとき、モーションＢを５％の確率で実行する、といった選択方法が定義される。
行動マップに移動目標地点や移動ルートが決定され、後述の各種イベントによりモーションが選択される。The operation control unit 222 selects a motion of the robot 100 from a plurality of motions in the motion storage unit 232. Each motion is associated with a selection probability for each situation. For example, a selection method is defined in which the motion A is executed with a probability of 20% when a pleasant action is performed by the owner, and the motion B is executed with a probability of 5% when the temperature exceeds 30 degrees. .
A movement target point and a movement route are determined in the action map, and a motion is selected by various events described later.

親密度管理部２２０は、ユーザごとの親密度を管理する。上述したように、親密度は個人データ格納部２１８において個人データの一部として登録される。快行為を検出したとき、親密度管理部２２０はそのオーナーに対する親密度をアップさせる。不快行為を検出したときには親密度はダウンする。また、長期間視認していないオーナーの親密度は徐々に低下する。 The closeness management unit 220 manages the closeness for each user. As described above, the intimacy is registered in the personal data storage unit 218 as a part of personal data. When a pleasant activity is detected, the intimacy management unit 220 increases the intimacy with the owner. The intimacy drops when an unpleasant activity is detected. In addition, the intimacy of owners who have not viewed for a long time gradually decreases.

（ロボット１００）
ロボット１００は、通信部１４２、データ処理部１３６、データ格納部１４８、内部センサ１２８および駆動機構１２０を含む。
通信部１４２は、通信機１２６（図５参照）に該当し、外部センサ１１４、サーバ２００および他のロボット１００との通信処理を担当する。データ格納部１４８は各種データを格納する。データ格納部１４８は、記憶装置１２４（図５参照）に該当する。データ処理部１３６は、通信部１４２により取得されたデータおよびデータ格納部１４８に格納されているデータに基づいて各種処理を実行する。データ処理部１３６は、プロセッサ１２２およびプロセッサ１２２により実行されるコンピュータプログラムに該当する。データ処理部１３６は、通信部１４２、内部センサ１２８、駆動機構１２０およびデータ格納部１４８のインタフェースとしても機能する。(Robot 100)
The robot 100 includes a communication unit 142, a data processing unit 136, a data storage unit 148, an internal sensor 128, and a driving mechanism 120.
The communication unit 142 corresponds to the communication device 126 (see FIG. 5), and is in charge of communication processing with the external sensor 114, the server 200, and the other robot 100. The data storage unit 148 stores various data. The data storage unit 148 corresponds to the storage device 124 (see FIG. 5). The data processing unit 136 performs various processes based on the data acquired by the communication unit 142 and the data stored in the data storage unit 148. The data processing unit 136 corresponds to the processor 122 and a computer program executed by the processor 122. The data processing unit 136 also functions as an interface of the communication unit 142, the internal sensor 128, the driving mechanism 120, and the data storage unit 148.

データ格納部１４８は、ロボット１００の各種モーションを定義するモーション格納部１６０を含む。
ロボット１００のモーション格納部１６０には、サーバ２００のモーション格納部２３２から各種モーションファイルがダウンロードされる。モーションは、モーションＩＤによって識別される。前輪１０２を収容して着座する、手１０６を持ち上げる、２つの前輪１０２を逆回転させることで、あるいは、片方の前輪１０２だけを回転させることでロボット１００を回転行動させる、前輪１０２を収納した状態で前輪１０２を回転させることで震える、ユーザから離れるときにいったん停止して振り返る、などのさまざまなモーションを表現するために、各種アクチュエータ（駆動機構１２０）の動作タイミング、動作時間、動作方向などがモーションファイルにおいて時系列定義される。The data storage unit 148 includes a motion storage unit 160 that defines various motions of the robot 100.
Various motion files are downloaded from the motion storage unit 232 of the server 200 to the motion storage unit 160 of the robot 100. A motion is identified by a motion ID. A state in which the front wheel 102 is housed, in which the robot 100 rotates by seating and sitting on the front wheel 102, lifting the hand 106, or by rotating the two front wheels 102 in reverse or by rotating only one of the front wheels 102. In order to express various motions such as shaking by rotating the front wheel 102 and stopping and turning back when leaving the user, the operation timing, operation time, operation direction, etc. of various actuators (drive mechanism 120) are expressed. Time series is defined in the motion file.

データ格納部１４８には、マップ格納部２１６および個人データ格納部２１８からも各種データがダウンロードされてもよい。 Various data may be downloaded from the map storage unit 216 and the personal data storage unit 218 to the data storage unit 148.

内部センサ１２８は、カメラ１３４を含む。本実施形態におけるカメラ１３４は、ツノ１１２に取り付けられる全天球カメラである。 Internal sensor 128 includes a camera 134. The camera 134 in this embodiment is a spherical camera attached to the horn 112.

データ処理部１３６は、認識部１５６、動作制御部１５０、動作検出部１５２、撮像制御部１５４および測距部１５８を含む。
ロボット１００の動作制御部１５０は、サーバ２００の動作制御部２２２と協働してロボット１００のモーションを決める。一部のモーションについてはサーバ２００で決定し、他のモーションについてはロボット１００で決定してもよい。また、ロボット１００がモーションを決定するが、ロボット１００の処理負荷が高いときにはサーバ２００がモーションを決定するとしてもよい。サーバ２００においてベースとなるモーションを決定し、ロボット１００において追加のモーションを決定してもよい。モーションの決定処理をサーバ２００およびロボット１００においてどのように分担するかはロボットシステム３００の仕様に応じて設計すればよい。The data processing unit 136 includes a recognition unit 156, an operation control unit 150, an operation detection unit 152, an imaging control unit 154, and a distance measurement unit 158.
The operation control unit 150 of the robot 100 determines the motion of the robot 100 in cooperation with the operation control unit 222 of the server 200. Some motions may be determined by the server 200, and other motions may be determined by the robot 100. Further, the robot 100 determines the motion, but when the processing load of the robot 100 is high, the server 200 may determine the motion. The base motion may be determined in the server 200, and the additional motion may be determined in the robot 100. How the motion determination process is shared between the server 200 and the robot 100 may be designed according to the specifications of the robot system 300.

ロボット１００の動作制御部１５０は、サーバ２００の動作制御部２２２とともにロボット１００の移動方向を決める。行動マップに基づく移動をサーバ２００で決定し、障害物をよけるなどの即時的移動をロボット１００の動作制御部１５０により決定してもよい。駆動機構１２０は、動作制御部１５０の指示にしたがって前輪１０２を駆動することで、ロボット１００を移動目標地点に向かわせる。 The operation control unit 150 of the robot 100 determines the moving direction of the robot 100 together with the operation control unit 222 of the server 200. The movement based on the action map may be determined by the server 200, and the immediate movement such as avoiding an obstacle may be determined by the operation control unit 150 of the robot 100. The drive mechanism 120 drives the front wheel 102 in accordance with an instruction from the operation control unit 150, thereby moving the robot 100 to the movement target point.

ロボット１００の動作制御部１５０は選択したモーションを駆動機構１２０に実行指示する。駆動機構１２０は、モーションファイルにしたがって、各アクチュエータを制御する。 The operation control unit 150 of the robot 100 instructs the drive mechanism 120 to execute the selected motion. The drive mechanism 120 controls each actuator according to the motion file.

動作制御部１５０は、親密度の高いユーザが近くにいるときには「抱っこ」をせがむ仕草として両方の手１０６をもちあげるモーションを実行することもできるし、「抱っこ」に飽きたときには左右の前輪１０２を収容したまま逆回転と停止を交互に繰り返すことで抱っこをいやがるモーションを表現することもできる。駆動機構１２０は、動作制御部１５０の指示にしたがって前輪１０２や手１０６、首（頭部フレーム３１６）を駆動することで、ロボット１００にさまざまなモーションを表現させる。 The motion control unit 150 can execute a motion of raising both hands 106 as a gesture to hug "Hug" when a user with high intimacy is nearby, and when the user is tired of "Hug", the left and right front wheels 102 can be moved. By alternately repeating the reverse rotation and the stop with the stowed, it is possible to express a motion that hugs the hug. The drive mechanism 120 causes the robot 100 to express various motions by driving the front wheel 102, the hand 106, and the neck (the head frame 316) according to an instruction from the operation control unit 150.

動作検出部１５２は、ユーザによるタッチのほか、ロボット１００の「抱え上げ」と「抱え下ろし」を検出する。「抱え上げ」とは、典型的には、ユーザがロボット１００のボディ１０４に両手を添えて、ロボット１００を持ち上げる行為である。「抱え下ろし」とは、典型的には、ユーザがロボット１００のボディ１０４に両手を添えて、ロボット１００を床面Ｆの上に下ろす行為である。動作検出部１５２は、ロボット１００の外皮３１４の下に設置されるタッチセンサによりユーザのタッチを検出する。タッチされた状態で加速度センサが上昇を検知したことを条件として動作検出部１５２は「抱え上げ」がなされたと判定する。同様にして、タッチされた状態で加速度センサにより下降を検出したとき、あるいは、着座面１０８または前輪１０２への荷重を検出したときには、動作検出部１５２は「抱え下ろし」がなされたと判定する。カメラ１３４によって外界を動画撮像し、ロボット１００の上昇および下降を画像の変化から認識することで「抱え上げ」と「抱え下ろし」を判定してもよい。 The motion detection unit 152 detects “lifting” and “holding down” of the robot 100 in addition to the touch by the user. “Lift” is typically an act of the user lifting the robot 100 with both hands attached to the body 104 of the robot 100. The “holding down” is typically an action in which the user puts both hands on the body 104 of the robot 100 and lowers the robot 100 on the floor F. The motion detection unit 152 detects a user's touch with a touch sensor installed below the outer skin 314 of the robot 100. The operation detection unit 152 determines that “lifting” has been performed on condition that the acceleration sensor detects an increase in a touched state. Similarly, when a descent is detected by the acceleration sensor in a touched state, or when a load on the seating surface 108 or the front wheel 102 is detected, the motion detection unit 152 determines that “holding down” has been performed. The camera 134 may capture a moving image of the outside world, and recognize “lift” and “down” by recognizing the rise and fall of the robot 100 from changes in the image.

撮像制御部１５４は、カメラ１３４を制御する。撮像制御部１５４は、抱え上げや抱え下ろし、タッチが検出されたとき、あるいは、後述の各種タイミングにて被写体を撮像する。 The imaging control unit 154 controls the camera 134. The imaging control unit 154 captures an image of a subject when the object is held up, held down, or a touch is detected, or at various timings described later.

測距部１５８は、内部センサ１２８に含まれる測距センサ（赤外線センサ）により、被写体となる移動物体（人物およびペット）との距離を検出する。また、認識部１５６は、被写体を画像認識することにより、ロボット１００と被写体の相対角度も検出する。被写体に対してロボット１００が所定の相対地点に位置したときの撮像画像をマスタ画像の候補（以下、「マスタ候補画像」とよぶ）とすることもできる。測距に基づくマスタ候補画像の取得方法については、図１３に関連して後述する。 The distance measuring unit 158 detects a distance to a moving object (person and pet) as a subject by using a distance measuring sensor (infrared sensor) included in the internal sensor 128. The recognition unit 156 also detects the relative angle between the robot 100 and the subject by recognizing the subject as an image. A captured image obtained when the robot 100 is located at a predetermined relative point with respect to the subject can be set as a master image candidate (hereinafter, referred to as a “master candidate image”). A method for acquiring the master candidate image based on the distance measurement will be described later with reference to FIG.

ロボット１００の認識部１５６は、内部センサ１２８から得られた外部情報を解釈する。認識部１５６は、視覚的な認識（視覚部）、匂いの認識（嗅覚部）、音の認識（聴覚部）、触覚的な認識（触覚部）が可能である。
認識部１５６は、内蔵の全天球カメラにより定期的に外界を撮像し、人やペットなどの移動物体を検出する。認識部１５６が移動物体の撮像画像から抽出した特徴ベクトルはサーバ２００に送信され、サーバ２００の人物認識部２１４はユーザを識別する。ロボット１００の認識部１５６は、ユーザの匂いやユーザの声も検出する。匂いや音（声）は既知の方法にて複数種類に分類される。The recognition unit 156 of the robot 100 interprets external information obtained from the internal sensor 128. The recognition unit 156 can perform visual recognition (visual unit), odor recognition (olfactory unit), sound recognition (auditory unit), and tactile recognition (tactile unit).
The recognizing unit 156 periodically captures an image of the outside world using a built-in spherical camera, and detects a moving object such as a person or a pet. The feature vector extracted from the captured image of the moving object by the recognition unit 156 is transmitted to the server 200, and the person recognition unit 214 of the server 200 identifies the user. The recognition unit 156 of the robot 100 also detects the smell of the user and the voice of the user. Odors and sounds (voices) are classified into a plurality of types by a known method.

ロボット１００に対する強い衝撃が与えられたとき、認識部１５６は内蔵の加速度センサによりこれを認識し、サーバ２００の応対認識部２２８は、近隣にいるユーザによって「乱暴行為」が働かれたと認識する。ユーザがツノ１１２を掴んでロボット１００を持ち上げるときにも、乱暴行為と認識してもよい。ロボット１００に正対した状態にあるユーザが特定音量領域および特定周波数帯域にて発声したとき、サーバ２００の応対認識部２２８は、自らに対する「声掛け行為」がなされたと認識してもよい。また、体温程度の温度を検知したときにはユーザによる「接触行為」がなされたと認識してもよい。
まとめると、ロボット１００は内部センサ１２８によりユーザの行為を物理的情報として取得し、動作検出部１５２は「抱え上げ」「抱え下ろし」等の行為を判定し、サーバ２００の応対認識部２２８は快・不快を判定し、サーバ２００の認識部２１２は特徴ベクトルに基づくユーザ識別処理を実行する。When a strong impact is given to the robot 100, the recognizing unit 156 recognizes the strong impact by the built-in acceleration sensor, and the response recognizing unit 228 of the server 200 recognizes that a "violent act" has been performed by a nearby user. When the user grasps the horn 112 and lifts the robot 100, it may be recognized as a violent act. When the user facing the robot 100 utters in the specific volume region and the specific frequency band, the response recognition unit 228 of the server 200 may recognize that the “houting action” for itself has been performed. Further, when a temperature around the body temperature is detected, it may be recognized that the “contact action” by the user has been performed.
In summary, the robot 100 acquires the user's action as physical information by using the internal sensor 128, the action detection unit 152 determines an action such as “lifting” or “holding down”, and the response recognition unit 228 of the server 200 is free. -After determining discomfort, the recognition unit 212 of the server 200 executes a user identification process based on the feature vector.

サーバ２００の応対認識部２２８は、ロボット１００に対するユーザの各種応対を認識する。各種応対行為のうち一部の典型的な応対行為には、快または不快、肯定または否定が対応づけられる。一般的には快行為となる応対行為のほとんどは肯定反応であり、不快行為となる応対行為のほとんどは否定反応となる。快・不快行為は親密度に関連し、肯定・否定反応はロボット１００の行動選択に影響する。 The response recognition unit 228 of the server 200 recognizes various responses of the user to the robot 100. Pleasant or unpleasant, affirmative or negative are associated with some typical responses among various responses. In general, most of the responding acts that are pleasant are positive reactions, and most of the disturbing acts are negative reactions. Pleasant or unpleasant behavior is related to intimacy, and affirmative or negative reactions affect the robot 100's action selection.

検出・分析・判定を含む一連の認識処理のうち、ロボット１００の認識部１５６は認識に必要な情報の取捨選択や抽出を行い、判定等の解釈処理はサーバ２００の認識部２１２により実行される。認識処理は、サーバ２００の認識部２１２だけで行ってもよいし、ロボット１００の認識部１５６だけで行ってもよいし、上述のように双方が役割分担をしながら上記認識処理を実行してもよい。 Among a series of recognition processes including detection, analysis, and determination, the recognition unit 156 of the robot 100 selects and extracts information necessary for recognition, and interpretation processing such as determination is performed by the recognition unit 212 of the server 200. . The recognition process may be performed only by the recognition unit 212 of the server 200, may be performed only by the recognition unit 156 of the robot 100, or may be performed by performing the above-described recognition process while sharing roles. Is also good.

認識部１５６により認識された応対行為に応じて、サーバ２００の親密度管理部２２０はユーザに対する親密度を変化させる。原則的には、快行為を行ったユーザに対する親密度は高まり、不快行為を行ったユーザに対する親密度は低下する。 In accordance with the response action recognized by the recognition unit 156, the familiarity management unit 220 of the server 200 changes the familiarity with the user. In principle, the intimacy with the user who performed the pleasure increases, and the intimacy with the user who performs the pleasure decreases.

サーバ２００の認識部２１２は、応対に応じて快・不快を判定し、マップ管理部２１０は「場所に対する愛着」を表現する行動マップにおいて、快・不快行為がなされた地点のｚ値を変化させてもよい。たとえば、リビングにおいて快行為がなされたとき、マップ管理部２１０はリビングに好意地点を高い確率で設定してもよい。この場合、ロボット１００はリビングを好み、リビングで快行為を受けることで、ますますリビングを好む、というポジティブ・フィードバック効果が実現する。 The recognizing unit 212 of the server 200 determines the pleasure or discomfort according to the response, and the map management unit 210 changes the z value of the point where the pleasure or discomfort is performed in the action map expressing “attachment to the place”. You may. For example, when a pleasant activity is performed in the living room, the map management unit 210 may set a favorable point in the living room with a high probability. In this case, the robot 100 prefers the living room and receives a pleasant act in the living room, thereby realizing a positive feedback effect that the robot 100 prefers the living room.

移動物体（ユーザ）からどのような行為をされるかによってそのユーザに対する親密度が変化する。 The intimacy with a moving object (user) changes depending on what action is taken by the user.

ロボット１００は、よく出会う人、よく触ってくる人、よく声をかけてくれる人に対して高い親密度を設定する。一方、めったに見ない人、あまり触ってこない人、乱暴な人、大声で叱る人に対する親密度は低くなる。ロボット１００はセンサ（視覚、触覚、聴覚）によって検出するさまざまな外界情報にもとづいて、ユーザごとの親密度を変化させる。 The robot 100 sets a high degree of intimacy with a person who frequently meets, a person who frequently touches, and a person who speaks frequently. On the other hand, the intimacy of those who rarely see, those who do not touch much, those who are violent, and those who scold loudly is low. The robot 100 changes intimacy for each user based on various kinds of external information detected by sensors (visual, tactile, and auditory).

実際のロボット１００は行動マップにしたがって自律的に複雑な行動選択を行う。ロボット１００は、寂しさ、退屈さ、好奇心などさまざまなパラメータに基づいて複数の行動マップに影響されながら行動する。ロボット１００は、行動マップの影響を除外すれば、あるいは、行動マップの影響が小さい内部状態にあるときには、原則的には、親密度の高い人に近づこうとし、親密度の低い人からは離れようとする。 The actual robot 100 autonomously performs complicated action selection according to the action map. The robot 100 behaves while being influenced by a plurality of behavior maps based on various parameters such as loneliness, boredom, curiosity, and the like. If the effect of the action map is excluded or the robot 100 is in an internal state where the effect of the action map is small, the robot 100 will, in principle, try to approach a person with high intimacy and move away from a person with low intimacy. And

ロボット１００の行動は親密度に応じて以下に類型化される。
（１）親密度が非常に高いユーザ
ロボット１００は、ユーザに近づき（以下、「近接行動」とよぶ）、かつ、人に好意を示す仕草としてあらかじめ定義される愛情仕草を行うことで親愛の情を強く表現する。
（２）親密度が比較的高いユーザ
ロボット１００は、近接行動のみを行う。
（３）親密度が比較的低いユーザ
ロボット１００は特段のアクションを行わない。
（４）親密度が特に低いユーザ
ロボット１００は、離脱行動を行う。The behavior of the robot 100 is categorized as follows according to the intimacy level.
(1) User with Very High Intimacy The robot 100 approaches the user (hereinafter, referred to as “proximity action”) and performs an affection gesture that is defined in advance as a gesture showing favor with the human, thereby making the friendship affect. Express strongly.
(2) The user robot 100 whose intimacy is relatively high performs only the proximity action.
(3) User with relatively low intimacy The robot 100 does not take any special action.
(4) The user robot 100 whose intimacy is particularly low performs the leaving behavior.

以上の制御方法によれば、ロボット１００は、親密度が高いユーザを見つけるとそのユーザに近寄り、逆に親密度が低いユーザを見つけるとそのユーザから離れる。このような制御方法により、いわゆる「人見知り」を行動表現できる。また、来客（親密度が低いユーザＡ）が現れたとき、ロボット１００は、来客から離れて家族（親密度が高いユーザＢ）の方に向かうこともある。この場合、ユーザＢはロボット１００が人見知りをして不安を感じていること、自分を頼っていること、を感じ取ることができる。このような行動表現により、ユーザＢは、選ばれ、頼られることの喜び、それにともなう愛着の情を喚起される。 According to the above control method, the robot 100 approaches a user with a high degree of intimacy, and moves away from a user with a low degree of intimacy. With such a control method, a so-called “shyness” can be expressed as an action. Also, when a visitor (user A with a low degree of intimacy) appears, the robot 100 may move away from the guest and head toward a family (user B with a high degree of closeness). In this case, the user B can sense that the robot 100 is shy and anxious and relies on himself. Such an action expression evokes the joy of being selected and relied on by the user B, and the feeling of attachment accompanying it.

一方、来客であるユーザＡが頻繁に訪れ、声を掛け、タッチをするとロボット１００のユーザＡに対する親密度は徐々に上昇し、ロボット１００はユーザＡに対して人見知り行動（離脱行動）をしなくなる。ユーザＡも自分にロボット１００が馴染んできてくれたことを感じ取ることで、ロボット１００に対する愛着を抱くことができる。 On the other hand, if the user A who is a visitor frequently visits, calls and touches, the intimacy of the robot 100 with the user A gradually increases, and the robot 100 does not perform a shyness action (withdrawal action) with respect to the user A. . The user A can also have an attachment to the robot 100 by feeling that the robot 100 has become familiar to him.

なお、以上の行動選択は、常に実行されるとは限らない。たとえば、ロボット１００の好奇心を示す内部パラメータが高くなっているときには、好奇心を満たす場所を求める行動マップが重視されるため、ロボット１００は親密度に影響された行動を選択しない可能性もある。また、玄関に設置されている外部センサ１１４がユーザの帰宅を検知した場合には、ユーザのお出迎え行動を最優先で実行するかもしれない。 Note that the above action selection is not always performed. For example, when the internal parameter indicating the curiosity of the robot 100 is high, the behavior map for finding a place that satisfies the curiosity is emphasized. Therefore, the robot 100 may not select the behavior affected by the intimacy. . When the external sensor 114 installed at the entrance detects that the user has returned home, the user's greeting action may be executed with the highest priority.

図７は、ロボットを抱っこしたときのイメージ図である。
ロボット１００は、丸く、やわらかく、手触りのよいボディ１０４と適度な重量を有し、かつ、タッチを快行為と認識するため、ロボット１００を抱っこしたいという感情をユーザに抱かせやすい。ロボット１００は、この関わりたいという気持ちを抱かせることをユーザ識別処理に応用している。FIG. 7 is an image diagram when the robot is carried.
The robot 100 has a round, soft, soft-touching body 104 and an appropriate weight, and recognizes a touch as a pleasure. Therefore, it is easy for the user to embrace the feeling of wanting to hold the robot 100. The robot 100 applies this feeling of wanting to be involved to the user identification processing.

ロボット１００がユーザを識別するためには、その手がかりとなる情報が必要である。たとえば、眉の太さ、目の大きさ、目の形状、肌の色、肌の明るさ、皺の形状、髪の明るさ、前髪の長さ、顔全体に占める目や鼻の大きさの割合、目と目の間隔などの身体的特徴が手がかりとなる。本実施形態においては、まず、ロボット１００はマスタ画像を取得する。ロボット１００の認識部１５６はマスタ画像から特徴ベクトル（マスタベクトル）を抽出する。特徴ベクトルは、複数のベクトル成分を有する。特徴ベクトル成分は、上述の各種身体的特徴を定量化した数値である。たとえば、目の横幅は０〜１の範囲で数値化され、これらが特徴ベクトル成分を形成する。人物の撮像画像から特徴ベクトルを抽出する手法については、既知の顔認識技術の応用である。ユーザＡのマスタベクトルは、個人データ格納部２１８のマスタ情報２２４として保存される。
以下、撮像画像から特徴ベクトルを抽出する処理のことを「ベクトル抽出処理」とよぶ。In order for the robot 100 to identify a user, information as a clue is necessary. For example, eyebrow thickness, eye size, eye shape, skin color, skin brightness, wrinkle shape, hair brightness, bangs length, eyes and nose size in the entire face Cue is based on physical characteristics such as proportion, eye-to-eye spacing. In the present embodiment, first, the robot 100 acquires a master image. The recognition unit 156 of the robot 100 extracts a feature vector (master vector) from the master image. The feature vector has a plurality of vector components. The feature vector component is a numerical value that quantifies the various physical features described above. For example, the width of the eyes is quantified in the range of 0 to 1 and these form a feature vector component. The method of extracting a feature vector from a captured image of a person is an application of a known face recognition technology. The master vector of the user A is stored as the master information 224 of the personal data storage unit 218.
Hereinafter, the process of extracting a feature vector from a captured image is referred to as “vector extraction process”.

ロボット１００が未知ユーザＸを撮像したとき、認識部１５６は未知ユーザＸの撮像画像（検査画像）から特徴ベクトル（検査ベクトル）を抽出する。サーバ２００の人物認識部２１４は、未知ユーザＸの検査ベクトルと登録ユーザＡのマスタベクトルが類似していれば、未知ユーザＸと登録ユーザＡが同一人物であると判定する。 When the robot 100 captures an image of the unknown user X, the recognition unit 156 extracts a feature vector (inspection vector) from a captured image (inspection image) of the unknown user X. If the test vector of the unknown user X and the master vector of the registered user A are similar, the person recognizing unit 214 of the server 200 determines that the unknown user X and the registered user A are the same person.

識別精度を高めるためには、マスタベクトルを抽出しやすい良質なマスタ画像、より具体的には、近距離でユーザを撮像する必要がある。本実施形態における認識部１５６は、動作検出部１５２がロボット１００の抱え上げを検出したときの撮像画像をマスタ画像として設定する。ロボット１００が抱っこされているときには、ロボット１００は内蔵のカメラ１３４により高精度に撮像できる。これは、ロボット１００を抱え上げたときには、ユーザの顔とロボット１００の内蔵するカメラ１３４の距離が一定の範囲内に収まるためである。マスタ画像を撮像するためにユーザに「行動指示」を与えるのではなく、ユーザが自らの意思でロボット１００を抱っこするタイミングを見計らって、ユーザに負担をかけることなく良質なマスタ画像を取得できる。 In order to increase the identification accuracy, it is necessary to image a user at a high-quality master image from which a master vector can be easily extracted, more specifically, at a short distance. The recognizing unit 156 in the present embodiment sets a captured image when the motion detecting unit 152 detects the holding of the robot 100 as a master image. When the robot 100 is being held, the robot 100 can take an image with high accuracy by the built-in camera 134. This is because when the robot 100 is lifted, the distance between the user's face and the camera 134 built into the robot 100 falls within a certain range. Instead of giving the user an "action instruction" to capture the master image, the user can obtain a good-quality master image without imposing a burden on the user at the timing of holding the robot 100 with his / her own will.

図８は、マスタ情報のデータ構造図である。
マスタ情報２２４は、応対認識部２２８に格納される。図８においては、ユーザＩＤ＝０１のユーザ（以下、「ユーザ（０１）」のように表記する）に３つのマスタベクトルが対応づけられている。ユーザ（０１）の正面だけでなく、右側面や左側面などの横顔からもマスタ画像が取得される。このため、複数角度、複数距離からユーザを撮像することにより、一人の登録ユーザに対して複数のマスタベクトルが対応づけられる。マスタベクトルは、マスタＩＤにより識別される。マスタベクトル（０１）はユーザ（０１）の顔を正面から撮像したときのマスタ画像から抽出され、マスタベクトル（０２）はユーザ（０１）の顔を右側から撮像したときのマスタ画像から抽出される。FIG. 8 is a data structure diagram of the master information.
The master information 224 is stored in the response recognition unit 228. In FIG. 8, three master vectors are associated with a user with user ID = 01 (hereinafter, referred to as “user (01)”). The master image is acquired not only from the front of the user (01), but also from a profile such as a right side or a left side. Therefore, by imaging the user from a plurality of angles and a plurality of distances, a plurality of master vectors are associated with one registered user. The master vector is identified by a master ID. The master vector (01) is extracted from the master image when the face of the user (01) is imaged from the front, and the master vector (02) is extracted from the master image when the face of the user (01) is imaged from the right. .

説明を簡単にするため、図８に示すマスタベクトルは５つのベクトル成分を有する５次元ベクトルであるとして説明する。５つのベクトル成分ａ〜ｅは、目と目の間隔、肌の色など任意の特徴量に対応する。マスタベクトル（０１）は、３つのベクトル成分ａ〜ｃに対応する特徴量ａ１，ｂ１，ｃ１を含む。一方、ベクトル成分ｄ，ｅには特徴量が設定されていない。たとえば、ベクトル成分ｄが耳の大きさを示す特徴量であるときには、正面のマスタ画像からは成分ｄを抽出できない可能性があるためである。 For the sake of simplicity, the master vector shown in FIG. 8 will be described as a five-dimensional vector having five vector components. The five vector components a to e correspond to arbitrary feature amounts such as an eye-to-eye distance and a skin color. The master vector (01) includes feature amounts a1, b1, and c1 corresponding to three vector components a to c. On the other hand, no feature amount is set for the vector components d and e. For example, when the vector component d is a feature amount indicating the size of the ear, the component d may not be able to be extracted from the front master image.

マスタベクトル（０２）は、３つのベクトル成分ａ，ｃ，ｅに対応する特徴量ａ２，ｃ２，ｅ２を含むがベクトル成分ｂ，ｄに対応する特徴量は含まない。マスタベクトル（０３）は、４つのベクトル成分ａ，ｂ，ｄ，ｅに対応する特徴量ａ３，ｂ３，ｄ３，ｅ３を含むがベクトル成分ｃに対応する特徴量は含まない。複数方向からユーザ（０１）を撮像することにより複数のマスタ画像が取得すれば、ユーザ（０１）の身体的特徴を３次元的に把握できる。 The master vector (02) includes the feature amounts a2, c2, and e2 corresponding to the three vector components a, c, and e, but does not include the feature amounts corresponding to the vector components b and d. The master vector (03) includes feature amounts a3, b3, d3, and e3 corresponding to four vector components a, b, d, and e, but does not include feature amounts corresponding to the vector component c. If a plurality of master images are obtained by imaging the user (01) from a plurality of directions, the physical characteristics of the user (01) can be grasped three-dimensionally.

人物認識部２１４は、ユーザ（０１）の３つのマスタベクトルを相加平均することにより、重心ベクトルＭＢを算出する。重心ベクトルＭＢのベクトル成分ａは、３つのマスタベクトルのａ成分（ａ１，ａ２，ａ３）の平均値である。マスタベクトル（０３）しかベクトル成分ｄを有していないため、重心ベクトルＭＢのベクトル成分ｄは、マスタベクトル（０３）の特徴量ｄ３となる。人物認識部２１４は、マスタベクトルまたは重心ベクトルＭＢに基づいて、ユーザ識別処理を実行する（後述）。 The person recognizing unit 214 calculates the center-of-gravity vector MB by arithmetically averaging the three master vectors of the user (01). The vector component a of the centroid vector MB is an average value of the a components (a1, a2, a3) of the three master vectors. Since only the master vector (03) has the vector component d, the vector component d of the center-of-gravity vector MB becomes the feature amount d3 of the master vector (03). The person recognition unit 214 performs a user identification process based on the master vector or the center of gravity vector MB (described later).

登録ユーザが一人もいない状況を想定する。
動作検出部１５２は、未知ユーザＡに抱っこされたときマスタ画像を取得する。人物認識部２１４は、未知ユーザＡのマスタ画像から抽出されたマスタベクトル（０１）にユーザＩＤ＝０１を対応づけてマスタ情報２２４に記録する。このとき、人物認識部２１４はマスタベクトル（０１）の取得日時も記録する。以上の処理により、未知ユーザＡは登録ユーザ（０１）としてマスタ情報２２４に登録される。図８においては、マスタベクトル（０１）は、２０１６年６月７日に取得されている。It is assumed that there is no registered user.
The motion detection unit 152 acquires a master image when the unknown user A holds the user. The person recognizing unit 214 records the user ID = 01 in the master information 224 in association with the master vector (01) extracted from the master image of the unknown user A. At this time, the person recognizing unit 214 also records the acquisition date and time of the master vector (01). Through the above processing, unknown user A is registered in master information 224 as registered user (01). In FIG. 8, the master vector (01) was acquired on June 7, 2016.

ユーザ（０１）の登録後、新たな未知ユーザＸがロボット１００を抱っこしたときにも、動作検出部１５２はマスタ画像を取得する。人物認識部２１４は、未知ユーザＸのマスタ画像から抽出されたマスタベクトルＭＸと登録ユーザ（０１）のマスタベクトル（０１）を比較する。 After the registration of the user (01), even when a new unknown user X carries the robot 100, the motion detection unit 152 acquires a master image. The person recognizing unit 214 compares the master vector MX extracted from the master image of the unknown user X with the master vector (01) of the registered user (01).

（１）未登録の場合
人物認識部２１４は、マスタベクトルＭＸとマスタベクトル（０１）のベクトル距離が所定距離以上であれば、未知ユーザＸはユーザ（０１）とは異なると判定する。特徴ベクトルの距離は、ユークリッド距離として計算してもよいし、チェビシェフ距離など他の定義に基づく距離計算であってもよい。人物認識部２１４は、未知ユーザＸを新たな登録ユーザ（０２）としてマスタ情報２２４に登録するとともに、ユーザＸにユーザＩＤ＝０２を割り当て、マスタベクトルＭＸにマスタＩＤ＝０４を割り当てる。以上の処理により、マスタ情報２２４にはユーザ（０１）およびユーザ（０２）の二人が登録される。(1) When Not Registered The person recognizing unit 214 determines that the unknown user X is different from the user (01) if the vector distance between the master vector MX and the master vector (01) is equal to or longer than a predetermined distance. The feature vector distance may be calculated as a Euclidean distance, or may be a distance calculation based on another definition such as a Chebyshev distance. The person recognizing unit 214 registers the unknown user X in the master information 224 as a new registered user (02), assigns a user ID = 02 to the user X, and assigns a master ID = 04 to the master vector MX. By the above processing, the user (01) and the user (02) are registered in the master information 224.

（２）既登録の場合
マスタベクトルＭＸとマスタベクトル（０１）の距離が所定距離未満であれば、人物認識部２１４は、未知ユーザＸと登録ユーザ（０１）は同一人物であると判定する。人物認識部２１４は、マスタベクトルＭＸにマスタＩＤ＝０２を設定して、ユーザ（０１）に対応づける。ユーザ（０１）のマスタベクトルは２つとなり、ユーザ（０１）を識別するための情報が充実する。(2) In the case of already registered If the distance between the master vector MX and the master vector (01) is less than the predetermined distance, the person recognizing unit 214 determines that the unknown user X and the registered user (01) are the same person. The person recognizing unit 214 sets the master ID = 02 in the master vector MX and associates it with the user (01). The master vector of the user (01) is two, and the information for identifying the user (01) is enriched.

マスタ画像からは高品質のマスタベクトルが得られるため、マスタベクトル同士を比較することにより、ロボット１００を抱っこしているユーザが登録ユーザと人物であるか否かを容易に判定できる。 Since a high quality master vector is obtained from the master image, it is possible to easily determine whether the user holding the robot 100 is a registered user or a person by comparing the master vectors.

複数の登録ユーザがいるときには、各登録ユーザのマスタベクトルが比較対象となる。一人の登録ユーザが２以上のマスタベクトルを有するときには、登録ユーザの重心ベクトルと未知ユーザのマスタベクトルが比較対象となる。 When there are a plurality of registered users, the master vector of each registered user is to be compared. When one registered user has two or more master vectors, the center of gravity vector of the registered user and the master vector of the unknown user are compared.

図９は、ユーザ識別方法を説明するための第１の模式図である。
図９および図１０では、ユーザ識別処理の原理を図解するため、特徴ベクトルに含まれるベクトル成分のうち、２つのベクトル成分ａ，ｂを対象として説明する。３つ以上のベクトル成分を有するときにも処理方法は同じである。
図９においては、登録ユーザＡおよび登録ユーザＢそれぞれについて、マスタベクトルＭＡとマスタベクトルＭＢが１つずつ抽出されている。マスタベクトルＭＡ＝（ａ１，ｂ１）、マスタベクトルＭＢ＝（ａ２，ｂ２）である。このような状況において、ロボット１００が正面から歩いてくる未知ユーザＸの撮像画像（検査画像）を取得したとする。認識部１５６は、検査画像に映る未知ユーザＸが登録ユーザＡ，Ｂのいずれであるかを判定する。抱っこされているわけではないので、未知ユーザＸの検査画像から得られる特徴ベクトル（検査ベクトル）は、通常、マスタベクトルほどの精度を有さない。FIG. 9 is a first schematic diagram for explaining a user identification method.
9 and 10, two vector components a and b among the vector components included in the feature vector will be described in order to illustrate the principle of the user identification process. The processing method is the same when there are three or more vector components.
In FIG. 9, one master vector MA and one master vector MB are extracted for each of the registered user A and the registered user B. The master vector MA = (a1, b1) and the master vector MB = (a2, b2). In such a situation, it is assumed that the robot 100 has acquired a captured image (inspection image) of the unknown user X who is walking from the front. The recognizing unit 156 determines whether the unknown user X shown in the inspection image is the registered user A or B. Since the user is not hugged, the feature vector (inspection vector) obtained from the inspection image of the unknown user X usually does not have as high accuracy as the master vector.

認識部１５６は、まず、未知ユーザＸの検査画像から、検査ベクトルＤＸ＝（ａｘ，ｂｘ）を抽出する。ロボット１００の通信部１４２は、サーバ２００の通信部２０４に検査ベクトルＤＸを送信する。サーバ２００の人物認識部２１４は、検査ベクトルＤＸとマスタベクトルＭＡとの距離であるｒａ，検査ベクトルＤＸとマスタベクトルＭＢとの距離であるｒｂをそれぞれ算出する。 The recognizing unit 156 first extracts a test vector DX = (ax, bx) from the test image of the unknown user X. The communication unit 142 of the robot 100 transmits the inspection vector DX to the communication unit 204 of the server 200. The person recognizing unit 214 of the server 200 calculates the distance ra between the test vector DX and the master vector MA, and the distance rb between the test vector DX and the master vector MB.

任意の閾値ｒｍを設定したとき、ｒｂ＜ｒａ、かつ、ｒｂ＜ｒｍであれば、人物認識部２１４は未知ユーザＸが登録ユーザＢであると判定する。一方、ｒａ＜ｒｂ、かつ、ｒａ＜ｒｍであれば、人物認識部２１４は未知ユーザＸが登録ユーザＡであると判定する。一方、ｒａ＞ｒｍ、かつ、ｒｂ＞ｒｍであるときには、未知ユーザＸは登録ユーザＡ、Ｂのいずれにも該当しない。未知ユーザＸが親密度の高い登録ユーザＡであると判明したときには、動作制御部１５０は未知ユーザＸのもとに駆け寄るなどの親密行動を選択してもよい。一方、未知ユーザＸが親密度の低い登録ユーザＢであると判明したときには、動作制御部１５０は未知ユーザＸから逃げるなどの忌避行動を選択してもよい。 When rb <ra and rb <rm when an arbitrary threshold value rm is set, the person recognizing unit 214 determines that the unknown user X is the registered user B. On the other hand, if ra <rb and ra <rm, the person recognizing unit 214 determines that the unknown user X is the registered user A. On the other hand, when ra> rm and rb> rm, unknown user X does not correspond to any of registered users A and B. When it is determined that the unknown user X is the registered user A with a high degree of familiarity, the operation control unit 150 may select a close action such as running up to the unknown user X. On the other hand, when it is determined that the unknown user X is the registered user B with low intimacy, the operation control unit 150 may select an avoidance action such as escaping from the unknown user X.

未知ユーザＸを識別できなかったときには、人物認識部２１４は未確認の旨をロボット１００に通知し、ロボット１００の動作制御部１５０は未知ユーザＸに抱っこをせがむモーションを選択してもよい。具体的には、未知ユーザＸに近づく、手１０６を挙げる、未知ユーザＸの前で座り込むなどのモーションが考えられる。 When the unknown user X cannot be identified, the person recognizing unit 214 notifies the robot 100 that the unknown user X has not been confirmed, and the motion control unit 150 of the robot 100 may select a motion that hugs the unknown user X. Specifically, motions such as approaching the unknown user X, raising the hand 106, and sitting down in front of the unknown user X are conceivable.

未知ユーザＸがロボット１００を抱え上げ、動作検出部１５２が「抱え上げ」を検出すると、撮像制御部１５４はカメラ１３４を制御して未知ユーザＸを近距離から撮像する。抱え上げ時に得られた未知ユーザのマスタ画像から、認識部１５６はマスタベクトルＭＸを抽出する。サーバ２００の人物認識部２１４は、未知ユーザＸのマスタベクトルＭＸと、既存のマスタベクトルＭＡ，ＭＢを比較することにより再度のユーザ識別処理を実行してもよい。マスタベクトル同士の比較であるためより高精度の識別が可能である。マスタベクトルの比較によっても未知ユーザＸが登録ユーザＡ，Ｂとは別人物であると判定されたときには、人物認識部２１４は未知ユーザＸを３人目の登録ユーザとしてマスタベクトルＭＸとともにマスタ情報２２４に登録する。 When the unknown user X holds the robot 100 and the motion detection unit 152 detects “holding”, the imaging control unit 154 controls the camera 134 to image the unknown user X from a short distance. The recognition unit 156 extracts the master vector MX from the unknown user's master image obtained at the time of lifting. The person recognizing unit 214 of the server 200 may execute the user identification process again by comparing the master vector MX of the unknown user X with the existing master vectors MA and MB. Since comparison is made between master vectors, more accurate identification is possible. When the unknown user X is also determined to be a different person from the registered users A and B by the comparison of the master vectors, the person recognizing unit 214 sets the unknown user X as the third registered user in the master information 224 together with the master vector MX. sign up.

なお、未知ユーザＸが登録ユーザＡであると判明したときには、人物認識部２１４は、未知ユーザＸの検査画像から得られた検査ベクトルを登録ユーザＡの新たなマスタベクトルとして登録してもよい。 When the unknown user X is determined to be the registered user A, the person recognizing unit 214 may register the test vector obtained from the test image of the unknown user X as a new master vector of the registered user A.

図１０は、ユーザ識別方法を説明するための第２の模式図である。
図１０においては、登録ユーザＡおよび登録ユーザＢそれぞれについて、複数のマスタベクトルが抽出されている。人物認識部２１４は、登録ユーザＡの重心ベクトルＭＢ（Ａ）および登録ユーザＢの重心ベクトルＭＢ（Ｂ）を算出する。このような状況において、ロボット１００が、正面から歩いてくる未知ユーザＸの撮像画像（検査画像）を取得したとする。認識部１５６は、検査画像に映る未知ユーザＸが登録ユーザＡ，Ｂのいずれであるかを判定する。FIG. 10 is a second schematic diagram for explaining the user identification method.
In FIG. 10, a plurality of master vectors are extracted for each of the registered user A and the registered user B. The person recognizing unit 214 calculates a center-of-gravity vector MB (A) of the registered user A and a center-of-gravity vector MB (B) of the registered user B. In such a situation, it is assumed that the robot 100 has acquired a captured image (inspection image) of the unknown user X who is walking from the front. The recognizing unit 156 determines whether the unknown user X shown in the inspection image is the registered user A or B.

認識部１５６は、まず、未知ユーザＸの検査画像から、検査ベクトルＤＸ＝（ａｘ，ｂｘ）を抽出する。ロボット１００の通信部１４２は、サーバ２００の通信部２０４に検査ベクトルＤＸを送信する。サーバ２００の人物認識部２１４は、検査ベクトルＤＸと重心ベクトルＭＢ（Ａ）との距離であるｒａ，検査ベクトルＤＸと重心ベクトルＭＢ（Ｂ）との距離であるｒｂをそれぞれ算出する。 The recognizing unit 156 first extracts a test vector DX = (ax, bx) from the test image of the unknown user X. The communication unit 142 of the robot 100 transmits the inspection vector DX to the communication unit 204 of the server 200. The person recognizing unit 214 of the server 200 calculates ra which is the distance between the test vector DX and the centroid vector MB (A), and rb which is the distance between the test vector DX and the centroid vector MB (B).

任意の閾値ｒｍを設定したとき、ｒｂ＜ｒａ、かつ、ｒｂ＜ｒｍであれば、人物認識部２１４は未知ユーザＸが登録ユーザＢであると判定する。一方、ｒａ＜ｒｂ、かつ、ｒａ＜ｒｍであれば、人物認識部２１４は未知ユーザＸが登録ユーザＡであると判定する。一方、ｒａ＞ｒｍ、かつ、ｒｂ＞ｒｍであるときには、未知ユーザＸは登録ユーザＡ、Ｂのいずれにも該当しない。 When rb <ra and rb <rm when an arbitrary threshold value rm is set, the person recognizing unit 214 determines that the unknown user X is the registered user B. On the other hand, if ra <rb and ra <rm, the person recognizing unit 214 determines that the unknown user X is the registered user A. On the other hand, when ra> rm and rb> rm, unknown user X does not correspond to any of registered users A and B.

図１１は、マスタベクトルの抽出処理過程を示すフローチャートである。
ロボット１００の動作検出部１５２がロボット１００の抱え上げを検出したとき、図１１のベクトル抽出処理が実行される。動作制御部１５０は、抱え上げが検出されたとき、所定の誘導モーションを実行する（Ｓ１０）。誘導モーションは、ユーザを注目させるためにあらかじめ定義されたモーションである。具体的には、手１０６を振る、ボディ１０４を揺らす、頭部フレーム３１６をユーザに向ける、頭部フレーム３１６を上下または左右に揺らすなどの非言語モーションが想定される。誘導モーションは機械的なモーションに限らない。動作制御部１５０は有機ＥＬ素子により目１１０に「瞳」を映像表示させる。動作制御部１５０は、瞳画像を大きくすることで瞳を見開く、瞳を揺らす、ウィンクさせるなどの画像制御を指示してもよい。FIG. 11 is a flowchart showing a master vector extraction process.
When the motion detection unit 152 of the robot 100 detects that the robot 100 is held up, the vector extraction processing of FIG. 11 is executed. The operation control unit 150 executes a predetermined guidance motion when the holding is detected (S10). The guidance motion is a motion defined in advance to make the user pay attention. Specifically, non-verbal motions such as shaking the hand 106, shaking the body 104, turning the head frame 316 toward the user, and shaking the head frame 316 up and down or left and right are assumed. Guided motion is not limited to mechanical motion. The operation control unit 150 causes the eye 110 to display a “pupil” as an image using the organic EL element. The operation control unit 150 may instruct image control such as opening the pupil, shaking the pupil, and causing the pupil to wink by enlarging the pupil image.

誘導モーションでユーザの気を引くことにより、ユーザの顔をロボット１００に向けさせる。また、多様な誘導モーションを用意することで、ユーザの多様な表情を引き出すことにより、多様な表情に対応した多様なマスタベクトルを抽出可能となる。たとえば、笑い皺や、えくぼなど、笑顔に特有の特徴量をマスタベクトルのベクトル成分として含めることもできる。 The user's face is directed to the robot 100 by attracting the user with the guided motion. Further, by preparing various guidance motions, various facial expressions of the user can be extracted, and various master vectors corresponding to various facial expressions can be extracted. For example, a characteristic amount specific to a smile, such as a laugh wrinkle or a dimple, may be included as a vector component of the master vector.

誘導モーションを実行後、撮像制御部１５４はカメラ１３４を制御してユーザを撮像する（Ｓ１２）。このときの撮像画像が「マスタ候補画像」となる。誘導モーションによってユーザがロボット１００を見つめるタイミングにてユーザを撮像することにより、ユーザの顔を認識しやすい高品質なマスタ候補画像を取得できる。 After executing the guided motion, the imaging control unit 154 controls the camera 134 to image the user (S12). The captured image at this time is the “master candidate image”. By imaging the user at the timing when the user looks at the robot 100 by the guided motion, it is possible to acquire a high-quality master candidate image in which the user's face can be easily recognized.

認識部１５６は、マスタ候補画像の品質を判定する（Ｓ１４）。以下、マスタ候補画像の品質判定のことを「品質検査」とよぶ。品質検査に合格したマスタ候補画像がマスタ画像として設定される。品質検査が不合格の場合には（Ｓ１４のＮ）、処理はＳ１０に戻り、マスタ候補画像を再取得する。このときには、別の種類の誘導モーションを実行してもよい。品質検査のために、あらかじめユーザの顔の大きさ、光量、表情などについて複数の評価項目が設定される。たとえば、ユーザが閉眼しているときや、マスタ候補画像が暗すぎるときや明るすぎるとき、マスタ候補画像の焦点が合っていないときには、品質検査は不合格となる。品質検査のためにどのような評価項目を設定するかは任意である。 The recognizing unit 156 determines the quality of the master candidate image (S14). Hereinafter, the quality determination of the master candidate image is referred to as “quality inspection”. A master candidate image that has passed the quality inspection is set as a master image. If the quality inspection fails (N in S14), the process returns to S10, and reacquires the master candidate image. At this time, another type of guided motion may be executed. For quality inspection, a plurality of evaluation items are set in advance for the user's face size, light amount, facial expression, and the like. For example, when the user has closed eyes, when the master candidate image is too dark or too bright, or when the master candidate image is out of focus, the quality inspection fails. What evaluation items are set for quality inspection is optional.

認識部１５６は、品質検査に合格したマスタ候補画像を正式なマスタ画像として採用する（Ｓ１４のＹ）。認識部１５６は、マスタ画像からマスタベクトルを抽出する（Ｓ１６）。通信部１４２は、マスタベクトルをサーバ２００に送信する（Ｓ１８）。 The recognizing unit 156 adopts the master candidate image that has passed the quality inspection as a formal master image (Y in S14). The recognizing unit 156 extracts a master vector from the master image (S16). The communication unit 142 transmits the master vector to the server 200 (S18).

人物認識部２１４は、新たに得られたマスタベクトルとマスタ情報２２４に既に登録されているマスタベクトルを比較する（Ｓ２０）。新たに得られたマスタベクトルが既に登録されているマスタベクトルの距離が近いときには（Ｓ２０のＹ）、マスタベクトルを追加登録する（Ｓ２２）。たとえば、ユーザ（０１）のマスタベクトル（０１）と類似のマスタベクトルが得られたときには、新たなマスタベクトルもユーザ（０１）に対応づける。登録されているいずれのマスタベクトルとも近くないときには（Ｓ２０のＮ）、新たなユーザＩＤとマスタＩＤを付与してマスタベクトルを新規登録する（Ｓ２４）。 The person recognizing unit 214 compares the newly obtained master vector with the master vector already registered in the master information 224 (S20). When the distance between the newly obtained master vector and the registered master vector is short (Y in S20), the master vector is additionally registered (S22). For example, when a master vector similar to the master vector (01) of the user (01) is obtained, a new master vector is also associated with the user (01). When it is not close to any of the registered master vectors (N in S20), a new user ID and a master ID are assigned to newly register the master vector (S24).

Ｓ２０においては登録済みのマスタベクトルと新規抽出のマスタベクトルを比較してもよいし、図１０に関連して説明したように登録済みの重心ベクトルと新規抽出のマスタベクトルを比較してもよい。 In S20, the registered master vector may be compared with the newly extracted master vector, or the registered center-of-gravity vector may be compared with the newly extracted master vector as described with reference to FIG.

マスタベクトルの抽出処理は、抱っこに限らず、ユーザがロボット１００にタッチしたことを契機として実行されてもよい。ユーザがロボット１００にタッチするときには、ユーザはロボット１００の近くにいるため良質なマスタ画像を得られる可能性がある。 The extraction process of the master vector is not limited to the hug, and may be executed when the user touches the robot 100. When the user touches the robot 100, there is a possibility that a good master image can be obtained because the user is near the robot 100.

動作検出部１５２がロボット１００の抱え下ろしを検出するときにも、認識部１５６はマスタベクトルの抽出処理を実行する。動作検出部１５２は、抱え下ろしが検出されたとき、連続的にユーザを撮像する。認識部１５６はこのときに得られた複数のマスタ候補画像を順次品質検査し、複数のマスタベクトルを抽出する。抱え下ろしのときには、顎や腰、足などの身体的特徴を近距離にて撮像できる。 Also when the motion detection unit 152 detects that the robot 100 is held down, the recognition unit 156 executes a master vector extraction process. The movement detection unit 152 continuously captures an image of the user when the holding down is detected. The recognition unit 156 sequentially performs quality inspection on the plurality of master candidate images obtained at this time, and extracts a plurality of master vectors. At the time of holding down, physical characteristics such as chin, waist and feet can be imaged at a short distance.

ユーザがロボット１００を抱え上げたときに得られたマスタベクトルを「第１マスタベクトル」、ユーザがロボット１００を下ろすとき、または、下ろしたあとに得られるマスタベクトルを「第２マスタベクトル」とよぶ。認識部１５６は、ユーザ（０１）の第１マスタベクトルを得たあとは、抱え下ろしのときのマスタ画像から１以上の第２マスタベクトルも抽出する。このように高精度の第１マスタベクトルが得られたときには、抱え下ろしのときにも第２マスタベクトルを取得することにより、ユーザ（０１）のマスタベクトルを充実させることができる。ここでいう「第２マスタベクトル」は、ロボット１００が抱え下ろされたあとも、ユーザの後ろ姿も含めて、さまざまな距離や角度から得られるマスタベクトルも含まれる。第１マスタベクトルと第２マスタベクトルは、マスタ情報２２４に示したように一人のユーザについて互いに関連付けられる。 The master vector obtained when the user lifts the robot 100 is referred to as a “first master vector”, and the master vector obtained when the user lowers the robot 100 or after the robot 100 is lowered is referred to as a “second master vector”. . After obtaining the first master vector of the user (01), the recognizing unit 156 also extracts one or more second master vectors from the master image at the time of holding down. When the first master vector with high accuracy is obtained in this manner, the master vector of the user (01) can be enriched by acquiring the second master vector even when the user holds the master vector down. The “second master vector” here includes master vectors obtained from various distances and angles, even after the robot 100 is held down, including the back of the user. The first master vector and the second master vector are associated with each other for one user as shown in the master information 224.

図１２は、ユーザの画像追跡方法を示す模式図である。
ロボット１００が床面Ｆに降ろされたあとも、更に、撮像制御部１５４はカメラ１３４（全天球カメラ）によりユーザを追跡する。図１２に示す天球撮像範囲４１８は、全天球カメラによる撮像範囲である。全天球カメラは、ロボット１００の上方半球略全域を一度に撮像可能である。ロボット１００の認識部１５６は、第１マスタベクトルを抽出したあともユーザを所定期間、たとえば、１０秒程度は天球撮像範囲４１８において追跡する。撮像制御部１５４は、追跡中に、さまざまな角度、さまざまな距離からユーザのマスタ画像を撮像する。たとえば、髪の長さ、腰の細さなどはユーザから離れないと得られない情報である。認識部１５６は、追跡中に得られるマスタ画像からさまざまな第２マスタベクトルを抽出することにより、マスタベクトルを充実させる。これらの第２マスタベクトルは第１マスタベクトルと対応づけて管理される。天球撮像範囲４１８においてユーザを画像上で追跡するだけでなく、動作制御部１５０はユーザについていく、ユーザの周りを動き回るなどの追跡行動を実行させてもよい。そして、追跡行動中にも撮像制御部１５４はユーザを撮像することにより、マスタベクトルを充実させてもよい。追跡行動は、動作制御部１５０が指示してもよいし、サーバ２００の動作制御部２２２が動作制御部１５０に指示してもよい。FIG. 12 is a schematic diagram showing a method of tracking an image of a user.
Even after the robot 100 is lowered onto the floor F, the imaging control unit 154 further tracks the user with the camera 134 (omnidirectional camera). The celestial sphere imaging range 418 shown in FIG. The spherical camera can capture an image of substantially the entire upper hemisphere of the robot 100 at a time. The recognition unit 156 of the robot 100 tracks the user in the celestial sphere imaging range 418 for a predetermined period, for example, about 10 seconds after the first master vector is extracted. The imaging control unit 154 captures a master image of the user from various angles and various distances during tracking. For example, the length of the hair, the thinness of the waist, and the like are information that cannot be obtained without leaving the user. The recognition unit 156 enriches the master vectors by extracting various second master vectors from the master image obtained during tracking. These second master vectors are managed in association with the first master vector. In addition to tracking the user on the image in the celestial sphere imaging range 418, the operation control unit 150 may execute a tracking action such as following the user or moving around the user. Then, even during the tracking action, the imaging control unit 154 may enhance the master vector by imaging the user. The tracking action may be instructed by the operation control unit 150, or the operation control unit 222 of the server 200 may instruct the operation control unit 150.

図１３は、マスタベクトルを遠隔から抽出する方法を説明するための模式図である。
撮像制御部１５４は、抱っこやタッチだけではなく、ユーザがロボット１００に対して所定の相対地点に位置したときマスタ候補画像を撮像する。ここでいう相対地点とは、ユーザとロボット１００の距離および相対角度の双方を含む。測距部１５８は、天球撮像範囲４１８において認識された１以上のユーザに対して定期的に測距する。図１３においては、ロボット１００は、ユーザの正面方向に対して水平角ａ、ユーザの顔の位置に対して仰角ｂ、ユーザからの距離ｒの相対地点に位置している。ユーザの体の向きは認識部１５６が画像認識により判定する。撮像制御部１５４は、距離、水平角および仰角が所定範囲（以下、「マスタショット範囲」とよぶ）にあるとき、マスタ候補画像を撮像する。認識部１５６は、マスタ候補画像を品質検査し、合格であればマスタベクトルを抽出する。FIG. 13 is a schematic diagram for explaining a method of remotely extracting a master vector.
The imaging control unit 154 captures a master candidate image when the user is located at a predetermined relative position with respect to the robot 100, in addition to holding and touching. Here, the relative point includes both the distance between the user and the robot 100 and the relative angle. The distance measuring unit 158 periodically measures one or more users recognized in the celestial sphere imaging range 418. In FIG. 13, the robot 100 is located at a relative point of a horizontal angle a with respect to the front direction of the user, an elevation angle b with respect to the position of the user's face, and a distance r from the user. The recognition unit 156 determines the orientation of the user's body by image recognition. When the distance, the horizontal angle, and the elevation angle are within a predetermined range (hereinafter, referred to as a “master shot range”), the imaging control unit 154 captures a master candidate image. The recognizing unit 156 inspects the quality of the master candidate image, and extracts a master vector if the master candidate image passes.

測距部１５８は、あらかじめ複数のマスタショット範囲を設定されている。測距部１５８は、ユーザがマスタショット範囲に入るごとに撮像制御部１５４に通知し、撮像制御部１５４はマスタ候補画像を取得する。たとえば、マスタショット範囲Ｒ１〜Ｒ３が定義されているとき、新規ユーザＣがマスタショット範囲Ｒ１に入ったときには、マスタショット範囲Ｒ１に対応するマスタ候補画像を取得する。このようにして、マスタショット範囲Ｒ１〜Ｒ３それぞれに対応するマスタベクトルを抽出する。ユーザＣを複数のマスタショット範囲、いいかえれば、複数の相対地点から多角的に撮像し、多方向からのマスタベクトルを取得することでユーザの身体的特徴を３次元的に把握できる。 In the distance measuring section 158, a plurality of master shot ranges are set in advance. The ranging unit 158 notifies the imaging control unit 154 every time the user enters the master shot range, and the imaging control unit 154 acquires a master candidate image. For example, when the master shot ranges R1 to R3 are defined and the new user C enters the master shot range R1, a master candidate image corresponding to the master shot range R1 is acquired. In this manner, master vectors corresponding to the respective master shot ranges R1 to R3 are extracted. The user C can be three-dimensionally grasped by capturing the user C from a plurality of master shot ranges, in other words, from a plurality of relative points and obtaining master vectors from multiple directions.

抱っこやタッチなどの接触時には至近距離からユーザを撮影できるため、ユーザの顔について良質な情報を得やすい。一方、抱っこやタッチがされていないときでも、測距部１５８が至近距離のユーザを検出したときには、撮像制御部１５４はマスタ候補画像を取得すればよい。たとえば、小さな子どもが抱っこやタッチに抵抗があっても、興味をもって近づいてきたときにはマスタベクトルを抽出できる。また、ユーザの髪の長さや体型に関する情報を得るためにはロボット１００はユーザからある程度は離れなければならない。さまざまなマスタショット範囲を設定することにより、ユーザの顔だけでなく体型まで含めた多様なマスタベクトルを取得できる。 Since the user can be photographed from a close distance at the time of contact such as holding or touching, it is easy to obtain high-quality information on the user's face. On the other hand, even when the user is not hugging or touching, when the distance measuring unit 158 detects a user at a short distance, the imaging control unit 154 may obtain the master candidate image. For example, even if a small child has resistance to holding or touching, a master vector can be extracted when approaching with interest. In addition, in order to obtain information on the length and body type of the user's hair, the robot 100 must be separated from the user to some extent. By setting various master shot ranges, various master vectors including not only the user's face but also the body shape can be obtained.

第１マスタベクトルは、ユーザを至近距離から撮像したマスタ画像に基づくため、ユーザを識別する上で有用な特徴ベクトルである。一方、第２マスタベクトルは、第１マスタベクトルほどユーザの身体的特徴がはっきりと現れないことも多い。そこで、抱っこやタッチをされたときのマスタ画像Ａから第１マスタベクトル（Ａ）を抽出したことを契機として、撮像制御部１５４は追跡モードに入る。追跡モードは所定時間継続するとしてもよい。撮像制御部１５４は、たとえば、抱え下ろしを検出したときにマスタ画像Ｂ１を取得する。このマスタ画像Ｂ１から第２マスタベクトル（Ｂ１）が抽出され、さきほど抽出された第１マスタベクトル（Ａ）に対応づけられる。抱え下ろしのあとも追跡モードは継続し、ユーザがマスタショット範囲に入るとマスタ画像Ｂ２を更に取得する。このマスタ画像Ｂ２から得られる第２マスタベクトル（Ｂ２）も、追跡モードの契機となった第１マスタベクトル（Ａ）に対応づけられる。このように、ユーザを確実に識別しやすい第１マスタベクトル（Ａ）に対して、その後に得られるさまざまな第２マスタベクトルが対応づけられる。「後ろ姿」のように特徴が現れにくい第２マスタベクトルであっても、その取得契機となった第１マスタベクトルと対応づけることで、一人のユーザに対応するマスタベクトル群を充実させることができる。 The first master vector is a feature vector useful for identifying a user because the first master vector is based on a master image of the user taken from a close distance. On the other hand, the second master vector often does not clearly show the physical characteristics of the user as much as the first master vector. Then, triggered by the extraction of the first master vector (A) from the master image A at the time of holding or touching, the imaging control unit 154 enters the tracking mode. The tracking mode may continue for a predetermined time. The imaging control unit 154 acquires the master image B1 when detecting the holding down, for example. A second master vector (B1) is extracted from the master image B1, and is associated with the first master vector (A) extracted earlier. The tracking mode continues even after the holding down, and when the user enters the master shot range, the master image B2 is further acquired. The second master vector (B2) obtained from the master image B2 is also associated with the first master vector (A) that triggered the tracking mode. In this way, various second master vectors obtained after that are associated with the first master vector (A) that can easily identify the user. Even in the case of the second master vector, whose characteristic is unlikely to appear like “back view”, the master vector group corresponding to one user can be enriched by associating the second master vector with the first master vector that triggered the acquisition. .

以上、実施形態に基づいてロボット１００およびロボット１００を含むロボットシステム３００について説明した。
顔認識技術では、「正面を向いてください」「カメラを見つめてください」などの言語指示をユーザに与えた上で、マスタ画像を取得することが多い。このような音声や文字などの言語指示は、ユーザの負担になりやすい。また、マスタ画像を取得するための言語指示は、ロボット１００の非生物性をユーザに意識させてしまうという点でも望ましくない。本実施形態におけるロボット１００は、ユーザがロボット１００を抱っこしたタイミングで、さりげなくマスタ画像を取得できる。ロボット１００は、小さい、柔らかい、軽い、丸い、といった人間が触りたくなる形状を有する。ユーザになんらかの行動を強いるのではなく、ユーザが自然に「抱っこ」したタイミングを捉えて、高品質のマスタ画像を取得できる。抱っこやタッチをしたくなる気持ちを刺激するというロボット１００の特性を生かすことで、マスタ画像をさりげなく取得できる。The robot 100 and the robot system 300 including the robot 100 have been described above based on the embodiment.
In the face recognition technology, a master image is often acquired after giving a user a language instruction such as "look at the front" or "look at the camera". Such language instructions such as voices and characters tend to burden the user. Further, the language instruction for acquiring the master image is also undesirable in that the user is made aware of the inanimateness of the robot 100. The robot 100 according to the present embodiment can casually acquire the master image at the timing when the user holds the robot 100. The robot 100 has a shape such as small, soft, light, and round that makes a human touch. Instead of forcing the user to take any action, it is possible to acquire a high-quality master image by capturing the timing when the user naturally "carryes". The master image can be casually acquired by utilizing the characteristics of the robot 100 that stimulates the desire to hold and touch.

ロボット１００は、更に、手１０６をばたつかせるなどの非言語の誘導モーションにより、ユーザの注意を喚起する。非言語コミュニケーション（non-verbal communication）によってユーザに注目させる方式であるため、ユーザは強制されている感覚をもちにくい。 The robot 100 further draws the user's attention by non-verbal guided motion such as flapping the hand 106. Since this is a method in which the user is noticed by non-verbal communication, the user is less likely to have a sense of being forced.

ロボット１００は、抱っこされたときにユーザを撮像し、第１マスタベクトルを取得する。更に、ロボット１００は、抱え下ろされるときや抱え下ろされたあともマスタ画像を撮像することにより、複数の第２マスタベクトルも取得できる。第１マスタベクトルを抽出したタイミングで第２マスタベクトルも蓄積することにより、ユーザの身体的特徴をより多面的に把握しやすくなる。 The robot 100 captures an image of the user when being held, and acquires a first master vector. Further, the robot 100 can also acquire a plurality of second master vectors by capturing a master image when the robot is held down or after the robot is held down. By accumulating the second master vector at the timing when the first master vector is extracted, it becomes easier to grasp the physical characteristics of the user from multiple sides.

本実施形態によれば、高品質かつ多数のマスタ画像に基づいて、ユーザ識別処理のための精緻な判別基準を確立しやすくなる。ユーザ識別処理は、応対行為の認識や親密度計算の前提となる。マスタベクトルに基づいて高精度にてユーザを識別することにより、ロボット１００はユーザに応じて行動特性を変化させることができる。 According to the present embodiment, it is easy to establish a fine criterion for user identification processing based on high quality and a large number of master images. The user identification process is a prerequisite for recognizing a response action and calculating intimacy. By identifying the user with high accuracy based on the master vector, the robot 100 can change the behavior characteristics according to the user.

なお、本発明は上記実施形態や変形例に限定されるものではなく、要旨を逸脱しない範囲で構成要素を変形して具体化することができる。上記実施形態や変形例に開示されている複数の構成要素を適宜組み合わせることにより種々の発明を形成してもよい。また、上記実施形態や変形例に示される全構成要素からいくつかの構成要素を削除してもよい。 Note that the present invention is not limited to the above embodiments and modified examples, and can be embodied by modifying the constituent elements without departing from the gist. Various inventions may be formed by appropriately combining a plurality of components disclosed in the above embodiments and modifications. In addition, some components may be deleted from all the components shown in the above-described embodiment and the modified examples.

１つのロボット１００と１つのサーバ２００、複数の外部センサ１１４によりロボットシステム３００が構成されるとして説明したが、ロボット１００の機能の一部はサーバ２００により実現されてもよいし、サーバ２００の機能の一部または全部がロボット１００に割り当てられてもよい。１つのサーバ２００が複数のロボット１００をコントロールしてもよいし、複数のサーバ２００が協働して１以上のロボット１００をコントロールしてもよい。 Although one robot 100, one server 200, and a plurality of external sensors 114 have been described as constituting the robot system 300, some of the functions of the robot 100 may be realized by the server 200, or the functions of the server 200. May be allocated to the robot 100. One server 200 may control a plurality of robots 100, or a plurality of servers 200 may control one or more robots 100 in cooperation.

ロボット１００やサーバ２００以外の第３の装置が、機能の一部を担ってもよい。図６において説明したロボット１００の各機能とサーバ２００の各機能の集合体は大局的には１つの「ロボット」として把握することも可能である。１つまたは複数のハードウェアに対して、本発明を実現するために必要な複数の機能をどのように配分するかは、各ハードウェアの処理能力やロボットシステム３００に求められる仕様等に鑑みて決定されればよい。 A third device other than the robot 100 and the server 200 may perform some of the functions. An aggregate of each function of the robot 100 and each function of the server 200 described in FIG. 6 can be globally grasped as one “robot”. How to allocate a plurality of functions necessary to implement the present invention to one or a plurality of hardware is determined in consideration of the processing capacity of each hardware, specifications required for the robot system 300, and the like. It only has to be determined.

上述したように、「狭義におけるロボット」とはサーバ２００を含まないロボット１００のことであるが、「広義におけるロボット」はロボットシステム３００のことである。サーバ２００の機能の多くは、将来的にはロボット１００に統合されていく可能性も考えられる。 As described above, the “robot in a narrow sense” refers to the robot 100 that does not include the server 200, whereas the “robot in a broad sense” refers to the robot system 300. Many of the functions of the server 200 may be integrated into the robot 100 in the future.

マスタベクトルは、マスタ画像から抽出される特徴量以外の特徴量をベクトル成分として含んでもよい。たとえば、ニオイセンサで検出した匂いやマイクロフォンで検出した声質、温度センサで検出した体温をベクトル成分として含んでもよい。特に、抱っこのときにはユーザの匂いや体温などを高精度にて検出しやすい。マスタ画像は静止画ではなく動画（以下、「マスタ動画」とよぶ）であってもよい。認識部１５６は、マスタ動画からユーザの歩き方や貧乏ゆすりなどの癖を抽出し、これらの特徴情報をマスタベクトル成分に含めてもよい。 The master vector may include a feature amount other than the feature amount extracted from the master image as a vector component. For example, the vector component may include the odor detected by the odor sensor, the voice quality detected by the microphone, and the body temperature detected by the temperature sensor. In particular, at the time of holding, it is easy to detect the user's odor, body temperature, and the like with high accuracy. The master image may be a moving image (hereinafter, referred to as “master moving image”) instead of a still image. The recognizing unit 156 may extract habits such as a user's walking style and poor extortion from the master moving image, and include these pieces of characteristic information in a master vector component.

本実施形態におけるカメラ１３４は、全天球カメラであるが、カメラ１３４は通常のカメラであってもよい。カメラ１３４はツノ１１２に内蔵されてもよいし、目１１０に内蔵されてもよい。また、全天球カメラと通常のカメラの双方が内蔵されてもよい。 Although the camera 134 in the present embodiment is a spherical camera, the camera 134 may be a normal camera. The camera 134 may be built in the horn 112 or may be built in the eye 110. Further, both a spherical camera and a normal camera may be incorporated.

図８においては、重心ベクトルは複数のマスタベクトルの相加平均により形成されるが、変形例としては複数のマスタベクトルの中央値を重心ベクトルの成分としてもよい。たとえば、図８においてａ１＜ａ２＜ａ３であれば、重心ベクトルのａ成分はａ２としてもよい。 In FIG. 8, the center of gravity vector is formed by arithmetic mean of a plurality of master vectors, but as a modification, the median of the plurality of master vectors may be used as a component of the center of gravity vector. For example, if a1 <a2 <a3 in FIG. 8, the a component of the center of gravity vector may be a2.

マスタ候補画像の品質検査に際しては、複数の評価項目に重み付けがなされてもよい。評価項目としては、（Ｅ１）正面を向いているか（Ｅ２）光量は適切か（Ｅ３）目を開けているか、などが考えられる。各評価項目についてマスタ候補画像を採点し、それらの項目点を加重平均することでマスタ候補画像の品質を判定してもよい。たとえば、Ｅ１〜Ｅ３にｐ１，ｐ２，ｐ３の係数が設定され（ｐ１＋ｐ２＋ｐ３＝１）、Ｅ１〜Ｅ３の項目値がｓ１，ｓ２，ｓ３であれば、総合点はｐ１・ｓ１＋ｐ２・ｓ２＋ｐ３・ｓ３となる。総合点が所定の閾値以上であればマスタ画像として採択され、マスタベクトルはマスタ情報２２４に登録される。 In quality inspection of the master candidate image, a plurality of evaluation items may be weighted. As the evaluation items, (E1) facing the front, (E2) light amount is appropriate, (E3) whether eyes are opened, and the like can be considered. The master candidate image may be graded for each evaluation item, and the quality of the master candidate image may be determined by weighted averaging of these item points. For example, the coefficients of p1, p2, and p3 are set to E1 to E3 (p1 + p2 + p3 = 1), and if the item values of E1 to E3 are s1, s2, and s3, the total point is p1 · s1 + p2 · s2 + p3 · s3. . If the total score is equal to or larger than a predetermined threshold, the master image is adopted as a master image, and the master vector is registered in the master information 224.

誘導モーションは、抱っこされたとき以外に実行されてもよい。ロボット１００とユーザの距離が所定範囲内にあるときに、動作制御部１５０は誘導モーションを実行してもよい。たとえば、図１３に示したマスタショット範囲にユーザがいるときに誘導モーションを実行した上で、マスタ候補画像を撮像してもよい。概括すれば、ロボット１００はユーザとさまざまな関わり方をする最中に、ユーザの身体的・行動的特徴を把握する上で有効な「シャッターチャンス」を逃すことなくマスタベクトルを抽出することにより、多様かつ高品質なマスタベクトルをユーザに意識させることなく集めることができる。また、誘導モーションによる非言語の働きかけにより、積極的に「シャッターチャンス」を作り出すこともできる。 The guided motion may be executed other than when the user is hugged. When the distance between the robot 100 and the user is within a predetermined range, the operation control unit 150 may execute a guided motion. For example, a master candidate image may be taken after a guided motion is executed when a user is in the master shot range shown in FIG. In general, the robot 100 extracts a master vector without missing a “photo opportunity” that is effective in grasping the physical and behavioral characteristics of the user while engaging in various ways with the user. Various and high-quality master vectors can be collected without making the user aware. In addition, a non-verbal approach through guided motion can positively create “shutter opportunities”.

本実施形態における誘導モーションは、非言語コミュニケーションの一種である。ここでいう非言語モーションは動物の鳴き声のように言語としての意味をなさない音声を含んでもよい。変形例として、ロボット１００は簡単な言語によりユーザによびかけてもよい。 Guided motion in the present embodiment is a type of non-verbal communication. Here, the non-verbal motion may include a sound that does not make sense as a language, such as the sound of an animal. As a variant, the robot 100 may call the user in a simple language.

ロボット１００は、ユーザに抱っこされたとき、正面顔、右顔、左顔の３つの顔画像をマスタ画像として取得してもよい。認識部１５６は、ユーザの耳や鼻を認識することにより、どの方向からユーザを見ているかを判定してもよい。内部センサ１２８のひとつとして、ロボット１００はジャイロスコープを搭載してもよい。認識部１５６はジャイロスコープにより、ユーザに抱っこされたときにロボット１００の傾き方向を検出し、それによりユーザをどの方向から見ているかを判定してもよい。 When held by the user, the robot 100 may acquire three face images of a front face, a right face, and a left face as master images. The recognition unit 156 may determine from which direction the user is looking by recognizing the user's ears and nose. As one of the internal sensors 128, the robot 100 may include a gyroscope. The recognizing unit 156 may use a gyroscope to detect the tilt direction of the robot 100 when held by the user, and thereby determine from which direction the user is looking.

図９および図１０に示した方法（以下、「距離判定法」とよぶ）のほか、マハラノビス距離（Mahalanobis' Distance）によりユーザ識別を実行してもよい。図１０において、人物認識部２１４は、複数のマスタベクトルが得られたときには、その分散値を考慮して、検査ベクトルＤＸとユーザＡのマスタベクトル・グループとのマハラノビス距離（Mahalanobis' Distance）を求める。同様にして、人物認識部２１４は、検査ベクトルＤＸとユーザＢのマスタベクトル・グループとのマハラノビス距離を求める。そして、それぞれのグループを対象としたマハラノビス距離に基づいて、既知の判別分析手法により未知ユーザＸがユーザＡまたはユーザＢのいずれであるかを判定してもよい（以下、「マハラノビス判定法」とよぶ）。 In addition to the method shown in FIGS. 9 and 10 (hereinafter, referred to as “distance determination method”), user identification may be performed based on Mahalanobis distance. In FIG. 10, when a plurality of master vectors are obtained, the person recognition unit 214 obtains a Mahalanobis' Distance between the test vector DX and the master vector group of the user A in consideration of the variance. . Similarly, the person recognizing unit 214 obtains a Mahalanobis distance between the inspection vector DX and the master vector group of the user B. Then, based on the Mahalanobis distance for each group, it may be determined whether the unknown user X is the user A or the user B by a known discriminant analysis method (hereinafter, “Maharanobis determination method”). Call).

人物認識部２１４は、各ユーザのマスタベクトル・グループを教師データとするニューラル・ネットワークを形成し、未知ユーザＸの検査ベクトルとマスタベクトルとの当てはまりのよさに基づいてユーザ識別を実行してもよい（以下、「ニューラル・ネットワーク判定法」とよぶ）。 The person recognizing unit 214 may form a neural network using the master vector group of each user as teacher data, and may execute user identification based on the goodness of fit between the test vector of the unknown user X and the master vector. (Hereinafter, it is called “neural network judgment method”).

人物認識部２１４は、距離判定法、マハラノビス判定法、ニューラル・ネットワーク判定法のうち、複数を組み合わせてユーザを識別してもよい。また、検査ベクトルとマスタベクトルの比較だけでなく、登録ユーザのマスタベクトルと未知ユーザのマスタベクトルを比較するときにも、上述の各種方法により類似判定をしてもよい。 The person recognizing unit 214 may identify the user by combining a plurality of the distance determination method, the Mahalanobis determination method, and the neural network determination method. In addition to comparing the test vector with the master vector, the similarity determination may be performed by the above-described various methods when comparing the master vector of the registered user with the master vector of the unknown user.

本実施形態においては、撮像制御部１５４は抱っこやタッチなどのタイミングにてマスタ画像を取得するとして説明した。変形例として、撮像制御部１５４はユーザを定期的に撮像し、認識部１５６は多数の撮像画像をマスタ候補画像として取捨選択してもよい。たとえば、１０秒に１回のタイミングにてユーザを撮像し、認識部１５６はこれをマスタ候補画像として品質検査する。認識部１５６は合格したマスタ画像からマスタベクトルを抽出する。このような方法によれば、偶然得られた良質な撮像画像からもマスタベクトルを抽出できる。 In the present embodiment, the description has been given assuming that the imaging control unit 154 acquires the master image at the timing of holding or touching. As a modification, the imaging control unit 154 may periodically image the user, and the recognizing unit 156 may select a large number of captured images as master candidate images. For example, the user is imaged at a timing of once every 10 seconds, and the recognition unit 156 performs quality inspection as a master candidate image. The recognizing unit 156 extracts a master vector from the passed master image. According to such a method, a master vector can be extracted from a high-quality captured image obtained by accident.

人物認識部２１４は、マスタベクトルの数が所定数以上となったとき、古いマスタベクトルを個人データ格納部２１８から削除してもよい。あるいは、古いマスタベクトル、たとえば、３年以上前に取得されたマスタベクトルを削除してもよい。このような制御方法によれば、個人データ格納部２１８のデータ量を抑制できるだけではなく、ユーザの加齢や成長にともなう身体的特徴の変化にも対応できる。 When the number of master vectors becomes equal to or more than a predetermined number, the person recognizing unit 214 may delete the old master vector from the personal data storage unit 218. Alternatively, an old master vector, for example, a master vector obtained three years or more ago may be deleted. According to such a control method, not only can the data amount of the personal data storage unit 218 be suppressed, but it is also possible to cope with changes in physical characteristics due to aging and growth of the user.

本実施形態においてはロボット１００において特徴ベクトルを抽出し、サーバ２００において特徴ベクトルの比較を行うことでユーザ識別するとして説明した。変形例として、ロボット１００は、撮像画像をサーバ２００に送り、サーバ２００の人物認識部２１４が特徴ベクトルの抽出およびユーザ識別の双方を実行してもよい。あるいは、ロボット１００は、サーバ２００の処理能力に頼ることなく、認識部１５６においてユーザ識別処理を実行してもよい。この場合には、ロボット１００は各ユーザのマスタベクトルをロボット１００のデータ格納部１４８において管理してもよい。 In the present embodiment, it has been described that the user is identified by extracting the feature vector in the robot 100 and comparing the feature vector in the server 200. As a modification, the robot 100 may send the captured image to the server 200, and the person recognizing unit 214 of the server 200 may perform both the extraction of the feature vector and the user identification. Alternatively, the robot 100 may execute the user identification process in the recognition unit 156 without depending on the processing capability of the server 200. In this case, the robot 100 may manage the master vector of each user in the data storage unit 148 of the robot 100.

ロボット１００に内蔵されるカメラや各種センサに限らず、外部センサ１１４に内蔵されるセンサによりユーザの身体的・行動的特徴を抽出してもよい。外部センサ１１４はユーザが近くにいるときにユーザを撮像し、撮像画像をロボット１００に送信する。ロボット１００の認識部１５６は、この撮像画像の品質検査や成分抽出を実行してもよい。 Not only the camera and various sensors built into the robot 100 but also the sensor built into the external sensor 114 may extract the physical and behavioral characteristics of the user. The external sensor 114 captures an image of the user when the user is nearby, and transmits the captured image to the robot 100. The recognition unit 156 of the robot 100 may execute the quality inspection and the component extraction of the captured image.

本実施形態においては、個人データ格納部２１８はマスタ画像ではなくマスタベクトルを保存するとして説明したが、マスタ画像とマスタベクトルの双方を保存してもよい。 In the present embodiment, the personal data storage unit 218 stores the master vector instead of the master image. However, the personal data storage unit 218 may store both the master image and the master vector.

ロボットシステム３００は、工場出荷時からマスタベクトルによるユーザ識別機能を備える必要はない。たとえば、ロボットシステム３００は、ディープラーニングを応用したクラスタリング技術によりユーザ識別を行ってもよい。ロボットシステム３００の出荷後に、通信ネットワークを介してマスタベクトルによるユーザ識別機能を実現する行動制御プログラムをダウンロードすることにより、ロボットシステム３００の機能強化が実現されてもよい。 The robot system 300 does not need to have a user identification function using a master vector from the time of shipment from the factory. For example, the robot system 300 may perform user identification by a clustering technique using deep learning. After shipping the robot system 300, the function of the robot system 300 may be enhanced by downloading an action control program for realizing a user identification function using a master vector via a communication network.

上述したように、認識部１５６は、ロボット１００が抱え上げられたときの撮像画像をマスタ候補画像として選択する。認識部１５６は、サーマルカメラなどの温度センサによりユーザの顔の位置および向きを検出してもよいし、測距センサによりユーザとロボット１００の距離を検出してもよい。認識部１５６は、サーマルカメラによる温度情報および測距センサによる距離情報の双方または一方について所定の特定条件が成立したときの撮像画像をマスタ候補画像として選択してもよい。たとえば、認識部１５６は、サーマルカメラによりユーザがロボット１００に向かい合っていることが確認でき、かつ、測距センサによりユーザとロボット１００の距離が所定範囲内にあるときの撮像画像をマスタ候補画像として選択してもよい。このような制御方法によれば、適切なマスタ候補画像を複数種類のセンサに基づいて厳選しやすくなる。 As described above, the recognition unit 156 selects a captured image when the robot 100 is held up as a master candidate image. The recognition unit 156 may detect the position and orientation of the user's face using a temperature sensor such as a thermal camera, or may detect the distance between the user and the robot 100 using a distance measurement sensor. The recognizing unit 156 may select, as a master candidate image, a captured image when a predetermined specific condition is satisfied for both or one of the temperature information from the thermal camera and the distance information from the distance measurement sensor. For example, the recognizing unit 156 can use a thermal camera to confirm that the user is facing the robot 100, and use a distance measurement sensor to set a captured image when the distance between the user and the robot 100 is within a predetermined range as a master candidate image. You may choose. According to such a control method, it becomes easy to carefully select an appropriate master candidate image based on a plurality of types of sensors.

ロボット１００が搭載するカメラは全天球カメラであってもよい。ロボット１００がユーザに背中側から抱っこされたとき、いいかえれば、ロボット１００とユーザが正対していないときでも、ロボット１００は全天球カメラにより後方のユーザを撮影できる。したがって、ロボット１００が背中側から抱っこされているときでも、認識部１５６は適切なマスタ候補画像を取得可能となるため、マスタ候補画像の取得機会を拡大できる。あるいは、ロボット１００とユーザが正対していることを条件として、認識部１５６はマスタ候補画像を特定するとしてもよい。 The camera mounted on the robot 100 may be a spherical camera. When the robot 100 is held by the user from the back side, in other words, even when the robot 100 and the user are not directly facing each other, the robot 100 can photograph the user behind using the spherical camera. Therefore, even when the robot 100 is hung from the back side, the recognition unit 156 can acquire an appropriate master candidate image, so that the opportunity of acquiring the master candidate image can be expanded. Alternatively, the recognition unit 156 may specify the master candidate image on condition that the robot 100 and the user face each other.

抱っこされたときの撮像画像に複数のユーザが映っているときには、認識部１５６はこの撮像画像をマスタ候補画像として選択しないとしてもよい。 When a plurality of users appear in the captured image when the user is hugged, the recognition unit 156 may not select the captured image as a master candidate image.

複数のユーザが含まれる撮像画像において登録ユーザＰ１が検出されたときには、認識部１５６は登録ユーザＰ１の特徴ベクトルをこの撮像画像から抽出し、これを登録ユーザＰ１の新たなマスタベクトルとして追加登録するとしてもよい。複数のユーザが含まれる撮像画像において登録ユーザが検出されなかったときには、いいかえれば、複数の未知ユーザのみが含まれる撮像画像が得られたときには、認識部１５６は、正面を向いているなど所定の条件を満たす未知ユーザＰ２について特徴ベクトルを抽出し、これを未知ユーザＰ２のマスタベクトルとして新規登録してもよい。 When the registered user P1 is detected in a captured image including a plurality of users, the recognizing unit 156 extracts a feature vector of the registered user P1 from the captured image and additionally registers the feature vector as a new master vector of the registered user P1. It may be. When a registered user is not detected in a captured image including a plurality of users, in other words, when a captured image including only a plurality of unknown users is obtained, the recognition unit 156 may perform a predetermined operation such as facing the front. A feature vector may be extracted for an unknown user P2 that satisfies the condition, and this may be newly registered as a master vector of the unknown user P2.

認識部１５６は、撮影に際して、マイクロフォンによりユーザの声（音声情報）も取得してもよい。マスタベクトルは、画像情報に限らず、音声情報に基づく特徴ベクトルを含んでもよい。同様にして、認識部１５６は、撮影に際して、ニオイセンサによりユーザの匂い（嗅覚情報）を取得してもよい。このように登録ユーザを特定するための情報として、画像情報のほか、音声情報や嗅覚情報など多様なセンサ情報が含まれてもよい。 The recognizing unit 156 may also acquire the voice (voice information) of the user by using a microphone at the time of shooting. The master vector is not limited to image information, and may include a feature vector based on audio information. Similarly, the recognition unit 156 may acquire the user's smell (smell information) by the odor sensor at the time of photographing. As described above, the information for specifying the registered user may include various sensor information such as audio information and olfactory information in addition to the image information.

ロボット１００は、複数のマイクロフォンを備えてもよい。音声登録に際しては、ユーザの存在する方向に対応するマイクロフォン、たとえば、ロボット１００の前方に取り付けられるマイクロフォンのみから音声を検出してもよい。認識部１５６は、他のマイクロフォンを無効にしてもよい。このような制御方法によれば、ユーザ以外の環境音がマスタベクトルに取り込まれにくくなる。マイクロフォン、特に、前方に取り付けられるマイクロフォンは指向性を有することが望ましい。 The robot 100 may include a plurality of microphones. At the time of voice registration, voice may be detected only from a microphone corresponding to the direction in which the user exists, for example, a microphone attached in front of the robot 100. The recognition unit 156 may invalidate other microphones. According to such a control method, it becomes difficult for environmental sounds other than the user to be captured in the master vector. It is desirable that the microphone, especially the microphone mounted in front, has directivity.

撮像画像に映るユーザの口唇に動きを検出したときの音声情報であることを条件として、認識部１５６はユーザの音声情報をマスタベクトルの一部として取得するとしてもよい。未知ユーザを検出したとき、認識部１５６は未知ユーザに近づいて抱っこをせがむモーションを実行させてもよい。 The recognition unit 156 may acquire the voice information of the user as a part of the master vector on condition that the voice information is the voice information when the movement of the lips of the user shown in the captured image is detected. When an unknown user is detected, the recognizing unit 156 may execute a motion approaching the unknown user and hugging.

Claims

An imaging control unit that controls the camera;
A recognition unit that determines a moving object based on a feature vector extracted from a captured image of the moving object,
An operation selection unit that selects a motion of the robot according to the determination result;
A drive mechanism for executing the motion selected by the operation selection unit,
A motion detection unit that detects the holding of the robot by the moving object,
The recognition unit sets a captured image when the robot is held by the moving object as a master image, and sets a determination criterion for the moving object based on a feature vector extracted from the master image. Autonomous robot.

The autonomous robot according to claim 1, wherein the recognition unit sets a plurality of captured images of the moving object from a plurality of angles as a master image.

The operation selection unit causes the drive mechanism to execute a predetermined guidance motion,
The autonomous robot according to claim 1, wherein the imaging control unit captures a master image of the moving object when the guidance motion is executed.

The autonomous robot according to claim 3, wherein the guidance motion is a non-verbal motion.

The autonomous robot according to claim 1, wherein the recognizing unit further sets a captured image when the robot is held down by the moving object as a master image.

The operation selection unit allows the user to select a motion that tracks the moving object after the robot is held down by the moving object,
The autonomous robot according to claim 1, wherein the imaging control unit captures a master image of the moving object during tracking.

The imaging control unit tracks the moving object even after the robot is held down by the moving object, and captures a second master image of the moving object at a predetermined timing,
The recognition unit may extract a plurality of feature vectors related to the moving object by associating a first master image acquired when the robot is held by the moving object with the second master image. The autonomous behavior robot according to claim 1, wherein

The recognition unit determines a moving object based on a feature vector extracted from both a captured image of the moving object and audio information, and also acquires audio information from the moving object when acquiring the master image. The autonomous robot according to claim 1, wherein a criterion for determining a moving object is set based on a feature vector extracted from the voice information.

A computer program for object recognition by a robot,
A function to set a captured image of the moving object when the robot is held by the moving object as a master image,
A function of setting a determination criterion for a moving object based on a feature vector extracted from the master image,
A behavior control program characterized by causing a robot to perform a function of determining a moving object based on a feature vector extracted from a captured image of the moving object.