JP2000187737A

JP2000187737A - Visual corresponding method/device

Info

Publication number: JP2000187737A
Application number: JP22375598A
Authority: JP
Inventors: Toru Fukuda; 徹福田
Original assignee: MEDIA LINK SYSTEM KK
Current assignee: MEDIA LINK SYSTEM KK
Priority date: 1998-07-24
Filing date: 1998-07-24
Publication date: 2000-07-04

Abstract

PROBLEM TO BE SOLVED: To make corresponding scenes before and behind to be smooth by making the corresponding scenes and connection scenes into videos with the mapping of the model of a man, constituting a database and inserting the selected connection scene between the corresponding scenes before and behind. SOLUTION: A visual corresponding device is constituted into the form of a visual robot 2 corresponding to a customer 1. Various pieces of software obtained by making the multiple corresponding scenes and the multiple connection scenes, which are made into patterns by mapping the model of the man, into videos are recorded in RAM 6. A sensor 3 and a speech recognition means 4 are used as the detection means of the contents of the correspondence of a user. The visual robot 2 detects the action (rock, scissors and paper) of the customer 1 by the sensor 3 and the speech recognition means 4 recognizes the speech of the customer 1, 'rock-scissors-paper', 'a draw' and 'face overthere', for example, and reproduces the appropriate corresponding scenes for them and the connection scenes on a display 5.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、実写された多数
の応対シーンとつなぎシーンを使用するビジュアル応対
方法および装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a visual response method and apparatus using a plurality of response scenes and a connection scene which are actually photographed.

【０００２】[0002]

【従来の技術】従来の情報サービスでは、文字、写真な
どからなる静止画像と、それに対応した音声のマルチメ
ディアデータを表示し、その表示に対するユーザの反応
（キー操作やタッチパネル操作等）により、あらかじめ
決められた静止画像及び声を出していた。これらの静止
画像と静止画像の間には画像の連続性の必要性は全くな
かった。2. Description of the Related Art In a conventional information service, a still image composed of characters, photographs, and the like and multimedia data of audio corresponding thereto are displayed, and a user's reaction to the display (key operation, touch panel operation, etc.) is performed in advance. The determined still image and voice were uttered. There was no need for image continuity between these still images.

【０００３】また、駅ホームでは、音声を音節毎にデー
タベース化し、その音節を組み合わせることにより、音
声を表示していた。例えば、「まもなく」、「３番線
に」、「特急電車がまいります」、「危ないですから白
線まで下がって下さい」という音声を組み合わせた案内
が実用化されている。音声だけの表示の場合にも、画像
をスムーズにつなぐことは必要としない。On the other hand, in a station platform, a voice is displayed in a database for each syllable, and the syllable is combined to display the voice. For example, guidance combining voices “soon”, “on line 3”, “express train is coming”, and “go down to the white line because it is dangerous” has been put to practical use. Even in the case of displaying only audio, it is not necessary to connect images smoothly.

【０００４】[0004]

【発明が解決しようとする課題】多数の応対シーンとつ
なぎシーンを使用してビジュアル応対方法および装置を
実現しようとするとき、まず考えられるのは、コンピュ
ータグラフィクスで人間の顔のロボットを画面に再生で
きるように予め制作しておき、現場でユーザの操作や反
応によってロボットの顔が画面上で一連のスムーズな動
きをして表情を変えたりセリフをしゃべるようにするこ
とである。When realizing a visual response method and apparatus using a large number of response scenes and connecting scenes, the first possibility is to reproduce a human face robot on a screen by computer graphics. It is to make it in advance so that it can be done, and the robot's face makes a series of smooth movements on the screen by changing the facial expression and speaks by the user's operation and reaction at the site.

【０００５】しかし、コンピュータグラフィクスを利用
すると、多大な制作時間と費用が必要となる。そのた
め、多数の応対シーンとつなぎシーンを使用するビジュ
アル応対方法および装置の実現が実際上困難であった。However, the use of computer graphics requires a great deal of production time and cost. For this reason, it has been practically difficult to realize a visual response method and apparatus using a large number of response scenes and connecting scenes.

【０００６】実在の人間のモデルを撮影（これを実写と
いう）して、多数の応対シーンをデータベース化してお
き、多数の応対シーンとつなぎシーンを使用するビジュ
アル応対方法および装置は、制作時間と費用が比較的少
くてすむ。[0006] A visual response method and apparatus using a large number of response scenes in which a real human model is photographed (this is referred to as a real photograph) and a large number of response scenes are used as a database, requires a lot of production time and cost. But relatively few.

【０００７】しかし、実際に使用するときに、ユーザの
操作に応じて、その場で応対シーンを編集して表示しよ
うとすると、ユーザの反応が遅い場合、応対シーンを一
時停止したり、互いに相違する顔の動画と顔の動画を接
続したカットで顔の位置や表情が突然変わったりして、
不自然な感じとなる。However, in actual use, if the response scene is edited and displayed on the spot according to the operation of the user, if the response of the user is slow, the response scene may be temporarily stopped or may differ from each other. The position and expression of the face suddenly change with the cut that connects the face video and the face video,
It feels unnatural.

【０００８】本発明の目的は、実写した多数の応対シー
ンとつなぎシーンを使用しているにもかかわらず、前後
の応対シーンがスムーズになるビジュアル応対方法およ
び装置を提供することである。SUMMARY OF THE INVENTION It is an object of the present invention to provide a visual response method and apparatus in which the preceding and succeeding response scenes are smooth despite the use of a large number of captured response scenes and a connection scene.

【０００９】[0009]

【課題を解決するための手段】本発明の第１の解決手段
は、パターン化された多数の応対シーンとつなぎシーン
を人間のモデルの実写により映像化して、それらの多数
の応対シーンとつなぎシーンをデータベースにする工程
と、ユーザの反応に応じて、そのデータベースから適当
な応対シーンを選択して画面に再生する工程と、再生す
る前後の応対シーンと応対シーンに適したつなぎシーン
を選択して、それらの前後の応対シーンと応対シーンの
間にそのつなぎシーンを挿入することにより前後の応対
シーンと応対シーンとの映像上のつなぎをスムーズにす
る工程を含むビジュアル応対方法である。A first solution of the present invention is to convert a large number of patterned reception scenes and connection scenes into real images of a human model, and to generate images of the multiple reception scenes and connection scenes. And a process of selecting an appropriate response scene from the database according to the user's reaction and reproducing it on the screen, and selecting a response scene before and after the reproduction and a connection scene suitable for the response scene. A visual response method including a step of inserting a connecting scene between the preceding and following response scenes to smoothly connect the preceding and subsequent response scenes on the video.

【００１０】本発明の第２の解決手段は、人間のモデル
を実写することによりパターン化された多数の応対シー
ンと多数のつなぎシーンを映像化したデータベースと、
データベース内の応対シーンとつなぎシーンを再生する
画面と、ユーザの反応を検出する検出手段と、その検出
手段によって検出された、ユーザの反応に応じて、デー
タベースから適当な応対シーンを画面に再生するために
選択する応対シーン選択手段と、連続する前後の応対シ
ーンと応対シーンの間に挿入する適当なつなぎシーンを
選択するつなぎシーン選択手段を備えたビジュアル応対
装置である。[0010] A second solution of the present invention is to provide a database in which a number of response scenes and a number of connection scenes that are patterned by actually photographing a human model are imaged;
A screen for reproducing a response scene and a connection scene in the database, a detecting means for detecting a user's reaction, and an appropriate response scene from the database is reproduced on the screen according to the user's reaction detected by the detecting means. The present invention is directed to a visual response apparatus provided with a response scene selection means for selecting a connection scene and a connection scene selection means for selecting an appropriate connection scene to be inserted between consecutive reception scenes before and after.

【００１１】好ましくは、つなぎシーンと応対シーンが
ランダム・アクセス・メモリーの形で記録されている。
つなぎシーンが、人間のモデルを実写することにより多
種多様の映像をデータ化した多数のインタラクティブな
画像からなる。応対シーンの最初のフレームと最後のフ
レームに最も一致したフレームをつなぎシーンの多数の
画像の各々から検出し、応対シーンのデータベースに最
初のフレームと最後のフレームに最も一致したつなぎシ
ーンの画像番号を予め設定しておく。前後の応対シーン
と応対シーンをスムーズにつなぐために、前の応対シー
ンを画面に再生した後、その応対シーンの最後のフレー
ムに最も一致しているつなぎシーンのフレームを再生
し、ユーザの反応により選択された後の応対シーンの最
初のフレームに一致しているつなぎシーンのフレームま
でを再生し、その後、後の応対シーンを再生する。つな
ぎシーンの再生は、一致フレームと一致フレームの位置
関係により、時間軸に対して順方向又は逆方向とする。
前後の応対シーンと応対シーンの間に挿入するつなぎシ
ーンが複数のつなぎシーンで構成され、つなぎシーンと
つなぎシーンの間に一致フレームをあらかじめ見つけて
おき、その一致フレーム同士をつないでいくことで、よ
り多くのつなぎシーンを前後の応対シーンと応対シーン
の間に挿入する。つなぎシーンの再生の際に、連続して
複数のつなぎシーンを連続して再生するが、乱数を利用
してランダムにそれらのつなぎシーンの再生順番や再生
方向を変える。Preferably, the connection scene and the response scene are recorded in the form of a random access memory.
The connecting scene is composed of a large number of interactive images obtained by converting various kinds of images into data by actually photographing a human model. The frame that best matches the first and last frames of the reception scene is detected from each of the multiple images of the connection scene, and the image number of the connection scene that best matches the first frame and the last frame is found in the database of the reception scene. Set in advance. In order to smoothly connect the front and back reception scenes and the reception scene smoothly, after playing the previous reception scene on the screen, play the frame of the connection scene that most closely matches the last frame of the reception scene, and depending on the user's reaction The frames up to the frame of the connected scene that matches the first frame of the selected reception scene are reproduced, and then the subsequent reception scene is reproduced. The reproduction of the connected scene is performed in the forward direction or the reverse direction with respect to the time axis depending on the positional relationship between the matching frames.
The connecting scene to be inserted between the front and rear reception scenes and the reception scene is composed of multiple connection scenes, a matching frame is found in advance between the connection scenes and the connection scene, and by connecting the matching frames, More connecting scenes are inserted between the preceding and following reception scenes. When the connected scenes are played back, a plurality of connected scenes are continuously played back, and the playback order and the playback direction of the connected scenes are randomly changed using random numbers.

【００１２】なお、本明細書において、「一致」とは、
人間の肉眼で見て不自然に感じない程度に一致している
ことを意味し、画像が完全に一致している場合だけでな
く、類似している場合も含む。[0012] In this specification, "match" means
This means that the images match to such an extent that they do not look unnatural when viewed by the naked eye of a human being, and include not only cases where images match completely but also cases where images are similar.

【００１３】[0013]

【発明の実施の形態】本発明者は、ランダム・アクセス
・メモリーが大容量化し安価になるにしたがって、静止
画や音声だけの情報案内サービスから、動画を利用した
情報案内サービスへとマーケットが変革していくことを
究明した。DESCRIPTION OF THE PREFERRED EMBODIMENTS The present inventor has changed the market from an information guide service using only still images and voices to an information guide service using moving images as the random access memory becomes larger and cheaper. I decided to do it.

【００１４】そのような状況の変化の中で、画面（ディ
スプレイ）の中でスムーズにユーザに反応する顔又は身
体（上半身、全身）のロボットはニーズが多いことを発
見した。ディスプレイ中の顔又は身体（上半身、全身）
のビジュアルロボットが、ユーザの操作に反応している
ように見られるためには、一連のスムーズな動きが重要
である。すなわち、ユーザの反応（操作等）に対して、
顔又は身体（上半身、全身）の動画が１つ１つ断片的に
応対するのでなく、ディスプレイ中の顔又は身体（上半
身、全身）が一連の動きの中で、常にユーザに自然な形
で対応しているように感じられることが重要である。In such a change of situation, it has been found that there is a great need for a face or body (upper body, whole body) robot that responds smoothly to a user on a screen (display). Face or body in display (upper body, whole body)
In order for the visual robot to appear to be responding to user operations, a series of smooth movements is important. That is, in response to a user's reaction (operation, etc.),
The face or body (upper body, whole body) in the display does not respond to the face or body (upper body, whole body) one by one, but always responds naturally to the user in a series of movements. It's important to feel like you're doing it.

【００１５】ビジュアルロボを実写モデルで制作する場
合、無背景で、あらかじめデータベースで決められた応
対シーン別に撮影を行う。全ての応対シーンの先頭フレ
ームと、最後尾フレームのモデルの顔又は身体が肉眼で
見て一致していれば、前後の応対シーンと応対シーンを
スムーズにつなぐことは問題ないが、撮影の時、実際の
モデルにそのような動作をしてもらうことは不可能であ
る。When a visual robot is produced using a real-life model, shooting is performed for each response scene determined in advance in a database without any background. If the face or body of the model of the first frame and the last frame of all the response scenes match with the naked eye, it is not a problem to smoothly connect the front and rear reception scenes and the reception scene, but at the time of shooting, It is impossible to have an actual model perform such an operation.

【００１６】前後の応対シーンと応対シーンがスムーズ
につなげられるように、撮影がうまくいったとしても、
実際のサービス等で利用するケースを考えると、ユーザ
の操作に時間がかかり、次のシーンが決まるまで、前の
応対シーンをストップしなければならない場合が考えら
れる。In order to smoothly connect the front and rear reception scenes and the reception scene, even if the shooting is successful,
Considering a case where the service is used in an actual service or the like, it may take a long time for the user to perform an operation, and the previous service scene may need to be stopped until the next scene is determined.

【００１７】本発明では、このような問題を解消するた
めに、多数のつなぎシーンを使用する。例えば、多数の
応対シーンを撮影するとき、応対シーンの映像をスムー
ズにつなげるためのシーン（これをつなぎシーンとよ
ぶ）を数多く撮影しておく。そして、サービスの運用時
に、応対シーンと応対シーンの間につなぎシーンを挿入
する。それにより、応対シーン間のスムーズな接続と、
応対シーンと応対シーンの時間調整を行う。In the present invention, in order to solve such a problem, a large number of connection scenes are used. For example, when shooting a large number of reception scenes, a number of scenes (referred to as connection scenes) for smoothly connecting the images of the reception scenes are shot. Then, during operation of the service, a connection scene is inserted between the reception scenes. As a result, a smooth connection between the reception scenes,
The reception scene and the reception scene are adjusted in time.

【００１８】本発明の好ましい実施態様を説明すると、
応対シーンを再生した後、その最後のフレームと最も一
致するつなぎシーンのフレームとをスムーズにつなぎ、
そこから、つなぎシーンを順方向または逆方向に再生し
て、時間調整を行う。客（ユーザ）の操作により次の応
対シーンが選択されたら、その応対シーンの先頭フレー
ムと一致するフレームになるまで、つなぎシーンを再生
し、その応対シーンへとスムーズにつなげるのである。In describing a preferred embodiment of the present invention,
After playing the reception scene, smoothly connect the last frame to the frame of the connection scene that matches the most,
From there, the connected scene is played back in the forward or reverse direction to adjust the time. When the next service scene is selected by the operation of the customer (user), the connected scene is reproduced until the frame corresponding to the first frame of the service scene is reached, and the connected scene is smoothly connected to the service scene.

【００１９】本明細書において、ユーザの操作や反応と
は、キーボードの操作のみでなく、音声や、身振り、手
振り、顔の表情なども含み、さらには人体センサーによ
り検出される脈搏、発汗その他も含むものとする。In this specification, the user's operation or reaction includes not only keyboard operation but also voice, gesture, hand gesture, facial expression, and the like, and further includes pulse, sweat, and the like detected by a human body sensor. Shall be included.

【００２０】つなぎシーンを複数用意しておくことによ
り、また、つなぎシーンの再生順をランダムに変えるこ
とにより、より多様なつなぎを再生することができる。By preparing a plurality of connection scenes and by randomly changing the reproduction order of the connection scenes, it is possible to reproduce more various connection scenes.

【００２１】本発明の利用分野は、多種多様である。典
型的な本発明の利用分野はサービス産業での客との応対
や情報案内などの分野である。音声だけのガイドより、
顔又は身体がディスプレイに映りユーザにガイドする方
が効果的な場合が多い。ビジュアルロボットとして画面
（ディスプレイ）に表示して次のような形態で利用する
ことができる。The fields of application of the invention are many and varied. A typical field of use of the present invention is in the field of dealing with customers and providing information guidance in the service industry. From a voice-only guide,
It is often more effective to show the face or body on the display and guide the user. It can be displayed on a screen (display) as a visual robot and used in the following forms.

【００２２】案内・会社の受付、テーマパークやイベント会場の案内、ホ
テルのフロント等。Guidance-Reception of the company, information on theme parks and event venues, hotel reception, etc.

【００２３】・ファーストフーズの店や自動販売機で客
のオーダーの受付や販売。Accepting and selling customer orders at First Foods stores and vending machines.

【００２４】・テーマパークやイベント会場のキャラク
ターを利用したロボ販売員。Robo salespeople who use characters from theme parks and event venues.

【００２５】・病院の窓口相談。・ Consultation at the hospital counter.

【００２６】教育・英語など語学は口の動きや表情が重要な要素。繰り返
しプレイも簡単。Education ・ Language such as English is an important element of mouth movement and facial expression. Easy to play repeatedly.

【００２７】・Ｑ＆Ａタイプの教育ソフトのように、先
生とインタラクティブに学習を進めていくことができ
る。[0027] Like Q & A type educational software, learning can be advanced interactively with the teacher.

【００２８】フレンド・占い、アドバイスをする。Friends ・ Fortune-telling and giving advice.

【００２９】・ジョークを言ったりお話をする。・ Speak or tell a joke.

【００３０】以上のような形態で、人間が行っていたサ
ービスをビジュアルロボとして代行させることができ
る。それにより、省力化がはかられる。In the above-described manner, services provided by humans can be substituted for visual robots. This saves labor.

【００３１】すでに、音声サービス分野では、駅の構内
放送、バスの車内放送、デパートの催し物案内等におい
てボイスロボットが活躍しているが、これらのサービス
の一部は本発明の１つの実施形態であるビジュアルロボ
に置き換えることが可能である。In the voice service field, voice robots are already active in station premises broadcasting, bus car broadcasting, department store entertainment guides, etc., but some of these services are part of the embodiment of the present invention. It is possible to replace it with a certain visual robot.

【００３２】更に、次のような実施形態も、本発明の範
囲内にある。Further, the following embodiments are also within the scope of the present invention.

【００３３】・パソコンやテレビの画面上での遊戯ゲー
ム・パソコンやテレビの画面上でのビジネスゲーム・パソコンやテレビの画面上での恋愛ゲーム本発明の典型的な実施態様の１つは、ビジュアルロボで
ある。ビジュアルロボは、実写モデルの画像をＰＣに取
り込んで制作する。例えば、コンピュータのディスプレ
イ上に表示される顔又は身体を主とする映像で構成し、
ユーザの反応や操作に対してスムーズに対応するインタ
ラクティブなものとする。サービス産業を例にとれば、
客の動作や反応に対して、あらかじめ定められた表情や
説明をさせることにより、案内員や販売員の代行をする
ことができる。このような実写モデルによる顔のビジュ
アルロボットは、サービス産業分野での応用範囲が非常
に広い。A game played on a personal computer or television screen A business game played on a personal computer or television screen A love game played on a personal computer or television screen One of the typical embodiments of the present invention is a visual game. Robo. The visual robot takes in an image of a live-action model into a PC and produces it. For example, a video composed mainly of a face or a body displayed on a computer display,
It should be an interactive one that responds smoothly to user reactions and operations. Taking the service industry as an example,
By giving a predetermined expression or explanation to the operation or reaction of the customer, it is possible to act on behalf of a guide or a salesperson. Such a visual robot based on a live-action model has a very wide range of applications in the service industry.

【００３４】ビジュアルロボのグラフィックス（主とし
て顔）は実写モデルを使用する。実写モデルの場合、有
名なタレントやスポーツ選手をビジュアルロボにするこ
とができるので、話題性や親近感のある客サービス用ビ
ジュアルロボットが実現できる。Visual robot graphics (mainly faces) use a live-action model. In the case of a live-action model, famous talents and athletes can be used as visual robots, so that a visual robot for customer service with topicality and a sense of closeness can be realized.

【００３５】ビジュアルロボの制作に際しては、ディジ
タル技術を駆使して、肉眼レベルでの画像の一致性を検
出したり、画像フレームのスムーズなつなぎを行ったり
して、映像処理をする。At the time of producing a visual robot, image processing is performed by making full use of digital technology to detect consistency of images at the level of the naked eye and to smoothly connect image frames.

【００３６】実写モデルを使用してビジュアルロボを制
作する場合、あらかじめユーザ（客）の反応や操作など
に対して適切なモデルの応対シーンを撮影し、データベ
ースにしておく。When a visual robot is produced by using a live-action model, a response scene of a model appropriate to the reaction and operation of the user (customer) is photographed in advance and stored in a database.

【００３７】撮影した応対シーンのデータベースをビジ
ュアルロボとして実際に使用する場合、ユーザの反応や
操作により、適当な応対シーンをデータベースから選択
し、ディスプレイに表示する。その時、応対シーンと応
対シーンの接続に以下のような問題がある。When the database of the photographed reception scenes is actually used as a visual robot, an appropriate reception scene is selected from the database and displayed on the display according to the reaction or operation of the user. At that time, there is the following problem in connection between the reception scene and the reception scene.

【００３８】・応対シーンの終わりと次の応対シーンの
初めの画像が一致していないと、つなぎが不連続で、不
自然な感じとなる。If the end image of the reception scene does not match the first image of the next reception scene, the connection is discontinuous, giving an unnatural feeling.

【００３９】・応対シーンの再生が終わった後、ユーザ
が次の反応または操作するまで、前の応対シーンの最後
の画像で止めておかなければならないので、映像がスム
ーズにつながらない。After the reproduction of the reception scene, the user must stop at the last image of the previous reception scene until the user performs the next reaction or operation, so that the video is not smoothly connected.

【００４０】そこで、本発明においては、次のような構
成を採用する。Therefore, the present invention employs the following configuration.

【００４１】・応対シーンと応対シーンの間にバラエテ
ィーなシーンを挿入して、ビジュアルロボの動作を自然
な感じにする。Inserting a variety of scenes between reception scenes to make the operation of the visual robot natural.

【００４２】・適当な長さのつなぎシーン（複数）を撮
影しておき、実際にビジュアルロボを客に応対させるた
めに、応対シーンと応対シーンを接続する時に、それら
の間につなぎシーンを組み合わせて挿入する。つまり、
つなぎ編集を行う。When a plurality of connecting scenes having an appropriate length are photographed, and the reception scenes are connected to each other in order to actually serve the visual robot to the customer, the connecting scenes are combined between them. Insert. That is,
Perform joint editing.

【００４３】・つなぎ部分が不自然に感じられないよう
に、応対シーンの先頭と最後の画像と最も一致している
つなぎシーンの画像を検出し、応対シーンからつなぎシ
ーン、つなぎシーンからつなぎシーン、つなぎシーンか
ら応対シーンへとつなぐパスをデータベース化してお
く。In order to prevent the connection portion from feeling unnatural, an image of the connection scene that best matches the first and last images of the reception scene is detected, and a connection scene from the reception scene, a connection scene from the connection scene, A database connecting paths from the connection scene to the reception scene is stored in advance.

【００４４】・応対シーンを再生した後、次の応対シー
ンが選択されるまでの間、つなぎシーンを再生し、時間
調整を行う。時間調整は、つなぎシーンの再生を順方
向、逆方向、順方向から逆方向への変更等により行う。After the reception scene is reproduced, the connection scene is reproduced and the time is adjusted until the next reception scene is selected. The time adjustment is performed by, for example, reproducing the connected scene in the forward direction, the reverse direction, or changing the forward direction to the reverse direction.

【００４５】・つなぎシーンを再生する場合、順方向、
逆方向、順方向から逆方向への任意の画像での切替（逆
方法／順方向）をランダムに行うことにより、つなぎシ
ーンにバラエティーさを持たせる。When playing back a connected scene,
By randomly performing switching (reverse method / forward direction) with an arbitrary image from the backward direction or the forward direction to the backward direction, a variety of connection scenes can be provided.

【００４６】[0046]

【実施例】図面を参照して、以下、本発明の実施例の１
つであるビジュアルロボを詳細に説明する。BRIEF DESCRIPTION OF THE DRAWINGS FIG.
One of the visual robots will be described in detail.

【００４７】（Ａ）ビジュアルロボのデータベースビジュアルロボのデータベースは、客の操作をビジュア
ルロボの応対シーンに関する応対シーンデータベース
と、前後の応対シーンと応対シーンをスムーズに接続す
るためのつなぎシーンデータベースからなる。(A) Visual Robo Database The Visual Robo database is composed of a response scene database relating to the customer's operation scenes of the visual robot, and a connecting scene database for smoothly connecting the preceding and following response scenes to the response scene. .

【００４８】１）応対シーンデータベース例図１は、応対シーンのデータベースの１例を示す。左の
欄に、客の操作や動作が記載されている。右の欄に、ビ
ジュアルロボの応対シーンに対応する音声の内容が記載
されている。1) Example of a Service Scene Database FIG. 1 shows an example of a service scene database. In the left column, the operations and actions of the customer are described. In the right column, the contents of audio corresponding to the response scene of the visual robot are described.

【００４９】２）つなぎシーンのデータベース例応対シーンからつなぎシーン、つなぎシーンからつなぎ
シーン、つなぎシーンから応対シーンへと画像をスムー
ズにつなぐために、それぞれのシーンの最後のフレーム
に接続すべき（最も一致した）次のシーンの最初のフレ
ームをデータベース化しておく。2) Example of a database of connection scenes In order to smoothly connect images from the reception scene to the connection scene, from the connection scene to the connection scene, and from the connection scene to the reception scene, the connection should be made to the last frame of each scene (most The first frame of the next (matched) scene is stored in a database.

【００５０】図２は、つなぎシーンのデータベースの１
例を示す。左の欄に、シーンの名称が記載されており、
右の欄に、画像フレームの番号が記載されている。FIG. 2 shows a database 1 of the connection scene.
Here is an example. The name of the scene is described in the left column,
In the right column, the numbers of the image frames are described.

【００５１】３）一致フレームの選択一致フレームの判定には、画像（フレーム）を構成する
各画素（ドット）毎のデータ（カラー情報、明度情報）
の一致性を利用する。すなわち、応対シーンの第１フレ
ームのデータとつなぎシーン１の全フレームデータの一
致性を判定する。つなぎシーン２，３とも同じ判定を行
い、最も一致しているフレームを選択する。同様に最後
のフレームに一致しているフレームも選択する。これら
の２つのフレーム番号を応対シーンの接続フレームデー
タベースとする。3) Selection of Matching Frame To determine a matching frame, data (color information, brightness information) for each pixel (dot) constituting an image (frame) is used.
Use the consistency of That is, the matching between the data of the first frame of the response scene and all the frame data of the connection scene 1 is determined. The same determination is made for the connection scenes 2 and 3, and the most matching frame is selected. Similarly, a frame that matches the last frame is also selected. These two frame numbers are used as a connection frame database of the response scene.

【００５２】リンクフレームはつなぎシーンとつなぎシ
ーンをスムーズにつなぐために、つなぎシーンとつなぎ
シーンの中で最も一致しているフレームを選択しておく
もので、つなぎシーンのフレーム同士を比較し、一致性
の最も高いフレームをリンクフレームのデータベースと
する。The link frame is used to select a frame that is the best match between the connection scenes and the connection scene in order to smoothly connect the connection scene and the connection scene. The frame with the highest probability is used as a link frame database.

【００５３】４）応対シーンのスムーズなつなぎと時間
調整方法図２の例を使用して、応対シーンのスムーズなつなぎと
時間調整方法を説明すると、応対シーン３の映像からつ
なぎシーンへリンクし、その後、ユーザの操作により対
応シーン２が選択され、つなぎシーンからその応対シー
ン２へスムーズに接続する。4) Method for smooth connection of reception scenes and time adjustment method Using the example of FIG. 2, a method for smooth connection and time adjustment of reception scenes will be described. The video of reception scene 3 is linked to the connection scene. Thereafter, the corresponding scene 2 is selected by the user's operation, and the connection scene is smoothly connected to the corresponding scene 2.

【００５４】図３は、前述のシーンの流れを示してい
る。つなぎシーンの再生は、順方向、逆方向、任意フレ
ームでの折り返しが自由にできる。応対シーン２が選択
された時に再生しているつなぎシーンのフレームからフ
レームＴ３−３３３迄は最短パスを通る。FIG. 3 shows the flow of the aforementioned scene. In the playback of the connected scene, the wrapping in the forward direction, the reverse direction, and an arbitrary frame can be freely performed. The shortest path passes from the frame of the connected scene being reproduced when the response scene 2 is selected to the frame T3-333.

【００５５】５）画像の一致判別方法画像の一致判別には既知の種々の方法を採用できるが、
２枚の静止画のヒストグラムを比較する方法について以
下に説明する。5) Method of judging image coincidence Various known methods can be adopted for image coincidence judgment.
A method for comparing the histograms of two still images will be described below.

【００５６】応対シーンの最後のフレームとつなぎシー
ンの一致フレームの判別や、つなぎフレームと応対シー
ンの最初のフレームの一致フレームの判別は、以下のよ
うにして行う。The determination of the matching frame between the last frame of the response scene and the connecting scene and the determination of the matching frame of the connecting frame and the first frame of the receiving scene are performed as follows.

【００５７】応対シーンの先頭フレームとつなぎシー
ンの各フレームについて、画面を構成する各ドットデー
タのヒストグラムの差をとる。フレームのＲＧＢ成分を
ＹＵＶに変換し、Ｙ（明るさ）成分を取り出す。各ドッ
ト毎のＹ値の差を求める。Ｙ値の差が０の場合、比較し
たフレームのそのドット（位置）は同じ明るさであるこ
とを示す。For the first frame of the response scene and each frame of the connected scene, the difference between the histograms of the dot data constituting the screen is calculated. The RGB components of the frame are converted to YUV, and the Y (brightness) component is extracted. The difference between the Y values for each dot is determined. If the Y value difference is 0, it indicates that the dot (position) of the compared frame has the same brightness.

【００５８】各ドットにおける差を合計する。The differences at each dot are summed.

【００５９】ドット毎でなく、画面をＮ×Ｍの区分に
分けて、各区分毎に前述の、の処理を行ってその差
を合計することもできる。It is also possible to divide the screen into N × M sections instead of each dot, perform the above-described processing for each section, and sum the differences.

【００６０】つなぎシーンの中で合計値が最も小さい
フレームが、応対シーンの先頭フレームと一致している
ものとする。It is assumed that the frame having the smallest total value in the connection scene matches the head frame of the response scene.

【００６１】（Ｂ）ファーストフーズ店向けビジュアル
ロボファーストフーズ店では、販売員がカウンター越しに客
からのオーダーを聞きとり、それをレジに打ち込み合計
金額を客に告げ集金を行う。販売員の接客は完全マニュ
アル化されている場合が多く、各販売員はいかにそのマ
ニュアルに沿ってスムーズに客応対をするかが求められ
る。(B) Visual Robo for First Foods Store At the first foods store, a sales clerk listens to an order from a customer through a counter, inputs the order into a cash register, tells the customer the total amount, and collects money. In many cases, sales staff are completely manualized, and it is required that each sales staff handle customers smoothly in accordance with the manual.

【００６２】ビジュアルロボは、音声認識機能を使用す
ることにより客の声に反応して注文を聞くシステムにす
ることもできるが、ここでは、音声認識の代わりに、客
によるキー操作を前提とした接客サービスについての例
を示す。The visual robot can be a system that listens to an order in response to a customer's voice by using a voice recognition function. However, here, it is assumed that a key operation by the customer is performed instead of voice recognition. The example about a customer service is shown.

【００６３】１）データベースの作成客との応対を分類する。1) Creation of a database Classification of reception with customers.

【００６４】応対フローチャート、応対ソフトを作成す
る。A service flow chart and service software are created.

【００６５】応対シーンの演出とセリフを決める。The production of the response scene and the dialogue are decided.

【００６６】各応対シーンをモデルに演じてもらい撮
影する。A model is shot for each response scene by the model.

【００６７】つなぎシーンを数カット撮影する。Several shots are taken of the connected scene.

【００６８】応対シーン、つなぎシーンの一致フレー
ムを自動検出する。A matching frame between the response scene and the connection scene is automatically detected.

【００６９】各応対シーンとつなぎシーンをデータベ
ース化する。A database is created for each response scene and the connection scene.

【００７０】データベース、応対シーン、つなぎシー
ン、応対ソフト、動画つなぎソフトをビジュアルロボシ
ステムに組み込む。The database, the reception scene, the connection scene, the reception software, and the moving image connection software are incorporated in the visual robot system.

【００７１】図４は、このようなデータベースの１例を
概念的に示している。FIG. 4 conceptually shows one example of such a database.

【００７２】２）フローチャートの作成従来、客が声で注文していたのをキー操作に改正す
る。2) Creation of Flowchart Conventionally, customers have ordered by voice, but amended to key operation.

【００７３】客の動作／操作とビジュアルロボの反応
を検討する。Consider the customer's actions / operations and the response of the visual robot.

【００７４】全体の流れを検討しフローチャートを作
成する。The overall flow is examined and a flowchart is created.

【００７５】図５は、左側にフローチャートを示し、右
側につなぎシーン（ただし１部のみ）の挿入例を示して
いる。FIG. 5 shows a flowchart on the left side, and shows an example of insertion of a connection scene (only one copy) on the right side.

【００７６】３）応対ソフト／動画再生ソフト実際にビジュアルロボをファーストフーズ店などで使用
する場合、フローチャートに従ってユーザ（客）の反応
・操作に応じた応対シーンを再生することになる。3) Reception software / moving image reproduction software When a visual robot is actually used in a first foods store or the like, a reception scene corresponding to a reaction (operation) of a user (customer) is reproduced according to a flowchart.

【００７７】応対ソフトは、ユーザの反応・操作によ
り、どの応対シーンを再生するか決定し、動画再生ソフ
トへ指示をする。The response software determines which response scene is to be reproduced according to the reaction and operation of the user, and instructs the moving image reproduction software.

【００７８】応対ソフトから、再生するシーンのフレー
ム番号を指示された動画再生ソフトは、その応対シーン
の再生を行う。The moving image reproducing software, in which the frame number of the scene to be reproduced is designated from the response software, reproduces the response scene.

【００７９】応対シーンの再生が終わったら、その最後
のフレームと最も一致しているフレームからつなぎシー
ンを再生する。After the reproduction of the response scene is completed, the connected scene is reproduced from the frame that most closely matches the last frame.

【００８０】ユーザが次の反応・操作をし、応対ソフト
から次の応対シーンの指示が来たら、動画再生ソフト
は、再生中のつなぎシーンから最短時間で次の応対シー
ンの先頭フレームに一致のフレームまで再生し、スムー
ズに次の応対シーンへとつなげる。When the user performs the next reaction / operation and receives an instruction of the next response scene from the response software, the moving image reproduction software promptly matches the first frame of the next response scene with the shortest time from the connected scene being reproduced. Play back to the frame and smoothly connect to the next reception scene.

【００８１】動画再生ソフトは、つなぎシーンの再生
順、方向をランダムに変更し、間を持たせると共に、バ
ラエティーな感じを演出する。The moving image reproduction software randomly changes the reproduction order and the direction of the connected scene to provide a pause and produce a variety of feelings.

【００８２】図６は、前述のような客とビジュアルロボ
との関係をシステム図として示している。FIG. 6 is a system diagram showing the relationship between the customer and the visual robot as described above.

【００８３】（Ｃ）受付の応対ビジュアルロボ会社やデパートなどの受付・案内嬢に代わるビジュアル
ロボの場合の手順を以下に説明する。(C) Visual Robo for Reception The procedure in the case of a visual robo in place of a reception / information guide for a company or department store will be described below.

【００８４】１）シナリオ、フローチャートの作成シナリオ：会社の受付に来た人に、面会相手の部やセク
ションをキーボードで操作してもらうために、ビジュア
ルロボの案内内容、客の操作に対する応対シーン、電話
の取り次ぎ方法などのシナリオを作成する。シナリオ作
成段階で、客の反応に合わせたいろいろな応対シーンを
設定する。1) Creation of Scenarios and Flowcharts Scenario: In order to have a person who comes to the company operate a part or section of a visitor with a keyboard, guidance contents of a visual robot, a response scene to a customer operation, Create scenarios such as how to route calls. In the scenario creation stage, various response scenes are set according to the customer's reaction.

【００８５】図７は、そのような応対シーンの１例を示
す。FIG. 7 shows an example of such a response scene.

【００８６】２）シーン撮影応対シーンと、つなぎシーンの一致フレームは、肉眼で
見たとき、なるべく同じであることが望ましいので、シ
ーンの撮影では、モデルからカメラの距離を一定にし
て、モデルは画面の中央の位置に設定する。2) Scene Shooting It is desirable that the matching frame between the response scene and the connecting scene be as close as possible when viewed with the naked eye. Therefore, in shooting a scene, the distance between the model and the camera is fixed, and the model is used. Set to the center of the screen.

【００８７】・応対シーン撮影シナリオ、フローチャートにより、応対シーンを決め、
実在のモデルの撮影を行う。Reception scene shooting A reception scene is determined based on a scenario and a flowchart.
Shoot a real model.

【００８８】図８は、受付けで想定されるいろいろな応
対シーンと、それらに対応するモデルの演技の例を示
す。FIG. 8 shows various response scenes assumed in the reception and examples of the performance of the models corresponding to them.

【００８９】・つなぎシーン撮影応対シーンと同じようなカメラアングルとモデルの位置
でつなぎシーンを複数枚撮影する。Shooting of a connection scene A plurality of connection scenes are shot at the same camera angle and model position as the response scene.

【００９０】図９は、各種つなぎシーンと、それらに対
応するモデルの演技の例を示す。FIG. 9 shows examples of various connection scenes and the performance of models corresponding thereto.

【００９１】３）一致フレームの検出応対シーン（１）の先頭フレームをつなぎシーンの各フ
レームと比較し、最も一致したフレームを自動的に特定
する。同様に応対シーン（１）の最終フレームと最も一
致したつなぎシーンのフレームも特定する。3) Detection of Matching Frame The first frame of the response scene (1) is connected to each frame of the connecting scene, and the most matching frame is automatically specified. Similarly, the frame of the connecting scene that best matches the final frame of the response scene (1) is also specified.

【００９２】応対シーン（２）以下も同様に先頭フレー
ムと最終フレームに最も一致したつなぎシーンのフレー
ムを特定する。In the response scene (2) and thereafter, similarly, the frame of the connected scene that best matches the first frame and the last frame is specified.

【００９３】つなぎシーンとつなぎシーンをスムーズに
接続するためのフレームを自動的に特定する。つなぎシ
ーン（１）とつなぎシーン（２）の各フレームの中で最
も一致しているフレームを特定する。同様に、つなぎシ
ーン（２）と（３）、（３）と（４）その他について
も、一致フレームを特定する。The frames for smoothly connecting the connection scenes to each other are automatically specified. The most identical frame among the frames of the connection scene (1) and the connection scene (2) is specified. Similarly, the matching frames are specified for the connection scenes (2) and (3), (3) and (4), and the like.

【００９４】４）データベース化・応対シーンデータベース客の反応、操作に対応する応対シーンをデータベース
（チャート）化する。客の反応、操作と応対シーンは原
則として１：１となるようにする。4) Creation of a database ・ Response scene database A database (chart) of response scenes corresponding to customer reactions and operations. The reaction, operation and reception scene of the customer are in principle 1: 1.

【００９５】・つなぎシーンデータベース応対シーン、つなぎシーンをスムーズにつなぐための一
致フレームをデータベース化する。Connection scene database A database of matching frames for smoothly connecting response scenes and connection scenes.

【００９６】５）ビジュアルデータの再生要領・客の反応・操作に合わせ、どの映像データ（応対シー
ン）が呼び出されるかがチャートにより自動的に決定さ
れる。5) Reproduction Procedure of Visual Data The video data (service scene) to be called is automatically determined by a chart according to the reaction and operation of the customer.

【００９７】・映像データ（応対シーン）が終わった
後、最後のフレームへ接続すべきつなぎデータベースの
一致フレームを再生し、その後、ランダム関数により順
／逆方向へ再生する。After the video data (response scene) is over, the matching frame of the connecting database to be connected to the last frame is reproduced, and then reproduced in the forward / reverse direction by a random function.

【００９８】・客が次の操作をしたら、チャートでそれ
に対応した映像データ（応対シーン）の再生準備をする
と同時に、現在再生中のつなぎシーンから、その応対シ
ーンの先頭フレームと一致しているつなぎシーンまで最
短パスで再生する。When the customer performs the following operation, the video data (corresponding scene) corresponding to the corresponding operation is prepared for reproduction on the chart, and at the same time, from the currently reproduced concatenated scene, the concordance with the first frame of the corresponding response scene is obtained. Play the shortest path to the scene.

【００９９】・つなぎシーンから応対シーンへ映像を切
り替えて再生する。Switching and reproducing the video from the connection scene to the response scene.

【０１００】・応対シーンが終わったら、つなぎシーン
へと接続し、今までのフローを繰り返す。When the reception scene is over, a connection is made to the connection scene, and the flow up to now is repeated.

【０１０１】（Ｄ）ゲームビジュアルロボ周知のゲーム「ジャンケン、ポン。あっち向いて、ホ
イ」をビジュアルロボにさせる場合の例を以下に説明す
る。(D) Game Visual Robot An example in which the well-known game “Janken, Pong.

【０１０２】１）ビジュアルロボのシーンとインターフ
ェース・応対シーン「ジャンケン、ポン」、（パー）「ジャンケン、ポン」、（グー）「ジャンケン、ポン」、（チョキ）「あいこでしょ」、（パー）「あいこでしょ」、（グー）「あいこでしょ」、（チョキ）「あっち向いてホイ」、（右）「あっち向いてホイ」、（左）右向き左向き・つなぎシーン笑いこける舌を出す頭をかくウィンクをする。1) Visual Robot scene and interface ・ Response scene “Janken, Pong”, (Par) “Janken, Pong”, (Goo) “Janken, Pong”, (Choki) “Aikodosho”, (Par) “ "Aiko desho", (Goo) "Aiko desho", (Choki) "Over there, hoi", (Right) "Over there, hoi", (Left) Right to left ・ Connecting scene Laughing tongue Out I do.

【０１０３】・客の反応センサーグラブインターフェース（下記の手の動きを判別）グー、パー、チョキ、右、左ヘッドセンサー（頭の動きを感知）右、左音声認識ジャンケン、ポンあいこでしょあっちむいて、ホイ２）ユーザとビジュアルロボのフロー図１０は、ユーザのフローとビジュアルロボのフローを
それぞれ左側と右側に例示的に示す。-Customer reaction sensor Grab interface (discriminating hand movements below) Goo, par, choki, right, left Head sensor (detects head movement) Right, left Speech recognition Janken, Pong Aikodyo 2) Flow of User and Visual Robo FIG. 10 exemplarily shows the flow of the user and the flow of Visual Robo on the left and right sides, respectively.

【０１０４】３）ビジュアルロボのシステム図図１１は、客との関連でビジュアルロボのシステム図を
示す。ビジュアルロボは、客の動作（グー、チョキ、パ
ーなど）をセンサーで検出したり、音声認識手段で客の
音声たとえば「ジャンケン、あいこで、あっちむいて」
などを音声認識し、それらに対する適当な応対シーンと
つなぎシーンをディスプレー（画面）に再生する。3) Visual Robot System Diagram FIG. 11 shows a visual robot system diagram in relation to customers. Visual Robo detects the customer's movements (goo, choki, par, etc.) with a sensor, and uses the voice recognition means to say the customer's voice, for example, "Janken, Aiko, go around"
And the like, and an appropriate response scene and a connection scene corresponding thereto are reproduced on a display (screen).

【０１０５】例えば、本発明によるビジュアル応対装置
は、客１に応対するビジュアルロボ２の形に構成され、
センサー３、音声認識手段４、ディスプレー（画面）
５、ランダム・アクセス・メモリー６などを備えてい
る。ランダム・アクセス・メモリー６には、人間のモデ
ルを実写することによりパターン化された多数の応対シ
ーンと多数のつなぎシーンを映像化した各種ソフトが記
録されている。ディスプレー（画面）５は、データベー
ス内の応対シーンとつなぎシーンを再生するものであ
る。センサー３と音声認識手段４は、ユーザの応対の内
容を検出する検出手段として使用するものである。For example, the visual response device according to the present invention is configured in the form of a visual robot 2 responding to a customer 1,
Sensor 3, voice recognition means 4, display (screen)
5, a random access memory 6, and the like. The random access memory 6 stores various software that visualizes a large number of response scenes and a large number of connection scenes that are patterned by actually photographing a human model. The display (screen) 5 reproduces a response scene and a connection scene in the database. The sensor 3 and the voice recognition means 4 are used as detection means for detecting the contents of the response of the user.

【０１０６】更に、本発明によるビジュアル応対装置
は、図には示されていないが、その検出手段によって検
出された、ユーザの応対の内容に応じて、データベース
から適当な応対シーンを画面に再生するために選択する
応対シーン選択手段と、連続する前後の応対シーンと応
対シーンの間に挿入する適当なつなぎシーンを選択する
つなぎシーン選択手段を備えている。Further, the visual response device according to the present invention reproduces an appropriate response scene from the database on the screen according to the content of the response of the user, which is not shown in the figure, but is detected by the detecting means. And a connecting scene selecting means for selecting an appropriate connecting scene to be inserted between the consecutive preceding and following receiving scenes.

【０１０７】図１２は、応対シーン１，３，７が選択さ
れ、前後の応対シーン１と３の間につなぎシーン２が挿
入され、更に、前後の応対シーン３と７の間につなぎシ
ーン４，５が挿入された例を示している。各シーンの右
側に示されているのは、画像フレーム番号である。矢印
で示されているように、つなぎが行われる。FIG. 12 shows that the response scenes 1, 3, and 7 are selected, the connection scene 2 is inserted between the front and rear reception scenes 1 and 3, and the connection scene 4 is inserted between the front and rear reception scenes 3 and 7. , 5 are inserted. Shown on the right side of each scene is the image frame number. As indicated by the arrow, the connection is made.

【０１０８】[0108]

【発明の効果】本発明によれば、実写モデルの画像を使
用するので、ＣＧやアニメに比較して制作費が格段に安
い。また、実在の有名タレントをモデルとして採用でき
るので、ユーザに親近感をもたせやすい。According to the present invention, since an image of a live-action model is used, the production cost is significantly lower than that of CG or animation. Also, since a real famous talent can be adopted as a model, it is easy to give the user a sense of closeness.

【０１０９】つなぎシーンが挿入されるので、ユーザの
反応に合わせて前後の応対シーンを自然な形で繋いで再
生できる。さらに、ユーザの次の反応や操作を待つと
き、時間的な間をもたせるために、繋ぎシーンをランダ
ムに再生できる。その結果、より自然な応対が可能とな
る。Since the connecting scene is inserted, the preceding and following response scenes can be connected and reproduced in a natural manner according to the user's reaction. Furthermore, when waiting for the user's next reaction or operation, a connection scene can be reproduced at random in order to allow time. As a result, a more natural response becomes possible.

【０１１０】つなぎシーンの再生パスを自動的にランダ
ム化すると、バラエティーに富んだ応対をすることがで
きる。By automatically randomizing the reproduction path of the connection scene, a variety of responses can be made.

[Brief description of the drawings]

【図１】応対シーンのデータベースの１例を示す。FIG. 1 shows an example of a database of response scenes.

【図２】つなぎシーンのデータベースの１例を示す。FIG. 2 shows an example of a connection scene database.

【図３】各種シーンの流れを示している。FIG. 3 shows the flow of various scenes.

【図４】データベースの１例を概念的に示している。FIG. 4 conceptually shows an example of a database.

【図５】左側にフローチャートを示し、右側につなぎシ
ーンの挿入例を示している。FIG. 5 is a flowchart on the left side, and shows an example of insertion of a connection scene on the right side.

【図６】客とビジュアルロボとの関係をシステム図とし
て示している。FIG. 6 shows a relationship between a customer and a visual robot as a system diagram.

【図７】応対シーンの１例を示す。FIG. 7 shows an example of a reception scene.

【図８】受付けで想定されるいろいろな応対シーンと、
それらに対応するモデルの演技の例を示す。FIG. 8 shows various reception scenes assumed in the reception;
Here are examples of the acting of the models corresponding to them.

【図９】各種つなぎシーンと、それらに対応するモデル
の演技の例を示す。FIG. 9 shows examples of various connection scenes and acting of models corresponding to the connection scenes.

【図１０】ユーザのフローとビジュアルロボのフローを
それぞれ左側と右側に例示的に示す。FIG. 10 exemplarily shows the flow of the user and the flow of the visual robot on the left and right sides, respectively.

【図１１】客との関連でビジュアルロボのシステム図を
示す。FIG. 11 shows a system diagram of a visual robot in relation to a customer.

【図１２】応対シーンの選択と、前後の応対シーン間へ
のつなぎシーンの挿入例を示している。FIG. 12 shows an example of selecting a response scene and inserting a connecting scene between the front and rear response scenes.

[Explanation of symbols]

１客２ビジュアルロボ３センサー４音声認識手段５ディスプレー（画面）６ランダム・アクセス・メモリー 1 customer 2 visual robot 3 sensor 4 voice recognition means 5 display (screen) 6 random access memory

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考） // Ｇ０６Ｆ 3/00 ６５４Ｈ０４Ｎ 5/91 Ｎ ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) // G06F 3/00 654 H04N 5/91 N

Claims

[Claims]

1. A process in which a number of patterned reception scenes and connecting scenes are visualized by a real photograph of a human model, and the plurality of reception scenes and connecting scenes are converted into a database. A step of selecting an appropriate reception scene from the database and reproducing it on the screen; selecting a reception scene before and after reproduction and a connecting scene suitable for the reception scene; A visual reception method including a step of inserting a connection scene to smoothly connect a front-back reception scene and a reception scene on a video.

2. The visual response method according to claim 1, wherein the connecting scene comprises a large number of interactive images obtained by converting various kinds of images into data by actually photographing a human model.

3. The connecting frame which is most similar to the first frame and the last frame of the serving scene is detected from each of a number of images of the connecting scene, and the connecting frame which best matches the first frame and the last frame in the database of the serving scene. 3. The image number of a scene is set in advance.
Visual response method described in.

4. In order to smoothly connect the front and rear reception scenes and the reception scene, the previous reception scene is reproduced on the screen, and then the frame of the connection scene that most closely matches the last frame of the reception scene is reproduced. , Until the frame of the splicing scene that matches the first frame of the response scene after being selected by the user's reaction,
4. A method according to claim 1, wherein a subsequent response scene is reproduced.
Visual response method described in section.

5. The visual response method according to claim 1, wherein reproduction of the connected scene is performed in a forward direction or a reverse direction with respect to a time axis depending on a positional relationship between the matching frames. .

6. A connecting scene to be inserted between the preceding and following reception scenes and the reception scene is composed of a plurality of connection scenes,
A matching frame is found in advance between connecting scenes and connecting scenes, and by connecting the matching frames, more connecting scenes are inserted between the preceding and following reception scenes. 6. The visual response method according to any one of 5.

7. A playback system according to claim 1, wherein a plurality of connected scenes are continuously played back at the time of playing back the connected scenes, and the playback order of the connected scenes is changed at random using random numbers. The visual response method according to any one of claims 1 to 4.

8. A database in which a number of response scenes and a number of connection scenes patterned by actually photographing a human model are visualized, a screen for reproducing the reception scene and the connection scene in the database, and a user reaction. Detecting means for detecting, and detected by the detecting means,
Response scene selection means for selecting an appropriate response scene from the database to be reproduced on the screen in accordance with the user's reaction, and a connection for selecting an appropriate connection scene to be inserted between the consecutive reception scenes before and after the reception scene. Visual response device with scene selection means.

9. The connection scene and the reception scene are random.
9. The visual response device according to claim 8, wherein the visual response device is recorded in an access memory.

10. A connecting scene which best matches the first frame and the last frame of each of the service scenes from each image of the connecting scene, and is stored in the service scene database. The visual response device according to claim 8 or 9, wherein an image number is added.

11. In order to smoothly connect the preceding and following reception scenes and the reception scene, after reproducing the previous reception scene, a frame of the connection scene most matching the last frame of the reception scene is reproduced. 11. The frame according to any one of claims 8 to 10, wherein up to a frame of the connecting scene corresponding to the first frame of the response scene selected after the reaction is reproduced, and thereafter, the subsequent response scene is reproduced. Visual response device.

12. The visual response apparatus according to claim 8, wherein reproduction of the connected scene is performed in a forward direction or a reverse direction with respect to a time axis depending on a positional relationship between the coincident frames. .

13. A connecting scene to be inserted between the preceding and following service scenes is composed of a plurality of connecting scenes, a matching frame is found in advance between the connecting scenes, and the matching frames are connected. The visual response device according to any one of claims 8 to 12, wherein more connection scenes are inserted between the front and rear response scenes by performing the following steps.

14. A system according to claim 8, wherein a plurality of connected scenes are successively reproduced at the time of reproducing the connected scenes, and the reproduction order of the connected scenes is changed at random using random numbers. The visual response device according to any one of claims 1 to 4.