JPH07200632A

JPH07200632A - Information processor

Info

Publication number: JPH07200632A
Application number: JP5351055A
Authority: JP
Inventors: Hisashi Aoki; 恒青木; Shunichi Numazaki; 俊一沼崎; Toshimitsu Kaneko; 敏充金子
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1993-12-28
Filing date: 1993-12-28
Publication date: 1995-08-04
Anticipated expiration: 2018-01-27
Also published as: JP3372096B2

Abstract

PURPOSE:To provide an information processor which can produce an index for retrieval of the image information by the also effective processing and also can properly retrieve a desired image without applying any burden to a user. CONSTITUTION:A position detector 417 detects the eye positions of a user set to the image information when an access is performed to a storage medium 410 which records the image information. Then the detected eye positions are stored in relation to the image information. When the correspondence is analyzed between an image and the eyes of the user, a marked point decider 419 calculates the coordinates of the marked point. Then an edge recognizing part 404 and an image part extractor 405 extract a partial image. Based on this extracted image, an index is generated for retrieval of images. Then the index image is shown when the image information is reproduced so that a list is generated for selection of a desired image.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、情報処理装置に係り、
特に画像情報の検索に適した画像情報アクセス装置、簡
易な遠隔操作を実現する遠隔操作装置及びファイルのリ
ンク構造を表示装置上にマップとして表示させる機能を
有する情報記録装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing device,
In particular, the present invention relates to an image information access device suitable for searching image information, a remote operation device for realizing simple remote operation, and an information recording device having a function of displaying a file link structure as a map on a display device.

【０００２】[0002]

【従来の技術】画像を記録する装置は、使用者が撮影し
たい風景の光学的情報を直接フィルム上の粒子の化学変
化に変換、あるいは同情報を電磁的信号に変換して媒体
に残存させる。直接フィルムに記録するものとして写真
カメラや映画撮影機、信号として媒体に記録するものと
して電子スチルカメラやビデオレコーダーが実用化され
ており、媒体には磁性体のテープやディスク、光学的な
特性を部分的に変化させる光ディスクや記憶を保持する
機能を備えた集積回路などが用いられている。記憶媒体
の性能の向上や記録方法の改善により、単一の媒体に記
録できる画像情報の量は増大し、今後も更なる向上が期
待されている。2. Description of the Related Art An apparatus for recording an image directly converts optical information of a scene that a user wants to take into a chemical change of particles on a film, or converts the information into an electromagnetic signal to be left on a medium. Photo cameras and movie cameras are directly used for recording on film, and electronic still cameras and video recorders are used for recording on media as signals, and magnetic media such as tapes and disks, and optical characteristics are used for media. An optical disk that partially changes, an integrated circuit having a function of holding a memory, and the like are used. With the improvement of the performance of the storage medium and the improvement of the recording method, the amount of image information that can be recorded on a single medium has increased, and further improvement is expected in the future.

【０００３】このような画像記録装置では、蓄積された
画像情報の中から別途、目的の画像を探す際に効率よく
探索できるような目印をつけることが望まれる。これは
写真カメラやビデオカメラで撮影日付・時間をフィルム
に写し込む機能や、ＶＴＲ（ビデオテープレコーダー）
で録画開始の位置に特別な信号を記録するインデックス
機能として実用化されている。しかしこれらの場合、画
像内容に関する情報管理は人間に任されており、目的と
する内容を含む画像部分を検索するためには、記録場所
についての使用者の記憶をたどるか、記憶がなければす
べての画像を精査する必要がある。あるいは、あらかじ
め手作業で内容に関する記述と対応する記録場所とを記
録し、保存しておかなければならない。In such an image recording apparatus, it is desired to separately provide a mark from the accumulated image information so that the desired image can be searched efficiently. This is a function to copy the shooting date and time on a film with a photo camera or a video camera, and a VTR (video tape recorder).
Has been put into practical use as an index function for recording a special signal at the position where the recording starts. However, in these cases, information management regarding the image contents is left to humans, and in order to search for the image portion containing the desired contents, it is necessary to follow the user's memory of the recording location or Need to scrutinize the image of. Alternatively, the description about the content and the corresponding recording location must be manually recorded and saved in advance.

【０００４】この場合、不特定の使用者からの検索に対
応した内容記述を行うためには、画像中の検索対象とな
りうる、より多くの物体について客観的に記述しておか
なければならない一方で、物体の状態や相互の関係から
の検索を許容するためには、著述的に内容を描写してお
く必要がある。これは利用者に大きな作業量を要求する
だけでなく、索引情報に極めて大きい記憶領域を用意す
る必要をも生じ、媒体の使用量が増大する。In this case, in order to describe the contents corresponding to the search from an unspecified user, it is necessary to objectively describe more objects that can be searched in the image. , In order to allow retrieval from the state of objects and mutual relationships, it is necessary to depict the contents in writing. This not only requires a large amount of work for the user, but also needs to prepare an extremely large storage area for the index information, which increases the amount of medium used.

【０００５】内容の理解・記述をすべて自動化しようと
すると、様々な物体が明るさと色の違いだけで表されて
いる画面の中から、すべての物体について、物体同士の
分離、物体の認識、状態の記述といった過程をたどらな
ければならず、高度な知能的処理を備えたシステムが必
要となる。また前記のような著述的な描写には、さらに
大規模な知識データベースが必要とされ、この理由から
いまだ完成されたものは見られない。If all attempts are made to automate the understanding and description of the contents, all objects are separated from each other, the recognition of the objects, and the state of the objects are displayed on the screen in which various objects are represented only by the difference in brightness and color. It is necessary to follow a process such as the description of, and a system with advanced intelligent processing is required. Further, the above-mentioned writing description requires a larger-scale knowledge database, and for this reason, no one has been completed yet.

【０００６】したがって従来、特定の画像を検索する際
には、画像を総覧しながら目的の部分を探索するか、内
容についてあらかじめ緻密に記述された索引を参照して
探索するか、あるいは高度な処理系によって自動的に付
与された客観的で細分化された表現から探索することが
要求され、それぞれ使用者、索引製作者、システム設計
者に多大な負担を必要とした。また、意図した画像を装
置に理解させるためには、キーボードやマウスなど入力
装置による煩雑な作業を必要とし、さらに、使用者以外
の者あるいはシステムが用意した索引は、必ずしも特定
の使用者に適合した索引ではないという問題もあった。Therefore, conventionally, when retrieving a specific image, a target portion is searched while viewing the image, or an index in which contents are described in detail is referred to, or an advanced process is performed. It was required to search from the objective and subdivided expressions automatically given by the system, which required a great burden on the user, the index maker, and the system designer, respectively. In addition, in order for the device to understand the intended image, it is necessary to perform complicated work with an input device such as a keyboard and a mouse, and an index prepared by a person other than the user or the system is not always suitable for a specific user. There was also the problem that it was not an index.

【０００７】一方、従来、実現されていた遠隔操作装置
は、操作者が手に持って操作を行うものであった。家庭
用で、最も普及している遠隔操作装置はいわゆる赤外線
リモコンである。これは操作者が手に持って、赤外線放
射部を機械に向けてボタンを押すことにより、機械に赤
外線を投射し、所定の指示を出すものである。On the other hand, the remote operation device that has been realized conventionally has been operated by an operator holding it in his hand. The most popular remote control device for home use is a so-called infrared remote controller. In this, an operator holds the infrared ray emitting section toward the machine and pushes a button to project infrared rays on the machine and give a predetermined instruction.

【０００８】また、画像認識技術を利用して手振り認識
や身ぶり認識などを行い、これを遠隔操作に応用すると
いう研究もなされている。しかし画像処理によりこれら
を認識するためには背景を単調にしなければならないと
か、画面の中央にどのくらいの大きさで手が写っていな
くてはならないとか、制約条件が厳しく、いまだ実用化
には至っていない。[0008] Further, studies have also been conducted to recognize hand gestures and gestures by utilizing image recognition technology and apply this to remote control. However, in order to recognize these by image processing, the background must be monotonous, how large the hand must be in the center of the screen, and there are severe constraints, so it is still in practical use. Not in.

【０００９】この他、三角測量の原理を用いたステレオ
法によって物体の３次元的な形状を把握し、認識に役立
てようとする研究もある。[0009] In addition, there is also a study to grasp the three-dimensional shape of an object by a stereo method using the principle of triangulation and to utilize it for recognition.

【００１０】一方、コンピュータ等の情報機器におい
て、ある記録媒体上にどのようなファイルが記録されて
いるのかを確認する機能は、従来からほとんどのシステ
ムにおいて使われていた。これは、記録媒体上に記録さ
れた情報が直接目でみることができないため、ユーザは
システムに情報を読みとらせ、ファイル名等のファイル
に関する情報を表示装置を使って表示するといったもの
である。On the other hand, the function of confirming what kind of file is recorded on a certain recording medium in an information device such as a computer has been conventionally used in most systems. This is because the information recorded on the recording medium cannot be seen directly by the user, so that the user can make the system read the information and display the information about the file such as the file name on the display device. .

【００１１】カセットテープやビデオテープに音楽、ビ
デオを記録した場合には、ユーザは自分で記録内容を記
述したシールをテープに直接張り付けることによって記
録媒体上の情報ファイルの内容がわかるようにしてい
る。このような方法は記録媒体上に記録できるファイル
数が少ない場合のみ可能である。ファイル数が多くなる
場合には、ファイルを構造化し、記録しておくことと、
それを見やすく表示することが必要になってくる。When music or video is recorded on a cassette tape or a video tape, the user directly attaches a sticker describing the recorded contents to the tape so that the contents of the information file on the recording medium can be understood. There is. Such a method is possible only when the number of files that can be recorded on the recording medium is small. If the number of files is large, structure the files and record them,
It becomes necessary to display it easily.

【００１２】ファイルの構造で最も単純なのは、ファイ
ルが１次元的に順番に並んでいるものである。このよう
なファイルを表示する場合には、ファイル名を順番に表
示するとか、ファイルをアイコンで表現して順番に並べ
るといった方法がとられる。ときには、表示の順番をフ
ァイルの持つ何らかの属性、例えばファイル名のアルフ
ァベット順や記録時間に従って並べ替え、表示すること
もある。The simplest file structure is that the files are one-dimensionally arranged in order. When displaying such a file, a method of displaying the file name in order, or expressing the file by an icon and arranging them in order can be adopted. At times, the display order may be rearranged and displayed according to some attribute of the file, for example, alphabetical order of file names or recording time.

【００１３】もう少し複雑なファイル構造としては、木
構造がある。これはディレクトリと呼ばれる単位にファ
イルが属しており、このディレクトリは親ディレクトリ
と子ディレクトリとを持つ。一つのディレクトリは複数
の子ディレクトリを持つこともあるが、親ディレクトリ
は一般に一つである。このようにファイルを木構造にす
ることにより、ユーザは多くのファイルを扱うことが容
易になった。すなわち、先に述べた１次元的なファイル
構造は、一つのディレクトリに全てのファイルが存在し
ていることになり、これを表示させるとファイル数が多
い場合には目的のファイルを見つけるのが困難であっ
た。そこで、ユーザは関係のあるファイルを同じディレ
クトリにまとめることにより、関係のないファイルを見
ることなく、比較的少ないファイルの表示の中から目的
のファイルを探すことができるようになった。このよう
なファイル構造の表示方法としては、一つのディレクト
リの中のファイルだけを表示する方法、ディレクトリの
木構造をマップとして表示する方法などがある。A tree structure is a slightly more complicated file structure. Files belong to a unit called a directory, and this directory has a parent directory and a child directory. A directory may have multiple child directories, but generally there is only one parent directory. The tree structure of the files makes it easy for the user to handle many files. That is, the one-dimensional file structure described above means that all files exist in one directory, and when this is displayed, it is difficult to find the target file when the number of files is large. Met. Therefore, by collecting related files in the same directory, the user can search for a target file from the display of relatively few files without looking at unrelated files. As a method of displaying such a file structure, there are a method of displaying only files in one directory, a method of displaying a tree structure of a directory as a map, and the like.

【００１４】これに対し、最近ではファイルの木構造で
は表現できないようなファイルの関係も表せるような構
造を作ろうとすることがなされている。従来の木構造で
は、ファイルは必ず一つのディレクトリに属し、これと
別のディレクトリに属しているファイルとの関係は自由
に定義できなかった。これに対して、関係しているファ
イル同士をリンクで結び、関係のあることを示すという
構造が取り入れられるようになってきた。これはいわゆ
るハイパーテキストと呼ばれる構造で、このような構造
ならば任意の２つのファイルの関係を自由に表せる。ま
た、ファイル同士だけでなく、ファイルの中にまで、立
ち入ることもできる。例えば、画像ファイルとして風景
の写真の中に猫がいる画像があった場合に、この画像の
猫の部分と猫の鳴き声の音声ファイルとをリンクで結ぶ
ことができる。このようなことまで考えると、ハイパー
テキストは非常に複雑な関係までも表現が可能な構造と
なっている。On the other hand, recently, it has been attempted to create a structure that can represent the relationship of files that cannot be expressed by the tree structure of files. In the conventional tree structure, a file always belongs to one directory, and the relation between this and a file belonging to another directory cannot be freely defined. On the other hand, a structure has been introduced in which related files are linked with each other to show that they are related. This is a so-called hypertext structure, and with such a structure, the relationship between any two files can be freely expressed. You can also enter not only files but also files. For example, when there is an image of a cat in a landscape photograph as an image file, the cat portion of this image and the voice file of the bark of the cat can be linked by a link. Considering such things, hypertext has a structure that can express even a very complicated relationship.

【００１５】しかしその一方で、構造が複雑になりす
ぎ、ユーザ自身がどのような構造になっているのか把握
しづらいといった問題も生じてきた。これを解消するた
めの方法の一つに、ファイルのハイパーテキスト構造を
マップとして表示するという方法がある。これはファイ
ルをアイコンで表し、さらにファイルに張られたリンク
を線で表現し、ファイル間を結ぶことにより構造を表現
するものである。マップによりファイルのハイパーテキ
スト構造を直接表示することで、ユーザは全体の構造、
注目しているファイルの他のファイルとの関係などを知
ることができるようになる。On the other hand, however, there has been a problem that the structure becomes too complicated and it is difficult for the user to grasp what the structure is. One way to solve this is to display the hypertext structure of the file as a map. This represents a file by an icon, a link drawn in the file by a line, and a structure by connecting the files. By displaying the hypertext structure of the file directly through the map, the user can
You will be able to know the relationship between the file you are interested in and other files.

【００１６】しかし、このようにファイルのハイパーテ
キスト構造をマップ表示した場合に、ファイルアイコン
をファイル名のアルファベット順、記録時間順などによ
って並べると、ファイル間に張られたリンクに従ってフ
ァイルアイコンを線で結んだ場合にリンク線が複雑にな
り、結局構造がよくわからないということになる。ま
た、一見近くにあるファイル同士が関係ありそうに見え
るのにリンクは張られておらず無関係で、遠くにあるフ
ァイル同士が実はリンクで結ばれていたりといった不具
合も生じてしまう。これは、マップ表示する際に、ファ
イルの位置とリンクが密接な関係を持っているにも関わ
らず、ファイル間に張られたリンクを考慮していないた
めである。However, when the hypertext structure of a file is displayed in a map in this way, if the file icons are arranged in alphabetical order of file name, recording time order, etc., the file icons are drawn as lines according to the links provided between the files. When they are connected, the link line becomes complicated and the structure is not well understood. Also, it seems that files that are near each other seem to be related, but they are not linked because they are not linked, and there is a problem that files that are far away are actually linked by a link. This is because when the map is displayed, the links between the files are not taken into consideration, even though the positions of the files and the links have a close relationship.

【００１７】[0017]

【発明が解決しようとする課題】上述したように従来の
画像記録再生装置では、使用者の意図する画像を検索す
るための索引を、検索しやすい形で利用するためには、
大規模な索引を映像製作者側が用意するか、複雑で高度
な画像認識システムに客観的な内容記述をさせることが
要求され、索引が冗長で、使用者にとって必ずしも効率
的でない、処理時間が増大する、処理系が複雑で高価で
ある、使用者の負担が大きいという問題があった。As described above, in the conventional image recording / reproducing apparatus, in order to use the index for retrieving the image intended by the user in a form that is easy to retrieve,
It is required that the video producer prepares a large-scale index or makes a complex and sophisticated image recognition system objectively describe the content. The index is redundant, not always efficient for the user, and the processing time increases. However, there are problems that the processing system is complicated and expensive, and that the burden on the user is heavy.

【００１８】また、従来の遠隔操作装置は、手に持って
操作しなくてはならず、装置が手元にない場合は操作す
ることができない。また汚れた手で操作するのは好まし
くないという問題があった。画像処理による手振り認識
などの技術においても、認識技術の未熟さから、ノイズ
の影響を受け、操作者の手を正確に認識することが難し
い。このため、特に家庭内において、リモコン装置など
のいかなるデバイスも手に持ったり装着したりすること
なく、素手のみで遠隔操作できる環境を提供すること
は、非常に望ましいにも拘らず実現されていない。The conventional remote control device must be operated by holding it in the hand, and cannot be operated unless the device is at hand. There is also a problem that it is not preferable to operate with dirty hands. Even in techniques such as hand gesture recognition by image processing, it is difficult to accurately recognize the operator's hand due to the influence of noise due to immaturity of the recognition technique. For this reason, it has been extremely desirable to provide an environment in which it is possible to perform remote control only with bare hands, without holding or wearing any device such as a remote control device in the home, especially, even at home. .

【００１９】更に、従来の情報記録装置では、記録され
たファイルのハイパーテキスト構造をマップ表示した場
合、ファイルアイコンの位置とリンクとの関係が考慮さ
れていないためにファイル構造が複雑でわかり難いとい
う問題があった。Further, in the conventional information recording apparatus, when the hypertext structure of the recorded file is displayed as a map, the file structure is complicated and difficult to understand because the relationship between the position of the file icon and the link is not taken into consideration. There was a problem.

【００２０】本発明は、使用者に負担をかけずに、効率
的な処理で画像情報検索のための索引を作成し、目的と
する画像を適切に検索することができる画像情報アクセ
ス装置を提供することを第１の目的とする。The present invention provides an image information access apparatus which can create an index for image information retrieval by an efficient process and appropriately retrieve a target image without imposing a burden on the user. The first purpose is to do so.

【００２１】また、本発明は、いかなるデバイスを手に
持つことなく、操作者の素手のみでシステムを操作する
ことができる遠隔操作装置を提供することを第２の目的
とする。A second object of the present invention is to provide a remote control device capable of operating the system with only the bare hands of the operator without holding any device in the hand.

【００２２】更に、本発明は、ファイルのリンク構造が
容易に分かる情報記録装置を提供することを第３の目的
とする。Further, a third object of the present invention is to provide an information recording device in which the file link structure can be easily understood.

【００２３】[0023]

【課題を解決するための手段】本発明に係る画像情報ア
クセス装置は、画像情報を記録する画像情報記録手段
と、この画像情報記録手段に対するアクセスの際に使用
者の前記画像情報に対する視線の位置を検出する視線検
出手段と、この視線検出手段で検出された視線の位置を
前記画像情報と関連付けて記憶する視線位置情報記憶手
段と、この視線位置情報記憶手段に記憶された視線の位
置に基づいて前記画像情報記録手段に記録された画像情
報中の注目部分を抽出し前記画像情報の検索のための索
引情報を生成する索引情報生成手段とを備えたことを特
徴とする。An image information access device according to the present invention includes an image information recording means for recording image information and a position of a line of sight of a user with respect to the image information when accessing the image information recording means. Based on the position of the line of sight stored in the line-of-sight position information storage unit, and a line-of-sight position information storage unit that stores the position of the line-of-sight detected by the line-of-sight detection unit in association with the image information. Index information generating means for extracting a target portion in the image information recorded in the image information recording means and generating index information for searching the image information.

【００２４】本発明に係る遠隔操作装置は、距離画像を
入力する距離画像入力手段と、この距離画像入力手段に
入力された距離画像の変化量を検出する変化量検出手段
と、得られた距離画像およびその変化量から動いている
物体の大きさ、方向及び移動距離を算出し、これらの情
報から動いている物体が操作者の手であることを認識す
る動き物体検出手段と、この動き物体検出手段で検出さ
れた手の画面内での動きを追跡する動き物体追跡手段
と、この動き物体追跡手段での追跡結果に従って画面上
にカーソルを表示するカーソル表示手段と、前記動き物
体追跡手段で追跡していた手が所定時間静止したことを
検出したら、又は予め決められた動きをしたことを検出
したら、そのカーソル位置に基づいて予め登録された所
定のコマンドを実行するコマンド実行手段とを備えたこ
とを特徴とする。The remote control device according to the present invention includes a distance image input means for inputting a distance image, a change amount detecting means for detecting a change amount of the distance image input to the distance image input means, and the obtained distance. A moving object detection unit that calculates the size, direction, and moving distance of a moving object from an image and its change amount, and recognizes that the moving object is the operator's hand, and the moving object. In the moving object tracking means, a moving object tracking means for tracking the motion of the hand detected by the detection means in the screen, a cursor display means for displaying a cursor on the screen according to the tracking result by the moving object tracking means, and the moving object tracking means. When it detects that the hand being tracked has been stationary for a predetermined time or when it has detected a predetermined movement, it executes a predetermined command registered in advance based on the cursor position. Characterized by comprising a that command execution unit.

【００２５】本発明に係る情報記録装置は、複数のファ
イルを関連のあるもの同士リンクによって相互に関連付
けて記憶する記録手段と、この記録手段に記録された複
数のファイルとそれらの間のリンクの構造を表示する際
の見やすさを決定するパラメータを評価して最適なパラ
メータを求める最適化手段と、この最適化手段によって
求められた最適なパラメータによって特定される前記複
数のファイルの間のリンク構造をファイルアイコンとそ
れを関連付ける情報で表示する表示手段とを備えたこと
を特徴とする。The information recording apparatus according to the present invention includes a recording means for storing a plurality of files in association with each other by linking the related files, a plurality of files recorded in the recording means, and a link between them. Optimizing means for evaluating the parameters that determine the visibility when displaying the structure and obtaining the optimum parameters, and the link structure between the plurality of files specified by the optimum parameters obtained by the optimizing means Is provided with a display means for displaying the file icon and information associating the file icon with the file icon.

【００２６】[0026]

【作用】本発明の画像情報アクセス装置によれば、画像
記録または再生時の使用者の視線の位置を検出して画像
情報と関連付けて記録し、その記録内容から画像情報内
での視線の位置、動き、および停留時間等を根拠に、画
像情報内の使用者が注目している部分を選択的に抽出
し、画像情報の検索のための索引を生成する。これによ
り、使用者がとりわけ興味をひいた物体や部分を全画像
の中から選択し、選択された部分を索引とすることがで
きる。According to the image information access device of the present invention, the position of the line of sight of the user at the time of recording or reproducing the image is detected and recorded in association with the image information, and the position of the line of sight within the image information is recorded from the recorded content. , A motion, a dwell time, and the like are used to selectively extract a portion of the image information that the user is paying attention to, and generate an index for searching the image information. This allows the user to select an object or part that is of particular interest from all the images and use the selected part as an index.

【００２７】本発明では、画像の通常の観察、記録又は
再生の過程で、特別な情報入力操作を必要とせずに関心
の高い部分のみが特定されて索引が生成されるため、使
用者の負担を大幅に削減することができる。また、注目
点を中心に解析を行うので、全画像に関する認識、理解
を必要とせず、処理が簡素化される。According to the present invention, during normal observation, recording, or reproduction of an image, only a portion of high interest is specified and an index is generated without requiring a special information input operation, so that the burden on the user is increased. Can be significantly reduced. Further, since the analysis is performed focusing on the points of interest, the recognition and understanding of all the images are not required, and the processing is simplified.

【００２８】また、本発明の遠隔操作装置によれば、形
状の分析によって手であると判断するのではなく、大体
の形状とその動きの情報から、手であることを判断す
る。具体的には、例えば操作者が手を前方に（これから
操作を行うという意思と共に）伸ばしたとき、それを手
であると判断できるように、手と同程度の大きさのもの
がある程度の距離（例えば片腕の長さの半分くらい）だ
け、前方に移動したことを検出し、それを手であると判
断する。そのあとはその部分を追跡することにより、手
の動きが入力できるので、画面上のポインティングデバ
イスとして用いることができる。したがって、例えば一
定時間停止したり、特定の動きをした場合に、カーソル
位置に基づいて特定のコマンドを実行することができ
る。Further, according to the remote control device of the present invention, it is not judged that the hand is a hand by analyzing the shape, but the hand is judged from the information of the general shape and the movement thereof. Specifically, for example, when the operator extends his / her hand forward (with the intention of performing an operation), it can be determined that it is a hand, so that the size of the hand is similar to a certain distance. It detects that it has moved forward (for example, about half the length of one arm) and determines that it is a hand. After that, the movement of the hand can be input by tracing the portion, so that it can be used as a pointing device on the screen. Therefore, for example, when stopped for a certain period of time or when a specific movement is performed, a specific command can be executed based on the cursor position.

【００２９】本発明によって、リモコン装置などの一切
のデバイスを手に持つことなく、また装着することな
く、素手のみでシステムを操作する環境が実現できる。According to the present invention, it is possible to realize an environment in which the system can be operated only with bare hands without holding or mounting any device such as a remote controller on the hand.

【００３０】更に、本発明の情報記録装置によれば、マ
ップの見やすさのパラメータとして、例えば表示したと
きのリンクを表す線の交差数、リンクを表す線の分散、
リンクを表す線の長さの総和の一つ、もしくはいくつか
を複合して用い、このパラメータの値を最適にするファ
イルアイコンの表示位置の組み合わせを組み合わせ最適
化手段を用いて見つけ出し、表示させるようにしている
ので、表示装置上にファイルのハイパーテキスト構造を
マップとして表示したときに、あるパラメータを最適に
する位置にファイルアイコンが表示される。このパラメ
ータは見やすさを代表するものを選んでいるので、この
値を最適にする表示位置は見やすさの点で優れているは
ずである。このようにマップ表示することにより、マッ
プ表示の本来の役割である。ユーザがファイル構造を理
解する、という意味において優れた表示が可能になる。Further, according to the information recording apparatus of the present invention, as the parameter of the visibility of the map, for example, the number of intersections of the lines representing the links when displayed, the distribution of the lines representing the links,
Use one or some of the total lengths of the lines that represent links to combine and use the combination optimizing means to find and display the combination of file icon display positions that optimizes the value of this parameter. Therefore, when the hypertext structure of the file is displayed as a map on the display device, the file icon is displayed at the position where a certain parameter is optimized. Since this parameter is selected to represent the legibility, the display position that optimizes this value should be excellent in terms of legibility. By displaying the map in this way, it is the original role of the map display. The display is excellent in the sense that the user understands the file structure.

【００３１】[0031]

【実施例】以下、図面を参照して本発明の実施例につい
て説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００３２】まず、図１〜図６を参照して、本発明の実
施例に係る画像情報アクセス装置について説明する。First, an image information access apparatus according to an embodiment of the present invention will be described with reference to FIGS.

【００３３】図１は、画像記録装置の構成を示すブロッ
ク図で、光ディスクを用いたディジタルビデオ装置の例
を示している。この装置は、撮影者が自分で被写体に装
置を向ける場合で、動画撮影時に撮影者の視線を観察
し、画像中の索引となりうる物体を選択する例である。FIG. 1 is a block diagram showing the arrangement of an image recording apparatus, showing an example of a digital video apparatus using an optical disc. This device is an example in which the photographer views the photographer's line of sight at the time of shooting a moving image and selects an object that can be an index in an image when the device himself or herself points the device at the subject.

【００３４】記録される映像信号は入力端子１０１を介
して入力され、Ａ／Ｄ変換装置１０２によって量子化さ
れた後、符号化装置１０３によって記録に適した信号に
変換され、記録信号発生装置１０４を通じて記憶媒体１
０５に記録される。A video signal to be recorded is input through an input terminal 101, quantized by an A / D converter 102, converted into a signal suitable for recording by an encoder 103, and a recording signal generator 104. Through storage medium 1
Recorded in 05.

【００３５】これと同時に入力端子１０６には、撮影者
の眼球の動きが電気信号で供給される。この眼球運動信
号はＡ／Ｄ変換装置１０７によって量子化された後、位
置検出器１０８によって画像中の視点位置情報として計
算される。計算された視点位置情報は符号化装置１０９
によって記録に適した信号に変換されて、映像信号とと
もに記録信号発生装置１０４の出力により記憶媒体１０
５に記録される。At the same time, the movement of the photographer's eyeball is supplied to the input terminal 106 as an electric signal. This eye movement signal is quantized by the A / D converter 107, and then calculated by the position detector 108 as viewpoint position information in the image. The calculated viewpoint position information is used as the encoding device 109.
Is converted into a signal suitable for recording by the storage medium 10 by the output of the recording signal generator 104 together with the video signal.
Recorded in 5.

【００３６】ここで、視点位置情報は映像信号の記録信
号発生装置１０４と異なる記録信号発生装置によって、
記憶媒体１０５の映像信号と異なる領域に格納してもよ
い。また記憶信号発生装置、記憶媒体ともに映像信号と
異なるものを用いて記録してもよい。Here, the viewpoint position information is obtained by a recording signal generator different from the recording signal generator 104 for the video signal.
You may store in the area | region different from the video signal of the storage medium 105. Further, both the storage signal generator and the storage medium may be recorded by using different ones from the video signal.

【００３７】入力端子１１０にはスイッチ、ボタンなど
を通じて撮影者の指示をシステムコントローラ１１１に
伝える信号が供給される。この信号により画像記録装置
は録画の開始、終了やズームイン、ズームアウトなどを
制御する。装置の制御情報１１２は、符号化装置１１３
によって、やはり記録に適した信号に変換されて映像信
号、視線位置情報とともに、記録信号発生装置１０４の
出力により記憶媒体１０５に記録される。A signal for transmitting a photographer's instruction to the system controller 111 is supplied to the input terminal 110 through a switch, a button and the like. With this signal, the image recording apparatus controls the start and end of recording, zoom-in, zoom-out, and the like. The device control information 112 is the encoding device 113.
Then, the signal is also converted into a signal suitable for recording, and is recorded in the storage medium 105 by the output of the recording signal generator 104 together with the video signal and the line-of-sight position information.

【００３８】撮影者による指示方法としては、索引とし
たい物体に注目しつつ撮影している際に装置にそれを知
らせる入力信号を用意して、入力端子１１０から上記の
経路を経る装置制御情報１１２に含めてもよい。As the instruction method by the photographer, an input signal for notifying the device of the object to be indexed is prepared, and the device control information 112 from the input terminal 110 via the above path is prepared. May be included in.

【００３９】ここで装置制御情報は、符号化装置１１３
の後、映像信号の記録信号発生装置１０４と異なる記録
信号発生装置によって、記憶媒体１０５の映像信号と異
なる領域に格納してもよい。また記憶信号発生装置、記
憶媒体ともに映像信号と異なるものを用いて記録しても
よい。Here, the device control information is the coding device 113.
After that, it may be stored in an area of the storage medium 105 different from that of the video signal by a recording signal generator different from the video signal recording signal generator 104. Further, both the storage signal generator and the storage medium may be recorded by using different ones from the video signal.

【００４０】図２は索引作成装置の構成を示すブロック
図である。この装置は、図１の画像記録装置によって記
録された映像信号、視線位置情報および装置制御情報に
基づいて、索引を作成するものである。FIG. 2 is a block diagram showing the structure of the index creating device. This apparatus creates an index on the basis of the video signal recorded by the image recording apparatus shown in FIG. 1, the line-of-sight position information, and the apparatus control information.

【００４１】記憶媒体２０１に記憶された映像信号、視
線位置情報および装置制御情報は記憶信号読出装置２０
２によって取り出され、映像信号２０３、視線位置情報
２１７および装置制御情報２１２に分けられる。ここ
で、記憶媒体は前記のように映像信号、視線位置情報、
装置制御情報で異なるものを用いてもよい。また、同一
記憶媒体を用いても異なる読みだし装置を用いて、あら
かじめ記録された異なる領域から取り出してもよい。映
像信号２０３は復号器２０４によって記録信号からディ
ジタル画像信号に変換され、画像蓄積部２０５に一定時
間分の画像が蓄えられる。The video signal, line-of-sight position information and device control information stored in the storage medium 201 are stored in the storage signal reading device 20.
2 and is divided into a video signal 203, line-of-sight position information 217, and device control information 212. Here, the storage medium is, as described above, a video signal, line-of-sight position information,
Different device control information may be used. In addition, the same storage medium may be used, or different reading devices may be used to retrieve from different prerecorded areas. The video signal 203 is converted from a recording signal into a digital image signal by the decoder 204, and an image for a fixed time is stored in the image storage unit 205.

【００４２】一方、装置制御情報２１２も同様に復号器
２１３によって記録信号からディジタル制御信号に変換
され、システムコントローラ２１４に供給される。シス
テムコントローラ２１４では、入力端子２１５から入力
される使用者の指示を受け、装置全体を管理するととも
に、記録されていた装置制御情報から連続した録画が途
切れる箇所（カット）を検出し、これをカット情報２１
６として前記画像蓄積部２０５に供給する。画像蓄積部
２０５はこれを受けて、カット単位で画像を蓄える。On the other hand, the device control information 212 is similarly converted from the recording signal to the digital control signal by the decoder 213 and supplied to the system controller 214. The system controller 214 receives the user's instruction input from the input terminal 215, manages the entire apparatus, detects a portion (cut) at which continuous recording is interrupted from the recorded apparatus control information, and cuts this. Information 21
6 is supplied to the image storage unit 205. The image storage unit 205 receives this and stores the image in units of cuts.

【００４３】さらに視線位置情報２１７も同様に復号器
２１８によって記録信号からディジタル信号に変換され
る。この段階の信号は撮影の各フレームでの視線の位置
を表しているにすぎない。眼球運動は、一点を凝視して
いる時にも微妙な振動をする固視微動と、運動する対象
を追跡する際の随従運動、および視対象間を移動する際
の跳躍運動の３つに大別される。種別判定機２１９は、
復号器２１８によってディジタル化された視線位置情報
から、こうした運動の性格を判断し、その種類に関する
情報を視線位置情報に付加する。眼球運動の種類に関す
る情報を付加された視線位置情報は、注視点決定器２２
０に供給される。注視点決定器２２０では、種別判定器
２１９からの運動種情報に基づいて注視点と判断できる
画面上の座標をエッジ認識装置２０６に供給する。Similarly, the line-of-sight position information 217 is also converted from a recording signal into a digital signal by the decoder 218. The signal at this stage merely represents the position of the line of sight in each frame of shooting. Eye movements are roughly divided into three categories: fixational eye movements that cause subtle vibrations even when gazing at one point, following movements when tracking a moving object, and jumping movements when moving between visual objects. To be done. The type determination machine 219
From the line-of-sight position information digitized by the decoder 218, the nature of such movement is determined, and information regarding the type is added to the line-of-sight position information. The line-of-sight position information added with the information regarding the type of eye movement is the gazing point determiner 22.
Supplied to zero. The gazing point determiner 220 supplies the coordinates on the screen that can be determined to be the gazing point to the edge recognition device 206 based on the motion type information from the type determiner 219.

【００４４】一方、エッジ認識装置２０６には前記画像
蓄積部２０５から一連のカット内の画像が画像の走査単
位（１フレーム）、または連続したフレームのまとまり
として供給され、その画像情報と注視点決定器２２０か
ら供給される注視点座標情報とから、注視している対象
物とその背景および前景との境界線（エッジ）を判定す
る。ここでエッジ認識装置２０６にはシステムコントロ
ーラ２１４を通じて使用者の指示を与え、指示に即して
エッジを認識させてもよい。On the other hand, the image in the series of cuts is supplied from the image storage unit 205 to the edge recognition device 206 as a scanning unit (one frame) of the image or as a group of continuous frames, and the image information and the gazing point are determined. Based on the gazing point coordinate information supplied from the device 220, a boundary line (edge) between the gazing target object and its background and foreground is determined. Here, the edge recognition device 206 may be instructed by the user through the system controller 214 so that the edge is recognized in accordance with the instruction.

【００４５】画像情報は、注視点の存在するフレームに
は注視点を含む物体の輪郭に関する情報が付加されて画
像部分抽出器２０７に供給される。画像部分抽出器２０
７では注視点を含む物体（注目物体）を、注視点の存在
するフレームすべてについて部分画像として抽出し、重
複検査器２０８に供給する。ここで注目物体はカット内
や同一フレーム内に複数存在してもよい。The image information is supplied to the image partial extractor 207 with information about the contour of the object including the gazing point added to the frame where the gazing point exists. Image part extractor 20
At 7, the object including the gazing point (target object) is extracted as a partial image for all the frames in which the gazing point exists, and is supplied to the duplication checker 208. Here, a plurality of target objects may exist in the cut or in the same frame.

【００４６】重複検査器２０８では、異なるフレームで
異なる姿勢であったり、ズームによって大きさが異なっ
ている同一物体について、それらが同一のものであると
いうことを検査し、供給された注目物体の画像とともに
重複情報として画像選定器２０９に供給する。ここで重
複検査器２０８は、検査中のカットだけでなくカットを
越えて重複の有無を検査してもよい。また、前記の画像
記録装置であらかじめ索引としたい物体を見ているとき
にそれを知らせる指示が記録されている場合、これを直
接索引候補の物体に加えてもよい。The duplication inspector 208 inspects that the same object has a different posture in different frames or has a different size due to zooming, and that they are the same, and the supplied image of the object of interest is supplied. At the same time, it is supplied to the image selector 209 as overlapping information. Here, the duplication inspector 208 may inspect not only the cut being inspected but also the presence or absence of duplication across the cut. Further, when the image recording apparatus has previously recorded an instruction for notifying the user of an object to be used as an index, this may be directly added to the object of the index candidate.

【００４７】画像選定器２０９は、注目物体画像と重複
情報に加えて登場回数、注視時間、大きさなど注目物体
画像から得られる副次的情報に基づいて、複数供給され
た注目物体の中から索引に適した対象物を選定するとと
もに、重複した同一物体の画像の中で、索引に適した状
態のものを選定する。ここで選定される索引は、カット
継続時間に対するその物体への注視時間の割合が一定値
を越えるものすべてを候補にするなどの方法を用いて、
カット内で複数存在してもよい。また索引に、同一物体
が違う状態で複数存在してもよい。さらにここでの選定
作業はシステムコントローラ２１４を通じて使用者の指
示を与え、指示に即して索引とする画像を選択してもよ
い。The image selector 209 selects from among a plurality of supplied target objects, based on secondary information obtained from the target object image such as the number of appearances, gaze time, and size, in addition to the overlapping information with the target object image. An object suitable for indexing is selected, and an image of the same object duplicated is selected in a state suitable for indexing. The index selected here uses a method such as making all candidates for which the ratio of the gaze time to the object with respect to the cut duration exceeds a certain value as a candidate,
There may be more than one in the cut. Further, a plurality of same objects may exist in the index in different states. Further, the selection work here may be performed by giving a user's instruction through the system controller 214 and selecting an image as an index according to the instruction.

【００４８】以上のようにして索引と決定された対象物
は、抽出された部分画像そのもの、あるいは索引とされ
た画像のフレームを表す情報および索引とする画像領域
を表す情報として出力され、あわせてその対象物が含ま
れる他のフレームの情報（関連フレーム情報）も重複情
報に基づいて作成され、出力される。The object determined to be the index as described above is output as the extracted partial image itself, or information indicating the frame of the indexed image and information indicating the image area to be indexed. Information of other frames (related frame information) including the object is also created and output based on the duplicated information.

【００４９】画像選定器２０９から出力された索引画像
あるいは索引画の存在領域情報、関連フレーム情報は符
号化装置２１０を通じて記録に適した信号に変換され、
記録信号発生装置２１１の出力により記憶媒体２０１に
再記録される。ここで、索引の記録に用いる記憶媒体
は、元の画像および視線位置情報が記録された記憶媒体
２０１と異なるものを用いてもよい。The index image or the area information of the index image and the related frame information output from the image selector 209 are converted into a signal suitable for recording by the encoding device 210,
It is re-recorded in the storage medium 201 by the output of the recording signal generator 211. Here, the storage medium used for recording the index may be different from the storage medium 201 in which the original image and the line-of-sight position information are recorded.

【００５０】図３は画像再生検索装置の構成を示すブロ
ック図である。この装置は図２の索引作成装置によって
記録された映像信号、索引画信号および関連フレーム情
報に基づいて、索引を提示して使用者の目的画像の検索
を可能にするものである。FIG. 3 is a block diagram showing the structure of the image reproduction / retrieval device. This device presents an index based on the video signal, index image signal and related frame information recorded by the index creating device of FIG. 2 to enable the user to search for the target image.

【００５１】記憶媒体３０１に記憶された映像信号、索
引画信号および関連フレーム情報は記憶信号読出装置３
０２によって取り出され、映像信号３０３、索引画信号
３１４および関連フレーム情報３０８に分けられる。こ
こで、記憶媒体は前記のように映像信号、索引画信号、
関連フレーム情報で異なるものを用いてもよい。また、
同一記憶媒体を用いても異なる読み出し装置を用いて、
あらかじめ記録された異なる領域から取り出してもよ
い。The video signal, index image signal and related frame information stored in the storage medium 301 are stored in the storage signal reading device 3.
02, and is divided into a video signal 303, an index image signal 314, and related frame information 308. Here, the storage medium is a video signal, an index image signal,
Different pieces of related frame information may be used. Also,
Even if the same storage medium is used and different reading devices are used,
It may be taken out from a different area recorded in advance.

【００５２】索引参照を必要とする通常の再生の場合、
映像信号３０３は復号器３０４によって記録信号からデ
ィジタル画像信号に変換され、入力端子３１１を通じて
使用者の指示を受けたシステムコントローラ３１２の指
示に基づいて、画面作成器３０５で色、表示範囲、字幕
などの加工を受け、Ｄ／Ａ変換器３０６でモニターテレ
ビに出力されるべき信号に変換された後、出力される。
ここで、モニターテレビがディジタルデータを許容する
ものである場合、Ｄ／Ａ変換器３０６は作用しなくても
よい。For normal playback, which requires index lookup,
The video signal 303 is converted from a recording signal into a digital image signal by a decoder 304, and colors, display ranges, captions, etc. are displayed by a screen creator 305 based on an instruction from a system controller 312 which receives an instruction from a user through an input terminal 311. Is processed, converted into a signal to be output to the monitor television by the D / A converter 306, and then output.
Here, the D / A converter 306 does not have to operate when the monitor television is one that allows digital data.

【００５３】一方、使用者による検索に備えて索引画信
号３１４は復号器３１５によりディジタル画像信号に変
換され、索引画格納部３１６に蓄積される。ここで前記
のように索引画ではなく、索引画となるべき画像の存在
位置（アドレス）およびそのフレームの中での表示領域
を記録する方法の場合、この存在位置、表示領域情報に
基づいて画像データを参照し、画像として同様に索引画
格納部３１６に蓄積される。On the other hand, the index image signal 314 is converted into a digital image signal by the decoder 315 and stored in the index image storage unit 316 in preparation for the search by the user. Here, as described above, in the case of the method of recording the position (address) of the image to be the index image and the display area in the frame instead of the index image, the image is displayed based on this position and display area information. The data is referred to and similarly stored as an image in the index image storage unit 316.

【００５４】索引で参照すべき物体が存在するフレーム
を記録した関連フレーム情報３０８は、復号器３０９に
よってディジタル信号に変換され、索引関連情報格納部
３１０に蓄積される。この索引関連情報格納部３１０で
は索引として指示する画像の索引画格納部３１６内での
蓄積場所や、その索引と対応する物体の存在するフレー
ムのアドレスが蓄積されており、システムコントローラ
３１２にこの情報を供給したり、入力端子３１１から使
用者の指示を受けたシステムコントローラ３１２の指示
によってこの対応関係を変更、更新できる。The related frame information 308 in which the frame in which the object to be referred to by the index exists is recorded is converted into a digital signal by the decoder 309 and stored in the index related information storage unit 310. In the index related information storage unit 310, the storage location in the index image storage unit 316 of the image designated as the index and the address of the frame in which the object corresponding to the index exists are stored, and this information is stored in the system controller 312. Can be supplied, or this correspondence can be changed or updated by an instruction from the system controller 312 which receives an instruction from the user through the input terminal 311.

【００５５】システムコントローラ３１２は、使用者の
検索要求指示に従って一覧表作成装置３１７に索引一覧
表を作成させる。一覧表作成装置３１７は索引画格納部
３１６から使用者の希望する範囲の索引画の供給を受
け、画面作成器３０５にそれを送る。画面作成器３０５
ではシステムコントローラ３１３の指示を通じて使用者
に提示すべき画面を、元の映像信号、索引一覧表を組み
合わせて作成し、Ｄ／Ａ変換器３０６に供給する。Ｄ／
Ａ変換器３０６はモニターテレビに出力されるべき信号
に変換し、出力端子３０７に出力する。ここで、モニタ
ーテレビがディジタルデータを許容するものである場
合、Ｄ／Ａ変換器３０６は作用しなくてもよい。The system controller 312 causes the list creation device 317 to create an index list according to the user's search request instruction. The list creation device 317 receives the index image in the range desired by the user from the index image storage unit 316 and sends it to the screen creator 305. Screen creator 305
Then, the screen to be presented to the user through the instruction of the system controller 313 is created by combining the original video signal and the index list, and is supplied to the D / A converter 306. D /
The A converter 306 converts the signal to be output to the monitor TV and outputs the signal to the output terminal 307. Here, the D / A converter 306 does not have to operate when the monitor television is one that allows digital data.

【００５６】使用者は指示された索引から、目的とする
対象物を選択しスイッチ、ボタンなどの入力手段を通じ
て入力端子３１１に指示を伝達する。ここで目的となる
対象物は複数でもよい。また索引を選択する手段として
再び視線を利用し、選択したい画像を注視したことを選
択の意志表示ととらえる方法を用いてもよい。The user selects the target object from the instructed index and transmits the instruction to the input terminal 311 through the input means such as the switch and the button. A plurality of target objects may be used here. Alternatively, a method may be used in which the line of sight is used again as a means for selecting an index, and the fact that the user gazes at the image to be selected is recognized as the intention of selection.

【００５７】システムコントローラ３１２は選択された
索引と関連するフレームを索引関連情報格納部３１０に
問い合わせ、通常の再生と同様の方法で画面に目的の画
像を供給する。The system controller 312 inquires of the index related information storage unit 310 about the frame associated with the selected index, and supplies the target image to the screen in the same manner as in normal reproduction.

【００５８】上記実施例では視線位置の検出と索引画の
作成および検索再生をそれぞれ別の装置で行っている
が、これらの中の任意の二つ、または三つすべてを同一
の装置で行ってもよい。In the above-described embodiment, the line-of-sight position detection, the index image creation, and the search / playback are performed by different devices, but any two or all three of these are performed by the same device. Good.

【００５９】図４は画像記録装置兼索引作成装置の構成
を示すブロック図で、視線位置の検出と索引画の作成を
同一の装置で行う場合の実施例を示す図ある。FIG. 4 is a block diagram showing the structure of the image recording device / index creating device, and is a diagram showing an embodiment in which the line-of-sight position detection and index image creation are performed by the same device.

【００６０】記録される映像信号は入力端子４０１に供
給され、Ａ／Ｄ変換装置４０２によって量子化された
後、符号化装置４１１によって記録に適した信号に変換
され、記録信号発生装置４０９を通じて記憶媒体４１０
に記録される。同時に画像蓄積部４０３に一定時間分の
画像が蓄えられる。The video signal to be recorded is supplied to the input terminal 401, quantized by the A / D converter 402, converted into a signal suitable for recording by the encoder 411, and stored by the recording signal generator 409. Medium 410
Recorded in. At the same time, images for a certain period of time are stored in the image storage unit 403.

【００６１】一方、同時に入力端子４１５には撮影者の
眼球の動きが電気信号で供給される。この眼球運動信号
はＡ／Ｄ変換装置４１６によって量子化された後、位置
検出器４１７によって画像中の視点位置情報として計算
される。計算された視点位置情報は種別判定器４１８に
よってその運動の性格を判定される。眼球運動の種類に
関する情報を付加された視線位置情報は、注視点決定器
４１９に供給される。注視点決定器４１９では、種別判
定器４１８からの運動種情報に基づいて注視点と判断で
きる画面上の座標をエッジ認識装置４０４に供給する。On the other hand, at the same time, the movement of the photographer's eyeball is supplied to the input terminal 415 as an electric signal. The eye movement signal is quantized by the A / D conversion device 416 and then calculated by the position detector 417 as viewpoint position information in the image. The calculated viewpoint position information is judged by the type judging device 418 as to the nature of the motion. The line-of-sight position information added with the information regarding the type of eye movement is supplied to the gazing point determiner 419. The gazing point determiner 419 supplies the coordinates on the screen that can be determined to be the gazing point to the edge recognition device 404 based on the motion type information from the type determiner 418.

【００６２】入力端子４１２にはスイッチ、ボタンなど
を通じて撮影者の指示をシステムコントローラ４１３に
伝える信号が供給される。この信号により画像記録装置
は録画の開始、終了やズームイン、ズームアウトなどを
制御する。装置の制御情報４２０は、符号化装置４２１
によって、やはり記録に適した信号に変換されて映像信
号、索引画情報とともに、記録信号発生装置４０９の出
力により記憶媒体４１０に記録される。また、システム
コントローラ４１３は、連続した録画の単位（カット）
を検出し、これをカット情報４１４として画像蓄積部４
０３に供給する。画像蓄積部４０３はこれを受けて、カ
ット単位で画像を蓄える。A signal for transmitting a photographer's instruction to the system controller 413 is supplied to the input terminal 412 through a switch, a button, or the like. With this signal, the image recording apparatus controls the start and end of recording, zoom-in, zoom-out, and the like. The device control information 420 includes an encoding device 421.
Is converted into a signal suitable for recording, and is recorded in the storage medium 410 by the output of the recording signal generator 409 together with the video signal and index image information. In addition, the system controller 413 is a unit (cut) of continuous recording.
Is detected and used as cut information 414 in the image storage unit 4
Supply to 03. The image storage unit 403 receives this and stores images in cut units.

【００６３】一方、エッジ認識装置４０４には画像蓄積
部４０３から一連のカット内の画像が画像の走査単位
（１フレーム）、または連続したフレームのまとまりと
して供給され、その画像情報と注視点決定器４１９から
供給される注視点座標情報とから、注視している対象物
と、その背景および前景との境界線（エッジ）を判定す
る。ここでエッジ認識装置４０４にはシステムコントロ
ーラ４１３を通じて使用者の指示を与え、指示に即して
エッジを認識させてもよい。On the other hand, the image in the series of cuts is supplied from the image storage unit 403 to the edge recognition device 404 as an image scanning unit (one frame) or as a group of continuous frames, and the image information and the gazing point determiner are supplied. Based on the gazing point coordinate information supplied from 419, the boundary line (edge) between the gazing target object and its background and foreground is determined. Here, the edge recognition device 404 may be instructed by the user through the system controller 413 to recognize the edge in accordance with the instruction.

【００６４】画像情報は、注視点の存在するフレームに
は注視点を含む物体の輪郭に関する情報が付加されて画
像部分抽出器４０５に供給される。画像部分抽出器４０
５では注視点を含む物体（注目物体）を、注視点の存在
するフレームすべてについて部分画像として抽出し、重
複検査器４０６に供給する。ここで注目物体はカット内
や同一フレーム内に複数存在してもよい。The image information is supplied to the image part extractor 405 with information about the contour of the object including the gazing point added to the frame in which the gazing point exists. Image part extractor 40
In 5, the object including the gazing point (target object) is extracted as a partial image for all the frames in which the gazing point exists, and is supplied to the duplication checker 406. Here, a plurality of target objects may exist in the cut or in the same frame.

【００６５】重複検査器４０６では、異なるフレームで
異なる姿勢であったり、ズームによって大きさが異なっ
ている同一物体について、それらが同一のものであると
いうことを検査し、供給された注目物体の画像とともに
重複情報として画像選定器４０７に供給する。ここで重
複検査器４０６は、検査中のカットだけでなくカットを
越えて重複の有無を検査してもよい。The duplication inspector 406 inspects that the same objects having different postures in different frames or different sizes due to zooming are the same, and the supplied image of the object of interest. Along with this, it is supplied to the image selector 407 as overlapping information. Here, the duplication inspector 406 may inspect not only the cut being inspected but also the presence or absence of duplication across the cut.

【００６６】画像選定器４０７は、注目物体画像と重複
情報に加えて登場回数、注視時間、大きさなど注目物体
画像から得られる副次的情報に基づいて、複数供給され
た注目物体の中から索引に適した対象物を選定するとと
もに、重複した同一物体の画像の中で、索引に適した状
態のものを選定する。ここで選定される索引は、カット
継続時間に対するその物体への注視時間の割合が一定値
を越えるものすべてを候補にするなどの方法を用いて、
カット内で複数存在してもよい。また索引に、同一物体
が違う状態で複数存在してもよい。さらにここでの選定
作業はシステムコントローラ４１３を通じて使用者の指
示を与え、指示に即して索引とする画像を選択してもよ
い。The image selector 407 selects from among a plurality of supplied objects of interest, based on side information obtained from the object of interest image such as the number of appearances, gaze time, and size in addition to the overlapping information of the object of interest image. An object suitable for indexing is selected, and an image of the same object duplicated is selected in a state suitable for indexing. The index selected here uses a method such as making all candidates for which the ratio of the gaze time to the object with respect to the cut duration exceeds a certain value as a candidate,
There may be more than one in the cut. Further, a plurality of same objects may exist in the index in different states. Further, the selection work here may be performed by giving a user's instruction through the system controller 413 and selecting an image as an index in accordance with the instruction.

【００６７】以上のようにして索引と決定された対象物
は、抽出された部分画像そのもの、あるいは索引とされ
た画像のフレームを表す情報および索引とする画像領域
を表す情報として出力され、あわせてその対象物が含ま
れる他のフレームの情報も重複情報に基づいて作成さ
れ、出力される。The object determined to be the index as described above is output as the extracted partial image itself, or the information indicating the frame of the indexed image and the information indicating the image area to be indexed. Information on other frames including the object is also created and output based on the duplicate information.

【００６８】画像選定器４０７から出力された索引画像
あるいは索引画の存在領域情報、関連フレーム情報は符
号化装置４０８を通じて記録に適した信号に変換され、
記録信号発生装置４０９の出力により記憶媒体４１０に
記録される。ここで、索引の記録に用いる記憶媒体は、
元の画像および装置制御情報が記録された記憶媒体４１
０と異なるものを用いてもよい。The index image or the area information of the index image and the related frame information output from the image selector 407 are converted into a signal suitable for recording by the encoder 408,
It is recorded in the storage medium 410 by the output of the recording signal generator 409. Here, the storage medium used to record the index is
Storage medium 41 in which the original image and device control information are recorded
A value different from 0 may be used.

【００６９】撮影者の指示としては、索引としたい物体
に注目しつつ撮影している際に装置にそれを知らせる入
力信号を用意して、入力端子４１２から上記の経路を経
る装置制御情報４２０に含めてもよい。As the photographer's instruction, an input signal for notifying the device of the object to be indexed while photographing is prepared, and input from the input terminal 412 to the device control information 420 passing through the above path. May be included.

【００７０】以上の中で映像信号、索引画信号と関連フ
レーム情報、装置制御情報は符号化装置４２１の後、そ
れぞれ異なる記録信号発生装置によって、記憶媒体４１
０の異なる領域に格納してもよい。また記憶信号発生装
置、記憶媒体ともにそれぞれ異なるものを用いて記録し
てもよい。In the above, the video signal, the index picture signal, the related frame information, and the device control information are stored in the storage medium 41 by the different recording signal generators after the encoding device 421.
It may be stored in different areas of 0. Further, different recording signal generators and different storage media may be used for recording.

【００７１】図５は、他の光ディスクを用いたディジタ
ルビデオ装置等の画像記録装置の概念を示す図で、動画
撮影時に撮影者の視線を観察し、視線の方向に自動的に
装置を向けつつ、画像中の索引となりうる物体を同時記
録する例を示している。本実施例は、撮影者自身が記録
装置の撮影方向を変化させない点で先の実施例とは異な
る。また、ここでは例として索引画像を記録するための
専用の光学系をもち、撮影者が意図した撮影画像を記録
する媒体と索引画像を記録する媒体が異なる場合を説明
する。FIG. 5 is a diagram showing the concept of an image recording device such as a digital video device using another optical disk. While observing the line of sight of the photographer at the time of shooting a moving image, the device is automatically oriented in the direction of the line of sight. , An example of simultaneously recording an object that can be an index in an image is shown. This embodiment is different from the previous embodiments in that the photographer himself does not change the shooting direction of the recording apparatus. Further, as an example, a case will be described in which a medium having a dedicated optical system for recording an index image and a medium for recording a photographed image intended by a photographer and a medium for recording an index image are different.

【００７２】画像記録装置本体５０１は、前記のように
撮影者が意図した撮影をするための光学系と、索引画を
撮影するための光学系の２つを持っている。撮影者５０
４の目の部分には、視線検出器５０２が装着され、本体
５０１とは有線または無線で結ばれている。本体５０１
あるいは視線検出器５０２は、互いの位置関係を察知す
る機能を持っており、その位置関係は本体５０１によっ
て把握される。撮影者５０４は端末５０３を手に持ち、
これによって装置本体５０１を制御する。端末５０３
は、本体５０１とは有線または無線で結ばれている。５
０５は仮に撮影者５０４が撮影しようとしている被写体
である。The image recording apparatus main body 501 has two optical systems, one for taking a photograph intended by the photographer and the other for taking an index image, as described above. Photographer 50
A line-of-sight detector 502 is attached to the fourth eye portion and is connected to the main body 501 by wire or wirelessly. Body 501
Alternatively, the line-of-sight detector 502 has a function of sensing the positional relationship between each other, and the positional relationship is grasped by the main body 501. The photographer 504 holds the terminal 503,
This controls the apparatus main body 501. Terminal 503
Is wired or wirelessly connected to the main body 501. 5
Reference numeral 05 denotes a subject that the photographer 504 is about to photograph.

【００７３】本体５０１によって撮影されている画像５
０６，５０８のうち、一方の画像５０６は、撮影者５０
４の指示によるズームを行っているもの（ユーザ画像）
である。撮影者５０４が被写体５０５に注視すると、本
体５０１は自動で被写体５０５の方向へ向きを変え、端
末５０３によって撮影者５０４に指示された拡大率で撮
影を行う。ユーザ画像５０６中の記号５０７は、仮に撮
影者５０４の視点が表示されているものである。他方の
画像５０８は索引画像で、ユーザ画像５０６と別の光学
系を用い、装置側であらかじめ決められた拡大率で注視
点を中心に撮影されるものである。Image 5 taken by the main body 501
Of the images 06 and 508, one image 506 is the photographer 50.
What is zoomed by the instruction of 4 (user image)
Is. When the photographer 504 gazes at the subject 505, the main body 501 automatically turns to the direction of the subject 505, and photographs with the enlargement ratio instructed by the photographer 504 by the terminal 503. The symbol 507 in the user image 506 indicates that the viewpoint of the photographer 504 is displayed. The other image 508 is an index image, and is captured around the gazing point using a different optical system from the user image 506 and at a magnification rate that is predetermined by the device side.

【００７４】図６はこの画像記録装置の構成を示すブロ
ック図であり、図５の概念図では本体５０１の内部の処
理装置に対応する。FIG. 6 is a block diagram showing the structure of this image recording apparatus, and corresponds to the processing apparatus inside the main body 501 in the conceptual diagram of FIG.

【００７５】二つの光学系から入力された映像信号はユ
ーザ画像信号６０１、索引画像信号６０６として装置に
供給される。ユーザ画像信号はＡ／Ｄ変換装置６０２に
よってディジタル信号に変換された後、符号化装置６０
３によってそのまま符号化され、記録信号発生装置６０
４の出力を介して記憶媒体６０５に記録される。ここで
符号化の前に電気信号上での拡大処理であるディジタル
ズームを施してもよい。Video signals input from the two optical systems are supplied to the apparatus as a user image signal 601 and an index image signal 606. The user image signal is converted into a digital signal by the A / D converter 602, and then encoded by the encoder 60.
And the recording signal generator 60
It is recorded in the storage medium 605 via the output of No. 4. Here, a digital zoom, which is an enlargement process on the electric signal, may be performed before the encoding.

【００７６】一方、入力端子６０６から供給された索引
画像信号も同様にＡ／Ｄ変換装置６０７によってディジ
タル信号に変換され、ディジタルズーム６０８に供給さ
れる。カメラは撮影者の注視点を向いているため、その
まま入力された索引画像信号は注目物体を含んでいる。
ディジタルズーム６０８はその注視点を中心にあらかじ
め決められたズームを施した画像信号を画像蓄積部６０
９に供給する。On the other hand, the index image signal supplied from the input terminal 606 is similarly converted into a digital signal by the A / D converter 607 and supplied to the digital zoom 608. Since the camera faces the gazing point of the photographer, the index image signal input as it is includes the target object.
The digital zoom 608 outputs an image signal, which is subjected to a predetermined zoom centering on the gazing point, to the image storage unit 60.
Supply to 9.

【００７７】撮影と同時に入力端子６２０からは眼球運
動信号が、入力端子６２４からは視線検出器と本体（撮
影装置）との位置関係を示す信号が供給される。この二
信号はそれぞれＡ／Ｄ変換器６２１、Ａ／Ｄ変換器６２
５によってディジタル信号に変換される。この後、眼球
運動信号は位置検出器６２２によって視野内での視線の
位置に変換される。これと並行して検出器位置信号が位
置検出器６２６によって本体の向き、検出器の向きと本
体・検出器の向きと距離などの情報に変換され、前記の
視線情報とともに注視点決定器６２３に供給される。Simultaneously with photographing, an eye movement signal is supplied from the input terminal 620, and a signal indicating the positional relationship between the line-of-sight detector and the main body (imaging device) is supplied from the input terminal 624. These two signals are respectively A / D converter 621 and A / D converter 62.
5 is converted into a digital signal. After this, the eye movement signal is converted into the position of the line of sight within the visual field by the position detector 622. In parallel with this, the detector position signal is converted by the position detector 626 into information such as the orientation of the main body, the orientation of the detector and the orientation and distance of the main body / detector, and the eye gaze information to the gazing point determiner 623. Supplied.

【００７８】注視点決定器６２３では、本体、検出器の
位置関係、および撮影者の両眼の視線方向から、撮影者
の注視点の絶対位置を決定し、三次元座標としてシステ
ムコントローラ６１５に伝える。システムコントローラ
６１５では得られた注視点座標からカメラを向けるべき
方向、光学系の焦点を合わせるべき距離を計算し、カメ
ラ方向変位信号として出力端子６１９に出力する。The gazing point determiner 623 determines the absolute position of the gazing point of the photographer from the positional relationship between the main body and the detector, and the line-of-sight directions of the photographer's eyes, and informs the system controller 615 as three-dimensional coordinates. . The system controller 615 calculates the direction to which the camera should be directed and the distance to which the optical system should be focused from the obtained gazing point coordinates, and outputs it to the output terminal 619 as a camera direction displacement signal.

【００７９】一方、撮影者は制御端末から装置に指示を
与えるが、その信号が入力端子６１８からシステムコン
トローラ６１５に提供される。システムコントローラ６
１５は、指示に従ってユーザ画像ズームフォーカス信号
として出力端子６１４にズーム指示を伝達し、光学系の
ズームを行う。ここで、前記の符号化装置６０３の前に
ディジタルズームを設けるなどして、光学的なズームお
よび焦点合わせの必要がないときにはこの出力はなくて
もよい。また、カメラと被写体との距離の情報に基づ
き、あらかじめ決められた拡大率で注視点周辺のズーム
画を撮影するためのズームコントロール信号を出力端子
６１７から出力する。On the other hand, the photographer gives an instruction to the apparatus from the control terminal, and the signal is provided from the input terminal 618 to the system controller 615. System controller 6
In accordance with the instruction, 15 transmits a zoom instruction to the output terminal 614 as a user image zoom focus signal, and zooms the optical system. Here, when there is no need for optical zooming and focusing by providing a digital zoom in front of the encoding device 603, this output may be omitted. In addition, a zoom control signal for shooting a zoom image around the gazing point at a predetermined enlargement ratio is output from the output terminal 617 based on the information on the distance between the camera and the subject.

【００８０】カット検出器６１６は、入力端子６１８か
らシステムコントローラ６１５に入力した撮影者の指示
の中から、録画の開始、終了に関するものをカット信号
として取り出し、画像蓄積部６０９に蓄積の単位として
供給する。画像蓄積部６０９は画像信号を１フレームま
たは連続したフレームのまとまりとして画像選定器６１
０に送る。画像選定器６１０では注目物体が移っている
画像の中から、登場回数、注視時間など注目物体画像か
ら得られる副次的情報に基づいて、複数供給された注目
物体の中から索引に適した対象物を選定するとともに、
重複した同一物体の画像の中で、索引に適した状態のも
のを選定する。ここで選定される索引は、カット継続時
間に対するその物体への注視時間の割合が一定値を越え
るものすべてを候補にするなどの方法を用いて、カット
内で複数存在してもよい。また索引に、同一物体が違う
状態で複数存在してもよい。ここでの選定作業はシステ
ムコントローラ６１８を通じて使用者の指示を与え、指
示に即して索引とする画像を選択してもよい。The cut detector 616 takes out from the photographer's instructions input from the input terminal 618 to the system controller 615, a cut signal relating to the start and end of recording and supplies it as a cut signal to the image storage unit 609. To do. The image storage unit 609 uses the image selector 61 as an image signal as one frame or a group of consecutive frames.
Send to 0. In the image selector 610, a target suitable for indexing is supplied from a plurality of supplied target objects based on side information obtained from the target object image such as the number of appearances and gaze time from the image in which the target object is moved. While selecting the item,
Among the duplicated images of the same object, the one suitable for indexing is selected. A plurality of indexes selected here may be present in the cut by using a method in which all the indexes in which the ratio of the gaze time to the object with respect to the cut duration exceeds a certain value are set as candidates. Further, a plurality of same objects may exist in the index in different states. The selection work here may be performed by giving a user's instruction through the system controller 618 and selecting an image as an index in accordance with the instruction.

【００８１】こうして決定された索引画像は符号装置６
１１によって記録に適した信号に変換され、記録信号発
生装置６１２の出力により、記憶媒体６１３に記録され
る。ここで、記憶媒体に映像信号の記録媒体６０５と同
一のものを用いて、媒体６０５の映像信号と異なる領域
に記憶させてもよい。また、符号化装置６１１によって
符号化された索引画像信号を、符号化装置６０３によっ
て符号化された映像信号と混合し、記録信号発生装置６
０４の出力により記憶媒体６０５に記憶させてもよい。The index image thus determined is the encoding device 6.
The signal is converted into a signal suitable for recording by 11, and is recorded in the storage medium 613 by the output of the recording signal generator 612. Here, the same storage medium as the video signal recording medium 605 may be used as the storage medium and may be stored in an area different from the video signal of the medium 605. Further, the index image signal encoded by the encoding device 611 is mixed with the video signal encoded by the encoding device 603, and the recording signal generating device 6
The output of 04 may be stored in the storage medium 605.

【００８２】さらに本実施例では注視点周辺の画像の拡
大のみを索引として用いたが、実施例の第一と同様に、
対象物のエッジを認識して部分画像として取り出しても
よい。このとき、映像および視線記録装置と索引作成装
置は一体でもよく、異なる装置でもよい。Further, in the present embodiment, only the enlargement of the image around the gazing point is used as an index, but as in the first embodiment,
You may recognize the edge of a target object and take it out as a partial image. At this time, the image and line-of-sight recording device and the index creating device may be integrated or may be different devices.

【００８３】その他の実施例として、動画再生時に視
（聴）者の視線を観察し、画像中の索引となりうる物体
を記録する例を説明する。本実施例は、録画時ではなく
再生時の使用者の視線を観察する点で先の実施例と異な
る。As another embodiment, an example of observing the line of sight of a viewer (listener) during reproduction of a moving image and recording an object that can serve as an index in an image will be described. This embodiment is different from the previous embodiment in that the user's line of sight is observed during reproduction, not during recording.

【００８４】本実施例の構成を示すブロック図は図４と
類似であるので、図４を参照しながら、先の実施例との
相違を示しつつ説明する。なお本実施例は映像信号と索
引信号を同一の媒体に記録するディジタルビデオ装置で
ある。Since the block diagram showing the configuration of the present embodiment is similar to that of FIG. 4, it will be described with reference to FIG. 4 while showing the differences from the previous embodiment. Note that this embodiment is a digital video device that records a video signal and an index signal on the same medium.

【００８５】映像信号は再生画面として視（聴）者に提
供されると同時に入力端子４０１にも供給される。映像
信号はＡ／Ｄ変換装置４０２によってディジタル信号に
変換され、画像蓄積部４０３に一定時間分の画像が蓄え
られる。画像蓄積部４０３では提供された画像の中から
カットを検出し、それを蓄積の単位とする。なお本実施
例ではＡ／Ｄ変換装置４０２と画像蓄積部４０３の中間
から符号化装置４１１を結ぶ流路、および符号化装置４
１１は必要としない。The video signal is supplied to the viewer (listener) as a reproduction screen and is also supplied to the input terminal 401. The video signal is converted into a digital signal by the A / D converter 402, and an image for a fixed time is stored in the image storage unit 403. The image storage unit 403 detects a cut from the provided images and uses it as a unit of storage. In this embodiment, the channel connecting the encoding device 411 from the middle of the A / D conversion device 402 and the image storage unit 403, and the encoding device 4
11 is not needed.

【００８６】一方、視（聴）者の眼球運動は、眼球運動
検出器と再生画面との位置関係の情報も含めて入力端子
４１５からＡ／Ｄ変換装置４１６に供給され、ディジタ
ル信号に変換される。変換されたディジタル信号は位置
検出器４１７で画面内での視線の位置として計算され
る。計算の結果は種別判定器４１８に供給され、眼球運
動の種類が解析される。その解析結果は注視点決定器４
１９に与えられ、画面内での注視点としてエッジ認識装
置４０４に注視点情報を与える。On the other hand, the eye movement of the viewer (listener) is supplied from the input terminal 415 to the A / D conversion device 416 including the information on the positional relationship between the eye movement detector and the reproduction screen and converted into a digital signal. It The converted digital signal is calculated by the position detector 417 as the position of the line of sight on the screen. The calculation result is supplied to the type determiner 418, and the type of eye movement is analyzed. The analysis result is the gazing point determiner 4.
19 and gives the point of interest information to the edge recognition device 404 as the point of interest on the screen.

【００８７】エッジ認識装置４０４には前記画像蓄積部
４０３から一連のカット内の画像が１フレーム、または
連続したフレームのまとまりとして供給され、その画像
情報と注視点決定器４１９から供給される注視点座標情
報とから、注視している対象物のエッジを判定する。こ
こでエッジ認識装置４０４にはシステムコントローラ４
１３を通じて使用者の指示を与え、指示に即してエッジ
を認識させてもよい。The image in the series of cuts is supplied to the edge recognition device 404 from the image storage unit 403 as one frame or a group of continuous frames, and the image information and the gazing point supplied from the gazing point determiner 419. From the coordinate information, the edge of the gazed object is determined. Here, the edge recognition device 404 includes a system controller 4
A user's instruction may be given through 13 and the edge may be recognized in accordance with the instruction.

【００８８】画像情報は、注視点の存在するフレームに
は注視点を含む物体の輪郭に関する情報とともに画像部
分抽出器４０５に供給される。画像部分抽出器４０５で
は注視点を含む物体（注目物体）を、注視点の存在する
フレームすべてについて部分画像として抽出し、重複検
査器４０６に供給する。ここで注目物体はカット内や同
一フレーム内に複数存在してもよい。The image information is supplied to the image part extractor 405 together with the information about the contour of the object including the gazing point in the frame where the gazing point exists. The image portion extractor 405 extracts an object (object of interest) including the gazing point as a partial image for all frames in which the gazing point exists, and supplies it to the duplication checker 406. Here, a plurality of target objects may exist in the cut or in the same frame.

【００８９】重複検査器４０６では、異なるフレームで
異なる姿勢であったり、ズームによって大きさが異なっ
ている同一物体について、それらが同一のものであると
いうことを検査し、供給された注目物体の画像とともに
重複情報として画像選定器４０７に供給する。ここで重
複検査器４０６は、検査中のカットだけでなくカットを
越えて重複の有無を検査してもよい。The overlap inspector 406 inspects that the same object has a different posture in different frames or has a different size due to zooming, and that they are the same, and the supplied image of the object of interest is supplied. Along with this, it is supplied to the image selector 407 as overlapping information. Here, the duplication inspector 406 may inspect not only the cut being inspected but also the presence or absence of duplication across the cut.

【００９０】画像選定器４０７は、注目物体画像と重複
情報に加えて登場回数、注視時間、大きさなど注目物体
画像から得られる副次的情報に基づいて、複数供給され
た注目物体の中から索引に適した対象物を選定するとと
もに、重複した同一物体の画像の中で、索引に適した状
態のものを選定する。ここで選定される索引は、カット
継続時間に対するその物体への注視時間の割合が一定値
を越えるものすべてを候補にするなどの方法を用いて、
カット内で複数存在してもよい。また索引に、同一物体
が違う状態で複数存在してもよい。さらにここでの選定
作業はシステムコントローラ４１３を通じて使用者の指
示を与え、指示に即して索引とする画像を選択してもよ
い。The image selector 407 selects from among a plurality of supplied objects of interest, based on secondary information obtained from the object of interest image such as the number of appearances, gaze time, and size in addition to the overlapping information of the object of interest image. An object suitable for indexing is selected, and an image of the same object duplicated is selected in a state suitable for indexing. The index selected here uses a method such as making all candidates for which the ratio of the gaze time to the object with respect to the cut duration exceeds a certain value as a candidate,
There may be more than one in the cut. Further, a plurality of same objects may exist in the index in different states. Further, the selection work here may be performed by giving a user's instruction through the system controller 413 and selecting an image as an index in accordance with the instruction.

【００９１】以上のようにして索引と決定された対象物
は、抽出された部分画像、あるいは索引とされた画像の
フレームを表す情報および索引とする画像領域を表す情
報として出力され、あわせてその対象物が含まれるフレ
ームの情報も重複情報に基づいて作成され、出力され
る。The object determined to be the index as described above is output as the information indicating the extracted partial image or the frame of the indexed image and the information indicating the image area to be indexed, and together with that information. The information of the frame including the object is also created and output based on the duplicate information.

【００９２】画像選定器４０７から出力された索引画像
あるいは索引画の存在領域情報、関連フレーム情報は符
号化装置４０８を通じて記録に適した信号に変換され、
記録信号発生装置４０９の出力により記録媒体４１０に
記録される。ここで、索引の記録に用いる記憶媒体は、
元の画像および装置制御情報が記録された記憶媒体４１
０と異なるものを用いてよい。The index image or the area information of the index image and the related frame information output from the image selector 407 are converted into a signal suitable for recording by the encoder 408,
It is recorded on the recording medium 410 by the output of the recording signal generator 409. Here, the storage medium used to record the index is
Storage medium 41 in which the original image and device control information are recorded
A value different from 0 may be used.

【００９３】視（聴）者の指示として、画面の中の索引
としたい物体を注目している際に装置にそれを知らせる
入力信号を用意して、入力端子４１２から上記の経路を
経る装置制御情報４２０に含めてもよい。As an instruction of a viewer (listener), an input signal for notifying the device when an object to be used as an index on the screen is focused is prepared, and device control from the input terminal 412 through the above path is performed. It may be included in the information 420.

【００９４】以上の中で、索引画信号と索引関連情報、
装置制御情報は符号化装置４２１の後、それぞれ異なる
記録信号発生装置によって、記憶媒体４１０の異なる領
域に格納してもよい。また記憶信号発生装置、記憶媒体
ともにそれぞれ異なるものを用いて記録してもよい。In the above, the index image signal and the index related information,
After the encoding device 421, the device control information may be stored in different areas of the storage medium 410 by different recording signal generation devices. Further, different recording signal generators and different storage media may be used for recording.

【００９５】本実施例では視線の検出と索引の作成を同
一の装置で行ったが、視線検出装置と索引作成装置が異
なるものでもよい。また、眼球運動検出器を複数用意し
て、同時に複数名の視点を分析し、それを画像選定器４
０７の判断基準に用いてもよい。さらに、同一画像が繰
り返し再生された際に、前回までの索引と新たな視点情
報とを併せて判断基準として画像選定器４０７に入力
し、索引を更新してもよい。In this embodiment, the line-of-sight detection and the index creation are performed by the same device, but the line-of-sight detection device and the index creation device may be different. In addition, a plurality of eye movement detectors are prepared, the viewpoints of a plurality of persons are analyzed at the same time, and the image selector 4
You may use it for the judgment criterion of 07. Further, when the same image is repeatedly reproduced, the index up to the previous time and the new viewpoint information may be input to the image selector 407 as a criterion to update the index.

【００９６】この実施例では、画像再生検索装置として
図３の実施例と同じものを用いる。ここで、上記視線検
出装置兼索引生成装置と画像再生検索装置が一体のもの
でもよい。In this embodiment, the same image reproduction / retrieval device as that in the embodiment of FIG. 3 is used. Here, the line-of-sight detection / cumulative index generation device and the image reproduction / retrieval device may be integrated.

【００９７】以上の実施例の中では、記録方式がディジ
タル方式であるとして説明を行ったが、アナログ方式の
ビデオに用いてもよい。また記録媒体としては光ディス
クのほか、磁気ディスク、磁気テープ、メモリ素子など
を用いてもよい。また動画に関して説明を行ったが、静
止画を記録する装置に本発明を用いてもよい。また視点
検出装置内で画像内での視線の位置を計算する例で説明
を行ったが、視点の方向だけを記録し、索引作成装置で
画像内の視線の位置を計算させてもよい。また先の実施
例では、対象物体のエッジを利用して対象物の抽出を行
ったが、撮影時に画像内のすべての物体に対してカメラ
からの距離を計測して記録し、その情報から注視点を含
む同程度の距離の集合として対象物を抽出してもよい。
また図５の実施例に関して、索引画像撮影用にユーザ画
像撮影用とは異なる光学系を用いたが、これを同一の光
学系を用いて撮影してもよい。この場合には一つの入力
画像を二つに分岐させ、前記実施例と同じ処理を行って
もよいし、時間的に分割してユーザ画像撮影と索引画像
撮影を交互に行ってもよい。In the above embodiments, the description has been made assuming that the recording system is the digital system, but the recording system may be used for the analog system video. In addition to the optical disc, a magnetic disc, a magnetic tape, a memory element, or the like may be used as the recording medium. Further, although the description has been made regarding the moving image, the present invention may be applied to an apparatus for recording a still image. In addition, the example in which the position of the line of sight in the image is calculated in the viewpoint detection device has been described, but only the direction of the viewpoint may be recorded and the position of the line of sight in the image may be calculated by the index creation device. In the previous example, the target object was extracted using the edge of the target object, but at the time of shooting, the distances from the camera to all the objects in the image were measured and recorded, and the information was extracted from the information. The object may be extracted as a set of similar distances including the viewpoint.
Further, regarding the embodiment of FIG. 5, although an optical system different from that for user image photographing is used for photographing the index image, it may be photographed using the same optical system. In this case, one input image may be branched into two and the same processing as that in the above-described embodiment may be performed, or the user image shooting and the index image shooting may be performed alternately by temporally dividing.

【００９８】以上の画像情報アクセス装置によれば、索
引をつけるべき画像を提示した際の人間の眼球運動を観
測し、その運動状態から注視点を判定し、注視点を中心
とした画像部分に特徴となる画像要素を求めるので、画
像中のすべての物体に対する認識を行うことなく索引を
作成することができる。また、そうして作成された索引
には撮影者が関心を示した物体が抽出されており、効率
のよい索引を作成することができる。According to the above-mentioned image information access device, the eye movement of a human being when an image to be indexed is presented is observed, the gazing point is judged from the movement state, and the image portion centered on the gazing point is displayed. Since the characteristic image element is obtained, the index can be created without recognizing all the objects in the image. In addition, since the object that the photographer is interested in is extracted from the index thus created, an efficient index can be created.

【００９９】さらに画像認識で取り扱うべき領域が減少
するため、索引作成処理が簡素化され、処理速度の向上
が望める。また記録量も減少するため、記録媒体の節約
が可能になる。Further, since the area to be handled by the image recognition is reduced, the index creation process can be simplified and the processing speed can be improved. Further, since the recording amount is reduced, the recording medium can be saved.

【０１００】次に、図７〜図９を参照して、本発明の実
施例に係る遠隔操作装置について説明する。Next, a remote control device according to an embodiment of the present invention will be described with reference to FIGS.

【０１０１】図７は、この遠隔操作装置の機能ブロック
図である。FIG. 7 is a functional block diagram of this remote control device.

【０１０２】まずユーザの手の存在する領域を距離画像
入力手段７０１で入力し、変化量検出手段７０２で距離
画像の時間差分を求める。動き物体検出手段７０３は、
距離画像とその時間差分から、どの程度の大きさの物体
がどの方向へ移動したかを計算する。この一連の処理に
より、ユーザが手を前方に伸ばしたときに、その動きを
検出することができる。First, the area where the user's hand is present is input by the distance image input means 701, and the change amount detection means 702 obtains the time difference of the distance image. The moving object detection means 703
From the distance image and its time difference, the size of the object moved in which direction is calculated. By this series of processing, when the user extends his / her hand forward, the movement can be detected.

【０１０３】上述したそれぞれの手段について、より詳
細に説明する。Each of the above means will be described in more detail.

【０１０４】図９のステップＳ１〜Ｓ４は、この装置で
の処理の流れを示している。Steps S1 to S4 in FIG. 9 show the flow of processing in this apparatus.

【０１０５】先ず、距離画像から動き領域を抽出する
（Ｓ１）。距離画像入力手段７０１としては様々なもの
が提案されている。非接触で距離を測定できるものとし
て、三角測量の原理を利用したステレオ法、レンズ焦点
面の検出による方法、変調光の投射と反射光の位相差ま
たは時間差による計測などがある。変化量検出手段７０
２では、時間軸で連続した２つのフレームの差分をとる
ことにより、変化のない背景画像を除去する。動き物体
検出手段７０３は距離画像および距離画像の変化量から
動きのある物体を検出する。First, the moving area is extracted from the distance image (S1). Various types of distance image input means 701 have been proposed. Non-contact distance measurement methods include a stereo method that uses the principle of triangulation, a method that detects the focal plane of the lens, and a method that measures the phase difference or time difference between the projected modulated light and the reflected light. Change amount detecting means 70
In 2, the background image having no change is removed by taking the difference between two consecutive frames on the time axis. The moving object detection unit 703 detects a moving object from the distance image and the change amount of the distance image.

【０１０６】次に動き領域の中で距離が近い画素を分離
して同一オブジェクトであると判定する（Ｓ２）。そし
て、オブジェクトの距離と大きさとから手であるかどう
かを判定する（Ｓ３）。手を伸ばしたときにそれを検出
することを目的とするならば、手と同程度の大きさの物
体を検出できれば良い。この時たまたま手に何かを持っ
たまま操作をすることも考えられるため、厳密に手の形
状を判定する必要はない。むしろ、手と同程度の物体を
誤って手と判断してしまわないよう、この物体がどのよ
うな動きをしたときに手と判断するかというところを最
適に決めねばならない。ここでは、手を前方に伸ばした
ときに検出されるようにする。したがって手と同程度の
大きさの物体が、検出器に向かってある範囲の距離だけ
前進してきたことを検出する（Ｓ４）。Next, pixels having a short distance in the motion area are separated and it is determined that they are the same object (S2). Then, it is determined from the distance and size of the object whether it is a hand (S3). For the purpose of detecting the hand when it is extended, it is only necessary to be able to detect an object having the same size as the hand. At this time, it is conceivable that the operator happens to hold something in his hand, so it is not necessary to strictly determine the shape of the hand. Rather, in order not to mistakenly judge an object as large as a hand to be a hand, it is necessary to optimally decide what movement of this object should be judged as a hand. Here, it is detected when the hand is extended forward. Therefore, it is detected that an object about the same size as the hand has advanced toward the detector by a certain range of distance (S4).

【０１０７】ノイズを低減したいならば、手と同程度の
物体が近づいてきた後、一定時間静止したとき手と判断
させるようにすれば良い。また、指の太さより細かい空
間分解能があれば、手と同程度の大きさの物体があり、
さらにその一部が局所的に検出器の方向に出っぱってい
るときにも手であると判定する方法もある。この場合
は、指を検出器の方向に向けて、手を伸ばしたときに検
出される。また、ペンなどの棒状のものを検出器に向け
て手を伸ばしても同様の効果が得られ、人がポイントす
る作業において自然な形である。If it is desired to reduce the noise, it may be determined that the object is a hand when the object of the same size as the hand approaches and then remains stationary for a certain period of time. Also, if there is a spatial resolution smaller than the thickness of a finger, there is an object that is about the size of a hand,
Further, there is also a method of determining that it is a hand even when a part of it locally protrudes toward the detector. In this case, it is detected when the hand is extended with the finger directed toward the detector. Moreover, the same effect can be obtained by extending a hand with a rod-shaped object such as a pen toward the detector, which is a natural shape in a work pointed by a person.

【０１０８】図８は本発明の他の遠隔操作装置の実施例
を示す機能ブロック図である。FIG. 8 is a functional block diagram showing another embodiment of the remote control device of the present invention.

【０１０９】距離画像入力手段８０１、変化量検出手段
８０２及び動き物体検出手段８０３は図１と同じもので
あり、動き物体検出手段８０３が手を検出したあとの処
理を下側の部分で行っている。図９は、この装置の処理
を示している。The distance image input means 801, the change amount detecting means 802 and the moving object detecting means 803 are the same as those in FIG. 1, and the processing after the moving object detecting means 803 detects the hand is performed in the lower part. There is. FIG. 9 shows the processing of this device.

【０１１０】動き物体追跡手段８０３が手と同程度の大
きさの物体が手前に近付いて来たことを検出すると（Ｓ
４）、以降の処理は動き物体追跡手段８０４に任され
る。処理が動き物体検出手段８０３に移ると、カーソル
表示手段８０５は動き物体追跡手段８０４の追跡結果に
したがって手の指し示す部分にカーソルを表示する（Ｓ
５）。動き物体追跡手段８０４で検出された動きベクト
ルは順次動きベクトル蓄積部８０６に蓄積されていく
（Ｓ６）。動きコマンド判定部８０７は、手があらかじ
め与えられた特別な動きをしたとき、それを検知する
（Ｓ７）。特別な動きが検出された場合には、コマンド
実行部８０９が、あらかじめ登録されたコマンドを実行
する（Ｓ８）。どのうよな動きをしたときにどのような
コマンドを実行するかは、あらかじめ動きコマンド登録
部８０８に登録されている。ここに登録される手の動き
及びそれに対応するコマンドとしては、例えば、一定時
間静止した場合、そのポイントにあるメニューを選択す
る、一定時間静止した後わずかな距離だけに下方に動
き、またもとに戻ってきたとき（つまり、ボタンを“押
す”ような動作をしたとき）、そのポイントにあるメニ
ューを選択する、ある場所で、開いていた手を握り、動
かしてまた開くことで、マウスのドラッグ操作をする
（開いた閉じたは、手と検知した物体の大きさの変化を
捕らえる）、など様々考えられる。When the moving object tracking means 803 detects that an object having the same size as the hand approaches the front (S
4), the subsequent processing is entrusted to the moving object tracking means 804. When the processing moves to the moving object detecting means 803, the cursor display means 805 displays the cursor on the portion pointed by the hand according to the tracking result of the moving object tracking means 804 (S).
5). The motion vector detected by the moving object tracking unit 804 is sequentially stored in the motion vector storage unit 806 (S6). The movement command determination unit 807 detects when the hand makes a special movement given in advance (S7). When a special movement is detected, the command execution unit 809 executes a command registered in advance (S8). The command to be executed when performing any movement is registered in advance in the motion command registration unit 808. The hand movements registered here and the commands corresponding thereto include, for example, when the user has been stationary for a certain period of time, the menu at that point is selected, and after moving for a certain period of time, it moves downward only a short distance, and When you come back to (that is, when you do something like "push" a button), select the menu at that point, hold your open hand at some point, move it and open it again There are various possibilities such as a drag operation (open / closed captures a change in the size of an object detected as a hand).

【０１１１】次に、図１０〜図１２を参照して、本発明
の実施例に係る情報記録装置について説明する。Next, with reference to FIGS. 10 to 12, an information recording apparatus according to an embodiment of the present invention will be described.

【０１１２】図１０は、本発明の実施例に係る情報再生
機器の機能ブロック図である。FIG. 10 is a functional block diagram of the information reproducing apparatus according to the embodiment of the present invention.

【０１１３】本実施例では一つの動画像、静止画像、音
声、テキストなどの情報ファイルはマップ上では一つの
アイコンで表現し、情報ファイル間に張られたリンクは
それぞれの情報ファイルに対応するアイコンを線で結ぶ
ことにより表現する。In this embodiment, one moving image, still image, voice, text, etc. information file is represented by one icon on the map, and the link between the information files is an icon corresponding to each information file. Express by connecting with a line.

【０１１４】入力インタフェース９０１は、リモコンボ
タン、手書き入力装置、キーボード、またはマウス、ト
ラックボールなどのポインティングデバイスである。情
報ファイルとリンクの構造をマップ表示させる命令は、
ユーザが入力インタフェース９０１を通して行う。情報
ファイル・リンク記録装置９０２は、ハードディスクや
光ディスク等の大容量記録媒体である。ここには、動画
像、静止画像データのほか、音声データやテキストデー
タも記録することができ、また、これらの間に張られた
リンクの情報も記録されている。表示装置９０３は、映
像、メッセージなど、ユーザへの表示を行う。組み合わ
せ最適化手段９０４は、情報ファイルのアイコンの配置
の中から、与えられたパラメータを最大、もしくは最小
にする組み合わせを計算し、出力する。最も単純な組み
合わせ最適化手段９０４は全探索を行うもので、すべて
の組み合わせについてパラメータの値を計算し、この値
を最大もしくは最小にする組み合わせを出力する。制御
装置９０５は、システム全体の制御を行う。The input interface 901 is a remote control button, a handwriting input device, a keyboard, or a pointing device such as a mouse or a trackball. The command to display the structure of the information file and the link is as follows:
The user does this through the input interface 901. The information file / link recording device 902 is a large-capacity recording medium such as a hard disk or an optical disk. In addition to moving image and still image data, voice data and text data can be recorded here, and information on links between them can also be recorded. The display device 903 displays images, messages, etc. to the user. The combination optimizing unit 904 calculates and outputs a combination that maximizes or minimizes a given parameter from the arrangement of icons in the information file. The simplest combination optimizing means 904 performs a full search, calculates parameter values for all combinations, and outputs the combinations that maximize or minimize this value. The control device 905 controls the entire system.

【０１１５】ユーザは、情報ファイルとそのファイル間
に張られたリンクの構造が知りたいときには、入力イン
タフェース９０１にマップ表示の命令を入力する。マッ
プ表示の命令が入力されると、制御装置９０５は情報フ
ァイル・リンク記録装置９０２から情報ファイル及び情
報ファイル間に張られたリンク情報を読みとる。情報フ
ァイル・リンク記録装置９０２がハードディスク、光デ
ィスク、又はフロッピーディスクである場合には、ファ
イル名、どのようなアイコンを表示させるかといった情
報ファイルに関する情報、及びリンク情報をいったんバ
ッファに読み込んでおき、マップ表示の命令があったと
きには、バッファから情報ファイルに関する情報、リン
ク情報を読み込んでも良い。読み込まれた情報ファイル
に関する情報・リンク情報は、組み合わせ最適化手段９
０４に送られ、表示装置９０３上での情報ファイル・リ
ンクの表示位置が決定される。When the user wants to know the structure of the information file and the link provided between the files, the user inputs a map display command to the input interface 901. When the map display command is input, the control device 905 reads the information file and the link information set between the information files from the information file / link recording device 902. If the information file / link recording device 902 is a hard disk, an optical disk, or a floppy disk, the file name, information about the information file such as what icon to display, and the link information are read into the buffer once and the map is created. When there is a display command, information about the information file and link information may be read from the buffer. The information / link information about the read information file is the combination optimizing means 9.
04, and the display position of the information file link on the display device 903 is determined.

【０１１６】この表示位置の決定により、例えば、通常
ならば図１１のように表示される情報ファアル・リンク
のマップが、図１２のようにより見やすくなって表示さ
れる。ただし、（１）〜（２）は映像ファイルのアイコ
ン、これらを結ぶ線は情報ファイルの間に張られている
リンクを表している。By the determination of the display position, for example, the map of the information fal link which is normally displayed as shown in FIG. 11 is displayed more easily as shown in FIG. However, (1) and (2) represent icons of video files, and the lines connecting these represent the links provided between the information files.

【０１１７】表示位置決定は、例えば次のように行われ
る。まず、表示装置９０３の表示用画面をいくつかの区
画に分割する。この区画には情報ファイルを表す情報フ
ァイルアイコンが表示される。一つの区画には最大で１
つの情報ファイルアイコンが表示される。リンクの張ら
れたファイル同士はそれを表す情報ファイルアイコン同
士が、リンクを表す線で結ばれるように表示される。情
報ファイルアイコンの表示位置の組み合わせは、情報フ
ァイルアイコンの個数をＮ，区画の数をＭとすると、Ｍ
！／（Ｍ−Ｎ）！通りある。この組み合わせの中から、
組み合わせ最適化手段９０４は一つの組み合わせを選ぶ
のだが、それは組み合わせによって特定される見やすさ
を代表するパラメータの値により決められる。すなわ
ち、見やすさの代表パラメータとして、リンクを表す線
の交差の数が採用される場合には、この値を最小にする
組み合わせを選ぶ、パラメータを最小にする組み合わせ
が複数存在する場合にはどれを選んでも良い。この見や
すさの代表パラメータとしては交差しているリンク線の
数のほかに、リンク線の長さの分散、リンク線の長さの
総和なども使うことができる。これらの一つを使って最
適な組み合わせを選ぶこともできるし、また、いくつか
のパラメータを複合して新たに一つのパラメータを作
り、それを使って最適な組み合わせを選んでも良い。こ
のほか、一画面内に表示できるか、情報ファイルアイコ
ンの表示に片寄りがないか等を考慮しても良い。The display position is determined, for example, as follows. First, the display screen of the display device 903 is divided into some sections. An information file icon representing an information file is displayed in this section. Maximum of 1 in one section
Two information file icons are displayed. Files with links are displayed so that information file icons representing them are connected by a line representing a link. The combination of the display positions of the information file icons is M, where N is the number of information file icons and M is the number of sections.
!! / (M-N)! There is a street. From this combination,
The combination optimizing means 904 selects one combination, which is determined by the value of the parameter representing the legibility specified by the combination. That is, when the number of intersections of lines that represent links is adopted as a representative parameter of visibility, select a combination that minimizes this value, and if there are multiple combinations that minimize the parameter, select which one. You may choose. In addition to the number of intersecting link lines, the distribution of the lengths of the link lines, the total sum of the lengths of the link lines, and the like can be used as the representative parameter of the visibility. It is possible to use one of these to select the optimum combination, or to combine several parameters to create a new parameter and use that to select the optimum combination. In addition, it may be possible to consider whether the information can be displayed on one screen, or whether the information file icons are displayed in one direction.

【０１１８】組み合わせ最適化手段９０４はこれらパラ
メータを最適にする組み合わせを選ぶのだが、それには
最も単純な方法として、すべての組み合わせについてパ
ラメータを計算し、最適なものを選ぶという方法があ
る。一般にこの方法はＮやＭが大きくなると大変な計算
を要する。そのため、すべてについて計算するのではな
く、一部の組み合わせについてパラメータを計算し、そ
の中で最適なものを選び、実際の最適解に近似するとい
う方法もある。一般には、全ての組み合わせについて計
算しなければ最適な組み合わせは得られないが、必ずし
も最適でなくてもある程度の最適化が計られていれば良
い場合には、最適解の近似法を用いることが多い。本実
施例においても、このような近似を用いても良い。The combination optimizing means 904 selects a combination that optimizes these parameters. The simplest method is to calculate parameters for all combinations and select the optimum one. Generally, this method requires a great deal of calculation as N and M increase. Therefore, there is also a method in which the parameters are calculated for some combinations instead of calculating all, and the optimum one is selected and approximated to the actual optimum solution. In general, the optimum combination cannot be obtained unless calculation is performed for all combinations. However, if it is not always the optimum combination and some optimization is sufficient, use the approximation method of the optimum solution. Many. Also in this embodiment, such an approximation may be used.

【０１１９】組み合わせ最適化手段９０４により情報フ
ァイルアイコン・リンクの表示位置が決まると、それに
従って表示装置９０３にマップが表示される。このと
き、マップの部分的なズームやパンを入力インタフェー
ス９０１を介しての命令入力によって行うことも可能で
ある。When the display position of the information file icon / link is determined by the combination optimizing means 904, the map is displayed on the display device 903 accordingly. At this time, it is also possible to partially zoom or pan the map by inputting a command via the input interface 901.

【０１２０】[0120]

【発明の効果】以上述べたように、本発明の画像情報ア
クセス装置によれば、画像記録または再生時の使用者の
視線を検出し、それを画像情報とともに記録して、使用
者にとって特徴的な物体を自動的かつ選択的に抽出する
ようにしているので、使用者の負担が少なく、簡単な処
理系で効率的な索引作成、および処理時間や記録媒体消
費量の削減が可能になる。As described above, according to the image information access apparatus of the present invention, the line of sight of the user at the time of recording or reproducing an image is detected, and the line of sight of the user is recorded together with the image information, which is characteristic for the user. Since various objects are automatically and selectively extracted, the burden on the user is small, and it is possible to efficiently create an index with a simple processing system and reduce processing time and recording medium consumption.

【０１２１】また、本発明の遠隔操作装置によれば、リ
モコン装置などの一切のデバイスを手に持つことなく、
また装着することなく、素手のみでシステムを操作する
環境を実現することができる。これにより、家庭内での
現在のリモコンの置き換えに留まらず、手にものを持ち
ながら操作を行いたい、汚れた手で機械を操作したいと
いった要求にも応えることができる。Further, according to the remote control device of the present invention, without holding any device such as a remote control device in the hand,
In addition, it is possible to realize an environment in which the system can be operated only with bare hands without wearing it. As a result, it is possible not only to replace the current remote controller in the home, but also to meet the demands such as wanting to operate while holding something in the hand and operating the machine with dirty hands.

【０１２２】また、本発明の情報記録装置によれば、フ
ァイルアイコンの表示位置とファイル間に張られている
リンクの表示の関係をも考慮してファイルアイコンの表
示位置を決定しているため、複雑な構造であるハイパー
テキスト構造になっているファイルのマップ表示におい
ても、ユーザにとって見やすい表示ができるようにな
る。その結果、ユーザが自分の探しているファイルを素
早く見つけたり、自分の見ているファイルの位置づけの
把握やハイパーテキスト構造を容易に再構築したりとい
うことが可能になる。本発明は、ハイパーテキスト構造
を有する全てのもののマップ表示に適用可能である。Further, according to the information recording apparatus of the present invention, the display position of the file icon is determined in consideration of the display position of the file icon and the display of the link between the files. Even in the map display of a file having a hypertext structure, which is a complicated structure, the user can easily view the map. As a result, the user can quickly find the file he is looking for, grasp the position of the file he is looking at, and easily reconstruct the hypertext structure. The present invention is applicable to map displays of anything with a hypertext structure.

[Brief description of drawings]

【図１】本発明の実施例に係る画像記録装置の構成を
示すブロック図。FIG. 1 is a block diagram showing the configuration of an image recording apparatus according to an embodiment of the present invention.

【図２】本発明の実施例に係る索引作成装置の構成を
示すブロック図。FIG. 2 is a block diagram showing a configuration of an index creating device according to an embodiment of the present invention.

【図３】本発明の実施例に係る画像再生検索装置の構
成を示すブロック図。FIG. 3 is a block diagram showing a configuration of an image reproduction / retrieval device according to an embodiment of the present invention.

【図４】本発明の実施例に係る画像記録装置兼索引装
置の構成を示すブロック図。FIG. 4 is a block diagram showing the configuration of an image recording device / index device according to an embodiment of the present invention.

【図５】本発明の実施例に係る他の画像記録装置の構
成を示す概念図。FIG. 5 is a conceptual diagram showing the configuration of another image recording apparatus according to the embodiment of the invention.

【図６】図５の画像記録装置の構成を示すブロック
図。6 is a block diagram showing the configuration of the image recording apparatus of FIG.

【図７】本発明の実施例に係る遠隔操作装置の機能ブ
ロック図。FIG. 7 is a functional block diagram of the remote control device according to the embodiment of the present invention.

【図８】本発明の実施例に係る他の遠隔操作装置の機
能ブロック図。FIG. 8 is a functional block diagram of another remote control device according to the embodiment of the present invention.

【図９】同装置の処理を示すフローチャート。FIG. 9 is a flowchart showing processing of the apparatus.

【図１０】本発明の実施例に係る情報再生機器の機能
ブロック図。FIG. 10 is a functional block diagram of an information reproducing device according to an embodiment of the present invention.

【図１１】同装置における最適化前のマップを示す
図。FIG. 11 is a diagram showing a map before optimization in the apparatus.

【図１２】同装置における最適化後のマップを示す
図。FIG. 12 is a diagram showing a map after optimization in the apparatus.

[Explanation of symbols]

１０１…映像信号、１０４…記録信号発生装置、１０５
…記憶媒体、１０６…眼球運動信号、１０８…眼球位置
検出器、１１０…操作信号、２０２… 記憶信号読出装
置、２０６…エッジ認識装置、２０７…画像部分抽出
器、２０８…重複検査器、２０９…画像選定器、２１９
…種別判定器、２２０…注視点決定器、３０３…映像信
号、３０５…画面作成器、３０７…モニター出力端子、
３０８…関連フレーム情報、３１７…一覧表作成装置、
４０１…映像信号、４１２…操作信号、４１５…眼球運
動信号、５０１…画像記録装置本体、５０２…視線検出
器、５０４…撮影者、５０５…被写体、５０８…索引画
像、６０１…ユーザ画像信号、６０６…索引画像信号、
６０８…ディジタルズーム、６１４…ユーザ画像ズーム
フォーカス信号、６１６…カット検出器、６１７…ズー
ムコントロール信号、６１８…操作信号、６１９…カメ
ラ方向変位信号、６２０…眼球運動信号、６２４…検出
器位置信号、７０１，８０１…距離画像入力手段、７０
２，８０２…変化量検出手段、７０３，８０３…動き物
体検出手段、８０４…動き物体追跡手段、８０７…動き
コマンド判定部、９０１…入力インタフェース、９０２
…情報ファアル・リンク記録装置、９０３…表示装置、
９０４…組み合わせ最適化手段、９０５…制御装置。101 ... Video signal, 104 ... Recording signal generator, 105
Storage medium, 106 ... Eye movement signal, 108 ... Eye position detector, 110 ... Operation signal, 202 ... Storage signal reading device, 206 ... Edge recognition device, 207 ... Image part extractor, 208 ... Duplication checker, 209 ... Image selector 219
... Type judging device, 220 ... Gaze point deciding device, 303 ... Video signal, 305 ... Screen creating device, 307 ... Monitor output terminal,
308 ... Related frame information, 317 ... List creation device,
401 ... Video signal, 412 ... Operation signal, 415 ... Eye movement signal, 501 ... Image recording device main body, 502 ... Line-of-sight detector, 504 ... Photographer, 505 ... Subject, 508 ... Index image, 601 ... User image signal, 606 … Index image signal,
608 ... Digital zoom, 614 ... User image zoom focus signal, 616 ... Cut detector, 617 ... Zoom control signal, 618 ... Operation signal, 619 ... Camera direction displacement signal, 620 ... Eye movement signal, 624 ... Detector position signal, 701, 801 ... Distance image input means, 70
2, 802 ... Change amount detecting means, 703, 803 ... Moving object detecting means, 804 ... Moving object tracking means, 807 ... Motion command determining section, 901 ... Input interface, 902
... information false link recording device, 903 ... display device,
904 ... Combination optimization means, 905 ... Control device.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｇ０６Ｔ 1/00 7/00 Ｇ０６Ｆ 15/62 ３８０４１５ ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁶ Identification code Internal reference number FI Technical display location G06T 1/00 7/00 G06F 15/62 380 415

Claims

[Claims]

1. An image information recording means for recording image information, a visual axis detecting means for detecting a position of a visual axis of a user with respect to the image information when accessing the image information recording means, and the visual axis detecting means for detecting the visual axis position. Line-of-sight position information storage means for storing the position of the selected line-of-sight in association with the image information, and in the image information recorded in the image information recording means based on the position of the line-of-sight stored in the line-of-sight position information storage means. An image information access device, comprising: index information generating means for extracting a portion of interest and generating index information for searching the image information.

2. A range image input means for inputting a range image, a change amount detecting means for detecting a change amount of the range image input to the range image input means, and a range image obtained and a movement based on the change amount. A moving object detecting means for recognizing that the moving object is the operator's hand by calculating the size, direction and moving distance of the moving object, and the hand detected by the moving object detecting means. A moving object tracking means for tracking the movement of the moving object on the screen, a cursor display means for displaying a cursor on the screen according to the tracking result by the moving object tracking means, and a hand tracked by the moving object tracking means is predetermined. A remote control device comprising: a command execution means for executing a predetermined command registered in advance based on the cursor position when it is detected that the user has stopped for a certain time.

3. A range image input means for inputting a range image, a change amount detecting means for detecting a change amount of the range image input to the range image input means, and a range image obtained and a movement based on the change amount. A moving object detecting means for recognizing that the moving object is the operator's hand by calculating the size, direction and moving distance of the moving object, and the hand detected by the moving object detecting means. A moving object tracking means for tracking the movement in the screen, a cursor display means for displaying a cursor on the screen according to the tracking result by the moving object tracking means, and a hand that has been tracked by the moving object tracking means in advance. A remote control device comprising: a command executing means for executing a predetermined command registered in advance based on the position of the cursor when detecting that a predetermined movement is made.

4. A recording means for storing a plurality of files in association with each other by linking them with each other and a structure for displaying a plurality of files recorded in the recording means and a structure of links between them, which is easy to see. And a linking structure between the plurality of files specified by the optimal parameter obtained by this optimizing means and the optimizing means for evaluating the parameter that determines An information recording apparatus comprising: a display unit for displaying information.