JP3380937B2

JP3380937B2 - Method for identifying object in moving image and apparatus for identifying object in moving image

Info

Publication number: JP3380937B2
Application number: JP19071694A
Authority: JP
Inventors: 精一紺谷; 潮井上; 哲司佐藤; 良治片岡
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1994-08-12
Filing date: 1994-08-12
Publication date: 2003-02-24
Anticipated expiration: 2018-02-24
Also published as: JPH0855131A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、動画像内の物体の同定
方法及び動画像内の物体の同定装置に係り、利用者が動
画像を見ながら指示した物体を同定する動画像内の物体
の同定方法及び動画像内の物体の同定装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method of identifying an object in a moving image and an apparatus for identifying an object in a moving image, and an object in a moving image for identifying an object designated by a user while looking at the moving image. And an apparatus for identifying an object in a moving image.

【０００２】詳しくは、利用者が指示した動画像情報内
の物体に関する情報等を提示する動画像ナビゲーション
システム等において、利用者の指示した動画像情報内の
物体を正しく同定する動画像内の物体の同定方法及び動
画像内の物体の同定装置に関する。More specifically, in a moving image navigation system or the like that presents information about an object in moving image information specified by a user, an object in a moving image that correctly identifies an object in moving image information specified by a user. And an apparatus for identifying an object in a moving image.

【０００３】[0003]

【従来の技術】従来、動画像内の物体を同定する方法と
しては以下の方法がある。従来の方法は、動画像を二次
元の画像と時間軸からなる三次元の時空間としてとら
え、動画像内の物体毎に、当該物体を内包する領域を管
理情報として設定しておき、利用者が指示した画像の位
置と時間の組、即ち利用者が指示した動画像内の座標を
含む領域を持つ物体を管理情報を用いて選択することで
動画像内の物体を同定している（高野元、的場ひろ
し、原良憲：“ビデオ・ハイパーメディアのナビゲー
ション方式”、8th Symposium on Human Interface，pp
607-612,Oct.21-23,1992）。2. Description of the Related Art Conventionally, there have been the following methods for identifying an object in a moving image. The conventional method considers a moving image as a three-dimensional space-time consisting of a two-dimensional image and a time axis, sets an area containing the object as management information for each object in the moving image, and The object in the moving image is identified by using the management information to select an object having a position and time pair of the image specified by, that is, an object having a region including the coordinates in the moving image specified by the user (Takano Former, Hiroshi Matoba, Yoshinori Hara: “Navigation method of video and hypermedia”, 8th Symposium on Human Interface, pp
607-612, Oct. 21-23, 1992).

【０００４】図１０は、従来の方法を説明する図を示
す。図１０（Ａ）は、図１０（Ｂ）に示す映像フレーム
１０内を、図１０（Ｃ）に示す三角形の物体２０が、時
刻ｔ₀から時刻ｔ₄（ｔ₀＜ｔ₄）までの時間経過に伴
い映像フレーム１０内を移動している様子を示してお
り、時刻ｔ₀における映像フレーム１０（座標（０，
０，０）、（ｘ，０，ｔ₀）、（ｘ，ｙ，ｔ₀）、
（０，ｙ，ｔ₀）の４点を頂点とする四角形の領域）か
ら時刻ｔ₄における映像フレーム１０までの５枚の映像
フレーム１０を、時刻の昇順に時間軸（ｔ）方向に平行
に並べた例を示している。映像フレーム１０内の物体２
０は、時刻ｔ₀から時刻ｔ₄への時間経過に伴い、各映
像フレーム１０内を右下から左上に向かって移動してい
る、映像情報として記録されている物体２０である。ま
た、各映像フレーム１０内の物体２０を内包する四角形
は、物体２０を同定することが可能な領域を示してお
り、映像情報として記録されているものではなく上述の
管理情報として蓄積されている。図１０（Ｄ）は、図１
０（Ａ）に示す時刻ｔ₀から時刻ｔ₄までの各映像フレ
ーム１０内で物体２０を同定することが可能な領域を示
している。この、物体２０を同定することが可能な領域
というのは、利用者が図１０（Ａ）に示す動画像を見な
がら当該動画像内の物体２０を指示する際に、物体２０
の内側の領域でなくても、物体２０の周囲の四角形の領
域内を指示していれば、利用者が物体２０を指示したも
のとして同定することが可能な領域のことである。映像
フレーム１０内の物体２０は時間の経過に伴ってその位
置を右下から左上へと変えているため、例えば、利用者
が時刻ｔ₂の映像フレーム１０内で物体２０の右下を指
示したつもりでも、指示しようと考えてから実際に指示
するまでには多少の時間（仮に、Δｔ＝ｔ₃−ｔ₂とす
る）を要する。そのため、指示した座標が入力された時
（ｔ₃＝ｔ₂＋Δｔ）には、時刻ｔ₃の映像フレーム１
０となってしまい、当該映像フレーム内の指示された座
標には既に物体２０は存在していないという場合が考え
られ、利用者の意図した物体２０を正しく指示すること
ができない場合がある。このような場合、特に物体２０
の移動速度が速い場合や、物体２０の大きさが小さい場
合などであっても利用者が物体２０をできるだけ正しく
指示することができるように、映像フレーム１０内の各
物体２０毎に実際の物体２０よりも広い範囲を当該物体
を同定することが可能な領域として予め設定しているの
である。FIG. 10 shows a diagram for explaining a conventional method. 10A shows the time from the time t ₀ to the time t ₄ (t ₀ <t ₄ ) of the triangular object 20 shown in FIG. 10C in the video frame 10 shown in FIG. 10B. elapsed shows a state in which moving image frame 10 with the video frame 10 (coordinates (0 at time t _0,
0,0), (x, 0, t ₀ ), (x, y, t ₀ ),
Five video frames 10 from (quadrangle region having four points of (0, y, t ₀ ) as vertices) to video frame 10 at time t ₄ are arranged in parallel in the time axis (t) direction in ascending order of time. An example of arranging is shown. Object 2 in video frame 10
0 is an object 20 recorded as video information, which is moving from the lower right to the upper left in each video frame 10 as time elapses from time t ₀ to time t ₄ . Further, a quadrangle including the object 20 in each video frame 10 indicates a region in which the object 20 can be identified, and is not recorded as video information but is stored as the management information described above. . FIG. 10D is the same as FIG.
The area in which the object 20 can be identified is shown in each video frame 10 from time t ₀ to time t ₄ shown in 0 (A). This area in which the object 20 can be identified means that when the user points to the object 20 in the moving image while looking at the moving image shown in FIG.
Even if the area is not inside, the area can be identified by the user as having instructed the object 20 if the area within the rectangular area around the object 20 is instructed. Since the position of the object 20 in the video frame 10 changes from the lower right to the upper left with the passage of time, for example, the user instructs the lower right of the object 20 in the video frame 10 at time t ₂ . Even if it is intentional, it takes some time (assuming Δt = t ₃ −t ₂ ) from the time of thinking to instruct to the time of actually instructing. Therefore, when the designated coordinates are input (t ₃ = t ₂ + Δt), the video frame 1 at time t ₃
It may be 0, and the object 20 may not already exist at the specified coordinates in the video frame, and the object 20 intended by the user may not be correctly specified. In such a case, especially the object 20
In order to allow the user to indicate the object 20 as accurately as possible even when the moving speed of the object is fast or the size of the object 20 is small, the actual object for each object 20 in the video frame 10 A range wider than 20 is preset as a region in which the object can be identified.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記従
来の方法では、以下に示す４つの問題点がある。However, the above-mentioned conventional method has the following four problems.

【０００６】第１の問題点は、利用者が指示した映像フ
レーム１０内の物体２０をシステムに正しく同定させる
ためには、利用者が当該システムに予め設定されている
各物体２０を同定することができる領域内を指示しなけ
ればならない点である。利用者が、予めシステムに設定
されている物体２０を同定することができる領域に対し
て少しでも外側の座標を指示してしまうと、システムは
物体２０を指示された物体２０として正しく同定するこ
とはできないため、利用者は注意深く物体２０を指示し
なければならない。The first problem is that in order for the system to correctly identify the object 20 in the video frame 10 designated by the user, the user must identify each object 20 preset in the system. The point is that you must specify the area within which you can do it. When the user designates coordinates outside even for a region in which the object 20 set in the system can be identified in advance, the system correctly identifies the object 20 as the designated object 20. Therefore, the user must carefully point the object 20.

【０００７】第２の問題点は、上記従来の方法では、第
１の問題点による影響を軽減するために、映像フレーム
１０内に存在する物体の数が少ない場合には、個々の物
体を同定することができる領域をできるだけ広くとり、
また、映像フレーム１０内に存在する物体の数が多い場
合には、個々の物体を同定することができる領域を狭く
とる等の調整を行なわなければならないという点であ
る。そのため、個々の物体に対する領域の設定に多大の
工程を要することになる。The second problem is that in the above conventional method, in order to reduce the influence of the first problem, when the number of objects existing in the video frame 10 is small, individual objects are identified. Take as much area as you can
In addition, when the number of objects existing in the video frame 10 is large, it is necessary to make adjustments such as narrowing the area where each object can be identified. Therefore, a large number of steps are required to set the area for each object.

【０００８】第３の問題点は、動きの速い物体を同定す
る場合、利用者が物体を指示する操作が物体の移動に追
いつかないことがあるため、物体を同定することができ
る領域を物体の移動速度を考慮した領域として設定して
おかなければならない点である。この場合にも設定に多
くの工程を要する。A third problem is that when a fast-moving object is identified, the operation of pointing the object by the user may not catch up with the movement of the object. This is a point that must be set as an area considering the moving speed. Also in this case, many steps are required for setting.

【０００９】第４の問題点は、動画像の再生速度が速い
場合などにも、上記第３の問題と同様の問題が発生する
ため、動画像の再生速度に応じて物体を同定することが
可能な領域を複数設定しておかなければならないという
点である。この問題も、上記第２の問題点、第３の問題
点と同様設定に多くの工程を要する。A fourth problem is that the same problem as the third problem occurs even when the moving image reproducing speed is high, so that an object can be identified according to the moving image reproducing speed. The point is that multiple possible areas must be set. This problem also requires many steps for setting like the second problem and the third problem.

【００１０】本発明は、上記の点に鑑みなされたもの
で、予め動画像情報内の個々の物体毎に、当該物体を同
定することができる領域を設定して管理情報として蓄積
しておくことなく、正しく物体を同定することが可能な
動画像内の物体の同定方法及び動画像内の物体の同定装
置を提供することを目的とする。The present invention has been made in view of the above points. For each individual object in moving image information, an area in which the object can be identified is set in advance and stored as management information. It is an object of the present invention to provide a method for identifying an object in a moving image and an apparatus for identifying an object in a moving image that can correctly identify an object.

【００１１】また、指示された座標が物体の外側であっ
たり、動画像情報内の物体の数が多い場合であっても、
利用者の意図した物体を正しく同定することが可能な動
画像内の物体の同定方法及び動画像内の物体の同定装置
を提供することを目的とする。Further, even when the designated coordinates are outside the object or the number of objects in the moving image information is large,
An object of the present invention is to provide a method for identifying an object in a moving image and an apparatus for identifying an object in a moving image that can correctly identify an object intended by a user.

【００１２】また、動画像情報内を移動する物体の移動
速度が速い場合や、動画像情報の再生速度が速い場合等
であっても、利用者の意図した物体を正しく同定するこ
とが可能な動画像内の物体の同定方法及び動画像内の物
体の同定装置を提供することを目的とする。Further, even if the moving speed of the object moving in the moving image information is high, or the moving image information reproducing speed is high, the object intended by the user can be correctly identified. An object is to provide a method for identifying an object in a moving image and an apparatus for identifying an object in a moving image.

【００１３】[0013]

【課題を解決するための手段】図１は、本発明の原理説
明図を示す。FIG. 1 shows the principle of the present invention.

【００１４】本発明の、動画像内の物体の同定方法は、
予め蓄積された動画像情報を表示し（ステップ１）、利
用者が表示された動画像情報を見ながら、当該動画像情
報内の座標を指示する（ステップ２）。The method of identifying an object in a moving image according to the present invention is
The moving image information accumulated in advance is displayed (step 1), and the user indicates the coordinates in the moving image information while looking at the displayed moving image information (step 2).

【００１５】利用者により動画像情報内の座標が指示さ
れると、予め蓄積された当該動画像情報内の物体の位置
情報から物体を検索する（ステップ３）。When the user designates the coordinates in the moving image information, the object is searched from the position information of the object in the moving image information stored in advance (step 3).

【００１６】利用者によって指示された座標と上記ステ
ップ３で検索された物体との尤度を計算し、計算された
尤度に基づいて利用者の指示した座標に対応する動画像
情報内の物体を同定する（ステップ４）。The likelihood between the coordinate designated by the user and the object retrieved in step 3 is calculated, and the object in the moving image information corresponding to the coordinate designated by the user is calculated based on the calculated likelihood. Are identified (step 4).

【００１７】図２は、本発明の原理構成図を示す。FIG. 2 shows the principle configuration of the present invention.

【００１８】本発明の、動画像内の物体の同定装置１０
０は、動画像情報を蓄積する画像蓄積手段１１０と、画
像蓄積手段１１０に蓄積された動画像情報を表示する画
像表示手段１２０と、画像表示手段１２０に表示された
動画像情報を見ながら動画像情報内の座標を指示する座
標指示手段１３０と、動画像情報内の個々の物体の位置
情報を蓄積する位置情報蓄積手段１４０と、位置情報蓄
積手段１４０の中から物体を検索する検索手段１５０
と、動画像情報内の各座標と個々の物体の位置情報の組
に対して実数を対応させて尤度とする尤度計算手段１６
０と、指示された座標と該検索手段１５０により検索さ
れた物体の位置情報との組を、尤度計算手段１６０に出
力して尤度を計算し、計算された尤度に基づいて指示さ
れた座標に対応する物体を同定する同定手段１７０とを
有する。An apparatus 10 for identifying an object in a moving image according to the present invention.
0 is a moving image while watching the image storage means 110 for storing the moving picture information, the image display means 120 for displaying the moving picture information stored in the image storage means 110, and the moving picture information displayed on the image display means 120. Coordinate pointing means 130 for pointing the coordinates in the image information, position information storage means 140 for storing the position information of each object in the moving image information, and search means 150 for searching the object from the position information storage means 140.
And a likelihood calculating means 16 for calculating a likelihood by associating a real number with a set of each coordinate in the moving image information and position information of an individual object.
A set of 0, the designated coordinate, and the position information of the object searched by the searching means 150 is output to the likelihood calculating means 160 to calculate the likelihood, and the instruction is made based on the calculated likelihood. Identification means 170 for identifying an object corresponding to the coordinate set.

【００１９】上記検索手段１５０は、動画像情報内の各
座標に対して動画像情報内に検索領域を対応させて設定
する検索領域設定手段と、指示された座標を検索領域設
定手段に出力して、指示された座標に対する検索領域を
設定するとともに、位置情報蓄積手段１４０の中の検索
領域内に位置情報を有する物体を検索する検索領域内検
索手段とを有する。The search means 150 outputs a search area setting means for setting a search area in the moving image information corresponding to each coordinate in the moving image information and the designated coordinates to the search area setting means. In addition, the search area in the search area for the designated coordinates is set, and the search area in the search area in the position information storage means 140 is searched for an object having position information.

【００２０】上記検索領域設定手段は、指示された座標
に対して、動画像情報内の物体を確認してから指示する
までの遅延時間に基づいて、動画像情報内に検索領域を
設定する。The search area setting means sets the search area in the moving image information based on the delay time from the confirmation of the object in the moving image information to the instruction at the designated coordinates.

【００２１】上記尤度計算手段１６０は、指示された座
標と検索手段１５０によって検索された物体の位置情報
とから距離を計算し、計算された距離に基づいて尤度を
決定する。The likelihood calculating means 160 calculates the distance from the designated coordinates and the position information of the object searched by the searching means 150, and determines the likelihood based on the calculated distance.

【００２２】[0022]

【作用】本発明は、動画像情報内の指示された座標に基
づき物体を検索し、検索の結果に基づいて指示された物
体を同定することにより、予め動画像情報内の各物体毎
に、当該物体を同定することができる領域を設定して蓄
積していなくても、物体を同定することが可能となる。According to the present invention, by searching for an object based on the designated coordinates in the moving image information and identifying the designated object based on the result of the search, each object in the moving image information is previously identified. It is possible to identify an object without setting and accumulating a region in which the object can be identified.

【００２３】また、予め動画像情報内の各物体毎に、当
該動画像情報内における位置情報を蓄積しているため、
指示された座標に基づき全ての物体を検索することが可
能となる。Further, since the position information in the moving image information is previously stored for each object in the moving image information,
It becomes possible to search for all objects based on the designated coordinates.

【００２４】また、指示された座標に基づいて、動画像
情報内に物体を検索するための検索領域を設定して物体
を検索することにより、検索量、検索時間を短縮するこ
とが可能となる。Further, by setting a search area for searching an object in the moving image information based on the designated coordinates and searching for the object, it is possible to shorten the search amount and the search time. .

【００２５】また、表示された動画像情報内の物体を確
認してから実際に指示するまでの遅延時間を考慮し、動
画像情報内の物体を検索するための検索領域を当該遅延
時間に基づき設定することにより、動きの速い物体や、
再生速度の速い動画像情報等であっても、実際に座標が
指示された時刻の動画像情報内での物体の検索が可能と
なる。Further, considering the delay time from the confirmation of the object in the displayed moving image information to the actual instruction, a search area for searching for the object in the moving image information is based on the delay time. By setting, fast moving objects,
Even with moving image information or the like having a high reproduction speed, it is possible to search for an object in the moving image information at the time when the coordinates are actually designated.

【００２６】また、指示された座標と検索された各物体
の位置情報に基づき尤度を計算することで、利用者の指
示した物体を尤度に基づき一意に同定することが可能と
なる。Further, by calculating the likelihood based on the designated coordinates and the position information of each retrieved object, the object designated by the user can be uniquely identified based on the likelihood.

【００２７】[0027]

【実施例】以下、図面と共に本発明の実施例を詳細に説
明する。Embodiments of the present invention will now be described in detail with reference to the drawings.

【００２８】図３は、本発明の一実施例の構成図を示
す。同図に示す動画像内の物体の同定装置１００は、画
像蓄積部１１０、画像表示部１２０、座標指示部１３
０、位置情報蓄積部１４０、検索領域設定部１８０、尤
度計算部１６０、制御部２００から構成される。FIG. 3 shows a block diagram of an embodiment of the present invention. The apparatus 100 for identifying an object in a moving image shown in the figure includes an image storage unit 110, an image display unit 120, and a coordinate designating unit 13.
0, position information storage unit 140, search area setting unit 180, likelihood calculation unit 160, and control unit 200.

【００２９】画像蓄積部１１０は、動画像情報を蓄積し
ているデータベースであり、ｘ軸、ｙ軸、ｔ軸で指定さ
れる、０≦ｘ≦Ｘ、０≦ｙ≦Ｙ、０≦ｔ≦Ｔの範囲にあ
る動画像情報であり、ｘ軸、ｙ軸から構成される映像フ
レームを時刻（ｔ）毎に蓄積している。The image storage unit 110 is a database that stores moving image information, and is specified by the x-axis, y-axis, and t-axis, 0 ≦ x ≦ X, 0 ≦ y ≦ Y, 0 ≦ t ≦. This is moving image information in the range of T, and video frames composed of an x axis and ay axis are accumulated at each time (t).

【００３０】画像表示部１２０は、画像蓄積部１１０に
蓄積されている動画像情報を表示するディスプレイであ
る。The image display unit 120 is a display for displaying the moving image information stored in the image storage unit 110.

【００３１】座標指示部１３０は、画像表示部１２０に
表示された動画像情報内の座標（ｘ，ｙ，ｔ）を指示す
るマウスである。画像表示部１２０に表示された動画像
情報を見ながら、当該動画像情報内のある座標（ｘ，
ｙ）の位置で、当該位置を指示するためのボタンを押す
ことにより、ボタンを押した時刻ｔにおける動画像情報
内の座標として、座標（ｘ，ｙ，ｔ）の指定をすること
ができる。以下、座標指示部１３０の指示する座標は、
（ｐｘ，ｐｙ，ｐｔ）と表記する。The coordinate designating unit 130 is a mouse for designating coordinates (x, y, t) in the moving image information displayed on the image display unit 120. While watching the moving image information displayed on the image display unit 120, a certain coordinate (x,
By pressing the button for indicating the position at the position y), the coordinates (x, y, t) can be specified as the coordinates in the moving image information at the time t when the button is pressed. Hereinafter, the coordinates designated by the coordinate designating unit 130 are
Notated as (px, py, pt).

【００３２】位置情報蓄積部１４０は、画像蓄積部１１
０に蓄積される動画像情報内の各物体毎の代表点として
物体の中心の位置情報を蓄積している。各物体の位置情
報は、当該動画像情報内の各映像フレーム毎に、フレー
ム内の二次元の座標（ｘ，ｙ）と時刻ｔとからなる三次
元空間内の座標（ｘ，ｙ，ｔ）として表現される、動画
像情報の再生に伴って描く軌跡として蓄積されている。
画像蓄積部１１０に蓄積されている動画像情報内の物体
ｉの位置情報を、The position information storage unit 140 is the image storage unit 11.
Position information of the center of the object is stored as a representative point for each object in the moving image information stored in 0. The position information of each object is the coordinates (x, y, t) in the three-dimensional space consisting of the two-dimensional coordinates (x, y) in the frame and the time t for each video frame in the moving image information. Is stored as a locus drawn as the moving image information is reproduced.
The position information of the object i in the moving image information stored in the image storage unit 110 is

【００３３】[0033]

【数１】 [Equation 1]

【００３４】と表記し、物体ｉの時刻ｔにおける動画像
情報内のｘ座標とｙ座標はそれぞれ以下のように表記す
る。The x coordinate and the y coordinate in the moving image information of the object i at time t are expressed as follows.

【００３５】[0035]

【数２】 [Equation 2]

【００３６】以下に、図を用いて物体ｉの位置情報につ
いて説明する。図４は、本発明の一実施例の物体ｉの時
刻ｔにおける位置を説明する図を示す。同図に示す太い
実線は物体ｉの時刻０から時刻ｔまでの映像フレーム内
の移動の軌跡を示しており、太い破線は物体ｉの時刻ｔ
以後の移動の軌跡を示している。図中のＡで示した太い
実線と太い破線との接続点（Ａ）は、時刻ｔにおける座
標（Ｘ，０，ｔ）、（Ｘ，Ｙ，ｔ）、（０，Ｙ，ｔ）、
（０，０，ｔ）の４つの座標を頂点とする四角形で示さ
れる映像フレーム１０内の物体ｉの位置を示している。
図５は、本発明の一実施例の位置情報蓄積部の蓄積例を
示す図である。同図に示す位置情報蓄積部１４０の蓄積
例は、画像蓄積部１１０に蓄積されている動画像情報と
対応しており、当該動画像情報内に存在する個々の物体
について、時刻、物体番号、物体番号に対応する物体の
代表点の位置情報としてｘ座標とｙ座標とを蓄積する例
である。時刻ｔ₁における動画像情報内には、座標
（３，７）、（１２，７）、（６，２）の位置に代表点
を持つ物体番号が２１、２２、２３の３つの物体が表示
されていることを示している。The position information of the object i will be described below with reference to the drawings. FIG. 4 is a diagram illustrating the position of the object i at time t according to the embodiment of the present invention. The thick solid line shown in the figure shows the locus of movement of the object i in the video frame from time 0 to time t, and the thick broken line shows the time t of the object i.
The locus of movement after that is shown. The connection point (A) between the thick solid line and the thick broken line indicated by A in the figure is the coordinates (X, 0, t), (X, Y, t), (0, Y, t) at time t,
The position of the object i in the video frame 10 indicated by a quadrangle having four coordinates (0, 0, t) as vertices is shown.
FIG. 5 is a diagram showing a storage example of the position information storage unit of the embodiment of the present invention. The accumulation example of the position information accumulation unit 140 shown in the figure corresponds to the moving image information accumulated in the image accumulation unit 110, and the time, object number, and This is an example of accumulating x-coordinates and y-coordinates as position information of the representative point of the object corresponding to the object number. In the moving image information at time t ₁ , three objects with object numbers 21, 22, and 23 having representative points at the coordinates (3, 7), (12, 7), and (6, 2) are displayed. It has been shown that.

【００３７】尤度計算部１６０は、座標指示部１３０に
より指示された座標と、物体ｉの位置情報とから、尤度
を計算する尤度計算器Ｌであり、The likelihood calculator 160 is a likelihood calculator L that calculates a likelihood from the coordinates designated by the coordinate designating unit 130 and the position information of the object i.

【００３８】[0038]

【数３】 [Equation 3]

【００３９】と表記する。尤度計算器Ｌの計算式につい
ては、後述の具体例で示す。Notated as The calculation formula of the likelihood calculator L will be shown in a specific example described later.

【００４０】検索領域設定部１８０は、座標指示部１３
０により指示された座標に基づき物体を検索するための
検索領域を動画像情報内に設定する検索領域設定器Ｒで
あり、Ｒ（ｐｘ，ｐｙ，ｐｔ）と表記する。検索領域設定器Ｒの右辺については、後述
の具体例で示す。The search area setting unit 180 includes a coordinate designating unit 13
A search area setting unit R that sets a search area for searching an object based on the coordinates designated by 0 in the moving image information, and is expressed as R (px, py, pt). The right side of the search area setting unit R will be shown in a specific example described later.

【００４１】制御部２００については、以下に図を用い
て詳しく説明する。図６は、本発明の一実施例の制御部
の構成図を示す。同図に示す制御部２００は、検索領域
内検索部１９０、同定部１７０から構成される。検索領
域内検索部１９０は、画像蓄積部１１０から動画像情報
を読み出して画像表示部１２０に表示する。また、表示
された動画像情報を見ながら座標指示部１３０により動
画像情報内の座標が指示され、指示座標が入力される
と、指示座標を検索領域設定部１８０へ出力して、当該
指示座標に対応する動画像情報内の物体を検索するため
の検索領域を設定する。検索領域が設定されると、位置
情報蓄積部１４０の中の検索領域内の位置情報を有する
物体を検索して同定部１７０へ出力する。The control unit 200 will be described in detail below with reference to the drawings. FIG. 6 is a block diagram of the control unit according to the embodiment of the present invention. The control unit 200 shown in the figure includes an in-search area search unit 190 and an identification unit 170. The search area search unit 190 reads the moving image information from the image storage unit 110 and displays it on the image display unit 120. Further, when the coordinates in the moving image information are instructed by the coordinate instructing unit 130 while watching the displayed moving image information and the instructed coordinates are input, the instructed coordinates are output to the search area setting unit 180, and the instructed coordinates A search area for searching for an object in the moving image information corresponding to is set. When the search area is set, the object having the position information in the search area in the position information storage unit 140 is searched and output to the identification unit 170.

【００４２】同定部１７０は、指示座標と検索領域内検
索部１９０の出力する各物体の位置情報との組を尤度計
算部１６０へ出力して尤度を計算し、計算した結果の尤
度が最も大きい値の物体を、指示座標に対応する動画像
情報中の物体として同定し、同定結果を出力する。The identification unit 170 outputs the set of the designated coordinates and the position information of each object output by the search area search unit 190 to the likelihood calculation unit 160 to calculate the likelihood, and the likelihood of the calculated result. The object with the largest value is identified as the object in the moving image information corresponding to the designated coordinates, and the identification result is output.

【００４３】以下に、上記動画像内の物体の同定装置１
００の動作を図を用いて説明する。図７は、本発明の一
実施例の動作を示すフローチャートを示す。Below, an apparatus 1 for identifying an object in the moving image is described.
The operation of 00 will be described with reference to the drawings. FIG. 7 is a flow chart showing the operation of the embodiment of the present invention.

【００４４】（ステップ２０）制御部２００の検索領
域内検索部１９０は、動画像蓄積部１１０から動画像情
報を読み出し画像表部１２０へ表示する。(Step 20) The in-search area search unit 190 of the control unit 200 reads out moving image information from the moving image storage unit 110 and displays it on the image table unit 120.

【００４５】（ステップ２１）検索領域内検索部１９
０は、座標指示装置１３０により利用者が動画像情報を
見ながら当該動画像情報内の座標を指示したか否かをチ
ェックし、指示されていなければ上記ステップ２０へ移
行する。(Step 21) Retrieval area retrieval unit 19
In 0, it is checked whether or not the user has instructed the coordinates in the moving image information while watching the moving image information by the coordinate instruction device 130, and if not instructed, the process proceeds to step 20.

【００４６】（ステップ２２）検索領域内検索部１９
０は、上記ステップ２１で、利用者が動画像情報内の座
標を指示した場合に、指示した座標（ｐｘ，ｐｙ，ｐ
ｔ）を検索領域設定部１８０へ出力し、検索領域設定器
Ｒで検索領域を設定する。(Step 22) Retrieval area retrieval unit 19
0 is the coordinate (px, py, p) instructed when the user instructed the coordinate in the moving image information in step 21.
t) is output to the search area setting unit 180, and the search area setting unit R sets the search area.

【００４７】（ステップ２３）検索領域内検索部１９
０は、検索領域設定部１８０によって設定された検索領
域に基づいて、位置情報蓄積部１６０の中から当該検索
領域内の位置情報を有する物体を検索し、検索結果とし
て検索された物体の位置情報を同定部１７０へ出力す
る。(Step 23) Search Area Search Unit 19
0 is a search area set by the search area setting unit 180, based on the search area, the position information storage unit 160 is searched for an object having position information in the search area, and the position information of the object searched as the search result. Is output to the identification unit 170.

【００４８】（ステップ２４）制御部２００の同定部
１７０は、検索領域内検索部１９０の出力する物体の位
置情報に基づいて、利用者の指示した座標（ｐｘ，ｐ
ｙ，ｐｔ）と各物体の位置情報とを尤度計算部１６０へ
出力し、尤度計算器Ｌで尤度を計算する。(Step 24) The identification unit 170 of the control unit 200, based on the position information of the object output by the in-search area search unit 190, coordinates (px, p) instructed by the user.
y, pt) and the position information of each object are output to the likelihood calculator 160, and the likelihood calculator L calculates the likelihood.

【００４９】（ステップ２５）同定部１７０は、上記
ステップ２３で検索領域内検索部１９０の出力した全て
の物体の位置情報に基づいて、ステップ２４の尤度の計
算を行なったか否かをチェックし、尤度の計算を行なっ
ていない物体がある場合にはステップ２４へ移行する。(Step 25) The identification unit 170 checks whether or not the likelihood calculation of Step 24 has been performed based on the position information of all the objects output from the search area search unit 190 in Step 23. If there is an object for which likelihood calculation is not performed, the process proceeds to step 24.

【００５０】（ステップ２６）同定部１７０は、上記
ステップ２５で、全ての物体について尤度の計算が完了
した場合に、計算した物体の尤度の中から最も値の大き
な尤度の物体を選んで、選んだ物体を利用者の指示した
座標に対応する物体であると同定して、同定結果を出力
する。(Step 26) When the likelihood calculation is completed for all the objects in step 25, the identifying section 170 selects the object having the largest likelihood from the calculated likelihoods of the objects. Then, the selected object is identified as the object corresponding to the coordinates designated by the user, and the identification result is output.

【００５１】以下に、上記本発明の一実施例における、
動画像内の物体の同定装置１００の具体的な例を説明す
る。以下の説明に際しては、領域設定部１８０の領域設
定器Ｒ及び尤度計算部１６０の尤度計算器Ｌを以下のも
のとする。In the following, one embodiment of the present invention will be described.
A specific example of the apparatus 100 for identifying an object in a moving image will be described. In the following description, the area setter R of the area setting unit 180 and the likelihood calculator L of the likelihood calculation unit 160 will be described below.

【００５２】Ｒ（ｐｘ，ｐｙ，ｐｔ）＝｛（ｒｘ，ｒｙ，ｒｔ）｜０
≦ｒｘ≦Ｘ，０≦ｒｙ≦Ｙ，ｒｔ＝ｐｔ｝R (px, py, pt) = {(rx, ry, rt) | 0
≦ rx ≦ X, 0 ≦ ry ≦ Y, rt = pt}

【００５３】[0053]

【数４】 [Equation 4]

【００５４】また、本動画像内の物体の同定装置１００
の利用者は動画像情報を見ながら、図８に示す動画像情
報が表示された時刻ｔ₁に、座標指示装置１３０で図中
の太い矢印で示す座標指示カーソル１１の位置を指示す
るものとし、指示された座標を（４、６、ｔ₁）として
説明する。図８のｒ１、ｒ２、ｒ３は、指示された座標
と各物体２１、２２、２３との距離を示しており、距離
に−１を乗じた値、−ｒ１、−ｒ２、−ｒ３が尤度とな
る。Further, the apparatus 100 for identifying an object in the main moving image.
While observing the moving image information, at time t ₁ when the moving image information shown in FIG. 8 is displayed, it is assumed that the coordinate indicating device 130 indicates the position of the coordinate indicating cursor 11 indicated by the thick arrow in the figure. , The designated coordinates will be described as (4, 6, t ₁ ). In FIG. 8, r1, r2, and r3 indicate the distance between the instructed coordinate and each of the objects 21, 22, and 23. The value obtained by multiplying the distance by -1, and -r1, -r2, and -r3 are likelihoods. Becomes

【００５５】また、位置情報蓄積部１４０には、図５に
示した位置情報が蓄積されているものとする。Further, it is assumed that the position information storage section 140 stores the position information shown in FIG.

【００５６】制御部２００の検索領域内検索部１９０
が、動画像蓄積部１１０から動画像情報を読み出し画像
表部１２０へ表示する（ステップ２０）。Search area search section 190 of control section 200
However, the moving image information is read from the moving image storage unit 110 and displayed on the image table unit 120 (step 20).

【００５７】検索領域内検索部１９０が、座標指示装置
１３０により利用者が動画像情報を見ながら当該動画像
情報内の座標を指示したか否かをチェックすると、利用
者により座標、（ｐｘ，ｐｙ，ｐｔ）＝（４，６，ｔ₁）が指示されたので（ステップ２１、ＹＥＳ）、検索領域
内検索部１９０は、利用者が指示した座標、（ｐｘ，ｐｙ，ｐｔ）＝（４，６，ｔ₁）を検索領域設定部１８０へ出力し、検索領域設定器Ｒ
で、｛（ｒｘ，ｒｙ，ｒｔ）｜０≦ｒｘ≦Ｘ，０≦ｒｙ≦
Ｙ，ｒｔ＝ｔ₁｝の検索領域を設定する（ステップ２２）。When the in-search area search unit 190 checks whether or not the user has instructed the coordinates in the moving image information while looking at the moving image information by the coordinate instructing device 130, the coordinates of the user are (px, Since py, pt) = (4,6, t ₁ ) has been instructed (step 21, YES), the search area retrieval unit 190 causes the user to instruct the coordinates, (px, py, pt) = (4 , 6, t ₁ ) to the search area setting unit 180, and the search area setting unit R
Where {(rx, ry, rt) | 0 ≦ rx ≦ X, 0 ≦ ry ≦
A search area of Y, rt = t ₁ } is set (step 22).

【００５８】検索領域内検索部１９０は、検索領域設定
部１８０によって設定された検索領域、｛（ｒｘ，ｒｙ，ｒｔ）｜０≦ｒｘ≦Ｘ，０≦ｒｙ≦
Ｙ，ｒｔ＝ｔ₁｝に基づいて、位置情報蓄積部１６０の中から当該検索領
域内の位置情報を有する物体を検索し、検索結果として
物体２１、２２、２３の３つの物体の位置情報、The in-search area search unit 190 uses the search area set by the search area setting unit 180 as follows: {(rx, ry, rt) | 0 ≦ rx ≦ X, 0 ≦ ry ≦
Y, rt = t ₁ } based on Y, rt = t ₁ }, an object having the position information in the search area is searched from the position information storage unit 160, and the position information of the three objects 21, 22, 23 is searched as the search result,

【００５９】[0059]

【数５】 [Equation 5]

【００６０】を取得し、取得した位置情報を同定部１７
０へ出力する（ステップ２３）。The position information thus obtained is identified by the identification unit 17
It is output to 0 (step 23).

【００６１】制御部２００の同定部１７０は、検索領域
内検索部１９０の出力する物体の位置情報に基づいて、
利用者の指示した座標、（ｐｘ，ｐｙ，ｐｔ）＝（４，６，ｔ₁）と各物体２１、２２、２３の位置情報とを尤度計算部１
６０へ出力し、尤度計算器Ｌで各物体２１、２２、２３
の尤度として下記の３つの計算結果を得る（ステップ２
４）。The identification unit 170 of the control unit 200, based on the position information of the object output from the search area search unit 190,
The likelihood calculation unit 1 calculates the coordinates (px, py, pt) = (4, 6, t ₁ ) specified by the user and the position information of each object 21, 22, 23.
60, and each of the objects 21, 22, 23 is calculated by the likelihood calculator L.
The following three calculation results are obtained as the likelihood of (step 2
4).

【００６２】[0062]

【数６】 [Equation 6]

【００６３】同定部１７０は、上記ステップ２３で検索
領域内検索部１９０の出力した全ての物体の位置情報に
基づいて、ステップ２４の尤度の計算を完了したので
（ステップ２５、ＹＥＳ）、計算した物体の尤度の中か
ら最も値の大きな尤度の物体として物体２１を選択し、
物体２１を利用者の指示した座標に対応する物体として
と同定する（ステップ２６）。Since the identifying section 170 has completed the calculation of the likelihood in step 24 based on the position information of all the objects output by the in-search area searching section 190 in step 23 (step 25, YES), The object 21 is selected as the object having the largest likelihood from the likelihoods of the selected objects,
The object 21 is identified as an object corresponding to the coordinates designated by the user (step 26).

【００６４】以下に、上記本発明の一実施例の動画像内
の物体の同定装置１００において、領域設定部１８０の
領域設定器Ｒ及び、尤度計算部１６０の尤度計算器Ｌを
以下のものとする、他の実施例についての具体的な例を
説明する。領域設定部１８０の領域設定器Ｒを、Ｒ（ｐｘ，ｐｙ，ｐｔ）＝｛（ｒｘ，ｒｙ，ｒｔ）｜０
≦ｒｘ≦Ｘ，０≦ｒｙ≦Ｙ，ｒｔ＝ｐｔ−Δｔ｝とし、尤度計算部１６０の尤度計算器Ｌを、In the apparatus 100 for identifying an object in a moving image according to one embodiment of the present invention, the area setting unit R of the area setting unit 180 and the likelihood calculator L of the likelihood calculating unit 160 will be described below. A specific example of another embodiment will be described. The area setting unit R of the area setting unit 180 is set to R (px, py, pt) = {(rx, ry, rt) | 0
≦ rx ≦ X, 0 ≦ ry ≦ Y, rt = pt−Δt}, and the likelihood calculator L of the likelihood calculator 160 is set to

【００６５】[0065]

【数７】 [Equation 7]

【００６６】とする。It is assumed that

【００６７】また、本動画像内の物体の同定装置１００
の利用者は動画像情報を見ながら、図９（Ａ）に示す外
側の四角形で示す映像フレーム１０₁が表示された時刻
ｔ₂に、座標指示装置１３０で図９（Ａ）中の破線の矢
印で示す座標指示カーソル１２の位置（８，５，ｔ₂）
によって四角形の物体２４を指示しようとしているもの
とする。しかし、同図に示す物体２３、２４は移動速度
が速く、利用者が図９（Ａ）の映像フレーム１０₁内の
物体２４を確認してから実際に座標指示部１３０を操作
し終わるまでの遅延時間（Δｔ）のうちに右方向へ移動
してしまい、実際に利用者が指示した結果は、図９
（Ｂ）に示す時刻ｔ₃（ｔ₃＝ｔ₂＋Δｔ）における外
側の四角形で示す映像フレーム１０₃内の矢印で示す座
標指示カーソル１３の位置となり、指示された座標は
（８，５，ｔ₂）ではなく、（８，５，ｔ₃）になるも
のとして説明する。Further, the apparatus 100 for identifying an object in the main moving image.
9A while watching the moving image information, at time t ₂ when the image frame 10 ₁ shown by the outer quadrangle shown in FIG. 9A is displayed, the coordinate pointing device 130 causes the broken line in FIG. Position of coordinate indication cursor 12 indicated by arrow (8, 5, t ₂ )
It is assumed that a rectangular object 24 is to be pointed by. However, the objects 23 and 24 shown in the same figure have a high moving speed, and the time from when the user confirms the object 24 in the video frame 10 ₁ of FIG. As a result of the user actually instructing, the result of moving to the right within the delay time (Δt) is shown in FIG.
At time t ₃ (t ₃ = t ₂ + Δt) shown in (B), the position is the position of the coordinate designating cursor 13 indicated by the arrow in the video frame 10 ₃ indicated by the outer quadrangle, and the designated coordinates are (8, 5, t). It will be described as (8, 5, t ₃ ) instead of ₂ ).

【００６８】また、位置情報蓄積部１４０には、図５に
示した位置情報が蓄積されているものとする。Further, it is assumed that the position information storage section 140 stores the position information shown in FIG.

【００６９】制御部２００の検索領域内検索部１９０
が、動画像蓄積部１１０から動画像情報を読み出し画像
表部１２０へ表示する（ステップ２０）。Search area search section 190 of control section 200
However, the moving image information is read from the moving image storage unit 110 and displayed on the image table unit 120 (step 20).

【００７０】検索領域内検索部１９０が、座標指示装置
１３０により利用者が動画像情報を見ながら当該動画像
情報内の座標を指示したか否かをチェックすると、利用
者により座標、（ｐｘ，ｐｙ，ｐｔ）＝（８，５，ｔ₃）が指示されたので（ステップ２１、ＹＥＳ）、検索領域
内検索部１９０は、利用者が指示した座標、（ｐｘ，ｐｙ，ｐｔ）＝（８，５，ｔ₃）を検索領域設定部１８０へ出力し、検索領域設定器Ｒ
で、｛（ｒｘ，ｒｙ，ｒｔ）｜０≦ｒｘ≦Ｘ，０≦ｒｙ≦
Ｙ，ｒｔ＝ｔ₂｝の検索領域を設定する（ステップ２２）。When the in-search area search unit 190 checks whether or not the user has instructed the coordinates in the moving image information while looking at the moving image information by the coordinate instructing device 130, the coordinates of the user are (px, Since py, pt) = (8,5, t ₃ ) is instructed (step 21, YES), the search area search unit 190 causes the search area search unit 190 to specify the coordinates (px, py, pt) = (8 , 5, t ₃ ) to the search area setting unit 180, and the search area setting unit R
Where {(rx, ry, rt) | 0 ≦ rx ≦ X, 0 ≦ ry ≦
A search area of Y, rt = t ₂ } is set (step 22).

【００７１】検索領域内検索部１９０は、検索領域設定
部１８０によって設定された検索領域、｛（ｒｘ，ｒｙ，ｒｔ）｜０≦ｒｘ≦Ｘ，０≦ｒｙ≦
Ｙ，ｒｔ＝ｔ₂｝に基づいて、位置情報蓄積部１６０の中から当該検索領
域内の位置情報を有する物体を検索し、検索結果として
物体２３、２４の２つの物体の位置情報、The search area in-search section 190 has a search area set by the search area setting section 180: {(rx, ry, rt) | 0 ≦ rx ≦ X, 0 ≦ ry ≦
Y, rt = t ₂ } based on Y, rt = t ₂ }, the object having the position information in the search area is searched from the position information storage unit 160, and the position information of the two objects 23 and 24 is obtained as the search result.

【００７２】[0072]

【数８】 [Equation 8]

【００７３】を取得し、取得した位置情報を同定部１７
０へ出力する（ステップ２３）。The position information thus obtained is identified by the identification unit 17
It is output to 0 (step 23).

【００７４】制御部２００の同定部１７０は、検索領域
内検索部１９０の出力する物体の位置情報に基づいて、
利用者の指示した座標、（ｐｘ，ｐｙ，ｐｔ）＝（８，５，ｔ₃）と各物体２３、２４の位置情報とを尤度計算部１６０へ
出力し、尤度計算器Ｌで各物体２３、２４の尤度として
下記の２つの計算結果を得る（ステップ２４）。The identification unit 170 of the control unit 200, based on the position information of the object output by the in-search area search unit 190,
The coordinates instructed by the user, (px, py, pt) = (8, 5, t ₃ ) and the position information of each of the objects 23 and 24 are output to the likelihood calculation unit 160, and each of them is calculated by the likelihood calculator L. The following two calculation results are obtained as the likelihoods of the objects 23 and 24 (step 24).

【００７５】[0075]

【数９】 [Equation 9]

【００７６】同定部１７０は、上記ステップ２３で検索
領域内検索部１９０の出力した全ての物体の位置情報に
基づいて、ステップ２４の尤度の計算を完了したので
（ステップ２５、ＹＥＳ）、計算した物体の尤度の中か
ら最も値の大きな尤度の物体として物体２４を選択し、
物体２４を利用者の指示した座標に対応する物体として
と同定する（ステップ２６）。Since the identifying section 170 has completed the calculation of the likelihood of step 24 based on the position information of all the objects output by the in-search area search section 190 in step 23 (step 25, YES), The object 24 is selected as the object having the largest likelihood from the likelihoods of the selected objects,
The object 24 is identified as an object corresponding to the coordinates designated by the user (step 26).

【００７７】上記他の実施例のように動画像情報内の物
体の移動速度が速い場合、利用者が物体２４を指示しよ
うとして座標指示部１３０を操作した結果の座標は、利
用者の意図した座標（８，５，ｔ₂）ではなく、Δｔ遅
い、座標（８，５，ｔ₃）となってしまうことがある。
しかし、上記他の実施例によれば、利用者が物体の移動
速度に追いつかずに、物体を指示しようとしてから実際
に指示するまでに遅延時間（Δｔ）がある場合であって
も、入力される指示座標（８，５，ｔ₃）ではなく遅延
時間を考慮した（８，５，ｔ₂）に基づく物体の同定が
可能となり、利用者の意図した物体を正しく同定するこ
とが可能となる。遅延時間Δｔは、利用者、動画像情報
内の物体の移動速度や、動画像情報等の再生速度に応じ
て設定することが可能であり、それぞれのケースで最適
な値を使用することにより物体を正しく同定することが
可能となる。When the moving speed of the object in the moving image information is high as in the other embodiments, the coordinates obtained as a result of the user operating the coordinate instructing section 130 to instruct the object 24 have the coordinates intended by the user. coordinates (8,5, t ₂₎ rather than, Delta] t slow, sometimes becomes the coordinates (8,5, t _3).
However, according to the other embodiment described above, even when the user does not catch up with the moving speed of the object and there is a delay time (Δt) from when the user tries to instruct the object until when the user actually instructs the object, the input is performed. It becomes possible to identify the object based on (8,5, t ₂ ) considering the delay time instead of the designated coordinate (8,5, t ₃ ), and it is possible to correctly identify the object intended by the user. . The delay time Δt can be set according to the user, the moving speed of the object in the moving image information, the reproduction speed of the moving image information, and the like, and by using an optimum value in each case, the object Can be correctly identified.

【００７８】また、上記他の実施例において動画像情報
の再生速度（表示速度）を変えた場合には、Δｔに再生
速度の比を乗じることで物体を正しく同定することが可
能になる。Further, when the reproduction speed (display speed) of the moving image information is changed in the other embodiment, the object can be correctly identified by multiplying Δt by the reproduction speed ratio.

【００７９】また、上記実施例によれば、利用者の座標
の指示動作に遅れがある場合であっても尤度計算器Ｌを
変えることによって、利用者の指示する座標が動画像情
報中の物体付近であれば正しく物体を同定することが可
能となる。Further, according to the above-described embodiment, even if the user's coordinate pointing operation is delayed, the user can change the likelihood calculator L so that the user's designated coordinates are included in the moving image information. It becomes possible to correctly identify the object in the vicinity of the object.

【００８０】なお、上記実施例では位置情報蓄積部１４
０に蓄積する個々の物体の位置情報として物体の１つの
代表点を蓄積しているが、１つの物体に対して複数の位
置情報を設定することにより、複数の物体が重なり合っ
ている場合であっても正しく同定することが可能とな
る。In the above embodiment, the position information storage unit 14
One representative point of an object is stored as the position information of each object to be stored in 0. However, by setting a plurality of position information for one object, a plurality of objects may overlap each other. However, it is possible to identify correctly.

【００８１】また、上記実施例の検索領域設定部１８０
の検索領域設定器Ｒでは、ある時刻における動画像情報
を検索領域として設定する例を示したが、時間軸方向に
幅を持つ検索領域を設定することにより動画像情報内を
特定の方向に移動する物体を選択的に同定することも可
能となる。Further, the search area setting unit 180 of the above embodiment
In the search area setting device R, the example in which the moving image information at a certain time is set as the search area is shown. However, by setting the search area having a width in the time axis direction, the moving image information is moved in a specific direction. It is also possible to selectively identify an object that does.

【００８２】また、上記実施例の尤度検査部１６０の尤
度計算器Ｌでは、指示された座標と物体との尤度をｘ
軸、ｙ軸からなる二次元平面上での距離に基づき計算す
る例を示したが、ｘ軸、ｙ軸に更にｔ軸をも含めた三次
元空間内での距離に基づき計算することにより、時間軸
上に幅を有する検索領域が設定された場合であっても尤
度の計算に基づき物体を同定することが可能となる。In the likelihood calculator L of the likelihood checking section 160 of the above embodiment, the likelihood between the designated coordinates and the object is x.
Although an example of calculating based on the distance on the two-dimensional plane composed of the axes and the y-axis has been shown, by calculating based on the distance in the three-dimensional space including the t-axis in addition to the x-axis and the y-axis, Even when a search area having a width on the time axis is set, it is possible to identify the object based on the calculation of the likelihood.

【００８３】また、上記実施例では、座標指示部１３０
としてマウスを使用した例を示したが、画像表示部１２
０に表示された動画像情報内の任意の座標を指示するこ
とが可能であれば、アイカメラのような利用者の視線を
基に座標を指示するものであってもよく、座標指示部１
３０をマウスに限定するものではない。Further, in the above embodiment, the coordinate designating section 130 is used.
As an example, a mouse is used as the image display unit 12.
If it is possible to instruct arbitrary coordinates in the moving image information displayed at 0, the coordinates may be instructed based on the line of sight of the user such as an eye camera.
30 is not limited to mice.

【００８４】また、上記実施例では、各物体の位置情報
として当該物体の中心点を使用する例を示したが、当該
物体を代表する点として予め決めた点であれば中心点で
なくてもよく、位置情報を中心点に限定するものではな
い。In the above embodiment, the center point of the object is used as the position information of each object. However, if the point is a predetermined point representative of the object, the center point may not be the center point. Of course, the position information is not limited to the center point.

【００８５】また、上記実施例では、制御部２００の検
索領域内検索部１９０に画像情報蓄積部１１０から動画
像情報を読み出し画像表示部１３０へ表示する機能を持
たせたが、制御部内に画像情報蓄積部１１０から動画像
情報を読み出し画像表示部１３０へ表示する動画像情報
再生部等を設ける事も可能であり、動画像情報の表示を
検索領域内検索部１９０が行なうことに限定するもので
はない。Further, in the above embodiment, the in-search area search unit 190 of the control unit 200 is provided with the function of reading moving image information from the image information storage unit 110 and displaying it on the image display unit 130. It is also possible to provide a moving image information reproducing unit or the like for reading moving image information from the information storage unit 110 and displaying it on the image display unit 130, and it is limited to displaying the moving image information by the search area search unit 190. is not.

【００８６】更に、本実施例では領域設定部１８０と尤
度計算部１６０に、特定の領域設定器Ｒ及び尤度計算器
Ｌを設定して物体の同定を行なう例を示したが、領域設
定部１８０と尤度計算部１６０に、それぞれ複数の領域
設定器Ｒ及び尤度計算器Ｌを設定して適宜選択して物体
の同定を行なうことも可能であり、領域設定部１８０と
尤度計算部１６０に設定する領域設定器Ｒ及び尤度計算
器Ｌを１つに限定するものではない。Further, in the present embodiment, an example has been shown in which a specific area setter R and likelihood calculator L are set in the area setting section 180 and the likelihood calculating section 160 to identify an object. It is also possible to set a plurality of region setting units R and likelihood calculating units L in the unit 180 and the likelihood calculating unit 160, respectively, and select them appropriately to identify an object. The area setter R and the likelihood calculator L set in the unit 160 are not limited to one.

【００８７】[0087]

【発明の効果】上述のように、本発明によれば、利用者
の指示した座標に基づき物体を検索し、検索された物体
の中から利用者の指示した座標に対応する物体を同定す
るため、予め動画像情報内の個々の物体毎に当該物体を
同定することができる領域を設定して蓄積しておかなく
ても利用者が物体の外側を指示した場合や動画像情報中
の物体の数が多い場合等であっても、利用者の指示する
座標に対応する物体を正しく同定することができる。As described above, according to the present invention, the object is searched based on the coordinates designated by the user, and the object corresponding to the coordinates designated by the user is identified from the searched objects. , If the user indicates the outside of the object or the object in the moving image information without setting and accumulating the area in which the object can be identified for each individual object in the moving image information in advance. Even if the number is large, the object corresponding to the coordinates designated by the user can be correctly identified.

【００８８】また、予め動画像情報内の各物体毎の当該
動画像情報内における位置情報を蓄積しているため、指
示された座標に基づく物体を高速かつ、確実に検索する
ことができ、利用者の指示する座標に対応する物体を精
度よく同定することが可能となる。Further, since the position information in the moving image information for each object in the moving image information is stored in advance, the object based on the designated coordinates can be searched at high speed and reliably. It is possible to accurately identify the object corresponding to the coordinates designated by the person.

【００８９】また、利用者の指示した座標に基づいて、
動画像情報中の物体を検索する検索領域を設定して検索
することにより、物体の検索量、検索時間を短縮するこ
とが可能となる。Further, based on the coordinates designated by the user,
By setting a search area for searching an object in the moving image information and performing a search, it is possible to reduce the search amount and the search time of the object.

【００９０】また、利用者が表示された動画像情報内の
物体を確認してから実際に指示するまでの遅延時間を考
慮し、物体の検索を行なう検索領域を遅延時間分だけ過
去の動画像情報に設定することにより、動きの速い物体
や、再生速度の速い動画像情報等であっても正しく物体
を同定することが可能となる。Also, in consideration of the delay time from when the user confirms the object in the displayed moving image information until when the user actually gives an instruction, the search area for searching the object is moved past the moving image by the delay time. By setting the information, it is possible to correctly identify the object even if it is a fast-moving object or moving image information having a high reproduction speed.

【００９１】また、指示された座標と検索された各物体
の位置情報とから距離を計算し、計算した距離に基づき
尤度を決定することにより、検索された複数の物体の中
から利用者の指示した物体として同定するに尤もな物体
を、尤度に基づいて同定することが可能となる。Further, the distance is calculated from the instructed coordinates and the position information of each searched object, and the likelihood is determined based on the calculated distance. An object most likely to be identified as the instructed object can be identified based on the likelihood.

【００９２】更に上記効果によって、予め動画像情報内
の物体を同定するための領域を、物体の大きさ、物体の
数、動画像情報の再生速度、利用者の反応速度等を考慮
して作成するといった多くの工程を省略することができ
るため、作成に要する時間、費用等を抑えることが可能
となる。Further, with the above effect, an area for identifying an object in the moving image information is created in advance in consideration of the size of the object, the number of objects, the reproduction speed of the moving image information, the reaction speed of the user, and the like. Since it is possible to omit many steps such as the above, it is possible to reduce the time and cost required for the production.

【００９３】また、動画像内の物体の同定装置を利用す
る人にとっては、動きの速い物体や小さな物体であって
も、物体を注意深く指示することなく目的とする物体を
容易に指示することが可能となる。Further, for a person who uses the apparatus for identifying an object in a moving image, even if the object is a fast-moving object or a small object, it is possible to easily specify the target object without carefully instructing the object. It will be possible.

[Brief description of drawings]

【図１】本発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】本発明の原理構成図である。FIG. 2 is a principle configuration diagram of the present invention.

【図３】本発明の一実施例の構成図である。FIG. 3 is a configuration diagram of an embodiment of the present invention.

【図４】本発明の一実施例の物体ｉの時刻ｔにおける位
置を説明する図である。FIG. 4 is a diagram illustrating a position of an object i at time t according to an embodiment of the present invention.

【図５】本発明の一実施例の位置情報蓄積部の蓄積例を
示す図である。FIG. 5 is a diagram showing a storage example of a position information storage unit according to an embodiment of the present invention.

【図６】本発明の一実施例の制御部の構成図である。FIG. 6 is a configuration diagram of a control unit according to an embodiment of the present invention.

【図７】本発明の一実施例の動作を示すフローチャート
である。FIG. 7 is a flowchart showing the operation of one embodiment of the present invention.

【図８】本発明の一実施例の物体の同定を説明する図
（１）である。FIG. 8 is a diagram (1) illustrating identification of an object according to an embodiment of the present invention.

【図９】本発明の一実施例の物体の同定を説明する図
（２）である。FIG. 9 is a diagram (2) explaining identification of an object according to an embodiment of the present invention.

【図１０】従来の方法を説明する図である。FIG. 10 is a diagram illustrating a conventional method.

[Explanation of symbols]

１０映像フレーム１１座標指示カーソル１２座標指示カーソル１３座標指示カーソル２０物体２１物体２２物体２３物体２４物体３０領域１００動画像内の物体の同定装置１１０画像蓄積手段、画像蓄積部１２０画像表示手段、画像表示部１３０座標指示手段、座標指示部１４０位置情報蓄積手段、位置情報蓄積部１５０検索手段１６０尤度計算手段、尤度計算部１７０同定手段、同定部１８０検索領域設定部１９０検索領域内検索部２００制御部 10 video frames 11 Coordinate pointing cursor 12 Coordinate pointing cursor 13 Coordinate pointing cursor 20 objects 21 objects 22 objects 23 objects 24 objects 30 areas 100 Object identification device in moving image 110 image storage means, image storage section 120 image display means, image display section 130 Coordinate instruction means, coordinate instruction section 140 Position Information Storage Means, Position Information Storage Unit 150 Search method 160 Likelihood calculator, likelihood calculator 170 Identification means, identification section 180 Search area setting section 190 Search area search section 200 Control unit

フロントページの続き (72)発明者片岡良治東京都千代田区内幸町１丁目１番６号日本電信電話株式会社内 (56)参考文献特開平５−189153（ＪＰ，Ａ) 特開平４−352076（ＪＰ，Ａ) 特開平３−52070（ＪＰ，Ａ) 片岡良治他２名，視覚誘導情報獲得モデルに基づくマルチメディア情報システム−ＶｉｄｅｏＲｅａｌｉｔｙ−，情報処理学会研究報告（94−ＤＢＳ− 97），日本，情報処理学会，1994年３月 18日，第94巻，第30号，ＰＰ．21−30 紺谷精一他３名，マルチメディア情報システムにおける動画像オブジェクトの管理モデル，情報処理学会第49回（平成６年後期）全国大会講演論文集（３），日本，情報処理学会，1994年９月30日，331−332 原義憲他７名，ハイパーメディアプラットホーム”雅（みやび）”の概要, 情報処理学会研究報告（92−ＤＢＳ− 90），日本，情報処理学会，1992年９月 11日，第92巻，第71号，ＰＰ．29−38 佐藤哲司他２名，ビデオリアリティ：映像を用いた情報検索手法の高度化，情報処理学会研究報告（94−ＤＢＳ −99），日本，情報処理学会，1994年７月22日，第94巻，62号，ｐｐ．281−286 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/30 G06T 1/00 G06T 7/00 ＪＩＣＳＴファイル（ＪＯＩＳ)Front page continuation (72) Inventor Ryoji Kataoka 1-6 Uchisaiwaicho, Chiyoda-ku, Tokyo Inside Nippon Telegraph and Telephone Corporation (56) Reference JP-A-5-189153 (JP, A) JP-A-4-352076 (JP, A) JP-A-3-52070 (JP, A) Ryoji Kataoka and 2 others, Multimedia Information System based on Visual Guidance Information Acquisition Model-VideoReality-, Information Processing Society Research Report (94-DBS-97 ), Japan, IPSJ, March 18, 1994, Volume 94, No. 30, PP. 21-30 Seiichi Konya and 3 others, Management model of moving image objects in multimedia information system, Proc. Of the 49th National Conference of IPSJ (6th semester) (3), Japan, IPSJ , September 30, 1994, 331-332 Yoshinori Hara and 7 others, Outline of hypermedia platform "Miyabi", Research Report of Information Processing Society of Japan (92-DBS-90), Japan, Information Processing Society of Japan, September 11, 1992, Vol. 92, No. 71, PP. 29-38 Tetsuji Sato and 2 others, Video Reality: Advanced Information Retrieval Method Using Video, Information Processing Society of Japan Research Report (94-DBS-99), Japan, Information Processing Society of Japan, July 22, 1994, Vol. 94, No. 62, pp. 281-286 (58) Fields surveyed (Int.Cl. ⁷ , DB name) G06F 17/30 G06T 1/00 G06T 7/00 JISST file (JOIS)

Claims

(57) [Claims]

1.In the method of identifying an object in a moving image, A step of displaying the moving image information accumulated in advance, While watching the displayed moving image information, the position in the moving image information
The step of indicating the mark, Is the position information of the object in the moving image information stored in advance?
A search step to search for objects from The designated coordinates and the object retrieved by the retrieval step
The likelihood with the body is calculated, and the finger is calculated based on the calculated likelihood.
An identification step to identify the object corresponding to the indicated coordinates and
HaveThen The search step is Search in the moving image information based on the designated coordinates
Search area setting step to set the area, Position information of the object in the moving image information stored in advance
Search area that searches for objects located in the search area
With internal search step In a moving image characterized by
Method of object identification.

2. The search area has a predetermined width in the time axis direction.
The method for identifying an object in a moving image according to claim 1, which is an area to be held .

3. The search area setting step includes, based on the designated coordinates, the moving image information that is past by a delay time from when the object in the moving image information is confirmed to when the object is designated, Claim set as a search area
1. A method for identifying an object in a moving image according to 1 .

4. The identifying step calculates the likelihood based on a distance between the designated coordinates and position information of an object obtained as a result of the search, and the calculated likelihood becomes maximum. The method for identifying an object in a moving image according to claim 1, wherein the object is identified as an object corresponding to the designated coordinates.

5.In the identification device of the object in the moving image, An image storage means for storing moving image information, An image for displaying the moving image information stored in the image storage means.
Image display means, While watching the moving image information displayed on the image display means,
Coordinate indicating means for indicating the coordinates in the moving image information, Positions for accumulating position information of individual objects in the moving image information
Information storage means, Search means for searching an object from the position information storage means
When, In the set of each coordinate in the moving image information and the position information of each object
On the other hand, a likelihood calculating means that associates a real number with the likelihood, The designated coordinates and the object retrieved by the retrieval means
The pair with the position information is output to the likelihood calculation means to calculate the likelihood.
Calculated and the indicated coordinates based on the calculated likelihood
And an identification means for identifying an object corresponding toThen The search means is Search in the moving image information based on the designated coordinates
Search area setting means for setting an area, The position information is stored in the search area in the position information storage means.
A search area search means for searching an object To do
An apparatus for identifying an object in a moving image, characterized by.

6. The search area has a predetermined width in the time axis direction.
The apparatus for identifying an object in a moving image according to claim 5, wherein the area is a region having a .

7. The search area setting means is configured to perform the search in the moving image information based on a delay time from confirmation of an object in the moving image information to instruction of the specified coordinate. The apparatus for identifying an object in a moving image according to claim 5, wherein an area is set.

8. The likelihood calculating means calculates a distance from the designated coordinates and the position information of the object searched by the searching means, and the distance to the designated coordinates is calculated based on the calculated distance. The apparatus for identifying an object in a moving image according to claim 5, which determines the likelihood.