JPH11266449A

JPH11266449A - Video structural processing device and recording medium storing program for video processing

Info

Publication number: JPH11266449A
Application number: JP10067225A
Authority: JP
Inventors: Susumu Kubota; 田進窪; Osamu Hori; 修堀; Toshimitsu Kaneko; 子敏充金; Hisashi Aoki; 木恒青
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1998-03-17
Filing date: 1998-03-17
Publication date: 1999-09-28

Abstract

PROBLEM TO BE SOLVED: To apply structural processing to various video images by combining basic processing systems from basis processing groups required for applying structural processing to a video image so as to establish a series of processing systems to realize the structural processing to the video image, charging only the procedure of combination of the processing systems. SOLUTION: The video image structural processing device is provided with a video class discrimination section 7 that designates a kind of a video image, a video image structural processing procedure storage section 3 that stores a structural processing procedure of a corresponding video image for each kind of the video image, a video image structural processing 9 that calls structural procedure corresponding to the class of the video image designated by the video class discrimination section 7 from the video image structural processing procedure storage, section 3 and that executes the video image structural processing, and a video image structural processing procedure storage section 4 that stores information from the video image structural processing 9. The optimum structural processing to each class of the video image is called from the video image structural processing procedure storage section 3, and the video image structural processing 9 executes the processing to obtain the optimum structural information.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、映像構造化装置及
びそのためのプログラム記録媒体に係り、特に、衛星放
送やＣＡＴＶなどの放送において、視聴者が、大量の映
像の中から所望の映像を選択的に視聴したり、または全
体の内容を短時間に把握したりすることを可能にする、
映像の構造化を行うための装置の構成に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image structuring apparatus and a program recording medium therefor, and more particularly to a viewer selecting a desired image from a large amount of images in broadcasting such as satellite broadcasting and CATV. To watch the content in a timely manner or to grasp the entire content in a short time,
The present invention relates to a configuration of an apparatus for structuring an image.

【０００２】[0002]

【従来の技術】近年、衛星放送や、ＣＡＴＶなどによ
り、家庭に配信される放送のチャンネル数は、増加の一
途をたどっている。また、今後、マルチメディアの基盤
が整備されて来るのに伴い、今後は、様々な場所に蓄積
される大量の映像情報に容易にアクセスできるような環
境が整ってくることが期待されている。2. Description of the Related Art In recent years, the number of broadcast channels distributed to homes by satellite broadcasting, CATV, and the like has been steadily increasing. Also, with the development of the multimedia infrastructure in the future, it is expected that in the future, an environment for easily accessing a large amount of video information stored in various places will be prepared.

【０００３】さて、大量の映像情報が、視聴できるよう
な環境が整ってきた場合、これらの映像情報の内容を、
短時間に把握し、所望の映像を選択的に視聴することに
対する要求の高まりが予想される。[0003] Now, when an environment has been established in which a large amount of video information can be viewed, the content of the video information is
It is expected that demand for grasping in a short time and selectively viewing a desired video will increase.

【０００４】以上のような要求に応えるためには、映像
情報へのアクセスを簡略化する必要があるが、この場
合、映像を何らかの形で、構造化し、これをもとにし
て、映像を縮約化して提示したり、または映像の一部を
選択的に提示するような機構が必要になってくる。In order to meet the above demands, it is necessary to simplify access to video information. In this case, however, the video is structured in some way, and based on this, the video is reduced. There is a need for a mechanism for presenting in a reduced form or for selectively presenting a part of an image.

【０００５】映像を縮約化し、表示するための技術とし
ては、特開平０３−２１４３６４号公報や特開平０４−
２１９８７８号公報などに開示されるような、カット検
出手法が提案されている。カット検出は、映像の変わり
目を自動的に検出する技術である。[0005] Techniques for reducing and displaying images are disclosed in Japanese Patent Application Laid-Open Nos. 03-214364 and 04-214364.
A cut detection method as disclosed in, for example, Japanese Patent No. 219878 has been proposed. Cut detection is a technique for automatically detecting a transition between images.

【０００６】しかしながら、映像を縮約化するための、
カット検出では、映像をショットに分割することはでき
るが、ショットとショットを関連付けて、映像を構造化
することができないという問題点がある。However, in order to reduce the image,
In the cut detection, the video can be divided into shots, but there is a problem that the shots cannot be linked to each other to structure the video.

【０００７】したがって、大量の映像情報のアクセスの
ためには、映像の種類や、番組ごとに、ユーザの目的に
沿った構造化の手続きを作る必要がある。Therefore, in order to access a large amount of video information, it is necessary to create a structuring procedure according to the purpose of the user for each type of video and each program.

【０００８】しかし、映像の構造化は、映像の種類や、
ユーザの目的によって大きく異なり、あらゆる映像に一
律的に適用できる構造化のルールを構築するには、非常
な困難が予想される。そして、種類も目的も異なる、ま
た様々なユーザニーズにも適合した構造化を行うための
決定的な手法は今のところ確立されていないというのが
現実である。[0008] However, the structuring of images depends on the types of images,
It is very difficult to construct structuring rules that vary widely depending on the purpose of the user and can be applied uniformly to all images. It is the reality that no definitive method has been established so far for performing structuring of different types and purposes and adapted to various user needs.

【０００９】これまでにも、例えば、ニュース番組や、
スポーツ番組など、対象を限定した個別の事例研究はな
されているが、これらを統合しても、対象ごとにまった
く異なる処理系を必要とするので、これを更に幅広く展
開しようとすると、膨大な設備や、処理手順が必要にな
ってくる。[0009] So far, for example, news programs,
Although individual case studies have been conducted on specific subjects such as sports programs, even if these are integrated, a completely different processing system is required for each subject. Also, a processing procedure is required.

【００１０】[0010]

【発明が解決しようとする課題】以上述べたように、従
来の映像構造化装置では、異なる映像ごとに、異なる構
造化を必要とするため、膨大な設備や処理を考えると、
現実的な解決手法とは言えない。特に、それぞれの構造
化が、まったく異なる処理系で行われるというのでは、
簡単にユーザニーズに適合させるという観点から見ても
現実的でなく、現在、種々の映像の構造化を扱えるよう
な、より汎用性の高い処理系を構築することが強く望ま
れている。As described above, in the conventional video structuring device, different structuring is required for each different video.
It is not a realistic solution. In particular, if each structuring is done in a completely different processing system,
It is not realistic from the viewpoint of easily adapting to user needs, and at present, it is strongly desired to construct a more versatile processing system that can handle various types of video structuring.

【００１１】したがって、本発明は、上記のような従来
技術の問題点を解消し、映像の構造化を行う上で必要と
される基本的な処理群を備え、これらの基本的な処理を
組み合わせにより、映像の構造化を実現するような処理
系を構築し、処理の組み合わせの手続きのみを入れ替え
るだけで、種々の映像の構造化を同じ処理系で行うこと
を可能とした映像構造化装置を提供することを目的とす
る。Accordingly, the present invention solves the above-mentioned problems of the prior art, includes a basic processing group required for structuring an image, and combines these basic processing. By constructing a processing system that realizes the structuring of video, a video structuring apparatus that can perform structuring of various videos with the same processing system only by changing the procedure of the combination of processes The purpose is to provide.

【００１２】[0012]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、映像の種類を指定する指定手段と、映像
の種類ごとに対応する映像の構造化手続を蓄積する手続
蓄積手段と、前記指定手段により指定された映像の種類
に対応する構造化手続を、前記手続蓄積手段より呼びだ
し、映像構造化を実施する構造化手段と、前記構造化手
段からの情報を蓄積する構造化情報蓄積手段と、を備え
る映像構造化装置を提供するものである。In order to achieve the above object, the present invention provides a designating means for designating an image type, and a procedure accumulating means for accumulating a video structuring procedure corresponding to each image type. A structuring procedure for calling a structuring procedure corresponding to the type of video specified by the specifying means from the procedure storage means, and performing video structuring; and structured information for storing information from the structuring means. And a storage unit.

【００１３】さらに、映像の構造化を行うプログラムを
コンピュータで実行可能なように記録した記録媒体であ
って、前記プログラムは、指定された映像の種類に対応
する予め記憶された構造化手続を呼び出し、呼び出した
構造化手続きのものに映像の構造化も行い、この構造化
した情報を蓄積させる手順を含むことを特徴とする記録
媒体を提供するものである。[0013] Further, the present invention is a recording medium on which a program for structuring a video is recorded so as to be executable by a computer, wherein the program calls a pre-stored structuring procedure corresponding to a specified type of video. In addition, the present invention provides a recording medium characterized by including a procedure for structuring an image in the called structured procedure and storing the structured information.

【００１４】[0014]

【発明の実施の形態】以下、図面を参照しながら、本発
明の実施形を説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１５】図１は、本発明の実施系の映像構造化装置
のブロック図である。FIG. 1 is a block diagram of a video structuring apparatus according to an embodiment of the present invention.

【００１６】図において、各構成要素の働きは以下の通
りである。In the figure, the function of each component is as follows.

【００１７】映像データ蓄積部１は、各種の映像データ
が蓄積される。The video data storage unit 1 stores various video data.

【００１８】入力装置５と、表示装置１１は、対話的に
情報を入出力するための端末を構成している。The input device 5 and the display device 11 constitute a terminal for interactively inputting and outputting information.

【００１９】映像モデルスクリプト蓄積部２は、映像デ
ータ蓄積部１に蓄積されている映像の種別ごとに対応す
るような映像モデルスクリプトを蓄積する。The video model script storage unit 2 stores a video model script corresponding to each type of video stored in the video data storage unit 1.

【００２０】映像モデルスクリプト加工編集部６は、表
示装置１１と入力装置５を介して、対話的に映像モデル
スクリプトの加工編集を行い、これを映像モデルスクリ
プト蓄積部２に蓄積しておく。The video model script processing / editing section 6 interactively processes and edits a video model script via the display device 11 and the input device 5, and stores the processed / edited video model script in the video model script storage section 2.

【００２１】映像種別判定部７は、映像データ蓄積部１
に蓄積されている映像データの種別を判定し、映像デー
タに対応する映像構造化手続きを特定する。The video type determination unit 7 is a video data storage unit 1
The type of the video data stored in the video data is determined, and a video structuring procedure corresponding to the video data is specified.

【００２２】映像構造化手続き生成部８は、映像の種別
ごとに、映像モデルスクリプト蓄積部２に蓄積された映
像モデルスクリプトに基づいて、映像構造化手続きを生
成し、映像構造化手続き蓄積部３に蓄積しておく。The video structuring procedure generator 8 generates a video structuring procedure based on the video model script stored in the video model script storage 2 for each type of video, and stores the video structuring procedure storage 3 To be stored.

【００２３】映像構造化処理部９は、映像種別判定部７
により特定された映像構造化手続きを、映像構造化手続
き蓄積部３から呼びだし、映像を構造化し、獲得された
映像構造化情報を、映像構造化情報蓄積部４に蓄積す
る。なお、映像構造化情報は、表示装置１１に表示する
ことができる。The video structuring unit 9 includes a video type determining unit 7
The video structuring procedure specified by the above is called from the video structuring procedure storage unit 3, the video is structured, and the obtained video structuring information is stored in the video structuring information storage unit 4. The video structuring information can be displayed on the display device 11.

【００２４】なお、映像構造化情報加工編集部１０は、
映像構造化情報蓄積部４に蓄積された映像構造化情報に
基づいて、映像を表示装置１１に表示させながら、入力
装置５を介して、対話的に映像構造化情報の加工編集を
行う。The video structuring information processing / editing unit 10
Based on the video structuring information stored in the video structuring information storage unit 4, the video structuring information is interactively processed and edited via the input device 5 while displaying the video on the display device 11.

【００２５】なお、上記の、映像データ蓄積部１、映像
モデルスクリプト蓄積部２、映像構造化手続き蓄積部
３、映像構造化情報蓄積部４、入力装置５、映像モデル
スクリプト加工編集部６、映像種別判定部７、映像構造
化手続き生成部８、映像構造化処理部９、映像構造化情
報加工編集部１０、表示装置１１は、これらのすべてを
統合するバス１２に接続される。The above-mentioned video data storage unit 1, video model script storage unit 2, video structuring procedure storage unit 3, video structuring information storage unit 4, input device 5, video model script processing / editing unit 6, video The type determination unit 7, the video structuring procedure generation unit 8, the video structuring processing unit 9, the video structuring information processing and editing unit 10, and the display device 11 are connected to a bus 12 that integrates all of them.

【００２６】さて、上記のような構成における動作を説
明する。The operation of the above configuration will now be described.

【００２７】なお、本実施形においては、以下のような
定義を適用する。In the present embodiment, the following definitions are applied.

【００２８】映像の変わり目をカットと呼び、カットと
カットの間に挟まれた区間の映像をショットと呼ぶこと
とする。A transition between images is called a cut, and an image in a section sandwiched between cuts is called a shot.

【００２９】映像構造は、ノードにより表される。そし
て、ノードは、構成要素と属性情報を持つ。An image structure is represented by nodes. Each node has constituent elements and attribute information.

【００３０】ノードの構成要素は、ノードと一連の映像
区間からなる。The components of a node include a node and a series of video sections.

【００３１】ノードの属性情報は、ノードの名称、代表
フレームまたは代表フレーム群から構成される。The attribute information of a node includes a node name, a representative frame, or a representative frame group.

【００３２】なお、本実施形では、構造化は、必ずしも
厳密なツリー構造を採るものではなく、下位のノードが
構成要素として、より上位のノードを含むことも許容す
るが、表示の便宜上、ひとつの構造化された映像は、ひ
とつ上位のノードを持つものとする。In the present embodiment, the structuring does not always adopt a strict tree structure, and a lower node is allowed to include a higher node as a constituent element. It is assumed that the structured video has one upper node.

【００３３】また、映像モデルスクリプトは、映像構造
モデルを記述するものである。そして、映像モデルスク
リプトから映像構造化手続きが生成される。また、映像
モデルスクリプトは、インデックスの記述と、ノードの
記述からなる。The video model script describes a video structure model. Then, a video structuring procedure is generated from the video model script. The video model script includes a description of an index and a description of a node.

【００３４】インデックスは、映像をノードに文節化す
るための手がかりとなるフレームまたはショットであ
る。インデックスの記述は、例えば、画像の全体または
一部を、そのまま用いたり、ショットの繰り返しパター
ンにより記述するなど、抽象度の異なる様々な表現が用
いられる。本実施形では、画像の全体または一部と、情
報や画像に現れる文字情報などを用いることも可能にし
ている。The index is a frame or a shot that is a clue for segmenting a video into nodes. For the description of the index, for example, various expressions having different levels of abstraction are used, such as using the whole or a part of the image as it is or describing the image using a repeated shot pattern. In the present embodiment, it is also possible to use the whole or a part of the image, information, and character information appearing in the image.

【００３５】ノードは、映像に付与されたインデックス
を基に特定される一連の映像区間や、ノードの組み合わ
せにより記述される。A node is described by a series of video sections specified based on an index given to a video and a combination of nodes.

【００３６】さて、映像の構造化は、図２のフローチャ
ートに示すような手順により実施される。The structuring of an image is performed according to the procedure shown in the flowchart of FIG.

【００３７】前処理ルーティン１３においては、映像デ
ータ蓄積部１に蓄積された映像データに対する、カット
検出処理Ｓ１とショットクラスタリング処理Ｓ２が行わ
れる。これらの処理は、映像モデルスクリプト加工編集
部６において実施される。In the pre-processing routine 13, cut detection processing S1 and shot clustering processing S2 are performed on the video data stored in the video data storage unit 1. These processes are performed in the video model script processing / editing unit 6.

【００３８】まず、カット検出処理Ｓ１により、映像の
変わり目を検出することにより、映像をショットに分割
する。First, the video is divided into shots by detecting a transition of the video in the cut detection processing S1.

【００３９】続いて、画像の類似度にしたがって、ショ
ットクラスタリング処理Ｓ２を行う。この処理は、例え
ば、本件出願人が、特開平９−２７０００６号公報にお
いて説明した方法を適用する。ショットのクラスタリン
グにより、類似ショットには同一のラベルが付与され
る。Subsequently, a shot clustering process S2 is performed according to the similarity of the images. For this processing, for example, the method described in Japanese Patent Application Laid-Open No. 9-270006 by the present applicant is applied. Due to the clustering of shots, similar shots are given the same label.

【００４０】前処理ルーティン１３の処理に続いて、映
像構造化処理ルーティン１４の処理に入るが、ここで
は、映像構造化手続き生成処理Ｓ３、インデックス付与
処理Ｓ４、ノード作成処理Ｓ５、構造化情報記録処理Ｓ
６の処理が行われる。Following the processing of the preprocessing routine 13, the processing of the video structuring processing routine 14 is started. Here, the video structuring procedure generation processing S3, the index assignment processing S4, the node creation processing S5, the structuring information recording Processing S
6 is performed.

【００４１】まず、映像構造化手続き生成処理Ｓ３にお
いては、自動または手動により、映像種別判定部７での
処理を通じて、映像の種別を決定し、その映像用の映像
モデルスクリプトを映像モデルスクリプト蓄積部２から
読み出し、この映像モデルスクリプトから、映像構造化
手続き生成部８を通じて、映像構造化手続きを生成す
る。ここで生成した映像構造化手続きは、映像構造化手
続き蓄積部３に蓄積される。First, in the video structuring procedure generation process S3, the type of the video is determined automatically or manually through the process of the video type determination unit 7, and the video model script for the video is stored in the video model script storage unit. 2 and generates a video structuring procedure from the video model script through the video structuring procedure generation unit 8. The generated video structuring procedure is stored in the video structuring procedure storage unit 3.

【００４２】続くインデックス付与処理Ｓ４において、
映像構造化手続きにより指定される画像やショットの繰
り返しパターンに合致するフレームまたはショットに、
インデックスを付与する。In the following indexing process S4,
Frames or shots that match the repeating pattern of images or shots specified by the video structuring procedure,
Assign an index.

【００４３】そして、ノード作成処理Ｓ５において、イ
ンデックスの付与に続いて、映像構造化手続きに従い、
映像からノードを抽出し、ノードごとに代表フレームま
たは代表フレーム群を選択する。Then, in the node creation processing S5, following the assignment of the index, in accordance with the video structuring procedure,
A node is extracted from the video, and a representative frame or a representative frame group is selected for each node.

【００４４】以上の、インデックス付与処理Ｓ４および
ノード作成処理Ｓ５は、映像構造化処理９における処理
を通じて実行される。The above-described index assignment processing S4 and node creation processing S5 are executed through the processing in the video structuring processing 9.

【００４５】最後に、構造化情報記録処理Ｓ６におい
て、以上のようにして獲得された構造化情報を、映像構
造化情報蓄積部４に蓄積記録する。Finally, in the structured information recording process S6, the structured information obtained as described above is stored and recorded in the video structured information storage unit 4.

【００４６】表示装置１１は、映像を、その構造化情報
に基づき、構造化表示し、入力装置５と共に対話的に、
順次ユーザの所望の映像を提示する。The display device 11 displays the video in a structured manner based on the structured information, and interactively with the input device 5
The video desired by the user is sequentially presented.

【００４７】続いて、表示装置１１における処理の流れ
を説明する。Next, the flow of processing in the display device 11 will be described.

【００４８】表示は、最上位ノードから始められ、画面
には、最上位ノードの名称と、代表フレーム（群）が表
示される。ユーザは、代表フレームを選択し、そのノー
ドの映像の再生を行ったり、そのノードを展開したり、
展開されたノードを元に戻したりすることができる。ノ
ードが展開されると、そのノードに含まれる全てのノー
ドが名前と代表フレーム、または代表フレーム群により
一覧表示される。The display is started from the top node, and the screen displays the name of the top node and a representative frame (group). The user selects a representative frame, plays the video of the node, expands the node,
The expanded node can be restored. When a node is expanded, all nodes included in the node are displayed in a list by name and representative frame or representative frame group.

【００４９】以下、同様の手続を繰り返すことにより、
ユーザは、映像内容を短時間に把握し、所望の映像のみ
を選択的に視聴することが可能となる。Hereinafter, by repeating the same procedure,
The user can grasp the content of the video in a short time and selectively view only the desired video.

【００５０】なお、映像構造化処理部９による構造化処
理だけでは、映像の構造化結果が、ユーザの希望する形
になっていないような場合も発生し得る。そこで、本実
施形では、ユーザが入力装置を介して、画面上で対話的
に、ノードの変更、追加、削除などの処理を行うための
映像構造化情報加工編集部１０を備えており、学習的
に、ユーザの要望を反映できるような構造となってい
る。It should be noted that the structuring process by the video structuring unit 9 alone may cause a case where the structuring result of the video is not in the form desired by the user. Therefore, in the present embodiment, the video structuring information processing / editing unit 10 is provided for the user to interactively change, add, or delete nodes on the screen via the input device. The structure is such that the demands of the user can be reflected.

【００５１】以下、ニュース番組の映像を例にとって、
具体的な構造化手順について説明する。Hereinafter, taking a video of a news program as an example,
A specific structuring procedure will be described.

【００５２】一連の映像に対しては、まず、カット検出
処理と、ショットのクラスタリングが施される。For a series of videos, first, cut detection processing and shot clustering are performed.

【００５３】図３は、ニュースの映像にカット検出処理
を施し、ショットごとに１こまのフレームを並べたもの
である。フレームの左上の記号は、ショットのクラスタ
リングの結果を表すものであり、アルファベットａ、
ｂ、ｃ・・・はショットのクラス、数字１、２、３・・
・は出現順序を表す。FIG. 3 shows a news image subjected to cut detection processing, and one frame is arranged for each shot. The symbol at the upper left of the frame indicates the result of the clustering of the shot, and includes the alphabet a,
.. b, c... are shot classes, numbers 1, 2, 3,.
* Indicates the order of appearance.

【００５４】ニュース映像において、最も出現頻度の高
いショットは、キャスターのショットであると考えられ
る。そこで、最も多く繰り返されているショットをキャ
スターのショットとしてインデックシング（ｂ１、ｂ
２、ｂ３・・・）する。In the news video, the shot with the highest appearance frequency is considered to be a caster shot. Therefore, the most repeated shot is indexed as a caster shot (b1, b
2, b3 ...).

【００５５】キャスターのショットから、次のキャスタ
ーのショットの直前のショットまでをノード（ｂ１、ｃ
１、ｄ１、ｅ１の群、他）にまとめる。こうしてできた
ノードのなかで、他のノード内のショットと類似するシ
ョットを最も多く含むノードをヘッドラインとして、そ
の他のノードをトピックとする。From the shot of the caster to the shot immediately before the shot of the next caster, the nodes (b1, c)
1, d1, e1, etc.). Among the nodes thus formed, a node containing the largest number of shots similar to shots in other nodes is set as a headline, and other nodes are set as topics.

【００５６】トピックスのノードの中で、ヘッドライン
のノード内のショットと類似のショットを持つノード
を、ヘッドラインのノードにつけ加える。A node having a shot similar to the shot in the headline node among the topics nodes is added to the headline node.

【００５７】次に、最初に出現するキャスターのショッ
トまでをオープンニング、最後に出現するキャスターの
ショット以降をエンディングとする。Next, the opening up to the first caster shot is defined as opening, and the shot after the last caster shot is defined as ending.

【００５８】続いて、以上の処理でできたノードを全て
まとめて、ひとつのノードにし、これを最上位ノードと
する。Subsequently, all the nodes obtained by the above processing are put together into one node, which is set as the highest node.

【００５９】最後に、各ノードに含まれるショットまた
は、ショット群から、代表フレームを、一枚または複数
枚、選び、ノードの代表フレームまたは代表フレーム群
とする。Finally, one or a plurality of representative frames are selected from shots or shot groups included in each node, and are set as a representative frame or a representative frame group of the node.

【００６０】図４は、以上で説明したニュース映像の構
造を表す映像モデルスクリプトの例である。図１の映像
構造化手続き生成部８は、図４の記述にしたがって、以
下のような処理を行う。FIG. 4 is an example of a video model script representing the structure of the news video described above. The video structuring procedure generation unit 8 of FIG. 1 performs the following processing according to the description of FIG.

【００６１】インデクシング処理Ｓ４１において、ま
ず、インデックス定義Ｓ４２の記述において、最も頻繁
に現れるショットとして、キャスターの映像をインデッ
クス定義する。In the indexing process S41, first, in the description of the index definition S42, the caster video is index-defined as the shot that appears most frequently.

【００６２】続くノード処理Ｓ４３において、まず、ノ
ード定義Ｓ４４の記述で、最初のショットから最初のキ
ャスターのショットまでを代表フレームとして、オープ
ンニングとして定義する。In the subsequent node processing S43, first, in the description of the node definition S44, an opening from the first shot to the first caster shot is defined as a representative frame and defined as opening.

【００６３】続いて、ノード定義Ｓ４５の記述で、各キ
ャスターのショットから、次のキャスターのショットま
でのノードを、各ショットの第１フレームを代表フレー
ムとして、トピックスとして定義する。Subsequently, in the description of the node definition S45, the nodes from the shot of each caster to the shot of the next caster are defined as topics using the first frame of each shot as a representative frame.

【００６４】次に、ノード定義Ｓ４６の記述で、最後の
キャスターのショットから、最後のショットまでのノー
ドを、最後のショットの最後のフレームを代表フレーム
として、エンディングとして定義する。Next, in the description of the node definition S46, the nodes from the shot of the last caster to the last shot are defined as the ending with the last frame of the last shot as the representative frame.

【００６５】続いて、ノード名称変更Ｓ４７の記述で、
他のショットへのリンクを有するショットの数が最大で
あるトピックスを、ヘッドラインとする。Subsequently, in the description of the node name change S47,
The topic in which the number of shots having links to other shots is the largest is defined as a headline.

【００６６】次に、追加処理Ｓ４８の記述で、ヘッドラ
インとされたノードを、ヘッドラインに追加する。Next, the node designated as the headline in the description of the addition processing S48 is added to the headline.

【００６７】最後に、ノード定義Ｓ４９で、トップノー
ドをニュースとして定義するが、これはオープンニング
を代表フレームとし、ヘッドライン、トピックス、エン
ディングとして定義されることになる。Finally, in the node definition S49, the top node is defined as news, which is defined as a headline, topics, and ending with opening as a representative frame.

【００６８】以上のような記述にしたがって、図５に示
すような映像構造化手続を実施し、映像構造化処理９に
より映像の構造化情報が獲得される。ちなみに、図５の
情報は、キャスターのインデクシングＳ５１、オープン
ニングのノード定義Ｓ５２、トピックスのノード定義Ｓ
５３、エンディングのノード定義Ｓ５４、ヘッドライン
のノード定義Ｓ５５で構成される。According to the above description, a video structuring procedure as shown in FIG. 5 is performed, and video structuring processing 9 obtains video structuring information. Incidentally, the information of FIG. 5 includes the indexing S51 of the caster, the node definition S52 of the opening, and the node definition S of the topics.
53, an ending node definition S54, and a headline node definition S55.

【００６９】以上のような処理の結果、例えば、最上位
ノードは、図６の説明図に示すように展開される。図６
では、オープンニングに続き、ヘッドライン、５個のト
ピックス、エンディングと展開される例を示している。As a result of the above processing, for example, the highest node is expanded as shown in the explanatory diagram of FIG. FIG.
Shows an example in which headlines, five topics, and endings are developed after opening.

【００７０】以上の処理は、ショットのインデクシング
に繰り返しパターンを用いた場合を例示したが、画像の
全体または一部を用いてインデクシングを行う場合を、
天気予報の映像を例にとって、以下に説明する。In the above processing, the case where a repetitive pattern is used for indexing shots is exemplified. However, the case where indexing is performed using the whole or a part of an image is described.
This will be described below using a weather forecast image as an example.

【００７１】天気予報の場合は、同一の番組において、
例えば、「明日の天気」や「週間天気予報」など、トピ
ックスごとに、毎回同じ構図の画像が使われる。したが
って、トピックスごとに、そのトピックスを同定できる
ような画像の全体または一部を格納し、これを用いて、
インデクシングを行うことにより、容易に構造化を行う
ことができる。In the case of the weather forecast, in the same program,
For example, an image having the same composition is used for each topic, such as “tomorrow's weather” and “weekly weather forecast”. Therefore, for each topic, the whole or a part of the image that can identify the topic is stored, and by using this,
By performing indexing, structuring can be easily performed.

【００７２】図７は、天気予報の映像の構造を表す映像
モデルスクリプトの例である。図１の映像構造化手続き
生成部８は、図７の記述にしたがって、以下のような処
理を行う。FIG. 7 shows an example of a video model script representing the structure of a weather forecast video. The video structuring procedure generation unit 8 of FIG. 1 performs the following processing according to the description of FIG.

【００７３】まず、インデクシング処理Ｓ７１におい
て、まず、インデックス定義Ｓ７２の記述において、今
日の天気予報のイメージを含むショットを、今日の天気
としてインデックス定義する。First, in the indexing process S71, first, in the description of the index definition S72, a shot including an image of today's weather forecast is index-defined as today's weather.

【００７４】続いて、同様に、例えば、週間天気のイメ
ージや、明日の天気のイメージや、などを続けてインデ
ックス定義し、インデックス定義Ｓ７３の記述におい
て、世界の天気予報のイメージを含むショットを、世界
の天気としてインデックス定義するまで、同様の処理を
繰り返す。Subsequently, similarly, for example, a weekly weather image, a tomorrow's weather image, and the like are successively index-defined, and in the description of the index definition S73, a shot including the image of the world weather forecast is defined as: The same process is repeated until an index is defined as the weather of the world.

【００７５】次のノード処理Ｓ７４において、ノード定
義Ｓ７５の記述で、インデックスを有するあらゆるショ
ットから、インデックスを有する次のショットまでを、
無題として定義する。In the next node processing S74, in the description of the node definition S75, from every shot having an index to the next shot having an index,
Define as untitled.

【００７６】続いて、ノード定義Ｓ７６の記述で、各キ
ャスターのショットから、次のキャスターのショットま
でのノードを、各ショットの第１フレームを代表フレー
ムとして、トピックスとして定義する。Subsequently, in the description of the node definition S76, the nodes from the shot of each caster to the shot of the next caster are defined as topics using the first frame of each shot as a representative frame.

【００７７】次に、ノード名称変更Ｓ７７の記述で、最
初のショットのインデックスを、あらゆるノードのノー
ド名とする。Next, in the description of the node name change S77, the index of the first shot is set as the node name of every node.

【００７８】以上のような記述にしたがって、図８に示
すような映像構造化手続を実施し、映像構造化処理９に
より映像の構造化情報が獲得される。ちなみに、図８の
情報は、インデクシング処理Ｓ８１、ノード処理Ｓ８２
で構成される。According to the above description, a video structuring procedure as shown in FIG. 8 is performed, and video structuring processing 9 acquires video structuring information. Incidentally, the information in FIG. 8 includes an indexing process S81 and a node process S82.
It consists of.

【００７９】以上のような映像構造化においては、映像
ごとに、その映像モデルスクリプトを適切に作成するこ
とが必要である。しかしながら、映像を希望する形に構
造化するような映像モデルスクリプトを作るのは、そう
容易なことではない。In the above-described image structuring, it is necessary to appropriately create an image model script for each image. However, it is not so easy to create a video model script that structures the video to the desired shape.

【００８０】これに対して、本実施形では、映像構造化
情報加工編集部１０を用いることにより、まず、ひな形
となる映像モデルスクリプトを作成し、これに基づい
て、映像を構造化し、その結果を参考にして、映像モデ
ルスクリプトの修正を行うという手順を繰り返すことに
より、比較的容易に、所望の構造化を実現するために必
要な構造モデルスクリプトを作ることを可能としてい
る。On the other hand, in this embodiment, by using the video structuring information processing / editing unit 10, first, a video model script serving as a model is created, and based on this, the video is structured, and the video is structured. By repeating the procedure of correcting the video model script with reference to the result, it is possible to relatively easily create a structural model script required to realize a desired structuring.

【００８１】以上の述べたうちの各手順は、コンピュー
タが実行可能なプログラムとして各種の記録媒体、例え
ばＣＤＲＯＭのような光ディスク、フロッピーディスク
のような磁気ディスク／媒体、あるいはＭＤのような光
磁気ディスク等に記録することもできる。Each of the above-described procedures is performed by a computer as a program that can be executed by various recording media, for example, an optical disk such as a CDROM, a magnetic disk / medium such as a floppy disk, or a magneto-optical disk such as an MD. Etc. can also be recorded.

【００８２】[0082]

【発明の効果】以上述べたように、本発明の映像構造化
装置は、映像の種別ごとに、映像構造化手続を準備する
ことにより、種々の映像の構造化を同じ処理系で実現で
き、ユーザが映像の内容を短時間で把握し、所望の映像
のみを選択的に視聴するような環境を容易に構築できる
という効果がある。As described above, the video structuring apparatus of the present invention can realize various video structuring by the same processing system by preparing a video structuring procedure for each video type. There is an effect that an environment in which the user can grasp the contents of the video in a short time and selectively view only the desired video can be easily constructed.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の実施系の映像構造化装置のブロック図
である。FIG. 1 is a block diagram of a video structuring apparatus according to an embodiment of the present invention.

【図２】図１の構成の動作を説明するためのフローチャ
ートである。FIG. 2 is a flowchart for explaining the operation of the configuration of FIG. 1;

【図３】ニュースの映像にカット検出処理を施し、ショ
ットごとに１こまのフレームを並べた説明図である。FIG. 3 is an explanatory diagram in which cut detection processing is performed on a news video, and one frame is arranged for each shot.

【図４】ニュース映像の構造を表す映像モデルスクリプ
トの例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of a video model script representing the structure of a news video.

【図５】図４の記述に基づく映像構造化手続の情報の例
を示す説明図である。FIG. 5 is an explanatory diagram showing an example of information of a video structuring procedure based on the description in FIG. 4;

【図６】図４、図５のような処理の結果得られる最上位
ノードの展開例を示す説明図である。FIG. 6 is an explanatory diagram showing an example of expanding a top node obtained as a result of the processing shown in FIGS. 4 and 5;

【図７】天気予報映像の構造を表す映像モデルスクリプ
トの例を示す説明図である。FIG. 7 is an explanatory diagram showing an example of a video model script representing a structure of a weather forecast video.

【図８】図７の記述に基づく映像構造化手続の情報の例
を示す説明図である。FIG. 8 is an explanatory diagram showing an example of information of a video structuring procedure based on the description of FIG. 7;

[Explanation of symbols]

１映像データ蓄積部２映像モデルスクリプト蓄積部３映像構造化手続き蓄積部４映像構造化情報蓄積部５入力装置６映像モデルスクリプト加工編集部７映像種別判定部８映像構造化手続き生成部９映像構造化処理１０映像構造化情報加工編集部１１表示装置１２バス Reference Signs List 1 video data storage unit 2 video model script storage unit 3 video structuring procedure storage unit 4 video structuring information storage unit 5 input device 6 video model script processing / editing unit 7 video type determination unit 8 video structuring procedure generation unit 9 video structure Processing 10 Video structured information processing and editing unit 11 Display device 12 Bus

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号ＦＩＨ０４Ｎ 5/91 Ｎ (72)発明者青木恒神奈川県川崎市幸区小向東芝町１株式会社東芝研究開発センター内────────────────────────────────────────────────── ─── Continued on the front page (51) Int.Cl. ⁶ Identification symbol FI H04N 5/91 N (72) Inventor Tsune Aoki 1 Komukai Toshiba-cho, Saiwai-ku, Kawasaki-shi, Kanagawa Prefecture Toshiba R & D Center Co., Ltd.

Claims

[Claims]

1. A designating means for designating a type of video, a procedure storing means for storing a structuring procedure of a video corresponding to each type of video, and a structuring corresponding to the type of video specified by the designating means An image structuring apparatus, comprising: a structuring unit that calls a procedure from the procedure storing unit and implements video structuring; and a structured information storing unit that stores information from the structuring unit.

2. Display means for displaying a video based on the video structured information stored by said structured information storage means.
The video structuring device according to claim 1, further comprising:

3. The video structuring apparatus according to claim 2, further comprising: means for storing a video model script describing a video structure model; and a procedure generation unit for generating a video structuring means based on the video model script. .

4. The video structuring apparatus according to claim 3, further comprising video model script processing / editing means for processing / editing the video model script.

5. The video structure according to claim 2, further comprising video structure information processing and editing means for interactively processing and editing the structured information obtained by said structuring means while displaying said structured information on said display means. Device.

6. A recording medium on which a program for structuring an image is recorded in a computer-executable manner, wherein the program calls a pre-stored structuring procedure corresponding to a specified type of image. A recording medium characterized by including a procedure for also structuring an image in the called structured procedure and storing the structured information.