JP4452122B2

JP4452122B2 - Metadata generation apparatus and metadata generation program

Info

Publication number: JP4452122B2
Application number: JP2004141595A
Authority: JP
Inventors: 雅規佐野; 英樹住吉; 正啓柴田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2004-05-11
Filing date: 2004-05-11
Publication date: 2010-04-21
Anticipated expiration: 2024-05-11
Also published as: JP2005322173A

Description

本発明は、コンテンツに含まれる種々の情報に基づいて、当該コンテンツの情報を記述したメタデータを生成するメタデータ生成装置及びメタデータ生成プログラムに関する。 The present invention relates to a metadata generation apparatus and a metadata generation program that generate metadata describing information of content based on various information included in the content.

近年、番組等のコンテンツ（映像コンテンツ）の情報を記述したメタデータを、コンテンツに含まれる種々の情報から生成する研究が行われ、種々のメタデータ生成手法が開示されている。例えば、映像コンテンツを解析し、その特徴をメタデータとするもの（特許文献１参照）、コンテンツ内のテキスト情報からキーワードを抽出してメタデータとするもの（特許文献２参照）、音を解析して音楽の区間やスピーチの区間毎の情報をメタデータとするもの（非特許文献１参照）等が存在する。
また、最近では、マルチモーダルと称する、映像や音声などの多様な情報を統合的に処理する研究が行われ、複数のメディア、例えば、映像処理によって生成されるメタデータと、音声処理によって生成されるメタデータとを統合して新しいメタデータを生成する技術が存在する（非特許文献２参照）。
特開２００４−７５３５号公報（図１）特開２００３−２２８５６９号公報（図１）南憲一、他３名「音情報を用いた映像インデキシングとその応用」、電子情報通信学会論文誌Ｖｏｌ．Ｊ８１−Ｄ２Ｎｏ．３ｐｐ．５２９−５３７笹森重和、他２名「インターモーダル協調による映像メディアからのイベント抽出」、電子情報通信学会ＰＲＭＵ９８−１８９、ｐｐ．９９−１０６（１９９９） In recent years, research for generating metadata describing content (video content) information such as a program from various information included in the content has been performed, and various metadata generation methods have been disclosed. For example, video content is analyzed and its features are used as metadata (see Patent Document 1), keywords are extracted from text information in the content as metadata (see Patent Document 2), and sound is analyzed. In addition, there is a method in which information for each music section or speech section is used as metadata (see Non-Patent Document 1).
In recent years, research on integrated processing of various information such as video and audio, called multi-modal, has been conducted, and a plurality of media, for example, metadata generated by video processing and audio processing. There is a technique for generating new metadata by integrating metadata (see Non-Patent Document 2).
Japanese Patent Laying-Open No. 2004-7535 (FIG. 1) Japanese Patent Laying-Open No. 2003-228569 (FIG. 1) Kenichi Minami and three others "Video indexing using sound information and its applications", IEICE Transactions Vol. J81-D2 No. 3 pp. 529-537 Shigekazu Sasamori, two others "Event extraction from video media by intermodal collaboration", IEICE PRMU 98-189, pp. 99-106 (1999)

前記した従来のメタデータ生成手法は、個々に特化した固有の手法によってメタデータを生成しているため、生成されるメタデータのデータ形式も、手法によってそれぞれ異なっている。そのため、例えば、１つのコンテンツから、映像、音声等に関するメタデータを生成する場合、データ形式の異なるメタデータが複数生成されるため、コンテンツとして１つの定型化したメタデータを生成するには、複数のメタデータを個々に編集しなければならず、手間がかかってしまうという問題があった。
また、個々のメタデータを汎用的に生成・編集するためのツール（装置、プログラム）は、これまで開発報告等がなく、種々のメタデータを、共通のインタフェースにより同じ操作環境で統一的に生成・編集することが可能なツールに対する要求が高まっている。 Since the above-described conventional metadata generation method generates metadata by a unique method specific to each, the data format of the generated metadata varies depending on the method. Therefore, for example, when generating metadata related to video, audio, etc. from one content, a plurality of metadata having different data formats are generated. Therefore, in order to generate one standardized metadata as content, a plurality of metadata are generated. There is a problem that it takes time and effort to individually edit the metadata.
In addition, tools (devices and programs) for general-purpose generation and editing of individual metadata have not been reported so far, and various types of metadata can be generated uniformly in the same operating environment using a common interface.・ The demand for tools that can be edited is increasing.

本発明は、以上のような課題を解決するためになされたものであり、コンテンツのメタデータを生成する際に、ユーザ（利用者）に対して共通の操作環境を提供し、映像、音声等で異なるメタデータを統一した形式のメタデータとして生成・編集が可能なメタデータ生成装置及びメタデータ生成プログラムを提供することを目的とする。 The present invention has been made to solve the above-described problems, and provides a common operating environment for users (users) when generating content metadata, such as video, audio, and the like. It is an object of the present invention to provide a metadata generation apparatus and a metadata generation program that can generate and edit different metadata in a unified format.

本発明は、前記目的を達成するために創案されたものであり、まず、請求項１に記載のメタデータ生成装置は、コンテンツにおいて、ある事象の事象区間毎に、前記コンテンツの内容を示す意味情報を対応付けた、当該コンテンツにおける前記事象区間毎の情報である区間メタデータを生成するメタデータ生成装置であって、区間情報抽出手段と、事象選択手段と、区間情報取得処理手段と、意味情報抽出手段と、種別選択手段と、意味情報取得処理手段とを備える構成とした。 The present invention was devised to achieve the above object, and first, the metadata generation device according to claim 1 represents the content of the content for each event section of a certain event in the content. A metadata generation device for generating section metadata, which is information for each event section in the content, associated with information, section information extraction means, event selection means, section information acquisition processing means, A semantic information extraction unit, a type selection unit, and a semantic information acquisition processing unit are provided.

かかる構成によれば、メタデータ生成装置は、コンテンツから事象を検出し、少なくとも当該事象の開始時刻及び終了時刻を区間情報として抽出する複数の区間情報抽出手段を備え、事象選択手段によって、複数の事象の中から１つを選択事象として指示されることで、当該選択事象に対応する区間情報抽出手段を選択する。これによって、当該メタデータ生成装置のユーザ（利用者）が、コンテンツの中から、所望する事象が発生している事象区間を示す区間情報を抽出する区間情報抽出手段を選択することが可能になる。なお、ここで事象とは、コンテンツ内で発生する固有の現象や変化であって、例えば、映像のシーンの切り替わりや、音声の周波数変化等である。また、事象区間は、例えば、映像のシーンの切り替わりとなるカット点間の区間や、ある周波数領域の音声区間等、コンテンツとなる映像、音声等の特徴を示す区間のことをいう。
そして、メタデータ生成装置は、区間情報取得処理手段によって、事象選択手段で選択された区間情報抽出手段から、コンテンツにおける選択事象の区間情報を取得し、第１の区間メタデータを生成する。これによって、少なくとも事象の開始時刻及び終了時刻を含んだ区間メタデータが生成される。 According to such a configuration, the metadata generation apparatus includes a plurality of section information extraction units that detect an event from the content and extract at least the start time and the end time of the event as section information. By instructing one of the events as a selected event, the section information extraction unit corresponding to the selected event is selected. This enables the user (user) of the metadata generation apparatus to select section information extraction means for extracting section information indicating an event section in which a desired event has occurred from the content. . Here, the event is a unique phenomenon or change that occurs in the content, and is, for example, switching of a video scene, a change in audio frequency, or the like. Further, the event section refers to a section showing features of video, audio, etc., which are contents, such as a section between cut points at which video scenes are switched and an audio section in a certain frequency region.
Then, the metadata generation device acquires the section information of the selected event in the content from the section information extraction unit selected by the event selection unit by the section information acquisition processing unit, and generates first section metadata. As a result, section metadata including at least the start time and end time of the event is generated.

さらに、メタデータ生成装置は、コンテンツを解析し、そのコンテンツの内容を示す意味情報を区間情報における開始時刻及び終了時刻に対応付けて抽出する複数の意味情報抽出手段を備え、種別選択手段によって、意味情報の種別の中から１つを選択種別として指示されることで、当該選択種別に対応する意味情報抽出手段を選択する。これによって、当該メタデータ生成装置のユーザが、コンテンツの中から、所望する意味情報を抽出する意味情報抽出手段を選択することが可能になる。なお、ここで意味情報とは、コンテンツ内のある区間（ここでは、区間情報で示される事象区間）における意味・内容を示す情報であって、例えば、映像内に登場する人物名やその動作、あるいは、スポーツ中継等でアナウンサが話すコメント等である。
そして、メタデータ生成装置は、意味情報取得処理手段によって、種別選択手段で選択された意味情報抽出手段から、意味情報を取得し、区間情報と意味情報とを対応付けた第２の区間メタデータを生成する。これによって、コンテンツの事象区間毎に、意味情報が対応付けられた区間メタデータが生成される。 Further, the metadata generation apparatus includes a plurality of semantic information extraction means for analyzing the content and extracting semantic information indicating the contents of the content in association with the start time and the end time in the section information. By instructing one of the types of semantic information as a selection type, a semantic information extraction unit corresponding to the selection type is selected. As a result, the user of the metadata generation apparatus can select semantic information extraction means for extracting desired semantic information from the content. Here, the semantic information is information indicating the meaning and content in a certain section in the content (here, the event section indicated by the section information). For example, the name of the person appearing in the video and its operation, Or the comment etc. which an announcer speaks by a sports broadcast etc.
Then, the metadata generation device acquires the semantic information from the semantic information extraction unit selected by the type selection unit by the semantic information acquisition processing unit, and associates the segment information with the semantic information. Is generated. Thereby, section metadata in which semantic information is associated with each event section is generated.

また、請求項２に記載のメタデータ生成装置は、請求項１に記載のメタデータ生成装置において、前記区間情報抽出手段が、予め当該区間情報抽出手段の機能を示す文字列を保持しており、前記事象選択手段が、前記区間情報抽出手段から前記文字列を取得することで、当該文字列を表示装置の画面上に一覧表示する構成とした。 The metadata generation apparatus according to claim 2 is the metadata generation apparatus according to claim 1, wherein the section information extraction unit holds a character string indicating a function of the section information extraction unit in advance. The event selection unit obtains the character string from the section information extraction unit, thereby displaying the character string in a list on the screen of the display device.

かかる構成によれば、メタデータ生成装置は、予め個々の区間情報抽出手段がその機能を示す文字列を保持しておくことで、事象選択手段が、区間情報抽出手段からその文字列を取得する。そして、メタデータ生成装置は、その文字列を表示装置の画面上に一覧表示する。これによって、ユーザは、どの区間情報抽出手段が利用可能で、どの区間情報抽出手段がどのような機能を有しているのかを認識することができる。 According to this configuration, in the metadata generation apparatus, the event selection unit acquires the character string from the section information extraction unit by holding the character string indicating the function of each section information extraction unit in advance. . Then, the metadata generation device displays a list of the character strings on the screen of the display device. Thus, the user can recognize which section information extraction means can be used and which section information extraction means has what function.

さらに、請求項３に記載のメタデータ生成装置は、請求項１又は請求項２に記載のメタデータ生成装置において、前記区間情報抽出手段が、予め当該区間情報抽出手段において前記区間情報を抽出するためのパラメータの項目を保持しており、前記区間情報取得処理手段が、前記区間情報抽出手段から前記パラメータの項目を取得することで、当該パラメータの項目を表示した当該パラメータの入力画面を表示装置に表示する構成とした。 Furthermore, the metadata generation device according to claim 3 is the metadata generation device according to claim 1 or 2, wherein the section information extraction unit extracts the section information in advance by the section information extraction unit. A parameter entry screen, and the section information acquisition processing means acquires the parameter item from the section information extraction means, thereby displaying the parameter input screen displaying the parameter item. It was set as the structure displayed on.

かかる構成によれば、メタデータ生成装置は、予め個々の区間情報抽出手段が区間情報を抽出するためのパラメータの項目を保持しておくことで、区間情報取得処理手段が、区間情報抽出手段からそのパラメータの項目を取得する。そして、メタデータ生成装置は、そのパラメータの項目を表示したパラメータの入力画面を表示装置に表示することで、ユーザに対して、パラメータの入力を促す。これによって、メタデータ生成装置は、異なる区間情報抽出手段を動作させる場合であっても、ユーザに対して共通のインタフェースを提供することが可能になる。 According to such a configuration, the metadata generation apparatus holds the parameter items for the individual section information extraction unit to extract the section information in advance, so that the section information acquisition processing unit is connected to the section information extraction unit. Get the parameter item. Then, the metadata generation apparatus prompts the user to input parameters by displaying a parameter input screen displaying the parameter items on the display device. Accordingly, the metadata generation apparatus can provide a common interface to the user even when different section information extraction means are operated.

また、請求項４に記載のメタデータ生成装置は、請求項１に記載のメタデータ生成装置において、前記意味情報抽出手段が、予め当該意味情報抽出手段の機能を示す文字列を保持しており、前記種別選択手段が、前記意味情報抽出手段から前記文字列を取得することで、当該文字列を表示装置の画面上に一覧表示する構成とした。 The metadata generation apparatus according to claim 4 is the metadata generation apparatus according to claim 1, wherein the semantic information extraction unit holds a character string indicating a function of the semantic information extraction unit in advance. The type selection unit obtains the character string from the semantic information extraction unit and displays the character string in a list on the screen of the display device.

かかる構成によれば、メタデータ生成装置は、予め個々の意味情報抽出手段がその機能を示す文字列を保持しておくことで、種別選択手段が、意味情報抽出手段からその文字列を取得する。そして、メタデータ生成装置は、その文字列を表示装置の画面上に一覧表示する。これによって、ユーザは、どの意味情報抽出手段が利用可能で、どの意味情報抽出手段がどのような機能を有しているのかを認識することができる。 According to this configuration, in the metadata generation device, each type of semantic information extraction unit holds a character string indicating its function in advance, so that the type selection unit acquires the character string from the semantic information extraction unit. . Then, the metadata generation device displays a list of the character strings on the screen of the display device. Thus, the user can recognize which semantic information extracting means can be used and which semantic information extracting means has what function.

さらに、請求項５に記載のメタデータ生成装置は、請求項１又は請求項４に記載のメタデータ生成装置において、前記意味情報抽出手段が、予め当該意味情報抽出手段において前記意味情報を抽出するためのパラメータの項目を保持しており、前記意味情報取得処理手段が、前記意味情報抽出手段から前記パラメータの項目を取得することで、当該パラメータの項目を表示した当該パラメータの入力画面を表示装置に表示する構成とした。 Furthermore, the metadata generation device according to claim 5 is the metadata generation device according to claim 1 or 4, wherein the semantic information extraction unit extracts the semantic information in advance by the semantic information extraction unit. A parameter entry screen, and the semantic information acquisition processing unit acquires the parameter item from the semantic information extraction unit, thereby displaying the parameter input screen displaying the parameter item. It was set as the structure displayed on.

かかる構成によれば、メタデータ生成装置は、予め個々の意味情報抽出手段が意味情報を抽出するためのパラメータの項目を保持しておくことで、意味情報取得処理手段が、意味情報抽出手段からそのパラメータの項目を取得する。そして、メタデータ生成装置は、そのパラメータの項目を表示したパラメータの入力画面を表示装置に表示することで、ユーザに対して、パラメータの入力を促す。これによって、メタデータ生成装置は、異なる意味情報抽出手段を動作させる場合であっても、ユーザに対して共通のインタフェースを提供することが可能になる。 According to such a configuration, the metadata generation apparatus holds the parameter items for each semantic information extraction unit to extract semantic information in advance, so that the semantic information acquisition processing unit can Get the parameter item. Then, the metadata generation apparatus prompts the user to input parameters by displaying a parameter input screen displaying the parameter items on the display device. Accordingly, the metadata generation apparatus can provide a common interface to the user even when different semantic information extraction means are operated.

また、請求項６に記載のメタデータ生成装置は、請求項１乃至請求項５のいずれか一項に記載のメタデータ生成装置において、前記意味情報取得処理手段が、前記意味情報を取得する毎に、前記区間情報に複数の前記意味情報を対応付けた区間メタデータを生成する構成とした。 The metadata generation device according to claim 6 is the metadata generation device according to any one of claims 1 to 5, wherein the semantic information acquisition processing unit acquires the semantic information. In addition, section metadata in which a plurality of pieces of semantic information are associated with the section information is generated.

かかる構成によれば、メタデータ生成装置は、種別選択手段によって、異なる意味情報抽出手段が選択され、意味情報取得処理手段によって、意味情報を取得する毎に、同一の事象区間である１つの区間情報に対して、異なる意味情報が順次付加された区間メタデータを生成する。 According to this configuration, the metadata generation apparatus selects one section that is the same event section each time the semantic information acquisition processing means selects different semantic information extraction means by the type selection means. Section metadata in which different semantic information is sequentially added to the information is generated.

さらに、請求項７に記載のメタデータ生成プログラムは、コンテンツにおいて、ある事象に対して、少なくとも当該事象の開始時刻及び終了時刻を区間情報として前記コンテンツから抽出する、前記事象毎に異なる複数の区間情報抽出モジュールと、前記コンテンツを解析し、当該コンテンツの内容を示す意味情報を時刻に対応付けて抽出する、前記意味情報の種別毎に異なる複数の意味情報抽出モジュールとに基づいて、前記コンテンツにおける事象区間毎の情報である区間メタデータを生成するために、コンピュータを、事象選択手段、区間情報取得処理手段、種別選択手段、意味情報取得処理手段として機能させる構成とした。 Furthermore, the metadata generation program according to claim 7 extracts a plurality of different events for each event, wherein at least a start time and an end time of the event are extracted from the content as section information for a certain event in the content. Based on the section information extraction module and a plurality of semantic information extraction modules that differ for each type of the semantic information, which analyzes the content and extracts semantic information indicating the content of the content in association with time In order to generate section metadata that is information for each event section, the computer is configured to function as an event selection unit, a section information acquisition processing unit, a type selection unit, and a semantic information acquisition processing unit.

かかる構成によれば、メタデータ生成プログラムは、事象選択手段によって、複数の事象の中から１つを選択事象として指示されることで、当該選択事象に対応する区間情報抽出モジュールを選択する。これによって、コンテンツの中から、ユーザが所望する事象が発生している事象区間を示す区間情報を抽出する区間情報抽出モジュールを選択することが可能になる。
そして、メタデータ生成プログラムは、区間情報取得処理手段によって、事象選択手段で選択された区間情報抽出モジュールから、コンテンツにおける選択事象の区間情報を取得し、第１の区間メタデータを生成する。 According to such a configuration, the metadata generation program selects an interval information extraction module corresponding to the selected event by instructing one of the plurality of events as a selected event by the event selecting unit. This makes it possible to select a section information extraction module that extracts section information indicating an event section in which an event desired by the user has occurred from the content.
Then, the metadata generation program acquires the section information of the selected event in the content from the section information extraction module selected by the event selection unit by the section information acquisition processing unit, and generates first section metadata.

さらに、メタデータ生成プログラムは、種別選択手段によって、意味情報の種別の中から１つを選択種別として指示されることで、当該選択種別に対応する前記意味情報抽出モジュールを選択する。これによって、コンテンツの中から、ユーザが所望する意味情報を抽出する意味情報抽出モジュールを選択することが可能になる。
そして、メタデータ生成プログラムは、意味情報取得処理手段によって、種別選択手段で選択された意味情報抽出モジュールから、意味情報を取得し、区間情報と意味情報とを対応付けた第２の区間メタデータを生成する。 Furthermore, the metadata generation program selects one of the semantic information extraction modules corresponding to the selected type by instructing one of the types of semantic information as the selected type by the type selection unit. This makes it possible to select a semantic information extraction module that extracts semantic information desired by the user from the content.
The metadata generation program acquires the semantic information from the semantic information extraction module selected by the type selection unit by the semantic information acquisition processing unit, and associates the segment information with the semantic information. Is generated.

請求項１に記載の発明によれば、メタデータ生成装置は、コンテンツの事象区間毎に、区間情報と意味情報とを対応付けた区間メタデータを生成することができる。このとき、ユーザは、複数の区間情報抽出手段や意味情報抽出手段から、所望の機能を行う手段を選択するだけの簡単な操作で、区間メタデータを生成することができる。さらに、メタデータ生成装置において生成された区間メタデータは、区間情報抽出手段や意味情報抽出手段の機能が異なる場合であっても、区間情報と意味情報とを含んだ共通した形式となる。そのため、メタデータ生成装置は、従来では、個別に生成されたメタデータを、共通に扱うことが可能なプラットフォームとなり、メタデータ（区間メタデータ）を分散して処理したり、メタデータを相互に交換したり等、種々な流通形態を実現することができる。 According to the first aspect of the present invention, the metadata generation apparatus can generate section metadata in which section information and semantic information are associated with each other for each event section of the content. At this time, the user can generate the section metadata with a simple operation by simply selecting a means for performing a desired function from the plurality of section information extracting means and semantic information extracting means. Furthermore, the section metadata generated in the metadata generation apparatus has a common format including section information and semantic information even when the functions of the section information extraction unit and the semantic information extraction unit are different. For this reason, the metadata generation device has conventionally become a platform that can handle individually generated metadata in common, processing metadata (section metadata) in a distributed manner, and processing metadata mutually. Various distribution forms such as exchange can be realized.

請求項２又は請求項４に記載の発明によれば、メタデータ生成装置は、区間情報抽出手段又は意味情報抽出手段が保持している機能を示す文字列を画面上に一覧表示するため、ユーザは一覧表示された項目の中から１つを選択するだけの簡単な操作で、所望の機能を行う手段を選択することができる。また、区間情報抽出手段又は意味情報抽出手段を追加する場合、区間情報抽出手段又は意味情報抽出手段に、共通のインタフェースとして、文字列を通知する機能を付加しておくだけで、簡単に区間情報抽出手段又は意味情報抽出手段の機能を示す文字列を一覧表示に付加することができる。 According to the invention described in claim 2 or claim 4, the metadata generation device displays a list of character strings indicating functions held by the section information extraction unit or the semantic information extraction unit on the screen. Can select a means for performing a desired function with a simple operation of selecting one of the displayed items. In addition, when adding section information extraction means or semantic information extraction means, section information extraction means or semantic information extraction means can be added simply by adding a function of notifying a character string as a common interface. A character string indicating the function of the extracting means or the semantic information extracting means can be added to the list display.

請求項３又は請求項５に記載の発明によれば、メタデータ生成装置は、区間情報抽出手段又は意味情報抽出手段が保持しているパラメータの項目を入力画面に表示して、ユーザに対してパラメータの入力を促すため、機能が異なる手段が選択された場合であっても、ユーザは、容易にその機能を実現するためのパラメータを認識し入力することができる。また、区間情報抽出手段又は意味情報抽出手段を追加する場合、共通のインタフェースとして、パラメータの項目を通知する機能を付加しておくだけで、簡単に区間情報抽出手段又は意味情報抽出手段のパラメータの項目を入力画面に表示することができる。 According to the invention described in claim 3 or claim 5, the metadata generation device displays the parameter items held by the section information extraction means or the semantic information extraction means on the input screen, and Even when means having different functions are selected in order to prompt the input of parameters, the user can easily recognize and input parameters for realizing the functions. Further, when adding section information extraction means or semantic information extraction means, simply adding a function for notifying parameter items as a common interface, the parameters of the section information extraction means or semantic information extraction means can be easily set. Items can be displayed on the input screen.

請求項６に記載の発明によれば、メタデータ生成装置は、１つの区間情報に複数の意味情報を対応付けることができ、異なる意味情報抽出手段を用いる場合であっても、共通の形式で区間メタデータを生成することができる。これによって、メタデータ生成装置は、メタデータを生成する統合プラットフォームとして機能させることができる。 According to the sixth aspect of the present invention, the metadata generation apparatus can associate a plurality of semantic information with one piece of section information, and even in the case of using different semantic information extraction means, sections in a common format Metadata can be generated. Thus, the metadata generation apparatus can function as an integrated platform for generating metadata.

請求項７の記載の発明によれば、メタデータ生成プログラムは、コンテンツの事象区間毎に、区間情報と意味情報とを対応付けた区間メタデータを生成することができる。このとき、メタデータ生成プログラムは、各区間情報や意味情報を抽出する手段として、モジュール化されたプログラムを使用するため、共通のインタフェースを持つモジュールを生成することで、簡単に機能追加を行うことができる。 According to the invention described in claim 7, the metadata generation program can generate section metadata in which section information and semantic information are associated with each other for each event section of the content. At this time, since the metadata generation program uses a modularized program as a means for extracting each section information and semantic information, it is possible to easily add a function by generating a module having a common interface. Can do.

以下、本発明の実施の形態について図面を参照して説明する。
［メタデータ生成システムの構成］
まず、図１を参照して、本発明に係るメタデータ生成システムの構成について説明する。図１は、本発明に係るメタデータ生成システムの構成を示すブロック図である。
図１に示すように、メタデータ生成システム１００は、コンテンツ（映像、音声、テキスト情報）から、その特徴に基づいて、ある時間幅である区間（セグメント）毎の情報を抽出し、さらに、その区間に対して、意味（ラベル）を対応付けることで、抽出した区間毎のメタデータ（区間メタデータ）を生成及び編集するものである。ここで、メタデータ生成システム１００は、メタデータ生成装置１と、モジュール蓄積装置２と、コンテンツ蓄積装置３とを備えている。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
[Configuration of metadata generation system]
First, the configuration of the metadata generation system according to the present invention will be described with reference to FIG. FIG. 1 is a block diagram showing a configuration of a metadata generation system according to the present invention.
As shown in FIG. 1, the metadata generation system 100 extracts information for each section (segment) having a certain time width from content (video, audio, text information) based on its characteristics, By associating meanings (labels) with the sections, metadata (section metadata) for each extracted section is generated and edited. Here, the metadata generation system 100 includes a metadata generation device 1, a module storage device 2, and a content storage device 3.

メタデータ生成装置１は、モジュール蓄積装置２に蓄積されているモジュールを選択し、メタデータ生成装置１内に組み込んで動作させることで、コンテンツから、区間メタデータを生成するとともに、その編集を行うものである。このメタデータ生成装置１の詳細な構成については後記する。 The metadata generation apparatus 1 selects a module stored in the module storage apparatus 2, and generates and edits section metadata from the content by incorporating and operating the module in the metadata generation apparatus 1. Is. A detailed configuration of the metadata generation apparatus 1 will be described later.

モジュール蓄積装置２は、コンテンツから区間（セグメント）毎の情報を抽出するプログラム（モジュール：区間情報抽出モジュールＳＭ（ＳＭ１、ＳＭ２、ＳＭ３…））と、その区間に対して意味（ラベル）を対応付けるプログラム（モジュール：意味情報抽出モジュールＬＭ（ＬＭ１、ＬＭ２、ＬＭ３…））とを蓄積するもので、ハードディスク等の記憶装置である。 The module storage device 2 is a program that extracts information for each section (segment) from the content (module: section information extraction module SM (SM1, SM2, SM3,...)), And a program that associates the meaning (label) with the section. (Module: semantic information extraction module LM (LM1, LM2, LM3...)), And is a storage device such as a hard disk.

区間情報抽出モジュールＳＭは、コンテンツの特徴に基づいて、その特徴となる区間を開始時刻及び終了時刻として抽出するものである。なお、区間情報抽出モジュールＳＭは、後記するメタデータ生成装置１の区間情報抽出モジュール選択手段１０によって選択され、メタデータ生成装置１の区間情報抽出手段１０ａとして組み込まれて動作する。この区間情報抽出モジュールＳＭは、例えば、動的リンクライブラリ（ＤＬＬ）として生成しておくことができる。ここでは、区間情報抽出モジュール（ＳＭ１、ＳＭ２、ＳＭ３…）は、それぞれ、異なる特徴によって区間情報を抽出するものとする。 The section information extraction module SM extracts a section serving as the feature as a start time and an end time based on the feature of the content. The section information extraction module SM is selected by a section information extraction module selection unit 10 of the metadata generation apparatus 1 to be described later, and operates by being incorporated as a section information extraction unit 10a of the metadata generation apparatus 1. This section information extraction module SM can be generated as a dynamic link library (DLL), for example. Here, it is assumed that the section information extraction modules (SM1, SM2, SM3,...) Extract section information based on different features.

例えば、区間情報抽出モジュールＳＭ１は、番組（コンテンツ）がサッカーの中継番組である場合、観客の歓声をデジタイズした音声ファイル（コンテンツ蓄積装置３内の音声Ａ１）から、ある一定音量となる時間区間を抽出するものとして機能する。なお、区間情報抽出モジュールＳＭは、コンテンツから目的となる特定の区間を抽出するものであれば、これに限定されるものではなく、例えば、映像の切り替わりフレームであるカット点を検出し、そのカット点間の区間（開始時刻、終了時刻）を区間情報として抽出するものでもよい。 For example, when the program (content) is a soccer relay program, the section information extraction module SM1 selects a time section having a certain volume from an audio file (audio A1 in the content storage device 3) digitized from the audience cheers. Acts as an extractor. The section information extraction module SM is not limited to this as long as it extracts a specific target section from the content. For example, the section information extraction module SM detects a cut point that is a video switching frame, and cuts the cut point. A section between points (start time, end time) may be extracted as section information.

意味情報抽出モジュールＬＭは、区間情報抽出モジュールＳＭで抽出された区間情報に対応付けて、コンテンツから、当該コンテンツの内容等を意味情報として抽出するものである。なお、意味情報抽出モジュールＬＭは、後記するメタデータ生成装置１の意味情報抽出モジュール選択手段１２によって選択され、メタデータ生成装置１の意味情報抽出手段１２ａとして組み込まれて動作する。この意味情報抽出モジュールＬＭは、区間情報抽出モジュールＳＭと同様に動的リンクライブラリ（ＤＬＬ）として生成し、同一のコンテンツから異なる意味情報を抽出する複数の意味情報抽出モジュール（ＬＭ１、ＬＭ２、ＬＭ３…）として、モジュール蓄積装置２に蓄積しておく。 The semantic information extraction module LM extracts the contents of the content as semantic information from the content in association with the segment information extracted by the segment information extraction module SM. The semantic information extraction module LM is selected by the semantic information extraction module selection means 12 of the metadata generation apparatus 1 described later, and is incorporated and operates as the semantic information extraction means 12a of the metadata generation apparatus 1. This semantic information extraction module LM is generated as a dynamic link library (DLL) in the same manner as the section information extraction module SM, and a plurality of semantic information extraction modules (LM1, LM2, LM3...) That extract different semantic information from the same content. ) Is stored in the module storage device 2.

例えば、意味情報抽出モジュールＬＭ１は、番組（コンテンツ）がサッカーの中継番組である場合、一般的な顔認識技術により、映像ファイル（コンテンツ蓄積装置３内の映像Ｖ１）から、区間情報抽出モジュールＳＭ１で抽出した区間内の映像に映っている選手を検出し、その選手名を意味情報として区間情報に対応付ける。なお、意味情報抽出モジュールＬＭは、コンテンツから目的となる内容を意味情報として抽出するものであれば、これに限定されるものではなく、例えば、サッカーの中継番組におけるアナウンサのコメントをテキスト化したテキスト情報（コンテンツ蓄積装置３内のテキスト情報Ｔ１）から、区間情報に対応するコメントを抽出するものでもよい。 For example, when the program (content) is a soccer relay program, the semantic information extraction module LM1 uses the section information extraction module SM1 from the video file (video V1 in the content storage device 3) using a general face recognition technique. A player reflected in the video in the extracted section is detected, and the player name is associated with the section information as semantic information. The semantic information extraction module LM is not limited to this as long as the target content is extracted from the content as semantic information. For example, a text in which an announcer's comment in a soccer relay program is converted into text A comment corresponding to the section information may be extracted from the information (text information T1 in the content storage device 3).

コンテンツ蓄積装置３は、コンテンツを蓄積するもので、ハードディスク等の記憶装置である。ここでは、コンテンツ蓄積装置３は、コンテンツとして、映像（Ｖ１、Ｖ２…）、音声（Ａ１、Ａ２…）、テキスト情報（Ｔ１、Ｔ２、…）を記憶している。
ここで、映像（Ｖ１等）は、例えば、ＭＰＥＧ等の映像ファイルである。また、音声（Ａ１等）は、映像（Ｖ１等）と同一時刻の音声を収録した音声ファイルである。例えば、音声Ａ１は、Ｗｉｎｄｏｗｓ（登録商標）の標準的なＷＡＶＥ形式（ＰＣＭ録音された音声データ）の音声ファイルである。また、テキスト情報（Ｔ１等）は、映像（Ｖ１等）の再生時刻に対応付けられた種々の情報をテキストデータで記述したテキストファイルである。例えば、テキスト情報Ｔ１は、アナウンサのコメントを時刻毎にテキスト化したテキストファイルである。 The content storage device 3 stores content and is a storage device such as a hard disk. Here, the content storage device 3 stores video (V1, V2,...), Audio (A1, A2,...), And text information (T1, T2,...) As content.
Here, the video (V1 or the like) is, for example, a video file such as MPEG. The audio (A1 etc.) is an audio file containing audio at the same time as the video (V1 etc.). For example, the audio A1 is an audio file in a standard WAVE format (audio data recorded by PCM) of Windows (registered trademark). The text information (T1 etc.) is a text file in which various information associated with the playback time of the video (V1 etc.) is described as text data. For example, the text information T1 is a text file in which an announcer's comment is converted into text for each time.

このように、メタデータ生成装置１は、モジュール蓄積装置２から、区間情報を抽出するモジュール（区間情報抽出モジュールＳＭ）を選択し、そのモジュールによってコンテンツを区間毎に区分（セグメント化）する。そして、メタデータ生成装置１は、モジュール蓄積装置２から、意味情報を抽出するモジュール（意味情報抽出モジュールＬＭ）を選択し、そのモジュールによってセグメント化された区間情報に、意味情報を対応付けることで、区間メタデータを生成する。 As described above, the metadata generation apparatus 1 selects a module (section information extraction module SM) for extracting section information from the module storage apparatus 2, and divides (segments) content into sections by the module. And the metadata production | generation apparatus 1 selects the module (semantic information extraction module LM) which extracts semantic information from the module storage apparatus 2, and associates semantic information with the section information segmented by the module, Generate section metadata.

ここで、図２を参照して、区間メタデータについて説明する。図２は、区間メタデータの構成を説明するための説明図であって、（ａ）は区間メタデータの構成、（ｂ）は区間メタデータの例を示している。 Here, the section metadata will be described with reference to FIG. 2A and 2B are explanatory diagrams for explaining the configuration of the section metadata. FIG. 2A shows the structure of the section metadata, and FIG. 2B shows an example of the section metadata.

図２（ａ）に示すように、区間メタデータは、識別情報と、区間情報と、意味情報とを含んだ、コンテンツの内容を示すデータである。
ここで、識別情報は、当該区間メタデータを識別するための固有の情報であって、数値等の識別子（ＩＤ）である。
また、区間情報は、コンテンツにおける当該区間メタデータの時間区間を示す情報であって、開始時刻（Ｉｎ）と終了時刻（Ｏｕｔ）である。 As shown in FIG. 2A, the section metadata is data indicating the content content including identification information, section information, and semantic information.
Here, the identification information is unique information for identifying the section metadata, and is an identifier (ID) such as a numerical value.
The section information is information indicating a time section of the section metadata in the content, and is a start time (In) and an end time (Out).

また、意味情報は、区間情報で区分されたコンテンツの時間区間における意味・内容を、キー（Ｋｅｙ）と値（Ｖａｌｕｅ）とに対応付けた情報である。ここでキー（Ｋｅｙ）は、意味・内容の種別を示す情報である。また、値（Ｖａｌｕｅ）は、キー（Ｋｅｙ）で示される種別の詳細な情報である。さらに、ここでは、意味情報として、各区間メタデータの関係（区間関係）を示す情報を付加している。この意味情報では、区間関係をタイプ（Ｔｙｐｅ）とターゲット（Ｔａｒｇｅｔ）とに対応付けている。ここでタイプ（Ｔｙｐｅ）は、当該区間に対して、他の区間がどのような意味合いを有しているのかを示す情報である。また、ターゲット（Ｔａｒｇｅｔ）は、区間関係の対象となる他の区間を示す情報（識別子）である。なお、同一の区間メタデータ内に、複数の意味情報を対応付けることも可能である。 The semantic information is information in which the meaning / content of the content segmented by the segment information is associated with a key (Key) and a value (Value). Here, the key (Key) is information indicating the type of meaning / content. The value (Value) is detailed information of the type indicated by the key (Key). Furthermore, here, information indicating the relationship (interval relationship) of each section metadata is added as semantic information. In this semantic information, the section relationship is associated with a type (Type) and a target (Target). Here, the type (Type) is information indicating what meaning the other sections have with respect to the section. The target is information (identifier) indicating another section that is a target of the section relationship. It is possible to associate a plurality of semantic information in the same section metadata.

この区間メタデータの具体例を図２（ｂ）で説明する。
図２（ｂ）に示すように、識別情報である識別子ＩＤ（「００１」、「００２」、「００３」）によって、区間メタデータＭ１、区間メタデータＭ２、区間メタデータＭ３が識別される。
また、区間メタデータは、開始時刻Ｉｎ及び終了時刻Ｏｕｔによって、コンテンツの時刻（番組時刻）において、どの時間区間のメタデータであるかが表されている。例えば、区間メタデータＭ１は、「０９：００：３２−００」から「０９：００：５４−００」の時間区間のメタデータであることを表している。なお、時刻「ＸＸ：ＸＸ：ＸＸ−ＸＸ」は、「時：分：秒−フレーム番号」を示している。 A specific example of the section metadata will be described with reference to FIG.
As shown in FIG. 2B, the section metadata M1, the section metadata M2, and the section metadata M3 are identified by the identifier ID (“001”, “002”, “003”) that is identification information.
In addition, the section metadata indicates which time section is the metadata at the content time (program time) by the start time In and the end time Out. For example, the section metadata M1 represents metadata of a time section from “09: 0: 32-00” to “09: 0: 54-00”. The time “XX: XX: XX-XX” indicates “hour: minute: second-frame number”.

また、区間メタデータは、意味・内容を示すキー（Ｋｅｙ）及び値（Ｖａｌｕｅ）によって、当該区間におけるコンテンツの内容を示している。例えば、区間メタデータＭ１は、ある事象が発生したことを示す「イベント」をキー（Ｋｅｙ）とし、そのイベントの詳細な情報である「シュート」を値（Ｖａｌｕｅ）として、当該区間の意味・内容を示している。また、区間メタデータＭ２は、映像内に選手が映っていることを示す「選手」をキー（Ｋｅｙ）とし、その選手の詳細な情報である選手名「Ｂ選手」を値（Ｖａｌｕｅ）として、当該区間の意味・内容を示している。 The section metadata indicates the contents of the contents in the section by using a key (Key) and a value (Value) indicating the meaning / content. For example, the section metadata M1 uses the “event” indicating that an event has occurred as a key (Key), and “shoot”, which is detailed information of the event, as a value (Value). Is shown. Further, the section metadata M2 uses “player” indicating that a player is reflected in the video as a key (Key), and the player name “B player”, which is detailed information of the player, as a value (Value). Indicates the meaning and content of the section.

さらに、区間メタデータは、区間関係を示すタイプ（Ｔｙｐｅ）及びターゲット（Ｔａｒｇｅｔ）によって、当該区間と他の区間との関係を示している。例えば、区間メタデータＭ１は、他の区間（ここでは、識別子「００３」）をターゲット（Ｔａｒｇｅｔ）とし、他の区間の映像が、本区間のスロー再生映像（「スロー再生」）であることを、タイプ（Ｔｙｐｅ）によって示している。
このように、区間メタデータを定義しておくことで、異なる特徴によってコンテンツから区間情報を抽出したり、異なる意味情報を抽出したりする場合であっても、共通の区間メタデータの形式で表現することができる。
以下、図１に戻って、メタデータ生成装置の詳細な構成について説明する。 Furthermore, the section metadata indicates a relationship between the section and another section by a type (Type) indicating the section relationship and a target (Target). For example, the section metadata M1 indicates that another section (here, the identifier “003”) is a target (Target), and the video of the other section is the slow playback video (“slow playback”) of this section. , By type.
In this way, by defining the section metadata, even if section information is extracted from the content by different features or different semantic information is extracted, it is expressed in the form of a common section metadata. can do.
Hereinafter, referring back to FIG. 1, the detailed configuration of the metadata generation apparatus will be described.

［メタデータ生成装置の構成］
図１に示すように、メタデータ生成装置１は、区間情報抽出モジュール選択手段１０と、区間情報取得処理手段１１と、意味情報抽出モジュール選択手段１２と、意味情報取得処理手段１３と、区間メタデータ生成制御手段１４と、メタデータ蓄積手段１５と、選択映像再生手段１６と、区間メタデータ編集手段１７と、ユーザ指示受け付け手段１８と、区間メタデータ入出力手段１９とを備えている。 [Configuration of metadata generation device]
As shown in FIG. 1, the metadata generation device 1 includes a section information extraction module selection unit 10, a section information acquisition processing unit 11, a semantic information extraction module selection unit 12, a semantic information acquisition processing unit 13, and a section metadata. A data generation control unit 14, a metadata storage unit 15, a selected video reproduction unit 16, a section metadata editing unit 17, a user instruction receiving unit 18, and a section metadata input / output unit 19 are provided.

区間情報抽出モジュール選択手段（事象選択手段）１０は、後記するユーザ指示受け付け手段１８を介して、複数の事象の中から１つを選択事象（例えば、サッカー中継番組における観客の歓声）として、ユーザから指示されることで、当該選択事象に対応する区間情報抽出モジュールＳＭを、モジュール蓄積装置２から選択するものである。なお、区間情報抽出モジュール選択手段１０は、複数の区間情報抽出モジュールＳＭから、個々の機能を示す文字列を取得し、ユーザ指示受け付け手段１８を介して、ユーザに提示することで、ユーザに対して区間情報抽出モジュールＳＭを選択するための補助となる情報を提示する。 The section information extraction module selection means (event selection means) 10 selects one of a plurality of events as a selection event (for example, cheer of the audience in a soccer broadcast program) via a user instruction reception means 18 described later. The section information extraction module SM corresponding to the selected event is selected from the module storage device 2. The section information extraction module selection unit 10 acquires character strings indicating individual functions from the plurality of section information extraction modules SM, and presents them to the user via the user instruction receiving unit 18 to Information for assisting selection of the section information extraction module SM is presented.

そして、区間情報抽出モジュール選択手段１０は、選択された区間情報抽出モジュールＳＭを、図示していないメモリ等に展開し動作させる。これによって、区間情報抽出モジュールＳＭを、区間情報抽出手段１０ａとして機能させる。なお、この区間情報抽出手段（区間情報抽出モジュール）１０ａは、後記する区間情報取得処理手段１１からの指示に基づいて動作を行う。 Then, the section information extraction module selection unit 10 expands and operates the selected section information extraction module SM in a memory (not shown). Thereby, the section information extraction module SM is caused to function as the section information extraction means 10a. The section information extraction means (section information extraction module) 10a operates based on an instruction from the section information acquisition processing means 11 described later.

ここでは、区間情報抽出手段１０ａは、複数の区間情報抽出モジュールＳＭにおいて選択された１つのモジュールを示しているが、予め複数の区間情報抽出モジュールＳＭを複数の区間情報抽出手段１０ａとして、メタデータ生成装置１内に組み込んで構成することとしてもよい。ただし、区間情報抽出モジュールを、一般的なモジュールとして提供される形態とする場合は、本形態のようにユーザが必要なモジュールを選択して、メタデータ生成装置１内に組み込むことで動作する形態の方が、汎用性が高く好ましい。 Here, the section information extraction means 10a shows one module selected in the plurality of section information extraction modules SM, but the metadata is obtained by using the plurality of section information extraction modules SM as a plurality of section information extraction means 10a in advance. It is good also as incorporating in the production | generation apparatus 1 and comprising. However, when the section information extraction module is configured to be provided as a general module, a mode in which the user operates by selecting a necessary module and incorporating it in the metadata generation apparatus 1 as in this embodiment. This is preferable because of its high versatility.

区間情報取得処理手段１１は、区間情報抽出モジュール選択手段１０で選択された区間情報抽出手段（区間情報抽出モジュール）１０ａを動作させることで、区間情報抽出手段１０ａから、コンテンツにおける選択事象の区間情報を取得し、区間メタデータ（第１の区間メタデータ）を生成するものである。ここでは、区間情報取得処理手段１１は、パラメータ設定手段１１ａと、区間情報取得手段１１ｂとを備えている。 The section information acquisition processing unit 11 operates the section information extraction unit (section information extraction module) 10a selected by the section information extraction module selection unit 10 to operate the section information of the selected event in the content from the section information extraction unit 10a. To generate section metadata (first section metadata). Here, the section information acquisition processing unit 11 includes a parameter setting unit 11a and a section information acquisition unit 11b.

パラメータ設定手段１１ａは、コンテンツから区間情報を抽出するために必要となるパラメータの項目を区間情報抽出手段１０ａから取得するとともに、ユーザが入力するパラメータの値を区間情報抽出手段１０ａに設定するものである。このパラメータ設定手段１１ａは、区間情報抽出手段１０ａから取得したパラメータの項目を、ユーザ指示受け付け手段１８を介して、ユーザに提示する。そして、パラメータ設定手段１１ａは、ユーザが入力したパラメータの値をユーザ指示受け付け手段１８から取得し、区間情報抽出手段１０ａに設定する。ここで、区間情報を抽出するために必要となるパラメータとは、例えば、どの音声（Ａ１等）から区間情報を抽出するのかを示す音声ファイルのファイル名等である。
このように、区間情報抽出手段１０ａから、必要となるパラメータの項目を取得し、ユーザに提示することで、区間情報抽出手段１０ａが、異なる区間情報抽出モジュールＳＭであっても、ユーザに対する操作インタフェースを統一することができる。 The parameter setting unit 11a obtains parameter items necessary for extracting section information from content from the section information extracting unit 10a, and sets parameter values input by the user in the section information extracting unit 10a. is there. The parameter setting unit 11a presents the parameter items acquired from the section information extraction unit 10a to the user via the user instruction receiving unit 18. Then, the parameter setting unit 11a acquires the parameter value input by the user from the user instruction receiving unit 18 and sets it in the section information extracting unit 10a. Here, the parameter necessary for extracting the section information is, for example, a file name of an audio file indicating from which voice (A1 or the like) the section information is extracted.
In this way, by acquiring necessary parameter items from the section information extraction unit 10a and presenting them to the user, even if the section information extraction unit 10a is a different section information extraction module SM, an operation interface for the user is provided. Can be unified.

区間情報取得手段１１ｂは、区間情報抽出手段１０ａに対して、区間情報の抽出を指示し、区間情報抽出手段１０ａの区間情報の抽出動作を実行させ、その抽出結果である区間情報を取得するものである。ここで取得した区間情報は、メタデータ蓄積手段１５に区間メタデータ（Ｍ１、Ｍ２…）として書き込まれる。この区間情報取得手段１１ｂによって、区間メタデータ（Ｍ１、Ｍ２…）には、図２に示した識別情報である識別子ＩＤと、区間情報である開始時刻Ｉｎ及び終了時刻Ｏｕｔのみが書き込まれる。なお、特許請求の範囲に記載の第１の区間メタデータは、識別情報に対応付けて区間情報のみが記述されたメタデータである。 The section information acquisition unit 11b instructs the section information extraction unit 10a to extract section information, causes the section information extraction unit 10a to perform the section information extraction operation, and acquires the section information as a result of the extraction. It is. The section information acquired here is written in the metadata storage unit 15 as section metadata (M1, M2,...). By the section information acquisition unit 11b, only the identifier ID, which is identification information shown in FIG. 2, and the start time In and end time Out, which are section information, are written in the section metadata (M1, M2,...). Note that the first section metadata described in the claims is metadata in which only section information is described in association with identification information.

意味情報抽出モジュール選択手段（種別選択手段）１２は、後記するユーザ指示受け付け手段１８を介して、意味情報を解析する複数の解析手法の種別から１つを選択種別（例えば、顔認識解析、アナウンサコメント解析）として、ユーザから指示されることで、当該選択種別に対応する意味情報抽出モジュールＬＭを、モジュール蓄積装置２から選択するものである。なお、意味情報抽出モジュール選択手段１２は、複数の意味情報抽出モジュールＬＭから、個々の機能を示す文字列を取得し、ユーザ指示受け付け手段１８を介して、ユーザに提示することで、ユーザに対して意味情報抽出モジュールＬＭを選択するための補助となる情報を提示する。 The semantic information extraction module selection unit (type selection unit) 12 selects one of a plurality of analysis method types for analyzing semantic information via a user instruction reception unit 18 described later (for example, face recognition analysis, announcer). As a comment analysis), the semantic information extraction module LM corresponding to the selected type is selected from the module storage device 2 when instructed by the user. The semantic information extraction module selection unit 12 acquires character strings indicating individual functions from a plurality of semantic information extraction modules LM, and presents them to the user via the user instruction reception unit 18, thereby providing the user with a character string. Information for assisting selection of the semantic information extraction module LM is presented.

そして、意味情報抽出モジュール選択手段１２は、選択された意味情報抽出モジュールＬＭを、図示していないメモリ等に展開し動作させる。これによって、意味情報抽出モジュールＬＭを、意味情報抽出手段１２ａとして機能させる。なお、この意味情報抽出手段（意味情報抽出モジュール）１２ａは、後記する意味情報取得処理手段１３からの指示に基づいて動作を行う。 Then, the semantic information extraction module selection unit 12 develops and operates the selected semantic information extraction module LM in a memory or the like (not shown). As a result, the semantic information extraction module LM functions as the semantic information extraction means 12a. The semantic information extraction means (meaning information extraction module) 12a operates based on an instruction from the semantic information acquisition processing means 13 described later.

ここでは、意味情報抽出手段１２ａは、複数の意味情報抽出モジュールＬＭにおいて選択された１つのモジュールを示しているが、予め複数の意味情報抽出モジュールＬＭを複数の意味情報抽出手段１２ａとして、メタデータ生成装置１内に組み込んで構成することとしてもよい。ただし、意味情報抽出モジュールを、一般的なモジュールとして提供される形態とする場合は、本形態のようにユーザが必要なモジュールを選択して、メタデータ生成装置１内に組み込むことで動作する形態の方が、汎用性が高く好ましい。 Here, the semantic information extraction unit 12a indicates one module selected in the plurality of semantic information extraction modules LM, but the metadata is preliminarily defined by using the plurality of semantic information extraction modules LM as the plurality of semantic information extraction units 12a. It is good also as incorporating in the production | generation apparatus 1 and comprising. However, when the semantic information extraction module is provided in a form provided as a general module, a form in which the user operates by selecting a necessary module and incorporating it in the metadata generation apparatus 1 as in this form. This is preferable because of its high versatility.

意味情報取得処理手段１３は、意味情報抽出モジュール選択手段１２で選択された意味情報抽出手段（意味情報抽出モジュール）１２ａを動作させることで、意味情報抽出手段１２ａから、選択された解析手法により抽出された意味情報を取得し、区間メタデータ（第２の区間メタデータ）を生成するものである。ここでは、意味情報取得処理手段１３は、パラメータ設定手段１３ａと、意味情報取得手段１３ｂとを備えている。 The semantic information acquisition processing unit 13 operates the semantic information extraction unit (semantic information extraction module) 12a selected by the semantic information extraction module selection unit 12 to extract from the semantic information extraction unit 12a by the selected analysis method. The acquired semantic information is acquired, and section metadata (second section metadata) is generated. Here, the semantic information acquisition processing unit 13 includes a parameter setting unit 13a and a semantic information acquisition unit 13b.

パラメータ設定手段１３ａは、コンテンツから意味情報を抽出するために必要となるパラメータの項目を意味情報抽出手段１２ａから取得するとともに、ユーザが入力するパラメータの値を意味情報抽出手段１２ａに設定するものである。このパラメータ設定手段１３ａは、意味情報抽出手段１２ａから取得したパラメータの項目を、ユーザ指示受け付け手段１８を介して、ユーザに提示する。そして、パラメータ設定手段１３ａは、ユーザが入力したパラメータの値をユーザ指示受け付け手段１８から取得し、意味情報抽出手段１２ａに設定する。ここで、意味情報を抽出するために必要となるパラメータとは、例えば、どの映像（Ｖ１等）から意味情報を抽出するのかを示す映像ファイルのファイル名、顔認識を行う際の顔の特徴と選手名とを対応付けたテキストファイルのファイル名等である。 The parameter setting unit 13a acquires parameter items necessary for extracting semantic information from the content from the semantic information extracting unit 12a, and sets the parameter value input by the user in the semantic information extracting unit 12a. is there. The parameter setting unit 13a presents the parameter items acquired from the semantic information extracting unit 12a to the user via the user instruction receiving unit 18. Then, the parameter setting unit 13a acquires the parameter value input by the user from the user instruction receiving unit 18 and sets it in the semantic information extracting unit 12a. Here, the parameters necessary for extracting semantic information include, for example, the video file name indicating which video (V1 etc.) the semantic information is to be extracted from, the facial features when performing face recognition, and the like. For example, the file name of the text file associated with the player name.

このように、意味情報抽出手段１２ａから、必要となるパラメータの項目を取得し、ユーザに提示することで、意味情報抽出手段１２ａが、異なる意味情報抽出モジュールＬＭであっても、統一したパラメータの入力画面を表示することができ、ユーザに対する操作インタフェースを統一することができる。
なお、メタデータ蓄積手段１５に蓄積されている区間メタデータ（Ｍ１、Ｍ２…）の区間情報（開始時刻、終了時刻）は、そのままパラメータとして意味情報抽出手段１２ａに設定される。 In this way, by acquiring necessary parameter items from the semantic information extracting unit 12a and presenting them to the user, even if the semantic information extracting unit 12a is a different semantic information extracting module LM, The input screen can be displayed, and the operation interface for the user can be unified.
The section information (start time, end time) of the section metadata (M1, M2,...) Stored in the metadata storage unit 15 is set in the semantic information extraction unit 12a as a parameter as it is.

意味情報取得手段１３ｂは、意味情報抽出手段１２ａに対して、意味情報の抽出を指示し、意味情報抽出手段１２ａの意味情報の抽出動作を実行させ、その抽出結果である意味情報を取得するものである。ここで取得した意味情報は、メタデータ蓄積手段１５に蓄積されている区間メタデータ（Ｍ１、Ｍ２…）の区間情報（開始時刻、終了時刻）に対応付けて、区間メタデータ（Ｍ１、Ｍ２…）に追加して書き込まれる。これによって、区間メタデータ（Ｍ１、Ｍ２…）には、識別情報と、区間情報と、意味情報とが書き込まれることになる。なお、特許請求の範囲に記載の第２の区間メタデータは、識別情報に対応付けて区間情報と意味情報とが記述されたメタデータである。 The semantic information acquisition unit 13b instructs the semantic information extraction unit 12a to extract semantic information, causes the semantic information extraction unit 12a to perform the semantic information extraction operation, and acquires the semantic information as a result of the extraction. It is. The semantic information acquired here is associated with the section information (start time, end time) of the section metadata (M1, M2,...) Stored in the metadata storage unit 15 in association with the section metadata (M1, M2,...). ) To be written. As a result, identification information, section information, and semantic information are written in the section metadata (M1, M2,...). Note that the second section metadata described in the claims is metadata in which section information and semantic information are described in association with identification information.

区間メタデータ生成制御手段１４は、メタデータ生成装置１内の全体を制御する主制御手段である。この区間メタデータ生成制御手段１４は、各手段から通知される制御情報に基づいて、動作を決定し、他の手段に対して指示を行う。例えば、区間メタデータ生成制御手段１４は、区間情報抽出モジュール１０から、区間情報抽出モジュールＳＭの機能を示す文字列をユーザに提示する旨の指示を通知された場合、ユーザ指示受け付け手段１８に対して、図示していない表示装置に、区間情報抽出モジュールＳＭを選択するための選択画面を提示する旨の指示を行う。 The section metadata generation control unit 14 is a main control unit that controls the entire metadata generation apparatus 1. The section metadata generation control unit 14 determines an operation based on the control information notified from each unit, and gives instructions to other units. For example, when the section metadata generation control unit 14 receives an instruction from the section information extraction module 10 to present a character string indicating the function of the section information extraction module SM to the user, the section metadata generation control unit 14 Thus, an instruction to present a selection screen for selecting the section information extraction module SM is given to a display device (not shown).

メタデータ蓄積手段１５は、区間メタデータを蓄積するものであって、半導体メモリ、ハードディスク等の記憶装置である。このメタデータ蓄積手段１５に蓄積される区間メタデータ（Ｍ１、Ｍ２…）は、区間情報取得処理手段１１や、意味情報取得処理手段１３によって、区間情報や意味情報が書き込まれることで生成される。
なお、ここで蓄積される区間メタデータの形式は、特に限定されるものではないが、例えば、ＭＰＥＧ−７等のＸＭＬ（ＥｘｔｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ）形式である。 The metadata accumulating unit 15 accumulates section metadata and is a storage device such as a semiconductor memory or a hard disk. The section metadata (M1, M2,...) Stored in the metadata storage unit 15 is generated by writing section information and semantic information by the section information acquisition processing unit 11 and the semantic information acquisition processing unit 13. .
The format of the section metadata stored here is not particularly limited, but is, for example, an XML (Extensible Markup Language) format such as MPEG-7.

選択映像再生手段１６は、区間メタデータ（Ｍ１、Ｍ２…）の中で、ユーザ指示受け付け手段１８を介して、ユーザによって選択された区間メタデータに対応するコンテンツの映像をコンテンツ蓄積装置３から読み出して再生するものである。例えば、コンテンツ蓄積装置３に蓄積されている映像（Ｖ１等）がＭＰＥＧ符号化された映像である場合、選択映像再生手段１６は、ＭＰＥＧデコーダとして機能し、映像を再生することで、図示していない表示装置に映像を表示する。これによって、ユーザは、選択された区間メタデータが、正常にセグメント化されているかどうかを確認することができる。 The selected video playback means 16 reads the content video corresponding to the section metadata selected by the user from the content storage device 3 via the user instruction receiving means 18 in the section metadata (M1, M2,...). To play. For example, when the video (V1 or the like) stored in the content storage device 3 is an MPEG-encoded video, the selected video playback means 16 functions as an MPEG decoder and plays back the video. Display video on no display device. As a result, the user can confirm whether or not the selected section metadata is normally segmented.

区間メタデータ編集手段１７は、区間メタデータ（Ｍ１、Ｍ２…）を、ユーザ指示受け付け手段１８を介して、ユーザに提示し、その区間メタデータを修正するものである。例えば、区間メタデータ編集手段１７は、ユーザ指示受け付け手段１８を介して表示画面上に、区間メタデータの内容（区間情報、意味情報）を提示し、ユーザから、図示していないキーボード等によって、区間メタデータの内容が修正された場合に、その修正結果に基づいてメタデータ蓄積手段１５の内容を更新する。 The section metadata editing unit 17 presents the section metadata (M1, M2,...) To the user via the user instruction receiving unit 18, and corrects the section metadata. For example, the section metadata editing unit 17 presents the contents of the section metadata (section information and semantic information) on the display screen via the user instruction receiving unit 18, and the user uses a keyboard or the like (not shown) to When the content of the section metadata is corrected, the content of the metadata storage unit 15 is updated based on the correction result.

ユーザ指示受け付け手段１８は、図示していない表示装置に、ユーザからの指示を受け付ける画面を提示し、図示していないマウス、キーボート等の入力装置を介して、ユーザからの指示を受け付けるものである。ここで指示された内容は、随時区間メタデータ生成制御手段１４に通知され、その指示内容に沿った動作が実行される。なお、このユーザからの指示を受け付ける画面については、後で画面例を示して説明する。 The user instruction accepting means 18 presents a screen for accepting an instruction from the user on a display device (not shown), and accepts an instruction from the user via an input device such as a mouse or a keyboard (not shown). . The content instructed here is notified to the section metadata generation control means 14 as needed, and an operation according to the content of the instruction is executed. Note that a screen for receiving an instruction from the user will be described later by showing a screen example.

区間メタデータ入出力手段１９は、区間メタデータを入出力するためのインタフェースである。例えば、ＤＶＤ−ＲＡＭ等の記録媒体に対して区間メタデータを書き込んだり、区間メタデータを読み出したりするディスクドライブ装置等である。あるいは、通信回線を介して、区間メタデータの送受信を行う通信ボードであってもよい。
なお、ここでは、区間メタデータ（Ｍ１、Ｍ２…）は、共通のデータ形式となっているため、外部で生成された区間メタデータを入力することで、その区間メタデータを活用することもできる。例えば、他のメタデータ生成装置で区間情報のみが書き込まれた区間メタデータを入力し、当該メタデータ生成装置１で、その区間メタデータに意味情報を付加することができる。 The section metadata input / output means 19 is an interface for inputting / outputting section metadata. For example, it is a disk drive device or the like that writes section metadata to or reads section metadata from a recording medium such as a DVD-RAM. Alternatively, it may be a communication board that transmits / receives section metadata via a communication line.
Here, since the section metadata (M1, M2,...) Has a common data format, the section metadata can be used by inputting the section metadata generated externally. . For example, it is possible to input section metadata in which only section information is written by another metadata generation apparatus, and the metadata generation apparatus 1 can add semantic information to the section metadata.

以上、メタデータ生成装置１の構成について説明したが、本発明はこれに限定されるものではない。例えば、メタデータ生成装置１は、一般的なコンピュータにプログラムを実行させ、コンピュータ内の演算装置や記憶装置を動作させることにより実現することができる。このプログラム（メタデータ生成プログラム）は、通信回線を介して配布することも可能であるし、ＣＤ−ＲＯＭ等の記録媒体に書き込んで配布することも可能である。 The configuration of the metadata generation apparatus 1 has been described above, but the present invention is not limited to this. For example, the metadata generation device 1 can be realized by causing a general computer to execute a program and operating an arithmetic device or a storage device in the computer. This program (metadata generation program) can be distributed via a communication line, or can be distributed by writing in a recording medium such as a CD-ROM.

［メタデータ生成装置の画面例］
次に、メタデータ生成装置１が表示する画面例について説明する。
（区間情報抽出用画面）
まず、図３及び図４を参照（適宜図１参照）して、区間情報抽出モジュールＳＭを選択して、コンテンツから区間情報を抽出する画面例について説明する。図３は、区間情報抽出モジュールを選択する選択画面の一例を示す画面例である。図４は、区間情報抽出モジュールに通知するパラメータを入力するパラメータ入力画面の一例を示す画面例である。 [Example screen of metadata generation device]
Next, an example of a screen displayed by the metadata generation apparatus 1 will be described.
(Section information extraction screen)
First, referring to FIGS. 3 and 4 (refer to FIG. 1 as appropriate), an example of a screen for selecting the section information extraction module SM and extracting section information from the content will be described. FIG. 3 is a screen example showing an example of a selection screen for selecting a section information extraction module. FIG. 4 is a screen example showing an example of a parameter input screen for inputting parameters to be notified to the section information extraction module.

図３に示すように、区間情報抽出モジュールＳＭを選択するための選択画面Ｗ１は、区間情報抽出モジュールＳＭの機能を示す文字列をリスト表示（一覧表示）し、ユーザからの選択を受け付ける画面となっている。
メタデータ生成装置１では、区間情報抽出モジュール選択手段１０が、区間情報抽出モジュールＳＭを選択する場合、ユーザ指示受け付け手段１８を介して、画面上に選択画面Ｗ１を表示する。この選択画面Ｗ１において、リストボタンＢをマウスでクリックされることで、区間情報抽出モジュール選択手段１０が、複数の区間情報抽出モジュールＳＭから取得した、個々の区間情報抽出モジュール（ＳＭ１、ＳＭ２、ＳＭ３…）の機能を示す文字列をリスト表示（一覧表示）する。ここでは、区間情報抽出モジュールＳＭ１から「歓声セグメント化モジュール（ＳＬ１）」、区間情報抽出モジュールＳＭ２から「映像カット点セグメント化モジュール（ＳＬ２）」の各文字列が取得され、その文字列ＳＬ１、ＳＬ２をリスト表示している。これによって、ユーザは、どのような機能で区間情報を抽出するのかを選択することができる。
そして、この選択画面Ｗ１で、区間情報抽出モジュールＳＭが選択された段階で、図４のパラメータ入力画面Ｗ２が表示される。 As shown in FIG. 3, the selection screen W1 for selecting the section information extraction module SM displays a list of character strings indicating the functions of the section information extraction module SM (list display), and receives a selection from the user. It has become.
In the metadata generation apparatus 1, when the section information extraction module selection unit 10 selects the section information extraction module SM, the selection screen W <b> 1 is displayed on the screen via the user instruction reception unit 18. When the list button B is clicked with the mouse on the selection screen W1, the section information extraction module selection means 10 acquires the individual section information extraction modules (SM1, SM2, SM3) acquired from the plurality of section information extraction modules SM. Display a list of character strings indicating the function of (...). Here, the character strings “cheer segmentation module (SL1)” and “video cut point segmentation module (SL2)” are acquired from the section information extraction module SM1, and the character strings SL1 and SL2 are acquired from the section information extraction module SM2. Is displayed as a list. Accordingly, the user can select what function is used to extract the section information.
Then, when the section information extraction module SM is selected on the selection screen W1, the parameter input screen W2 of FIG. 4 is displayed.

図４に示すように、区間情報を抽出するために必要となるパラメータの値を入力する入力画面Ｗ２は、各パラメータＰ１、Ｐ２に対して、入力欄ＡＲ１、ＡＲ２を設けることで、ユーザからの入力を受け付ける画面となっている。
メタデータ生成装置１では、区間情報取得処理手段１１のパラメータ設定手段１１ａが、区間情報抽出手段（区間情報抽出モジュール）１０ａから、区間情報を抽出するために必要となるパラメータを取得し、ユーザ指示受け付け手段１８を介して、画面上に入力画面Ｗ２を表示する。この入力画面Ｗ２では、パラメータとして、「解析対象ファイル名（Ｐ１）」と「最大エントロピ法モデルファイル名（Ｐ２）」とが、区間情報抽出手段１０ａから通知された状態を示している。これによって、ユーザは、各パラメータの値（ファイル名）を、各パラメータの入力欄ＡＲ１、ＡＲ２に入力し、実行ボタンＥＸをマウスでクリックすることで、区間情報の抽出が実行される。 As shown in FIG. 4, the input screen W2 for inputting the parameter values necessary for extracting the section information is provided with input fields AR1 and AR2 for the parameters P1 and P2, respectively. The screen accepts input.
In the metadata generation device 1, the parameter setting unit 11 a of the section information acquisition processing unit 11 acquires parameters necessary for extracting section information from the section information extraction unit (section information extraction module) 10 a, The input screen W2 is displayed on the screen via the accepting means 18. This input screen W2 shows a state in which “analysis target file name (P1)” and “maximum entropy method model file name (P2)” are notified from the section information extraction unit 10a as parameters. Thereby, the user inputs the value (file name) of each parameter in the input fields AR1 and AR2 of each parameter and clicks the execution button EX with the mouse, thereby extracting the section information.

（意味情報抽出用画面）
次に、図５及び図６を参照（適宜図１参照）して、意味情報抽出モジュールＬＭを選択して、コンテンツから意味情報を抽出する画面例について説明する。図５は、意味情報抽出モジュールを選択する選択画面の一例を示す画面例である。図６は、意味情報抽出モジュールに通知するパラメータを入力するパラメータ入力画面の一例を示す画面例である。 (Semantic information extraction screen)
Next, referring to FIGS. 5 and 6 (refer to FIG. 1 as appropriate), an example of a screen for selecting the semantic information extraction module LM and extracting semantic information from the content will be described. FIG. 5 is a screen example showing an example of a selection screen for selecting the semantic information extraction module. FIG. 6 is a screen example showing an example of a parameter input screen for inputting a parameter to be notified to the semantic information extraction module.

図５に示すように、意味情報抽出モジュールＬＭを選択するための選択画面Ｗ３は、意味情報抽出モジュールＬＭの機能を示す文字列をリスト表示（一覧表示）し、ユーザからの選択を受け付ける画面となっている。
メタデータ生成装置１では、意味情報抽出モジュール選択手段１２が、意味情報抽出モジュールＬＭを選択する場合、ユーザ指示受け付け手段１８を介して、画面上に選択画面Ｗ３を表示する。この選択画面Ｗ３において、リストボタンＢをマウスでクリックされることで、意味情報抽出モジュール選択手段１２が、複数の意味情報抽出モジュールＬＭから取得した、個々の意味情報抽出モジュール（ＬＭ１、ＬＭ２、ＬＭ３…）の機能を示す文字列をリスト表示（一覧表示）する。ここでは、意味情報抽出モジュールＬＭ１から「顔認識ラベル付けモジュール（ＬＬ１）」、意味情報抽出モジュールＬＭ２から「コメント解析ラベル付けモジュール（ＬＬ２）」の各文字列が取得され、その文字列ＬＬ１、ＬＬ２をリスト表示している。これによって、ユーザは、どのような機能で意味情報を抽出するのかを選択することができる。
そして、この選択画面Ｗ３で、意味情報抽出モジュールＬＭが選択された段階で、図６のパラメータ入力画面Ｗ４が表示される。 As shown in FIG. 5, the selection screen W3 for selecting the semantic information extraction module LM displays a list of character strings indicating the functions of the semantic information extraction module LM (list display), and receives a selection from the user. It has become.
In the metadata generation apparatus 1, when the semantic information extraction module selection unit 12 selects the semantic information extraction module LM, the selection screen W3 is displayed on the screen via the user instruction reception unit 18. On this selection screen W3, when the list button B is clicked with the mouse, the semantic information extraction module selection means 12 acquires the individual semantic information extraction modules (LM1, LM2, LM3) acquired from the plurality of semantic information extraction modules LM. Display a list of character strings indicating the function of (...). Here, each character string of “face recognition labeling module (LL1)” from the semantic information extraction module LM1 and “comment analysis labeling module (LL2)” from the semantic information extraction module LM2 is acquired, and the character strings LL1, LL2 are acquired. Is displayed as a list. Thereby, the user can select what function the semantic information is extracted from.
Then, when the semantic information extraction module LM is selected on the selection screen W3, the parameter input screen W4 in FIG. 6 is displayed.

図６に示すように、意味情報を抽出するために必要となるパラメータの値を入力する入力画面Ｗ４は、各パラメータＰ３、Ｐ４に対して、入力欄ＡＲ３、ＡＲ４を設けることで、ユーザからの入力を受け付ける画面となっている。
メタデータ生成装置１では、意味情報取得処理手段１３のパラメータ設定手段１３ａが、意味情報抽出手段（意味情報抽出モジュール）１２ａから、意味情報を抽出するために必要となるパラメータを取得し、ユーザ指示受け付け手段１８を介して、画面上に入力画面Ｗ４を表示する。この入力画面Ｗ４では、パラメータとして、「解析対象ファイル名（Ｐ３）」と「選手名ファイル名（Ｐ４）」とが、意味情報抽出手段１２ａから通知された状態を示している。これによって、ユーザは、各パラメータの値（ファイル名）を、各パラメータの入力欄ＡＲ３、ＡＲ４に入力し、実行ボタンＥＸをマウスでクリックすることで、意味情報の抽出が実行される。 As shown in FIG. 6, the input screen W4 for inputting parameter values necessary for extracting semantic information is provided with input fields AR3 and AR4 for the parameters P3 and P4. The screen accepts input.
In the metadata generation apparatus 1, the parameter setting unit 13 a of the semantic information acquisition processing unit 13 acquires parameters necessary for extracting semantic information from the semantic information extraction unit (semantic information extraction module) 12 a, An input screen W4 is displayed on the screen via the accepting means 18. This input screen W4 shows a state in which “analysis target file name (P3)” and “player name file name (P4)” are notified from the semantic information extraction means 12a as parameters. Thereby, the user inputs the value (file name) of each parameter in the input field AR3, AR4 of each parameter, and the semantic information is extracted by clicking the execution button EX with the mouse.

（区間メタデータ確認・編集画面）
次に、図７を参照（適宜図１参照）して、生成された区間メタデータを、画面上で確認・編集する画面例について説明する。図７は、区間メタデータの確認・編集を行う確認・編集画面の一例を示す画面例である。 (Section metadata confirmation / edit screen)
Next, referring to FIG. 7 (refer to FIG. 1 as appropriate), a screen example for confirming / editing the generated section metadata on the screen will be described. FIG. 7 is a screen example showing an example of a confirmation / editing screen for confirming / editing the section metadata.

図７に示すように、確認・編集画面Ｗ５は、映像再生領域Ｖａと、区間情報提示領域ＳＩと、意味情報（意味・内容）提示領域ＬＩ１と、意味情報（区間関係）提示領域ＬＩ２とで主に画面を構成している。なお、ここでは、区間情報抽出モジュールＳＭからの情報を提示する情報提示領域Ｓａと、意味情報抽出モジュールＬＭからの情報を提示する情報提示領域Ｌａとを、さらに画面上に構成している。
なお、区間メタデータは、区間情報によって一意に決まるため、ここでは、区間メタデータの選択は、区間情報提示領域ＳＩに表示される区間情報を選択することで行うこととしている。 As shown in FIG. 7, the confirmation / editing screen W5 includes a video reproduction area Va, a section information presentation area SI, a semantic information (meaning / content) presentation area LI1, and a semantic information (section relation) presentation area LI2. The screen is mainly composed. Here, an information presentation area Sa for presenting information from the section information extraction module SM and an information presentation area La for presenting information from the semantic information extraction module LM are further configured on the screen.
Since the section metadata is uniquely determined by the section information, the section metadata is selected here by selecting the section information displayed in the section information presentation area SI.

映像再生領域Ｖａは、選択された区間メタデータの区間情報（開始時刻Ｉｎと終了時刻Ｏｕｔ）で示される時間区間の映像を、再生・表示する領域である。メタデータ生成装置１では、区間メタデータが選択されたことを、ユーザ指示受け付け手段１８が検出し、選択映像再生手段１６に通知することで、選択映像再生手段１６が、該当する時間区間の映像を再生する。 The video playback area Va is an area for playing back and displaying a video of a time section indicated by section information (start time In and end time Out) of the selected section metadata. In the metadata generation device 1, the user instruction receiving means 18 detects that the section metadata has been selected and notifies the selected video playback means 16 so that the selected video playback means 16 can display the video in the corresponding time section. Play.

区間情報提示領域ＳＩは、区間メタデータから抽出された識別情報（識別子ＩＤ）と、区間情報（開始時刻Ｉｎ、終了時刻Ｏｕｔ）とを表示する領域である。なお、ここでは、開始時刻Ｉｎにおけるフレーム画像（Ｆａ１等）を同時に表示することとしている。そして、ユーザがマウスによって、区間情報提示領域ＳＩ内の所望の区間情報Ｓｅを選択することで、区間メタデータが選択される。 The section information presentation area SI is an area for displaying identification information (identifier ID) extracted from the section metadata and section information (start time In, end time Out). Here, the frame image (Fa1 and the like) at the start time In is displayed simultaneously. The section metadata is selected by the user selecting desired section information Se in the section information presentation area SI with the mouse.

意味情報（意味・内容）提示領域ＬＩ１は、区間メタデータから抽出された意味情報（意味・内容）であるキー及び値を表示する領域である。この意味情報（意味・内容）提示領域ＬＩ１は、区間情報提示領域ＳＩで区間メタデータが選択された段階で、当該区間メタデータの意味情報（意味・内容）が表示される。 The semantic information (meaning / content) presentation area LI1 is an area for displaying keys and values that are semantic information (meaning / content) extracted from the section metadata. In the semantic information (meaning / content) presentation area LI1, semantic information (meaning / content) of the section metadata is displayed when the section metadata is selected in the section information presentation area SI.

意味情報（区間関係）提示領域ＬＩ２は、区間メタデータから抽出された意味情報（区間関係）であるタイプ及びターゲットを表示する領域である。この意味情報（区間関係）提示領域ＬＩ２は、区間情報提示領域ＳＩで区間メタデータが選択された段階で、当該区間メタデータの意味情報（区間関係）が表示される。
なお、区間情報提示領域ＳＩ、意味情報（意味・内容）提示領域ＬＩ１及び意味情報（区間関係）提示領域ＬＩ２は、区間メタデータ編集手段１７によって表示され、編集を行う際には、キーボード等からデータを入力することで、区間情報及び意味情報の変更等を行う。 The semantic information (section relation) presentation area LI2 is an area for displaying a type and target that are semantic information (section relation) extracted from the section metadata. In the semantic information (section relation) presentation area LI2, semantic information (section relation) of the section metadata is displayed when the section metadata is selected in the section information presentation area SI.
The section information presentation area SI, the semantic information (meaning / content) presentation area LI1, and the semantic information (section relation) presentation area LI2 are displayed by the section metadata editing means 17, and can be edited from a keyboard or the like. By inputting data, section information and semantic information are changed.

情報提示領域Ｓａは、区間情報抽出モジュールＳＭから詳細情報を取得可能な場合に、その詳細情報を提示する領域である。この情報提示領域Ｓａは、区間情報抽出モジュールＳＭから、当該区間情報抽出モジュールＳＭの抽出手法、バージョン等を文字列として取得可能な場合に、その文字列を提示する。あるいは、区間情報として、区間情報提示領域ＳＩに提示した情報以外の情報、例えば、色平均、最大音量、最低音量等を提示することとしてもよい。 The information presentation area Sa is an area for presenting detailed information when the detailed information can be acquired from the section information extraction module SM. This information presentation area Sa presents a character string when the extraction method, version, etc. of the section information extraction module SM can be acquired as a character string from the section information extraction module SM. Alternatively, information other than the information presented in the section information presentation area SI, for example, color average, maximum volume, minimum volume, etc. may be presented as the section information.

情報提示領域Ｌａは、意味情報抽出モジュールＬＭから詳細情報を取得可能な場合に、その詳細情報を提示する領域である。この情報提示領域Ｌａは、意味情報抽出モジュールＬＭから、当該意味情報抽出モジュールＬＭの抽出手法、バージョン等を文字列として取得可能な場合に、その文字列を提示する。あるいは、意味情報として、意味情報（意味・内容）提示領域ＬＩ１及び意味情報（区間関係）提示領域ＬＩ２に提示した情報以外の情報、例えば、当該区間におけるアナウンサコメントの全文等を提示することとしてもよい。
このように、区間メタデータを視覚化することで、区間メタデータの内容の確認・編集を容易に行うことができる。 The information presentation area La is an area for presenting detailed information when the detailed information can be acquired from the semantic information extraction module LM. This information presentation area La presents a character string when the extraction method, version, etc. of the semantic information extraction module LM can be acquired as a character string from the semantic information extraction module LM. Alternatively, as the semantic information, information other than the information presented in the semantic information (meaning / content) presentation area LI1 and the semantic information (section relation) presentation area LI2, for example, the full text of the announcer comment in the section may be presented. Good.
Thus, by visualizing the section metadata, the contents of the section metadata can be easily confirmed and edited.

［メタデータ生成装置の動作］
次に、図８乃至図１０を参照（構成については、適宜図１参照）して、本発明に係るメタデータ生成装置の動作について説明する、図８は、メタデータ生成装置の全体動作を示すフローチャートである。図９は、メタデータ生成装置の区間情報の抽出動作を示すフローチャートである。図１０は、メタデータ生成装置の意味情報の抽出動作を示すフローチャートである。 [Operation of metadata generation device]
Next, the operation of the metadata generation apparatus according to the present invention will be described with reference to FIGS. 8 to 10 (refer to FIG. 1 as appropriate for the configuration). FIG. 8 shows the overall operation of the metadata generation apparatus. It is a flowchart. FIG. 9 is a flowchart showing the section information extraction operation of the metadata generation apparatus. FIG. 10 is a flowchart showing the semantic information extraction operation of the metadata generation apparatus.

（全体動作）
まず、図８を参照して、メタデータ生成装置１の全体動作について説明する。なお、図８では、各動作において生成される区間メタデータの内容についても例示している。
最初に、メタデータ生成装置１は、コンテンツ蓄積装置３に蓄積されているコンテンツ（映像（Ｖ１等）、音声（Ａ１等）、テキスト情報（Ｔ１等））から、区間情報である開始時刻及び終了時刻の抽出を行う（ステップＳ１０）。
このステップＳ１０では、例えば、音声（Ａ１等）から区間情報の抽出（歓声によるセグメント化）を行うことで、図８（ａ）に示すような開始時刻と終了時刻とを含んだ区間メタデータ（Ｍ１₁、Ｍ２₁、Ｍ３₁）が生成される。なお、図中、各区間メタデータの識別情報は省略している。このステップＳ１０の詳細動作については、図９を参照して後記する。 (Overall operation)
First, the overall operation of the metadata generation apparatus 1 will be described with reference to FIG. In addition, in FIG. 8, the content of the section metadata produced | generated in each operation | movement is also illustrated.
First, the metadata generation device 1 starts from the content (video (V1 etc.), audio (A1 etc.), text information (T1 etc.)) stored in the content storage device 3, and is the start time and end which are section information Time extraction is performed (step S10).
In this step S10, for example, by extracting section information (segmentation by cheer) from voice (A1 etc.), section metadata including start time and end time as shown in FIG. M1 ₁ , M2 ₁ , M3 ₁ ) are generated. In the drawing, identification information of each section metadata is omitted. The detailed operation of step S10 will be described later with reference to FIG.

続けて、メタデータ生成装置１は、コンテンツ蓄積装置３に蓄積されているコンテンツ（映像（Ｖ１等）、音声（Ａ１等）、テキスト情報（Ｔ１等））から、区間メタデータ（Ｍ１₁、Ｍ２₁、Ｍ３₁）における開始時刻及び終了時刻に対応する意味情報の抽出を行う（ステップＳ２０）。
このステップＳ２０では、例えば、映像（Ｖ１等）から意味情報の抽出（顔認識によるラベル付け）を行うことで、図８（ｂ）に示すように、図８（ａ）における区間メタデータ（Ｍ１₁、Ｍ２₁、Ｍ３₁）に、キー（選手）と値（Ａ選手等）とが付加された区間メタデータ（Ｍ１₂、Ｍ２₂、Ｍ３₂）が生成される。
なお、このステップＳ２０では、さらに、テキスト情報（Ｔ１等）から意味情報の抽出（アナウンサのコメントによるラベル付け）を行うことで、図８（ｃ）に示すように、図８（ｂ）における区間メタデータ（Ｍ１₂、Ｍ２₂、Ｍ３₂）に、キー（イベント）と値（シュート等）とが付加された区間メタデータ（Ｍ１₃、Ｍ２₃、Ｍ３₃）が生成される。 Subsequently, the metadata generation device 1 uses the section metadata (M1 ₁ , M2) from the content (video (V1 etc.), audio (A1 etc.), text information (T1 etc.)) stored in the content storage device 3. ₁ , semantic information corresponding to the start time and end time in M3 ₁ ) is extracted (step S20).
In this step S20, for example, by extracting semantic information (labeling by face recognition) from the video (V1 etc.), as shown in FIG. 8B, the section metadata (M1) in FIG. ₁ , M2 ₁ , M3 ₁ ), and section metadata (M1 ₂ , M2 ₂ , M3 ₂ ) in which a key (player) and a value (A player, etc.) are added are generated.
In step S20, semantic information is extracted from the text information (T1 etc.) (labeled by the announcer's comment), so that as shown in FIG. Section metadata (M1 ₃ , M2 ₃ , M3 ₃ ) in which a key (event) and a value (shoot, etc.) are added to the metadata (M1 ₂ , M2 ₂ , M3 ₂ ) is generated.

そして、メタデータ生成装置１は、ステップＳ２０で生成された区間メタデータ（Ｍ１₃、Ｍ２₃、Ｍ３₃）を出力する（ステップＳ３０）。
このように、メタデータ生成装置１は、コンテンツから区間情報を抽出するステップ（ステップＳ１０）と、意味情報を抽出するステップ（ステップＳ２０）の２つのステップによって、区間メタデータを生成し、出力する。 Then, the metadata generation apparatus 1 outputs the section metadata (M1 ₃ , M2 ₃ , M3 ₃ ) generated in step S20 (step S30).
As described above, the metadata generation apparatus 1 generates and outputs the section metadata by the two steps of extracting the section information from the content (step S10) and extracting the semantic information (step S20). .

（区間情報の抽出動作）
次に、図９を参照して、メタデータ生成装置１における区間情報の抽出動作（図８のステップＳ１０）の動作について詳細に説明する。
まず、メタデータ生成装置１は、区間情報抽出モジュール選択手段１０によって、区間情報抽出モジュールＳＭから、個々の機能を示す文字列を取得する（ステップＳ１００）。ここでは、区間情報抽出モジュールＳＭが、関数（ＩｓＳｅｇｍｅｎｔＭｏｄｕｌｅ（））によって、文字列を返すこととしている。 (Section information extraction operation)
Next, the operation of the section information extraction operation (step S10 in FIG. 8) in the metadata generation apparatus 1 will be described in detail with reference to FIG.
First, the metadata generation apparatus 1 acquires character strings indicating individual functions from the section information extraction module SM by the section information extraction module selection unit 10 (step S100). Here, the section information extraction module SM returns a character string by a function (IsSegmentModule ()).

この文字列を取得した区間情報抽出モジュール選択手段１０は、ユーザ指示受け付け手段１８を介して、区間情報抽出モジュールＳＭを選択する選択画面（図３参照）を提示する（ステップＳ１０１）。ここで、ユーザ指示受け付け手段１８が、選択画面において、ユーザによる区間情報抽出モジュールＳＭの選択を受け付け、区間情報抽出モジュール選択手段１０に対して、選択された区間情報抽出モジュールＳＭを通知する。 The section information extraction module selection unit 10 that has acquired the character string presents a selection screen (see FIG. 3) for selecting the section information extraction module SM via the user instruction reception unit 18 (step S101). Here, the user instruction accepting means 18 accepts the selection of the section information extraction module SM by the user on the selection screen, and notifies the section information extraction module selection means 10 of the selected section information extraction module SM.

そして、メタデータ生成装置１は、区間情報抽出モジュール選択手段１０によって、ユーザ指示受け付け手段１８から通知された区間情報抽出モジュールＳＭを、区間情報抽出手段１０ａとして組み込む。これによって、実際に区間情報を抽出する区間情報抽出モジュールＳＭが選択されたことになる（ステップＳ１０２）。 The metadata generation apparatus 1 incorporates the section information extraction module SM notified from the user instruction receiving unit 18 by the section information extraction module selection unit 10 as the section information extraction unit 10a. As a result, the section information extraction module SM that actually extracts the section information is selected (step S102).

そして、メタデータ生成装置１は、区間情報取得処理手段１１のパラメータ設定手段１１ａによって、区間情報を抽出するために必要となるパラメータ（パラメータ項目）を区間情報抽出手段１０ａ（ここでは、区間情報抽出モジュールＳＭ１）から取得する（ステップＳ１０３）。ここでは、区間情報抽出モジュールＳＭが、関数（ＧｅｔＲｅｑｕｉｒｅｄＤａｔａ（））によって、パラメータの項目を返すこととしている。 Then, the metadata generation apparatus 1 uses the parameter setting unit 11a of the section information acquisition processing unit 11 to set parameters (parameter items) necessary for extracting the section information to the section information extracting unit 10a (here, section information extraction). Obtained from the module SM1) (step S103). Here, the section information extraction module SM returns the parameter item by a function (GetRequiredData ()).

このパラメータを取得したパラメータ設定手段１１ａは、ユーザ指示受け付け手段１８を介して、パラメータの値を入力する入力画面（図４参照）を提示する（ステップＳ１０４）。ここで、ユーザ指示受け付け手段１８が、入力画面において、ユーザによるパラメータの値の入力を受け付け、パラメータ設定手段１１ａに対して、入力されたパラメータの値を通知する。 The parameter setting unit 11a that has acquired the parameter presents an input screen (see FIG. 4) for inputting the parameter value via the user instruction receiving unit 18 (step S104). Here, the user instruction accepting means 18 accepts input of the parameter value by the user on the input screen, and notifies the parameter setting means 11a of the inputted parameter value.

そして、パラメータ設定手段１１ａが、パラメータの値を区間情報抽出手段１０ａ（ここでは、区間情報抽出モジュールＳＭ１）に通知（設定）する（ステップＳ１０５）。ここでは、区間情報抽出モジュールＳＭが、関数（ＳｅｔＲｅｑｕｉｒｅｄＤａｔａ（））によって、パラメータを設定することとしている。 Then, the parameter setting unit 11a notifies (sets) the parameter value to the section information extraction unit 10a (here, the section information extraction module SM1) (step S105). Here, the section information extraction module SM sets parameters using a function (SetRequiredData ()).

そして、区間情報取得手段１１ｂが、区間情報抽出手段１０ａに対して、区間情報の抽出を指示することで、区間情報の抽出を実行する（ステップＳ１０６）。ここでは、区間情報抽出モジュールＳＭが、関数（ＧｅｔＳｅｇｍｅｎｔ（））によって、区間情報の抽出結果を返すこととしている。ここで抽出された区間情報は、区間メタデータ編集手段１７によって、確認・編集画面（図７参照）に提示される。 Then, the section information acquiring unit 11b instructs the section information extracting unit 10a to extract section information, thereby executing section information extraction (step S106). Here, the section information extraction module SM returns a section information extraction result by a function (GetSegment ()). The section information extracted here is presented by the section metadata editing means 17 on the confirmation / editing screen (see FIG. 7).

この段階で、ユーザが区間情報の内容の確認を行う。すなわち、メタデータ生成装置１は、ユーザ指示受け付け手段１８によって、ユーザから区間情報の抽出のやり直し（ステップＳ１０７）、区間情報の編集（ステップＳ１０８）、他の区間情報抽出モジュールＳＭの実行（ステップＳ１０９）を行うかどうかの指示が入力されるかどうかを判定する。 At this stage, the user confirms the contents of the section information. That is, the metadata generation apparatus 1 uses the user instruction receiving means 18 to re-extract the section information from the user (step S107), edit the section information (step S108), and execute another section information extraction module SM (step S109). ) Is determined whether or not an instruction to perform is input.

そして、メタデータ生成装置１は、ユーザから区間情報の抽出のやり直しを指示された場合（ステップＳ１０７でＹｅｓ）は、ステップＳ１０３に戻って、パラメータの値の再入力を行うことで、再度区間情報の抽出を行う。
また、メタデータ生成装置１は、ユーザから区間メタデータの編集を指示された場合（ステップＳ１０８でＹｅｓ）は、区間メタデータ編集手段１７によって、区間メタデータの編集を行う（ステップＳ１１０）。なお、この区間メタデータの編集後、ステップＳ１０７に戻る。 When the metadata generation apparatus 1 is instructed to re-extract the section information by the user (Yes in step S107), the metadata generation apparatus 1 returns to step S103 and re-enters the parameter value, thereby re-entering the section information. Perform extraction.
When the metadata generation apparatus 1 is instructed to edit the section metadata by the user (Yes in step S108), the section metadata editing unit 17 edits the section metadata (step S110). After editing the section metadata, the process returns to step S107.

さらに、メタデータ生成装置１は、ユーザから他の区間情報抽出モジュールＳＭの実行を指示された場合（ステップＳ１０９でＹｅｓ）は、ステップＳ１００に戻って、区間情報抽出モジュールＳＭの選択動作から動作を再実行する。
そして、メタデータ生成装置１は、ユーザが区間情報の内容を確認した段階（ステップＳ１０７、ステップＳ１０８及びステップＳ１０９でＮｏ）で、区間情報の抽出動作を終了する。 Furthermore, when the metadata generation apparatus 1 is instructed by the user to execute another section information extraction module SM (Yes in step S109), the metadata generation apparatus 1 returns to step S100 and starts the operation from the selection operation of the section information extraction module SM. Try again.
And the metadata production | generation apparatus 1 complete | finishes the extraction operation of area information in the step (No in step S107, step S108, and step S109) in which the user confirmed the content of area information.

（意味情報の抽出動作）
次に、図１０を参照して、メタデータ生成装置１における意味情報の抽出動作（図８のステップＳ２０）の動作について詳細に説明する。
まず、メタデータ生成装置１は、意味情報抽出モジュール選択手段１２によって、意味情報抽出モジュールＬＭから、個々の機能を示す文字列を取得する（ステップＳ２００）。ここでは、意味情報抽出モジュールＬＭが、関数（ＩｓＬａｂｅｌＭｏｄｕｌｅ（））によって、文字列を返すこととしている。 (Semantic information extraction operation)
Next, the operation of the semantic information extraction operation (step S20 in FIG. 8) in the metadata generation apparatus 1 will be described in detail with reference to FIG.
First, the metadata generation apparatus 1 acquires character strings indicating individual functions from the semantic information extraction module LM by the semantic information extraction module selection unit 12 (step S200). Here, the semantic information extraction module LM returns a character string by a function (IsLabelModule ()).

この文字列を取得した意味情報抽出モジュール選択手段１２は、ユーザ指示受け付け手段１８を介して、意味情報抽出モジュールＬＭを選択する選択画面（図５参照）を提示する（ステップＳ２０１）。ここで、ユーザ指示受け付け手段１８が、選択画面において、ユーザによる意味情報抽出モジュールＬＭの選択を受け付け、意味情報抽出モジュール選択手段１２に対して、選択された意味情報抽出モジュールＬＭを通知する。 The semantic information extraction module selection unit 12 that has acquired the character string presents a selection screen (see FIG. 5) for selecting the semantic information extraction module LM via the user instruction reception unit 18 (step S201). Here, the user instruction accepting unit 18 accepts selection of the semantic information extraction module LM by the user on the selection screen, and notifies the semantic information extraction module selection unit 12 of the selected semantic information extraction module LM.

そして、メタデータ生成装置１は、意味情報抽出モジュール選択手段１２によって、ユーザ指示受け付け手段１８から通知された意味情報抽出モジュールＬＭを、意味情報抽出手段１２ａとして組み込む。これによって、実際に意味情報を抽出する意味情報抽出モジュールＬＭが選択されたことになる（ステップＳ２０２）。 Then, the metadata generation apparatus 1 incorporates the semantic information extraction module LM notified from the user instruction reception unit 18 by the semantic information extraction module selection unit 12 as the semantic information extraction unit 12a. As a result, the semantic information extraction module LM that actually extracts the semantic information is selected (step S202).

そして、メタデータ生成装置１は、意味情報取得処理手段１３のパラメータ設定手段１３ａによって、意味情報を抽出するために必要となるパラメータ（パラメータ項目）を意味情報抽出手段１２ａ（ここでは、意味情報抽出モジュールＬＭ１）から取得する（ステップＳ２０３）。ここでは、意味情報抽出モジュールＬＭが、関数（ＧｅｔＲｅｑｕｉｒｅｄＤａｔａ（））によって、パラメータの項目を返すこととしている。
このパラメータを取得したパラメータ設定手段１３ａは、ユーザ指示受け付け手段１８を介して、パラメータの値を入力する入力画面（図６参照）を提示する（ステップＳ２０４）。ここで、ユーザ指示受け付け手段１８が、入力画面において、ユーザによるパラメータの値の入力を受け付け、パラメータ設定手段１３ａに対して、入力されたパラメータの値を通知する。 Then, the metadata generation apparatus 1 uses the parameter setting unit 13a of the semantic information acquisition processing unit 13 to extract parameters (parameter items) necessary for extracting the semantic information from the semantic information extraction unit 12a (here, semantic information extraction). Obtained from the module LM1) (step S203). Here, the semantic information extraction module LM returns a parameter item by a function (GetRequiredData ()).
The parameter setting unit 13a that has acquired the parameter presents an input screen (see FIG. 6) for inputting the parameter value via the user instruction receiving unit 18 (step S204). Here, the user instruction accepting means 18 accepts the input of the parameter value by the user on the input screen, and notifies the parameter setting means 13a of the inputted parameter value.

そして、パラメータ設定手段１３ａが、パラメータの値を意味情報抽出手段１２ａ（ここでは、意味情報抽出モジュールＬＭ１）に通知（設定）する（ステップＳ２０５）。ここでは、意味情報抽出モジュールＬＭが、関数（ＳｅｔＲｅｑｕｉｒｅｄＤａｔａ（））によって、パラメータを設定することとしている。 Then, the parameter setting unit 13a notifies (sets) the parameter value to the semantic information extraction unit 12a (here, the semantic information extraction module LM1) (step S205). In this case, the semantic information extraction module LM sets parameters using a function (SetRequiredData ()).

そして、意味情報取得手段１３ｂが、意味情報抽出手段１２ａに対して、意味情報の抽出を指示することで、意味情報の抽出を実行する（ステップＳ２０６）。ここでは、意味情報抽出モジュールＬＭが、関数（ＧｅｔＬａｂｅｌ（））によって、意味情報（意味・内容）の抽出結果を返し、関数（ＧｅｔＲｅｌａｔｉｏｎ（））によって、意味情報（区間関係）の抽出結果を返すこととしている。ここで抽出された意味情報は、区間メタデータ編集手段１７によって、確認・編集画面（図７参照）に提示される。 Then, the semantic information acquisition unit 13b instructs the semantic information extraction unit 12a to extract semantic information, thereby executing the semantic information extraction (step S206). Here, the semantic information extraction module LM returns the extraction result of the semantic information (meaning / content) by the function (GetLabel ()), and returns the extraction result of the semantic information (section relation) by the function (GetRelation ()). I am going to do that. The semantic information extracted here is presented on the confirmation / editing screen (see FIG. 7) by the section metadata editing means 17.

この段階で、ユーザが意味情報の内容の確認を行う。すなわち、メタデータ生成装置１は、ユーザ指示受け付け手段１８によって、ユーザから意味情報の抽出のやり直し（ステップＳ２０７）、意味情報の編集（ステップＳ２０８）、他の意味情報抽出モジュールＬＭの実行（ステップＳ２０９）を行うかどうかの指示が入力されるかどうかを判定する。
そして、メタデータ生成装置１は、ユーザから意味情報の抽出のやり直しを指示された場合（ステップＳ２０７でＹｅｓ）は、ステップＳ２０３に戻って、パラメータの値の再入力を行うことで、再度意味情報の抽出を行う。 At this stage, the user confirms the content of the semantic information. That is, the metadata generation apparatus 1 uses the user instruction receiving unit 18 to re-extract semantic information from the user (step S207), edit semantic information (step S208), and execute another semantic information extraction module LM (step S209). ) Is determined whether or not an instruction to perform is input.
When the metadata generation apparatus 1 is instructed by the user to redo the extraction of the semantic information (Yes in Step S207), the metadata generation apparatus 1 returns to Step S203 and performs the re-input of the parameter value, thereby performing the semantic information again. Perform extraction.

また、メタデータ生成装置１は、ユーザから意味情報の編集を指示された場合（ステップＳ２０８でＹｅｓ）は、区間メタデータ編集手段１７によって、意味情報の編集を行う（ステップＳ２１０）。なお、この意味情報の編集後、ステップＳ２０７に戻る。
さらに、メタデータ生成装置１は、ユーザから他の意味情報抽出モジュールＬＭの実行を指示された場合（ステップＳ２０９でＹｅｓ）は、ステップＳ２００に戻って、意味情報抽出モジュールＬＭの選択動作から動作を再実行する。
そして、メタデータ生成装置１は、ユーザが意味情報の内容を確認した段階（ステップＳ２０７、ステップＳ２０８及びステップＳ２０９でＮｏ）で、意味情報の抽出動作を終了する。 Further, when the metadata generation apparatus 1 is instructed to edit the semantic information by the user (Yes in step S208), the metadata generation apparatus 1 edits the semantic information by the section metadata editing unit 17 (step S210). After editing this semantic information, the process returns to step S207.
Further, when the metadata generation apparatus 1 is instructed by the user to execute another semantic information extraction module LM (Yes in step S209), the metadata generation apparatus 1 returns to step S200 and starts the operation from the selection operation of the semantic information extraction module LM. Try again.
And the metadata production | generation apparatus 1 complete | finishes the extraction operation | movement of semantic information in the stage (No in step S207, step S208, and step S209) which the user confirmed the content of the semantic information.

以上、メタデータ生成装置１の動作として、予めコンテンツ蓄積装置３に蓄積されたコンテンツから区間メタデータを生成する動作（オフライン動作）について説明した。しかし、本発明は、ストリームデータとして入力されるコンテンツからも区間メタデータを抽出することが可能である。以下、図１１を参照（適宜図１参照）して、メタデータ生成装置１において、ストリームデータとして入力されるコンテンツから区間メタデータを生成する動作（オンライン動作）について説明する。 As above, the operation of generating the section metadata from the content stored in advance in the content storage device 3 (offline operation) has been described as the operation of the metadata generation device 1. However, according to the present invention, it is possible to extract section metadata from content input as stream data. Hereinafter, an operation (online operation) for generating section metadata from content input as stream data in the metadata generation apparatus 1 will be described with reference to FIG. 11 (refer to FIG. 1 as appropriate).

（オンライン処理による区間メタデータ生成動作）
図１１は、メタデータ生成装置のオンライン処理による区間メタデータ生成動作を示すフローチャートである。
まず、メタデータ生成装置１は、区間情報抽出モジュール選択手段１０によって、区間情報抽出モジュールＳＭを選択し、区間情報取得処理手段１１のパラメータ設定手段１１ａによって、区間情報を抽出するために必要となるパラメータを設定する（ステップＳ４０）。なお、このステップＳ４０は、図９のステップＳ１００からステップＳ１０５に相当する。 (Section metadata generation operation by online processing)
FIG. 11 is a flowchart showing the section metadata generation operation by online processing of the metadata generation apparatus.
First, the metadata generation apparatus 1 is necessary for selecting the section information extraction module SM by the section information extraction module selection unit 10 and extracting the section information by the parameter setting unit 11a of the section information acquisition processing unit 11. A parameter is set (step S40). This step S40 corresponds to step S100 to step S105 in FIG.

また、メタデータ生成装置１は、意味情報抽出モジュール選択手段１２によって、意味情報抽出モジュールＬＭを選択し、意味情報取得処理手段１３のパラメータ設定手段１３ａによって、意味情報を抽出するために必要となるパラメータを設定する（ステップＳ４１）。なお、このステップＳ４１は、図１０のステップＳ２００からステップＳ２０５に相当する。
そして、メタデータ生成装置１は、区間情報取得処理手段１１の区間情報取得手段１１ｂによって、区間情報抽出手段１０ａを介して、入力されるコンテンツから区間情報を１つ（１セグメント）抽出する（ステップＳ４２）。 Further, the metadata generation apparatus 1 is necessary for selecting the semantic information extraction module LM by the semantic information extraction module selecting unit 12 and extracting the semantic information by the parameter setting unit 13 a of the semantic information acquisition processing unit 13. A parameter is set (step S41). This step S41 corresponds to step S200 to step S205 in FIG.
Then, the metadata generation apparatus 1 extracts one piece of section information (one segment) from the input content through the section information extraction unit 10a by the section information acquisition unit 11b of the section information acquisition processing unit 11 (Step 1). S42).

続けて、メタデータ生成装置１は、意味情報取得処理手段１３の意味情報取得手段１３ｂによって、意味情報抽出手段１２ａを介して、ステップＳ４２で抽出された区間情報で示される区間内（１セグメント内）で、コンテンツから意味情報を抽出する（ステップＳ４３）。なお、区間情報で示される区間のコンテンツのデータは、図示していない蓄積手段に一時的に蓄積され、意味情報抽出手段１２ａによって、その区間におけるコンテンツのデータから意味情報が抽出される。 Subsequently, the metadata generation apparatus 1 uses the semantic information acquisition unit 13b of the semantic information acquisition processing unit 13 within the section (in one segment) indicated by the section information extracted in step S42 via the semantic information extraction unit 12a. ) To extract semantic information from the content (step S43). The content data of the section indicated by the section information is temporarily stored in storage means (not shown), and the semantic information is extracted from the content data in the section by the semantic information extraction means 12a.

そして、１区間毎に区間メタデータの確認・編集を行う場合（ステップＳ４４でＹｅｓ）は、区間メタデータ編集手段１７が、ステップＳ４３で抽出し、生成した区間メタデータを表示画面上に表示し、確認・編集作業を行う（ステップＳ４５）。
さらに、１区間毎に区間メタデータを出力する場合（ステップＳ４６でＹｅｓ）は、区間メタデータ入出力手段１９が、１セグメント分の区間メタデータを出力する（ステップＳ４７）。 When the section metadata is checked and edited for each section (Yes in step S44), the section metadata editing means 17 extracts the section metadata that is extracted in step S43 and displays the generated section metadata on the display screen. Then, confirmation / editing work is performed (step S45).
Furthermore, when outputting section metadata for every section (Yes in step S46), the section metadata input / output means 19 outputs section metadata for one segment (step S47).

そして、コンテンツの入力が終了したかどうかを判断し（ステップＳ４８）、終了していない場合（ステップＳ４８でＮｏ）は、ステップＳ４２に戻って、区間情報及び意味情報の抽出を続ける。
一方、コンテンツの入力が終了した場合（ステップＳ４８でＹｅｓ）は、全区間の区間メタデータに対して確認・編集を行うかどうかを判定し（ステップＳ４９）、確認・編集を行う場合（ステップＳ４９でＹｅｓ）は、区間メタデータ編集手段１７が、図７に示したような確認・編集画面を表示し、確認・編集作業を行う（ステップＳ５０）。
そして、メタデータ生成装置１は、生成された区間メタデータ（複数セグメント分）を区間メタデータ入出力手段１９によって出力する（ステップＳ５１）。
以上の動作によって、メタデータ生成装置１は、オンライン処理によって、コンテンツから区間メタデータを生成し出力することができる。 Then, it is determined whether or not the content input has been completed (step S48). If it has not been completed (No in step S48), the process returns to step S42 to continue extracting the section information and the semantic information.
On the other hand, when the input of the content is completed (Yes in step S48), it is determined whether or not to confirm / edit the section metadata of all the sections (step S49), and when confirmation / editing is performed (step S49). Yes), the section metadata editing means 17 displays the confirmation / editing screen as shown in FIG. 7 and performs the confirmation / editing work (step S50).
And the metadata production | generation apparatus 1 outputs the produced | generated area metadata (for several segments) by the area metadata input / output means 19 (step S51).
Through the above operation, the metadata generation apparatus 1 can generate and output the section metadata from the content by online processing.

本発明に係るメタデータ生成装置を含んだメタデータ生成システムの構成を示すブロック図である。It is a block diagram which shows the structure of the metadata production | generation system containing the metadata production | generation apparatus which concerns on this invention. 区間メタデータの構成を説明するための説明図であって、（ａ）は区間メタデータの構成、（ｂ）は区間メタデータの例を示す。It is explanatory drawing for demonstrating the structure of area metadata, Comprising: (a) is a structure of area metadata, (b) shows the example of area metadata. 区間情報抽出モジュールを選択する選択画面の一例を示す画面例である。It is an example of a screen which shows an example of the selection screen which selects an area information extraction module. 区間情報抽出モジュールに通知するパラメータを入力するパラメータ入力画面の一例を示す画面例である。It is an example of a screen which shows an example of the parameter input screen which inputs the parameter notified to an area information extraction module. 意味情報抽出モジュールを選択する選択画面の一例を示す画面例である。It is an example of a screen which shows an example of the selection screen which selects a semantic information extraction module. 意味情報抽出モジュールに通知するパラメータを入力するパラメータ入力画面の一例を示す画面例である。It is a screen example which shows an example of the parameter input screen which inputs the parameter notified to a semantic information extraction module. 区間メタデータの確認・編集を行う確認・編集画面の一例を示す画面例である。It is an example of a screen which shows an example of the confirmation / edit screen which confirms and edits section metadata. 本発明に係るメタデータ生成装置の全体動作を示すフローチャートである。It is a flowchart which shows the whole operation | movement of the metadata production | generation apparatus which concerns on this invention. 本発明に係るメタデータ生成装置の区間情報の抽出動作を示すフローチャートである。It is a flowchart which shows the extraction operation | movement of the section information of the metadata production | generation apparatus which concerns on this invention. 本発明に係るメタデータ生成装置の意味情報の抽出動作を示すフローチャートである。It is a flowchart which shows the extraction operation | movement of the semantic information of the metadata production | generation apparatus which concerns on this invention. 本発明に係るメタデータ生成装置のオンライン処理による区間メタデータ生成動作を示すフローチャートである。It is a flowchart which shows the area metadata production | generation operation | movement by the online process of the metadata production | generation apparatus which concerns on this invention.

Explanation of symbols

１メタデータ生成装置
１０区間情報抽出モジュール選択手段（事象選択手段）
１０ａ区間情報抽出手段
１１区間情報取得処理手段
１１ａパラメータ設定手段
１１ｂ区間情報取得手段
１２意味情報抽出モジュール選択手段（種別選択手段）
１２ａ意味情報抽出手段
１３意味情報取得処理手段
１３ａパラメータ設定手段
１３ｂ意味情報取得手段
１４区間メタデータ生成制御手段
１５メタデータ蓄積手段
１６選択映像再生手段
１７区間メタデータ編集手段
１８ユーザ指示受け付け手段
１９区間メタデータ入出力手段
ＳＭ区間情報抽出モジュール
ＬＭ意味情報抽出モジュール DESCRIPTION OF SYMBOLS 1 Metadata production | generation apparatus 10 Section information extraction module selection means (event selection means)
10a Section information extraction means 11 Section information acquisition processing means 11a Parameter setting means 11b Section information acquisition means 12 Semantic information extraction module selection means (type selection means)
12a Semantic information extraction means 13 Semantic information acquisition processing means 13a Parameter setting means 13b Semantic information acquisition means 14 Section metadata generation control means 15 Metadata storage means 16 Selected video reproduction means 17 Section metadata editing means 18 User instruction reception means 19 Section Metadata input / output means SM Section information extraction module LM Semantic information extraction module

Claims

In the content, for each event section of a certain event, a metadata generation device that generates section metadata that is information for each event section in the content in association with semantic information indicating the content of the content,
A plurality of section information extraction means different for each event, detecting the event from the content, and extracting at least a start time and an end time of the event as section information;
An event selection unit that selects the section information extraction unit corresponding to the selected event by instructing one of the plurality of events as a selected event;
Section information acquisition processing means for acquiring section information of the selected event in the content from the section information extracting means selected by the event selecting means, and generating first section metadata;
Analyzing the content and extracting the semantic information in association with a start time and an end time in the section information, and a plurality of different semantic information extraction means for each type of the semantic information;
By selecting one of the types of semantic information as a selection type, a type selection unit that selects the semantic information extraction unit corresponding to the selection type;
Semantic information acquisition processing means for acquiring the semantic information from the semantic information extraction means selected by the type selection means, and generating second section metadata in which the section information and the semantic information are associated with each other;
A metadata generation apparatus comprising:

The section information extraction unit holds a character string indicating the function of the section information extraction unit in advance, and the event selection unit acquires the character string from the section information extraction unit, thereby the character string The metadata generation device according to claim 1, wherein a list is displayed on a screen of the display device.

The section information extraction means holds in advance the parameter items for extracting the section information in the section information extraction means, and the section information acquisition processing means extracts the parameter items from the section information extraction means. The metadata generation apparatus according to claim 1 or 2, wherein an acquisition screen for displaying the parameter item is displayed on the display device by acquiring the parameter item.

The semantic information extraction unit holds a character string indicating the function of the semantic information extraction unit in advance, and the type selection unit acquires the character string from the semantic information extraction unit, thereby The metadata generation apparatus according to claim 1, wherein a list is displayed on a screen of the display apparatus.

The semantic information extraction means holds in advance parameters items for extracting the semantic information in the semantic information extraction means, and the semantic information acquisition processing means extracts the parameter items from the semantic information extraction means. 5. The metadata generation apparatus according to claim 1, wherein an acquisition screen of the parameter displaying the parameter item is displayed on the display device.

6. The semantic data acquisition processing unit generates section metadata in which a plurality of the semantic information is associated with the section information every time the semantic information is acquired. The metadata generation device according to claim 1.

In a content, for a certain event, at least a start time and an end time of the event are extracted from the content as segment information, a plurality of segment information extraction modules different for each event, the content is analyzed, and the content Generating section metadata that is information for each event section in the content based on a plurality of semantic information extraction modules that are different for each type of the semantic information, and extracts semantic information indicating the content of the content in association with time For the computer,
Event selection means for selecting the section information extraction module corresponding to the selected event by instructing one of the plurality of events as a selected event,
Section information acquisition processing means for acquiring section information of the selected event in the content from the section information extraction module selected by the event selecting means, and generating first section metadata.
Type selection means for selecting the semantic information extraction module corresponding to the selected type by instructing one of the types of semantic information as a selected type,
Semantic information acquisition processing means for acquiring the semantic information from the semantic information extraction module selected by the type selection means, and generating second section metadata in which the section information and the semantic information are associated with each other;
A metadata generation program characterized by functioning as