JP4580345B2

JP4580345B2 - Content transmission device and content output device

Info

Publication number: JP4580345B2
Application number: JP2006000992A
Authority: JP
Inventors: 欣司松村; 浩行浜田; 武史木村; 謙二郎加井; 幹大上野; 香子有安; 誠沼田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2006-01-06
Filing date: 2006-01-06
Publication date: 2010-11-10
Anticipated expiration: 2026-01-06
Also published as: JP2007184740A

Description

本発明は、コンテンツを制作した制作者の意図を反映させて受信側で出力する当該コンテンツを送信するコンテンツ送信装置および当該コンテンツを出力するコンテンツ出力装置に関する。 The present invention relates to a content transmitting apparatus that transmits the content that is output on the receiving side reflecting the intention of the creator who created the content, and a content output apparatus that outputs the content.

従来、視聴者の個人情報であるユーザプロファイルを参照して、受信したコンテンツを当該視聴者に視聴可能な状態で提示する提示内容選択手法がある。例えば、視聴者が聴覚障害者であった場合、コンテンツの音声の代わりに、この音声を視認できる字幕として提示したり、視聴者が視覚障害者であった場合、コンテンツの映像を解説する解説音声を出力したりする。 2. Description of the Related Art Conventionally, there has been a presentation content selection method in which received content is presented to a viewer in a viewable state with reference to a user profile that is personal information of the viewer. For example, if the viewer is a hearing-impaired person, this audio will be presented as a visible subtitle instead of the content sound, or if the viewer is a visually impaired person, the commentary sound explaining the content video Is output.

或いは、視聴者が健常者であった場合でも、状況によっては、コンテンツの音声を出せない場合、コンテンツの音声が聞き取れない場合またはコンテンツの映像が見られない場合が想定され、そういった場合に、字幕や解説音声といったように、コンテンツの提示を変更する必要が生じる。つまり、様々な障害（人的な障害、環境による障害）等によって、コンテンツの視聴者が当該コンテンツの制作者の意図を十分に理解することができない場合に、代替の手段による伝達方法として、字幕や解説音声があった（例えば、特許文献１参照）。 Or, even if the viewer is a healthy person, depending on the situation, it is assumed that the audio of the content cannot be output, the audio of the content cannot be heard, or the video of the content cannot be seen. It is necessary to change the presentation of content, such as a commentary or commentary voice. In other words, when the content viewer cannot fully understand the intention of the content creator due to various disabilities (human disabilities, environmental disabilities), etc. And commentary audio (for example, see Patent Document 1).

この特許文献１に開示されている伝達方法は、色々な視聴者向けの字幕をコンテンツに多重化し、当該視聴者の好み（状況）に応じて表示可能にしたものであるが、例えば、視聴者の個人情報である視聴者プロファイルに応じて、当該字幕が表示されるものではなく、視聴者が選択しなければならなかった。
特開平１０−２０８４４６号公報 The transmission method disclosed in Patent Document 1 multiplexes subtitles for various viewers into content and enables display according to the viewer's preference (situation). The subtitles are not displayed according to the viewer profile which is personal information of the viewer, and the viewer has to select.
Japanese Patent Laid-Open No. 10-208446

しかしながら、従来の提示内容選択手法では、ユーザプロファイルから導き出される条件と、コンテンツの提示の仕方や提示内容とが１対１に固定して取り扱われていたので、コンテンツ制作者の意図した内容が十分に伝わる状況にも拘わらず、余計な代替の提示が行われたり、その逆に内容が伝わらない状況にも拘わらず、代替の提示が行われなかったりする場合が生じてしまうという問題がある。また、コンテンツを送信する送信側が望む提示の仕方と、コンテンツを受信する受信側が望む提示の仕方とがかみ合わない場合が生じてしまうという問題がある。さらに、従来の提示内容選択手法では、視聴者個人個人の身体的な能力や、状況に対応したコンテンツの多種多様の提示仕方が選択できなかったという問題がある。 However, in the conventional presentation content selection method, the condition derived from the user profile, the content presentation method and the presentation content are handled in a fixed one-to-one manner, so that the content intended by the content creator is sufficient. In spite of the situation transmitted to the user, there is a problem that an extra alternative is presented, or conversely, the alternative presentation is not performed despite the situation where the content is not transmitted. In addition, there is a problem that the presentation method desired by the transmission side that transmits the content and the presentation method desired by the reception side that receives the content may not be engaged. Further, the conventional presentation content selection method has a problem that it is impossible to select various presentation methods of content corresponding to the physical ability of the individual viewer and the situation.

また、コンテンツを送信する送信側において、統一的に記述されたコンテンツのメタデータと、コンテンツを受信する受信側において、統一的に記述されたユーザプロファイルとを元に、個々の視聴者に対応する代替提示を行うための一般的な仕組みが望まれていた。つまり、ユーザプロファイルの構成要素と、コンテンツのメタデータの表現手法とは、今後も新たなものが表れることが想定されるにも拘わらず、従来の提示内容選択手法や、既存の代替提示を行うためのシステムでは対応できないからである。 In addition, on the transmitting side that transmits the content, it corresponds to each viewer on the basis of the metadata of the content that is uniformly described and the user profile that is uniformly described on the receiving side that receives the content. A general mechanism for providing alternative presentations was desired. In other words, although the user profile components and the content metadata expression method are expected to show new things in the future, the conventional presentation content selection method and the existing alternative presentation method are used. This is because the system for this cannot cope.

具体的に述べると、提示内容選択手法や、既存の代替提示を行うためのシステムでは、コンテンツの音声の代替提示として、字幕を提示させることとなっているが、コンテンツのメタデータには、「字幕が音声の代替提示手段である」といった意味的な記述が含まれていない。また、同様に、コンテンツのメタデータには、「解説音声が映像の代替提示手段である」といった意味的な記述が含まれていない。 Specifically, in the presentation content selection method and the existing alternative presentation system, subtitles are presented as an alternative presentation of the audio of the content. Semantic description such as “subtitles are an alternative means for presenting audio” is not included. Similarly, the content metadata does not include a semantic description such as “explanatory audio is an alternative presentation means for video”.

このため、場合によっては、「意味的な記述」、すなわち、コンテンツを制作した制作者側の意図が受信側で反映されない場合が生じる。現在のところ、提示内容選択手法や既存の代替提示を行うためのシステムにおける代替提示手段は、コンテンツの送信側と、コンテンツの受信側とが暗黙のうちに共有しているだけであり、コンテンツを制作した制作者側（送信側）の意図を反映させた上で、提示側（受信側）で導出されたものではない。 For this reason, in some cases, a “semantic description”, that is, the intention of the producer who created the content may not be reflected on the receiving side. Currently, the alternative presentation means in the presentation content selection method and the existing alternative presentation system are only implicitly shared between the content transmission side and the content reception side, It is not derived on the presentation side (reception side) after reflecting the intention of the producer side (transmission side).

この結果、コンテンツの送信に関して新しいサービス、例えば、弱視者向けに表示画面上の文字を通常よりも大きくする拡大提示サービスを実現しようとした場合、既存の代替提示を行うためのシステムでは、コンテンツの送信側で「弱視者（視聴者）が判別できない小さな文字が表示画面上に表示されているときには、当該小さな文字を拡大して（拡大文字に置き換えて）表示する」といった記述をしても、コンテンツの受信側では、導出できるようにすることができない。 As a result, when trying to realize a new service for content transmission, for example, an expanded presentation service that makes characters on the display screen larger than usual for low vision viewers, the existing system for alternative presentation Even if a description such as “when a small character that cannot be identified by a low-sighted person (viewer) is displayed on the display screen on the transmission side, the small character is enlarged (replaced by an enlarged character)” is described. The content receiving side cannot be derived.

つまり、コンテンツの受信側では「意味的な記述」である「字幕の文字が小さくて読めない場合、大きな文字にする」と解釈することができない。このように、新しいサービスを実現しようとした場合、コンテンツを制作した制作者側（送信側）と、提示側（受信側）との双方で新たなシステム（ここでは、解釈の仕方を更新したもの）を導入しなければならないということになる。 In other words, the content receiving side cannot interpret it as a “semantic description” “if the subtitle text is too small to be read, it will be made larger”. In this way, when trying to realize a new service, a new system (in this case, an updated interpretation method) on both the producer side (sender side) who created the content and the presenter side (receiver side) ) Must be introduced.

そこで、本発明では、前記した問題を解決し、視聴者個人個人の身体的な能力や、状況に対応したコンテンツの多種多様の提示仕方を選択して行うことができるコンテンツ送信装置およびコンテンツ出力装置を提供することを目的とする。 Therefore, in the present invention, a content transmission apparatus and a content output apparatus that can solve the above-described problems and can select and perform various presentation methods of contents corresponding to physical abilities of individual viewers and situations. The purpose is to provide.

前記課題を解決するため、請求項１に記載のコンテンツ出力装置は、コンテンツと、このコンテンツの代わりに提示する代替提示コンテンツと、前記コンテンツの分割箇所を示す分割メタデータと、この分割メタデータに従って前記コンテンツおよび前記代替提示コンテンツを分割した構成部分ごとおよび前記代替提示コンテンツに受信側で必要となる提示装置および当該装置の機能並びに受信側の視聴者に必要となる視聴能力を定義した必要リストメタデータと、前記コンテンツの前記構成部分および前記代替提示コンテンツをグループ化するためのグループ化メタデータと、このグループ化メタデータの相関を予め設定した基準に従って付した前記構成部分ごとおよび前記代替提示コンテンツの相関メタデータと、前記コンテンツの経過時間に対応させて付した前記代替提示コンテンツの時間情報とを受信して、前記提示装置に前記コンテンツまたは前記代替提示コンテンツを出力するコンテンツ出力装置であって、プロファイル蓄積手段と、判定手段と、導出手段と、再構成手段と、出力手段と、を備える構成とした。 In order to solve the above-mentioned problem, the content output device according to claim 1, according to the content, the alternative presentation content to be presented instead of the content, the divided metadata indicating the divided portion of the content, and the divided metadata Necessary list meta which defines each of the divided parts of the content and the alternative presentation content and a presentation device required on the reception side for the alternative presentation content, a function of the device, and a viewing ability required for the viewer on the reception side Data, grouping metadata for grouping the component parts of the content and the alternative presentation content, and each of the component parts and the alternative presentation content assigned according to a preset criterion. Correlation metadata and the course of the content A content output device that receives time information of the alternative presentation content attached in correspondence with the content information and outputs the content or the alternative presentation content to the presentation device, and includes a profile storage unit, a determination unit, The derivation unit, the reconstruction unit, and the output unit are provided.

かかる構成によれば、コンテンツ出力装置は、受信手段によって、コンテンツ、代替提示コンテンツ、分割メタデータ、必要リストメタデータ、グループ化メタデータ、相関メタデータおよび時間情報を受信する。また、コンテンツ出力装置は、プロファイル蓄積手段に、コンテンツまたは代替提示コンテンツを視聴する視聴者の視聴能力を定義した視聴者プロファイルと、コンテンツまたは代替提示コンテンツを提示可能な提示装置および当該装置の機能について記述した視聴環境プロファイルとを蓄積している。なお、視聴者の視聴能力とは、視覚聴覚等の身体的能力、一般教養や雑学等の知識、日本語以外の語を理解する理解力、笑いやすいや泣きやすい等のコンテンツを視聴した際の感性を含むものである。 According to such a configuration, the content output device receives the content, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information by the receiving unit. The content output device also has a viewer profile that defines the viewing ability of the viewer who views the content or alternative presentation content in the profile storage means, a presentation device that can present the content or alternative presentation content, and the function of the device The viewing environment profile described is stored. The viewing ability of viewers means physical ability such as visual hearing, knowledge of general education and trivia, comprehension ability to understand words other than Japanese, and content such as laughter and crying. It includes sensitivity.

そして、コンテンツ出力装置は、判定手段によって、必要リストメタデータと、プロファイル蓄積手段に蓄積されている視聴者プロファイルおよび視聴環境プロファイルとを比較して、分割メタデータに従ってコンテンツが分割された構成部分ごとに、視聴者プロファイルおよび視聴環境プロファイルが必要リストメタデータで記述された条件を満たすか否かを判定する。さらに、コンテンツ出力装置は、導出手段によって、判定手段で判定した結果、視聴者プロファイルおよび視聴環境プロファイルが必要リストメタデータで記述された条件を満たさなかった場合に、満たさなかった必要リストメタデータが付加されている構成部分について、視聴者プロファイル、視聴環境プロファイル、グループ化メタデータ、相関メタデータおよび時間情報に基づいて、視聴能力を満たすと共に、同一のグループ化メタデータが付加されているまたは所定範囲内の相関メタデータが付加されている代替提示コンテンツを導出する。 Then, the content output device compares the necessary list metadata with the viewer profile and the viewing environment profile stored in the profile storage unit by the determination unit, and each component part in which the content is divided according to the divided metadata. Then, it is determined whether or not the viewer profile and the viewing environment profile satisfy the conditions described in the necessary list metadata. Furthermore, the content output apparatus determines that the necessary list metadata that has not been satisfied when the viewer profile and the viewing environment profile do not satisfy the conditions described in the necessary list metadata as a result of the determination by the determination unit by the derivation unit. Regarding the added components, the viewing ability is satisfied and the same grouping metadata is added or predetermined based on the viewer profile, viewing environment profile, grouping metadata, correlation metadata and time information. The alternative presentation content to which the correlation metadata within the range is added is derived.

コンテンツ出力装置は、再構成手段によって、導出手段で導出された代替提示コンテンツと、コンテンツの構成部分とを再構成し、出力手段によって、再構成手段で再構成された再構成コンテンツを、提示装置に出力する。 The content output device reconstructs the alternative presentation content derived by the deriving unit and the component part of the content by the reconstructing unit, and presents the reconstructed content reconstructed by the reconstructing unit by the output unit. Output to.

請求項２に記載のコンテンツ出力装置は、コンテンツと、このコンテンツの代わりに提示する代替提示コンテンツと、前記コンテンツの分割箇所を示す分割メタデータと、この分割メタデータに従って前記コンテンツを分割した構成部分ごとおよび前記代替提示コンテンツに受信側で必要となる提示装置および当該装置の機能並びに受信側の視聴者に必要となる視聴能力を定義した必要リストメタデータと、前記コンテンツの前記構成部分および前記代替提示コンテンツをグループ化するためのグループ化メタデータと、このグループ化メタデータの相関を予め設定した基準に従って付した前記構成部分ごとおよび前記代替提示コンテンツの相関メタデータと、前記コンテンツの経過時間に対応させて付した前記代替提示コンテンツの時間情報とを受信して、前記提示装置に前記コンテンツまたは前記代替提示コンテンツを出力するコンテンツ出力装置であって、受信手段と、プロファイル蓄積手段と、判定手段と、導出手段と、変換手段と、再構成手段と、出力手段と、を備える構成とした。 The content output device according to claim 2, wherein the content, the alternative presentation content to be presented instead of the content, the divided metadata indicating the divided part of the content, and the component obtained by dividing the content according to the divided metadata And necessary list metadata defining the presentation device required on the receiving side and the function of the device and the viewing capability required for the viewer on the receiving side, and the constituent parts of the content and the alternative The grouping metadata for grouping the presented content, the correlation metadata of the component parts and the alternative presented content, which are assigned according to a preset criterion, and the elapsed time of the content Time information of the alternative presentation content attached correspondingly And outputting the content or the alternative presentation content to the presentation device, the reception means, the profile storage means, the determination means, the derivation means, the conversion means, and the reconfiguration Means and output means.

かかる構成によれば、コンテンツ出力装置は、受信手段によって、コンテンツ、代替提示コンテンツ、分割メタデータ、必要リストメタデータ、グループ化メタデータ、相関メタデータおよび時間情報を受信する。また、コンテンツ出力装置は、プロファイル蓄積手段に、コンテンツまたは代替提示コンテンツを視聴する視聴者の視聴能力を定義した視聴者プロファイルと、コンテンツまたは代替提示コンテンツを提示可能な提示装置および当該装置の機能並びに変換可能な変換装置について記述した視聴環境プロファイルとを蓄積している。 According to such a configuration, the content output device receives the content, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information by the receiving unit. In addition, the content output device includes a viewer profile that defines the viewing ability of a viewer who views the content or alternative presentation content in the profile storage means, a presentation device that can present the content or alternative presentation content, the function of the device, A viewing environment profile describing a convertible conversion device is stored.

そして、コンテンツ出力装置は、判定手段によって、必要リストメタデータと、プロファイル蓄積手段に蓄積されている視聴者プロファイルおよび視聴環境プロファイルとを比較して、分割メタデータに従ってコンテンツが分割された構成部分ごとに、視聴者プロファイルおよび視聴環境プロファイルが必要リストメタデータで記述された条件を満たすか否かを判定し、導出手段によって、判定手段で判定した結果、視聴者プロファイルおよび視聴環境プロファイルが必要リストメタデータで記述された条件を満たさなかった場合に、満たさなかった必要リストメタデータが付加されている構成部分について、視聴者プロファイル、視聴環境プロファイル、グループ化メタデータ、相関メタデータおよび時間情報に基づいて、視聴能力を満たすと共に、同一のグループ化メタデータが付加されているまたは所定範囲内の相関メタデータが付加されている代替提示コンテンツを導出する。 Then, the content output device compares the necessary list metadata with the viewer profile and the viewing environment profile stored in the profile storage unit by the determination unit, and each component part in which the content is divided according to the divided metadata. In addition, it is determined whether or not the viewer profile and the viewing environment profile satisfy the conditions described in the required list metadata. As a result of the determination by the determination unit by the deriving unit, the viewer profile and the viewing environment profile are determined as necessary list metadata. Based on the viewer profile, viewing environment profile, grouping metadata, correlation metadata, and time information for the component with the required list metadata that was not met if the conditions described in the data were not met Satisfying the viewing ability Together, to derive the alternative presentation content correlated metadata is added or within a predetermined range the same grouping metadata is added.

そして、コンテンツ出力装置は、変換手段によって、判定手段で判定した結果、視聴環境プロファイルが必要リストメタデータを満たさなかった場合に、満たさなかった必要リストメタデータが付加されている構成部分について、視聴者プロファイルおよび視聴環境プロファイルに基づいて、同一のグループ化メタデータが付加されているまたは所定範囲内の相関メタデータが付加されている代替提示コンテンツを、変換装置で出力可能で且つ視聴能力を満たして視聴者に伝達可能な変換部分に変換する。なお、変換手段は、導出手段により、代替提示コンテンツが導出できない場合に用いてもよいし、代替提示コンテンツが導出できる場合であっても、判定手段で判定した結果によって、すぐに用いてもよい。そして、変換手段は、導出手段で導出された代替提示コンテンツが、付加された必要リストメタデータに、変換手段で変換する必要があると記述されていれば、当該コンテンツの構成部分を変換し、視聴者に伝達可能な変換部分に変換する。そして、コンテンツ出力装置は、再構成手段によって、導出手段で導出された代替提示コンテンツと、変換手段で変換した変換部分と、コンテンツの構成部分とを再構成し、出力手段によって、再構成手段で再構成された再構成コンテンツを、提示装置に出力する。 Then, as a result of the determination by the determining means by the converting means, the content output apparatus, when the viewing environment profile does not satisfy the required list metadata, for the component portion to which the required list metadata that is not satisfied is added. Alternative presentation content to which the same grouping metadata is added or correlation metadata within a predetermined range can be output by the conversion device based on the user profile and the viewing environment profile, and the viewing capability is satisfied To convert it into a conversion part that can be transmitted to the viewer. The conversion means may be used when the alternative presentation content cannot be derived by the derivation means, or may be used immediately depending on the result determined by the determination means even when the alternative presentation content can be derived. . And if the alternative presentation content derived by the derivation means is described as being necessary to be converted by the conversion means in the added required list metadata, the conversion means converts the constituent parts of the content, Convert to a conversion part that can be transmitted to the viewer. Then, the content output device reconstructs the alternative presentation content derived by the deriving unit, the converted part converted by the converting unit, and the component part of the content by the reconstructing unit, and the reconstructing unit by the output unit The reconstructed reconstructed content is output to the presentation device.

請求項３に記載のコンテンツ送信装置は、請求項１または２に記載のコンテンツ出力装置に、コンテンツと、このコンテンツの代わりに提示する代替提示コンテンツと、前記コンテンツの分割箇所を示す分割メタデータと、この分割メタデータに従って前記コンテンツを分割した構成部分ごとおよび前記代替提示コンテンツに受信側で必要となる提示装置および当該装置の機能並びに受信側の視聴者に必要となる視聴能力を定義した必要リストメタデータと、前記コンテンツの前記構成部分および前記代替提示コンテンツをグループ化するためのグループ化メタデータと、このグループ化メタデータの相関を予め設定した基準に従って付した前記構成部分ごとおよび前記代替提示コンテンツの相関メタデータと、前記コンテンツの経過時間に対応させて付した時間情報とを送信する送信手段を備える構成とした。 The content transmission device according to claim 3 is a content output device according to claim 1 or 2, wherein the content, the alternative presentation content to be presented instead of the content, and the divided metadata indicating the divided portion of the content, The required list that defines the presentation device required on the reception side for each component part obtained by dividing the content according to the division metadata and the substitute presentation content, the function of the device, and the viewing ability required for the viewer on the reception side Meta data, grouping metadata for grouping the component parts of the content and the alternative presentation content, and each of the component parts and the alternative presentations assigned with a correlation of the grouped metadata according to a preset criterion The correlation metadata of the content and the elapsed time of the content And configured to include a transmitting means for transmitting a response is not subjected time information.

かかる構成によれば、コンテンツ送信装置は、送信手段によって、コンテンツ、代替提示コンテンツ、分割メタデータ、必要リストメタデータ、グループ化メタデータ、相関メタデータおよび時間情報を送信する。なお、ここで送信する際に、これらを多重化して送信してもよいし、受信側で対応関係が識別できる状態で、別々に送信してもよい。 According to such a configuration, the content transmission device transmits the content, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information by the transmission unit. In addition, when transmitting here, you may multiplex these and may transmit separately in the state which can identify a correspondence on the receiving side.

請求項１に記載の発明によれば、送信側から送信されたコンテンツ、代替提示コンテンツ、分割メタデータ、必要リストメタデータ、グループ化メタデータ、相関メタデータおよび時間情報と、蓄積しておいた視聴者プロファイルおよび視聴環境プロファイルとを参照して、視聴者個人個人の身体的な能力や、状況に対応した、コンテンツ制作者の意図を可能な限り損なわずに伝達するための提示の仕方を導出して、コンテンツを提示することができる。 According to the first aspect of the present invention, the content transmitted from the transmission side, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information are stored. Referring to the viewer profile and the viewing environment profile, derive the presentation method to convey the physical ability of the individual viewer and the content creator's intention as much as possible without compromising the situation. Thus, the content can be presented.

請求項２に記載の発明によれば、送信側から送信されたコンテンツ、代替提示コンテンツ、分割メタデータ、必要リストメタデータ、グループ化メタデータ、相関メタデータおよび時間情報と、蓄積しておいた視聴者プロファイルおよび視聴環境プロファイルとを参照して、視聴者個人個人の身体的な能力や、状況に対応した、コンテンツ制作者の意図を可能な限り損なわずに伝達するための提示の仕方を導出して、コンテンツを提示することができ、さらに、場合によっては、コンテンツの構成部分を伝達可能に変換した変換部分を当該コンテンツに付加することができる。 According to the second aspect of the present invention, the content transmitted from the transmission side, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information are stored. Referring to the viewer profile and the viewing environment profile, derive the presentation method to convey the physical ability of the individual viewer and the content creator's intention as much as possible without compromising the situation. Thus, the content can be presented, and in some cases, a converted part obtained by converting the constituent part of the content so as to be transmitted can be added to the content.

請求項３に記載の発明によれば、コンテンツの構成部分ごとに分割メタデータに従って分割し、分割した構成部分ごとおよび代替提示コンテンツに受信側で必要となる提示装置および当該装置の機能並びに受信側の視聴者に必要となる視聴能力を定義した必要リストメタデータと、コンテンツの構成部分および代替提示コンテンツをグループ化するためのグループ化メタデータと、このグループ化メタデータの相関を予め設定した基準に従って付した構成部分ごとおよび代替提示コンテンツの相関メタデータとを送信することで、受信側では、蓄積されている視聴者プロファイルと視聴環境プロファイルとを参照して、視聴者個人個人の身体的な能力や、状況に対応した、コンテンツ制作者の意図を可能な限り損なわずに伝達するための提示の仕方を導出して、コンテンツを提示することができる。 According to the third aspect of the present invention, each component of content is divided according to the divided metadata, and the presentation device required for the divided component and the alternative presentation content on the receiving side, the function of the device, and the receiving side Necessary list metadata that defines the viewing ability required for viewers, grouping metadata for grouping content components and alternative presentation content, and criteria that set the correlation between the grouping metadata in advance By transmitting the component parts attached in accordance with the above and the correlation metadata of the alternative presentation content, the receiving side refers to the stored viewer profile and viewing environment profile, and the physical body of the individual viewer A presentation to convey the ability and context of the content creator as much as possible Derive the way, it is possible to present the content.

次に、本発明の実施形態について、適宜、図面を参照しながら詳細に説明する。
図１はコンテンツ送信出力システムのブロック図である。図１に示すように、コンテンツ送信出力システムＳは、送信側において、コンテンツと、このコンテンツの代わりに提示する代替提示コンテンツと、様々なメタデータとを送信することで、受信側において、当該コンテンツおよび当該代替提示コンテンツを制作した制作者の意図を反映させると同時に、受信側の視聴者にあわせたコンテンツの種々の提示形態を実現することができるもので、コンテンツ送信装置１と、コンテンツ出力装置３とを備えている。なお、コンテンツ出力装置３には、コンテンツの映像や音声を出力（提示）する提示装置２が接続されている。 Next, embodiments of the present invention will be described in detail with reference to the drawings as appropriate.
FIG. 1 is a block diagram of a content transmission output system. As shown in FIG. 1, the content transmission output system S transmits content, alternative presentation content to be presented instead of this content, and various metadata on the transmission side, so that the content can be transmitted on the reception side. In addition to reflecting the intention of the creator who created the alternative presentation content, it is possible to realize various presentation forms of the content according to the viewer on the receiving side. 3 is provided. The content output device 3 is connected to a presentation device 2 that outputs (presents) video and audio of the content.

〈コンテンツ送信装置の構成〉
コンテンツ送信装置１は、コンテンツ（本来コンテンツ）と、代替提示コンテンツと、様々メタデータとを送信するもので、入力手段５と、送信手段７とを備えている。
入力手段５は、コンテンツと、代替提示コンテンツと、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータと、時間情報とを入力するものである。 <Configuration of content transmission device>
The content transmission device 1 transmits content (original content), alternative presentation content, and various metadata, and includes an input unit 5 and a transmission unit 7.
The input means 5 inputs content, alternative presentation content, divided metadata, necessary list metadata, grouping metadata, correlation metadata, and time information.

コンテンツは、いわゆるマルチメディアコンテンツであり、映像（動画）や音声に加え、テキスト、静止画、効果音、意味情報等の様々なデータから構成されているものである。また、このコンテンツは、分割メタデータで分割される複数の構成部分から成り立っている。 The content is so-called multimedia content, and is composed of various data such as text, still images, sound effects, and semantic information in addition to video (moving image) and audio. Further, this content is composed of a plurality of components divided by the divided metadata.

代替提示コンテンツは、コンテンツ（本来コンテンツ）を制作した制作者が本来想定している受信側における提示形態が実現できない場合に、当該コンテンツの代わりに提示するコンテンツである。つまり、受信側において、本来コンテンツがそのままの状態で提示されない場合であっても、制作者の意図を伝えられるようにしたコンテンツである。 The alternative presentation content is content that is presented in place of the content when the presentation form on the receiving side that is originally assumed by the creator who created the content (original content) cannot be realized. In other words, the content is such that the intention of the producer can be transmitted even if the content is not originally presented as it is on the receiving side.

例えば、本来コンテンツが映像と音声と字幕とからなる通常のニュース番組であり、受信側の視聴者が聴覚障害者であった場合に、代替提示コンテンツは、音声の代わりになるもので、制作者の意図である「各ニュースの詳細内容を伝える」ことを実現できるように、音声で読み上げられていることを、音声を読み上げているアナウンサーの近くに表示されるテキストデータに置き換えたものとなる。ここで、テキストデータが表示される位置を「アナウンサーの近く」としたのは、本来コンテンツの字幕とは異なり、より視聴者が見やすい位置といった意味合いである。さらに、テキストデータとして表示される代替提示コンテンツは、見やすさ、読みやすさといった観点から、字幕で表示される文字よりも大きな文字で表示されるようにすることが好ましい。 For example, if the original content is a normal news program consisting of video, audio, and subtitles, and the viewer on the receiving side is a hearing impaired person, the alternative presentation content is a substitute for audio, In order to realize the purpose of “to convey the detailed contents of each news”, the text-to-speech is replaced with text data displayed near the announcer who is reading the voice. Here, the position where the text data is displayed is “near the announcer”, which means that the position is easier for the viewer to see, unlike the subtitle of the original content. Furthermore, it is preferable that the alternative presentation content displayed as text data be displayed in characters larger than the characters displayed in subtitles from the viewpoints of ease of viewing and readability.

分割メタデータは、コンテンツを構成部分に分割する際の分割箇所を示すメタデータである。具体的には、分割メタデータは、コンテンツの内容により、分割する箇所を、時間的範囲、すなわち、映像コンテンツの画面上の座標や音声コンテンツの周波数帯域によって示したものである。 The division metadata is metadata indicating a division part when the content is divided into constituent parts. Specifically, the divided metadata indicates a portion to be divided according to the content contents by a temporal range, that is, a coordinate on the screen of the video content or a frequency band of the audio content.

例えば、コンテンツが野球中継番組である場合に、分割メタデータは、ピッチャーがピッチングしたシーン、バッターがヒッティングしたシーン、出塁したランナーが走塁したシーン等、各シーンの区切りとなる時刻を示したものとなる。 For example, when the content is a baseball broadcast program, the divided metadata indicates the time at which each scene is separated, such as a scene pitched by a pitcher, a scene hit by a batter, or a scene run by a runner who has come out. It will be a thing.

必要リストメタデータは、コンテンツ（本来コンテンツ）を制作した制作者の意図に従って、本来コンテンツが受信側で表示されるように、当該受信側で必要な提示装置およびこの提示装置の機能を列挙し、さらに、受信側の視聴者に必要となる視聴能力を定義したメタデータである。 Necessary list metadata enumerates presentation devices and functions of the presentation device that are necessary on the receiving side so that the original content is displayed on the receiving side according to the intention of the creator who created the content (original content). Furthermore, the metadata defines viewing ability required for the viewer on the receiving side.

例えば、必要リストメタデータは、コンテンツの提示が受信側の視聴者の視覚に訴えるものであれば、「映像ディスプレイ、視覚」と、聴覚に訴えるものであれば、「スピーカ、聴覚」と、リストアップしたものとなる。 For example, the required list metadata is “video display, visual” if the presentation of the content appeals to the viewer's visual perception, and “speaker, auditory” if the content presentation appeals to the hearing. It will be up.

視聴能力とは、視聴者の身体的能力、知識（知識量）、認識力、理解力、感性等を指し、これらを受信側の装置で処理可能なように、テキストデータ化または数値化したものである。 Viewing ability refers to the viewer's physical ability, knowledge (knowledge amount), cognitive ability, comprehension ability, sensitivity, etc., which are converted to text data or digitized so that they can be processed by the receiving device. It is.

例えば、必要リストメタデータに「視力」という身体的能力に関するテキストデータがリストアップされている場合、これから導出される条件は「視力のある人だけが、この構成部分に含まれる制作者の意図を理解できる」である。
また、必要リストメタデータに「日本語の聞き取り能力」という理解力に関するテキストデータがリストアップされている場合、これから導出される条件は「日本語の聞き取り能力がある人だけが、この構成部分に含まれる制作者の意図を理解できる」である。 For example, if text data related to physical ability called “sight” is listed in the required list metadata, the condition derived from this is “only the person with vision has the intention of the creator included in this component. I understand. "
In addition, if the required list metadata lists text data related to comprehension, “Japanese listening ability”, the condition derived from this is “Only those who have Japanese listening ability can include this component. I understand the intentions of the creators included. "

さらに、必要リストメタデータに「○×首相に関する知識」という知識に関するテキストデータがリストアップされている場合、これから導出される条件は「○×首相を知っている人だけが、この構成部分に含まれる制作者の意図を理解できる」である。
さらにまた、数値化されたデータの例として、必要リストメタデータに「視力０．５以上」という身体的能力に関するテキストデータおよび数値化データがリストアップされている場合、これから導出される条件は「視力０．５以上の人だけが、この構成部分に含まれる制作者の意図を理解できる」である。 In addition, if the required list metadata lists text data related to the knowledge "XX Prime Minister", the condition derived from this is that only those who know the XX Prime Minister are included in this component. Understand the intentions of the creators. "
Furthermore, as an example of the quantified data, when text data and quantified data related to physical ability “visual acuity 0.5 or more” are listed in the necessary list metadata, the condition derived from this is “ Only people with a visual acuity of 0.5 or more can understand the intentions of the producers included in this component. "

グループ化メタデータは、コンテンツが分割された構成部分を分類する際に付されるものである。つまり、同じグループに属する構成部分には同一のグループ化メタデータが、違うグループに属する構成部分には異なるグループ化メタデータが付されることになる。なお、この実施の形態では、コンテンツおよび代替提示コンテンツを制作した「制作者の意図」に基づいて、当該コンテンツの構成部分および当該代替提示コンテンツを分類し、当該分類に対応したグループ化メタデータが付されている。 The grouping metadata is attached when classifying the constituent parts into which the content is divided. That is, the same grouping metadata is attached to the components belonging to the same group, and different grouping metadata is assigned to the components belonging to different groups. In this embodiment, the constituent parts of the content and the alternative presentation content are classified based on the “creator's intention” that produced the content and the alternative presentation content, and grouping metadata corresponding to the classification is provided. It is attached.

例えば、コンテンツが野球中継番組である場合に、グループ化メタデータは、ピッチャーがピッチングしたシーンであれば、♯Ａ（「視聴者にボールを投げたことを理解させる」という制作者の意図に対応させて付けた記号）、バッターがヒッティングしたシーンであれば、♯Ｂ（「視聴者にボールを打ったことを理解させる」という制作者の意図に対応させて付けた記号）といったものになる。 For example, if the content is a baseball broadcast program and the grouping metadata is a scene pitched by the pitcher, it corresponds to the creator ’s intention of #A (“Let the viewer understand that the ball was thrown”) If the scene was hit by the batter, it would be #B (a symbol attached in response to the creator ’s intention to “make the viewer understand that the ball was hit”). .

相関メタデータは、構成部分ごとのグループ化メタデータの相関を、予め設定した基準に従って数値化した、または、ＩＤ化したものである。例えば、構成部分００１と構成部分００２とのグループ化メタデータが♯Ａ（「視聴者にボールを投げたことを理解させる」という制作者の意図に対応させて付けた記号）ということであれば、構成部分００１と構成部分００２との相関メタデータは、グループ化メタデータが同一であるので、高い値、例えば１００（相関度１００％）をとり、構成部分００３のグループ化メタデータが＃Ｂ（「視聴者にボールを打ったことを理解させるという制作者の意図に対応させて付けた記号」）ということであれば、構成部分００１と構成部分００３との相関メタデータは、グループ化メタデータが異なっているので、低い値、例えば、１０（相関度１０％）をとる。 The correlation metadata is obtained by converting the correlation of grouped metadata for each component into a numerical value or an ID according to a preset criterion. For example, if the grouping metadata of the constituent part 001 and the constituent part 002 is #A (a symbol given in correspondence with the creator's intention to “make the viewer understand that the ball has been thrown”). Since the correlation metadata between the component part 001 and the component part 002 has the same grouping metadata, the correlation metadata of the component part 003 takes a high value, for example, 100 (correlation degree 100%), and the grouping metadata of the component part 003 is #B. ("A symbol attached in correspondence with the creator's intention to make the viewer understand that the player has hit the ball"), the correlation metadata between the component part 001 and the component part 003 is the grouping metadata. Since the data is different, a low value, for example, 10 (correlation degree 10%) is taken.

また、グループ化メタデータに相関関係がある場合について説明すると、例えば、コンテンツが野球中継番組であり、コンテンツの構成部分００Ａが実況音声で、グループ化メタデータが＃Ｃ（「視聴者に試合の進行状況を理解させる」という制作者の意図に対応させて付けた記号）で、コンテンツの構成部分００Ｂがアウトカウントとストライク数とボール数とを示すＳＢＯの字幕スーパーであり、グループ化メタデータが♯Ｄ（「視聴者に試合の進行状況を理解させる」という制作者の意図に対応させて付けた記号）で、これらのグループ化メタデータ♯Ｃおよび♯Ｄの相関の相関メタデータが、相関（ＩＤ：♯Ｃ，ＩＤ：＃Ｄ，８０）として付されている場合に、これらグループ化メタデータ♯Ｃおよび♯Ｄのとは相関関係が高い（相関度８０、相似関係にある）となる。なお、この相関関係が高いと、受信側において、コンテンツの該当する構成部分の表示が不可の場合に、代替提示コンテンツとして、選択される優先順位が高くなることになる。 The case where the grouped metadata has a correlation will be described. For example, the content is a baseball broadcast program, the content component 00A is live audio, and the grouped metadata is #C (" The content component part 00B is a subtitle supermarket of SBO indicating the outcount, the number of strikes and the number of balls, and the grouping metadata is In #D (a symbol given in correspondence with the producer's intention to “make viewers understand the progress of the game”), the correlation metadata of the correlation between these grouped metadata #C and #D is correlated. (ID: #C, ID: #D, 80), the grouping metadata #C and #D have a high correlation (phase) Degree 80, in similar relationship) become. When this correlation is high, the priority order selected as the alternative presentation content is high when the receiving side cannot display the corresponding component part of the content.

時間情報は、コンテンツの経過時間に対応させて、代替提示コンテンツに付した時間に関する情報であり、実際の時間（実時間）に対して、相対的な時間を示すものである。つまり、コンテンツの時間に関する情報が送信した時刻、すなわち、タイムコード等で一義的に決定されるのに対し、代替提示コンテンツの時間情報は、当該コンテンツを視聴する際（蓄積した後であれば再生する際）にコンテンツの先頭を視聴し始めてから（再生してから）経過した時間に対応するように付されたものである。 The time information is information related to the time attached to the alternative presentation content in correspondence with the elapsed time of the content, and indicates a relative time with respect to the actual time (real time). In other words, the information regarding the time of the content is uniquely determined by the transmission time, that is, the time code, while the time information of the alternative presented content is reproduced when viewing the content (after being accumulated, When the head of the content is started to be viewed (reproduced), it is attached so as to correspond to the time elapsed.

送信手段７は、入力手段５に入力されたコンテンツと、代替提示コンテンツと、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータと、時間情報とを、コンテンツ出力装置３に送信するものである。 The transmission means 7 sends the content input to the input means 5, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information to the content output device. 3 is transmitted.

このコンテンツ送信装置１によれば、分割メタデータにより、コンテンツの構成部分ごとに分割し、必要リストメタデータにより、受信側で必要となる提示装置および当該提示装置の機能並びに受信側の視聴者に必要となる視聴能力を定義し、グループ化メタデータにより、構成部分および代替提示コンテンツをグループ化し、相関メタデータにより、これらのグループの相関を数値化しておくことで、受信側では、蓄積されている視聴者プロファイルと視聴環境プロファイルとを参照して、視聴者個人個人の身体的な能力や、状況に対応したコンテンツの多種多様の提示仕方を選択して行うことができる According to the content transmitting apparatus 1, the divided metadata is used to divide each component of the content, and the necessary list metadata is used to provide the presentation device required on the reception side, the function of the presentation device, and the viewer on the reception side. By defining the required viewing ability, grouping the component parts and alternative presentation content by grouping metadata, and quantifying the correlation of these groups by correlation metadata, it is accumulated on the receiving side. The viewer's individual profile and viewing environment profile can be referred to, and a variety of presentation methods can be selected according to the physical abilities of the individual viewer and the content corresponding to the situation.

〈コンテンツ出力装置の構成〉
コンテンツ出力装置３は、コンテンツ送信装置１から送信されたコンテンツと、代替提示コンテンツと、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータと、時間情報とを受信して、入力された視聴者プロファイルに応じて、接続される提示装置２に出力するもので、受信手段９と、コンテンツ・メタデータ蓄積手段１１と、プロファイル保持手段１３と、判定手段１５と、導出手段１７と、変換手段１９と、再構成手段２１と、出力手段２３とを備えている。 <Configuration of content output device>
The content output device 3 receives the content transmitted from the content transmitting device 1, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information. In response to the input viewer profile, the information is output to the connected presentation device 2. The receiving means 9, the content / metadata storage means 11, the profile holding means 13, the determination means 15, and the derivation Means 17, conversion means 19, reconstruction means 21, and output means 23 are provided.

受信手段９は、コンテンツと、代替提示コンテンツと、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータと、時間情報とを受信し、これらを判定手段１５およびコンテンツ・メタデータ蓄積手段１１に出力するものである。なお、コンテンツ出力装置３において、受信手段９で受信したコンテンツと、代替提示コンテンツと、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータと、時間情報とを判定手段１５に出力するだけでなく、コンテンツ・メタデータ蓄積手段１１に出力しているのは、リアルタイムでコンテンツまたは代替提示コンテンツを提示する際に、再構成手段２１で当該コンテンツまたは当該代替提示コンテンツを再構成する必要が生じた場合に備えるためである。 The receiving means 9 receives the content, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information, and these are received as the determination means 15 and the content This is output to the metadata storage means 11. In the content output device 3, the content received by the receiving means 9, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information are determined. 15, the content / metadata storage unit 11 outputs the content or the alternative presentation content in real time when the reconfiguration unit 21 regenerates the content or the alternative presentation content. This is to prepare for the case where it is necessary to configure.

コンテンツ・メタデータ蓄積手段１１は、受信手段９で受信されたコンテンツと、代替提示コンテンツと、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータと、時間情報とを蓄積するもので、一般的な記録媒体によって構成されている。このコンテンツ・メタデータ蓄積手段１１に蓄積されているコンテンツと、代替提示コンテンツと、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータと、時間情報とは、必要に応じて再構成手段２１から読み出される。 The content / metadata storage unit 11 receives the content received by the reception unit 9, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information. It accumulates and is constituted by a general recording medium. The content stored in the content / metadata storage means 11, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information are necessary. In response, the data is read from the reconstruction means 21.

プロファイル保持手段１３は、コンテンツ出力装置３を利用する利用者（視聴者）が入力した視聴者プロファイルおよび視聴環境プロファイルを保持（蓄積）するもので、一般的な記録媒体によって構成されている。
視聴者プロファイルは、視聴者の個人情報（性別、年齢、職業、趣味・嗜好等）と、生活スタイル（平日および休日の視聴時間帯、集中度・状況等）と、視聴能力（身体的能力、知識（知識量）、認識力、理解力、感性等）とを定義したものである。なお、この視聴者プロファイルでは、これら個人情報、生活スタイルおよび視聴能力が当該装置３で処理可能なように、テキストデータ化または数値化されている。 The profile holding means 13 holds (accumulates) the viewer profile and the viewing environment profile input by the user (viewer) who uses the content output device 3, and is configured by a general recording medium.
The viewer profile includes the viewer's personal information (gender, age, occupation, hobbies / preferences, etc.), lifestyle (weekday and holiday viewing hours, concentration / situation, etc.), and viewing ability (physical ability, Knowledge (knowledge amount), cognitive ability, understanding ability, sensitivity, etc.). In this viewer profile, the personal information, lifestyle and viewing ability are converted into text data or digitized so that the apparatus 3 can process them.

個人情報における性別、年齢および職業とは、当該装置３を利用する視聴者が家族単位である場合には、当該家族を構成する視聴者各個人の性別、年齢および職業である。
個人情報における趣味・嗜好とは、映画鑑賞、音楽鑑賞、読書、スポーツ、料理、囲碁将棋、園芸、写真、語学学習、歴史学習といった趣味と、お酒、お茶、コーヒー、たばこといった嗜好（嗜好品）とに関する情報である。 The gender, age, and occupation in the personal information are the gender, age, and occupation of each individual viewer who constitutes the family when the viewer who uses the device 3 is a family unit.
Hobbies and preferences in personal information include movies, music appreciation, reading, sports, cooking, shogi shogi, gardening, photography, language learning, history learning, and liquor, tea, coffee, tobacco and other preferences ) And information.

生活スタイルにおける視聴時間帯とは、視聴者がコンテンツを視聴する主な時間帯に関する情報である。例えば、視聴者が学生や勤労者であれば、平日では、起床時間から登校（出勤）時間まで、且つ、帰宅時間から就寝時間までの時間帯の中で、休日では、起床時間から就寝時間までの時間帯の中で、曜日ごとのコンテンツを視聴する時間帯に関する情報である。また、視聴者が専業主婦であれば、起床時間から就寝時間までの時間帯の中で、曜日ごとのコンテンツを視聴する時間帯に関する情報である。 The viewing time zone in the lifestyle is information on the main time zone in which the viewer views the content. For example, if the viewer is a student or a worker, on weekdays, from wake-up time to school (commuting time), and in the time zone from return time to bedtime, on holidays, from wake-up time to bedtime In this time zone, it is information relating to the time zone for viewing content for each day of the week. In addition, if the viewer is a full-time housewife, the information is related to a time period during which content for each day of the week is viewed in the time period from wake-up time to bedtime.

生活スタイルにおける集中度・状況とは、コンテンツを視聴する際の視聴者の集中度合いや、コンテンツを視聴（提示）する際の周囲の状況を示す情報である。例えば、視聴者が集中してコンテンツを視聴する「集中視聴（通常視聴）」、視聴者が何か別のことをしながらコンテンツを視聴する「ながら視聴」、視聴者がコンテンツの映像を見られない状況にある場合の「映像不可視聴」、視聴者がコンテンツの音声を聞き取れない状況にある場合の「音声不可視聴」、視聴者が映像または音声以外の代替え提示を受けられない場合の「代替不可視聴」といったものである。 The degree of concentration / situation in the lifestyle is information indicating the degree of concentration of the viewer when viewing the content and the surrounding situation when viewing (presenting) the content. For example, “concentrated viewing (normal viewing)” where the viewer concentrates on viewing the content, “view while viewing” where the viewer views the content while doing something else, and the viewer can view the content video "No video viewing" when there is no video, "No audio viewing" when the viewer is unable to hear the audio of the content, "Alternative" when the viewer is unable to receive an alternative presentation other than video or audio It is something like “impossible viewing”.

視聴能力における身体的能力（知覚力）とは、視聴者が健常者である場合を除いて、視聴者に視覚障害や聴覚障害がある場合、コンテンツの通常の視聴が困難であることを示す情報である。
視聴能力における知識（知識量）とは、視聴者の一般的な教養レベルやコンテンツに関係する背景知識を示すものである。例えば、一般的な教養レベルを示すものとして、日本語の語彙に関しては、漢字検定で特定される級が想定される。 Physical ability (perceptual ability) in viewing ability is information indicating that normal viewing of content is difficult when the viewer has visual impairment or hearing impairment, except when the viewer is healthy. It is.
Knowledge (knowledge amount) in viewing ability indicates background knowledge related to the general level of education and content of the viewer. For example, a class specified by a kanji test is assumed for a Japanese vocabulary to indicate a general level of education.

視聴能力における認識力とは、視聴者が健常者である場合に、視聴者が日本語以外の言語、例えば、英語やドイツ語を理解できるか否かを示す情報であり、言語聞き取り能力（日本語や英語等）と、言語読み取り能力（日本語や英語等）とがある。 The recognition ability in viewing ability is information indicating whether or not the viewer can understand a language other than Japanese, for example, English or German, when the viewer is a healthy person. Language and English) and language reading ability (Japanese, English, etc.).

視聴能力における理解力とは、視聴者が健常者である場合に、視聴者がどれだけコンテンツの言語（日本語や英語等）や情景を理解できるか否かを示す情報である。また、視聴者が視覚障害者である場合に、点字が理解できるか否かや、視聴者が聴覚障害者である場合に、手話が理解できるか否かに関する情報である。 The comprehension in viewing ability is information indicating how much the viewer can understand the language (Japanese, English, etc.) and scene of the content when the viewer is a healthy person. Further, it is information regarding whether or not Braille can be understood when the viewer is visually impaired, and whether or not sign language can be understood when the viewer is hearing impaired.

視聴能力における感性とは、視聴者の感性を示すもので、感動しやすい、涙もろい、イライラしやすい等を数値化したものである。この数値は、例えば、図示を省略した表示装置に表示される、予め用意しておいたアンケート形式の入力シートに、視聴者が回答した結果に基づいて定められるものである。例えば、「恋愛ドラマで感動できる？」、「悲劇に共感できる？」、「長い解説に耐えられる？」という問いに対して、５段階（１〜５の数値）で自己評価することで得られる。また、この数値は、視聴者が視聴したコンテンツの視聴履歴を参照して、当該視聴履歴に含まれているコンテンツの選択傾向から推量して求める方法が想定される。 Sensitivity in viewing ability indicates a viewer's sensibility, and is a numerical value of being easily moved, tearful, easily frustrated, and the like. This numerical value is determined based on the result of the viewer answering a prepared questionnaire-type input sheet displayed on a display device (not shown). For example, it can be obtained by self-assessment in five stages (numerical values of 1 to 5) in response to questions such as “Can you be impressed by a love drama?”, “Can you sympathize with a tragedy”, or “Can you withstand long explanations?” . Further, it is assumed that this numerical value is obtained by referring to the viewing history of the content viewed by the viewer and estimating from the selection tendency of the content included in the viewing history.

すなわち、これらの感性は、コンテンツを視聴する上で視聴者の心理的な傾向を示すものであり、ここで定義される感性によっても、コンテンツの構成部分を、代替提示コンテンツに代えることができる。例えば、恋愛ドラマに感動できないという感性（「恋愛ドラマで感動できる？」という問いに対して回答した数値が低い）を持った視聴者の場合、当該視聴者を感動させるため（無理矢理感動してもらうため）に、より過剰な演出を行った代替提示コンテンツを用意しておいて、該当するコンテンツの構成部分を当該代替提示コンテンツに変更する。また、悲劇に共感できないという感性（「悲劇に共感できる？」という問いに対して回答した数値が低い）を持った視聴者の場合、当該視聴者を共感させるため（無理矢理共感してもらう或いは無理矢理悲しい気持ちになってもらうため）に、より過剰な演出を行った代替提示コンテンツを用意しておいて、該当するコンテンツの構成部分を当該代替提示コンテンツに変更する。さらに、長い解説に耐えられないという感性（「長い解説に耐えられる？」という問いに対して回答した数値が低い）を持った視聴者の場合、当該視聴者が視聴に耐えられるように、より簡素な解説を行った代替提示コンテンツを用意しておいて、該当するコンテンツの構成部分を当該代替提示コンテンツに変更する。
なお、これらの視聴能力における感性は、当該装置３の制御手段（図示せず）によって処理され、プロファイル保持手段１３に事前に保持される。 That is, these sensibilities indicate a viewer's psychological tendency in viewing the content, and the constituent parts of the content can be replaced with alternative presented content by the sensitivity defined here. For example, in the case of a viewer who has a sensitivity that cannot be touched by a love drama (the numerical value answered in response to the question “Can you touch the love drama?” Is low) For this reason, the alternative presentation content with a more excessive presentation is prepared, and the constituent part of the corresponding content is changed to the alternative presentation content. In addition, in the case of a viewer who has the sensitivity that the tragedy cannot be sympathized (the numerical value answered in response to the question “Can you sympathize with tragedy?”), In order to sympathize with the viewer In order to have a sad feeling), an alternative presentation content with a more excessive presentation is prepared, and the constituent part of the corresponding content is changed to the alternative presentation content. In addition, for viewers who have a sense of being unable to withstand long explanations (the number of respondents who answered “Can you withstand long explanations?” Is low) An alternative presentation content with a simple explanation is prepared, and the constituent part of the corresponding content is changed to the alternative presentation content.
Note that the sensibility in the viewing ability is processed by the control means (not shown) of the apparatus 3 and is held in advance in the profile holding means 13.

また、視聴環境プロファイルは、コンテンツまたは代替提示コンテンツを提示可能な提示装置および変換可能な変換装置について記述したものである。
コンテンツまたは代替提示コンテンツを提示可能な提示装置についての記述とは、当該装置３に接続されている提示装置２に関することであり、当該提示装置２の名称、機能、仕様に関する情報である。また、コンテンツまたは代替提示コンテンツを変換可能な変換装置についての記述とは、後記する変換手段１９の機能に関する情報である。 The viewing environment profile describes a presentation device capable of presenting content or alternative presentation content and a conversion device capable of conversion.
The description of the presentation device that can present content or alternative presentation content is related to the presentation device 2 connected to the device 3, and is information about the name, function, and specification of the presentation device 2. The description of the conversion device that can convert the content or the alternative presentation content is information regarding the function of the conversion means 19 described later.

判定手段１５は、受信手段９で受信された必要リストメタデータと、プロファイル保持手段１３に保持されている視聴者プロファイルおよび視聴環境プロファイルとを比較して、視聴者プロファイルおよび視聴環境プロファイルが必要リストメタデータで記述された条件を満たすか否かを判定するものである。この判定手段１５によって、視聴環境プロファイルが必要リストメタデータで記述された条件を満たすと判定された場合、判定手段１５に入力されたコンテンツはそのまま出力手段２３に出力される。 The judging unit 15 compares the necessary list metadata received by the receiving unit 9 with the viewer profile and the viewing environment profile held in the profile holding unit 13, and the viewer profile and the viewing environment profile are necessary lists. It is determined whether or not the conditions described in the metadata are satisfied. When the determination unit 15 determines that the viewing environment profile satisfies the condition described in the necessary list metadata, the content input to the determination unit 15 is output to the output unit 23 as it is.

また、この判定手段１５によって、視聴者プロファイルおよび視聴環境プロファイルが必要リストメタデータで記述された条件を満たすと判定されなかった場合、コンテンツ、代替提示コンテンツ、グループ化メタデータおよび相関メタデータを導出手段１７に出力する、または、コンテンツ、代替提示コンテンツ、視聴者プロファイルおよび視聴環境プロファイルを変換手段１９に出力する。 In addition, when the determination unit 15 does not determine that the viewer profile and the viewing environment profile satisfy the conditions described in the necessary list metadata, the content, alternative presentation content, grouping metadata, and correlation metadata are derived. The content, the alternative presentation content, the viewer profile, and the viewing environment profile are output to the conversion unit 19.

導出手段１７は、視聴者プロファイルおよび視聴環境プロファイルが必要リストメタデータを満たすと判定されなかった場合、コンテンツの構成部分の代わりに、代替提示コンテンツを、グループ化メタデータ、相関メタデータおよび時間情報に基づいて導出するものである。なお、この導出手段１７によって、代替提示コンテンツを導出する導出方法（導出の仕方）の詳細は後記することとする。 The deriving unit 17, when it is determined that the viewer profile and the viewing environment profile satisfy the necessary list metadata, substitutes the alternative presentation content for the grouping metadata, the correlation metadata, and the time information instead of the constituent parts of the content. Is derived on the basis of The details of the derivation method (derivation method) for deriving the alternative presentation content by the derivation means 17 will be described later.

変換手段１９は、導出手段１７で導出された代替提示コンテンツの必要リストメタデータに、当該代替提示コンテンツを変換するコンテンツ変換装置がリストアップされている場合に、当該代替提示コンテンツを視聴者に伝達可能な変換部分に変換するものである。なお、この変換手段１９によって、代替提示コンテンツを変換部分に変換する変換方法（変換の仕方）の詳細は後記することとする。
ここでは、導出手段１７と変換手段１９とを併用しているが、少なくとも導出手段１７を備えていればよい。 When the content conversion apparatus that converts the alternative presentation content is listed in the necessary list metadata of the alternative presentation content derived by the derivation unit 17, the conversion unit 19 transmits the alternative presentation content to the viewer. It converts to a possible conversion part. Note that details of the conversion method (conversion method) for converting the alternative presentation content into a converted part by the conversion means 19 will be described later.
Here, the derivation means 17 and the conversion means 19 are used together, but at least the derivation means 17 may be provided.

再構成手段２１は、導出手段１７で導出された代替提示コンテンツ、または変換手段１９で代替提示コンテンツが変換された変換部分と、コンテンツ・メタデータ蓄積手段１１で蓄積しておいたコンテンツとを再構成するものである。 The reconstructing means 21 reconstructs the alternative presentation content derived by the derivation means 17 or the converted portion obtained by converting the alternative presentation content by the conversion means 19 and the content accumulated by the content / metadata accumulation means 11. It constitutes.

出力手段２３は、再構成手段２１で再構成された再構成コンテンツを提示装置２に出力するものである。 The output unit 23 outputs the reconstructed content reconfigured by the reconfiguration unit 21 to the presentation device 2.

このコンテンツ出力装置３によれば、送信側から送信されたコンテンツ、代替提示コンテンツ、分割メタデータ、必要リストメタデータ、グループ化メタデータ、相関メタデータおよび時間情報を受信手段９で受信し、判定手段１５、導出手段１７によって、プロファイル保持手段１３で保持（蓄積）しておいた視聴者プロファイルおよび視聴環境プロファイルを参照して、視聴者個人個人の身体的な能力や、状況に対応したコンテンツの多種多様の提示仕方を選択して行うことができ、さらに、場合によっては、変換手段１９によって、代替提示コンテンツを伝達可能に変換した変換部分を当該コンテンツに付加することができる。 According to the content output device 3, the reception means 9 receives the content, alternative presentation content, divided metadata, necessary list metadata, grouping metadata, correlation metadata, and time information transmitted from the transmission side, and determines By referring to the viewer profile and the viewing environment profile held (accumulated) by the profile holding unit 13 by the means 15 and the derivation unit 17, the physical ability of the individual viewer and the content corresponding to the situation A variety of presentation methods can be selected and performed. Further, in some cases, the conversion unit 19 can add a converted portion converted from the alternative presentation content to the content.

〈コンテンツ送信装置の動作〉
次に、コンテンツ送信装置１の動作について説明する。
コンテンツ送信装置１は、入力手段５により、コンテンツ（本来コンテンツ）と、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータとを入力する。そして、コンテンツ送信装置１は、送信手段７によって入力されたコンテンツ（本来コンテンツ）と、分割メタデータと、必要リストメタデータと、グループ化メタデータと、相関メタデータと、時間情報とを送信する。 <Operation of content transmission device>
Next, the operation of the content transmission device 1 will be described.
The content transmitting apparatus 1 inputs content (original content), divided metadata, necessary list metadata, grouping metadata, and correlation metadata by the input unit 5. Then, the content transmission device 1 transmits the content (original content) input by the transmission means 7, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information. .

〈コンテンツ出力装置の動作〉
次に、図２および図３に示すフローチャートを参照して、コンテンツ出力装置３の動作について説明する（適宜、図１参照）。
まず、コンテンツ出力装置３は、受信手段９によって、コンテンツ、代替提示コンテンツ、分割メタデータ、必要リストメタデータ、グループ化メタデータ、相関メタデータおよび時間情報を受信する（ステップＳ１）。 <Operation of content output device>
Next, the operation of the content output apparatus 3 will be described with reference to the flowcharts shown in FIGS. 2 and 3 (see FIG. 1 as appropriate).
First, the content output device 3 receives the content, the alternative presentation content, the divided metadata, the necessary list metadata, the grouping metadata, the correlation metadata, and the time information by the receiving unit 9 (step S1).

続いて、コンテンツ出力装置３は、判定手段１５によって、分割メタデータで分割された箇所（構成部分）ごとに、コンテンツの必要リストメタデータで記述された条件を、視聴者プロファイルおよび視聴環境プロファイルが満たしているか否かを判定する（ステップＳ２）。コンテンツ出力装置３は、必要リストメタデータを、視聴者プロファイルおよび視聴環境プロファイルが満たしていると判定した場合（ステップＳ２、Ｙｅｓ）には、代替提示する必要がないとして、コンテンツをそのまま出力手段２３に出力し、提示装置２に提示させる（ステップＳ３）。 Subsequently, the content output device 3 uses the determination unit 15 to determine the conditions described in the content required list metadata for each part (component) divided by the divided metadata, based on the conditions in the viewer profile and the viewing environment profile. It is determined whether or not the condition is satisfied (step S2). When the content output device 3 determines that the viewer list and the viewing environment profile satisfy the necessary list metadata (Yes in step S2), the content output device 3 outputs the content as it is, as it is not necessary to present the alternative list. To be presented on the presentation device 2 (step S3).

また、コンテンツ出力装置３は、必要リストメタデータを、視聴者プロファイルおよび視聴環境プロファイルが満たしていると判定しなかった場合（ステップＳ２、Ｎｏ）には、満たしていない箇所（構成部分）について、コンテンツの構成部分のグループ化メタデータと、同一のグループ化メタデータを持つコンテンツの構成部分があるか否かを判定する（ステップＳ４）。 In addition, when the content output device 3 does not determine that the required list metadata is satisfied by the viewer profile and the viewing environment profile (step S2, No), the content output device 3 It is determined whether there is a content component having the same grouping metadata as the grouping metadata of the content component (step S4).

コンテンツ出力装置３は、コンテンツの構成部分のグループ化メタデータと、同一のグループ化メタデータを持つコンテンツの構成部分があると判定した場合（ステップＳ４、Ｙｅｓ）には、代替提示する必要がないとして、コンテンツをそのまま出力手段２３に出力し、提示装置２に提示させる（ステップＳ３）。コンテンツ出力装置３は、コンテンツの構成部分のグループ化メタデータと、同一のグループ化メタデータをもつコンテンツの構成部分があると判定しなかった場合（ステップＳ４、Ｎｏ）満たしていない箇所について、コンテンツの構成部分の相関メタデータにより相関を持つコンテンツの構成部分があるか否かを判定する（ステップＳ５）。 If the content output device 3 determines that there is a content component having the same grouping metadata as the grouping metadata of the content component (step S4, Yes), there is no need to present an alternative. Then, the content is output as it is to the output means 23 and is presented on the presentation device 2 (step S3). If the content output device 3 does not determine that there is a content component having the same grouping metadata as the grouping metadata of the content component (step S4, No), the content output device 3 It is determined whether or not there is a content component having a correlation based on the correlation metadata of the component (step S5).

コンテンツ出力装置３は、コンテンツの構成部分の相関メタデータにより相関を持つコンテンツの構成部分があると判定した場合（ステップＳ５、Ｙｅｓ）には、代替提示する必要がないとして、コンテンツをそのまま出力手段２３に出力し、提示装置２に提示させる（ステップＳ３）。コンテンツ出力装置３は、コンテンツの構成部分の相関メタデータにより相関を持つコンテンツの構成部分があると判定しなかった場合（ステップＳ５、Ｎｏ）には、満たしていない箇所について、代替提示コンテンツのグループ化メタデータと、同一のグループメタデータを持つ代替提示コンテンツがあるか否かを判定する（ステップＳ６）。 If the content output device 3 determines that there is a content component having a correlation based on the correlation metadata of the content component (step S5, Yes), the content output device 3 outputs the content as it is, as it is not necessary to present an alternative. It outputs to 23 and makes the presentation apparatus 2 present (step S3). If the content output device 3 does not determine that there is a content component having a correlation based on the correlation metadata of the content component (step S5, No), the group of alternative presentation content for the unsatisfied part It is determined whether or not there is alternative presentation content having the same group metadata as the generalized metadata (step S6).

そして、コンテンツ出力装置３は、コンテンツの構成部分のグループ化メタデータと、同一のグループメタデータを持つ代替提示コンテンツがあると判定した場合（ステップＳ６、Ｙｅｓ）には、判定手段１６によって、代替提示コンテンツの必要リストメタデータを、視聴者プロファイルおよび視聴環境プロファイルが満たしているか否かを判定する（ステップＳ７）。そして、コンテンツ出力装置３は、代替提示コンテンツの必要リストメタデータを視聴者プロファイルおよび視聴環境プロファイルが満たしていると判定した場合（ステップＳ７、Ｙｅｓ）には、導出手段１７によって、代替提示を実行する（ステップＳ８）。 When the content output device 3 determines that there is alternative presentation content having the same group metadata as the grouping metadata of the constituent parts of the content (step S6, Yes), the determination unit 16 performs the replacement. It is determined whether the viewer list and the viewing environment profile satisfy the required list metadata of the presented content (step S7). When the content output device 3 determines that the viewer list and the viewing environment profile satisfy the required list metadata of the alternative presentation content (Yes in step S7), the content output device 3 executes the alternative presentation by the derivation unit 17. (Step S8).

また、コンテンツ出力装置３は、コンテンツの構成部分のグループ化メタデータと、同一のグループメタデータを持つ代替提示コンテンツがあると判定しなかった場合（ステップＳ６、Ｎｏ）並びに代替提示コンテンツの必要リストメタデータを視聴者プロファイルおよび視聴環境プロファイルが満たしていると判定しなかった場合（ステップＳ７、Ｎｏ）には、満たしていない箇所について、「包含される」相関を持つ代替提示コンテンツがあるか否かを判定する（ステップＳ９）。 In addition, the content output apparatus 3 does not determine that there is alternative presentation content having the grouping metadata of the content component and the same group metadata (No in step S6) and the necessary list of alternative presentation content If it is not determined that the viewer profile and the viewing environment profile satisfy the metadata (step S7, No), whether or not there is alternative presentation content having a “included” correlation for the unsatisfied portion Is determined (step S9).

また、コンテンツ出力装置３は、「包含される」相関を持つ代替提示コンテンツがあると判定されなかった場合（ステップＳ９、Ｎｏ）、満たしていない箇所について、「相似度」の相関を持つ代替提示コンテンツがあるか否かを判定する（ステップＳ１０）。
なお、コンテンツ出力装置３は、ステップＳ９にて、「包含される」相関を持つ代替提示コンテンツがあると判定された場合（ステップＳ９、Ｙｅｓ）には、判定手段１５によって、代替提示コンテンツの必要リストメタデータを、視聴者プロファイルおよび視聴環境プロファイルが満たしているか否かを判定し（ステップＳ１１）、満たしていれば（ステップＳ１１、Ｙｅｓ）ステップＳ８に進み、満たしていなければ（ステップＳ１１、Ｎｏ）、ステップＳ１０に進む。 In addition, when it is not determined that there is alternative presentation content having “included” correlation (No in step S9), the content output device 3 provides alternative presentation having “similarity” correlation for unsatisfied parts. It is determined whether there is content (step S10).
If the content output device 3 determines in step S9 that there is alternative presentation content having “included” correlation (Yes in step S9), the determination unit 15 needs the alternative presentation content. It is determined whether or not the list metadata satisfies the viewer profile and the viewing environment profile (step S11), and if satisfied (step S11, Yes), the process proceeds to step S8, and if not satisfied (step S11, No) ), Go to step S10.

また、コンテンツ出力装置３は、ステップ１０にて、「相似度」の相関を持つ代替提示コンテンツがあると判定された場合（ステップＳ１０、Ｙｅｓ）には、判定手段１５によって、代替提示コンテンツの必要リストメタデータを、視聴者プロファイルおよび視聴環境プロファイルが満たしているか否かを判定し（ステップＳ１２）、満たしていれば（ステップＳ１２、Ｙｅｓ）ステップＳ８に進み、満たしていなければ（ステップＳ１２、Ｎｏ）、ステップＳ１０に戻る。 If the content output device 3 determines in step 10 that there is an alternative presentation content having a correlation of “similarity” (step S10, Yes), the determination unit 15 needs the alternative presentation content. It is determined whether or not the list metadata satisfies the viewer profile and the viewing environment profile (step S12), and if satisfied (step S12, Yes), the process proceeds to step S8, and if not satisfied (step S12, No) ), The process returns to step S10.

そして、コンテンツ出力装置３は、「相似度」の相関を持つ代替提示コンテンツがあると判定されなかった（ステップＳ１０、Ｎｏ）には、代替提示不可能として、コンテンツをそのまま出力手段２３に出力し、提示装置２に提示させる（ステップＳ１３）。 Then, if it is not determined that there is alternative presentation content having a correlation of “similarity” (No in step S10), the content output device 3 outputs the content as it is to the output unit 23 as alternative presentation is impossible. Then, it is presented to the presentation device 2 (step S13).

〈視聴者プロファイルおよび視聴環境プロファイルの具体例〉
次に、図４を参照して、視聴者プロファイルおよび視聴環境プロファイルに含まれる情報（項目）の具体例について説明する。
図４に示したように、視聴者プロファイルには、知覚力と認識力と理解力とが定義されており、知覚力として、視覚、聴覚および触覚が定義されており、認識力として、言語聞き取り（日本語）および言語読み取り（日本語、英語）が定義されており、理解力として、言語理解（日本語、英語）および情景理解（ある映像や音声を受容したときに特定の状況等を想起する能力。例えば、「風鈴の映像」と「セミの鳴き声」とから構成されるシーンを受容したときに、「田舎の夏」を想起する能力）が定義されている。 <Specific examples of viewer profile and viewing environment profile>
Next, specific examples of information (items) included in the viewer profile and the viewing environment profile will be described with reference to FIG.
As shown in FIG. 4, the viewer profile defines perceptual power, cognitive power, and comprehension power. Visual, auditory, and tactile sensations are defined as the perceptual power. (Japanese) and language reading (Japanese, English) are defined, and as comprehension, language understanding (Japanese, English) and scene understanding (recalling a specific situation when a certain video or voice is received) For example, an ability to recall “summer in the country” when a scene composed of “wind chimes video” and “semi cry” is defined.

また、視聴環境プロファイルには、変換生成装置（変換手段１９）と提示装置（コンテンツ出力装置３に接続される提示装置２）との名称が記述されており、変換生成装置として、テキスト文音声合成装置、字幕スーパー装置およびテキスト文点字化装置が含まれており、提示装置として、映像ディスプレイ、スピーカ、点字ディスプレイおよび触覚ピンディスプレイが含まれている。 The viewing environment profile describes the names of the conversion generation device (conversion means 19) and the presentation device (presentation device 2 connected to the content output device 3). A device, a caption super device, and a text sentence Braille device are included, and a video display, a speaker, a Braille display, and a tactile pin display are included as presentation devices.

このような情報を含んだ、視聴者プロファイルおよび視聴環境プロファイルをプロファイル保持手段１３に保持しておくことで、コンテンツ出力装置３は、視聴者の視聴能力および視聴環境に対応したコンテンツの提示を実現することができる。 By holding the viewer profile and the viewing environment profile including such information in the profile holding unit 13, the content output device 3 realizes the presentation of content corresponding to the viewer's viewing ability and viewing environment. can do.

〈導出手段、変換手段による代替え提示の具体例〉
次に、図５〜図８（具体例１：ニュース番組）および図９〜図１３（具体例２：野球中継番組）を参照して、コンテンツ出力装置３における代替え提示の具体例について説明する。
図５は、映像ディスプレイに表示されているニュース番組の１シーンで、キャスターが原稿を読んでいるシーンを示したものである。図６は、図５に示したシーンに、代替え提示がなされた結果（代替提示コンテンツが付加された結果）を示したものである。図７は、図５に示したシーンに相当するコンテンツの構成部分および代替提示コンテンツと、付加されているメタデータ（多重されていないものも含む）とついて説明したものである。図８は、図７に示したメタデータが付加されるコンテンツの構成部分を定義する分割メタデータの例を示したものである。 <Specific examples of alternative presentation by deriving means and converting means>
Next, specific examples of alternative presentation in the content output apparatus 3 will be described with reference to FIGS. 5 to 8 (specific example 1: news program) and FIGS. 9 to 13 (specific example 2: baseball broadcast program).
FIG. 5 shows a scene in which a caster is reading a manuscript in one scene of a news program displayed on the video display. FIG. 6 shows the result of the alternative presentation (the result of adding the alternative presentation content) to the scene shown in FIG. FIG. 7 explains the constituent parts of the content corresponding to the scene shown in FIG. 5 and alternative presentation content, and added metadata (including non-multiplexed metadata). FIG. 8 shows an example of divided metadata that defines the constituent parts of the content to which the metadata shown in FIG. 7 is added.

これら図５および図６に示したように、ニュース番組において、キャスターが読み上げている音声「昨日の・・・について、・・・・・・であることがわかりました。」（図５参照）を、視聴者が理解できない場合（視聴者が聴覚障害者であり、視聴者プロファイルの視聴能力の聴覚がＮＧ）、または、音声提示が困難な状況にある場合（音声提示がＮＧ）である場合に、当該音声をテキストデータとして、映像ディスプレイの画面上のキャスターの近くに表示させている（図６参照）。 As shown in FIG. 5 and FIG. 6, in the news program, the voice read by the caster “I understand that yesterday's ...” (see FIG. 5). If the viewer cannot understand (the viewer is hearing-impaired and hearing of the viewing ability of the viewer profile is NG), or the voice presentation is difficult (sound presentation is NG) In addition, the voice is displayed as text data near a caster on the screen of the video display (see FIG. 6).

このような代替え提示を行うために、図７に示したコンテンツの構成部分および代替提示コンテンツとメタデータとが定義されており、この図７では、「コンテンツの構成部分および代替提示コンテンツの名称」（メタデータとして多重化されていない）と、コンテンツの構成部分か代替提示コンテンツかを示す「本来ｏｒ代替」と、コンテンツの構成部分か代替提示コンテンツについて、制作者が制作時にグループ化を行う際に考慮した意図である「この構成部分の持つ意図」と、「グループ化メタデータ」と、「必要リストメタデータ」との関係が定義されている。 In order to perform such alternative presentation, the content components shown in FIG. 7 and the alternative presentation content and metadata are defined. In FIG. 7, “names of content components and alternative presentation content” are defined. When the creator groups the “original or alternative” indicating whether it is a content component or alternative presentation content and the content component or alternative presentation content at the time of production (not multiplexed as metadata) The relationship between “the intention of this component”, “grouping metadata”, and “necessary list metadata” is defined.

この図７において、「コンテンツの構成部分および代替提示コンテンツの名称」が“アナ音声”は、コンテンツ（本来コンテンツ）の構成部分を示しており、このコンテンツの構成部分の「この構成部分の持つ意図」が“あるニュースに関する詳細内容を理解させる”であり、「グループ化メタデータ」として“＃Ａ”が定義されており、さらに、「必要リストメタデータ」（どの手段によって伝えられるか）として“スピーカ”が定義されている。 In FIG. 7, the “name of the content component and alternative presentation content” “ana voice” indicates the content (original content) component, and the “intention of this component” of the content component. "Is to understand the details of a certain news", "#A" is defined as "grouping metadata", and "required list metadata" (which means is used to convey) " "Speaker" is defined.

また、この図７において、例えば、「コンテンツの構成部分および代替提示コンテンツの名称」が“原稿文テキストデータ（代替用時系列テキスト）”は、代替提示コンテンツを示しており、この代替提示コンテンツの「この構成部分の持つ意図」が“あるニュースに関する詳細内容を理解させる”であり、「グループ化メタデータ」として“＃Ａ”が定義されており、さらに、「必要リストメタデータ」として、“時系列テキスト提示装置”、“日英翻訳装置”、“音声合成装置”、“スピーカ”が定義されている。なお、この「必要リストメタデータ」の一段目に記載してある“時系列テキスト提示装置”が提示装置を示しており、二段目および三段目に記載してある“日英翻訳装置”、“音声合成装置”が変換装置を示している。 In FIG. 7, for example, “original text data (alternative time-series text)” of “content component part and alternative presentation content name” indicates the alternative presentation content. “Intention of this component” is “Let me understand the detailed contents about a certain news”, “#A” is defined as “Grouping metadata”, and “Necessary list metadata” is “ “Time-series text presentation device”, “Japanese-English translation device”, “speech synthesizer”, and “speaker” are defined. The “time-series text presentation device” described in the first row of the “necessary list metadata” indicates the presentation device, and the “Japanese-English translation device” described in the second and third rows. "Speech synthesizer" indicates a conversion device.

すなわち、この代替提示コンテンツの“原稿文データ（代替用時系列テキスト）”では、必要リストメタデータの中に、３つの提示手法（提示の仕方）が定義されており、視聴者プロファイルおよび視聴環境プロファイルを参照した結果、必要リストメタデータの中で「時系列テキスト提示装置」が選択される場合には、導出手段１７によって、コンテンツの構成部分の代わりに代替提示コンテンツが導出され、さらに、変換手段１９に相当する日英翻訳装置や音声合成装置によって変換された上で提示されることになる。 That is, in the “original text data (substitution time-series text)” of the alternative presentation content, three presentation methods (presentation methods) are defined in the required list metadata, and the viewer profile and the viewing environment are defined. As a result of referring to the profile, when “time series text presentation device” is selected in the required list metadata, the alternative presentation content is derived by the deriving means 17 instead of the component part of the content, and further converted. It is presented after being converted by a Japanese-English translation device or a speech synthesizer corresponding to the means 19.

また、図８はコンテンツを構成部分に分割する分割メタデータの定義であり、例えば、図７に示した“アナ音声”の構成部分は時間位置として、開始時刻Ｘ〜終了時刻Ｙの間に、空間位置として、オーディオｃｈＡ、周波数帯域ＢＢ〜ＣＣの領域に付加されることとなる。また、例えば、“項目スーパー”の構成部分は、時間位置として、開始時刻Ｘ〜終了時刻Ｙの間に、空間位置として、ビデオ（映像上）の座標範囲ＤＤ〜ＥＥに付加されることとなる。 FIG. 8 is a definition of divided metadata that divides content into constituent parts. For example, the constituent part of “ana voice” shown in FIG. 7 is a time position between a start time X and an end time Y. As a spatial position, it is added to the area of audio chA and frequency bands BB to CC. Further, for example, the component part of “item super” is added as a time position between the start time X and the end time Y as a spatial position in the coordinate range DD to EE of the video (on the video). .

次に、図９〜図１３（具体例２：野球中継番組）を参照して、コンテンツ出力装置３における代替え提示の具体例について説明する。
図９は、野球中継番組において、通常の提示を行う場合と、音を聞けない人または聞いていない人向けの提示を行う場合と、字幕ディスプレイで見ている人向きの提示を行う場合とについて、時間経過に伴う制作者の意図と、映像、音声および字幕との関係について示した図である。図１０は、野球中継番組を表示する際に、視聴者（ここでは、視聴者Ａ、視聴者Ｂ）に応じて、提示の仕方を変更した例を示した図である。図１１は、野球中継番組のあるシーンに付加されているメタデータを一覧表にしたものである。図１２は、図１１に示したメタデータに従って、野球中継番組の代替え提示（表示画面が見られない場合、音声のみ）が行われた例について示した図である。図１３は、図１１に示したメタデータに従って、野球中継番組の代替え提示（表示画面が見られるが、ボールが判別できない程度に当該表示画面が小さい場合、音声のみ）が行われた例について示した図である。 Next, specific examples of alternative presentation in the content output device 3 will be described with reference to FIGS. 9 to 13 (specific example 2: baseball broadcast program).
FIG. 9 shows a case where a normal presentation is performed in a baseball broadcast program, a case where presentation is made for a person who cannot hear or hear a sound, and a case where presentation is made for a person watching on a caption display. FIG. 4 is a diagram showing a relationship between a producer's intention with time and video, audio, and subtitles. FIG. 10 is a diagram showing an example in which the way of presentation is changed according to the viewer (here, viewer A and viewer B) when displaying the baseball broadcast program. FIG. 11 shows a list of metadata added to a scene with a baseball broadcast program. FIG. 12 is a diagram showing an example in which an alternative presentation of a baseball broadcast program (only a voice is displayed when a display screen is not seen) is performed according to the metadata shown in FIG. FIG. 13 shows an example in which an alternative presentation of a baseball broadcast program (only the voice is displayed when the display screen is small enough to prevent the ball from being discriminated) according to the metadata shown in FIG. It is a figure.

図９に示したように、野球中継番組において、通常の提示を行う場合、映像と音声とが提示されることで、制作者の意図が伝えられ、音声内容を文字化した字幕は提示されない。なお、映像（中継映像）が表現している意図（制作者の意図）は「試合内容を伝える」ことであり、音声が表現している意図のうち、実況部分については「試合内容を伝える」ことであり、解説部分については「試合に関する解説を伝える」ことである。 As shown in FIG. 9, in the case of normal presentation in a baseball broadcast program, the intention of the producer is conveyed by the presentation of video and audio, and the subtitles in which the audio content is transcribed are not presented. In addition, the intention (creator's intention) expressed in the video (relay video) is “tell the game content”, and among the intentions expressed by the voice, the actual part is “tell the game content” As for the commentary part, it is “to convey commentary about the game”.

また、野球中継番組において、音を聞けない人または聞いていない人向けの提示を行う場合、映像と音声とが提示され、映像については制作者の意図が伝えられるが、音声については認識されない。また、音声内容を文字化した字幕については、実況部分については提示されず、解説部分には提示され、制作者の意図が伝えられる。 In addition, in a baseball broadcast program, when a presentation is made for a person who cannot hear sound or for a person who has not heard, a video and a voice are presented, and the intention of the producer is conveyed for the video, but the voice is not recognized. In addition, the subtitles with the audio content converted into text are not presented in the live part, but are presented in the commentary part to convey the intention of the producer.

さらに、野球中継番組において、字幕ディスプレイで見ている人向きの提示を行う場合、映像と音声とは共に提示されない。また、音声内容を文字化した字幕については、全て提示され、制作者の意図が伝えられる。 Furthermore, in the baseball broadcast program, when the presentation for the person watching on the caption display is performed, neither video nor audio is presented. In addition, all subtitles in which the audio content is converted to text are presented and the intention of the producer is conveyed.

ここで、コンテンツ出力装置３における代替え提示と、従来の提示手法とを比較すると、従来の提示手法では、表示画面を見れば分かること、例えば、音声の実況部分についても音声内容を文字化した字幕を表示しようとして煩雑になるか、逆に、音声の解説部分だけを字幕で表示されるように送信しておくと、当該字幕だけを見ている人には、試合内容が伝わらないということが生じていた。そこで、このコンテンツ出力装置３における代替え提示では、コンテンツ送信装置１にて、構成部分ごとにグループ化メタデータおよび必要リストメタデータを付随させておいて、映像からなる構成部分と同じ意図を持つ（つまり、同じグループ化メタデータを持つ）音声からなる構成部分については、認識されなくても代替え提示を行わないように、当該装置３が判定することで、様々な視聴状況において視聴者に応じた提示を行うことができる。 Here, when the alternative presentation in the content output device 3 is compared with the conventional presentation method, the conventional presentation method can be understood by looking at the display screen, for example, subtitles in which the audio content is also transcribed in the live part of the audio If you send only the commentary part of the audio to be displayed in subtitles, the content of the game will not be transmitted to those who are watching only the subtitles. It was happening. Therefore, in the alternative presentation in the content output device 3, the content transmitting device 1 attaches grouping metadata and necessary list metadata to each constituent part and has the same intention as the constituent part composed of video ( In other words, the device 3 determines not to make a substitute presentation even if it is not recognized with respect to a constituent part composed of audio (having the same grouping metadata), so that it can respond to the viewer in various viewing situations. You can make a presentation.

図１０に示したように、野球中継番組において、ピッチャーがボールを投げるシーンの映像の一部（領域が座標指定されている箇所）で、制作者の意図が「ＳＢＯ状況を伝える」である場合に、同じ意図を持つ代替提示コンテンツ（ＳＢＯ状況を示した拡大可能なデータ）を用意しておく。なお、ここで領域とは、ＳＢＯが表示されている領域を囲む座標である。そして、コンテンツ出力装置３における代替提示において、視聴者Ａには、本来コンテンツに含まれるＳＢＯが常に表示されているため代替用テキストデータを提示せず、視聴者Ｂには、必要なときに代替用テキストデータを提示している。 As shown in FIG. 10, in a baseball broadcast program, when the intention of the producer is “tell SBO status” in a part of the video of the scene where the pitcher throws the ball (the area is designated by coordinates) In addition, alternative presentation contents having the same intention (expandable data indicating the SBO status) are prepared. Here, the area is a coordinate surrounding the area where the SBO is displayed. In the alternative presentation in the content output device 3, the viewer A does not present alternative text data because the SBO originally included in the content is always displayed, and the viewer B substitutes when necessary. Text data is presented.

すなわち、本来映像（本来コンテンツ）中において、本来映像を拡大した結果、制作者の意図である「ＳＢＯ状況を伝える」部分が表示画面に収まりきらずに表示されないので、代替用テキストデータをスーパーする。また、本来映像（本来コンテンツ）中において、本来映像を拡大した結果、制作者の意図である「ＳＢＯ状況を伝える」部分が、表示画面に収まっている場合には、代替用テキストデータをスーパーしない。 That is, as a result of enlarging the original video in the original video (original content), the “informed SBO status” part intended by the producer is not displayed on the display screen, so the alternative text data is superposed. In addition, if the portion of the original video (original content) that is originally intended to expand the video and the “speaking SBO status” that is the intention of the producer is within the display screen, the alternative text data is not superposed. .

また、図１１に示したように、シーンに相当するコンテンツの構成部分および代替提示コンテンツと、付加されているメタデータ（多重されていないものも含む）とが定義されていることで、図１２また図１３に示した代替え提示が可能になっている。
つまり、図１２では、野球中継番組において、視聴者が表示画面を見られない際の代替え提示を実現した場合を示しており、図１３では、野球中継番組において、視聴者がバッターの状態やボールカウントを判別できない程、表示画面が小さい際の代替え提示を実現した場合を示している。 Also, as shown in FIG. 11, the constituent parts of content corresponding to a scene and alternative presentation content, and added metadata (including non-multiplexed metadata) are defined, so that FIG. Further, the alternative presentation shown in FIG. 13 is possible.
That is, FIG. 12 shows a case where an alternative presentation is realized when the viewer cannot see the display screen in the baseball relay program, and FIG. 13 shows that the viewer is in the batter state or the ball in the baseball relay program. This shows a case where an alternative presentation is realized when the display screen is so small that the count cannot be determined.

具体的に説明すると、図１１において、本来コンテンツの映像ディスプレイによる提示に関する構成部分（付加させる元）である「中継映像」（グループ化メタデータ＃Ａ）、「中継映像ピッチング」（グループ化メタデータ＃Ｂ）、「中継映像ヒッティング」（グループ化メタデータ＃Ｃ、＃Ｄ）、「中継映像ランナー１塁へ」（グループ化メタデータ＃Ｅ）、「ＳＢＯスーパー」（グループ化メタデータ＃Ｆ）が映像ディスプレイを提示装置２として必要とし、表示画面に表示されるものであるので、提示不可となる。 More specifically, in FIG. 11, “relay video” (grouped metadata #A) and “relay video pitching” (grouped metadata), which are constituent parts (sources to be added) originally related to presentation of content on the video display. #B), “relay video hitting” (grouped metadata #C, #D), “to relay video runner 1” (grouped metadata #E), “SBO super” (grouped metadata #F) ) Requires a video display as the presentation device 2 and is displayed on the display screen, and cannot be presented.

そして、これらの構成部分と同じ相関メタデータを持つ構成部分である「球場ＳＥ」（グループ化メタデータ＃Ａ、＃Ｃ）、「実況音声「さぁカウントはツースリー」」（グループ化メタデータ＃Ｇ）、「実況音声「投げた」」（グループ化メタデータ＃Ｂ）、「実況音声「打ちました！」」（グループ化メタデータ＃Ｃ）、「実況音声「レフト前ヒット」」（グループ化メタデータ＃Ｄ、＃Ｅ）がスピーカを提示装置２として必要とし、音声で出力されるものである。この中で、「球場ＳＥ」（グループ化メタデータ＃Ａ、＃Ｃ）は本来コンテンツであるので、代替え提示が必要となる構成部分は、「中継映像ピッチング」（グループ化メタデータ＃Ｂ）、「中継映像ヒッティング」（グループ化メタデータ＃Ｄ）、「中継映像ランナー１塁へ」（グループ化メタデータ＃Ｅ）である。 Then, “ball field SE” (grouped metadata #A, #C), which is a component having the same correlation metadata as these components, “live voice“ Now count is two-three ”” (grouped metadata #G ), “Live audio“ I threw ”” (Grouped metadata #B), “Live audio“ I hit it! ”(Grouped metadata #C),“ Live audio “Hit before left” ”(Grouped) Metadata #D and #E) require a speaker as the presentation device 2 and are output as audio. Among these, since “ballpark SE” (grouped metadata #A, #C) is originally content, the components that require alternative presentation are “relay video pitching” (grouped metadata #B), “Relay video hitting” (grouped metadata #D), “To relay video runner 1 塁” (grouped metadata #E).

これら代替え提示が必要な構成部分の中で、グループ化メタデータが＃Ｂ、＃Ｄ、＃Ｅのものについては、代替提示コンテンツである実況音声「投げた」、「打ちました」、「レフト前ヒット」があり、それぞれのグループ化メタデータが「投げたことを理解させる」（＃Ｂ）、「打球の方向を理解させる」（♯Ｄ）、「シングヒットであることを理解させる」（＃Ｅ）であるので、代替え提示が行える。 Among the components that require alternative presentation, those with grouping metadata of #B, #D, and #E are the actual presentation voices “throwed”, “hit”, “left” which are alternative presentation contents There is a “pre-hit”, and each grouping metadata “makes you understand that you threw” (#B), “makes you understand the direction of the hit ball” (#D), “makes you understand that it is a single hit” ( #E), an alternative presentation can be made.

つまり、本来コンテンツの構成部分「中継映像ピッチング」（グループ化メタデータ＃Ｂ）は、代替提示コンテンツの構成部分「実況音声「投げた」」（グループ化メタデータ＃Ｂ）に、本来コンテンツの構成部分「中継映像ヒッティング」（グループ化メタデータ＃Ｄ）は、代替提示コンテンツの構成部分「実況音声「レフト前ヒット」」（グループ化メタデータ＃Ｄ）に、本来コンテンツの構成部分「中継映像ランナー１塁へ」（グループ化メタデータ＃Ｅ）は、代替提示コンテンツの構成部分「実況音声「レフト前ヒット」」（グループ化メタデータ＃Ｅ）に代替え提示されることとなる。 That is, the original content component “relay video pitching” (grouped metadata #B) is replaced with the alternative presentation content component “live speech“ threw ”(grouped metadata #B). The part “relay video hitting” (grouped metadata #D) is included in the constituent part “relay video” of the original content in the constituent part “actual voice“ hit before left ”” (grouped metadata #D) of the alternative presentation content. “To Runner 1」 ”(grouped metadata #E) will be presented in place of the constituent part“ actual voice “hit before left” ”(grouped metadata #E) of the alternative presentation content.

また、代替え提示が必要な構成部分の中で、本来コンテンツの構成部分「ＳＢＯスーパー」（グループ化メタデータ＃Ｆ）については、図１０に示したように代替え提示が行えるが、ここでは、図１２に示したように、視聴者が表示画面を見られない状況にあるので、グループ化メタデータ＃Ｆと相関のあるグループ化メタデータ＃Ｇの代替提示コンテンツである「実況音声「さぁカウントはツースリー」」による代替え提示が行われることとなる。 Further, among the components that need to be replaced, the component “SBO Super” (grouped metadata #F) of the original content can be replaced as shown in FIG. As shown in FIG. 12, since the viewer cannot see the display screen, the “present condition voice”, which is an alternative presentation content of the grouping metadata #G correlated with the grouping metadata #F, An alternative presentation by “two three” ”will be performed.

また、図１３に示した、野球中継番組において、視聴者がバッターの状態やボールカウントを判別できない程、表示画面が小さい際の代替え提示では、図１１において、本来コンテンツの映像ディスプレイによる提示に関する構成部分（付加させる元）である「中継映像ヒッティング」（グループ化メタデータ＃Ｃ、＃Ｄ）、「中継映像ランナー１塁へ」（グループ化メタデータ＃Ｅ）、「ＳＢＯスーパー」（グループ化メタデータ＃Ｆ）が映像ディスプレイを提示装置２として必要とし、表示画面に表示されるものであるので、提示不可となる。 Further, in the alternative presentation when the display screen is so small that the viewer cannot determine the batter state and the ball count in the baseball broadcast program shown in FIG. “Relay video hitting” (grouped metadata #C, #D), “To relay video runner 1” (grouped metadata #E), “SBO super” (grouped) Since the metadata #F) requires a video display as the presentation device 2 and is displayed on the display screen, it cannot be presented.

そして、これらの構成部分と同じ相関メタデータを持つ構成部分である「球場ＳＥ」（グループ化メタデータ＃Ａ、＃Ｃ）、「実況音声「さぁカウントはツースリー」」（グループ化メタデータ＃Ｇ）、「実況音声「投げた」」（グループ化メタデータ＃Ｂ）、「実況音声「打ちました！」」（グループ化メタデータ＃Ｃ）、「実況音声「レフト前ヒット」」（グループ化メタデータ＃Ｄ、＃Ｅ）がスピーカを提示装置２として必要とし、音声で出力されるものである。この中で、「球場ＳＥ」（グループ化メタデータ＃Ａ、＃Ｃ）は本来コンテンツであるので、代替え提示が必要となる構成部分は、「中継映像ヒッティング」（グループ化メタデータ＃Ｄ）、「中継映像ランナー１塁へ」（グループ化メタデータ＃Ｅ）、「ＳＢＯスーパー」（グループ化メタデータ＃Ｆ）である。 Then, “ball field SE” (grouped metadata #A, #C), which is a component having the same correlation metadata as these components, “live voice“ Now count is two-three ”” (grouped metadata #G ), “Live audio“ I threw ”” (Grouped metadata #B), “Live audio“ I hit it! ”(Grouped metadata #C),“ Live audio “Hit before left” ”(Grouped) Metadata #D and #E) require a speaker as the presentation device 2 and are output as audio. Among them, “baseball field SE” (grouped metadata #A, #C) is inherently content, so the component that needs to be replaced is “relay video hitting” (grouped metadata #D). “To relay video runner 1” (grouped metadata #E) and “SBO super” (grouped metadata #F).

そこで、本来コンテンツの構成部分「中継映像ヒッティング」（グループ化メタデータ＃Ｄ）は、代替提示コンテンツの構成部分「実況音声「レフト前ヒット」」（グループ化メタデータ＃Ｄ）に、本来コンテンツの構成部分「中継映像ランナー１塁へ」（グループ化メタデータ＃Ｅ）は、代替提示コンテンツの構成部分「実況音声「レフト前ヒット」」（グループ化メタデータ＃Ｅ）に代替え提示されることとなる。 Therefore, the original content component “relay video hitting” (grouped metadata #D) is replaced with the original content “replacement voice“ pre-left hit ”” (grouped metadata #D). The component part “To the relay video runner 1” (grouped metadata #E) is to be presented in place of the component part “actual voice“ hit before left ”” (grouped metadata #E) of the alternative presentation content It becomes.

以上、本発明の実施形態について説明したが、本発明は前記実施形態には限定されない。例えば、本実施形態では、コンテンツ送信装置１、コンテンツ出力装置３として構成したが、これらの構成の処理を実現可能に、汎用的または特殊なコンピュータ言語で記述したコンテンツ送信プログラム、コンテンツ受信プログラムとして構成することも可能である。これらの場合、コンテンツ送信装置１、コンテンツ出力装置３と同様の効果を得ることができる。 As mentioned above, although embodiment of this invention was described, this invention is not limited to the said embodiment. For example, in the present embodiment, the content transmission device 1 and the content output device 3 are configured. However, the content transmission program and the content reception program described in a general-purpose or special computer language can be realized so that the processing of these configurations can be realized. It is also possible to do. In these cases, the same effects as those of the content transmission device 1 and the content output device 3 can be obtained.

本発明の実施形態に係るコンテンツ送信出力システムのブロック図である。It is a block diagram of the content transmission output system which concerns on embodiment of this invention. 図１に示したコンテンツ出力装置の動作を説明するためのフローチャートである（前半）。3 is a flowchart for explaining the operation of the content output apparatus shown in FIG. 1 (first half). 図１に示したコンテンツ出力装置の動作を説明するためのフローチャートである（後半）。3 is a flowchart for explaining the operation of the content output apparatus shown in FIG. 1 (second half). 視聴者プロファイルと視聴環境プロファイルとの具体例を示した図である。It is the figure which showed the specific example of the viewer profile and the viewing environment profile. 代替え提示の具体例（具体例１：ニュース番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 1: news program) of alternative presentation. 代替え提示の具体例（具体例１：ニュース番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 1: news program) of alternative presentation. 代替え提示の具体例（具体例１：ニュース番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 1: news program) of alternative presentation. 代替え提示の具体例（具体例１：ニュース番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 1: news program) of alternative presentation. 代替え提示の具体例（具体例２：野球中継番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 2: baseball broadcast program) of alternative presentation. 代替え提示の具体例（具体例２：野球中継番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 2: baseball broadcast program) of alternative presentation. 代替え提示の具体例（具体例２：野球中継番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 2: baseball broadcast program) of alternative presentation. 代替え提示の具体例（具体例２：野球中継番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 2: baseball broadcast program) of alternative presentation. 代替え提示の具体例（具体例２：野球中継番組）を説明するための図である。It is a figure for demonstrating the specific example (specific example 2: baseball broadcast program) of alternative presentation.

Explanation of symbols

１コンテンツ送信装置
２提示装置
３コンテンツ出力装置
５入力手段
７送信手段
９受信手段
１１コンテンツ・メタデータ蓄積手段
１３プロファイル保持手段
１５判定手段
１７導出手段
１９変換手段
２１再構成手段
２３出力手段 DESCRIPTION OF SYMBOLS 1 Content transmission apparatus 2 Presentation apparatus 3 Content output apparatus 5 Input means 7 Transmission means 9 Receiving means 11 Content metadata storage means 13 Profile holding means 15 Determination means 17 Derivation means 19 Conversion means 21 Reconfiguration means 23 Output means

Claims

Necessary on the receiving side for content, alternative presentation content to be presented in place of this content, division metadata indicating the division part of the content, and for each constituent part obtained by dividing the content according to the division metadata and for the alternative presentation content Required list metadata that defines the presentation device to be used, the function of the device, and the viewing ability required for the viewer on the receiving side, and grouping meta data for grouping the constituent parts of the content and the alternative presentation content Correlation metadata of each component part and the alternative presentation content provided with the correlation of the data and the grouped metadata in accordance with a preset criterion, and the time of the alternative presentation content attached corresponding to the elapsed time of the content Information and receiving the information on the presentation device. Ntsu or said a content output apparatus that outputs the alternate presentation content,
Receiving means for receiving the content, the alternative presentation content, the split metadata, the required list metadata, the grouping metadata, the correlation metadata, and the time information;
A viewer profile that defines the viewing ability of a viewer who views the content or the alternative presentation content, a presentation device that can present the content or the alternative presentation content, and a viewing environment profile that describes the function of the device are stored. Profile storage means to
The necessary list metadata is compared with the viewer profile and viewing environment profile stored in the profile storage means, and the viewer profile and Determining means for determining whether or not the viewing environment profile satisfies a condition described in the necessary list metadata;
As a result of the determination by the determination means, when the viewer profile and the viewing environment profile do not satisfy the conditions described in the necessary list metadata, the component to which the necessary list metadata that is not satisfied is added Based on the viewer profile, the grouping metadata, the correlation metadata and the time information, the viewing ability is satisfied, and the same grouping metadata is added or correlation metadata within a predetermined range Deriving means for deriving the alternative presentation content to which is added,
Reconstructing means for reconstructing the alternative presentation content derived by the deriving means and the constituent parts of the content;
Output means for outputting the reconstructed content reconstructed by the reconstructing means to the presentation device;
A content output apparatus comprising:

Necessary on the receiving side for content, alternative presentation content to be presented in place of this content, division metadata indicating the division part of the content, and for each constituent part obtained by dividing the content according to the division metadata and for the alternative presentation content Required list metadata that defines the presentation device to be used, the function of the device, and the viewing ability required for the viewer on the receiving side, and grouping meta data for grouping the constituent parts of the content and the alternative presentation content Data, correlation metadata of each of the component parts and the alternative presentation content provided according to a preset criterion for correlation of the grouped metadata, and time information attached corresponding to the elapsed time of the content. The content or the alternative presentation on the presentation device A content output apparatus for outputting content,
Receiving means for receiving the content, the alternative presentation content, the split metadata, the required list metadata, the grouping metadata, the correlation metadata, and the time information;
A viewer profile that defines a viewing ability of a viewer who views the content or the alternative presentation content, a presentation device that can present the content or the alternative presentation content, a function of the device, and a conversion device that can be converted are described. Profile storage means for storing a viewing environment profile;
The necessary list metadata is compared with the viewer profile and viewing environment profile stored in the profile storage means, and the viewer profile and Determining means for determining whether or not the viewing environment profile satisfies a condition described in the necessary list metadata;
As a result of the determination by the determination means, when the viewer profile and the viewing environment profile do not satisfy the conditions described in the necessary list metadata, the component to which the necessary list metadata that is not satisfied is added Based on the viewer profile, the grouping metadata, the correlation metadata and the time information, the viewing ability is satisfied, and the same grouping metadata is added or correlation metadata within a predetermined range Deriving means for deriving the alternative presentation content to which is added,
As a result of the determination by the determination means, when the viewing environment profile does not satisfy the necessary list metadata, the viewer profile and the viewing environment profile for the component to which the necessary list metadata that did not satisfy is added. The alternative presentation content to which the same grouping metadata is added or to which correlation metadata within a predetermined range is added can be output by the conversion device and the viewing capability is satisfied and the viewing is performed. Conversion means for converting into a conversion part that can be transmitted to a person,
Reconstructing means for reconstructing the converted part converted by the converting means, the alternative presentation content derived by the deriving means, and the constituent parts of the content;
Output means for outputting the reconstructed content reconstructed by the reconstructing means to the presentation device;
A content output apparatus comprising:

The content output apparatus according to claim 1 or 2, wherein the content, alternative presentation content to be presented instead of the content, division metadata indicating a division part of the content, and the content and the substitution according to the division metadata The required list metadata that defines the presentation device required on the receiving side for the constituent parts into which the presentation content is divided and the substitute presentation content, the function of the device, and the viewing ability required for the viewer on the receiving side, and the content Grouping metadata for grouping the component parts and the alternative presentation content, and the correlation metadata of the component parts and the alternative presentation content with the correlation of the grouping metadata attached according to a preset criterion , Corresponding to the elapsed time of the content Content transmitting apparatus comprising: a transmitting means for transmitting the the time information.