JP2018092324A

JP2018092324A - Content identification device and program

Info

Publication number: JP2018092324A
Application number: JP2016234338A
Authority: JP
Inventors: 周平田口; Shuhei Taguchi; 大礎遠藤; Hiroki Endo; 松村　欣司; Kinji Matsumura; 欣司松村
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2016-12-01
Filing date: 2016-12-01
Publication date: 2018-06-14

Abstract

PROBLEM TO BE SOLVED: To provide a content identification device and a program for identifying content which is identical to content that a user requests.SOLUTION: A content identification device obtains meta data which is data representing a feature of content accompanying the content for a plurality of pieces of content, and comprises meta data management means for accumulating the meta data while classifying the meta data by a type of the meta data, similarity calculation means for calculating similarity between pieces of the meta data for each type of the meta data accumulated, degree of equivalence calculation means for calculating a degree of equivalence between the content and another content other than the content on the basis of the similarity calculated by the similarity calculation means, and list creation means for creating a list of meta data including an acquiring method of content which is specified as content identical to the specified content on the basis of the degree of equivalence of pieces of the content calculated by the degree of equivalence calculation means.SELECTED DRAWING: Figure 1

Description

本発明は、同一又は同一とみなせるコンテンツを特定するための、コンテンツ同定装置及びプログラムに関する。 The present invention relates to a content identification device and a program for specifying content that can be regarded as identical or identical.

近年、様々な事業者が多様なコンテンツを提供するようになっている。また、これに伴いコンテンツの提供方法も多岐に渡っている。例えば、事業者毎に、地上波放送、衛星放送、及びネット配信といった提供方法が使い分けられている。このため、ユーザに提供されるコンテンツの数は膨大なものとなっている。 In recent years, various businesses have provided various contents. Along with this, there are various ways of providing content. For example, providing methods such as terrestrial broadcasting, satellite broadcasting, and Internet distribution are used for each business operator. For this reason, the number of contents provided to the user is enormous.

そこで、ユーザによるコンテンツの選択を容易とするための技術開発が進められている。このような技術の一例が特許文献１に開示されている。 Therefore, technology development for facilitating selection of content by the user is underway. An example of such a technique is disclosed in Patent Document 1.

特許文献１に開示の技術では、ユーザが録画した録画番組（すなわち、ユーザが興味を抱いた番組）がどのような番組であるかに基づいて、この番組に関連する他の番組を特定する。そして、この特定した他の番組についてもユーザが興味を抱くと考え、この特定した他の番組についての情報を、リコメンド情報としてテレビ画面に表示する。
ユーザは、このリコメンド情報を参照することにより、自身が興味を抱く番組を容易に選択することが可能となる。 In the technique disclosed in Patent Document 1, another program related to this program is specified based on what the recorded program (that is, the program in which the user is interested) recorded by the user is. And it thinks that a user is interested also about this specified other program, and displays information about this specified other program on the television screen as recommendation information.
The user can easily select a program that the user is interested in by referring to the recommendation information.

特開２０１６−１７８６８１号公報JP, 2006-178861, A

特許文献１に開示されているようなリコメンドの技術等を用いることにより、ユーザは、自身が興味を抱く番組を容易に選択することが可能となる。しかしながら、特許文献１に開示されているような技術では、以下に説明するような課題を解決することができなかった。この課題について説明をする。 By using a recommendation technique or the like as disclosed in Patent Document 1, the user can easily select a program in which the user is interested. However, the technique disclosed in Patent Document 1 cannot solve the problems described below. This problem will be described.

現在、同一のコンテンツが異なるメディアで配信される場合がある。例えば、或る番組が地上波で放送された後に、同じ番組がインターネットで配信されるような場合がある。この場合、ユーザは、所望のコンテンツを取得するために、テレビやパーソナルコンピュータやスマートフォンといった様々なデバイスやアプリを使い分ける必要がある。 Currently, the same content may be distributed on different media. For example, after a certain program is broadcast on the ground, the same program may be distributed over the Internet. In this case, the user needs to properly use various devices and applications such as a television, a personal computer, and a smartphone in order to acquire desired content.

しかし、ユーザが、同一のコンテンツが異なるメディアで配信されることを知らない場合がある。また、或るメディアでの取得方法が分かったとしても、その取得方法が、ユーザの状況やデバイスの機能にとって最適であるとは限らず、よりユーザに適した取得方法が他に存在することもあり得る。
そのため、ユーザが所望するコンテンツと同一のコンテンツが、他の方法でも取得できるような場合には、その取得方法のそれぞれをユーザに提示することが望まれる。 However, the user may not know that the same content is distributed on different media. Also, even if an acquisition method for a certain medium is known, the acquisition method is not necessarily optimal for the user's situation and device function, and there may be other acquisition methods more suitable for the user. possible.
Therefore, when the same content as the content desired by the user can be acquired by other methods, it is desired to present each of the acquisition methods to the user.

しかしながら、特許文献１に開示の技術は、ユーザが過去に録画した番組に関連する番組をユーザに提示するものであり、ユーザが所望する番組と同一の番組についての情報を提示する技術ではない。つまり、同一のコンテンツ同士を特定するための技術ではない。 However, the technique disclosed in Patent Literature 1 presents a program related to a program recorded by the user in the past to the user, and is not a technique for presenting information about the same program as the user desires. That is, it is not a technique for identifying the same content.

また、そもそも特許文献１に開示の技術は、現在利用しているテレビにて受信できる番組の中から選択した番組を提示するというものであり、他のデバイスで受信できる番組については考慮していない。つまり、現在利用しているテレビ以外のデバイスで受信できる番組については考慮してはいない。 In the first place, the technique disclosed in Patent Document 1 presents a program selected from programs that can be received by the currently used television, and does not consider programs that can be received by other devices. . In other words, no consideration is given to programs that can be received by devices other than the currently used television.

そのため、特許文献１に開示の技術等の一般的な技術では、同一のコンテンツが異なるメディアで配信されるような場合であっても、ユーザが所望するコンテンツと同一のコンテンツに関する情報を提示するようなことができなかった。 Therefore, in a general technique such as the technique disclosed in Patent Document 1, even when the same content is distributed on different media, information related to the same content as the content desired by the user is presented. I could not do anything.

そこで本発明は、例えばユーザが所望するコンテンツと同一のコンテンツに関する情報を提示するために、同一のコンテンツを特定することが可能な、コンテンツ同定装置及びプログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide a content identification device and a program capable of specifying the same content in order to present information related to the same content as the content desired by the user, for example.

本発明に係るコンテンツ同定装置は、複数のコンテンツごとに、当該コンテンツに付随したコンテンツの特徴を示すデータであるメタデータを所定のメタデータ取得方法に基づいて取得し、該取得したメタデータを、メタデータの種類ごとに分類して蓄積するメタデータ管理手段と、前記メタデータ管理手段が蓄積した当該メタデータの種類ごとに、当該メタデータ同士の類似度を算出する類似度算出手段と、前記類似度算出手段が算出した前記類似度に基づいて、前記コンテンツと前記コンテンツ以外の他コンテンツとの同一度を算出する同一度算出手段と、前記同一度算出手段が算出したコンテンツ同士の同一度に基づいて、指定されたコンテンツと同一のコンテンツとして特定されるコンテンツの取得方法を含むメタデータのリストを作成するリスト作成手段を備える。 The content identification device according to the present invention acquires, for each of a plurality of contents, metadata that is data indicating the characteristics of the content attached to the content based on a predetermined metadata acquisition method, and the acquired metadata is Metadata management means for classifying and storing each metadata type, similarity calculation means for calculating the similarity between the metadata for each metadata type stored by the metadata management means, and Based on the similarity calculated by the similarity calculation means, the sameness calculation means for calculating the sameness between the content and other content other than the content, and the sameness between the contents calculated by the identity calculation means Based on a list of metadata that includes content acquisition methods identified as the same content as the specified content It includes a list creation means for growth.

上記本発明に係るコンテンツ同定装置を、前記リスト作成手段は、コンテンツの指定を含むリスト要求をユーザから受け付けた場合に、前記同一度算出手段が算出したコンテンツ同士の同一度に基づいて、前記リスト要求により指定されたコンテンツと同一のコンテンツとして特定されるコンテンツの取得方法を含むメタデータのリストを作成し、当該作成したリストを、前記リスト要求の応答として前記ユーザに対して出力するコンテンツ同定装置としてもよい。 In the content identification device according to the present invention, when the list creation unit receives a list request including content specification from a user, the list is generated based on the identity of the content calculated by the identity calculation unit. A content identification device that creates a list of metadata including a content acquisition method specified as the same content as the content specified by the request, and outputs the created list to the user as a response to the list request It is good.

上記本発明に係るコンテンツ同定装置を、前記同一度算出手段は、前記メタデータの種類によって異なる重み付けを前記類似度算出手段が算出した各メタデータ同士の類似度に与え、前記同一度算出手段は、前記重み付けを与えられた各メタデータの類似度をコンテンツ単位で合算した値に基づいて、各コンテンツ同士の同一度を算出するコンテンツ同定装置としてもよい。 In the content identification apparatus according to the present invention, the degree-of-identity calculation unit assigns different weights depending on the type of the metadata to the degrees of similarity between the metadata calculated by the degree-of-similarity calculation unit, and the degree-of-identity calculation unit The content identification device may calculate the degree of identity between the contents based on a value obtained by adding the similarities of the metadata given the weights in units of contents.

上記本発明に係るコンテンツ同定装置を、前記同一度算出手段は、前記各コンテンツ同士の同一度を算出するための同一度算出部を複数備え、前記複数の同一度算出部は、当該複数の同一度算出部のそれぞれが、異なる算出方法で前記各コンテンツ同士の同一度を算出するコンテンツ同定装置としてもよい。 In the content identification device according to the present invention, the degree-of-identity calculation means includes a plurality of degree-of-identity calculation units for calculating the degree of identity between the contents. Each of the calculation units may be a content identification device that calculates the degree of identity between the contents by different calculation methods.

上記本発明に係るコンテンツ同定装置を、コンテンツ組み合わせ管理手段を更に備え、前記類似度算出手段は、前記メタデータ管理手段が管理するメタデータの種類の内の何れかの種類について各メタデータ同士の類似度を算出し、算出された類似度が所定の基準以上であった各メタデータに対応するコンテンツ同士の組み合わせを、前記コンテンツ組み合わせ管理手段に格納し、前記コンテンツ組み合わせ管理手段に格納されなかったコンテンツ同士の組み合わせについては、前記類似度算出手段による各メタデータ同士の類似度の算出と、前記同一度算出手段による各コンテンツ同士の同一度の算出と、を行わないコンテンツ同定装置としてもよい。 The content identification apparatus according to the present invention further includes content combination management means, and the similarity calculation means is configured to store each metadata between any of the metadata types managed by the metadata management means. The similarity is calculated, and the combination of contents corresponding to each metadata whose calculated similarity is equal to or higher than a predetermined standard is stored in the content combination management unit and not stored in the content combination management unit As for the combination of contents, a content identification apparatus that does not perform the calculation of the similarity between the metadata by the similarity calculation unit and the calculation of the degree of identity between the contents by the identity calculation unit.

本発明に係るプログラムは、コンピュータを、上記本発明に係るコンテンツ同定装置として機能させる。 The program according to the present invention causes a computer to function as the content identification device according to the present invention.

本発明にかかるコンテンツ同定装置及びプログラムによれば、例えばユーザが所望するコンテンツと同一のコンテンツに関する情報を提示するために、同一のコンテンツを特定することが可能となる。 According to the content identification device and the program according to the present invention, for example, in order to present information related to the same content as the content desired by the user, the same content can be specified.

本発明の第１の実施形態の基本的構成を表す機能ブロック図である。It is a functional block diagram showing the basic composition of the 1st embodiment of the present invention. 本発明の第１の実施形態の基本的動作を表すフローチャートである。It is a flowchart showing the basic operation | movement of the 1st Embodiment of this invention. 本発明の第２の実施形態の基本的構成を表す機能ブロック図である。It is a functional block diagram showing the basic composition of the 2nd Embodiment of this invention. 本発明の第２の実施形態の基本的動作を表すフローチャートである。It is a flowchart showing the basic operation | movement of the 2nd Embodiment of this invention. 本発明の第３の実施形態の基本的構成を表す機能ブロック図である。It is a functional block diagram showing the fundamental structure of the 3rd Embodiment of this invention. 本発明の実施例におけるデータの一例を表す図（１／４）である。It is a figure (1/4) showing an example of the data in the Example of this invention. 本発明の実施例におけるデータの一例を表す図（２／４）である。It is a figure (2/4) showing an example of the data in the Example of this invention. 本発明の実施例におけるデータの一例を表す図（３／４）である。It is a figure (3/4) showing an example of the data in the Example of this invention. 本発明の実施例におけるデータの一例を表す図（４／４）である。It is a figure (4/4) showing an example of the data in the Example of this invention.

まず、本発明の実施形態の概略を説明する。本発明の実施形態は、同一のコンテンツが異なるメディアで様々な事業者によって提供されるような場合に、各事業者が保有するコンテンツのメタデータを用いて、同一のコンテンツを同定するというものである。 First, an outline of an embodiment of the present invention will be described. In the embodiment of the present invention, when the same content is provided by various companies on different media, the same content is identified using the metadata of the contents held by each company. is there.

このように本実施形態では、同一のコンテンツを同定できることから、ユーザから所得したいコンテンツの要求があったときに、そのコンテンツと同一であると同定したコンテンツの取得方法を含むメタデータのリストを返すことができる。 As described above, in this embodiment, since the same content can be identified, when a request for content that the user wants to earn is received, a list of metadata including a method for acquiring the content that is identified as the content is returned. be able to.

つまり、ユーザからしてみると、自身が取得したいコンテンツに関する情報を本システムに入力するだけで、そのコンテンツの取得方法を含むメタデータのリストを取得することができる。ユーザはそのリストの中から、自らの状況やデバイス機能に適したコンテンツの取得方法を選択することができる。これにより、ユーザは選択した取得方法によりコンテンツを取得して、コンテンツを楽しむことができる。
以上が本実施形態の概略である。 That is, from the viewpoint of the user, it is possible to acquire a list of metadata including the content acquisition method simply by inputting information about the content that the user wants to acquire into the system. The user can select a content acquisition method suitable for his / her situation and device function from the list. Thus, the user can acquire the content by the selected acquisition method and enjoy the content.
The above is the outline of this embodiment.

次に、本発明の実施形態について図面を参照して詳細に説明する。なお、以下では３つの実施形態について説明をする。ここで、第１の実施形態は、基本的な実施形態である。また、第２の実施形態は、第１の実施形態の構成を簡略化した実施形態である。また、第３の実施形態は、第１の実施形態を変形し、計算量を削減することを可能とした実施形態である。 Next, embodiments of the present invention will be described in detail with reference to the drawings. In the following, three embodiments will be described. Here, the first embodiment is a basic embodiment. The second embodiment is an embodiment in which the configuration of the first embodiment is simplified. Further, the third embodiment is an embodiment in which the first embodiment can be modified to reduce the calculation amount.

＜第１の実施形態＞
図１に示すように、第１の実施形態であるコンテンツ同定装置１０１は、メタデータ管理手段１０、メタデータ類似度管理手段２０、コンテンツ同一度管理手段３０及びコンテンツ取得方法リスト作成手段４０とを含む。各装置は、例えばＬＡＮ（ｌｏｃａｌａｒｅａｎｅｔｗｏｒｋ）やインターネット等のネットワークに接続され、相互に通信可能な状態となっている。なお、以下の説明及び図中においては、データベースのことを、適宜「ＤＢ」と表記する。 <First Embodiment>
As shown in FIG. 1, the content identification apparatus 101 according to the first embodiment includes a metadata management unit 10, a metadata similarity management unit 20, a content identity management unit 30, and a content acquisition method list creation unit 40. Including. Each device is connected to a network such as a local area network (LAN) or the Internet, and can communicate with each other. In the following description and drawings, the database is appropriately expressed as “DB”.

メタデータ管理手段１０は、コンテンツを提供する事業者から、提供される各コンテンツに関する情報（以下、「メタデータ」と呼ぶ。）を取得して管理する装置である。 The metadata management means 10 is a device that acquires and manages information (hereinafter referred to as “metadata”) related to each provided content from a provider that provides the content.

メタデータ管理手段１０は、メタデータ整形部１１と、メタデータ管理ＤＢ１２とを備える。メタデータ整形部１１は、各事業者が利用する端末（図示省略）と通信を行うことによりメタデータを取得する。そして、メタデータ整形部１１は取得したメタデータを、メタデータの種類毎にメタデータ管理ＤＢ１２に格納する。メタデータ管理ＤＢ１２は、記憶装置であり、メタデータをメタデータの種類毎に記憶する。 The metadata management unit 10 includes a metadata shaping unit 11 and a metadata management DB 12. The metadata shaping unit 11 acquires metadata by communicating with a terminal (not shown) used by each business operator. The metadata shaping unit 11 stores the acquired metadata in the metadata management DB 12 for each type of metadata. The metadata management DB 12 is a storage device and stores metadata for each type of metadata.

また、メタデータ類似度管理手段２０は、メタデータ管理手段１０が管理するメタデータ同士の類似の度合いである「メタデータ類似度」を算出して管理する装置である。 The metadata similarity management unit 20 is a device that calculates and manages “metadata similarity”, which is the degree of similarity between metadata managed by the metadata management unit 10.

メタデータ類似度管理手段２０は、ｎ個（ｎは任意の自然数）のメタデータ類似度算出部２１と、メタデータ類似度管理ＤＢ２２とを備える。メタデータ類似度算出部２１は、メタデータの種類に応じた数だけ設けられる。そして、メタデータ類似度算出部２１のそれぞれは、同一の種類のメタデータを比較することにより、メタデータ類似度を算出し、算出したメタデータ類似度をメタデータ類似度管理ＤＢ２２に格納する。メタデータ類似度管理ＤＢ２２は、記憶装置であり、メタデータ類似度をメタデータの種類毎に記憶する。 The metadata similarity management unit 20 includes n (n is an arbitrary natural number) metadata similarity calculation units 21 and a metadata similarity management DB 22. The number of metadata similarity calculation units 21 is provided according to the type of metadata. Each of the metadata similarity calculation units 21 calculates the metadata similarity by comparing the same type of metadata, and stores the calculated metadata similarity in the metadata similarity management DB 22. The metadata similarity management DB 22 is a storage device, and stores the metadata similarity for each type of metadata.

更に、コンテンツ同一度管理手段３０は、メタデータ類似度管理手段２０が管理するメタデータ類似度に基づいて、メタデータに対応するコンテンツ同士の同一性の度合いである「コンテンツ同一度」を算出して管理する装置である。 Further, the content identity management means 30 calculates “content identity”, which is the degree of identity between contents corresponding to metadata, based on the metadata similarity managed by the metadata similarity management means 20. It is a device to manage.

コンテンツ同一度管理手段３０は、ｍ個（ｍは任意の自然数）のコンテンツ同一度算出部３１と、コンテンツ同一度管理ＤＢ３２とを備える。コンテンツ同一度算出部３１は、コンテンツ同一度の算出方法の数に応じた数だけ設けられる。そして、コンテンツ同一度算出部３１のそれぞれは、コンテンツ同一度算出部３１自身に対応する算出方法で、コンテンツ同一度を算出し、算出したコンテンツ同一度をコンテンツ同一度管理ＤＢ３２に格納する。コンテンツ同一度管理ＤＢ３２は、記憶装置であり、コンテンツ同一度算出部３１のそれぞれが算出したコンテンツ同一度を記憶する。 The content identity management unit 30 includes m (m is an arbitrary natural number) content identity calculation units 31 and a content identity management DB 32. The content identity calculation units 31 are provided in a number corresponding to the number of content identity calculation methods. Then, each of the content identity calculation units 31 calculates the content identity by a calculation method corresponding to the content identity calculation unit 31 itself, and stores the calculated content identity in the content identity management DB 32. The content identity management DB 32 is a storage device, and stores the content identity calculated by each content identity calculation unit 31.

更に、コンテンツ取得方法リスト作成手段４０は、コンテンツ取得方法リスト作成部４１を備える。コンテンツ取得方法リスト作成部４１は、コンテンツの利用を希望するユーザから、利用を希望するコンテンツと同一のコンテンツの取得方法を含むメタデータのリストの作成要求を受け付ける。この作成要求には、利用を希望するコンテンツの指定が含まれる。
この要求を受け付けたコンテンツ取得方法リスト作成部４１は、コンテンツ同一度管理手段３０が管理するコンテンツ同一度に基づいて、指定されたコンテンツと同一のコンテンツの取得方法を含むメタデータのリストを作成する。また、コンテンツ取得方法リスト作成部４１は、作成したリストをユーザに対して出力する。ここで、ユーザとは、コンテンツを利用するユーザであり、例えばコンテンツの１つであるテレビ番組の視聴を希望するユーザである。 Further, the content acquisition method list creation unit 40 includes a content acquisition method list creation unit 41. The content acquisition method list creation unit 41 accepts a metadata list creation request including a content acquisition method identical to the content desired to be used from a user who desires to use the content. This creation request includes specification of content desired to be used.
The content acquisition method list creation unit 41 that has received this request creates a metadata list that includes the same content acquisition method as the specified content, based on the content identity managed by the content identity management means 30. . In addition, the content acquisition method list creation unit 41 outputs the created list to the user. Here, the user is a user who uses content, for example, a user who wants to watch a television program that is one of the content.

コンテンツ取得方法リスト作成部４１は、ユーザが利用する端末（図示省略）と通信を行うことにより作成要求の受け付けや、リストの出力を行う。ユーザは、自身が利用する端末にてリストを参照することにより、自身の状況に適したコンテンツの取得方法を選択して、コンテンツを視聴することができる。 The content acquisition method list creation unit 41 receives a creation request and outputs a list by communicating with a terminal (not shown) used by the user. The user can view the content by selecting a content acquisition method suitable for his / her situation by referring to the list on the terminal used by the user.

コンテンツ同定装置１０１は、一般的なサーバ装置やパーソナルコンピュータに本実施形態特有のソフトウェアを組み込むことにより実現できる。 The content identification device 101 can be realized by incorporating software specific to the present embodiment into a general server device or personal computer.

より詳細に説明すると、コンテンツ同定装置１０１は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等の演算処理装置を備える。また、コンテンツ同定装置１０１は、各種のプログラムを格納したＨＤＤ（ｈａｒｄｄｉｓｋｄｒｉｖｅ）やＳＳＤ（ｓｏｌｉｄｓｔａｔｅｄｒｉｖｅ）等の補助記憶装置や、演算処理装置がプログラムを実行する上で一時的に必要とされるデータを格納するためのＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）といった主記憶装置を備える。 More specifically, the content identification device 101 includes an arithmetic processing device such as a CPU (Central Processing Unit). The content identification device 101 is temporarily required when an auxiliary storage device such as a hard disk drive (HDD) or a solid state drive (SSD) that stores various programs or an arithmetic processing device executes the program. A main memory such as a RAM (Random Access Memory) for storing data.

また、コンテンツ同定装置１０１において、演算処理装置が補助記憶装置から各種のプログラムを読み込み、読み込んだ各種のプログラムを主記憶装置に展開させながら、これら各種のプログラムに基づいた演算処理を行なう。
そして、この演算結果に基づいて、各装置のハードウェアを制御することにより、上述した各手段（メタデータ管理手段１０、メタデータ類似度管理手段２０、コンテンツ同一度管理手段３０及びコンテンツ取得方法リスト作成手段４０）を実現する。つまり、本実施形態は、ハードウェアとソフトウェアが協働することにより実現することができる。
なお、メタデータ管理手段１０、メタデータ類似度管理手段２０、コンテンツ同一度管理手段３０及びコンテンツ取得方法リスト作成手段４０のそれぞれを、別個の装置にて実現するようにしてもよい。そして、これらの手段を実現する各装置が協働することによりコンテンツ同定システムを構成するようにしてもよい。 In the content identification apparatus 101, the arithmetic processing device reads various programs from the auxiliary storage device, and performs arithmetic processing based on these various programs while developing the read various programs in the main storage device.
Then, by controlling the hardware of each device based on this calculation result, the above-described means (metadata management means 10, metadata similarity management means 20, content identity management means 30, and content acquisition method list) Creating means 40). That is, this embodiment can be realized by cooperation of hardware and software.
Note that each of the metadata management unit 10, the metadata similarity management unit 20, the content identity management unit 30, and the content acquisition method list creation unit 40 may be realized by separate devices. And a content identification system may be comprised by each apparatus which implement | achieves these means cooperating.

次に、図２のシーケンス図を参照して、本実施形態の動作について説明をする。なお、前提として、本例では、事業者Ａが提供する第１のコンテンツ（図中では、「丸付き数字の１」として示す。他のコンテンツも同様に図中では丸付き数字で示す。）と、事業者Ａが提供する第２のコンテンツと、事業者Ｃが提供する第４のコンテンツとが同一の映像及び同一の音声のコンテンツである。すなわち、同一のコンテンツであるとする。 Next, the operation of this embodiment will be described with reference to the sequence diagram of FIG. As a premise, in this example, the first content provided by the operator A (shown as “circled number 1” in the figure. Other contents are also shown in the figure by circled numbers). The second content provided by the operator A and the fourth content provided by the operator C are the same video and audio content. That is, it is assumed that the content is the same.

また、事業者Ｂが提供する第３のコンテンツが、これら第１のコンテンツ、第２のコンテンツ及び第４のコンテンツに関連するコンテンツであるとする。更に、事業者Ｃが提供する第５のコンテンツが、これら第１のコンテンツ、第２のコンテンツ及び第４のコンテンツと関連のないコンテンツであるとする。 In addition, it is assumed that the third content provided by the operator B is content related to the first content, the second content, and the fourth content. Furthermore, it is assumed that the fifth content provided by the operator C is content not related to the first content, the second content, and the fourth content.

まず、ステップＳ１において、メタデータ整形部１１が、各事業者が保有するコンテンツのメタデータを取得する。ここで、どのようなメタデータが含まれているのかは、コンテンツや事業者により異なる。そこで、メタデータ整形部１１は、各メタデータを比較しやすいように整形してメタデータ管理ＤＢ１２に登録する。より詳細には、１つのコンテンツに対応するメタデータに含まれる複数のメタデータを、１つ１つのメタデータに区分けして、同じ種類のメタデータ毎に分類する。 First, in step S <b> 1, the metadata shaping unit 11 acquires metadata of content held by each business operator. Here, what kind of metadata is included differs depending on the content and the business. Therefore, the metadata shaping unit 11 shapes each metadata so that it can be easily compared, and registers the metadata in the metadata management DB 12. More specifically, a plurality of metadata included in metadata corresponding to one content is classified into metadata one by one and classified for each metadata of the same type.

例えば、区分けしたメタデータがコンテンツの題名を表すメタデータであるならば“ｔｉｔｌｅ”というメタデータの種類に分類する。また、区分けしたメタデータが、コンテンツの内容の概要を表すメタデータならば“ｓｕｍｍａｒｙ”というメタデータの種類に分類する。
そして、ステップＳ２において、メタデータ整形部１１はこのように整形した、整形後のメタデータをメタデータ管理ＤＢ１２に格納する。 For example, if the classified metadata is metadata representing the title of the content, the metadata is classified into a metadata type “title”. In addition, if the classified metadata is metadata that represents an outline of the content, it is classified into a metadata type “summary”.
In step S2, the metadata shaping unit 11 stores the shaped metadata thus shaped in the metadata management DB 12.

ステップＳ３において、ｎ個のメタデータ類似度算出部２１のそれぞれ（図中では、メタデータ類似度算出部２１を１つのみ図示する。）は、メタデータ管理ＤＢ１２に対して問い合わせを行う。なお、図中ではステップＳ２の終了を契機として連続してステップＳ３が実行されるように記載している。しかしながらこれは例示に過ぎず、ステップＳ２の終了を契機として連続してステップＳ３が行われる構成としてもよく、ステップＳ２の終了を契機として連続してステップＳ３が行なわれない構成としてもよい。
例えば、メタデータ管理ＤＢ１２にメタデータが蓄積されたことを、メタデータ類似度算出部２１に通知する構成としてもよい。この場合には、この通知を契機にしてメタデータ類似度算出部２１がステップＳ３を行うので、ステップＳ２の終了を契機として連続してステップＳ３が実行される。
一方で、メタデータ類似度算出部２１が定期的にステップＳ３を行う構成や、ユーザからのコンテンツ取得方法リストの取得要求があった場合にステップＳ３を行う構成としてもよい。この場合には、ステップＳ２の終了を契機としてステップＳ３は実行されない。
つまり、図中の記載は図示のための便宜上のものであり、本実施形態の構成を限定するものではない。
ステップＳ４において、その返答として整形後のメタデータを取得する。そして、ｎ個のメタデータ類似度算出部２１のそれぞれは、取得した各メタデータ同士を比較し、その類似の度合いであるメタデータ類似度を算出してメタデータ類似度管理ＤＢ２２に登録する。なお、ｎ個のメタデータ類似度算出部２１は、メタデータ類似度を算出する対象とするメタデータの種類がそれぞれ異なる。 In step S3, each of the n metadata similarity calculators 21 (only one metadata similarity calculator 21 is shown in the figure) makes an inquiry to the metadata management DB 12. In the figure, it is described that step S3 is continuously executed with the end of step S2. However, this is merely an example, and step S3 may be continuously performed with the end of step S2 as an opportunity, or step S3 may not be continuously performed with the end of step S2.
For example, the metadata similarity calculation unit 21 may be notified that metadata has been accumulated in the metadata management DB 12. In this case, since the metadata similarity calculation unit 21 performs step S3 with this notification as a trigger, step S3 is continuously executed with the end of step S2.
On the other hand, a configuration in which the metadata similarity calculation unit 21 periodically performs step S3 or a configuration in which step S3 is performed when there is an acquisition request for a content acquisition method list from the user may be employed. In this case, step S3 is not executed with the end of step S2.
That is, the description in the drawing is for convenience of illustration, and does not limit the configuration of the present embodiment.
In step S4, the formatted metadata is acquired as the response. Then, each of the n metadata similarity calculation units 21 compares the acquired metadata with each other, calculates a metadata similarity that is a degree of similarity, and registers it in the metadata similarity management DB 22. Note that the n metadata similarity calculation units 21 have different types of metadata for which the metadata similarity is calculated.

例えば、メタデータ類似度算出部２１−２は、各コンテンツの“ｔｉｔｌｅ”という種類のメタデータについてメタデータ類似度を算出する。また、例えば、メタデータ類似度算出部２１−３は、各コンテンツの“ｓｕｍｍａｒｙ”という種類のメタデータについてメタデータ類似度を算出する。 For example, the metadata similarity calculation unit 21-2 calculates the metadata similarity for the type of metadata “title” of each content. Further, for example, the metadata similarity calculating unit 21-3 calculates the metadata similarity for the metadata of the type “summary” of each content.

次に、ステップＳ５において、ｎ個のメタデータ類似度算出部２１のそれぞれは、メタデータ管理ＤＢ１２に格納されているコンテンツの全組み合わせについてメタデータ類似度を算出し、算出結果をメタデータ類似度管理ＤＢ２２に格納する。格納方法は任意であるが、例えば、縦の列と横の行それぞれに各コンテンツの識別子を配置したテーブルを作成し、このテーブルの行と列が交差した部分に、対応したコンテンツのメタデータ同士のメタデータ類似度を格納する。なお、このような格納方法の詳細については、後述の実施例の説明の際に、図６−３を参照しながら説明をする。 Next, in step S5, each of the n metadata similarity calculation units 21 calculates the metadata similarity for all the combinations of contents stored in the metadata management DB 12, and the calculation result is the metadata similarity. Store in the management DB 22. The storage method is arbitrary, but for example, a table in which identifiers of contents are arranged in vertical columns and horizontal rows is created, and the corresponding content metadata is created at the intersection of the rows and columns of the table. Stores the metadata similarity of. Note that details of such a storage method will be described with reference to FIG.

ステップＳ６において、ｍ個のコンテンツ同一度算出部３１それぞれ（図中では、コンテンツ同一度算出部３１を１つのみ図示する。）は、メタデータ類似度管理ＤＢ２２に対して問い合わせを行う。また、ステップＳ７において、その返答としてｎ個のメタデータ類似度算出部２１がそれぞれ算出したメタデータ類似度を取得する。
なお、図中では便宜上ステップＳ５の終了を契機として連続してステップＳ６が実行されるように記載している。しかしながらこれは例示に過ぎず、ステップＳ５の終了を契機として連続してステップＳ６が行われる構成としてもよく、ステップＳ５の終了を契機として連続してステップＳ６が行なわれない構成としてもよい。
例えば、メタデータ類似度管理ＤＢにメタデータが蓄積されたことを、コンテンツ同一度算出部３１に通知する構成としてもよい。この場合には、この通知を契機にしてコンテンツ同一度算出部３１がステップＳ６を行うので、ステップＳ５の終了を契機として連続してステップＳ６が実行される。
一方で、コンテンツ同一度算出部３１が定期的にステップＳ６を行う構成や、ユーザからのコンテンツ取得方法リストの取得要求があった場合にステップＳ６を行う構成としてもよい。この場合には、ステップＳ５の終了を契機としてステップＳ６は実行されない。
つまり、図中の記載は図示のための便宜上のものであり、本実施形態の構成を限定するものではない。 In step S6, each of the m content identity calculation units 31 (only one content identity calculation unit 31 is illustrated in the drawing) makes an inquiry to the metadata similarity management DB 22. In step S7, the metadata similarity calculated by the n metadata similarity calculators 21 is acquired as a response.
In the drawing, for the sake of convenience, it is described that step S6 is continuously executed with the end of step S5 as a trigger. However, this is merely an example, and step S6 may be continuously performed with the end of step S5 as a trigger, or step S6 may not be continuously performed with the end of step S5 as a trigger.
For example, the content similarity calculation unit 31 may be notified that metadata has been accumulated in the metadata similarity management DB. In this case, since the content identity calculation unit 31 performs step S6 with this notification as a trigger, step S6 is continuously executed with the end of step S5 as a trigger.
On the other hand, a configuration in which the content identity calculation unit 31 periodically performs step S6 or a configuration in which step S6 is performed when there is a content acquisition method list acquisition request from the user may be employed. In this case, step S6 is not executed in response to the end of step S5.
That is, the description in the drawing is for convenience of illustration, and does not limit the configuration of the present embodiment.

そして、ｍ個のコンテンツ同一度算出部３１のそれぞれは、取得した各メタデータ類似度を比較し、比較したメタデータ類似度に対応するコンテンツ同士の同一性の度合いであるコンテンツ同一度を算出してコンテンツ同一度管理ＤＢ３２に登録する。なお、ｍ個のコンテンツ同一度算出部３１は、コンテンツ同一度の算出方法がそれぞれ異なる。 Then, each of the m content identity calculation units 31 compares the acquired metadata similarities, and calculates the content identity that is the degree of identity between the contents corresponding to the compared metadata similarities. Are registered in the content identity management DB 32. Note that the m content identity calculation units 31 have different content identity calculation methods.

例えば、コンテンツ同一度算出部３１−１は“ｔｉｔｌｅ”という種類のメタデータのメタデータ類似度に重きをおいてコンテンツ同一度を算出する。これに対し、コンテンツ同一度算出部３１−２は“ｓｕｍｍａｒｙ”という種類のメタデータのメタデータ類似度の重きをおいてコンテンツ同一度を算出する。また、コンテンツの映像と音声内容が完全に一致する時のみ同一度が高くなるようにメタデータ類似度の重みを設定する場合もあれば、それらに加えてハイライトやダイジェスト版についても同一度が高くなるようにメタデータ類似度に重みを設定する場合もある。 For example, the content identity calculation unit 31-1 calculates the content identity by placing emphasis on the metadata similarity of the type of metadata “title”. On the other hand, the content identity calculation unit 31-2 calculates the content identity with an emphasis on the metadata similarity of the metadata of the type “summary”. Also, in some cases, the metadata similarity weight is set so that the degree of identity increases only when the content video and audio content completely match. In addition, the same degree is also applied to the highlight and digest versions. In some cases, the metadata similarity is set so as to have a higher weight.

そして、ステップＳ８において、ｍ個のコンテンツ同一度算出部３１のそれぞれは、メタデータ類似度管理ＤＢ２２に格納されているメタデータ類似度に対応するコンテンツの全組み合わせについてコンテンツ類似度を算出し、算出結果をコンテンツ同一度管理ＤＢ３２に格納する。格納方法は任意であるが、例えば、縦の列と横の行それぞれに各コンテンツの識別子を配置したテーブルを作成し、このテーブルの行と列が交差した部分に、対応したコンテンツ同士のコンテンツ同一度を格納する。なお、このような格納方法の詳細については、後述の実施例の説明の際に、図６−４を参照しながら説明をする。 In step S8, each of the m content identity calculation units 31 calculates content similarity for all combinations of content corresponding to the metadata similarity stored in the metadata similarity management DB 22. The result is stored in the content identity management DB 32. Although the storage method is arbitrary, for example, a table is created in which identifiers for each content are arranged in vertical columns and horizontal rows, and the content of the corresponding content is shared at the intersection of the rows and columns of this table. Store once. The details of such a storage method will be described with reference to FIG. 6-4 in the description of an embodiment described later.

他方、ステップＳ９において、ユーザは、利用を希望するコンテンツと同一のコンテンツの取得方法を含むメタデータのリストの作成要求を、コンテンツ取得方法リスト作成部４１に送信する。この作成要求には、利用を希望するコンテンツの指定が含まれる。ここで、利用を希望するコンテンツとは、例えばユーザが視聴を希望する番組である。利用を希望するコンテンツの指定とは、例えば、図中に示すように、利用を希望するコンテンツを一意に特定するためのコンテンツ特定用情報を送信することであり、例えば、コンテンツの題名に対応するテキストデータや、コンテンツに対応するＵＲＬ等を送信することである。
なお、図中では便宜上ステップＳ８の終了を契機として連続してステップＳ９が実行されるように記載している。しかしながら、ユーザはステップＳ８が終了したタイミングとは関係なく、自身が所望するタイミングでステップＳ９を実行することができる。つまり、図中の記載は図示のための便宜上のものであり、本実施形態の構成を限定するものではない。 On the other hand, in step S <b> 9, the user transmits a metadata list creation request including the same content acquisition method as the content desired to be used to the content acquisition method list creation unit 41. This creation request includes specification of content desired to be used. Here, the content desired to be used is, for example, a program that the user desires to view. The designation of the content desired to be used is, for example, transmitting content specifying information for uniquely identifying the content desired to be used, as shown in the figure, and corresponds to, for example, the title of the content. Sending text data, URL corresponding to the content, and the like.
In the drawing, for the sake of convenience, it is described that step S9 is continuously executed in response to the end of step S8. However, the user can execute step S9 at a timing desired by the user regardless of the timing at which step S8 ends. That is, the description in the drawing is for convenience of illustration, and does not limit the configuration of the present embodiment.

また、ステップＳ１０において、コンテンツ取得方法リスト作成部４１は、メタデータ管理ＤＢ１２に対して、コンテンツ特定用情報を用いた問い合わせを行う。更に、ステップＳ１１において、コンテンツ特定用情報に対応する特定のコンテンツが何れのコンテンツであるかということを解決する。例えば、コンテンツ特定用情報に対応するコンテンツが第１のコンテンツであるならば、メタデータ管理ＤＢ１２に対して問い合わせを行うことにより、コンテンツ特定用情報に対応するコンテンツが第１のコンテンツであることを解決する。 In step S <b> 10, the content acquisition method list creation unit 41 makes an inquiry to the metadata management DB 12 using the content specifying information. Further, in step S11, it is resolved which content is the specific content corresponding to the content specifying information. For example, if the content corresponding to the content specifying information is the first content, it is determined that the content corresponding to the content specifying information is the first content by making an inquiry to the metadata management DB 12. Solve.

次に、ステップＳ１２において、コンテンツ取得方法リスト作成部４１は、コンテンツ同一度管理ＤＢ３２に対して問い合わせを行う。また、ステップＳ１３において、その返答として何れかのコンテンツ同一度算出部３１が算出したコンテンツ同一度を取得する。何れのコンテンツ同一度算出部３１の算出結果を用いるかは、ユーザの希望などを考慮して決定する。ユーザの希望とは、例えばユーザの求めるコンテンツの取得方法が見つからない場合に、類似するコンテンツが渡されることを希望するか否か、などが挙げられる。 Next, in step S12, the content acquisition method list creation unit 41 makes an inquiry to the content identity management DB 32. In step S13, the content identity calculated by any content identity calculator 31 is acquired as a response. Which content identity calculation unit 31 uses the calculation result is determined in consideration of the user's wishes and the like. The user's desire includes, for example, whether or not it is desired that similar content is passed when a method for acquiring the content desired by the user is not found.

そして、コンテンツ取得方法リスト作成部４１は、取得したコンテンツ同一度に基づいて、コンテンツ取得方法リストを生成する。例えば、コンテンツ同一度が高いコンテンツから順に並べることによりリストを作成する。
このリストは、各コンテンツを取得する方法を含むメタデータにより構成される。具体的には、コンテンツが例えばネット配信されるコンテンツであれば、このコンテンツを取得するためのＵＲＬをメタデータとしてリストに含める。また、ユーザがリストを参照した場合に、各ＵＲＬがどのような取得方法に対応するのかが分かるように、取得方法を表すテキストデータや、放送時刻等を表すテキストデータ、解像度や画面比率を表すテキストデータ等も、リストに含ませるようにしてもよい。
更に、コンテンツが例えば地上波で放送されるような場合であれば、ＵＲＬに代えて、地上波のチャンネルと放送時刻等をメタデータとしてリストに含ませるようにしてもよい。
なお、リストに含まれるコンテンツの数が膨大なものとなってしまうと、ユーザがリストを利用してコンテンツを選択することが困難となる。そこで、リストに含ませるコンテンツの数について閾値を設け、この閾値を超えるコンテンツのメタデータについては、コンテンツ取得方法リストに含ませないようにするとよい。 Then, the content acquisition method list creation unit 41 generates a content acquisition method list based on the acquired content identity. For example, a list is created by arranging content in descending order of content identity.
This list is composed of metadata including a method for acquiring each content. Specifically, if the content is content distributed on the Internet, for example, a URL for acquiring the content is included in the list as metadata. In addition, when the user refers to the list, the text data indicating the acquisition method, the text data indicating the broadcast time, the resolution, and the screen ratio are expressed so that each URL corresponds to what acquisition method. Text data or the like may also be included in the list.
Furthermore, if the content is broadcast on terrestrial waves, for example, the terrestrial channel and broadcast time may be included in the list as metadata instead of the URL.
If the number of contents included in the list becomes enormous, it becomes difficult for the user to select contents using the list. Therefore, it is preferable to set a threshold for the number of contents to be included in the list, and not to include content metadata exceeding the threshold in the content acquisition method list.

そして、ステップＳ１４において、コンテンツ取得方法リスト作成部４１は作成したコンテンツ取得方法リストを、ユーザに渡す。例えば、ユーザが利用する端末にコンテンツ取得方法リストを渡す。例えば、コンテンツ特定用情報に対応するコンテンツが第１のコンテンツであるならば、第１のコンテンツに加えて、第１のコンテンツと同一のコンテンツである、第２のコンテンツ及び第４のコンテンツを含んだコンテンツ取得方法リストを渡す。 In step S14, the content acquisition method list creation unit 41 passes the created content acquisition method list to the user. For example, the content acquisition method list is passed to the terminal used by the user. For example, if the content corresponding to the content specifying information is the first content, in addition to the first content, the second content and the fourth content that are the same content as the first content are included. Pass the content acquisition method list.

ユーザは、自身が利用する端末にてリストを参照することにより、コンテンツの取得方法を選択して、コンテンツを視聴することができる。 The user can select the content acquisition method and view the content by referring to the list at the terminal used by the user.

以上説明した本実施形態によれば、メタデータを利用することにより、ユーザが所望するコンテンツと同一のコンテンツを特定することができる。つまり、同一のコンテンツを同定することができる。
これにより、同一のコンテンツが複数のメディアで配信されるような場合に、各メディアにおいてコンテンツを取得することが可能となる。また、ユーザは、リストを参照して任意の取得方法を参照することができるので、自身が保有しているデバイスの種類や、コンテンツを参照したい時間帯等といった状況に応じた、適切な方法でコンテンツを取得することができる。 According to the present embodiment described above, the same content as the content desired by the user can be specified by using the metadata. That is, the same content can be identified.
Thereby, when the same content is distributed by a plurality of media, it is possible to acquire the content in each media. In addition, since the user can refer to the list and refer to any acquisition method, it is possible to use an appropriate method according to the situation such as the type of device that the user owns and the time period when content is desired to be referred to. Content can be acquired.

更に、本実施形態では、ｎ個のメタデータ類似度算出部２１を用意することにより、メタデータの種類毎にメタデータ類似度を算出できる。従って、メタデータの種類を考慮することなく、メタデータ類似度を算出する場合よりも、精度よくメタデータの類似度を算出することが可能となる。 Furthermore, in the present embodiment, by preparing n metadata similarity calculation units 21, the metadata similarity can be calculated for each type of metadata. Therefore, it is possible to calculate the similarity of metadata more accurately than in the case of calculating the metadata similarity without considering the type of metadata.

また、本実施形態では、ｍ個のコンテンツ同一度算出部３１を用意することにより、異なる算出方法によりコンテンツ同一度を算出することができる。そして、例えばユーザの希望等に応じて、何れの算出方法で算出したコンテンツ同一度を利用するのかを選択することができる。これにより、ユーザの希望により沿ったリストを作成することが可能となる。 In the present embodiment, by preparing m content identity calculation units 31, the content identity can be calculated by different calculation methods. Then, for example, according to the user's desire or the like, it is possible to select which calculation method is used to use the content identity. Thereby, it is possible to create a list according to the user's request.

＜第２の実施形態＞
次に、第２の実施形態の構成について、図３を参照して説明をする。なお、本実施形態の基本的な構成については、第１の実施形態と同じである。そこで、以下では特に本実施形態特有の点について詳細に説明をし、第１の実施形態と共通する部分については、重複する説明を省略する。 <Second Embodiment>
Next, the configuration of the second embodiment will be described with reference to FIG. Note that the basic configuration of the present embodiment is the same as that of the first embodiment. Therefore, in the following, the points peculiar to the present embodiment will be described in detail, and the overlapping description will be omitted for the parts common to the first embodiment.

図３に示すように、第２の実施形態であるコンテンツ同定装置１０２は、メタデータ管理手段１０、類似度・同一度管理手段５０及びコンテンツ取得方法リスト作成手段４０を含む。メタデータ管理手段１０及びコンテンツ取得方法リスト作成手段４０については、第１の実施形態と同様である。一方で、本実施形態では、第１の実施形態のメタデータ類似度管理手段２０及びコンテンツ同一度管理手段３０に代えて、類似度・同一度管理手段５０を備えている。 As shown in FIG. 3, the content identification apparatus 102 according to the second embodiment includes a metadata management unit 10, a similarity / identity management unit 50, and a content acquisition method list creation unit 40. The metadata management unit 10 and the content acquisition method list creation unit 40 are the same as those in the first embodiment. On the other hand, in the present embodiment, a similarity / identity management unit 50 is provided instead of the metadata similarity management unit 20 and the content identity management unit 30 of the first embodiment.

類似度・同一度管理手段５０は、メタデータ類似度管理手段２０及びコンテンツ同一度管理手段３０の機能を兼ね備えた装置である。類似度・同一度管理手段５０は、類似度・同一度算出部５１と、コンテンツ同一度管理ＤＢ５２を備える。 The similarity / identity management unit 50 is a device having the functions of the metadata similarity management unit 20 and the content identity management unit 30. The similarity / identity management means 50 includes a similarity / identity calculation unit 51 and a content identity management DB 52.

そして、類似度・同一度算出部５１は、メタデータ類似度管理ＤＢ２２に格納されている整形後のメタデータに基づいて、メタデータ類似度を算出すると共に、この算出したメタデータ類似度を用いて更に、コンテンツ同一度を算出する。また、類似度・同一度算出部５１は、この算出したコンテンツ同一度を、コンテンツ同一度管理ＤＢ５２に格納する。 Then, the similarity / identity calculation unit 51 calculates the metadata similarity based on the shaped metadata stored in the metadata similarity management DB 22 and uses the calculated metadata similarity. Further, the degree of content identity is calculated. Further, the similarity / identity calculation unit 51 stores the calculated content identity in the content identity management DB 52.

このような構成とすることにより、本実施形態では、全てのメタデータの組み合わせについてのメタデータ類似度を記憶しておく必要がなくなる。つまり、メタデータ類似度管理ＤＢ２２のようなデータベースを構築する必要がなくなる。従って、メタデータ類似度管理ＤＢ２２を構築するための記憶装置を用意する必要がなくなる。なお、類似度・同一度管理手段５０は、メタデータ管理手段１０、メタデータ類似度管理手段２０、コンテンツ同一度管理手段３０及びコンテンツ取得方法リスト作成手段４０と同様にコンテンツ同定装置１０２のハードウェアとソフトウェアが協働することにより実現される。また、第１の実施形態において説明したように、各手段を別個の装置で実現する場合には、類似度・同一度管理手段５０を、他の手段を実現する装置とは別個の装置により実現してもよい。 By adopting such a configuration, in this embodiment, it is not necessary to store metadata similarity for all metadata combinations. That is, it is not necessary to construct a database such as the metadata similarity management DB 22. Therefore, it is not necessary to prepare a storage device for constructing the metadata similarity management DB 22. Similarity / identity management means 50 is the same as the hardware of the content identification device 102, like the metadata management means 10, metadata similarity management means 20, content identity management means 30, and content acquisition method list creation means 40. This is realized through collaboration between software and software. Further, as described in the first embodiment, when each unit is realized by a separate device, the similarity / identity management unit 50 is realized by a device separate from a device that realizes other units. May be.

次に、本実施形態の動作について図４のシーケンス図を参照して説明をする。
まず、図４のステップＳ２１及びステップＳ２２の処理は、図２のステップＳ１及びステップＳ２の処理と同じなので説明を省略する。 Next, the operation of the present embodiment will be described with reference to the sequence diagram of FIG.
First, steps S21 and S22 in FIG. 4 are the same as steps S1 and S2 in FIG.

ステップＳ２３において、ｍ個の類似度・同一度算出部５１のそれぞれ（図中では、類似度・同一度算出部５１を１つのみ図示する。）は、メタデータ管理ＤＢ１２に対して問い合わせを行う。また、ステップＳ２４において、その返答として整形後のコンテンツを取得する。なお、第１の実施形態のステップＳ３と同様に、ステップＳ２２の終了を契機として連続してステップＳ２３が行われる構成としてもよく、ステップＳ２２の終了を契機として連続してステップＳ２３が行なわれない構成としてもよい。つまり、図中の記載は図示のための便宜上のものであり、本実施形態の構成を限定するものではない。 In step S23, each of the m similarity / identity calculating units 51 (only one similarity / identity calculating unit 51 is illustrated in the drawing) inquires of the metadata management DB 12. . In step S24, the formatted content is acquired as a response. Note that, similarly to step S3 of the first embodiment, step S23 may be continuously performed with the end of step S22 as a trigger, and step S23 is not continuously performed with the end of step S22 as a trigger. It is good also as a structure. That is, the description in the drawing is for convenience of illustration, and does not limit the configuration of the present embodiment.

そして、ｍ個の類似度・同一度算出部５１のそれぞれは、取得した各メタデータ同士を比較し、その類似の度合いであるメタデータ類似度を算出する。かかるメタデータの算出は全ての種類のメタデータについて行われる。そして、ｍ個の類似度・同一度算出部５１のそれぞれは、算出した各メタデータ類似度を比較し、比較したメタデータ類似度に対応するコンテンツ同士の同一性の度合いであるコンテンツ同一度を算出してコンテンツ同一度管理ＤＢ５２に登録する。なお、ｍ個の類似度・同一度算出部５１は、コンテンツ同一度の算出方法がそれぞれ異なる。 Then, each of the m similarity / identity calculation units 51 compares the acquired metadata with each other, and calculates a metadata similarity that is a degree of similarity. Such calculation of metadata is performed for all types of metadata. Then, each of the m similarity / identity calculation units 51 compares the calculated metadata similarities, and calculates the content identity that is the degree of identity between the contents corresponding to the compared metadata similarities. Calculate and register in the content identity management DB 52. The m similarity / identity calculation units 51 have different content identity calculation methods.

例えば、類似度・同一度算出部５１−１“ｔｉｔｌｅ”という種類のメタデータのメタデータ類似度に重きをおいてコンテンツ同一度を算出する。これに対し、類似度・同一度算出部５１−２の“ｓｕｍｍａｒｙ”という種類のメタデータのメタデータ類似度の重きをおいてコンテンツ同一度を算出する。 For example, the similarity / identity calculation unit 51-1 calculates the content identity by weighting the metadata similarity of the type of metadata “title”. On the other hand, the content similarity is calculated with emphasis on the metadata similarity of the metadata of the type “summary” in the similarity / identity calculation unit 51-2.

次に、ステップＳ２５において、ｍ個の類似度・同一度算出部５１−１のそれぞれは、コンテンツの全組み合わせについてコンテンツ類似度を算出し、算出結果をコンテンツ同一度管理ＤＢ５２に格納する。格納方法については、上述したコンテンツ同一度管理ＤＢ３２と同様にすればよい。 Next, in step S <b> 25, each of the m similarity / identity calculation units 51-1 calculates content similarity for all combinations of content, and stores the calculation result in the content identity management DB 52. The storage method may be the same as the content identity management DB 32 described above.

これにより、第１の実施形態のコンテンツ同一度管理ＤＢ３２と同内容のデータベースがコンテンツ同一度管理ＤＢ５２として構築される。その後の処理であるステップＳ２６からステップＳコンテンツ同一度算出部３１の処理は、図２のステップＳ９及びステップＳ１４の処理と同じなので説明を省略する。 Thereby, a database having the same content as the content identity management DB 32 of the first embodiment is constructed as the content identity management DB 52. The subsequent processing from step S26 to step S content identity calculation unit 31 is the same as the processing of step S9 and step S14 in FIG.

以上説明した本実施形態は、第１の実施形態と同様の効果を奏する。また、本実施形態では、或るコンテンツの組合せについてメタデータ類似度及びコンテンツ同一度を算出したならば、算出したメタデータ類似度を記憶しておく必要がない。そのため、メタデータ類似度管理ＤＢ２２のようなデータベースを構築する必要がない、という効果も奏する。 The present embodiment described above has the same effect as the first embodiment. In the present embodiment, if the metadata similarity and the content identity are calculated for a certain content combination, it is not necessary to store the calculated metadata similarity. Therefore, there is an effect that it is not necessary to construct a database like the metadata similarity management DB 22.

ただし、本実施形態のような構成にした場合には、第１の実施形態と比較して、システム全体で必要な計算量が増加する。例えば、第１の実施形態であれば、各コンテンツの組合せの、各種類のメタデータについて、メタデータ類似度を算出して保持しておけば、ｍ個のコンテンツ同一度算出部３１は、その算出結果を参照することができる。そのため、ｍ個のコンテンツ同一度算出部３１のそれぞれが、コンテンツ同一度を算出する都度、メタデータ類似度を算出する必要はない。 However, in the case of the configuration as in the present embodiment, the amount of calculation required for the entire system increases as compared with the first embodiment. For example, in the first embodiment, if the metadata similarity is calculated and held for each type of metadata of each content combination, the m content identity calculation units 31 The calculation result can be referred to. Therefore, it is not necessary for each of the m content identity calculation units 31 to calculate the metadata similarity each time the content identity is calculated.

一方で、本実施形態では、ｍ個のコンテンツ同一度算出部３１のそれぞれが、コンテンツ同一度を算出する都度、メタデータ類似度を算出する必要がある。そのため、本実施形態では、第１の実施形態と比較して、システム全体で必要な計算量が増加する。しかしながら、上述したようにメタデータ類似度管理ＤＢ２２のようなデータベースを構築する必要がなくなるという効果があるので、本実施形態を実装する環境に応じて、第１の実施形態と第２の実施形態の内の何れかを選択するようにすればよい。すなわち、記憶装置の容量に余裕があるならば第１の実施形態を実装するようにすればよく、演算処理能力に余裕があるならば第２の実施形態を実装するようにすればよい。 On the other hand, in this embodiment, each of the m content identity calculation units 31 needs to calculate the metadata similarity every time the content identity is calculated. Therefore, in the present embodiment, the amount of calculation required for the entire system increases compared to the first embodiment. However, since there is an effect that it is not necessary to construct a database such as the metadata similarity management DB 22 as described above, the first embodiment and the second embodiment according to the environment in which this embodiment is implemented. Any one of the above may be selected. That is, the first embodiment may be implemented if there is a margin in the capacity of the storage device, and the second embodiment may be implemented if there is a margin in the arithmetic processing capability.

＜第３の実施形態＞
次に、第３の実施形態について説明をする。なお、本実施形態の基本的な構成については、第１の実施形態と同じある。あるいは、本実施形態の基本的な構成を第１の実施形態ではなく、第２の実施形態と同様にしてもよい。
何れにせよ、以下では特に本実施形態特有の点について詳細に説明をし、第１の実施形態や第２の実施形態と共通する部分については、重複する説明を省略する。 <Third Embodiment>
Next, a third embodiment will be described. The basic configuration of this embodiment is the same as that of the first embodiment. Alternatively, the basic configuration of this embodiment may be the same as that of the second embodiment instead of the first embodiment.
In any case, in the following, the points specific to the present embodiment will be described in detail, and overlapping description of portions common to the first embodiment and the second embodiment will be omitted.

上述した各実施形態や、本実施形態では、各事業者が提供する、全てのコンテンツの組合せについて、メタデータ類似度や、コンテンツ同一度を算出して、データベースに格納していた。そのため、メタデータ類似度管理ＤＢ２２やコンテンツ同一度管理ＤＢ３２、あるいはコンテンツ同一度管理ＤＢ５２にて管理するデータ量はコンテンツ数の２乗に比例して増大していた。 In each of the above-described embodiments and this embodiment, the metadata similarity and the content identity are calculated and stored in the database for all content combinations provided by each business operator. For this reason, the amount of data managed by the metadata similarity management DB 22, the content identity management DB 32, or the content identity management DB 52 has increased in proportion to the square of the number of contents.

このように、データ量が増大すると検索対象が多くなるため、同一コンテンツを同定するために時間を要することとなる。また、当然のことながらデータ量が増大すると、より大きな保存領域が必要になる。 Thus, since the search target increases as the amount of data increases, it takes time to identify the same content. As a matter of course, when the amount of data increases, a larger storage area is required.

しかしながら、実際に異なるメディアで配信される同一のコンテンツは、数個程度である。例えば、或るコンテンツが、地上波で放送されると共に、幾つかのサービスで、ビデオオンデマンドで配信されるような場合に１つのコンテンツが３つ程度のメディアで配信される程度である。
そのため、例えば数千〜数万といったコンテンツの全ての組合せについて、メタデータ類似度やコンテンツ同一度を算出したとしても、その算出結果は無駄になる可能性の方が高い。 However, there are about several identical contents that are actually distributed on different media. For example, when a certain content is broadcast on terrestrial waves and distributed on video on demand with some services, one content is distributed on about three media.
Therefore, even if the metadata similarity and content identity are calculated for all combinations of content such as thousands to tens of thousands, for example, the calculation result is more likely to be wasted.

そこで、本実施形態は、所定の基準に基づいて、メタデータ類似度やコンテンツ同一度を算出する対象となるコンテンツの組合せを限定する。そして、これにより、各データベースにて管理するデータ量を削減し、同一コンテンツを迅速に同定できるようにすると共に、大きな保存領域を不要なものとする。 In view of this, the present embodiment limits combinations of content for which the metadata similarity and content identity are to be calculated based on a predetermined criterion. As a result, the amount of data managed in each database is reduced, the same content can be quickly identified, and a large storage area is not required.

今回は、所定の基準の一例として、メタデータ類似度算出部２１−２が各コンテンツの“ｔｉｔｌｅ”という種類のメタデータについてメタデータ類似度を算出し、このメタデータ類似度が閾値以上のコンテンツの組合せのみを対象として、更にメタデータ類似度やコンテンツ同一度を算出する。
つまり、“ｔｉｔｌｅ”という種類のメタデータのメタデータ類似度が閾値未満のコンテンツの組合せについては、“ｔｉｔｌｅ”という種類のメタデータのメタデータ類似度は算出するが、それ以外の種類のメタデータ類似度やコンテンツ同一度は算出しないようにする。 In this example, as an example of a predetermined standard, the metadata similarity calculation unit 21-2 calculates metadata similarity for the type of metadata “title” of each content, and content whose metadata similarity is equal to or greater than a threshold value. Further, the metadata similarity and content identity are calculated only for the combination.
That is, for a combination of contents whose metadata similarity of the type “title” is less than the threshold, the metadata similarity of the type of metadata “title” is calculated, but other types of metadata are calculated. Do not calculate similarity or content identity.

そのために、本実施形態では図５にコンテンツ同定装置１０３として示すように、コンテンツ組合せ管理ＤＢ６０を設ける。そして、まず、何れかのメタデータ類似度算出部２１が何れかの種類のメタデータについてのみメタデータ類似度を算出する。
例えば、メタデータ類似度算出部２１−２が各コンテンツの“ｔｉｔｌｅ”という種類のメタデータについてのみメタデータ類似度を算出する。そして、算出結果に基づいて、“ｔｉｔｌｅ”という種類のメタデータのメタデータ類似度が閾値以上のコンテンツの組合せのみをコンテンツ組合せ管理ＤＢ６０に格納する。
ただし、この方法はあくまで一例であり、コンテンツの組合せ管理ＤＢ６０に登録するコンテンツ組合せの決定方法はこの限りではない。例えば“ｔｉｔｌｅ”と“ｓｕｍｍａｒｙ”のどちらかの類似度がある閾値を上回るコンテンツの組み合わせのみを登録する方法を利用するようにしてもよい。他にも、例えば“ｔｉｔｌｅ”と“ｓｕｍｍａｒｙ”に重み付け値を与えて合算した値がある閾値を上回るコンテンツの組み合わせのみを登録する方法を利用するようにしてもよい。 For this purpose, in this embodiment, a content combination management DB 60 is provided as shown as the content identification device 103 in FIG. First, any metadata similarity calculation unit 21 calculates the metadata similarity only for any kind of metadata.
For example, the metadata similarity calculation unit 21-2 calculates the metadata similarity only for the metadata of the type “title” of each content. Then, based on the calculation result, only a combination of contents whose metadata similarity of the type of metadata “title” is equal to or greater than a threshold value is stored in the content combination management DB 60.
However, this method is merely an example, and the method for determining the content combination to be registered in the content combination management DB 60 is not limited to this. For example, a method of registering only a combination of contents whose similarity between “title” and “summary” exceeds a certain threshold value may be used. In addition, for example, a method of registering only a combination of contents in which a value obtained by adding weighting values to “title” and “summary” exceeds a certain threshold may be used.

そして、メタデータ類似度算出部２１−２以外のメタデータ類似度算出部２１は、コンテンツ組合せ管理ＤＢ６０に格納されているコンテンツの組合せについてのみを対象として、“ｔｉｔｌｅ”以外の種類のメタデータのメタデータ類似度を算出して、メタデータ類似度管理ＤＢ２２に格納する。また、ｍ個のコンテンツ同一度算出部３１は、メタデータ類似度管理ＤＢ２２に格納されているメタデータ類似度に基づいてコンテンツ同一度を算出する。つまり、メタデータ類似度算出部２１−２以外のメタデータ類似度算出部２１とｍ個のコンテンツ同一度算出部３１は、コンテンツ組合せ管理ＤＢ６０に格納されていないコンテンツの組合せについては算出を行わない。 Then, the metadata similarity calculation unit 21 other than the metadata similarity calculation unit 21-2 targets only the combination of contents stored in the content combination management DB 60, and stores metadata of types other than “title”. The metadata similarity is calculated and stored in the metadata similarity management DB 22. The m content identity calculation units 31 calculate the content identity based on the metadata similarity stored in the metadata similarity management DB 22. That is, the metadata similarity calculation unit 21 other than the metadata similarity calculation unit 21-2 and the m content identity calculation units 31 do not calculate a combination of contents that are not stored in the content combination management DB 60. .

以上説明した構成により、本実施形態では算出の対象とするコンテンツの組合せを限定することから、システム全体としての計算量を削減することができる。また、これに伴い各データベースにて管理するデータ量を削減される。そのため、同一コンテンツを迅速に同定できるようにすると共に、大きな保存領域を不要なものとすることが可能となる。 With the configuration described above, the present embodiment limits the combinations of contents to be calculated, so that the amount of calculation for the entire system can be reduced. As a result, the amount of data managed in each database is reduced. Therefore, it is possible to quickly identify the same content and to make a large storage area unnecessary.

なお、本実施形態では、図５に示すように、更に算出結果削減部７０を設けるようにしてもよい。算出結果削減部７０は、コンテンツ取得方法リストに含めるコンテンツの数を限定する部分である。上述の説明では、コンテンツ同一度が高いコンテンツから順に並べることによりリストを作成すると説明していた。しかし、このようにすると、リストに含まれるコンテンツ数が膨大となってしまう。一方で、上述したように、実際に同一のコンテンツは数個程度（例えば、３個）である。そのため、必要以上の数のコンテンツをリストに含ませることは好ましくない。 In the present embodiment, as shown in FIG. 5, a calculation result reduction unit 70 may be further provided. The calculation result reduction unit 70 is a part that limits the number of contents included in the content acquisition method list. In the above description, it has been described that a list is created by arranging contents in descending order of content identity. However, if this is done, the number of contents included in the list becomes enormous. On the other hand, as described above, there are actually several (for example, three) identical contents. Therefore, it is not preferable to include more contents than necessary in the list.

そこで、算出結果削減部７０により、所定個数以上のコンテンツはリストに含まれないようにする。例えば、或るコンテンツについて、コンテンツ同一度が例えば５位までのコンテンツのコンテンツ同一度についてはコンテンツ同一度管理ＤＢ３２に格納しておくが、６位以下のコンテンツのコンテンツ同一度については削除することとする。
このようにすることにより、コンテンツ取得方法リスト作成部４１が作成するコンテンツ取得方法リストに含まれるコンテンツの数を限定することが可能となる。 Therefore, the calculation result reduction unit 70 prevents a predetermined number of contents from being included in the list. For example, for a certain content, the content identity of the content whose content identity is up to 5th is stored in the content identity management DB 32, but the content identity of content of 6th or less is deleted. To do.
By doing so, it is possible to limit the number of contents included in the content acquisition method list created by the content acquisition method list creation unit 41.

＜実施例＞
以上３つの実施形態について説明をした。次に、これら実施形態を実施した場合のより具体的な例を実施例として説明する。今回は、第１の実施形態に対応する構成に基づいて実施例の説明をする。 <Example>
The three embodiments have been described above. Next, more specific examples when these embodiments are implemented will be described as examples. This time, an example will be described based on the configuration corresponding to the first embodiment.

図６−１に、事業者Ａ、事業者Ｂ、及び事業者Ｃが保有するコンテンツのメタデータの例を示す。これらの情報は、メタデータ整形部１１に入力される情報である。
図６−１に示す、“ｔｉｔｌｅ”が「＊＊物語（５）」、「連続テレビドラマ＊＊物語（５）」及び「第５回＊＊物語１０月１８日放送分」のコンテンツは映像・音声が同一のコンテンツとする。また、“ｔｉｔｌｅ”が「５分でわかる＊＊物語」は上記３コンテンツに関連するコンテンツであり、「正午のニュース１０月１８日放送分」は上記３コンテンツとは全く関係のないコンテンツであるとする。 FIG. 6A illustrates an example of content metadata held by the operator A, the operator B, and the operator C. These pieces of information are information input to the metadata shaping unit 11.
The content of “title” is “** Story (5)”, “Continuous TV Drama ** Story (5)” and “5th ** Story October 18 broadcast” shown in FIG.・ The audio content is the same. “Title” is “a story that can be understood in 5 minutes” is a content related to the above-mentioned three contents, and “noon news broadcast on October 18” is a content that has nothing to do with the above-mentioned three contents. And

コンテンツの内容が同一であっても、コンテンツのメタデータの種類は事業者や配信メディアにより異なる場合がある。例えば、“ｔｉｔｌｅ”が「＊＊物語（５）」のメタデータはｔｉｔｌｅ、ｕｒｌ、ｄａｔｅ、ｉｄ、ｄｉｓｔｒｉｂｕｔｏｒであり、「第５回＊＊物語１０月１８日放送分」のメタデータはｔｉｔｌｅ、ｕｒｌ、ｄｉｓｔｒｉｂｕｔｏｒである。また、“ｔｉｔｌｅ”のように、同一内容、同じ種類のメタデータであっても事業者やサービスごとにその内容が異なる場合もある。 Even if the content is the same, the type of content metadata may differ depending on the provider and the distribution media. For example, the metadata of “title” is “** Story (5)” is title, url, date, id, distributor, and the metadata of “5th ** Story October 18 broadcast” is title, url, distributor. Moreover, even if the metadata is the same content and the same type as “title”, the content may be different for each business operator or service.

なお、“ｔｉｔｌｅ”はコンテンツの題名に相当し、“ｕｒｌ”はコンテンツを取得するためのＵＲＬ（ＵｎｉｆｏｒｍＲｅｓｏｕｒｃｅＬｏｃａｔｏｒ）に相当し、“ｄａｔｅ”はコンテンツの放送日時に相当し、“ｉｄ”はコンテンツを識別するための識別子に相当し、“ｄｉｓｔｒｉｂｕｔｏｒ”はコンテンツを提供する事業者を識別するための識別子に相当する。これらのメタデータはあくまで例示であり、これら以外のメタデータが存在していてもよい。例えば、コンテンツの再生時間の長さであったりしてもよい。 Note that “title” corresponds to the title of the content, “url” corresponds to a URL (Uniform Resource Locator) for acquiring the content, “date” corresponds to the broadcast date of the content, and “id” corresponds to the content. “Distributor” corresponds to an identifier for identifying a provider that provides content. These metadata are merely examples, and metadata other than these may exist. For example, it may be the length of the content playback time.

なお、仮に全てのコンテンツにメタデータとして“ｉｄ”が含まれているならば、この“ｉｄ”を用いて同一のコンテンツを同定することができる。しかしながら、各コンテンツを提供する事業者は様々であり、必ずしも“ｉｄ”が含まれている訳ではない。また、仮に“ｉｄ”が含まれていたとしても事業者毎に異なる基準で“ｉｄ”を割り当てているような場合には、この“ｉｄ”を用いても同一のコンテンツは同定できない。 If “id” is included as metadata in all contents, the same contents can be identified using this “id”. However, there are various providers that provide each content, and “id” is not necessarily included. Even if “id” is included, if “id” is assigned based on a different standard for each business operator, the same content cannot be identified using this “id”.

そこで、各実施形態では、１つの種類のメタデータを利用するのみならず、複数の種類のメタデータを利用することにより、同一のコンテンツを同定する精度を高めている。 Therefore, in each embodiment, not only using one type of metadata but also using a plurality of types of metadata increases the accuracy of identifying the same content.

図６−２に、メタデータ管理ＤＢ１２に格納されている整形後のメタデータの例を示す。これは、図６−１に示したメタデータをメタデータ整形部１１が整形・分類したものである。 FIG. 6B shows an example of the metadata after shaping stored in the metadata management DB 12. This is obtained by the metadata shaping unit 11 shaping and classifying the metadata shown in FIG.

例えば、図６−２の“ｓｕｍｍａｒｙ”テーブルには、各コンテンツの“ｓｕｍｍａｒｙ”に相当するメタデータが登録される。また、「第５回＊＊物語１０月１８日放送分」は「第５回＊＊物語」は“ｔｉｔｌｅ”テーブルに、「１０月１８日放送分」は「２０１６１０１８」に整形されて“ｄａｔｅ”テーブルに登録される。 For example, metadata corresponding to “summary” of each content is registered in the “summary” table in FIG. Also, “5th ** Story October 18 Broadcast” is formatted as “title” for “5th ** Story” and “20161018” for “18th October Broadcast”. “Registered in the table.

図６−３に、メタデータ類似度管理ＤＢ２２に格納されているメタデータ類似度の例を示す。これは、ｎ個のメタデータ類似度管理ＤＢ２２それぞれが、図６−２に示した整形後のメタデータを参照して各メタデータ同士を比較し、そのメタデータ類似度を算出してメタデータ類似度管理ＤＢ２２に格納したものである。 FIG. 6C shows an example of the metadata similarity stored in the metadata similarity management DB 22. This is because each of the n metadata similarity management DBs 22 compares the metadata with reference to the shaped metadata shown in FIG. 6B, calculates the metadata similarity, and calculates the metadata. It is stored in the similarity management DB 22.

例えば、“ｔｉｔｌｅ”のメタデータ類似度であるメタデータ類似度Ｓ_２を参照すると「＊＊物語（５）」と「連続テレビドラマ＊＊物語（５）」のメタデータ類似度は「０．９」と高い値となり、「＊＊物語（５）」と「正午のニュース１０月１８日放送分」のメタデータ類似度は「０．１」と低い値となる。 For example, referring to the metadata similarity S ₂ that is the metadata similarity of “title”, the metadata similarity of “** story (5)” and “continuous television drama ** story (5)” is “0. The metadata similarity between “** Story (5)” and “Noon News October 18 broadcast” is a low value of “0.1”.

図６−４に、コンテンツ同一度管理ＤＢ３２に格納されているコンテンツ同一度の例を示す。これは、ｍ個のコンテンツ同一度算出部３１それぞれが、図６−３に示したメタデータ類似度を参照してコンテンツの全組み合わせについてコンテンツ同一度を算出してコンテンツ同一度管理ＤＢ３２に格納したものである。
例えば、コンテンツ同一度算出部３１−１は下記の［数式１］によりコンテンツ同一度Ｉ_１を算出している。 FIG. 6-4 shows an example of content identity stored in the content identity management DB 32. This is because each of the m content identity calculation units 31 calculates the content identity for all combinations of content with reference to the metadata similarity shown in FIG. 6-3 and stores it in the content identity management DB 32. Is.
For example, the content identity calculation unit 31-1 calculates the content identity I _{1 according} to the following [Equation 1].

［数式１］
Ｉ_１＝Ｓ_１＋１．５Ｓ_２＋０．５Ｓ_３＋０．３Ｓ_４ [Formula 1]
I ₁ = S ₁ + 1.5S ₂ + 0.5S ₃ + 0.3S ₄

これは、メタデータの種類毎に算出されたメタデータ類似度Ｓ_１〜Ｓ_４にそれぞれ重み付け値を与えて合算した値をコンテンツ同一度として算出していることを示す。なお、コンテンツ同一度算出部３１−２やコンテンツ同一度算出部３１−３等の他のコンテンツ同一度算出部３１は、メタデータ類似度Ｓ_１〜Ｓ_４に異なる重み付け値を与えてコンテンツ同一度を算出する。 This indicates that a value obtained by adding a weighting value to each of the metadata similarities S _{1 to} S ₄ calculated for each type of metadata is calculated as the content identity. Incidentally, other content identical calculation unit 31, such as contents same calculator 31-2 and content identical calculation unit 31-3, a content identical degree given different weighting value to the metadata similarity S ₁ to S ₄ Is calculated.

図６−１〜図６−４を参照したようにして、各データベースは構築される。
次に、その後のコンテンツ取得方法リストの作成について説明をする。
ユーザは、所望のコンテンツを一意に特定するための情報をコンテンツ取得方法リスト作成部４１に渡す。コンテンツ取得方法リスト作成部４１は、まずメタデータ類似度管理ＤＢ２２を参照して、渡されたコンテンツ情報から特定のコンテンツに解決する。例えば、ユーザがコンテンツを一意に特定するための情報として「＊＊物語（５）」と「１０月１８日放送」を入力したとすると、「コンテンツ取得方法リスト作成機能」は、メタデータ管理ＤＢを参照して、例えばｔｉｔｌｅ”が「＊＊物語（５）」のコンテンツに解決する。 Each database is constructed as described with reference to FIGS.
Next, the creation of the content acquisition method list thereafter will be described.
The user passes information for uniquely specifying the desired content to the content acquisition method list creation unit 41. First, the content acquisition method list creation unit 41 refers to the metadata similarity management DB 22 to resolve specific content from the passed content information. For example, if the user inputs “** Story (5)” and “October 18 broadcast” as information for uniquely specifying the content, the “content acquisition method list creation function” is the metadata management DB. For example, title "is resolved to the content of" ** Story (5) ".

次に、コンテンツ取得方法リスト作成部４１はコンテンツ同一度管理ＤＢ３２を参照して、そのコンテンツの視聴方法リストを作成し、ユーザに渡す。 Next, the content acquisition method list creation unit 41 creates a viewing method list for the content with reference to the content identity management DB 32 and passes it to the user.

例えば、コンテンツ取得方法リスト作成部４１が、コンテンツ同一度算出部３１−１の算出したコンテンツ同一度Ｉ_１を参照したとする。この場合、ユーザが希望するコンテンツは、コンテンツ同一度が高い方から順に“ｔｉｔｌｅ”が「＊＊物語（５）」、「連続テレビ小説＊＊物語（５）」及び「第５回＊＊物語１０月１８日放送分」であると判断され、その順位付けと取得方法のリストとして、これらのコンテンツを取得するためのＵＲＬである“ａｒｉｂ：／／ｌｌｌ”、“ｈｔｔｐ：／／ｍｍｍ”、“ｈｔｔｐ：／／ｏｏｏ”が返される。 For example, it is assumed that the content acquisition method list creation unit 41 refers to the content identity I ₁ calculated by the content identity calculation unit 31-1. In this case, the content desired by the user is “title” (“Story (5)”, “Continuous TV novel ** story (5)”) and “5th ** story” in descending order of content identity. “October 18 broadcast”, and as a list of ranking and acquisition methods, URLs for acquiring these contents are “arib: // lll”, “http: // mmm”, “Http: // oooo” is returned.

また、コンテンツ取得方法リスト作成部４１が、コンテンツ同一度算出部３１−２の算出したコンテンツ同一度Ｉ_２を参照したとする。この場合、ユーザが希望するコンテンツは、コンテンツ同一度が高い方から順に“ｔｉｔｌｅ”が「＊＊物語（５）」、「連続テレビ小説＊＊物語（５）」、「第５回＊＊物語１０月１８日放送分」、「５分でわかる＊＊物語」、「正午のニュース１０月１８日放送分」であると判断され、その順位付けと取得方法のリストとして、これらのコンテンツを取得するためのＵＲＬである“ａｒｉｂ：／／ｌｌｌ”、“ｈｔｔｐ：／／ｍｍｍ”、“ｈｔｔｐ：／／ｏｏｏ”、“ｈｔｔｐ：／／ｎｎｎ”、“ｈｔｔｐ：／／ｐｐｐ”が返される。 The content acquisition method list creation unit 41, and with reference to the content same degree I ₂ calculated content identical calculation unit 31-2. In this case, the contents desired by the user are “title” (“Story (5)”), “Continuous TV novel ** story (5)”, “5th ** story” in order from the highest content identity. “October 18th broadcast”, “5 minutes ** story”, “noon news October 18th broadcast”, and acquire these contents as a list of ranking and acquisition methods URLs “arib: /// llll”, “http: // mmm”, “http: // ooo”, “http: // nnn”, “http: // ppp” are returned.

ユーザは渡されたコンテンツ取得方法リストの中から、自身の状況やデバイスの機能に適した取得方法を選んでコンテンツを楽しむことができる。 The user can enjoy the content by selecting an acquisition method suitable for the user's situation and device function from the content acquisition method list that has been passed.

以上、３つの実施形態とその実施例について説明をした。これら、上述した各実施形態は、それぞれ本発明の好適な実施形態ではあるが、上記した各実施形態のみに本発明の範囲を限定するものではなく、本発明の要旨を逸脱しない範囲において種々の変更を施した形態での実施が可能である。例えば以下に説明するような変更を施すことが可能である。 The three embodiments and examples thereof have been described above. Each of the above-described embodiments is a preferred embodiment of the present invention. However, the scope of the present invention is not limited to only the above-described embodiments, and various modifications can be made without departing from the spirit of the present invention. Implementation in a modified form is possible. For example, it is possible to make changes as described below.

上述した各実施形態では、配信するメディアや配信時間は異なるが、映像及び音声の双方が同一であるコンテンツ同士を同一のコンテンツとしていた。この点、同一のコンテンツの概念を広げ、同一とみなせるコンテンツ同士も、同一のコンテンツ同士として取り扱うようにしてもよい。 In each of the above-described embodiments, the media to be delivered and the delivery time are different, but the content having the same video and audio is the same content. In this regard, the concept of the same content may be expanded, and content that can be regarded as the same may be handled as the same content.

例えば、映像の画質の違いが異なっていても同じコンテンツとしてもよい。例えば、いわゆる４Ｋ放送や８Ｋ放送用のコンテンツと、ネット配信用のコンテンツとは、同じ内容であっても画質が相違する。このような場合であっても、同一のコンテンツとみなすようにしてもよい。また、他にも、例えば、洋画等であって字幕が挿入されているものと、字幕が挿入されていないものも同一のコンテンツとみなすようにしてもよい。更には、例えば日本語により吹き替えが行われているものも同一のコンテンツとみなすようにしてもよい。
ただし、画質や字幕の有無の相違があれば、同一のコンテンツとはみなせないとしてもよい。 For example, the same content may be used even if the image quality differs. For example, content for so-called 4K broadcasting or 8K broadcasting and content for net distribution have different image quality even if they are the same content. Even in such a case, the content may be regarded as the same content. In addition, for example, a foreign film or the like in which subtitles are inserted may be regarded as the same content as that in which no subtitles are inserted. Furthermore, for example, content that is dubbed in Japanese may be regarded as the same content.
However, if there is a difference in image quality or presence / absence of subtitles, it may not be regarded as the same content.

また、他にも、例えば予めユーザが利用可能なデバイスの種類を登録しておき、この登録されたデバイスで取得できるコンテンツのみを、コンテンツ取得方法リストに含ませるようにしてもよい。 In addition, for example, device types that can be used by the user may be registered in advance, and only content that can be acquired by the registered device may be included in the content acquisition method list.

更に、他にも、例えば複数のコンピュータの組合せにより、実施形態中の１つの装置を実現してもよい。また、そうではなく、１つのコンピュータにより、実施形態中の複数の装置を実現してもよい。 In addition, for example, one device in the embodiment may be realized by a combination of a plurality of computers. Alternatively, the plurality of devices in the embodiments may be realized by a single computer.

更に、他にも、上述した各実施形態では、テレビ等で配信される番組をコンテンツとして取り扱ったが、それ以外のコンテンツを対象として上述した各実施形態を実現するようにしてもよい。 In addition, in each of the above-described embodiments, a program distributed on a television or the like is handled as content, but each of the above-described embodiments may be realized for other content.

なお、上記の各実施形態に含まれる装置のそれぞれは、ハードウェア、ソフトウェア又はこれらの組み合わせにより実現することができる。また、上記の各実施形態に含まれる装置のそれぞれが協働することにより行なわれるコンテンツ同定方法も、ハードウェア、ソフトウェア又はこれらの組み合わせにより実現することができる。ここで、ソフトウェアによって実現されるとは、コンピュータがプログラムを読み込んで実行することにより実現されることを意味する。 Note that each of the devices included in each of the above embodiments can be realized by hardware, software, or a combination thereof. Further, the content identification method performed by the cooperation of the devices included in each of the above embodiments can also be realized by hardware, software, or a combination thereof. Here, “realized by software” means realized by a computer reading and executing a program.

プログラムは、様々なタイプの非一時的なコンピュータ可読媒体（ｎｏｎ−ｔｒａｎｓｉｔｏｒｙｃｏｍｐｕｔｅｒｒｅａｄａｂｌｅｍｅｄｉｕｍ）を用いて格納され、コンピュータに供給することができる。非一時的なコンピュータ可読媒体は、様々なタイプの実体のある記録媒体（ｔａｎｇｉｂｌｅｓｔｏｒａｇｅｍｅｄｉｕｍ）を含む。非一時的なコンピュータ可読媒体の例は、磁気記録媒体（例えば、フレキシブルディスク、磁気テープ、ハードディスクドライブ）、光磁気記録媒体（例えば、光磁気ディスク）、ＣＤ−ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＣＤ−Ｒ、ＣＤ−Ｒ／Ｗ、半導体メモリ（例えば、マスクＲＯＭ、ＰＲＯＭ（ＰｒｏｇｒａｍｍａｂｌｅＲＯＭ）、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰＲＯＭ）、フラッシュＲＯＭ、ＲＡＭ（ｒａｎｄｏｍａｃｃｅｓｓｍｅｍｏｒｙ））を含む。また、プログラムは、様々なタイプの一時的なコンピュータ可読媒体（ｔｒａｎｓｉｔｏｒｙｃｏｍｐｕｔｅｒｒｅａｄａｂｌｅｍｅｄｉｕｍ）によってコンピュータに供給されてもよい。一時的なコンピュータ可読媒体の例は、電気信号、光信号、及び電磁波を含む。一時的なコンピュータ可読媒体は、電線及び光ファイバ等の有線通信路、又は無線通信路を介して、プログラムをコンピュータに供給できる。 The program can be stored and provided to a computer using various types of non-transitory computer readable media. Non-transitory computer readable media include various types of tangible storage media. Examples of non-transitory computer-readable media include magnetic recording media (for example, flexible disks, magnetic tapes, hard disk drives), magneto-optical recording media (for example, magneto-optical disks), CD-ROMs (Read Only Memory), CD- R, CD-R / W, and semiconductor memory (for example, mask ROM, PROM (Programmable ROM), EPROM (Erasable PROM), flash ROM, RAM (random access memory)). The program may also be supplied to the computer by various types of transitory computer readable media. Examples of transitory computer readable media include electrical signals, optical signals, and electromagnetic waves. The temporary computer-readable medium can supply the program to the computer via a wired communication path such as an electric wire and an optical fiber, or a wireless communication path.

１０メタデータ管理手段
１１メタデータ整形部
１２メタデータ管理ＤＢ
２０メタデータ類似度管理手段
２１メタデータ類似度算出部
２２メタデータ類似度管理ＤＢ
３０コンテンツ同一度管理手段
３１コンテンツ同一度算出部
３２コンテンツ同一度管理ＤＢ
４０コンテンツ取得方法リスト作成手段
４１コンテンツ取得方法リスト作成部
５０類似度・同一度管理手段
５１類似度・同一度算出部
５２コンテンツ同一度管理ＤＢ
６０コンテンツ組合せ管理ＤＢ
７０算出結果削減部
１０１、１０２、１０３コンテンツ同定装置 10 Metadata management means 11 Metadata shaping unit 12 Metadata management DB
20 Metadata Similarity Management Unit 21 Metadata Similarity Calculation Unit 22 Metadata Similarity Management DB
30 Content identity management means 31 Content identity calculation unit 32 Content identity management DB
40 Content acquisition method list creation unit 41 Content acquisition method list creation unit 50 Similarity / identity management unit 51 Similarity / identity calculation unit 52 Content identity management DB
60 Content combination management DB
70 Calculation Result Reduction Units 101, 102, 103 Content Identification Device

Claims

For each of a plurality of contents, metadata that is data indicating the characteristics of the contents attached to the contents is acquired based on a predetermined metadata acquisition method, and the acquired metadata is classified for each type of metadata. Metadata management means to be stored;
Similarity calculation means for calculating the similarity between the metadata for each type of the metadata accumulated by the metadata management means;
On the basis of the similarity calculated by the similarity calculation means, the degree of similarity calculation means for calculating the degree of identity between the content and other content other than the content;
And a list creation unit that creates a list of metadata including a content acquisition method that is specified as the same content as the specified content, based on the degree of identity between the content calculated by the identity level calculation unit. Content identification device.

The content identification device according to claim 1,
The list creation means includes:
When a list request including content specification is received from a user, the content specified as the same content as the content specified by the list request is based on the degree of identity between the content calculated by the identity level calculation means. A content identification device that creates a metadata list including an acquisition method and outputs the created list to the user as a response to the list request.

The content identification device according to claim 1 or 2,
The degree-of-identity calculation means gives different weights to the similarities between the metadata calculated by the similarity calculation means, depending on the type of the metadata,
The degree-of-identity calculation unit is a content identification device that calculates the degree of identity between contents based on a value obtained by adding the similarities of the weighted metadata in units of contents.

The content identification device according to any one of claims 1 to 3,
The degree-of-identity calculation means includes a plurality of degree-of-identity calculation units for calculating the degree of identity between the contents.
The plurality of identity calculation units is a content identification device in which each of the plurality of identity calculation units calculates the degree of identity between the contents by different calculation methods.

The content identification device according to any one of claims 1 to 4,
It further includes content combination management means,
The similarity calculation means calculates the similarity between the metadata for any of the types of metadata managed by the metadata management means, and the calculated similarity is equal to or greater than a predetermined reference. A combination of contents corresponding to each metadata is stored in the content combination management means,
For the combination of contents not stored in the content combination management means, the calculation of the similarity between the metadata by the similarity calculation means, the calculation of the degree of identity between the contents by the identity calculation means, Content identification device that does not perform

A program for causing a computer to function as the content identification device according to any one of claims 1 to 5.