JP2000276478A

JP2000276478A - Method and device for detecting time-series data and recording medium where program thereof is recorded

Info

Publication number: JP2000276478A
Application number: JP11079879A
Authority: JP
Inventors: Takashi Yamana; 岳志山名; Takashi Shirotani; 貴志城谷; Hisazumi Tsuchida; 尚純土田; Yasuhiro Hirano; 泰宏平野
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1999-03-24
Filing date: 1999-03-24
Publication date: 2000-10-06

Abstract

PROBLEM TO BE SOLVED: To speedily and securely detection object time-series data to be detected by matching the electronic watermarks of input data and original data and determining matching data when similar data are found through a matching process between the input data and original data. SOLUTION: When a specific CM is retrieved from a broadcast signal like a CM, the original data of the CM is recorded previously in an original data storage part 141, electronic watermark information embedded in the original data of the CM is recorded in an electronic watermark storage part 142, and a retrieved broadcast radio wave is recorded in a broadcast signal storage part 143. Then the original data and broadcast contents are made into feature signals by a feature signal process part 131, and the similarities of all original data feature signals are calculated, clustered, and recorded in an original data similarity storage part 147. Then a matching process part 133 performs a matching processing between the feature signals of the broadcast signal and original data and collates the electronic watermark read out of the original data position in the broadcast signal with the electronic watermark of the original data when there are similar data to determine matching data.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は時系列データの中か
ら、あらかじめ登録された時系列データを的確に検出す
る時系列データ検出方法と装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a time series data detection method and apparatus for accurately detecting time series data registered in advance from time series data.

【０００２】[0002]

【従来の技術】入力された時系列データの中から、あら
かじめ登録された時系列データを的確に検出することが
求められる場合がある。例えば、放送の音響または映
像、もしくはその両方の信号の中から特定のコマーシャ
ルが放送された部分の検出を行ってコマーシャルが指定
通り放送されたかどうかの検証を自動的に行ったり、特
定の音楽や映像を検出して映像や音楽の著作物の利用を
自動的に監視したりする場合である。2. Description of the Related Art In some cases, it is required to accurately detect previously registered time-series data from input time-series data. For example, in a sound or video of a broadcast, or a signal of both, a part where a specific commercial is broadcast is detected to automatically verify whether the commercial is broadcast as specified, or a specific music or video is detected. This is a case where a video is detected and the use of a video or music work is automatically monitored.

【０００３】従来、時系列データから目的とする時系列
データを検出する方法としては、「電子透かし」を用い
て検出する方法と「マッチング」による方法とがあっ
た。Conventionally, methods for detecting target time-series data from time-series data include a method using "digital watermark" and a method using "matching".

【０００４】「電子透かし」による方法は、データの身
元認証やオリジナルの保証のための識別子（以後、ＩＤ
と呼ぶ）をデータに埋め込み、それを抽出することによ
り検出するものである。この方法は、原データに対する
ＩＤ埋め込み処理が必要であるが、ＩＤが確認される限
り正確な検出が可能である。[0004] The method using the "digital watermark" is an identifier (hereinafter referred to as ID) for authenticating the identity of the data or guaranteeing the original.
) Is embedded in the data and extracted to extract it. This method requires ID embedding processing for the original data, but accurate detection is possible as long as the ID is confirmed.

【０００５】また、「マッチング」による方法は、基本
的には、目的とする時系列データと検出対象である時系
列データ（音声や映像）の波形（あるいは特徴量）を比
較して行き、類似度が一定値（以後、この一定値をマッ
チングしきい値と呼ぶ）以上の場合、検出対象である時
系列データが存在するとしてその同一性を確認する方法
である。[0005] The method based on "matching" basically compares the waveform (or characteristic amount) of the target time-series data with the time-series data (audio or video) to be detected, and compares them. When the degree is equal to or more than a certain value (hereinafter, this certain value is referred to as a matching threshold value), it is a method of confirming the identity of time-series data to be detected as existing.

【０００６】[0006]

【発明が解決しようとする課題】しかし、電子透かしを
用いた検出システムの場合、電子透かしの出現場所が不
明なため、時系列データすべてにわたって電子透かしを
調べる必要があり、大量の時系列データからの検出の場
合、多くの処理時間を要するという問題がある。また、
マッチングを用いた検索システムの場合にも、時系列デ
ータのすべてにわたってマッチング処理を行う必要があ
る。但し、特願平１０−１５１７２３号に記載のような
アクティブ検索手法等が開発され、処理は高速化されて
おり実用性は向上しているが、本方法は、その処理方式
の限界から、似たような波形（特徴量）を持つデータを
見分けることが困難であるという問題点がある。例えば
ＣＭの場合、入力データの中に調べたいＣＭの原データ
の他に、調べたいＣＭの原データと比べてある部分が少
し変わっているＣＭの原データ、例えば、ほとんどのＣ
Ｍ内容は同じだが対象の視聴者に応じてある部分の言葉
を変えるとか、簡易版のＣＭとするため同じＣＭからあ
る部分のシーンを削除するとかのＣＭの原データも使用
されている場合があり、その場合マッチング処理では区
分することが難しく、全てを調べたいＣＭの原データと
して検出てしまう可能性がある。However, in the case of a detection system using a digital watermark, since the appearance position of the digital watermark is unknown, it is necessary to check the digital watermark over all the time-series data. There is a problem in that detection requires a lot of processing time. Also,
Even in the case of a search system using matching, it is necessary to perform matching processing on all time-series data. However, an active search method and the like described in Japanese Patent Application No. 10-151723 have been developed, and the processing has been speeded up and the practicability has been improved. However, this method is similar due to the limitations of the processing method. There is a problem that it is difficult to distinguish data having such a waveform (feature amount). For example, in the case of a CM, in addition to the original data of the CM to be examined in the input data, the original data of the CM in which a part is slightly different from the original data of the CM to be examined, for example, most C
In some cases, the original data of the CM is used, such as changing the words of a certain part according to the target audience, or deleting a certain part of the scene from the same CM to make it a simplified version of the CM. In such a case, it is difficult to perform the division by the matching process, and there is a possibility that the whole is detected as the original data of the CM to be examined.

【０００７】図７はＣＭ出稿確認を例としたマッチング
による検出方法の問題点を説明するための模式図であ
り、（ａ）は検出前の状態、、（ｂ）は検出後の状態を
示す。検出の対象となる入力データの集合７１の中に検
出されるべきＣＭの集合７２が存在する。マッチングの
結果楕円で示される集合がマッチングにより検出された
ＣＭ７３であるが、この中にはマッチングにより正しく
検出されたＣＭ７４とマッチングによって誤って検出さ
れたＣＭ７６とが存在する。一方検出されるべきＣＭ７
２から正しく検出されたＣＭ７４を除いた部分は検出さ
れなかったＣＭ７５として残ってしまう。FIGS. 7A and 7B are schematic diagrams for explaining a problem of a detection method by matching using a CM submission confirmation as an example. FIG. 7A shows a state before detection, and FIG. 7B shows a state after detection. . A set 72 of CMs to be detected exists in a set 71 of input data to be detected. The set represented by the ellipse as a result of the matching is the CM 73 detected by the matching, and the CM 73 includes the CM 74 correctly detected by the matching and the CM 76 erroneously detected by the matching. CM7 to be detected on the other hand
The portion excluding the CM 74 that has been correctly detected from No. 2 remains as a CM 75 that has not been detected.

【０００８】本発明は上述の両方式の問題点を解決する
ためのもので、その目的は時系列データのすべての領域
に対して、迅速にかつ確実に検出対象とする時系列デー
タを検出するための時系列データ検出方法と検出装置と
そのプログラムを記録した記録媒体を提供することにあ
る。An object of the present invention is to solve the above-mentioned problems of both types, and an object thereof is to quickly and surely detect time-series data to be detected in all regions of the time-series data. To provide a time-series data detection method and a detection device for the same and a recording medium on which the program thereof is recorded.

【０００９】[0009]

【課題を解決するための手段】本発明の時系列データの
検出方法は、検索しようとする時系列データ（原デー
タ）を入力された時系列データ（入力データ）から検出
する時系列データの検出方法であって、原データと入力
データには、原データを一意に判別可能な電子透かし情
報が埋め込まれている。検索しようとする原データを入
力し、特徴信号化処理して原データ特徴信号として格納
し、所定のしきい値を用いて全ての原データ特徴信号を
マッチング処理して類似度計算を行って原データ類似度
テーブルとして格納し、原データの電子透かし情報を入
力して格納して、必要な原データ関連資料を完成させ
る。入力データを入力し、特徴信号化処理して入力デー
タ特徴信号として格納して必要な入力データ関連資料を
完成させる。A method for detecting time-series data according to the present invention comprises detecting time-series data in which time-series data to be searched (original data) is detected from input time-series data (input data). In this method, digital watermark information capable of uniquely identifying the original data is embedded in the original data and the input data. The original data to be searched is input, converted into a feature signal, stored as an original data feature signal, and all original data feature signals are matched using a predetermined threshold to calculate similarity. The data is stored as a data similarity table, and digital watermark information of the original data is input and stored to complete necessary original data related data. Input data is input, processed into a feature signal, and stored as an input data feature signal to complete necessary input data related data.

【００１０】入力データ特徴信号と、原データ特徴信号
との間で、所定のしきい値を用いてマッチング処理を行
って入力データ内の原データのデータ数とデータ位置と
を検出し、原データ類似度テーブルのしきい値に対応す
るクラスタに類似データがある場合は、入力データ内の
検出された原データ位置に埋め込まれた電子透かしを読
み出して原データの電子透かし情報と照合して類似デー
タを排除し、入力データ内の原データのデータ数とデー
タ位置とを確定させる。A matching process is performed between the input data characteristic signal and the original data characteristic signal using a predetermined threshold to detect the number and position of the original data in the input data. If there is similar data in the cluster corresponding to the threshold value of the similarity table, the digital watermark embedded in the detected original data position in the input data is read out and compared with the digital watermark information of the original data to obtain similar data. Is eliminated, and the data number and data position of the original data in the input data are determined.

【００１１】さらに、入力データ関連資料に入力データ
における原データの予定位置関係を示すタイムスケジュ
ールを格納し、マッチング処理で検出された原データ数
がタイムスケジュ−ルから算定された入力データに含ま
れる予定の原データ数より少なければ、検出された原デ
ータ数がタイムスケジュ−ルから算定された入力データ
に含まれる予定の原データ数以上となるまで、マッチン
グしきい値を低い値に変更してマッチング処理を繰り返
し行って、入力データ内の原データのデータ数とデータ
位置とを検出してもよい。Further, a time schedule indicating the expected positional relationship of the original data in the input data is stored in the input data-related material, and the number of the original data detected by the matching process is included in the input data calculated from the time schedule. If the number of original data is less than the planned number of original data, the matching threshold is changed to a lower value until the detected number of original data is equal to or greater than the planned number of original data included in the input data calculated from the time schedule. The number of data and the data position of the original data in the input data may be detected by repeatedly performing the matching process.

【００１２】さらに、マッチング処理を行った結果とし
きい値のデータとを蓄積し分析することで、マッチング
処理に対応したマッチングしきい値を適切に設定し、入
力データ内の原データのデータ数とデータ位置とを検出
してもよい。Further, by accumulating and analyzing the result of performing the matching process and the data of the threshold value, a matching threshold value corresponding to the matching process is appropriately set, and the number of original data in the input data is reduced. The data position may be detected.

【００１３】本発明の時系列データ検出装置は、検索し
ようとする時系列データ（原データ）を入力された時系
列データ（入力データ）から検出する時系列データ検出
装置であって、原データ入力部と原データ格納部、電子
透かし入力部と電子透かし格納部、入力データ入力部と
入力データ格納部、タイムスケジュール入力部とタイム
スケジュール格納部、特徴信号化処理部、原データ特徴
信号格納部、入力データ特徴信号格納部、類似度計算
部、原データ類似度格納部、マッチング処理部、出現回
数判定部、類似度判定部、透かし抽出／判定部、判定結
果出力部、および制御部を備えている。A time-series data detecting apparatus according to the present invention is a time-series data detecting apparatus for detecting time-series data (original data) to be searched from input time-series data (input data). Unit and original data storage unit, digital watermark input unit and digital watermark storage unit, input data input unit and input data storage unit, time schedule input unit and time schedule storage unit, feature signal processing unit, original data feature signal storage unit, An input data feature signal storage unit, a similarity calculation unit, an original data similarity storage unit, a matching processing unit, an appearance frequency determination unit, a similarity determination unit, a watermark extraction / determination unit, a determination result output unit, and a control unit I have.

【００１４】原データ入力部は原データを入力して、原
データ格納部に格納し、電子透かしデータ入力部は原デ
ータに埋め込まれている電子透かしデータを入力して、
その埋め込み先データとともに電子透かしデータ格納部
に格納し、入力データ入力部は入力データを入力して、
入力データ格納部に格納し、タイムスケジュール入力部
は入力データのタイムスケジュールを入力して、タイム
スケジュール格納部に格納する。The original data input unit inputs the original data and stores it in the original data storage unit. The digital watermark data input unit inputs the digital watermark data embedded in the original data.
The data is stored in the digital watermark data storage together with the embedding destination data, and the input data input unit inputs the input data,
The data is stored in the input data storage unit, and the time schedule input unit inputs the time schedule of the input data and stores the input data in the time schedule storage unit.

【００１５】特徴信号化処理部は原データに特徴信号化
処理を行って原データ特徴信号を生成して原データ特徴
信号格納部に格納し、入力データに特徴信号化処理を行
って入力データ特徴信号を生成して入力データ特徴信号
格納部に格納し、類似度計算部は、所定のしきい値に基
づき、マッチング処理により原データ特徴信号間の類似
度を計算し、類似の特徴をもつ原データが同一クラスタ
になるようにクラスタリング処理を行い、そのクラスタ
情報とともに原データ類似度格納部へ格納する。The feature signal processing unit performs a feature signal processing on the original data to generate an original data feature signal, stores the signal in the original data feature signal storage unit, performs a feature signal processing on the input data, and executes the input data feature processing. A signal is generated and stored in an input data feature signal storage unit, and a similarity calculation unit calculates a similarity between the original data feature signals by a matching process based on a predetermined threshold value, and generates an original signal having a similar feature. The clustering process is performed so that the data becomes the same cluster, and the data is stored in the original data similarity storage unit together with the cluster information.

【００１６】マッチング処理部では、原データ特徴信号
と入力データ特徴信号とを所定のしきい値を用いてマッ
チング処理を行い、出現回数判定部では、原データのタ
イムスケジュール上での出現予定回数とマッチング処理
による原データの出現回数とを比較し、必要に応じマッ
チング処理部のしきい値を変更させ、類似度判定部で
は、最終的に選択されたしきい値に対応する原データの
クラスタリング情報を読み出し、そのクラスタに存在す
るデータが、原データ１個のみか、類似データが存在す
るかを判定しその結果を透かし抽出判定部に送出し、透
かし抽出判定部では、原データ１個のみの場合はマッチ
ング処理部のマッチング処理結果を判定結果出力部へ出
力し、類似データが存在する場合は、原データの電子透
かしを読み出し、入力データのマッチング処理結果の原
データの位置に埋め込まれた電子透かしを読み出して照
合し、同定された原データのマッチング回数と位置情報
を判定結果出力部に出力し、判定結果出力部は判定結果
を外部に出力し、制御部は各部の処理を制御して実行さ
せる。The matching processing unit performs a matching process between the original data characteristic signal and the input data characteristic signal using a predetermined threshold value, and the appearance number determination unit determines the expected number of appearances of the original data on the time schedule. Compare the number of appearances of the original data by the matching process, change the threshold value of the matching processing unit as necessary, and the similarity determination unit determines the clustering information of the original data corresponding to the finally selected threshold value. Is read, and it is determined whether there is only one original data or similar data exists in the cluster, and the result is sent to the watermark extraction determining unit. In this case, the matching processing result of the matching processing unit is output to the judgment result output unit. If similar data exists, the digital watermark of the original data is read out and input. The digital watermark embedded in the position of the original data of the data matching processing result is read and collated, and the number of times of matching of the identified original data and the position information are output to the judgment result output unit. The data is output to the outside, and the control unit controls and executes the processing of each unit.

【００１７】さらに記録媒体を備え、制御部の動作は、
その記録媒体に記録された時系列データ検出システム制
御プログラムにより制御できてもよい。Further, a recording medium is provided, and the operation of the control unit is as follows.
The control may be performed by a time-series data detection system control program recorded on the recording medium.

【００１８】本発明のプログラムを記録した記録媒体
は、検索しようとする時系列データ（原データ）を入力
された時系列データ（入力データ）から検出するための
制御プログラムを記録する。The recording medium on which the program of the present invention is recorded records a control program for detecting time-series data (original data) to be searched from input time-series data (input data).

【００１９】原データに対して一意なＩＤが電子透かし
情報として埋め込まれ、入力データ内のすべての原デー
タが電子透かし情報により識別可能な状態となってお
り、原データの集合に対して、すべての原データ間の類
似度を計算され、この類似度の計算の際には、マッチン
グの処理で用いられているのと同一の手法が用いられ、
計算後、類似度がクラスタリングしきい値以上のものに
ついて、類似データとみなされる。A unique ID is embedded in the original data as digital watermark information, and all the original data in the input data can be identified by the digital watermark information. The similarity between the original data is calculated. In calculating the similarity, the same method as that used in the matching process is used.
After the calculation, the data whose similarity is equal to or larger than the clustering threshold value is regarded as similar data.

【００２０】基本的な検索は、類似度をしきい値とする
マッチング手法により行い、類似データが存在する原デ
ータについてのみ電子透かしを用いた検証をすることと
なっている。この場合のしきい値は処理内容に応じ適応
的に変更可能となっている。適応的な変更とは、例え
ば、入力データ内の検出すべき時系列データ数があらか
じめわかっている場合には、その検出数以上の数の検出
があるまでしきい値を下げる方法であり、未検出の原デ
ータの発生を防止できる。The basic search is performed by a matching method using the similarity as a threshold value, and only original data having similar data is verified using a digital watermark. In this case, the threshold value can be adaptively changed according to the processing content. The adaptive change is a method in which, for example, when the number of time-series data to be detected in the input data is known in advance, the threshold is lowered until the number of detected data is equal to or larger than the detected number. Generation of original data for detection can be prevented.

【００２１】電子透かしの抽出の際には、走査すべき時
系列データ中の位置が、マッチング方法により導かれて
いることから、容易な検証が可能となる。At the time of extracting the digital watermark, the position in the time-series data to be scanned is derived by the matching method, so that the verification can be easily performed.

【００２２】これらの特徴によって、電子透かしのみを
用いた場合の透かし抽出時の冗長処理を防ぎつつ、マッ
チングのみを用いた場合には区別が困難な類似原データ
の区別を実現することができ、時系列データ中のデータ
検索が、比較的低コストで高速に実現可能となる。With these features, it is possible to prevent the redundant processing at the time of watermark extraction when only the digital watermark is used, and to distinguish the similar original data which is difficult to distinguish when only the matching is used. Data retrieval from time-series data can be realized at relatively low cost and at high speed.

【００２３】[0023]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して説明する。本発明の実施の形態は入力
データが放送信号の場合であるが入力信号は放送信号に
限られるものではない。図１は本発明の第１の実施の形
態の時系列データ検出装置のブロック構成図である。Next, embodiments of the present invention will be described with reference to the drawings. The embodiment of the present invention is a case where the input data is a broadcast signal, but the input signal is not limited to the broadcast signal. FIG. 1 is a block diagram of a time-series data detection device according to a first embodiment of the present invention.

【００２４】本発明の第１の実施の形態の時系列データ
検出装置１００は、使用される可能性のあるすべての原
データを入力する原データ入力部１１１と入力した原デ
ータを格納する原データ格納部１４１、原データに埋め
込まれている電子透かしデータを入力する電子透かしデ
ータ入力部１１２と入力した電子透かしデータをその埋
め込み先データとともに格納する電子透かしデータ格納
部１４２、放送信号を入力する放送信号入力部１１３と
入力した放送信号を格納する放送信号格納部１４３、放
送のタイムスケジュールを入力するタイムスケジュール
入力部１１４と入力したタイムスケジュールを格納する
タイムスケジュール格納部１４４、特徴信号化処理部１
３１、原データ特徴信号格納部１４５、放送特徴信号格
納部１４６、類似度計算部１３２、原データ類似度格納
部１４７、マッチング処理部１３３、出現回数判定部１
３４、類似度判定部１３５、透かし抽出／判定部１３
６、判定結果出力部１２１、および制御部１３８を備え
る。The time-series data detecting apparatus 100 according to the first embodiment of the present invention includes an original data input unit 111 for inputting all the original data that may be used and an original data for storing the input original data. A storage unit 141, a digital watermark data input unit 112 for inputting digital watermark data embedded in the original data, a digital watermark data storage unit 142 for storing the input digital watermark data together with the embedding destination data, a broadcast for inputting a broadcast signal A signal input unit 113 and a broadcast signal storage unit 143 for storing the input broadcast signal, a time schedule input unit 114 for inputting a broadcast time schedule, a time schedule storage unit 144 for storing the input time schedule, and a feature signal processing unit 1
31, an original data feature signal storage unit 145, a broadcast feature signal storage unit 146, a similarity calculation unit 132, an original data similarity storage unit 147, a matching processing unit 133, an appearance number determination unit 1
34, similarity determination unit 135, watermark extraction / determination unit 13
6, a determination result output unit 121 and a control unit 138.

【００２５】特徴信号化処理部１３１は原データ格納部
１４１から原データを取り込み特徴信号化処理を行って
原データ特徴信号を生成して原データ特徴信号格納部１
４５に格納し、放送信号格納部１４１から放送信号を取
り込み特徴信号化処理を行って放送特徴信号を生成して
放送特徴信号格納部１４６に格納する。The characteristic signal processing section 131 takes in the original data from the original data storage section 141 and performs a characteristic signal processing to generate an original data characteristic signal.
The broadcast feature signal is stored in the broadcast feature signal storage unit 146, the broadcast signal is taken in from the broadcast signal storage unit 141, the feature signal is converted into a feature signal, and the broadcast feature signal is generated.

【００２６】類似度計算部１３２は原データ特徴信号格
納部１４５から原データ特徴信号を読み出して、所定の
クラスタリングしきい値に基づき、マッチング処理によ
って格納された原データ間の類似度を計算し、類似の特
徴をもつ原データが同一クラスタになるようにクラスタ
リング処理を行い、そのクラスタ情報とともに原データ
類似度格納部１４７へ格納する。ただし、ここでのクラ
スタリングしきい値は、後述のマッチングで用いるマッ
チングしきい値と同じ値とし、マッチングしきい値を複
数個用いる場合は、それに応じたクラスタリングを行う
こととし、複数個のクラスタリング情報が記録される。The similarity calculation unit 132 reads the original data characteristic signal from the original data characteristic signal storage unit 145, and calculates the similarity between the original data stored by the matching process based on a predetermined clustering threshold. The clustering process is performed so that the original data having similar characteristics are in the same cluster, and the original data is stored in the original data similarity storage unit 147 together with the cluster information. However, the clustering threshold here is the same as the matching threshold used in the matching described later, and when a plurality of matching thresholds are used, the clustering is performed in accordance with the matching threshold. Is recorded.

【００２７】マッチング処理部１３３では、原データ特
徴信号格納部１４５から検索したい原データ特徴信号
を、放送特徴信号格納部１４６から検索の対象となる放
送特徴信号を読み出し、原データ特徴信号と放送特徴信
号とを所定のマッチングしきい値を用いてマッチング処
理をおこなう。マッチング処理は、上述のように基本的
には、目的とする時系列データ（原データ）と検出対象
である時系列データ（入力データ）の波形あるいは特徴
量を比較して行き、類似度が一定値（マッチングしきい
値）以上の場合、入力データに検出対象である原データ
が存在するとしてその同一性を確認する方法である。The matching processing unit 133 reads an original data characteristic signal to be searched from the original data characteristic signal storage unit 145, reads a broadcast characteristic signal to be searched from the broadcast characteristic signal storage unit 146, and outputs the original data characteristic signal and the broadcast characteristic. The signal and the signal are subjected to a matching process using a predetermined matching threshold. In the matching process, basically, as described above, the waveform or feature amount of the target time-series data (original data) and the time-series data (input data) to be detected are compared, and the similarity is constant. If the value is equal to or more than the value (matching threshold), it is assumed that the original data to be detected exists in the input data and the identity thereof is confirmed.

【００２８】出現回数判定部１３４では、タイムスケジ
ュール格納部１４４からタイムスケジュールを読み出し
て原データのスケジュール上での出現予定回数とマッチ
ング処理部１３３のマッチング処理による原データの出
現回数とを比較し、マッチング処理による原データの出
現回数がスケジュール上の出現回数より少ない場合はマ
ッチング処理部のマッチングしきい値を低い値に変更さ
せてマッチング処理を繰り返し、マッチング処理による
原データの出現回数がスケジュール上の出現予定回数以
上となった状態でそのマッチング回数と位置情報とを透
かし抽出判定部１３６に送出する。The appearance number determination unit 134 reads the time schedule from the time schedule storage unit 144 and compares the number of appearances of the original data on the schedule with the number of appearances of the original data by the matching processing of the matching processing unit 133. If the number of appearances of the original data by the matching process is less than the number of occurrences on the schedule, the matching threshold is changed to a low value in the matching processing unit, and the matching process is repeated. When the number of appearances is equal to or more than the number of appearances, the matching number and the position information are sent to the watermark extraction determination unit 136.

【００２９】類似度判定部１３５では、原データ類似度
格納部１４７より、最終的に選択されたマッチングしき
い値に対応する原データのクラスタリング情報を読み出
し、そのクラスタに存在するデータが、原データ１個の
みか、類似データが存在するかを判定し、その結果を透
かし抽出判定部１３６に送出する。The similarity determination unit 135 reads the clustering information of the original data corresponding to the finally selected matching threshold from the original data similarity storage unit 147, and determines the data existing in the cluster as the original data. It is determined whether there is only one or similar data, and the result is sent to the watermark extraction determination unit 136.

【００３０】透かし抽出判定部１３６では、原データ１
個のみの場合はマッチング処理部１３３のマッチング処
理結果を判定結果出力部１２１へ出力する。類似データ
が存在する場合は、電子透かし格納部１４２より原デー
タの電子透かしを読み出し、放送信号格納部１４３から
放送信号を読み込み、マッチング処理部１３３のマッチ
ング処理結果の原データ位置情報から、放送信号のその
位置に埋め込まれた電子透かしを読み出し、電子透かし
格納部１４２より読み出した原データの電子透かしと対
応させて原データの同定を行い、同定された原データの
マッチング回数と位置情報とを判定結果出力部１２１に
出力する。In the watermark extraction determining section 136, the original data 1
If there is only one, the matching processing result of the matching processing unit 133 is output to the determination result output unit 121. When similar data exists, the digital watermark of the original data is read from the digital watermark storage unit 142, the broadcast signal is read from the broadcast signal storage unit 143, and the broadcast signal is read from the original data position information of the matching processing result of the matching processing unit 133. Of the original data read from the digital watermark storage unit 142, and identifies the original data in correspondence with the digital watermark of the original data, and determines the number of times of matching of the identified original data and the position information. Output to the result output unit 121.

【００３１】判定結果出力部１２１は判定結果を外部に
出力する。制御部１３８は各部の処理を制御して実行さ
せる。The judgment result output section 121 outputs the judgment result to the outside. The control unit 138 controls and executes the processing of each unit.

【００３２】放送信号から任意の音楽を検出するように
入力データ内の原データの有無が不確定な検索のみを行
う場合は、タイムスケジュール格納部１４４と出現回数
判定部１３４とを省略してもよい。When performing only a search for which the presence / absence of the original data in the input data is uncertain so as to detect any music from the broadcast signal, the time schedule storage section 144 and the number-of-appearance determination section 134 may be omitted. Good.

【００３３】次に本発明の第１の実施の形態の時系列デ
ータの検出方法について説明する。図２は本発明の第１
の実施の形態の時系列データ検出方法の準備段階のフロ
ーチャートであり、（ａ）は原データ関連資料蓄積工程
のフローチャート、（ｂ）は放送信号入力工程のフロー
チャートである。図３は本発明の第１の実施の形態の時
系列データ検出方法の検出工程のフローチャートであ
る。Next, a method of detecting time-series data according to the first embodiment of the present invention will be described. FIG. 2 shows the first embodiment of the present invention.
5A is a flowchart of a preparation stage of the time-series data detection method according to the embodiment, FIG. 4A is a flowchart of an original data-related material accumulation step, and FIG. 4B is a flowchart of a broadcast signal input step. FIG. 3 is a flowchart of a detection process of the time-series data detection method according to the first embodiment of the present invention.

【００３４】先ず原データ関連資料蓄積工程について説
明する。検出の対象となる入力データに含まれる可能性
のある原データ、例えば原データがＣＭの場合、放送の
可能性のあるＣＭの原データを全て登録する。原データ
関連資料は検出したい原データに対して原データに類似
のデータを含めて検出の都度作成してもよいし、あらか
じめ放送の可能性のあるすべての原データ、例えばＣＭ
のすべての原データをデータベースとして作成してお
き、都度所望の原データに関する部分を読み出して利用
してもよい。First, the original data related material storing step will be described. If the original data that may be included in the input data to be detected, for example, the original data is a CM, all the original data of the CM that may be broadcast are registered. The original data-related materials may be created each time the original data to be detected is included, including data similar to the original data, or all the original data that may be broadcast, such as CM
May be created as a database, and a portion related to desired original data may be read and used each time.

【００３５】原データ関連資料としては、原データを特
徴信号化した原データ特徴信号、原データの電子透か
し、および登録されたすべての原データを複数の所定の
しきい値でマッチング処理を行ってクラスタリングを行
って作成した原データとしきい値による原データ類似度
テーブルを作成する。As the original data-related data, the original data characteristic signal obtained by converting the original data into a characteristic signal, the electronic watermark of the original data, and all registered original data are subjected to matching processing with a plurality of predetermined threshold values. An original data similarity table is created based on the original data created by performing clustering and a threshold.

【００３６】処理を開始すると（Ｓ１０１）、原データ
を入力して原データ格納部１４１に格納し（Ｓ１０
２）、入力した原データを特徴信号化処理して（Ｓ１０
３）、原データ特徴信号格納部１４５に格納し（Ｓ１０
４）、マッチング処理の際使用する複数のしきい値と同
じしきい値をクラスタリングしきい値とし、それぞれの
しきい値を用いて（Ｓ１０５）、検出処理と同じマッチ
ング処理方法で全ての原データ特徴信号をマッチング処
理して類似度計算を行い（Ｓ１０６）、全てのしきい値
について処理を繰り返し（Ｓ１０７）、原データとしき
い値による原データ類似度テーブルとして原データ類似
度格納部１４７に格納する（Ｓ１０８）。次に原データ
の電子透かし情報を入力し（Ｓ１０９）、電子透かし格
納部１４２に格納する（Ｓ１１０）。この処理を対象と
する全ての原データについて繰り返し（Ｓ１１１）、必
要な原データ関連資料を完成する（Ｓ１１２）。When the process is started (S101), the original data is input and stored in the original data storage unit 141 (S10).
2) The input original data is converted into a characteristic signal (S10).
3) and store it in the original data feature signal storage unit 145 (S10)
4) The same threshold value as the plurality of threshold values used in the matching process is set as a clustering threshold value, and all the original data are processed using the respective threshold values (S105) by the same matching processing method as the detection process. The similarity calculation is performed by matching the characteristic signals (S106), and the process is repeated for all thresholds (S107), and stored in the original data similarity storage unit 147 as an original data similarity table based on the original data and the thresholds. (S108). Next, digital watermark information of the original data is input (S109) and stored in the digital watermark storage unit 142 (S110). This processing is repeated for all the original data (S111), and necessary original data-related materials are completed (S112).

【００３７】次に放送信号入力工程について説明する。
処理を開始すると（Ｓ２０１）、放送信号を入力し（Ｓ
２０２）、入力した放送信号を放送信号格納部１４３に
格納し（Ｓ２０３）、入力した放送信号を特徴信号化処
理して放送特徴信号を生成し（Ｓ２０４）、放送特徴信
号格納部１４６に格納し（Ｓ２０５）、入力した放送信
号のタイムスケジュールを入力し（Ｓ２０６）、タイム
スケジュール格納部１４４に格納し（Ｓ２０７）、所望
の範囲を入力して（Ｓ２０８）、必要な放送信号関連資
料を完成する（Ｓ２０９）。ステップ２０６とステップ
２０７は省略する場合もある。Next, the broadcast signal input step will be described.
When the process starts (S201), a broadcast signal is input (S201).
202), the input broadcast signal is stored in the broadcast signal storage unit 143 (S203), the input broadcast signal is converted into a characteristic signal to generate a broadcast characteristic signal (S204), and stored in the broadcast characteristic signal storage unit 146. (S205), a time schedule of the input broadcast signal is input (S206), stored in the time schedule storage unit 144 (S207), and a desired range is input (S208) to complete necessary broadcast signal related materials. (S209). Steps 206 and 207 may be omitted in some cases.

【００３８】次に、時系列データ検出方法の検出工程に
ついて説明する。処理を開始すると（Ｓ３０１）、検出
の対象とする放送信号を選択し（Ｓ３０２）、対象とす
る放送信号の放送特徴信号を放送特徴信号格納部１４６
から読み出し（Ｓ３０３）、タイムスケジュ−ル格納部
１４４から検出の対象とする放送信号のタイムスケジュ
ールを読み出す（Ｓ３０４）。Next, the detection process of the time-series data detection method will be described. When the process is started (S301), a broadcast signal to be detected is selected (S302), and the broadcast characteristic signal of the target broadcast signal is stored in the broadcast characteristic signal storage unit 146.
(S303), and reads the time schedule of the broadcast signal to be detected from the time schedule storage section 144 (S304).

【００３９】次に検出すべき原データを選択し（Ｓ３０
５）、原データ特徴信号格納部１４５から選択した原デ
ータの原データ特徴信号を読み出し（Ｓ３０６）、電子
すかし格納部１４２から選択した原データの電子透かし
を読み出し（Ｓ３０７）、原データ類似度格納部１４７
から選択した原データの類似度テーブルを読み出し（Ｓ
３０８）、タイムスケジュ−ルから放送信号に含まれる
予定の原データ数を算定する（Ｓ３０９）。Next, original data to be detected is selected (S30).
5) Read the original data characteristic signal of the selected original data from the original data characteristic signal storage unit 145 (S306), read the digital watermark of the selected original data from the digital watermark storage unit 142 (S307), and determine the similarity of the original data. Storage unit 147
Read the similarity table of the original data selected from (S
308), the number of original data to be included in the broadcast signal is calculated from the time schedule (S309).

【００４０】次に複数のマッチングしきい値から所定の
マッチングしきい値を選択し（Ｓ３１０）、そのマッチ
ングしきい値で放送特徴信号と原データ特徴信号とのマ
ッチング処理を行い、マッチング数と位置とを検出する
（Ｓ３１１）。この場合できるだけ処理速度が向上した
マッチング処理方法を用いる。また、最初に選択するマ
ッチングしきい値は最も高いしきい値から始めることが
望ましいが、次回以降は前回の最終選択しきい値を用い
ることが望ましい。検出された原データ数がタイムスケ
ジュ−ルから算定された放送信号に含まれる原データ予
定数より少なければ（Ｓ３１２Ｎ）、ステップ３１０に
戻ってマッチングしきい値を低い値に変更してマッチン
グ処理を繰り返し、検出された原データ数がタイムスケ
ジュ−ルから算定された放送信号に含まれる原データ予
定数以上となれば（Ｓ３１２Ｙ）、ステップ３１３に進
む。Next, a predetermined matching threshold value is selected from a plurality of matching threshold values (S310), and a matching process between the broadcast feature signal and the original data feature signal is performed using the matching threshold value. Is detected (S311). In this case, a matching processing method whose processing speed is improved as much as possible is used. Also, it is preferable that the matching threshold value selected first starts from the highest threshold value, but it is preferable to use the last final selection threshold value from the next time onward. If the detected number of original data is smaller than the expected number of original data included in the broadcast signal calculated from the time schedule (S312N), the process returns to step 310 to change the matching threshold value to a lower value and perform the matching process. If the detected number of original data is equal to or larger than the expected number of original data included in the broadcast signal calculated from the time schedule (S312Y), the process proceeds to step 313.

【００４１】ステップ３１３ではステップ３０８で読み
出した原データの類似度テーブルから最終のマッチング
しきい値に対応するクラスタ内の類似データ数を読み取
り、類似データがクラスタ内になければ（３１３Ｙ）、
ステップ３１８に進んで検出されたマッチング数と位置
とをマッチングデータとして出力する（Ｓ３１８）。類
似データがあれば（３１３Ｎ）、放送信号格納部１４３
から放送信号を読み出し（Ｓ３１４）、検出された放送
信号内の原データの位置をマークし（Ｓ３１５）、予定
位置に埋め込まれた電子透かしを読み出し（Ｓ３１
６）、ステップ３０７で読み出した原データの電子透か
しと照合し、一致した位置に原データがあると同定して
（Ｓ３１７）、同定した原データ数とデータ位置とをマ
ッチングデータとして出力する（Ｓ３１８）。In step 313, the number of similar data in the cluster corresponding to the final matching threshold is read from the similarity table of the original data read in step 308, and if the similar data is not in the cluster (313Y),
Proceeding to step 318, the detected matching number and position are output as matching data (S318). If there is similar data (313N), the broadcast signal storage unit 143
(S314), marks the position of the original data in the detected broadcast signal (S315), and reads the digital watermark embedded in the scheduled position (S31).
6), collate with the digital watermark of the original data read in step 307, identify that there is original data at the matched position (S317), and output the identified number of original data and data position as matching data (S318). ).

【００４２】他の原データについても検出を継続するの
であればステップＳ３０５に戻って処理を繰り替えし
（Ｓ３１９Ｙ）、全ての原データの処理が終了すれば
（Ｓ３１９Ｎ）、ステップＳ３０２に戻って他の放送信
号の検出に移り（Ｓ３２０Ｙ）、全ての放送信号の検出
が終われば（Ｓ３２０Ｎ）、処理を終了する（Ｓ３２
１）。If detection of other original data is to be continued, the flow returns to step S305 to repeat the processing (S319Y). If processing of all the original data is completed (S319N), the flow returns to step S302 to return to other processing. The process shifts to the detection of the broadcast signal (S320Y), and when the detection of all the broadcast signals is completed (S320N), the process ends (S32).
1).

【００４３】本発明の時系列データ検出方法では、１）原データに対して一意なＩＤを電子透かし情報とし
て埋め込み、放送信号中のすべての原データを電子透か
し情報により識別可能な状態にし、２）原データの集合に対して、すべての原データ間の類
似度を計算し、この類似度の計算の際には、マッチング
の処理で用いられているのと同一の手法を用い、計算
後、類似度がクラスタリングしきい値以上のデータにつ
いては、類似データとみなし、３）基本的な検索は、類似度をしきい値とするマッチン
グ手法により行い、類似データが存在する原データにつ
いてのみ電子透かしを用いた検証をすることとし、４）マッチングの処理には処理速度が向上したものを用
い、例えば、特願平１０−１５１７２３号に記載のアク
ティブ検索方法を用い、５）しきい値は処理内容に応じ適応的に変更可能とし、
適応的な変更とは、例えば、検出すべき時系列データ数
があらかじめわかっている場合には、その検出予定数以
上の数の検出があるまでしきい値を下げる方法であり、６）また、検出を繰り返すことにより蓄積されたしきい
値の変更データを分析し、その結果を用いてしきい値の
調整を半自動的に行い、７）電子透かしの抽出の際には、走査すべき時系列デー
タ中の位置が、マッチング方法により導かれていること
から、容易な検証が可能となる、ことを特徴としてお
り、電子透かしのみを用いた場合の透かし抽出時の冗長
処理を防ぎつつ、マッチングのみを用いた場合には困難
な類似データとの区別を実現するような時系列データ中
のデータ検索が、比較的低コストで高速に実現可能とな
っている。In the time-series data detection method of the present invention, 1) a unique ID is embedded in the original data as digital watermark information so that all the original data in the broadcast signal can be identified by the digital watermark information; ) For the set of original data, calculate the similarity between all the original data, and in calculating this similarity, use the same method as that used in the matching process. Data whose similarity is equal to or higher than the clustering threshold is regarded as similar data. 3) A basic search is performed by a matching method using the similarity as a threshold, and digital watermarking is performed only on original data having similar data. 4) Matching processing with improved processing speed is used. For example, an active search method described in Japanese Patent Application No. 10-151723 is used. 5) The threshold value can be adaptively changed according to the processing content.
The adaptive change is, for example, a method of lowering a threshold value until the number of time-series data to be detected is known in advance if the number of detected time-series data is equal to or larger than the expected number of detections. Analyzing the threshold change data accumulated by repeating the detection, semi-automatically adjusting the threshold using the result, 7) When extracting the digital watermark, the time series to be scanned Since the position in the data is derived by the matching method, it can be easily verified, and it is possible to prevent the redundant processing at the time of extracting the watermark when only the digital watermark is used and to perform the matching only. In the case where is used, data retrieval in time-series data that can be distinguished from similar data, which is difficult, can be realized at relatively low cost and at high speed.

【００４４】次に、本発明の時系列データの検出方法の
具体的な実施例を、比較的高い効果で適用できると考え
られる２つの例について説明する。第１の例として特定
のコマーシャル（ＣＭ）のようなあらかじめ出現する原
データの予定数が分かっている放送信号から、特定の信
号部分であるＣＭを検索する場合について、第２の例と
して放送に用いられる可能性のあるオリジナルの音楽デ
ータなどの原データの出現する可能性が不明な放送信号
から、特定の信号であるオリジナルの音楽データなどを
検索する場合について説明する。図４は放送信号から特
定のＣＭを検索する流れを示す模式図であり、図５は放
送信号から音楽の原データを検索する流れを示す模式図
である。Next, two examples in which the specific embodiment of the method for detecting time-series data of the present invention can be applied with a relatively high effect will be described. As a first example, a case where a CM as a specific signal portion is searched from a broadcast signal such as a specific commercial (CM) in which the expected number of original data appearing in advance is known is described as a second example. A case will be described in which a specific signal, such as original music data, is searched for from a broadcast signal in which the possibility of occurrence of original data, such as original music data, which may be used, is unknown. FIG. 4 is a schematic diagram showing a flow of searching for a specific CM from a broadcast signal, and FIG. 5 is a schematic diagram showing a flow of searching for original music data from the broadcast signal.

【００４５】先ず第１の実施例について図１、図２、図
３、図４を参照して説明する。特定のコマーシャル（Ｃ
Ｍ）のようなあらかじめ出現する原データの予定数が分
かっている放送信号から、特定の信号部分であるＣＭを
検索する場合には、まず、ＣＭの原データ（放送に用い
られるオリジナルのデータ）は、あらかじめ原データ入
力部１１１を経て原データ格納部１４１へと記録され
る。ここで原データ格納部１４１に記録されるＣＭの原
データは、放送される可能性のあるＣＭすべてとする。First, the first embodiment will be described with reference to FIGS. 1, 2, 3 and 4. Certain commercials (C
When searching for a CM which is a specific signal portion from a broadcast signal such as M) in which the expected number of appearing original data is known, first, CM original data (original data used for broadcasting) Is recorded in the original data storage unit 141 via the original data input unit 111 in advance. Here, the original data of the CM recorded in the original data storage unit 141 is all CMs that may be broadcast.

【００４６】また、ＣＭの原データに対して埋め込まれ
ている電子透かし情報は、その埋め込み先とともに電子
透かし入力部１１２を経て、電子透かし格納部１４２へ
と記録される。The digital watermark information embedded in the original data of the CM is recorded in the digital watermark storage unit 142 via the digital watermark input unit 112 together with the embedding destination.

【００４７】そして、検索される放送電波は放送信号入
力部１１３である受信機により受信され、その放送内容
が放送信号格納部１４３へと記録される。Then, the searched broadcast wave is received by the receiver serving as the broadcast signal input unit 113, and the broadcast contents are recorded in the broadcast signal storage unit 143.

【００４８】また、ＣＭが放送される予定スケジュール
はタイムスケジュール入力部１１４を経て、タイムスケ
ジュール格納部１４４へと記録される。The schedule for broadcasting the CM is recorded in the time schedule storage section 144 via the time schedule input section 114.

【００４９】次に、記録されたＣＭの原データと放送内
容とは特徴信号化処理部１３１により特徴信号化され、
それぞれ、原データ特徴信号格納部１４５、放送特徴信
号格納部１４６へと記録される。ここで、ＣＭの原デー
タについては、原データ特徴信号を用いてＣＭの原デー
タ間の類似度を計算し、クラスタリングしきい値に基づ
き、似たような特徴をもつＣＭの原データが同一クラス
タになるようにクラスタリング処理を行い、そのクラス
タ情報とともに原データ類似度格納部１４７へと記録さ
れる。Next, the recorded CM original data and broadcast contents are converted into a characteristic signal by the characteristic signal processing section 131,
These are recorded in the original data characteristic signal storage unit 145 and the broadcast characteristic signal storage unit 146, respectively. Here, regarding the original data of the CM, the similarity between the original data of the CM is calculated using the original data feature signal, and based on the clustering threshold, the original data of the CM having similar characteristics are in the same cluster. Is performed, and is recorded in the original data similarity storage unit 147 together with the cluster information.

【００５０】ただし、ここでのクラスタリングしきい値
は、後のマッチングで用いるマッチングしきい値と同じ
値とし、マッチングしきい値を複数個用いる場合には、
それぞれに応じたクラスタリングを行うこととし、複数
個のクラスタリング情報を記録する。However, the clustering threshold here is set to the same value as the matching threshold used in the subsequent matching, and when a plurality of matching thresholds are used,
Clustering according to each is performed, and a plurality of pieces of clustering information are recorded.

【００５１】マッチング処理部１３３では、あらかじめ
選択されたマッチングしきい値を用いてＣＭの特徴信号
と放送内容の特徴信号との間のマッチング処理が施され
る。この処理後、ＣＭの原データのタイムスケジュール
上の出現予定回数と、マッチング処理によって検出され
た原データの出現回数とを比較し、マッチング処理結果
の出現回数がタイムスケジュール上の出現予定回数以上
であった場合には、その出現回数と位置によりＣＭの原
データの特定フェーズへ移る。The matching processing section 133 performs a matching process between the CM characteristic signal and the broadcast content characteristic signal by using a previously selected matching threshold value. After this processing, the number of appearances of the original data of the CM on the time schedule is compared with the number of appearances of the original data detected by the matching processing. If there is, the process proceeds to a specific phase of the original data of the CM according to the number of appearances and the position.

【００５２】マッチング処理結果の出現回数がタイムス
ケジュール上の出現予定回数を下回った場合には、マッ
チング処理部１３３におけるマッチングしきい値を低下
させ、再度マッチング処理を施す。この処理は、マッチ
ング処理結果における出現回数がタイムスケジュール上
の出現予定回数以上となるまで続けられ、タイムスケジ
ュール上の出現予定回数以上となったところで、その出
現回数と位置によりＣＭの原データの特定フェーズへ移
る。When the number of appearances of the matching processing result is less than the expected number of appearances in the time schedule, the matching threshold in the matching processing unit 133 is reduced, and the matching processing is performed again. This process is continued until the number of appearances in the result of the matching process becomes equal to or more than the number of appearances on the time schedule. When the number of occurrences exceeds the number of occurrences on the time schedule, the original data of the CM is identified by the number of occurrences and the position. Move to phase.

【００５３】ＣＭの原データの特定フェーズでは、原デ
ータ類似度格納部１４７に記録されているクラスタリン
グ情報をもとに、最終のマッチングしきい値におけるク
ラスタ内の原データの類似データの有無を検索する。調
べたいＣＭの原データと同一クラス内に存在するＣＭの
原データが、調べたいＣＭの原データのみであった場合
には電子透かし情報の抽出は行わないで、検索結果であ
る原データ数とデータ位置を最終結果とする。In the specific phase of the original data of the CM, the presence or absence of similar data of the original data in the cluster at the final matching threshold value is searched based on the clustering information recorded in the original data similarity storage unit 147. I do. If the original data of the CM existing in the same class as the original data of the CM to be examined is only the original data of the CM to be examined, the electronic watermark information is not extracted, and the number of the original data as the retrieval result is reduced. The data position is the final result.

【００５４】類似のデータがクラスタ内に存在する場合
は、放送信号格納部１４３から放送信号を読み出し、検
索結果である原データの位置に埋め込まれた電子透かし
を読み出し、電子透かし格納部１４２に格納された原デ
ータの電子透かしを読み出して照合し、同一であった位
置に原データが存在するとして、その原データ数とデー
タ位置を同定して最終結果とする。If similar data exists in the cluster, the broadcast signal is read from the broadcast signal storage unit 143, the digital watermark embedded at the position of the original data as the search result is read, and stored in the digital watermark storage unit 142. The digital watermark of the obtained original data is read out and collated, and assuming that the original data exists at the same position, the number of the original data and the data position are identified to obtain the final result.

【００５５】特定されたＣＭの原データについては、放
送信号中にＣＭの原データが存在したと判定され、その
位置が報告される。With respect to the specified original data of the CM, it is determined that the original data of the CM is present in the broadcast signal, and its position is reported.

【００５６】以上の処理を検索したいＣＭの原データご
とに行い、その処理結果を統合し、タイムスケジュール
との比較を行うことにより、ＣＭがタイムスケジュール
に従って適切に放送されたか否かを判断する。The above processing is performed for each original data of the CM to be searched, the processing results are integrated, and the result is compared with the time schedule to determine whether the CM has been properly broadcast according to the time schedule.

【００５７】次に第２の実施例について図１、図２、図
３、図５を参照して説明する。放送に用いられる可能性
のあるオリジナルの音楽データなどを、原データの出現
する可能性が不明な放送信号から検索する場合は、ま
ず、音楽の原データ（放送に用いられるオリジナルの音
楽データ）が、あらかじめ原データ入力部１１１を経て
原データ格納部１４１へと記録される。ここで、原デー
タ格納部１４１に記憶される音楽の原データは、使用さ
れる可能性のある音楽すべてとする。Next, a second embodiment will be described with reference to FIG. 1, FIG. 2, FIG. 3, and FIG. When searching for original music data that may be used for broadcasting from a broadcast signal for which the possibility that the original data appears is unknown, first, the original music data (original music data used for broadcasting) Are recorded in the original data storage unit 141 via the original data input unit 111 in advance. Here, the original music data stored in the original data storage unit 141 is all music that may be used.

【００５８】また、音楽の原データに対して埋め込まれ
ている電子透かし情報は、その埋め込み先とともに電子
透かし入力部１１２を経て、電子透かし格納部１４２へ
と記録される。The digital watermark information embedded in the original music data is recorded in the digital watermark storage unit 142 via the digital watermark input unit 112 together with the embedding destination.

【００５９】そして、検索される放送電波は放送信号入
力部１１３である受信機により受信され、その放送内容
は放送信号格納部１４３へと記録される。The broadcast wave to be searched is received by the receiver serving as the broadcast signal input unit 113, and the broadcast content is recorded in the broadcast signal storage unit 143.

【００６０】次に、記録された音楽の原データと放送内
容のデータは特徴信号化処理部１３１により特徴信号化
され、それぞれ、原データ特徴信号格納部１４５、放送
特徴信号格納部１４６へと記録される。Next, the recorded music original data and broadcast content data are converted into characteristic signals by the characteristic signal processing section 131, and are recorded in the original data characteristic signal storage section 145 and the broadcast characteristic signal storage section 146, respectively. Is done.

【００６１】ここで、音楽の原データ特徴信号を用い
て、音楽の原データ間の類似度を計算し、クラスタリン
グしきい値に基づき、似たような特徴を持つ音楽の原デ
ータが同一クラスタになるようにクラスタリング処理を
行い、そのクラスタ情報とともに原データ類似度格納部
１４７へと格納しておく。ただし、ここでのクラスタリ
ングしきい値は、後のマッチング処理で用いるマッチン
グしきい値と同じ値とする。Here, the similarity between the original music data is calculated using the original music data characteristic signal, and based on the clustering threshold, the original music data having similar characteristics belong to the same cluster. A clustering process is performed so that the original data similarity storage unit 147 stores the cluster information together with the cluster information. However, the clustering threshold here is the same value as the matching threshold used in the subsequent matching processing.

【００６２】マッチング処理部１３３では、所定のマッ
チングしきい値を用いて音楽の原データ特徴信号と放送
内容の特徴信号との間のマッチング処理が施され、その
出現回数と位置により音楽の原データの特定フェーズへ
移る。第１の実施例ではタイムスケジュールの出現予定
数とマッチング処理の出現数との対比を行ったが、第２
の実施例ではこの対比は行わず、所定のマッチングしき
い値によるマッチング結果により音楽の原データの出現
回数と位置を検索する。The matching processing unit 133 performs a matching process between the original music data characteristic signal and the broadcast content characteristic signal using a predetermined matching threshold value. Move to a specific phase. In the first embodiment, the number of appearances of the time schedule is compared with the number of occurrences of the matching process.
In this embodiment, this comparison is not performed, and the number of appearances and the position of the original music data are searched based on the matching result based on a predetermined matching threshold.

【００６３】音楽の原データの特定フェーズでは、原デ
ータ類似度格納部１４７に記録されているクラスタリン
グ情報をもとに、マッチングしきい値におけるクラスタ
内の原データの類似データの有無を検索する。調べたい
音楽の原データと同一クラス内に存在する音楽の原デー
タが、調べたい音楽の原データのみであった場合には電
子透かし情報の抽出は行わないで、検索結果である原デ
ータ数とデータ位置を最終結果とする。In the specific phase of the original music data, based on the clustering information recorded in the original data similarity storage section 147, the presence or absence of similar data of the original data in the cluster at the matching threshold is searched. If the original data of the music existing in the same class as the original data of the music to be examined is only the original data of the music to be examined, the digital watermark information is not extracted, and the number of the original data as the search result is The data position is the final result.

【００６４】類似の音楽のデータがクラスタ内に存在す
る場合は、放送信号格納部１４３から放送信号を読み出
し、検索結果である原データの位置に埋め込まれた電子
透かしを読み出し、電子透かし格納部１４２に格納され
た原データを読み出してその電子透かしと照合し、同一
であった位置に原データが存在するとして、その原デー
タ数とデータ位置を同定して最終結果とする。以上の処
理を、調べたい音楽のすべての原データについて行う。If similar music data exists in the cluster, the broadcast signal is read from the broadcast signal storage unit 143, the digital watermark embedded in the position of the original data as the search result is read, and the digital watermark storage unit 142 is read. Is read out and compared with the digital watermark, and assuming that the original data exists at the same position, the number of the original data and the data position are identified to obtain the final result. The above processing is performed for all original data of music to be examined.

【００６５】次に、本発明の第２の実施の形態の時系列
データの検出方法と時系列データの検出装置について図
面を参照して説明する。図６は本発明の第２の実施の形
態の時系列データの検出装置の模式的ブロック構成図で
ある。Next, a method for detecting time-series data and a device for detecting time-series data according to a second embodiment of the present invention will be described with reference to the drawings. FIG. 6 is a schematic block configuration diagram of a time-series data detection device according to the second embodiment of this invention.

【００６６】図６は、本発明の時系列データ検出装置２
００を、装置を構成するコンピュータとして示したもの
であり、コンピュータはモデム、キーボード、ポインテ
ィングデバイス等の入力装置２１０、モデム、プリン
タ、ディスプレイ等の出力装置２２０、データ処理装置
２３０、記憶部２４０および記録媒体２５０を備える。
記録媒体２５０には各部の動作を制御できる本発明の時
系列データ検出システム制御プログラムが記録されてお
り、ＦＤ，ＣＤ−ＲＯＭ、半導体メモリ等が用いられ
る。FIG. 6 shows a time-series data detecting device 2 according to the present invention.
00 is shown as a computer constituting the apparatus, and the computer is an input device 210 such as a modem, a keyboard, a pointing device, etc .; an output device 220 such as a modem, a printer, a display; a data processing device 230; A medium 250 is provided.
The recording medium 250 stores a time-series data detection system control program of the present invention capable of controlling the operation of each unit, and uses an FD, a CD-ROM, a semiconductor memory, or the like.

【００６７】時系列データ検出装置の構成や時系列デー
タ検出方法は第１の実施の形態と同じなので説明を省略
する。The configuration of the time-series data detecting device and the method for detecting the time-series data are the same as those in the first embodiment, and therefore the description thereof is omitted.

【００６８】入力された放送信号に含まれる時系列デー
タを検索する時系列データ検出システムの制御プログラ
ムは、記録媒体２５０からデータ処理装置２３０に読み
込まれデータ処理装置２３０の動作を制御する。データ
処理装置２３０は制御プログラムの制御により以下の処
理を実行する。The control program of the time series data detection system for searching for the time series data contained in the input broadcast signal is read from the recording medium 250 into the data processing device 230 and controls the operation of the data processing device 230. The data processing device 230 executes the following processing under the control of the control program.

【００６９】即ち、検索しようとする時系列データ（原
データ）を入力し特徴信号化処理して原データ特徴信号
を格納し、しきい値を用いて全ての原データ特徴信号を
マッチング処理して類似度計算を行って原データ類似度
テーブルとして格納し、原データの電子透かし情報を入
力して格納し、必要な原データ関連資料を完成する処理
と、放送信号を入力し特徴信号化処理して放送特徴信号
を格納し、放送信号のタイムスケジュールを入力して格
納し、必要な放送信号関連資料を完成する処理と、検索
の対象とする放送信号を選択し、放送信号の放送特徴信
号を読み出し、タイムスケジュールを読み出し、検索す
べき原データを選択し、原データ特徴信号を読み出し、
電子透かしを読み出し、原データの類似度テーブルを読
み出し、タイムスケジュ−ルから放送信号に含まれる予
定の原データ数を算定する処理と、マッチングしきい値
を用いて放送特徴信号と原データ特徴信号との間のマッ
チング処理を行い、検出された原データ数がタイムスケ
ジュ−ルから算定された放送信号に含まれる予定の原デ
ータ数より少なければ、検出された原データ数がタイム
スケジュ−ルから算定された放送信号に含まれる予定の
原データ数以上となるまでしきい値を低い値に変更して
マッチング処理を繰り返す処理と、マッチング処理を行
った結果としきい値のデータとを蓄積し分析すること
で、マッチング処理に対応したマッチングしきい値を適
切に設定し、入力データ内の原データのデータ数とデー
タ位置とを検出する処理と、最終のマッチングしきい値
に対応するクラスタ内に類似データがなければ検出され
た原データ数とデータ位置とをマッチングデータとして
出力し、類似データがあれば、放送信号内の原データの
検出された位置に埋め込まれた電子透かしを読み出して
原データの電子透かしと照合し、一致した位置に原デー
タがあると同定して原データ数とデータ位置とをマッチ
ングデータとして出力する処理と、を実行する。That is, the time-series data (original data) to be searched is input, converted into a characteristic signal, the original data characteristic signal is stored, and all the original data characteristic signals are matched using the threshold value. Perform similarity calculation and store as original data similarity table, input and store digital watermark information of original data, complete processing of necessary original data related materials, input broadcast signal and perform feature signal processing The broadcast feature signal is stored, the time schedule of the broadcast signal is input and stored, the process of completing the necessary broadcast signal related materials is performed, the broadcast signal to be searched is selected, and the broadcast feature signal of the broadcast signal is selected. Read, read the time schedule, select the original data to be searched, read the original data feature signal,
Processing for reading the digital watermark, reading the similarity table of the original data, calculating the number of original data to be included in the broadcast signal from the time schedule, and using the matching threshold value for the broadcast characteristic signal and the original data characteristic signal And if the number of detected original data is less than the number of original data to be included in the broadcast signal calculated from the time schedule, the detected number of original data is calculated from the time schedule. A process in which the threshold value is changed to a low value until the number of original data to be included in the calculated broadcast signal is equal to or more than a predetermined value, and the matching process is repeated. By doing so, the matching threshold value corresponding to the matching process is appropriately set, and the number of data and the data position of the original data in the input data are detected. If there is no similar data in the cluster corresponding to the final matching threshold, the number of detected original data and the data position are output as matching data, and if there is similar data, the original data in the broadcast signal is output. A process of reading out the digital watermark embedded at the detected position and comparing it with the digital watermark of the original data, identifying that there is original data at the matched position, and outputting the number of original data and the data position as matching data; Execute

【００７０】[0070]

【発明の効果】以上説明したように本発明は、時系列デ
ータのすべての領域に対して、迅速にかつ確実に検出対
象とする時系列データを検出できるという効果がある。
これは、マッチング処理のマッチングしきい値と対応さ
せて全ての原データ間の類似度を計算し、入力データと
原データとのマッチング処理のマッチングしきい値レベ
ルで類似データがあれば検出された入力データ内の原デ
ータ位置のみから読み出された電子透かしと原データの
電子透かしとの照合を行ってマッチングデータを確定さ
せるので、マッチング処理における不確実性を補い、電
子透かし処理における処理時間の短縮を行うことができ
るからである。As described above, the present invention has an effect that time-series data to be detected can be quickly and reliably detected in all regions of the time-series data.
That is, the similarity between all the original data is calculated in correspondence with the matching threshold value of the matching process, and if there is similar data at the matching threshold level of the matching process between the input data and the original data, it is detected. The matching between the digital watermark read from only the original data position in the input data and the digital watermark of the original data is performed to determine the matching data, thereby compensating for the uncertainty in the matching process and reducing the processing time in the digital watermarking process. This is because shortening can be performed.

[Brief description of the drawings]

【図１】図１は本発明の第１の実施の形態の時系列デー
タ検出装置のブロック構成図である。FIG. 1 is a block diagram of a time-series data detection device according to a first embodiment of the present invention.

【図２】本発明の第１の実施の形態の時系列データ検出
方法の準備段階のフローチャートである。（ａ）は原デ
ータ関連資料蓄積工程のフローチャートである。（ｂ）
は放送信号入力工程のフローチャートである。FIG. 2 is a flowchart of a preparation stage of the time-series data detection method according to the first embodiment of the present invention. (A) is a flowchart of an original data related material accumulation process. (B)
Is a flowchart of a broadcast signal input step.

【図３】本発明の第１の実施の形態の時系列データ検出
方法の検出工程のフローチャートである。FIG. 3 is a flowchart of a detection process of the time-series data detection method according to the first embodiment of the present invention.

【図４】放送信号から特定のＣＭを検索する流れを示す
模式図である。FIG. 4 is a schematic diagram showing a flow of searching for a specific CM from a broadcast signal.

【図５】放送信号から音楽の原データを検索する流れを
示す模式図である。FIG. 5 is a schematic diagram showing a flow of searching for original music data from a broadcast signal.

【図６】本発明の第２の実施の形態の時系列データの検
出装置の模式的ブロック構成図である。FIG. 6 is a schematic block diagram of a time-series data detection device according to a second embodiment of the present invention.

【図７】ＣＭ出稿確認を例としたマッチングによる検出
方法の問題点を説明するための模式図である。（ａ）は
検出前の状態を示す。（ｂ）は検出後の状態を示す。FIG. 7 is a schematic diagram for explaining a problem of a detection method by matching in which a CM submission confirmation is used as an example. (A) shows a state before detection. (B) shows the state after detection.

[Explanation of symbols]

７１入力データの集合７２検出すべきＣＭの集合７３マッチングにより検出されたＣＭ７４正しく検出されたＣＭ７５検出されなかったＣＭ７６誤って検出されたＣＭ１００、２００時系列データ検出装置１１１、２１１原データ入力部１１２、２１２電子透かし入力部１１３、２１３放送信号入力部１１４、２１４タイムスケジュール入力部１２１、２２１判定結果出力部１３１、２３１特徴信号化処理部１３２、２３２類似度計算部１３３、２３３マッチング処理部１３４、２３４出現回数判定部１３５、２３５類似度判定部１３６、２３６透かし抽出／判定部１３８、２３８制御部１４１、２４１原データ格納部１４２、２４２電子透かし格納部１４３、２４３放送信号格納部１４４、２４４タイムスケジュール格納部１４５、２４５原データ特徴信号格納部１４６、２４６放送特徴信号格納部１４７、２４７原データ類似度格納部２５０記録媒体Ｓ１０１〜Ｓ１１２、Ｓ２０１〜Ｓ２０９、Ｓ３０１〜
Ｓ３２１ステップ71 Set of input data 72 Set of CMs to be detected 73 CM detected by matching 74 CM detected correctly 75 CM not detected 76 CM detected erroneously 100, 200 Time-series data detection device 111, 211 Original Data input unit 112, 212 Digital watermark input unit 113, 213 Broadcast signal input unit 114, 214 Time schedule input unit 121, 221 Judgment result output unit 131, 231 Feature signal processing unit 132, 232 Similarity calculation unit 133, 233 Matching Processing units 134, 234 Appearance number determination unit 135, 235 Similarity determination unit 136, 236 Watermark extraction / determination unit 138, 238 Control unit 141, 241 Original data storage unit 142, 242 Digital watermark storage unit 143, 243 Broadcast signal storage unit 144, 244 Times Joule storage unit 145, 245 the original data, wherein the signal storage unit 146, 246 broadcast feature signal storage unit 147,247 raw data similarity storage unit 250 recording medium S101~S112, S201~S209, S301~
S321 Step

───────────────────────────────────────────────────── フロントページの続き (72)発明者土田尚純東京都新宿区西新宿三丁目19番２号日本電信電話株式会社内 (72)発明者平野泰宏東京都新宿区西新宿三丁目19番２号日本電信電話株式会社内Ｆターム(参考） 5B075 ND12 NK06 NK39 PQ02 PR06 QM01 QM08 5C063 AB03 AB07 CA23 CA34 DA07 5L096 BA18 DA01 GA51 HA08 JA03 JA11 MA07 9A001 EE03 FF03 HH35 KK43 KK60 LL03 ──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Naozumi Tsuchida 3-19-2 Nishi-Shinjuku, Shinjuku-ku, Tokyo Japan Telegraph and Telephone Corporation (72) Inventor Yasuhiro Hirano 3-192-1, Nishi-Shinjuku, Shinjuku-ku, Tokyo No. Nippon Telegraph and Telephone Corporation F term (reference) 5B075 ND12 NK06 NK39 PQ02 PR06 QM01 QM08 5C063 AB03 AB07 CA23 CA34 DA07 5L096 BA18 DA01 GA51 HA08 JA03 JA11 MA07 9A001 EE03 FF03 HH35 KK43 KK60 KK03

Claims

[Claims]

1. The time series data to be searched (hereinafter referred to as
A method for detecting time-series data, which detects original data) from input time-series data (hereinafter referred to as input data), wherein the original data and the input data can be uniquely identified. The original data to be searched is input, the feature signal is processed and stored as an original data feature signal, and all the original data feature signals are determined using a predetermined threshold. Perform matching processing, calculate similarity, store as original data similarity table, input and store digital watermark information of the original data, complete necessary original data related materials, and input the input data A feature signal is processed and stored as an input data feature signal to complete necessary input data related data. A predetermined operation is performed between the input data feature signal and the original data feature signal. Performing a matching process using a threshold to detect the number of data and the data position of the original data in the input data; and when there is similar data in a cluster corresponding to the threshold in the original data similarity table. Reads out the digital watermark embedded in the detected original data position in the input data, compares it with the digital watermark information of the original data, eliminates similar data, and removes the original data in the input data. A method for detecting time-series data, wherein the number of data and the data position are determined.

2. A time schedule indicating a planned positional relationship of the original data in the input data is stored in the input data-related material, and the number of original data detected in the matching process is calculated from the time schedule. If the number of original data to be included is smaller than the number of original data to be included in the input data, the matching is performed until the number of detected original data is equal to or greater than the number of original data to be included in the input data calculated from the time schedule. 2. The time-series data detection method according to claim 1, wherein a threshold value is changed to a low value, and a matching process is repeatedly performed to detect a data number and a data position of the original data in the input data.

Further, by accumulating and analyzing the result of the matching processing and the data of the threshold value,
The method of detecting time-series data according to claim 1, wherein the matching threshold value corresponding to the matching process is appropriately set, and the number of data and the data position of the original data in the input data are detected.

4. A time-series data detecting device for detecting time-series data (original data) to be searched from input time-series data (input data), comprising: an original data input unit, an original data storage unit, Watermark input unit and digital watermark storage unit, input data input unit and input data storage unit, time schedule input unit and time schedule storage unit, feature signal processing unit, original data feature signal storage unit, input data feature signal storage unit, and similar A degree calculation unit, an original data similarity storage unit, a matching processing unit, an appearance number determination unit, a similarity determination unit, a watermark extraction / determination unit, a determination result output unit, and a control unit. Inputting data, storing the data in the original data storage unit, and inputting the digital watermark data embedded in the original data into the digital watermark data input unit, and embedding the digital watermark data The input data input unit inputs the input data and stores it in the input data storage unit, and the time schedule input unit inputs the time schedule of the input data. The feature signal processing unit performs feature signal processing on the original data to generate an original data feature signal, stores the original data feature signal in the original data feature signal storage unit, and stores the input data. The input data feature signal is generated by performing a feature signal conversion process on the input data feature signal storage unit, and stored in the input data feature signal storage unit. Calculate the degree of similarity between them, and perform clustering processing so that the original data with similar characteristics are in the same cluster. The original data characteristic signal and the input data characteristic signal are stored in the original data similarity storage unit using a predetermined threshold, and the matching processing unit performs a matching process. The scheduled number of appearances on the time schedule of the original data is compared with the number of appearances of the original data by the matching process, and the threshold value of the matching processing unit is changed as necessary. The clustering information of the original data corresponding to the finally selected threshold value is read, and whether there is only one original data in the cluster is determined.
It is determined whether or not similar data exists, and the result is sent to the watermark extraction determining unit. In the watermark extraction determining unit, if there is only one original data, the matching processing result of the matching processing unit is output to the determination result output unit. And if similar data exists, read out the digital watermark of the original data, read out and verify the digital watermark embedded in the position of the original data as a result of the matching processing of the input data, Outputting the number of times of data matching and position information to the judgment result output unit, the judgment result output unit outputting the judgment result to the outside, and the control unit controlling and executing the processing of each unit. Sequence data detection device.

5. The time-series data detection device according to claim 4, further comprising a recording medium, wherein the operation of the control unit can be controlled by a time-series data detection system control program recorded on the recording medium.

6. A recording medium on which a control program for detecting time-series data (original data) to be searched from input time-series data (input data) is recorded, wherein the original data to be searched is recorded. Is input, processed as a feature signal, stored as an original data feature signal, and all original data feature signals are subjected to a matching process using a predetermined threshold to calculate similarity, and stored as an original data similarity table. Inputting and storing the digital watermark information of the original data; and inputting the input data, converting the input data into a characteristic signal, storing the input data as a characteristic signal, and the expected positional relationship of the original data in the input data. And storing a time schedule indicating the following, and performing a matching process between the input data feature signal and the original data feature signal using a predetermined threshold value. Performing the detection of the number of data and the data position of the original data in the input data, and the number of the original data detected in the matching process is to be included in the input data calculated from the time schedule. If the number of original data is less than the number of original data, the matching threshold is changed to a lower value until the number of detected original data is equal to or more than the number of original data to be included in the input data calculated from the time schedule. A procedure of repeatedly performing a matching process to detect a data number and a data position of the original data in the input data, and accumulating and analyzing a result of the matching process and data of the threshold value, The matching threshold value corresponding to the matching process is appropriately set, and the number of data and the data position of the original data in the input data are detected. If there is similar data in the cluster corresponding to the threshold value of the original data similarity table, the electronic watermark embedded in the detected original data position in the input data is read and the original data is read. A step of determining the number of data and the data position of the original data in the input data by excluding similar data by collating with the digital watermark information, and a machine-readable recording recording a program for executing Medium.