JP2010224481A

JP2010224481A - Device for detection of similar section

Info

Publication number: JP2010224481A
Application number: JP2009074539A
Authority: JP
Inventors: Yoshihiro Omori; 善啓大盛
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2009-03-25
Filing date: 2009-03-25
Publication date: 2010-10-07

Abstract

PROBLEM TO BE SOLVED: To provide a similar section detection technology for searching a similar position at high speed, and for suppressing excessive detection or short of search. SOLUTION: A feature value extraction unit 122 extracts a feature value string X from an image signal Vx for which a first input receiving unit 120 receives input, and extracts a feature value string Y from an image signal Vy for which a second input receiving unit 121 receives input. A similar position detection unit 123 time sequentially collates the feature value string X and the feature value string Y, and detects, as a similar position, a position where their similarity is larger than a search threshold θ stored in a threshold memory 124. A detection result output unit 125 binarizes a detection result r of the similar position detection unit 123 and outputs the binarized result r. A similar position distribution calculation unit 126 calculates a distribution of the number of the similar positions by using the detection result r. A threshold update section 127 updates the search threshold θ stored in a threshold memory 124, when a similar position distribution satisfies a predetermine distribution condition. COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、類似区間検出装置のしきい値の最適化に関する。 The present invention relates to optimization of a threshold value of a similar section detection device.

例えば、異なる日に放送された同じ番組の２つの映像の時系列信号から類似区間を検出することによって、映像にインデックスを付加して視聴を容易にする技術が提案されている（例えば特許文献１参照）。類似区間とは連続する類似位置の集合である。映像などの２つの時系列信号から類似区間を探索することを部分信号探索という。部分信号探索では、時系列信号Ｘ上に探索窓Ｗｘを設定すると共に時系列信号Ｙ上に探索窓Ｗｙを設定し、探索窓Ｗｘ内の信号と探索窓Ｗｙ内の信号とを照合する際に、探索窓Ｗｘと探索窓Ｗｙの間で類似度を算出し、類似度が探索しきい値θを上回る場合に、探索窓Ｗｘ内の信号と探索窓Ｗｙ内の信号とが類似すると判定して、類似区間を探索する方法がある。しかし、このような探索をしらみつぶしに行うと処理量が膨大になる。このため、近年では類似度が探索しきい値θを明らかに下回る位置での照合を省略して、探索を高速化する方法が提案されている（例えば非特許文献１参照）。この手法はＲＩＦＡＳ法と呼ばれる。ＲＩＦＡＳ法では、探索しきい値θが大きいほど高速に探索できるが、探索漏れが増えるというトレードオフの関係がある。 For example, a technique for facilitating viewing by adding an index to a video by detecting a similar section from time series signals of two videos of the same program broadcast on different days has been proposed (for example, Patent Document 1). reference). A similar section is a set of consecutive similar positions. Searching for a similar section from two time-series signals such as video is called partial signal search. In the partial signal search, when the search window Wx is set on the time series signal X and the search window Wy is set on the time series signal Y, the signal in the search window Wx and the signal in the search window Wy are collated. The similarity between the search window Wx and the search window Wy is calculated, and when the similarity exceeds the search threshold θ, it is determined that the signal in the search window Wx is similar to the signal in the search window Wy. There is a method for searching for similar sections. However, if such a search is exhausted, the amount of processing becomes enormous. For this reason, in recent years, a method has been proposed in which collation at a position where the similarity is clearly below the search threshold value θ is omitted to speed up the search (see, for example, Non-Patent Document 1). This method is called the RIFAS method. In the RIFAS method, the search can be performed faster as the search threshold θ is larger, but there is a trade-off relationship that the search omission is increased.

特許第４０２１５４５号公報Japanese Patent No. 40215545

西村拓一，ほか，”アクティブ探索法による時系列データ中の一致区間検出‐参照区間自由時系列アクティブ探索法‐，“信学論Ｄ−ＩＩ，Ｖｏｌ．Ｊ８４−Ｄ−ＩＩ，Ｎｏ．８，ｐｐ．１８２６−１８３７，Ａｕｇ．２００１．Takuichi Nishimura, et al., “Detection of Matching Intervals in Time Series Data by Active Search Method—Reference Section Free Time Series Active Search Method”, “Science Theory D-II, Vol. J84-D-II, No. 8, pp. 1826-1837, Aug. 2001.

探索漏れが起きない最大値に探索しきい値θを設定すれば最も高速に探索ができる。しかし、従来は、探索しきい値θを適切な値に設定することは困難であり、探索しきい値θを経験的に求めていた。このため、過剰検出や探索漏れが発生する恐れがあった。 If the search threshold θ is set to the maximum value at which no search omission occurs, the search can be performed at the highest speed. However, conventionally, it has been difficult to set the search threshold θ to an appropriate value, and the search threshold θ has been determined empirically. For this reason, there has been a risk of excessive detection and search omission.

本発明は、上記に鑑みてなされたものであって、類似位置を高速に探索可能であり、過剰検出や探索漏れを抑制可能な類似区間検出装置を提供することを目的とする。 The present invention has been made in view of the above, and an object of the present invention is to provide a similar section detection apparatus that can search for similar positions at high speed and suppress excessive detection and search omission.

上述した課題を解決し、目的を達成するために、本発明は、類似区間検出装置であって、第１の時系列信号の入力を受け付ける第１入力受付手段と、第２の時系列信号の入力を受け付ける第２入力受付手段と、前記第１の時系列信号から第１の特徴量を抽出し、前記第２の時系列信号から第２の特徴量を抽出する抽出手段と、第１のしきい値を記憶する記憶手段と、前記第１の特徴量及び前記第２の特徴量を時系列に照合して、その類似度が前記第１のしきい値以上の位置である類似位置を検出する検出手段と、前記類似位置の数の分布を算出する算出手段と、前記類似位置の数の分布が所定の分布条件を満たす場合に、検出される類似位置の数が減少するように前記第１のしきい値を更新する更新手段と、前記検出手段、前記算出手段及び前記更新手段を繰り返し動作させることにより、連続する前記類似位置の集合である類似区間を取得する制御手段とを備えることを特徴とする。 In order to solve the above-described problems and achieve the object, the present invention is a similar section detection device, comprising a first input receiving means for receiving an input of a first time-series signal, and a second time-series signal. A second input receiving means for receiving an input; an extracting means for extracting a first feature quantity from the first time-series signal; and extracting a second feature quantity from the second time-series signal; A storage unit for storing a threshold value, the first feature value and the second feature value are collated in time series, and a similarity position whose similarity is equal to or higher than the first threshold value is obtained. Detecting means for detecting, calculating means for calculating the distribution of the number of similar positions, and when the distribution of the number of similar positions satisfies a predetermined distribution condition, the number of detected similar positions is reduced. Updating means for updating the first threshold, the detecting means, the calculating means, and Serial By repeatedly operating the updating means, characterized in that it comprises a control means for obtaining a similarity section said a set of similar contiguous positions.

本発明によれば、類似位置を高速に探索可能であり、過剰検出や探索漏れを抑制可能になる。 According to the present invention, it is possible to search for a similar position at high speed, and it is possible to suppress excessive detection and search omission.

図１は、一実施の形態にかかる類似区間検出装置１００のハードウェア構成を例示する図である。FIG. 1 is a diagram illustrating a hardware configuration of a similar section detection device 100 according to an embodiment. 図２は、類似区間検出装置１００の機能的構成を例示する図である。FIG. 2 is a diagram illustrating a functional configuration of the similar section detection device 100. 図３は、類似区間検出装置１００の行う類似区間検出処理の手順を示すフローチャートである。FIG. 3 is a flowchart illustrating a procedure of similar section detection processing performed by the similar section detection apparatus 100. 図４は、探索しきい値θを最適化する原理を説明するための図である。FIG. 4 is a diagram for explaining the principle of optimizing the search threshold value θ.

以下に添付図面を参照して、この発明にかかる類似区間検出装置の最良な実施の形態を詳細に説明する。 Exemplary embodiments of a similar section detection device according to the present invention will be explained below in detail with reference to the accompanying drawings.

[第１の実施の形態]
（１）構成
図１の通り、類似区間検出装置１００は、各種データや各種プログラムを記憶するＲＯＭ（Read Only Memory）１０４やＲＡＭ（Random Access Memory）１０５等の主記憶部と、各種データや各種プログラムを記憶するＨＤＤ（Hard Disk Drive）やＣＤ（Compact Disk）ドライブ装置等の外部記憶部１０７と、これらを接続するバス１０８とを備えており、通常のコンピュータを利用したハードウェア構成となっている。また、類似区間検出装置１００には、情報を表示する表示部１０３と、ユーザの指示入力を受け付けるキーボードやマウス等の操作部１０２と、外部装置との通信を制御する通信部１０６とが有線又は無線により各々接続される。 [First embodiment]
(1) Configuration As shown in FIG. 1, the similar section detection device 100 includes a main storage unit such as a ROM (Read Only Memory) 104 and a RAM (Random Access Memory) 105 that stores various data and various programs, and various data and various types. It has an external storage unit 107 such as an HDD (Hard Disk Drive) or CD (Compact Disk) drive device for storing programs, and a bus 108 for connecting them, and has a hardware configuration using a normal computer. Yes. In addition, the similar section detection apparatus 100 includes a display unit 103 that displays information, an operation unit 102 such as a keyboard and a mouse that accepts user instruction input, and a communication unit 106 that controls communication with an external device. Each is connected by radio.

次に、このようなハードウェア構成において、類似区間検出装置１００のＣＰＵ１０１が記憶部や外部記憶部１０７に記憶された各種プログラムを実行することにより実現される各種機能について、図２を参照して説明する。類似区間検出装置１００は、第１入力受付部１２０と、第２入力受付部１２１と、特徴量抽出部１２２と、類似位置検出部１２３と、しきい値記憶部１２４と、検出結果出力部１２５と、類似位置分布算出部１２６と、しきい値更新部１２７とを有する。第１入力受付部１２０と、第２入力受付部１２１と、特徴量抽出部１２２と、類似位置検出部１２３と、検出結果出力部１２５と、類似位置分布算出部１２６と、しきい値更新部１２７とは、ＣＰＵ１０１のプログラム実行時にＲＡＭ１０５などの記憶部に生成されるものである。しきい値記憶部１２４は、ＨＤＤなどの外部記憶部１０７に生成される記憶領域である。 Next, in such a hardware configuration, various functions realized when the CPU 101 of the similar section detection device 100 executes various programs stored in the storage unit or the external storage unit 107 will be described with reference to FIG. explain. The similar section detection apparatus 100 includes a first input reception unit 120, a second input reception unit 121, a feature amount extraction unit 122, a similar position detection unit 123, a threshold value storage unit 124, and a detection result output unit 125. And a similar position distribution calculating unit 126 and a threshold value updating unit 127. First input reception unit 120, second input reception unit 121, feature amount extraction unit 122, similar position detection unit 123, detection result output unit 125, similar position distribution calculation unit 126, and threshold value update unit 127 is generated in the storage unit such as the RAM 105 when the CPU 101 executes the program. The threshold storage unit 124 is a storage area generated in the external storage unit 107 such as an HDD.

第１入力受付部１２０は、単位時間毎に撮像された映像を時系列で表す映像信号Ｖｘの入力を受け付ける。第２入力受付部１２１は、第１入力受付部１２０に入力される信号とは異なる映像信号Ｖｙの入力を受け付ける。しきい値記憶部１２４は、探索しきい値θ（０≦θ≦１）を記憶する。 The first input receiving unit 120 receives an input of a video signal Vx that represents a video captured every unit time in time series. The second input receiving unit 121 receives an input of a video signal Vy different from the signal input to the first input receiving unit 120. The threshold storage unit 124 stores the search threshold θ (0 ≦ θ ≦ 1).

特徴量抽出部１２２は、第１入力受付部１２０が入力を受け付けた映像信号Ｖｘから、単位時間毎の特徴量を抽出することにより、単位時間毎の特徴量を含む特徴量列Ｘを抽出すると共に、第２入力受付部１２１が入力を受け付けた映像信号Ｖｙから、単位時間毎の特徴量を抽出することにより、単位時間毎の特徴量を含む特徴量列Ｙを抽出する。映像信号から特徴量列を抽出する方法は、例えば以下の通りである。特徴量抽出部１２２は、単位時間に撮像された１枚のフレーム画像を９つの領域に分割し、各領域について平均輝度を求め、４段階（２ｂｉｔ）に量子化する。また、特徴量抽出部１２２は、これとは別に１つのフレーム画像全体の平均輝度も求め、１６段階（４ｂｉｔ）に量子化する。そして、特徴量抽出部１２２は、これらの平均輝度を連結して２２ｂｉｔの特徴量を抽出する。このようにして特徴量抽出部１２２は単位時間毎の特徴量を抽出し、これらを時系列で配列したものを特徴量列として抽出する。 The feature amount extraction unit 122 extracts the feature amount sequence X including the feature amount per unit time by extracting the feature amount per unit time from the video signal Vx received by the first input reception unit 120. At the same time, the feature amount sequence Y including the feature amount per unit time is extracted by extracting the feature amount per unit time from the video signal Vy received by the second input reception unit 121. A method for extracting a feature string from a video signal is, for example, as follows. The feature amount extraction unit 122 divides one frame image captured per unit time into nine regions, obtains an average luminance for each region, and quantizes it in four steps (2 bits). In addition to this, the feature quantity extraction unit 122 also obtains the average luminance of one entire frame image and quantizes it in 16 steps (4 bits). Then, the feature quantity extraction unit 122 concatenates these average luminances and extracts a 22-bit feature quantity. In this manner, the feature amount extraction unit 122 extracts feature amounts for each unit time, and extracts those obtained by arranging them in time series as a feature amount sequence.

類似位置検出部１２３は、特徴量列Ｘ及び特徴量列Ｙを時系列に照合して類似位置を探索し、その類似度がしきい値記憶部１２４に記憶された探索しきい値θより大きい位置を類似位置として検出する。具体的には、類似位置検出部１２３は、特徴量列Ｘ上に探索窓Ｗｘを設定すると共に、特徴量列Ｙ上に探索窓Ｗｙを設定し、探索窓Ｗｘ内の特徴量列と探索窓Ｗｙ内の特徴量列を照合して、映像信号Ｖｘ，Ｖｙにおける類似位置を探索する。探索窓とは、所定の時間の範囲であり、特徴量列上に設定されることにより、所定の時間の範囲内に対応する特徴量を含む特徴量列が照合の対象となる。このような探索窓Ｗｘ内の特徴量列と探索窓Ｗｙ内の特徴量列とを照合する際に、類似位置検出部１２３は、探索窓Ｗｘ,Ｗｙの間で特徴量列の類似度を算出し、当該類似度が、しきい値記憶部１２４に記憶された探索しきい値θより大きい場合に、探索窓Ｗｘの位置ｘ及び探索窓Ｗｙの位置ｙを表すｐ（ｘ，ｙ）を類似位置として検出する。そして類似位置検出部１２３は、検出した類似位置及びその類似度ｓを出力する。類似位置検出部１２３は、このような類似位置を探索する処理を、例えば、探索窓Ｗｙの位置ｙを０から１サンプルずつ順に進めて固定し、固定した位置ｙにおいて探索窓Ｗｘを自由に移動させて位置ｙ毎に繰り返し行う。 The similar position detection unit 123 searches the similar position by comparing the feature amount sequence X and the feature amount sequence Y in time series, and the similarity is larger than the search threshold θ stored in the threshold storage unit 124. The position is detected as a similar position. Specifically, the similar position detection unit 123 sets the search window Wx on the feature quantity sequence X and sets the search window Wy on the feature quantity sequence Y, and the feature quantity sequence and the search window in the search window Wx. The feature amount sequence in Wy is collated to search for similar positions in the video signals Vx and Vy. The search window is a predetermined time range, and is set on the feature amount sequence, so that a feature amount sequence including a corresponding feature amount within the predetermined time range is a target of collation. When matching such a feature quantity sequence in the search window Wx with a feature quantity sequence in the search window Wy, the similar position detection unit 123 calculates the similarity of the feature quantity sequence between the search windows Wx and Wy. When the similarity is larger than the search threshold value θ stored in the threshold value storage unit 124, the position x of the search window Wx and the p (x, y) representing the position y of the search window Wy are similar. Detect as position. Then, the similar position detection unit 123 outputs the detected similar position and the similarity s. The similar position detection unit 123, for example, fixes the search for the similar position by sequentially moving the position y of the search window Wy one sample at a time from 0, and freely moves the search window Wx at the fixed position y. And repeatedly for each position y.

探索窓Ｗｘ,Ｗｙの間の特徴量列の類似度を算出する方法は例えば以下の通りである。類似位置検出部１２３は、探索窓Ｗｘ内の特徴量列のヒストグラムＨｘと、探索窓Ｗｙ内の特徴量列のヒストグラムＨｙとを求め、ヒストグラムＨｘ,Ｈｙが重なる割合を表すヒストグラム重なり率を類似度として算出する。ヒストグラムＨｘ，Ｈｙを各々構成する各ビン（柱）の数をＭ，ヒストグラムの要素を各々ｈｘｉ，ｈｙｉとすると（１≦ｉ≦Ｍ）、ヒストグラム重なり率ｓ（Ｈｘ，Ｈｙ）は以下の式１により求めることができる。 The method for calculating the similarity of the feature amount sequence between the search windows Wx and Wy is, for example, as follows. The similar position detection unit 123 obtains the histogram Hx of the feature amount sequence in the search window Wx and the histogram Hy of the feature amount sequence in the search window Wy, and calculates the histogram overlap rate indicating the ratio of the histograms Hx and Hy as similarities. Calculate as If the number of bins (columns) constituting the histograms Hx and Hy is M and the elements of the histogram are hxi and hyi (1 ≦ i ≦ M), the histogram overlap rate s (Hx, Hy) is expressed by the following equation 1 It can ask for.

なお、本実施の形態においては、類似位置検出部１２３は、ＲＩＦＡＳ法に示されるように、類似度が探索しきい値θを明らかに下回る位置での照合を省略することによって、類似位置を高速に探索する。上述したＲＩＦＡＳ法では、探索空間上の１点で類似度ｓが求まると、類似度が探索しきい値θを超えない領域であり、即ち、探索をスキップ可能であるスキップ可能領域があることが示されている。ＲＩＦＡＳ法では、このスキップ可能領域で探索空間全体を敷き詰めて探索を進める。スキップ可能領域は４５度傾いた正方形の領域であり、対角線の長さを２ｗとするとｗは以下の式２で表すことができる。 In the present embodiment, as shown in the RIFAS method, the similar position detection unit 123 performs high-speed similar position detection by omitting collation at a position where the similarity is clearly below the search threshold value θ. To explore. In the RIFAS method described above, when the similarity s is obtained at one point in the search space, there is a region where the similarity does not exceed the search threshold θ, that is, there is a skippable region where the search can be skipped. It is shown. In the RIFAS method, the entire search space is laid out in this skippable area and the search proceeds. The skippable area is a square area inclined by 45 degrees, and w can be expressed by the following expression 2 when the length of the diagonal line is 2w.

この式２は、探索しきい値θが大きければ、スキップ可能領域も大きくなることを示している。よって、探索しきい値θが大きければ探索を高速化できる。一方大きくし過ぎると探索しきい値θ以下の類似位置は検出できなくなるから、探索漏れが発生する。本実施の形態においては、類似位置検出部１２３の機能により類似位置を高速に探索可能にする。また、以下で説明する検出結果出力部１２５、類似位置分布算出部１２６及びしきい値更新部１２７の各機能により、検出される類似位置の数が減少するように探索しきい値θを最適化することにより、過剰検出や探索漏れを抑制可能にする。 This equation 2 shows that if the search threshold θ is large, the skippable area is also large. Therefore, if the search threshold θ is large, the search can be speeded up. On the other hand, if it is too large, a similar position below the search threshold value θ cannot be detected, and a search omission occurs. In the present embodiment, the function of the similar position detection unit 123 enables a similar position to be searched at high speed. Further, the search threshold value θ is optimized so that the number of detected similar positions is reduced by the functions of the detection result output unit 125, the similar position distribution calculation unit 126, and the threshold value update unit 127 described below. By doing so, excessive detection and search omission can be suppressed.

検出結果出力部１２５は、類似位置検出部１２３の検出結果ｒを２値化して出力する。即ち、探索窓Ｗｘ内の特徴量列と探索窓Ｗｙ内の特徴量列との照合の結果、類似位置ｐ（ｘ，ｙ）が出力された場合、検出結果出力部１２５は、「ｒ＝１」として検出結果ｒを出力し、類似位置が出力されない場合、「ｒ＝０」として検出結果ｒを出力する。 The detection result output unit 125 binarizes and outputs the detection result r of the similar position detection unit 123. That is, when the similar position p (x, y) is output as a result of matching between the feature amount sequence in the search window Wx and the feature amount sequence in the search window Wy, the detection result output unit 125 displays “r = 1. The detection result r is output as “”, and when the similar position is not output, the detection result r is output as “r = 0”.

類似位置分布算出部１２６は、類似位置検出部１２３が出力する検出結果ｒを用いて、類似位置の数の分布を算出する。具体的には、類似位置分布算出部１２６は、例えば、探索窓Ｗｙの位置ｙ毎に、当該探索窓Ｗｙに対して各々設定される探索窓Ｗｘにおいて各々出力される検出結果ｒについて、当該値が１の個数を算出して、単位長さ辺りの類似位置の数の平均値Ｎを算出する。 The similar position distribution calculation unit 126 calculates the distribution of the number of similar positions using the detection result r output from the similar position detection unit 123. Specifically, the similar position distribution calculation unit 126, for example, for each detection result r output in the search window Wx set for the search window Wy for each position y of the search window Wy, The number of 1 is calculated, and the average value N of the number of similar positions around the unit length is calculated.

しきい値更新部１２７は、類似位置分布算出部１２６が算出した、類似位置の分布が所定の分布条件を満たす場合に、探索しきい値θを更新し、更新した値をしきい値記憶部１２４に記憶させる。所定の分布条件とは、例えば、類似位置分布算出部１２６が算出した平均値Ｎが更新しきい値α１を超えることである。更新しきい値α１は例えばＨＤＤなどの外部記憶部１０７に予め記憶されている。探索しきい値θの更新は、例えば、更新の都度、探索しきい値θに対して定数ｄを加えることにより行なう。この結果、更新の都度、探索しきい値θは定数ｄずつ増加することになる。定数ｄは例えばＨＤＤなどの外部記憶部１０７に予め記憶されている。このように更新された探索しきい値θを用いて類似位置検出部１２３が類似位置を探索すると、更新前の探索しきい値θを用いた場合と比べて、検出される類似位置の数は減少することになり、過剰検出が減少することになる。 The threshold update unit 127 updates the search threshold θ when the similar position distribution calculated by the similar position distribution calculation unit 126 satisfies a predetermined distribution condition, and the updated value is stored in the threshold storage unit. 124 is stored. The predetermined distribution condition is, for example, that the average value N calculated by the similar position distribution calculation unit 126 exceeds the update threshold value α1. The update threshold value α1 is stored in advance in the external storage unit 107 such as an HDD. The search threshold value θ is updated, for example, by adding a constant d to the search threshold value θ every time it is updated. As a result, each time the update is performed, the search threshold θ increases by a constant d. The constant d is stored in advance in the external storage unit 107 such as an HDD. When the similar position detection unit 123 searches for a similar position using the search threshold θ updated in this way, the number of detected similar positions is smaller than when the search threshold θ before the update is used. Will reduce and overdetection will decrease.

本実施の形態において、平均値Ｎが更新しきい値α１が超える場合に、過剰検出が発生していると判定するのは以下の理由からである。特徴量列ＸをＸ軸とし特徴量列ＹをＹ軸とする２次元のＸＹ空間において、類似区間は傾き１の線分として検出される。一方、過剰検出はＸ軸あるいはＹ軸に平行な線分として検出される。従って、類似位置検出部１２３が位置ｙ毎の単位長さ辺りの類似位置数の平均値Ｎを算出することでＸ軸に平行な線分を検出し、この平均値Ｎと更新しきい値α１とを比較することにより過剰検出が発生しているか否かを判定することができるからである。そして、本実施の形態においては、過剰検出が発生していると判定した場合には、しきい値更新部１２７が、過剰検出が減少するように、即ち、検出される類似位置の数が減少するように探索しきい値θを更新する。このように、類似位置分布算出部１２６が位置ｙ毎に算出した平均値Ｎが更新しきい値α１より大きくなる都度、しきい値更新部１２７が探索しきい値θを更新することにより、探索しきい値θを最適化し、最適化された探索しきい値θを用いて類似位置検出部１２３が類似位置の探索を行なう。このような処理を類似位置検出部１２３は類似位置の探索を行なう過程で過剰検出が発生しなくなるまで繰り返すことによって、探索しきい値θを最適化し、連続する類似位置の集合である類似区間を取得する。 In the present embodiment, when the average value N exceeds the update threshold value α1, it is determined that excessive detection has occurred for the following reason. In a two-dimensional XY space in which the feature quantity sequence X is the X axis and the feature quantity sequence Y is the Y axis, the similar section is detected as a line segment with a slope of 1. On the other hand, excessive detection is detected as a line segment parallel to the X axis or the Y axis. Therefore, the similar position detector 123 detects the line segment parallel to the X axis by calculating the average value N of the number of similar positions per unit length for each position y, and this average value N and the update threshold value α1. This is because it can be determined whether or not excessive detection has occurred. In this embodiment, when it is determined that excessive detection has occurred, the threshold update unit 127 reduces the excessive detection, that is, the number of detected similar positions decreases. The search threshold θ is updated so that In this way, whenever the average value N calculated by the similar position distribution calculation unit 126 for each position y becomes larger than the update threshold value α1, the threshold value update unit 127 updates the search threshold value θ so that the search is performed. The threshold value θ is optimized, and the similar position detection unit 123 searches for a similar position using the optimized search threshold value θ. The similar position detection unit 123 repeats such processing until the excessive detection does not occur in the process of searching for similar positions, thereby optimizing the search threshold θ, and selecting a similar section that is a set of consecutive similar positions. get.

（２）動作
次に、本実施の形態にかかる類似区間検出装置１００の行う類似区間検出処理の手順について図３を用いて説明する。ステップＳ１では、類似区間検出装置１００には、第１入力受付部１２０の機能により、映像信号Ｖｘの入力を受け付け、第２入力受付部１２１の機能により、映像信号Ｖｙの入力を受け付ける。そして、類似区間検出装置１００は、特徴量抽出部１２２の機能により、映像信号Ｖｘから特徴量列Ｘを抽出し、映像信号Ｖｙから特徴量列Ｙを抽出する。ステップＳ２では、類似区間検出装置１００は、類似位置検出部１２３の機能により、特徴量列Ｘ上に探索窓Ｗｘを設定すると共に、特徴量列Ｙ上に探索窓Ｗｙを設定し、まず、探索窓Ｗｙの位置ｙを０に初期化する。ステップＳ３では、類似区間検出装置１００は、類似位置検出部１２３の機能により、特徴量列ＸをＸ軸とし特徴量列ＹをＹ軸とした２次元のＸＹ空間上の全ての位置ｙ，ｘで類似位置の探索が終了したか否かを判断し、当該判断結果が肯定的である場合、処理を終了し、当該判断結果が否定的である場合、探索窓Ｗｙの位置ｙを正方向に１サンプル移動させ、ステップＳ４に進む。 (2) Operation Next, a procedure of similar section detection processing performed by the similar section detection apparatus 100 according to the present embodiment will be described with reference to FIG. In step S 1, the similar section detection device 100 receives an input of the video signal Vx by the function of the first input receiving unit 120 and receives an input of the video signal Vy by the function of the second input receiving unit 121. Then, the similar section detection device 100 extracts the feature amount sequence X from the video signal Vx and the feature amount sequence Y from the video signal Vy by the function of the feature amount extraction unit 122. In step S 2, the similar section detection device 100 sets a search window Wx on the feature quantity sequence X and sets a search window Wy on the feature quantity sequence Y by the function of the similar position detection unit 123. The position y of the window Wy is initialized to 0. In step S 3, the similar section detection device 100 uses the function of the similar position detection unit 123 to perform all the positions y, x in the two-dimensional XY space with the feature quantity sequence X as the X axis and the feature quantity sequence Y as the Y axis. To determine whether the search for similar positions is completed. If the determination result is affirmative, the process ends. If the determination result is negative, the position y of the search window Wy is set in the positive direction. Move one sample and go to step S4.

ステップＳ４では、類似区間検出装置１００は、探索窓Ｗｙの位置ｙを固定して、探索窓Ｗｘの位置ｘを自由に移動させ（例えば１サンプルずつ正方向に移動させる。あるいはＲＩＦＡＳ法によって探索が必要な位置に移動する。）、探索窓Ｗｘ内の特徴量列と探索窓Ｗｙ内の特徴量列とを照合して、類似位置を探索する。この照合の際に、類似区間検出装置１００は、探索窓Ｗｘ,Ｗｙの間で特徴量列の類似度を算出し、当該類似度が、しきい値記憶部１２４に記憶された探索しきい値θより大きいか否かを判定する。当該類似度が、しきい値記憶部１２４に記憶された探索しきい値θより大きい場合に、類似区間検出装置１００は、探索窓Ｗｘ,Ｗｙの位置ｐ（ｘ，ｙ）を類似位置として検出し、当該類似位置及びその類似度ｓを出力する。また、類似区間検出装置１００は、検出結果出力部１２５の機能により、類似位置の検出結果ｒを出力する。なお、上述したＲＩＦＡＳ法を用いて、ステップＳ４では、類似区間検出装置１００は、類似度が探索しきい値θを明らかに下回る位置（スキップ可能領域）での照合を省略する。 In step S4, the similar section detection apparatus 100 fixes the position y of the search window Wy, and freely moves the position x of the search window Wx (for example, moves it in the positive direction by one sample. Alternatively, the search is performed by the RIFAS method. It moves to the required position.), The feature amount sequence in the search window Wx and the feature amount sequence in the search window Wy are collated to search for a similar position. At the time of this collation, the similar section detection device 100 calculates the similarity of the feature amount sequence between the search windows Wx and Wy, and the similarity is stored in the threshold storage unit 124. It is determined whether it is greater than θ. When the similarity is larger than the search threshold θ stored in the threshold storage unit 124, the similar section detection device 100 detects the position p (x, y) of the search windows Wx, Wy as a similar position. Then, the similar position and the similarity s are output. Further, the similar section detection device 100 outputs the detection result r of the similar position by the function of the detection result output unit 125. Note that using the RIFAS method described above, in step S4, the similar section detection device 100 omits collation at a position (skipable area) where the similarity is clearly below the search threshold θ.

位置ｙに対する全ての位置ｘについて類似位置の探索が終了すると、ステップＳ５では、類似区間検出装置１００は、類似位置分布算出部１２６の機能により、位置ｙにおいて位置ｘ毎に各々検出された検出結果ｒについてその値が１の個数を算出して、単位長さ辺りの類似位置数の平均値Ｎを算出する。ステップＳ６では、類似区間検出装置１００は、しきい値更新部１２７の機能により、ステップＳ５で算出した平均値Ｎが更新しきい値α１を超えるか否かを判定する。平均値Ｎが更新しきい値α１を超える場合は、過剰検出が発生したということである。この場合、ステップＳ７では、類似区間検出装置１００は、しきい値記憶部１２４に記憶された探索しきい値θに対して定数ｄを加えて探索しきい値θを更新して、更新した値をしきい値記憶部１２４に記憶させる。その後ステップＳ３に戻る。 When the search for similar positions for all the positions x with respect to the position y is completed, in step S5, the similar section detection device 100 detects each of the positions x at the position y by the function of the similar position distribution calculation unit 126. The number of r having a value of 1 is calculated, and an average value N of the number of similar positions per unit length is calculated. In step S6, the similar section detection device 100 determines whether or not the average value N calculated in step S5 exceeds the update threshold value α1 by the function of the threshold value update unit 127. If the average value N exceeds the update threshold value α1, it means that over detection has occurred. In this case, in step S7, the similar section detection device 100 updates the search threshold θ by adding a constant d to the search threshold θ stored in the threshold storage unit 124, and updates the updated value. Is stored in the threshold storage unit 124. Thereafter, the process returns to step S3.

次に、本実施の形態で探索しきい値θを最適化する原理について図４を用いて説明する。符号３０１は、特徴量列ＸをＸ軸とし特徴量列ＹをＹ軸とした２次元のＸＹ空間において設定される探索窓Ｗｘ，Ｗｙ及び当該ＸＹ空間において類似位置検出部１２３が検出した類似位置を黒点の集合として表している。同図においては、黒点の集合は、傾き１の線分や、Ｘ軸に平行な線分や、Ｙ軸に平行な線分として表されている。符号３０２は、類似位置分布算出部１２６がＹ軸上の位置毎に算出した単位長さ当たりの類似位置の数についての推移を表している。符号３０３は、しきい値更新部１２７が更新する探索しきい値θの推移を表している。 Next, the principle of optimizing the search threshold θ in the present embodiment will be described with reference to FIG. Reference numeral 301 denotes search windows Wx and Wy set in a two-dimensional XY space having the feature quantity string X as the X axis and the feature quantity string Y as the Y axis, and similar positions detected by the similar position detection unit 123 in the XY space. Is represented as a set of sunspots. In the figure, a set of black dots is represented as a line segment having a slope of 1, a line segment parallel to the X axis, or a line segment parallel to the Y axis. Reference numeral 302 represents a transition of the number of similar positions per unit length calculated by the similar position distribution calculation unit 126 for each position on the Y axis. Reference numeral 303 represents a transition of the search threshold θ updated by the threshold update unit 127.

まず，過剰検出による類似位置と真の類似位置の違いを説明する。上述したように、類似区間検出装置１００は、探索窓Ｗｙの位置ｙを０から１サンプルずつ順に移動して固定し、固定した位置ｙにおいて探索窓Ｗｘを自由に移動させて類似位置を探索するという処理を位置ｙ毎に繰り返し行う。符号３０１に示されるように、真の類似位置は傾きが１の線分の周辺に分布する。一方、過剰検出による類似位置はＸ軸またはＹ軸に平行な線分の周辺に分布する。例えば、映像信号Ｖｙの表す映像に暗部がある場合、映像信号Ｖｘの表す映像にある暗部と類似するとしてこの暗部が類似位置として検出されることがある。また、映像信号Ｖｘにおいて暗部を有するフレーム画像が連続する場合は、Ｘ軸に平行な線分の周辺に類似位置が分布する傾向がある。暗部自体は厳密には一致するものではないため、過剰検出による暗部での類似度は、真の類似位置での類似度よりも低くなる。このため、探索しきい値を適切に設定すれば、真の類似位置と過剰検出による類似位置とを区別することができる。 First, the difference between the similar position and the true similar position due to excessive detection will be described. As described above, the similar section detection device 100 moves and fixes the position y of the search window Wy one sample at a time from 0, and searches the similar position by freely moving the search window Wx at the fixed position y. This process is repeated for each position y. As indicated by reference numeral 301, true similar positions are distributed around a line segment having an inclination of 1. On the other hand, similar positions due to excessive detection are distributed around a line segment parallel to the X axis or the Y axis. For example, if the video represented by the video signal Vy has a dark part, the dark part may be detected as a similar position as being similar to the dark part in the video represented by the video signal Vx. In addition, when frame images having dark portions are continuous in the video signal Vx, similar positions tend to be distributed around the line segment parallel to the X axis. Since the dark part itself does not exactly match, the similarity in the dark part due to overdetection is lower than the similarity in the true similar position. For this reason, if the search threshold value is appropriately set, the true similar position and the similar position due to excessive detection can be distinguished.

次に、類似位置の過剰検出と探索しきい値の最適化とについて説明する。まず、類似区間検出装置１００は、探索しきい値θを、当該探索しきい値の最適な値として想定される最適値θａよりも低い値θｏに設定して探索を始める。即ち、探索しきい値θの初期値としてθｏがしきい値記憶部１２４に予め記憶される。上述したように過剰検出による類似位置はＸ軸に平行な線分の周辺に分布するから、例えば探索窓Ｗｙの位置が０からｙ１まで進むと、符号３０２に示されるように、位置ｙ１でのＸ軸についての単位長さ当たりの類似位置の数が急激に増加する。この数の平均値をＮ（ｙ１）とすると、以下の式３が成立する場合に過剰検出が発生したと類似区間検出装置１００は判定する。α１は、上述した更新しきい値である。 Next, excessive detection of similar positions and search threshold optimization will be described. First, the similar section detection device 100 starts the search by setting the search threshold θ to a value θo lower than the optimum value θa assumed as the optimum value of the search threshold. That is, θo is stored in advance in the threshold storage unit 124 as an initial value of the search threshold θ. As described above, similar positions due to excessive detection are distributed around the line segment parallel to the X axis. Therefore, when the position of the search window Wy advances from 0 to y1, for example, as indicated by reference numeral 302, the position at the position y1 The number of similar positions per unit length about the X axis increases rapidly. When the average value of this number is N (y1), the similar section detection device 100 determines that excessive detection has occurred when the following expression 3 holds. α1 is the update threshold value described above.

そして、類似区間検出装置１００は、過剰検出が発生したと判定した場合、探索しきい値θを定数ｄだけ増加させることにより、探索しきい値を更新する。更新後の探索しきい値をθ’とすると、以下の式４が成り立つ。 If the similar section detection device 100 determines that excessive detection has occurred, the similar section detection device 100 updates the search threshold by increasing the search threshold θ by a constant d. Assuming that the updated search threshold is θ ′, the following Expression 4 is established.

類似区間検出装置１００は、更新後の探索しきい値θ’を新たに探索しきい値θとして用いて類似位置を探索する。探索しきい値θが大きくなると、検出される類似位置の数は減少する。しかし、更新された探索しきい値θが最適値よりも小さい場合、探索を進めるうちに、符号３０２に示すように、Ｘ軸の単位長さ当たりの類似位置の数が更新しきい値α１を超えることが繰り返し起こり得る。このため、このようなことが起こった場合に、類似区間検出装置１００は、探索しきい値θを式４により改めて更新することで、検出される類似位置の数が減少するように探索しきい値θを最適化する。このようにして、本実施の形態によれば、探索しきい値を自動的に最適値に設定することができ、その結果、過剰検出や探索漏れを抑制可能に、また類似位置を高速に探索することができる。 The similar section detection apparatus 100 searches for a similar position by using the updated search threshold value θ ′ as the search threshold value θ. As the search threshold θ increases, the number of detected similar positions decreases. However, when the updated search threshold value θ is smaller than the optimum value, as the search proceeds, the number of similar positions per unit length of the X axis becomes the update threshold value α1 as the reference numeral 302 indicates. It can happen repeatedly. Therefore, when such a situation occurs, the similar section detection apparatus 100 performs a search threshold so that the number of detected similar positions is reduced by updating the search threshold θ again using Equation 4. Optimize the value θ. In this way, according to the present embodiment, the search threshold can be automatically set to the optimum value, and as a result, excessive detection and search omission can be suppressed, and a similar position can be searched at high speed. can do.

[第２の実施の形態]
次に、類似区間検出装置の第２の実施の形態について説明する。なお、上述の第１の実施の形態と共通する部分については、同一の符号を使用して説明したり、説明を省略したりする。 [Second Embodiment]
Next, a second embodiment of the similar section detection device will be described. In addition, about the part which is common in the above-mentioned 1st Embodiment, it demonstrates using the same code | symbol or abbreviate | omits description.

（１）構成
本実施の形態においては、時系列信号として音声信号を取り扱い、２つの音声信号から類似区間を検出する構成について説明する。本実施の形態にかかる類似区間検出装置１００の機能的構成自体は図２に例示したものと略同様である。本実施の形態にかかる構成が、上述の第1の実施の形態と異なる点は以下の通りである。第１入力受付部１２０は、単位時間毎に音声信号の入力を受け付け、第２入力受付部１２１は、単位時間毎に音声信号の入力を受け付ける。特徴量抽出部１２２は、第１入力受付部１２０が入力を受け付けた音声信号から、単位時間毎の特徴量を抽出することにより、単位時間毎の特徴量を含む特徴量列を抽出すると共に、第２入力受付部１２１が入力を受け付けた音声信号から、単位時間毎の特徴量を抽出することにより、単位時間毎の特徴量を含む特徴量列を抽出する。音声信号号から特徴量列を抽出する方法は、例えば以下の通りである。なお、音声信号は例えば１１０２５［Ｈｚ］でサンプリングされた１６ｂｉｔのＰＣＭ信号とする。特徴量抽出部１２２は、２次ＩＩＲデジタルフィルタを有し、２次ＩＩＲデジタルフィルタを用いて２００［Ｈｚ］から３７００［Ｈｚ］をｌｏｇスケールで６等分した７つの周波数を中心とするバンドパスフィルタ群を構成する。２次ＩＩＲデジタルフィルタはＣＰＵ１０１がＲＯＭ１０４や外部記憶部１０７に記憶されたプログラムを実行することにより実現しても良いし、ＤＳＰや順序回路で構成しても良い。このバンドパスフィルタ群に、第１入力受付部１２０が入力を受け付けた音声信号及び第２入力受付部１２１が入力を受け付けた音声信号を入力し、特徴量抽出部１２２は、時間について一定間隔毎にパワーを算出する。チャンネル数は７だから一回のサンプリングで７個のパワーＰ（ｉ）（１≦ｉ≦７）が得られる。サンプリング間隔は２９．９７［Ｈｚ］とする。そして、特徴量抽出部１２２は、パワー値Ｐ（ｉ）を３つのしきい値を用いて２ｂｉｔにベクトル量子化してＢ（ｉ）を得る。特徴量抽出部１２２は、このベクトルをＢ（１）Ｂ（２）Ｂ（３）Ｂ（４）Ｂ（５）Ｂ（６）Ｂ（７）の順に連結して１４ｂｉｔの符号を生成し、これを音声信号の特徴量とする。なお、ベクトル量子化で用いるしきい値としては、入力される音声信号が表す音声に似た音声を分析して符号化の効率が最も良い値を求め、この値を固定値として用いれば良い。 (1) Configuration In the present embodiment, a configuration is described in which an audio signal is handled as a time series signal and a similar section is detected from two audio signals. The functional configuration itself of the similar section detection apparatus 100 according to the present embodiment is substantially the same as that illustrated in FIG. The configuration according to the present embodiment is different from the first embodiment described above as follows. The first input receiving unit 120 receives an audio signal input every unit time, and the second input receiving unit 121 receives an audio signal input every unit time. The feature amount extraction unit 122 extracts a feature amount sequence including a feature amount per unit time by extracting a feature amount per unit time from the audio signal received by the first input reception unit 120, and The feature amount sequence including the feature amount for each unit time is extracted by extracting the feature amount for each unit time from the audio signal received by the second input receiving unit 121. A method for extracting a feature string from an audio signal is, for example, as follows. Note that the audio signal is, for example, a 16-bit PCM signal sampled at 11025 [Hz]. The feature amount extraction unit 122 has a secondary IIR digital filter, and a band pass centered on seven frequencies obtained by dividing the logarithm from 200 [Hz] to 3700 [Hz] into 6 parts using the secondary IIR digital filter. A filter group is configured. The secondary IIR digital filter may be realized by the CPU 101 executing a program stored in the ROM 104 or the external storage unit 107, or may be configured by a DSP or a sequential circuit. The band pass filter group is input with the audio signal received by the first input receiving unit 120 and the audio signal received by the second input receiving unit 121, and the feature amount extracting unit 122 is arranged at regular intervals with respect to time. Calculate the power. Since the number of channels is 7, seven powers P (i) (1 ≦ i ≦ 7) can be obtained by one sampling. The sampling interval is 29.97 [Hz]. Then, the feature amount extraction unit 122 vector-quantizes the power value P (i) to 2 bits using three threshold values to obtain B (i). The feature amount extraction unit 122 generates a 14-bit code by concatenating the vectors in the order of B (1) B (2) B (3) B (4) B (5) B (6) B (7), This is defined as a feature amount of the audio signal. As a threshold used in vector quantization, a value similar to the sound represented by the input sound signal is analyzed to obtain a value with the highest coding efficiency, and this value may be used as a fixed value.

（２）動作
次に、本実施の形態にかかる類似区間検出装置１００の行う類似区間検出処理の手順について説明する。本実施の形態にかかる類似区間検出処理の手順自体は図３に示したものと略同様である。ステップＳ１では、類似区間検出装置１００には、第１入力受付部１２０の機能により、音声信号の入力を受け付け、第２入力受付部１２１の機能により、音声信号の入力を受け付ける。そして、類似区間検出装置１００は、特徴量抽出部１２２の機能により、入力を受け付けた２つの音声信号から各々特徴量列を抽出する。ステップＳ２以降は上述の第１の実施の形態と同様である。 (2) Operation Next, a procedure of similar section detection processing performed by the similar section detection apparatus 100 according to the present embodiment will be described. The procedure of the similar section detection process according to this embodiment is substantially the same as that shown in FIG. In step S 1, the similar section detection device 100 receives an input of an audio signal by the function of the first input receiving unit 120 and receives an input of an audio signal by the function of the second input receiving unit 121. Then, the similar section detection device 100 extracts a feature amount sequence from each of the two audio signals that have been input by the function of the feature amount extraction unit 122. Step S2 and subsequent steps are the same as those in the first embodiment described above.

以上のような構成によれば、音声信号について類似位置を探索する場合であっても、探索しきい値を自動的に最適値に設定することができ、その結果、過剰検出や探索漏れを抑制可能に、類似位置を高速に探索することができる。 According to the configuration as described above, even when searching for a similar position in the audio signal, the search threshold can be automatically set to the optimum value, and as a result, excessive detection and search omission are suppressed. It is possible to search for similar positions at high speed.

[変形例]
なお、本発明は前記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、前記実施形態に開示されている複数の構成要素の適宜な組み合わせにより、種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態にわたる構成要素を適宜組み合わせてもよい。また、以下に例示するような種々の変形が可能である。 [Modification]
Note that the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. Moreover, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, constituent elements over different embodiments may be appropriately combined. Further, various modifications as exemplified below are possible.

＜変形例１＞
上述した各実施の形態において、類似区間検出装置１００で実行される各種プログラムを、インターネット等のネットワークに接続されたコンピュータ上に格納し、ネットワーク経由でダウンロードさせることにより提供するように構成しても良い。また当該各種プログラムを、インストール可能な形式又は実行可能な形式のファイルでＣＤ−ＲＯＭ、フレキシブルディスク（ＦＤ）、ＣＤ−Ｒ、ＤＶＤ（Digital Versatile Disk）等のコンピュータで読み取り可能な記録媒体に記録してコンピュータプログラムプロダクトとして提供するように構成しても良い。 <Modification 1>
In each embodiment described above, various programs executed by the similar section detection device 100 may be stored on a computer connected to a network such as the Internet and provided by being downloaded via the network. good. The various programs are recorded in a computer-readable recording medium such as a CD-ROM, a flexible disk (FD), a CD-R, and a DVD (Digital Versatile Disk) in a file in an installable or executable format. The computer program product may be provided.

また、第１入力受付部１２０と、第２入力受付部１２１と、特徴量抽出部１２２と、類似位置検出部１２３と、検出結果出力部１２５と、類似位置分布算出部１２６と、しきい値更新部１２７とは、ＣＰＵ１０１のプログラム実行時にＲＡＭ１０５などの記憶部に生成されるものであるとしたが、これらのうち少なくとも１つをプロセッサなどのハードウェアにより構成しても良い。 Further, the first input receiving unit 120, the second input receiving unit 121, the feature amount extracting unit 122, the similar position detecting unit 123, the detection result output unit 125, the similar position distribution calculating unit 126, a threshold value The update unit 127 is generated in the storage unit such as the RAM 105 when the CPU 101 executes the program, but at least one of these may be configured by hardware such as a processor.

＜変形例２＞
上述した各実施の形態において、類似位置検出部１２３は、探索窓Ｗｙを位置ｙに固定して探索窓Ｗｘの位置ｘを自由に移動させて、類似位置の探索を行なったが、探索窓Ｗｘを位置ｘに固定して探索窓Ｗｙの位置ｙを自由に移動させて、類似位置の探索を行なっても良い。 <Modification 2>
In each of the embodiments described above, the similar position detection unit 123 searches the similar position by fixing the search window Wy to the position y and freely moving the position x of the search window Wx. May be fixed at position x and the position y of the search window Wy may be freely moved to search for similar positions.

また、上述した各実施の形態において、過剰検出が発生しているか否かを判定するための所定の分布条件は、探索窓Ｗｙを固定した場合の類似位置の数の平均値が更新しきい値α１を超えることであったが、過剰検出による類似位置がＸ軸またはＹ軸に平行な線分の周辺に分布する性質を利用した条件であればどのような分布条件であっても良い。 In each of the above-described embodiments, the predetermined distribution condition for determining whether or not excessive detection has occurred is that the average value of the number of similar positions when the search window Wy is fixed is the update threshold value. As long as α1 is exceeded, any distribution condition may be used as long as the condition utilizes the property that similar positions due to overdetection are distributed around a line segment parallel to the X axis or the Y axis.

例えば、類似位置の数の変化率が更新しきい値α２を超えることを所定の分布条件として、過剰検出を判定しても良い。例えば、探索窓Ｗｙが位置ｙ（ｉ）にある場合の類似位置の数をＮ（ｙ（ｉ））、位置ｙ（ｉ＋１）にある場合の類似位置の数をＮ（ｙ（ｉ＋１））とすると、類似位置の変化率Ａ（ｉ）は以下の式５により表すことができる。 For example, over-detection may be determined using a predetermined distribution condition that the rate of change in the number of similar positions exceeds the update threshold value α2. For example, the number of similar positions when the search window Wy is at the position y (i) is N (y (i)), and the number of similar positions when the search window Wy is at the position y (i + 1) is N (y (i + 1)). Then, the change rate A (i) of the similar position can be expressed by the following formula 5.

類似位置の変化率Ａ（ｉ）が予め決められた更新しきい値α２を超えて以下の式６に表される関係を満たす場合に所定の条件を満たすとして、過剰検出過剰検出が発生したと判定すれば良い。 It is assumed that over-detection and over-detection have occurred, assuming that a predetermined condition is satisfied when the rate of change A (i) of the similar position exceeds a predetermined update threshold value α2 and satisfies the relationship expressed by the following expression 6. Judgment is sufficient.

また、探索窓Ｗｙを固定した線分上だけで類似位置の数を計数するのではなく、特徴量列ＸをＸ軸とし特徴量列ＹをＹ軸とした２次元のＸＹ空間を１６等分して、等分された各領域内における類似位置の数Ｎが更新しきい値α３を超えることを所定の条件として、過剰検出を判定しても良い。 Also, instead of counting the number of similar positions only on the line segment with the search window Wy fixed, a two-dimensional XY space with the feature quantity sequence X as the X axis and the feature quantity sequence Y as the Y axis is divided into 16 equal parts. Then, overdetection may be determined on the condition that the number N of similar positions in each equally divided region exceeds the update threshold value α3.

また、特徴量列ＸをＸ軸とし特徴量列ＹをＹ軸とした２次元のＸＹ空間上で、上述した各実施の形態のように探索窓Ｗｙを少しずつ移動させながら探索を進める場合に、ハフ変換を用いてＸ軸またはＹ軸に平行な線分を検出し、その線分の長さが更新しきい値α４を超えることを所定の分布条件として、過剰検出を判定しても良い。 In the case where the search is advanced while moving the search window Wy little by little in the two-dimensional XY space having the feature quantity row X as the X axis and the feature quantity row Y as the Y axis as in the above-described embodiments. Further, it is also possible to detect a line segment parallel to the X-axis or the Y-axis using the Hough transform, and to determine over-detection with a predetermined distribution condition that the length of the line segment exceeds the update threshold value α4. .

また、上述した各実施の形態のように探索窓Ｗｙを少しずつ移動させながら類似位置の数を記憶しておいて、類似位置の数の推移に基づいて所定の分布条件を設定して、過剰検出を判定しても良い。例えば、類似位置の数の推移から分散を求めて、この分散が更新しきい値α５を下回る場合には、類似位置が極端に密に分布していることを表すから、この場合に所定の分布条件を満たすとして、過剰検出が発生したと判定する。 Also, as in the above-described embodiments, the number of similar positions is stored while moving the search window Wy little by little, and a predetermined distribution condition is set based on the transition of the number of similar positions, and excessive Detection may be determined. For example, when the variance is obtained from the transition of the number of similar positions and this variance is below the update threshold value α5, it indicates that the similar positions are extremely densely distributed. If the condition is satisfied, it is determined that excessive detection has occurred.

＜変形例３＞
上述した第１の実施の形態において、ＲＩＦＡＳ法ではスキップ可能領域で評価した領域を谷の位置の列Ｕで管理するが、この谷Ｕの数が更新しきい値α６を超えるか否かにより、過剰検出を判定しても良い。ＲＩＦＡＳ法で類似位置を探索する原理を以下に簡単に説明して、この手法で用いられる谷がどのようなものであるかを説明する。ＲＩＦＡＳ法では、特徴量列ＸをＸ軸とし特徴量列ＹをＹ軸とした２次元のＸＹ空間（探索空間という）においてある一点の位置で類似度が求められた場合に、その位置の周囲に探索しきい値θを上回り得ない領域があることを示し、この領域での照合を省略することによって探索を高速化した。この領域がスキップ可能領域である。スキップ可能領域は類似度を求めた位置を中心とする４５度傾いた正方形であり、正方形の対角線の長さは類似度から算出できる。ＲＩＦＡＳ法では部分信号探索問題を２次元の探索空間をスキップ可能領域か類似位置かで敷き詰める問題として定式化した。まず探索窓Ｗｙを固定した状態で探索窓Ｗｘを走査して全ての位置についてスキップ可能領域か類似位置であるかの評価をすると、探索窓Ｗｙを１サンプル移動させる処理を繰り返す。この際にスキップ可能領域か類似位置として評価した領域と未評価の領域との境界線の形状を記憶する必要がある。ＲＩＦＡＳ法ではスキップ可能領域が重なる場合にはその交点を記憶し、類似位置である場合にはその位置を記憶する。これらの位置は他のスキップ可能領域内の位置と比較して、Ｙ軸について小さい位置にあって谷のように見える。そこでこれらの位置を谷と呼ぶ。類似位置が少ない場合にはスキップ可能領域が大きくなるから谷の数も少なくなる。逆に類似位置が多い場合にはスキップ可能領域が小さくなるから谷の数も多くなる。即ち、谷の数は、類似位置の数の増加に応じて増加し、類似位置の数の減少に応じて減少する。この性質を利用して、類似位置の数を直接計数するのではなく、谷の数を計数し、谷の数が更新しきい値α６を超えることを所定の条件として、過剰検出を判定しても良い。 <Modification 3>
In the first embodiment described above, in the RIFAS method, the region evaluated as the skippable region is managed by the column U of the valley position. Depending on whether or not the number of the valleys U exceeds the update threshold value α6, Excess detection may be determined. The principle of searching for a similar position by the RIFAS method will be briefly described below to explain what kind of valley is used in this method. In the RIFAS method, when similarity is obtained at a certain position in a two-dimensional XY space (referred to as a search space) in which the feature quantity sequence X is the X axis and the feature quantity sequence Y is the Y axis, Indicates that there is a region that cannot exceed the search threshold θ, and the search is speeded up by omitting verification in this region. This area is a skippable area. The skippable area is a square inclined by 45 degrees with the position where the similarity is obtained as the center, and the diagonal length of the square can be calculated from the similarity. In the RIFAS method, the partial signal search problem is formulated as a problem in which a two-dimensional search space is spread by skippable areas or similar positions. First, when the search window Wx is scanned and the search window Wx is scanned to evaluate whether all the positions are skippable areas or similar positions, the process of moving the search window Wy by one sample is repeated. At this time, it is necessary to store the shape of the boundary line between the skippable region or the region evaluated as a similar position and the unevaluated region. In the RIFAS method, when the skippable areas overlap, the intersection is stored, and when it is a similar position, the position is stored. These positions are smaller on the Y axis than the positions in other skippable areas and look like valleys. Therefore, these positions are called valleys. When the number of similar positions is small, the skippable area is large and the number of valleys is also small. Conversely, when there are many similar positions, the skippable area is small, and the number of valleys is large. That is, the number of valleys increases as the number of similar positions increases, and decreases as the number of similar positions decreases. Using this property, instead of directly counting the number of similar positions, the number of valleys is counted, and overdetection is determined on the condition that the number of valleys exceeds the update threshold value α6. Also good.

以上のような構成によれば、探索の過程で谷の数は分かっているので、類似位置の数を計数する仕組みを探索処理に新たに加えなくても良い。 According to the above configuration, since the number of valleys is known during the search process, a mechanism for counting the number of similar positions may not be newly added to the search process.

その他、類似位置検出部１２３が類似位置を検出する際に用いるパラメータであって類似位置の数の増加に応じて増加するパラメータ値が、更新しきい値α７を超えることを所定の分布条件として、過剰検出を判定しても良い。 In addition, as a predetermined distribution condition, a parameter value that is used when the similar position detection unit 123 detects a similar position and that increases as the number of similar positions increases exceeds the update threshold value α7. Excess detection may be determined.

なお、更新しきい値を超えるか否かを判定するための対象の値は、上述した類似位置の数、類似位置の数の変化率、ハフ変換によって求めた線分の長さ及びパラメータのうち少なくとも１つの最大値、最小値、平均値及び分散のうち少なくとも１つであれば良い。 The target value for determining whether or not the update threshold is exceeded is the number of similar positions, the rate of change of the number of similar positions, the length of a line segment obtained by Hough transform, and parameters. It may be at least one of at least one maximum value, minimum value, average value, and variance.

＜変形例４＞
上述した第１の実施の形態において、探索しきい値θの更新は、探索しきい値θに対して定数ｄを加えることにより行なったが、検出される類似位置の数が減少するように探索しきい値θを更新するのであればこれに限らない。例えば、１以上の定数ｅを探索しきい値θで割った値を新たな探索しきい値としても良い。更新後の探索しきい値θ’は以下の式７により表わすことができる。 <Modification 4>
In the first embodiment described above, the search threshold value θ is updated by adding a constant d to the search threshold value θ, but the search is performed so that the number of detected similar positions decreases. The present invention is not limited to this as long as the threshold value θ is updated. For example, a value obtained by dividing one or more constants e by the search threshold θ may be used as a new search threshold. The updated search threshold value θ ′ can be expressed by the following Equation 7.

探索しきい値とその最適値とが大きく離れている場合には、探索しきい値を更新するための更新量は大きい方が好ましく、探索しきい値とその最適値とがあまり離れていない場合には、更新量は小さい方が好ましい。本変形例の構成では、「θ≦１」なので更新量が徐々に減少する。よって，定数を加えて探索しきい値を更新するのと比較すると探索しきい値を最適値により近付けることができる。 When the search threshold is far away from its optimal value, it is preferable that the update amount for updating the search threshold is large, and when the search threshold is not far from the optimal value. Therefore, it is preferable that the update amount is small. In the configuration of this modification, “θ ≦ 1”, so the update amount gradually decreases. Therefore, the search threshold can be brought closer to the optimum value as compared with updating the search threshold by adding a constant.

１００類似区間検出装置
１０１ＣＰＵ
１０２操作部
１０３表示部
１０４ＲＯＭ
１０５ＲＡＭ
１０６通信部
１０７外部記憶部
１０８バス
１０８しきい値更新部
１２０第１映像入力受付部
１２１第２映像入力受付部
１２２特徴量抽出部
１２３類似位置検出部
１２４しきい値記憶部
１２５検出結果出力部
１２６類似位置分布算出部
１２７しきい値更新部 100 Similar Section Detection Device 101 CPU
102 Operation unit 103 Display unit 104 ROM
105 RAM
106 communication unit 107 external storage unit 108 bus 108 threshold update unit 120 first video input reception unit 121 second video input reception unit 122 feature amount extraction unit 123 similar position detection unit 124 threshold storage unit 125 detection result output unit 126 Similar position distribution calculation unit 127 Threshold update unit

Claims

First input receiving means for receiving an input of a first time-series signal;
Second input receiving means for receiving an input of the second time-series signal;
Extraction means for extracting a first feature quantity from the first time series signal and extracting a second feature quantity from the second time series signal;
Storage means for storing a first threshold;
Detecting means for collating the first feature quantity and the second feature quantity in time series and detecting a similar position whose similarity is equal to or higher than the first threshold value;
Calculating means for calculating a distribution of the number of similar positions;
Updating means for updating the first threshold value so that the number of detected similar positions decreases when the distribution of the number of similar positions satisfies a predetermined distribution condition;
A similar section detection apparatus comprising: a control section that acquires a similar section that is a set of similar positions by repeatedly operating the detection section, the calculation section, and the update section.

The predetermined distribution condition includes a two-dimensional space having the number of similar positions, the rate of change of the number of similar positions, the first feature quantity as a first axis, and the second feature quantity as a second axis. It is set based on at least one of the lengths of the line segments parallel to the first axis or the second axis obtained by straight line detection above.
The updating means includes
Determining means for determining whether or not the predetermined distribution condition is satisfied;
The similar section detection device according to claim 1, further comprising a threshold update unit configured to update the first threshold when the predetermined distribution condition is satisfied.

The predetermined distribution condition is that the number of similar positions exceeds a second threshold, the rate of change of the number of similar positions exceeds a third threshold, and the length of the line segment is the first. At least one of exceeding the threshold of 4;
The similar section detection device according to claim 2, wherein the determination unit determines whether or not the predetermined distribution condition is satisfied.

The predetermined distribution condition can be used to detect a similar position, and a parameter that increases with an increase in the number of the similar positions can be omitted, and verification of the first feature quantity and the second feature quantity can be omitted. It is set based on at least one of the number of valleys managing the area,
The updating means includes
Determining means for determining whether or not the predetermined distribution condition is satisfied;
The similar section detection device according to claim 1, further comprising a threshold update unit configured to update the first threshold when the predetermined distribution condition is satisfied.

The predetermined distribution condition is at least one of the parameter exceeding a fifth threshold and the number of valleys exceeding a sixth threshold;
The similar section detection apparatus according to claim 4, wherein the determination unit determines whether or not the predetermined distribution condition is satisfied.

The updating unit adds a constant to the first threshold value or newly sets a value inversely proportional to the first threshold value when the distribution of the number of similar positions satisfies a predetermined distribution condition. The similar section detection device according to claim 1, wherein the first threshold value is updated by setting the first threshold value.