JP2007157064A

JP2007157064A - Signal search unit and its method

Info

Publication number: JP2007157064A
Application number: JP2005355040A
Authority: JP
Inventors: Yoshihiro Omori; 善啓大盛
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2005-12-08
Filing date: 2005-12-08
Publication date: 2007-06-21
Anticipated expiration: 2025-12-08
Also published as: JP4469785B2

Abstract

<P>PROBLEM TO BE SOLVED: To widen a range not to be searched than conventionally, by defining all the range from a current position to a skip candidate position as the range not to be searched. <P>SOLUTION: A signal search unit comprises: a reference signal characteristic quantity calculation section 104 for calculating a characteristic quantity by inputting a reference signal 101 which is a search key; an input signal characteristic quantity calculation section 105 for calculating the characteristic quantity of an input signal 102 to be searched for; a similarity degree calculation section 106 for calculating the degree of similarity between the characteristic quantity of the reference signal 101 and that of the input signal 102; a correlation skippable width calculation section 107 for calculating the maximum value for skipping based on the degree of the similarity; a skip width calculation section 108 for determining the width for actual skipping based on the skippable width; and a backtracking determination section 109 for determining whether it is necessary to backtrack based on the degree of the similarity of a search window of a skip destination. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、長時間の信号系列の中から、予め登録した信号と類似した部分を探す信号探索装置及びその方法に関する。 The present invention relates to a signal search apparatus and method for searching for a portion similar to a pre-registered signal from a long-time signal sequence.

特許文献１に、ヒストグラムインタセクションによって算出した類似度から検索不要な範囲を算出して、その範囲の処理をスキップして音声信号探索を高速化する時系列アクティブ探索法を開示している。 Patent Document 1 discloses a time-series active search method in which a search unnecessary range is calculated from the similarity calculated by histogram intersection, and processing of the range is skipped to speed up speech signal search.

特許文献２、非特許文献１に、時系列アクティブ探索法を発展させて、現在の探索位置の類似度から算出された検索不要範囲と、暫定のスキップ位置の類似度から算出された後ろ向きの検索不要範囲とが重なる場合に、現在の探索位置から暫定のスキップ位置の全範囲を検索不要にして高速化するオ−バ−スキピングによる時系列アクティブ探索法を開示している。
特許第３０６５３１４号公報の数１と数２特許第３４７４１３１号公報の図３柏野邦夫，村瀬洋，「オーバースキッピングによる時系列アクティブ探索法の高速化」，日本音響学会講演論文集，ｐｐ．４４５−４４６，１９９９． The time-series active search method is developed in Patent Document 2 and Non-Patent Document 1, and the search unnecessary range calculated from the similarity of the current search position and the backward search calculated from the similarity of the temporary skip position A time-series active search method by overskiing is disclosed in which the entire range from the current search position to the tentative skip position is not required to be searched when the unnecessary range overlaps.
Number 1 and number 2 of Japanese Patent No. 30653314 FIG. 3 of Japanese Patent No. 3474131 Kunio Kanno and Hiroshi Murase, “Acceleration of time-series active search method by overskiping”, Proceedings of Acoustical Society of Japan, pp. 445-446, 1999.

特許文献２、非特許文献１による方法は、現在位置の類似度算出とスキップ候補位置の類似度算出を独立して行い、類似度の相関関係を考慮していなかった。そのため、両方の類似度を考慮すれば探索不要であるはずの範囲を探索してしまい十分に高速化できない問題があった。 The methods according to Patent Literature 2 and Non-Patent Literature 1 independently calculate the similarity of the current position and the similarity of the skip candidate position, and do not consider the correlation of the similarity. Therefore, if both similarities are taken into account, there is a problem that a range that should not be searched is searched and the speed cannot be sufficiently increased.

本発明は、上記問題点を解決するためになされたものであって、従来よりも信号の探索不要範囲を広げることができる信号探索装置及びその方法を提供する。 The present invention has been made to solve the above-described problems, and provides a signal search apparatus and method capable of extending the signal search unnecessary range as compared with the conventional technique.

本発明の信号探索装置は、入力信号列上をスキップしながら参照信号列と類似する信号列を探索する信号探索装置において、前記入力信号列上の現在の探索位置に探索窓を設定する現在位置設定手段と、前記入力信号列上の次の探索位置の候補であるスキップ候補位置に探索窓を設定するスキップ候補位置設定手段と、前記現在位置の探索窓に含まれる信号列と前記参照信号列との類似度と、前記スキップ候補位置の探索窓に含まれる信号列と前記参照信号列の類似度とをそれぞれ算出する類似度算出手段と、前記現在位置の探索窓の末尾の位置と前記スキップ候補位置の探索窓の先頭の位置までの長さに基づいて、前記現在位置から前記スキップ候補位置までの範囲と前記参照信号列との類似度の最大値を算出する最大値算出手段と、前記最大値が閾値を超えない場合には、前記スキップ候補位置に次の現在位置を設定するスキップ幅算出手段と、を具備することを特徴とする信号探索装置である。 The signal search device of the present invention is a signal search device for searching a signal sequence similar to a reference signal sequence while skipping the input signal sequence, and a current position for setting a search window at the current search position on the input signal sequence Setting means; skip candidate position setting means for setting a search window at a skip candidate position that is a candidate for a next search position on the input signal string; a signal string included in the search window at the current position; and the reference signal string Similarity calculating means for calculating the similarity between the signal sequence included in the search window of the skip candidate position and the similarity of the reference signal sequence, the end position of the search window at the current position, and the skip A maximum value calculating means for calculating a maximum value of the similarity between the range from the current position to the skip candidate position and the reference signal sequence based on the length of the candidate position to the top position of the search window; If a large value does not exceed the threshold value is a signal search apparatus characterized by comprising, a skip width calculation means for setting a next current location to the deletion candidate position.

本発明によれば、従来よりも信号の探索不要範囲を広げて高速な信号探索を実現できる。 According to the present invention, it is possible to realize a high-speed signal search by expanding the signal search unnecessary range as compared with the conventional case.

以下、本発明の実施形態の信号探索装置１００について説明する。 Hereinafter, the signal search apparatus 100 according to the embodiment of the present invention will be described.

（第１の実施形態）
第１の実施形態の信号探索装置１００について図１から図５に基づいて説明する。本実施形態では、テレビ番組を録画した２時間の映像から１５秒間のＣＭ映像を検索する例を説明する。なお、参照信号と入力信号は、１秒間に３０枚のフレーム画像が含まれた映像信号である。 (First embodiment)
A signal search device 100 according to a first embodiment will be described with reference to FIGS. In the present embodiment, an example in which a 15-second CM video is searched from a 2-hour video recording a television program will be described. The reference signal and the input signal are video signals including 30 frame images per second.

（１）信号探索装置１００の構成
図１は、本実施形態に係わる信号探索装置１００を示すブロック図である。 (1) Configuration of Signal Search Device 100 FIG. 1 is a block diagram showing a signal search device 100 according to the present embodiment.

信号探索装置１００は、検索キーである参照信号１０１を入力して特徴量を算出する参照信号特徴量算出部１０４と、検索対象である入力信号１０２の特徴量を算出する入力信号特徴量算出部１０５と、参照信号１０１の特徴量と入力信号１０２の特徴量との類似度を算出する類似度算出部１０６と、類似度に基づいてスキップ可能な最大値を算出する相関スキップ可能幅算出部１０７と、スキップ可能幅に基づいて実際にスキップする幅を決定するスキップ幅算出部１０８と、スキップした先の探索窓の類似度から後戻りする必要があるかどうかを判定するバックトラック判定部１０９と、から構成される。 The signal search apparatus 100 receives a reference signal 101 as a search key and calculates a feature value by a reference signal feature value calculation unit 104 and an input signal feature value calculation unit that calculates a feature value of the input signal 102 to be searched. 105, a similarity calculation unit 106 that calculates the similarity between the feature amount of the reference signal 101 and the feature amount of the input signal 102, and a correlation skippable width calculation unit 107 that calculates a skippable maximum value based on the similarity. A skip width calculation unit 108 that determines a width that is actually skipped based on the skippable width, a backtrack determination unit 109 that determines whether it is necessary to go back from the similarity of the skipped search window, Consists of

信号探索装置１００は、例えば、汎用のコンピュータ装置を基本ハードウェアとして用いることで実現することが可能である。 The signal search apparatus 100 can be realized by using, for example, a general-purpose computer apparatus as basic hardware.

例えば、参照信号特徴量算出部１０４、入力信号特徴量算出部１０５、類似度算出部１０６、相関スキップ可能幅算出部１０７、スキップ幅算出部１０８、及び、バックトラック判定部１０９は、上記のコンピュータ装置に搭載されたプロセッサにプログラムを実行させることにより実現することができる。 For example, the reference signal feature value calculation unit 104, the input signal feature value calculation unit 105, the similarity calculation unit 106, the correlation skippable width calculation unit 107, the skip width calculation unit 108, and the backtrack determination unit 109 are the above computers. This can be realized by causing a processor mounted on the apparatus to execute a program.

このとき、信号探索装置１００は、上記のプログラムをコンピュータ装置に予めインストールすることで実現してもよいし、ＣＤ−ＲＯＭなどの記憶媒体に記憶して、あるいはネットワークを介して上記のプログラムを配布して、このプログラムをコンピュータ装置に適宜インストールすることで実現してもよい。また、Ｂ及びＣは、上記のコンピュータ装置に内蔵あるいは外付けされたメモリ、ハードディスクもしくはＣＤ−Ｒ、ＣＤ−ＲＷ、ＤＶＤ−ＲＡＭ、ＤＶＤ−Ｒなどの記憶媒体などを適宜利用して実現することができる。 At this time, the signal search apparatus 100 may be realized by installing the above program in a computer device in advance, or may be stored in a storage medium such as a CD-ROM or distributed through the network. Then, this program may be realized by appropriately installing it in a computer device. Further, B and C are realized by appropriately using a memory, a hard disk or a storage medium such as a CD-R, a CD-RW, a DVD-RAM, a DVD-R, etc., incorporated in or externally attached to the computer device. Can do.

なお、特許文献２ではスキップ可能幅をそのままスキップ幅としたが、本実施形態では最もバックトラックが少ないスキップ幅をスキップ可能幅から算出するように、相関スキップ可能幅算出部１０７とスキップ幅算出部１０８を具備する。また、本実施形態は、特許文献２とスキップ可能幅の算出方法が異なる。 Although the skippable width is directly used as the skip width in Patent Document 2, the correlation skippable width calculation unit 107 and the skip width calculation unit 107 calculate the skip width with the smallest backtrack from the skippable width in this embodiment. 108 is provided. Also, this embodiment is different from Patent Document 2 in the method for calculating the skippable width.

（２）信号探索装置１００の動作
次に、図１及び図２に示すフローチャートを用いて、信号探索装置１００の動作について説明する。 (2) Operation of Signal Search Device 100 Next, the operation of the signal search device 100 will be described using the flowcharts shown in FIGS. 1 and 2.

（２−１）入力信号１０２における現在位置の類似度の算出
まず、参照信号１０１と入力信号１０２を取得する（ステップＳ２００）。 (2-1) Calculation of similarity of current position in input signal 102 First, the reference signal 101 and the input signal 102 are acquired (step S200).

参照信号１０１の特徴量を参照信号特徴量算出部１０４で算出する（ステップＳ２０１）。参照信号１０１はここでは１５秒間のＣＭ映像とする。特徴量は参照信号１０１の映像を構成する各フレーム画面について、画面を１６分割して輝度の平均を２５６ビン（すなわち、２５６区間）で算出する。特徴量として１６次元のベクトルが４５０個得られる。さらに、これらの特徴量から２４ビンのヒストグラムを算出する。 The reference signal feature value calculation unit 104 calculates the feature value of the reference signal 101 (step S201). Here, the reference signal 101 is a CM image of 15 seconds. For each frame screen constituting the video of the reference signal 101, the feature amount is calculated by dividing the screen into 16 and averaging the luminance in 256 bins (that is, 256 sections). 450 16-dimensional vectors are obtained as feature quantities. Further, a 24-bin histogram is calculated from these feature amounts.

入力信号１０２は参照信号１０１と同様に映像であり、テレビ番組を録画した２時間の映像とする。入力信号１０２中に現在位置を入力信号１０２の先頭に設定して初期化する（ステップＳ２０２）。本実施形態では、現在位置を入力信号１０２中の先頭のフレームから末尾のフレームまで少しずつずらしながら参照信号１０１との類似度を算出して、類似度が予め与えられた閾値以上の場合に、参照信号１０１と同一の信号であると検出する。 The input signal 102 is a video, similar to the reference signal 101, and is a 2-hour video recording a television program. The current position is set to the head of the input signal 102 in the input signal 102 and is initialized (step S202). In the present embodiment, the similarity with the reference signal 101 is calculated while gradually shifting the current position from the first frame to the last frame in the input signal 102, and when the similarity is equal to or greater than a predetermined threshold value, It is detected that the signal is the same as the reference signal 101.

現在位置（すなわち、現在のフレーム位置）が入力信号１０２の末尾かどうかを調べ（ステップＳ２０３）、末尾のフレームの場合には探索処理を終了する。そうでない場合には現在位置の特徴量を入力信号特徴量算出部１０５で算出し、参照信号１０１との類似度を類似度算出部１０６で算出する（ステップＳ２０４）。現在位置の特徴量は、参照信号１０１の特徴量の算出と同様に、現在位置から参照信号１０１と同じ長さ、すなわち１５秒の探索窓を設定し、この中に含まれるフレーム画像について特徴量を算出し、ヒストグラムを算出する。 It is checked whether or not the current position (that is, the current frame position) is the end of the input signal 102 (step S203), and in the case of the end frame, the search process is terminated. Otherwise, the feature quantity at the current position is calculated by the input signal feature quantity calculator 105, and the similarity with the reference signal 101 is calculated by the similarity calculator 106 (step S204). Similar to the calculation of the feature value of the reference signal 101, the feature value of the current position is set to the same length as the reference signal 101 from the current position, that is, a search window of 15 seconds, and the feature value of the frame image included therein And a histogram is calculated.

参照信号１０１のヒストグラムと現在位置の探索窓に含まれる信号との類似度は（以後、現在位置の類似度と呼ぶ）、特許文献１に示された式（下記で示す式（２）参照）と同様に、ヒストグラムの各ビンについて最小値を取り、それらを合計したものをヒストグラムの総度数で割って正規化したヒストグラムインタセクションと呼ばれる距離尺度で算出する。ヒストグラムインタセクションは、信号が類似していれば類似しているほど大きくなる性質を持つ。 The similarity between the histogram of the reference signal 101 and the signal included in the current position search window (hereinafter referred to as the current position similarity) is the expression shown in Patent Document 1 (see Expression (2) below). Similarly, the minimum value is taken for each bin of the histogram, and the sum is calculated by a distance measure called a histogram intersection obtained by dividing the sum by the total frequency of the histogram. The histogram intersection has the property that the more similar the signals, the larger the similar.

現在位置の類似度が予め与えられた閾値以上かどうかを調べ（ステップＳ２０５）、閾値以上の場合には、現在位置の探索窓に含まれる信号と参照信号１０１とは一致しているとみなして、例えば現在位置の数値を画面上に表示して探索結果を出力する（ステップＳ２０６）。そうでない場合には、ステップＳ２０７に進んで現在位置を前方にずらす。 It is checked whether or not the similarity of the current position is greater than or equal to a predetermined threshold value (step S205). If the similarity is greater than or equal to the threshold value, the signal included in the current position search window and the reference signal 101 are regarded as matching. For example, the numerical value of the current position is displayed on the screen, and the search result is output (step S206). If not, the process proceeds to step S207 to shift the current position forward.

（２−２）スキップ可能幅の算出
現在位置は１つずつ進めるのではなく（すなわち、１フレームずつ進めるのでなく）、現在位置の他にスキップ候補位置（すなわち、スキップする候補のフレーム位置）を設定し、現在位置の類似度とスキップ候補位置の類似度とから探索不要な範囲を算出して、この範囲の信号照合を省略してスキップすることによって高速化する。本実施形態における探索不要範囲（すなわち、スキップ可能幅）の算出方法は従来方法と異なり、本実施形態の最も特徴とする点である。 (2-2) Calculation of skippable width The current position is not advanced one by one (that is, not advanced one frame at a time), but the skip candidate position (that is, the frame position of the candidate to be skipped) is added in addition to the current position. The speed is increased by calculating a range that does not require a search from the similarity of the current position and the similarity of the skip candidate position, and skipping the signal collation in this range. Unlike the conventional method, the calculation method of the search unnecessary range (that is, skippable width) in the present embodiment is the most characteristic point of the present embodiment.

図３は、スキップ幅を算出する原理を説明するための図であり、図３−３００は特許文献１の説明図、図３−３１０は特許文献２の説明図、図３−３２０は本実施形態の説明図である。 FIG. 3 is a diagram for explaining the principle of calculating the skip width. FIG. 3-300 is an explanatory diagram of Patent Document 1, FIG. 3-310 is an explanatory diagram of Patent Document 2, and FIG. It is explanatory drawing of a form.

（２−３）特許文献１におけるスキップ可能幅の算出
図３−３００は特許文献１のスキップ原理を示す。 (2-3) Calculation of Skippable Width in Patent Document 1 FIG. 3-300 shows the skip principle of Patent Document 1.

特許文献１では、探索窓３０１の類似度がＳｏとすると、探索窓を前方にずらした部分が仮に全て一致していたとしても、類似度Ｓｏとずれによる類似度の増加分との合計が、閾値θを越えることがない。さらに説明すると、閾値θと等しい類似度を持つということは、探索窓３０２と入力信号とが完全に一致した部分、すなわち、１５秒のＣＭ映像を発見できた位置であり、探索を行っている過程でこの閾値θを超えることがない。そのため、この考え方を利用して、この間の照合を省略する。このスキップ幅ｗ１は式（１）で与えられる。 In Patent Document 1, if the similarity of the search window 301 is So, even if all the parts of the search window shifted forward match, the sum of the similarity So and the increase in similarity due to the shift is The threshold value θ is not exceeded. To explain further, having a similarity equal to the threshold value θ is a portion where the search window 302 and the input signal completely coincide, that is, a position where a 15-second CM image can be found, and the search is performed. This threshold value θ is not exceeded in the process. For this reason, this concept is used to omit collation during this period. This skip width w1 is given by equation (1).

ｗ１＝ｆｌｏｏｒ（Ｌ（θ−Ｓｏ））＋１Ｓｏ＜θ
ｗ１＝１それ以外
・・・（１）

但し、Ｌは探索窓の幅（探索窓のフレーム数）、関数ｆｌｏｏｒは小数以下の切り捨てを表す関数、Ｓｏは現在位置の類似度（現在のフレーム位置における類似度）、θは閾値の類似度を示す。なお、θは、完全一致した場合は１であるが、外乱やデジタル化の過程で完全一致ではなくなるので０＜θ＜１である。例えば、θ＝０．８である。
w1 = floor (L (θ−So)) + 1 So <θ
w1 = 1 otherwise
... (1)

Where L is the width of the search window (the number of frames in the search window), the function floor is a function that represents truncation of decimals, So is the similarity of the current position (similarity at the current frame position), and θ is the similarity of the threshold Indicates. Note that θ is 1 when it completely matches, but 0 <θ <1 because it does not completely match in the course of disturbance or digitization. For example, θ = 0.8.

（２−４）特許文献２におけるスキップ可能幅
（２−４−１）図３−３１０に基づく説明
図３−３１０は特許文献２のスキップ原理を示す。 (2-4) Skippable Width in Patent Document 2 (2-4-1) Description Based on FIG. 3-310 FIG. 3-310 shows the skip principle of Patent Document 2.

特許文献２では、特許文献１と同様に現在位置の探索窓３１１のスキップ可能幅を探索窓３１２のように算出する。 In Patent Document 2, as in Patent Document 1, the skippable width of the search window 311 at the current position is calculated as in the search window 312.

次に、スキップ候補位置の探索窓３１４について後方のスキップ幅を同様に探索窓３１４のように算出し、これらのスキップ幅が重なる場合には現在位置からスキップ候補位置までの照合を省略する。スキップ幅が重ならなかった場合には、スキップ候補位置を後方に戻すバックトラックを行い、再度判定する。 Next, the backward skip width of the search window 314 for the skip candidate position is similarly calculated as in the search window 314, and when these skip widths overlap, collation from the current position to the skip candidate position is omitted. If the skip widths do not overlap, backtracking is performed to return the skip candidate position backward, and the determination is again made.

スキップ幅が重なるかどうかの判定は式（２）で与えられる。 The determination of whether or not the skip widths overlap is given by equation (2).

Ｔｐ−Ｔｏ＜ｆｌｏｏｒ（Ｌ（２θ−Ｓｏ−Ｓｐ））・・・（２）

但し、Ｔｏは現在位置、Ｔｐはスキップ候補位置、Ｓｐはスキップ候補位置の類似度を示す。よってスキップ幅の最大値ｗ２は式（３）となる。
Tp−To <floor (L (2θ−So−Sp)) (2)

Where To is the current position, Tp is the skip candidate position, and Sp is the similarity of the skip candidate position. Therefore, the maximum value w2 of the skip width is expressed by Expression (3).

ｗ２＝ｆｌｏｏｒ（Ｌ（２θ−Ｓｏ−Ｓｐ））・・・（３）

（２−４−２）図３−３３０に基づく説明
次に、特許文献２について、図３−３３０を用いて説明する。この図は、図３−３１０に示した図と同じ原理を示している。
w2 = floor (L (2θ−So−Sp)) (3)

(2-4-2) Description Based on FIG. 3-330 Next, Patent Document 2 will be described with reference to FIG. This figure shows the same principle as that shown in FIG.

特許文献２では、まず現在位置の時刻Ｔｏ、類似度Ｓｏの点３３１と、スキップ候補位置の時刻Ｔｐの類似度Ｓｐの点３３２を得る。探索窓の幅をＬとすると類似度の変化率は１／Ｌを超えない。そのため、点３３１からの類似度の最大値は直線３４１のように表せ、類似度がθに等しくなる点３３３が式（１）に示したスキップ位置である。この点３３３から最も急激に類似度が低下すると直線３４２のように表せる。この直線３４２がスキップ候補位置の時刻Ｔｐでの点３３４の類似度が点３３２の類似度を超える場合には、ＴｏからＴｐまでの間には類似度がθを超えていかなったことが分かる。 In Patent Document 2, first, a point 331 having a current position time To and a similarity So, and a similarity Sp point 332 having a time Tp at a skip candidate position are obtained. When the width of the search window is L, the similarity change rate does not exceed 1 / L. Therefore, the maximum value of the similarity from the point 331 can be expressed as a straight line 341, and the point 333 where the similarity is equal to θ is the skip position shown in the equation (1). When the similarity decreases most rapidly from this point 333, it can be expressed as a straight line 342. When the similarity of the point 334 at the time Tp at which the straight line 342 is a skip candidate position exceeds the similarity of the point 332, it can be seen that the similarity does not exceed θ between To and Tp.

これを別の方法で説明すると、類似度の変化率は探索窓を後方に移動させても１／Ｌを超えないから、スキップ候補位置の類似度を表す点３３２からの後方の類似度の最大値は直線３４３のように表せ、類似度がθに等しくなる点３３５までは後方にスキップ可能である。点３３５が点３３３よりも後方にあれば、現在位置の探索窓による前方のスキップ幅と、スキップ候補位置の探索窓による後方のスキップ幅が重なり、現在位置からスキップ候補位置までの全区間で照合を省略できる。この説明は図３−３１０に示した原理と同一である。 To explain this in another way, since the rate of change in similarity does not exceed 1 / L even when the search window is moved backward, the maximum of the similarity in the rear from the point 332 representing the similarity of the skip candidate position The value can be expressed as a straight line 343 and can be skipped backward to a point 335 where the similarity is equal to θ. If the point 335 is behind the point 333, the forward skip width of the current position search window overlaps the backward skip width of the skip candidate position search window, and collation is performed in all sections from the current position to the skip candidate position. Can be omitted. This description is the same as the principle shown in FIG.

（２−５）本実施形態の技術
（２−５−１）図３−３２０に基づく説明
図３−３２０に本実施形態の原理を示す。現在位置の類似度とスキップ候補位置の類似度との相関を考慮することにより、特許文献２より広いスキップ可能幅を算出できる点に特徴がある。 (2-5) Technology of this Embodiment (2-5-1) Description Based on FIG. 3-320 FIG. 3-320 shows the principle of this embodiment. It is characterized in that a wider skippable width than that of Patent Document 2 can be calculated by considering the correlation between the similarity of the current position and the similarity of the skip candidate positions.

図３−３１０において、バックトラックを行い、スキップ候補位置の探索窓３１２のスキップ可能幅を算出する際に、探索窓３１１の類似度はＳｏを超えない。そのため、探索窓３１４は探索窓３１３までずらせないという制約がある。しかし、特許文献２ではスキップ幅を前方と後方とで独立して算出したため、上記の制約が全く考慮されていない。 In FIG. 3-310, when backtracking is performed and the skippable width of the search window 312 at the skip candidate position is calculated, the similarity of the search window 311 does not exceed So. Therefore, there is a restriction that the search window 314 cannot be shifted to the search window 313. However, in Patent Document 2, since the skip width is calculated independently for the front and the rear, the above restriction is not taken into consideration at all.

そこで、本実施形態では、現在位置の類似度とスキップ候補位置の類似度を同時に考慮する。現在位置の探索窓３２１と、スキップ候補位置の探索窓３２２との長さをｄと定義する。すなわち、現在位置の探索窓の末尾のフレーム位置とスキップ候補位置の探索窓の先頭のフレーム位置までのフレーム数をｄと定義する。 Therefore, in the present embodiment, the similarity at the current position and the similarity at the skip candidate position are considered simultaneously. The lengths of the current position search window 321 and the skip candidate position search window 322 are defined as d. That is, the number of frames from the last frame position of the current position search window to the first frame position of the skip candidate position search window is defined as d.

探索窓３２１と探索窓３２２の間の長さｄが全て類似度上昇に寄与したとしても、探索窓３２３の類似度は、現在位置の探索窓３２１の類似度Ｓｏ、スキップ候補位置の探索窓３２２の類似度Ｓｑ、及び、ｄ／Ｌの類似度の和を超えない。よって、この合計の類似度が閾値θ（０＜＝θ＜＝１）を超えなければ現在位置とスキップ候補位置の間の照合を省略する。 Even if the length d between the search window 321 and the search window 322 all contributes to the increase in similarity, the similarity of the search window 323 is similar to the similarity So of the search window 321 at the current position and the search window 322 of skip candidate positions. The similarity Sq and the sum of d / L similarities are not exceeded. Therefore, if the total similarity does not exceed the threshold θ (0 <= θ <= 1), the collation between the current position and the skip candidate position is omitted.

この判定は式（４）で与えられる。 This determination is given by equation (4).

Ｔｑ−Ｔｏ＜ｆｌｏｏｒ（Ｌ（１＋θ−Ｓｏ−Ｓｑ））・・・（４）

ここで、式（４）について導出過程について説明する。
Tq−To <floor (L (1 + θ−So−Sq)) (4)

Here, the derivation process for Equation (4) will be described.

まず、現在位置の探索窓３２１の末尾の位置Ｔｏとスキップ候補位置の探索窓３２２の先頭の位置Ｔｑまでの長さｄは、ｄ＝Ｔｑ−Ｔｏとなる。 First, the length d from the end position To of the current position search window 321 to the start position Tq of the skip candidate position search window 322 is d = Tq−To.

次に、現在位置からスキップ候補位置までの範囲と参照信号との類似度の最大値Ｓを算出すると下記のようになる。 Next, the maximum value S of the similarity between the range from the current position to the skip candidate position and the reference signal is calculated as follows.

類似度の最大値Ｓ＝ｄ／Ｌ＋Ｓｏ＋Ｓｑ
＝（Ｔｑ−Ｔｏ）／Ｌ−１＋Ｓｏ＋Ｓｑ

そして、最大値Ｓ＜閾値θのときにスキップ候補位置に次の現在位置を設定できる。
Maximum similarity S = d / L + So + Sq
= (Tq-To) / L-1 + So + Sq

When the maximum value S <threshold value θ, the next current position can be set as the skip candidate position.

すなわち、

Ｓ＝（Ｔｑ−Ｔｏ）／Ｌ−１＋Ｓｏ＋Ｓｑ＜θ

である。これを変形すると、

ｄ＝Ｔｑ−Ｔｏ＜Ｌ（１＋θ−Ｓｏ−Ｓｑ）

となる。長さｄはサンプリングして整数のため、右辺も整数化する必要があるため、関数ｆｌｏｏｒを用いて式（４）のようになる。 That is,

S = (Tq−To) / L−1 + So + Sq <θ

It is. If this is transformed,

d = Tq−To <L (1 + θ−So−Sq)

It becomes. Since the length d is an integer obtained by sampling, it is necessary to convert the right side into an integer, so that the function floor is used to obtain the equation (4).

したがって、本実施形態におけるスキップ可能幅の最大値ｗ３は、

ｗ３＝ｆｌｏｏｒ（Ｌ（１＋θ−Ｓｏ−Ｓｑ））・・・（５）

となる。 Therefore, the maximum value w3 of the skippable width in this embodiment is

w3 = floor (L (1 + θ−So−Sq)) (5)

It becomes.

式（１）、式（３）、式（５）の大小関係は式（６）で与えられる。但し、等号はθ＝１の場合にのみ成り立つ。よって、本実施形態のスキップ可能幅ｗ３が最も大きい。 The magnitude relationship between the expressions (1), (3), and (5) is given by the expression (6). However, the equal sign holds only when θ = 1. Therefore, the skippable width w3 of this embodiment is the largest.

ｗ１＜ｗ２＜ｗ３・・・（６）

また、本実施形態と特許文献２は、スキップ位置、すなわちスキップ候補位置の設定方法も異なる。特許文献２ではスキップ候補位置を式（３）によって求めた最大スキップ幅に設定したが、本実施形態では最大スキップ幅より少し手前にスキップ候補位置を設定して、バックトラックの危険性を低下させる。例えば、スキップ候補位置の類似度が正規分布に従うとして、式（５）の判定に成功する確率の分布関数とスキップ幅との積が最大になる位置をスキップ幅に定める。あるいは、スキップ候補位置の類似度の算出時に類似度を控えめに算出して同様の効果を得る。
w1 <w2 <w3 (6)

Also, the present embodiment and Patent Document 2 are different in the skip position, that is, the skip candidate position setting method. In Patent Document 2, the skip candidate position is set to the maximum skip width obtained by Expression (3), but in this embodiment, the skip candidate position is set slightly before the maximum skip width to reduce the risk of backtracking. . For example, assuming that the similarity of skip candidate positions follows a normal distribution, a position where the product of the distribution function of the probability of success in the determination of Expression (5) and the skip width is maximized is determined as the skip width. Alternatively, a similar effect is obtained by conservatively calculating the similarity when calculating the similarity of the skip candidate position.

（２−５−２）図３−３１０に基づく説明
次に、図３−３３０に示す図を用いて本実施形態を説明する。 (2-5-2) Description Based on FIG. 3-310 Next, the present embodiment will be described with reference to FIG.

点３３１から点３３３までは特許文献２と同様であるが、式（４）が等号で最大スキップ幅の場合には、既に説明したように探索窓３２１と探索窓３２２の相関関係によって、類似度はθのまま上昇しない。その後、探索窓３２３の位置が点３３６まで来ると、最大類似度は直線３４４のように変化率１／Ｌで低下する。点３３７で表されたスキップ候補位置Ｔｑの類似度Ｓｑが、Ｔｑにおける直線３４４の最大類似度を超えない場合には、ＴｏからＴｑまでの全範囲で類似度がθを超えないと保障できる。なお、ＴｏからＴｑまでの全範囲で類似度がθを超えないとは、その範囲内で一致した映像がないことを意味するのでその範囲はスキップをしてもよいことになる。スキップ候補位置の類似度は、特許文献２と同様に、直線３４４と類似度推定直線または曲線との交点として算出してもよい。 Points 331 to 333 are the same as in Patent Document 2, but when Equation (4) is the equal sign and has the maximum skip width, similar to the above, depending on the correlation between the search window 321 and the search window 322 as described above. The degree does not increase with θ. Thereafter, when the position of the search window 323 reaches the point 336, the maximum similarity decreases at a rate of change 1 / L as a straight line 344. When the similarity Sq of the skip candidate position Tq represented by the point 337 does not exceed the maximum similarity of the straight line 344 at Tq, it can be ensured that the similarity does not exceed θ in the entire range from To to Tq. Note that if the similarity does not exceed θ in the entire range from To to Tq, it means that there is no matching video in the range, so that range may be skipped. Similar to Patent Literature 2, the similarity of the skip candidate position may be calculated as an intersection of the straight line 344 and the similarity estimation straight line or curve.

（２−６）スキップ候補位置の特徴量の算出
以上に説明したように、本実施形態では、スキップ候補位置の類似度を例えば特許文献２と同様にこれまでの探索窓の履歴から線形予測するなどして類似度を推定し（ステップＳ２０７）、相関スキップ可能幅算出部１０７で最大スキップ幅を式（５）により算出し（ステップＳ２０８）、バックトラックの危険性を低下させるように、最大スキップ幅からスキップ幅をスキップ幅１０８で算出してスキップ候補位置を設定する（ステップＳ２０９）。 (2-6) Calculation of Feature Value of Skip Candidate Position As described above, in this embodiment, the similarity of skip candidate positions is linearly predicted from the history of search windows so far, for example, as in Patent Document 2. Thus, the similarity is estimated (step S207), the correlation skippable width calculation unit 107 calculates the maximum skip width by the equation (5) (step S208), and the maximum skip is performed so as to reduce the risk of backtracking. The skip candidate position is set by calculating the skip width from the width by the skip width 108 (step S209).

入力信号特徴量算出部１０５でスキップ候補位置の特徴量を算出し、類似度算出部１０６で参照信号との類似度を算出する（ステップＳ２１０）。 The input signal feature value calculation unit 105 calculates the feature value of the skip candidate position, and the similarity calculation unit 106 calculates the similarity to the reference signal (step S210).

バックトラック判定部１０９で式（４）を用いてバックトラックが必要かどうかを判定し（ステップＳ２１１）、必要ならばスキップ候補位置を後戻りして再設定してステップＳ２１０に戻り再度判定する。そうでない場合は、現在位置とスキップ候補位置との間で類似度が閾値を超えないことを保証できるから、スキップ候補位置を現在位置に置き換えて設定して現在位置を進め（ステップＳ２１２）、ステップＳ２０５に戻って新たな現在位置について同様の照合を繰り返す。 The backtrack determination unit 109 determines whether the backtrack is necessary using the equation (4) (step S211). If necessary, the skip candidate position is set back and reset, and the process returns to step S210 to determine again. Otherwise, since it can be ensured that the similarity does not exceed the threshold between the current position and the skip candidate position, the skip position is replaced with the current position and the current position is advanced (step S212). Returning to S205, the same collation is repeated for the new current position.

以上のように現在位置を進めていき、現在位置が入力信号１０２の末尾まで照合が進んだら処理を終了する。 As described above, the current position is advanced, and when the current position reaches the end of the input signal 102, the process ends.

（３）効果
図４は本実施形態と背景技術の実行の差の一例を示した図である。 (3) Effect FIG. 4 is a diagram showing an example of a difference in execution between the present embodiment and the background art.

グラフ４００、グラフ４０１、グラフ４０２は、それぞれ特許文献１、特許文献２、本実施形態による実行の様子の一例を示す。 A graph 400, a graph 401, and a graph 402 show an example of the state of execution according to Patent Document 1, Patent Document 2, and this embodiment, respectively.

横軸は入力信号１０２内の時間を、縦軸は類似度を表し、棒グラフは照合した箇所にのみ表示している。本実施形態によるものが最もスキップ幅が大きいことが分かる。 The horizontal axis represents the time in the input signal 102, the vertical axis represents the similarity, and the bar graph is displayed only at the collated portion. It can be seen that the one according to this embodiment has the largest skip width.

図５は本実施形態と背景技術との照合回数の差の一例を示した図である。 FIG. 5 is a diagram showing an example of the difference in the number of comparisons between the present embodiment and the background art.

４０枚のフレーム画像から成る参照画像１０１を６０００枚のフレーム画像から成る入力映像１０２から探索した結果である。 This is a result of searching a reference image 101 made up of 40 frame images from an input video 102 made up of 6000 frame images.

棒グラフは照合回数を表し、折れ線グラフはバックトラック回数を表す。棒グラフ５００と点５０４は全探索、棒グラフ５０１と点５０５は特許文献１、棒グラフ５０２と５０６は特許文献２、棒グラフ５０３と点５０７は本実施形態による照合回数とバックトラック回数を表す。このように本実施形態は大幅に照合回数を削減できる。バックトラックの回数も激増はしておらず、全体として処理時間を低減できることが分かる。 The bar graph represents the number of collations, and the line graph represents the number of backtracks. A bar graph 500 and a point 504 indicate a full search, a bar graph 501 and a point 505 indicate Patent Document 1, a bar graph 502 and 506 indicate a Patent Document 2, and a bar graph 503 and a point 507 indicate the number of comparisons and the number of backtracks according to this embodiment. As described above, this embodiment can greatly reduce the number of collations. It can be seen that the number of backtracks has not increased dramatically, and the overall processing time can be reduced.

このように、本実施形態に係わる信号探索装置１００によれば、現在位置とスキップ候補位置の相互関係を考慮してスキップ幅を算出することにより、より広くスキップできる。 Thus, according to the signal search device 100 according to the present embodiment, it is possible to skip more widely by calculating the skip width in consideration of the mutual relationship between the current position and the skip candidate position.

（４）変更例
なお、本発明は上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより、種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態にわたる構成要素を適宜組み合わせてもよい。 (4) Modifications Note that the present invention is not limited to the above-described embodiments as they are, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. In addition, various inventions can be formed by appropriately combining a plurality of components disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, constituent elements over different embodiments may be appropriately combined.

例えば、本実施形態では映像の探索に用いたが、あらゆる信号探索、例えば、音声信号探索、ＤＮＡ信号の探索などに適用できる。 For example, although the present embodiment is used for video search, it can be applied to any signal search, for example, audio signal search, DNA signal search, and the like.

本実施形態に係わる信号探索装置１００の構成を示すブロック図である。It is a block diagram which shows the structure of the signal search apparatus 100 concerning this embodiment. 本実施形態の動作を示すフローチャートである。It is a flowchart which shows operation | movement of this embodiment. 本実施形態においてスキップ幅を算出する原理を説明するための図である。It is a figure for demonstrating the principle which calculates a skip width | variety in this embodiment. 本実施形態と従来手法との実行の差の一例を示した図である。It is the figure which showed an example of the difference of execution with this embodiment and the conventional method. 本実施形態と従来手法との照合回数の差の一例を示した図である。It is the figure which showed an example of the difference in the frequency | count of collation with this embodiment and the conventional method.

Explanation of symbols

１０１参照信号
１０２入力信号
１０４参照信号特徴量算出部
１０５入力信号特徴量算出部
１０６類似度算出部
１０７相関スキップ可能幅算出部
１０８スキップ幅算出部
１０９バックトラック判定部 101 Reference signal 102 Input signal 104 Reference signal feature value calculation unit 105 Input signal feature value calculation unit 106 Similarity calculation unit 107 Correlation skippable width calculation unit 108 Skip width calculation unit 109 Backtrack determination unit

Claims

In a signal search device that searches for a signal sequence similar to a reference signal sequence while skipping over the input signal sequence,
Current position setting means for setting a search window at the current search position on the input signal sequence;
Skip candidate position setting means for setting a search window at a skip candidate position that is a candidate for the next search position on the input signal sequence;
Similarity calculation for calculating the similarity between the signal sequence included in the current position search window and the reference signal sequence, and the similarity between the signal sequence included in the skip candidate position search window and the reference signal sequence, respectively. Means,
Based on the length from the end position of the search window of the current position to the start position of the search window of the skip candidate position, the similarity between the range from the current position to the skip candidate position and the reference signal sequence A maximum value calculating means for calculating the maximum value of
If the maximum value does not exceed a threshold value, a skip width calculating means for setting a next current position as the skip candidate position;
A signal search apparatus comprising:

The maximum value calculating means includes
(1) The product of the similarity of the current position and the length of the search window,
(2) The product of the similarity of the skip candidate position and the length of the search window,
(3) a length from the end of the search window at the current position to the start of the search window at the skip candidate position;
The signal search apparatus according to claim 1, wherein a sum of these (1), (2), and (3) is set as the maximum value.

The maximum value calculating means calculates the maximum value S,

S = d / L + So + Sq = (Tq−To) / L−1 + So + Sq

Sought by
The skip width calculation means sets a next current position to the skip candidate position when S <θ,
Where Tq is the skip candidate position on the input signal sequence, To is the current position on the input signal sequence, and d is the end position of the search window for the current position and the start position of the search window for the skip candidate position. , Θ is the threshold (0 <= θ <= 1), So is the similarity of the current position (0 <= So <= 1), and Sq is the similarity of the skip candidate position (0 <= Sq <= 1) The signal search device according to claim 1, wherein:

The signal search apparatus according to claim 1, wherein the skip width calculation unit reverts the skip candidate position to the current position side when the maximum value exceeds a threshold value.

The signal search apparatus according to claim 1, wherein the search is performed for the entire range of the input signal sequence.

In a signal search method for searching for a signal sequence similar to a reference signal sequence while skipping over the input signal sequence,
Set a search window at the current search position on the input signal sequence;
Set a search window at a skip candidate position that is a candidate for the next search position on the input signal sequence,
Calculating the similarity between the signal sequence included in the search window at the current position and the reference signal sequence, and the similarity between the signal sequence included in the search window at the skip candidate position and the reference signal sequence,
Based on the length from the end position of the search window for the current position to the start position of the search window for the skip candidate position, the similarity between the range from the current position to the skip candidate position and the reference signal sequence The maximum value of
If the maximum value does not exceed a threshold value, a next current position is set as the skip candidate position.

When calculating the maximum value,
(1) The product of the similarity of the current position and the length of the search window,
(2) The product of the similarity of the skip candidate position and the length of the search window,
(3) a length from the end of the search window at the current position to the start of the search window at the skip candidate position;
The signal search method according to claim 6, wherein a sum of these (1), (2), and (3) is set as the maximum value.

When calculating the maximum value, the maximum value S is

S = d / L + So + Sq = (Tq−To) / L−1 + So + Sq

Sought by
The next current position is set to the skip candidate position when S <θ.
Where Tq is the skip candidate position on the input signal sequence, To is the current position on the input signal sequence, and d is the end position of the search window for the current position and the start position of the search window for the skip candidate position. , Θ is the threshold (0 <= θ <= 1), So is the similarity of the current position (0 <= So <= 1), and Sq is the similarity of the skip candidate position (0 <= Sq The signal search method according to claim 6, wherein <= 1).

The signal search method according to claim 6, wherein, when the maximum value exceeds a threshold value, the skip candidate position is returned to the current position side.

The signal search method according to claim 6, wherein the search is performed for the entire range of the input signal sequence.

In a signal search program for searching a signal sequence similar to a reference signal sequence by a computer while skipping over the input signal sequence,
A current position setting function for setting a search window at the current search position on the input signal sequence;
A skip candidate position setting function for setting a search window at a skip candidate position that is a candidate for the next search position on the input signal sequence;
Similarity calculation for calculating the similarity between the signal sequence included in the current position search window and the reference signal sequence, and the similarity between the signal sequence included in the skip candidate position search window and the reference signal sequence, respectively. Function and
Based on the length from the end position of the search window for the current position to the start position of the search window for the skip candidate position, the similarity between the range from the current position to the skip candidate position and the reference signal sequence Maximum value calculation function to calculate the maximum value of
When the maximum value does not exceed a threshold value, a skip width calculation function for setting a next current position as the skip candidate position;
The signal search program characterized by comprising.

The maximum value calculation function is:
(1) The product of the similarity of the current position and the length of the search window,
(2) The product of the similarity of the skip candidate position and the length of the search window,
(3) a length from the end of the search window at the current position to the start of the search window at the skip candidate position;
The signal search program according to claim 1, wherein a sum of these (1), (2), and (3) is set as the maximum value.

The maximum similarity calculation function calculates the maximum value S as

S = d / L + So + Sq = (Tq−To) / L−1 + So + Sq

Sought by
The skip width calculation function sets a next current position to the skip candidate position when S <θ,
Where Tq is the skip candidate position on the input signal sequence, To is the current position on the input signal sequence, and d is the end position of the search window for the current position and the start position of the search window for the skip candidate position. , Θ is the threshold (0 <= θ <= 1), So is the similarity of the current position (0 <= So <= 1), and Sq is the similarity of the skip candidate position (0 <= Sq The signal search program according to claim 11, wherein <= 1).

The signal search program according to claim 1, wherein the skip width calculation function causes the skip candidate position to be returned to the current position side when the maximum value exceeds a threshold value.

The signal search program according to claim 11, wherein the search is performed for the entire range of the input signal sequence.