JP2008145504A

JP2008145504A - Signal feature extracting method, signal feature extracting device, signal feature extracting program, and recording medium thereof

Info

Publication number: JP2008145504A
Application number: JP2006329443A
Authority: JP
Inventors: Takayuki Kurozumi; 隆行黒住; Shogo Kimura; 昭悟木村; Hidenao Nagano; 秀尚永野; Kunio Kayano; 邦夫柏野
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2006-12-06
Filing date: 2006-12-06
Publication date: 2008-06-26
Anticipated expiration: 2026-12-06
Also published as: JP4762870B2

Abstract

<P>PROBLEM TO BE SOLVED: To perform signal feature extraction which is robuster than before by selecting a feature based upon a standard which is robuster than before and providing an area which is not a target of feature calculation. <P>SOLUTION: In a signal feature extraction device 1, a mask storage unit 20 stores a plurality of masks previously given identifiable mask numbers. An input unit 10 inputs a signal. A basic feature extracting unit 11 extracts basic feature information comprising a vector from the input signal. An in-mask maximum/minimum calculation unit 12 reads a mask out of the mask storage unit 20 and calculates a maximum value and a minimum value from a partial area, where the maximum value is calculated, and a partial area, where the minimum value is calculated, determined by the read-mask according to the extracted basic feature information. A difference calculation unit 13 calculates the difference between the calculated maximum value and minimum value. A mask selection vector quantizing unit 14 determines the mask number of a mask having the largest calculated differencde as a quantized value of an extraction target area of the basic feature information. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、例えば、音響信号検出や映像信号検出において、信号を特定するために利用される特徴を信号の中から抽出する信号特徴抽出方法、信号特徴抽出装置、信号特徴抽出プログラム、及びその記録媒体に関する。 The present invention provides, for example, a signal feature extraction method, a signal feature extraction apparatus, a signal feature extraction program, and a recording thereof for extracting a feature used for specifying a signal from a signal in audio signal detection and video signal detection. It relates to the medium.

従来、信号特徴抽出方法に関しては、ＬＢＧアルゴリズムに基づくベクトル量子化のように、ベクトルのクラスタリングに基づく方法が知られている。また、特許文献１に記載の方法のように、信号の特徴的な箇所のみを抽出する方法なども知られている。
国際公開第２００６／００６５２８号パンフレット Conventionally, as a signal feature extraction method, a method based on vector clustering, such as vector quantization based on an LBG algorithm, is known. Further, a method of extracting only a characteristic part of a signal, such as the method described in Patent Document 1, is also known.
International Publication No. 2006/006528 Pamphlet

しかしながら、ベクトルのクラスタリングに基づく方法では、特徴の変動に影響されやすいという問題があった。また、特許文献１に記載の方法では、表現できる符号が少ないという問題があった。また、ベクトルのクラスタリングに基づく方法や特許文献１に記載の方法では、サンプリングのタイミングによって特徴ベクトルの値が変動するために安定した特徴を得るのが難しいという問題があった。 However, the method based on vector clustering has a problem that it is easily influenced by feature variations. Further, the method described in Patent Document 1 has a problem that there are few codes that can be expressed. In addition, the method based on vector clustering and the method described in Patent Document 1 have a problem that it is difficult to obtain a stable feature because the value of the feature vector varies depending on the sampling timing.

本発明は、上記問題を解決すべくなされたもので、その目的は、従来よりも頑健な基準で特徴を選択するとともに、特徴の算出対象としない領域を設けることで、従来よりも頑健な信号特徴抽出を可能とする信号特徴抽出方法、信号特徴抽出装置、信号特徴抽出プログラム、及びその記録媒体を提供することにある。 The present invention has been made to solve the above-described problems, and its purpose is to select a feature based on a criterion that is more robust than in the past and to provide a signal that is more robust than in the past by providing an area that is not subject to feature calculation. It is an object to provide a signal feature extraction method, a signal feature extraction device, a signal feature extraction program, and a recording medium thereof that enable feature extraction.

上記問題を解決するために、本発明は、入力される信号から特定の信号の抽出に利用される特徴を抽出する信号特徴抽出方法であって、予め識別可能な識別情報が付与される複数のマスクをマスク記憶手段に記憶させる記憶過程と、信号を入力する入力過程と、入力した信号から、ベクトルからなる基本特徴情報を抽出する基本特徴抽出過程と、前記マスク記憶手段からマスクを読み出し、抽出した基本特徴情報の前記読み出したマスクにより定められる最大値を算出する部分領域と最小値を算出する部分領域とからそれぞれ最大値と最小値を算出するマスク内最大最小計算過程と、算出した最小値と最大値の差分を算出する差分計算過程と、算出した差分の最大のマスクの識別情報を前記基本特徴情報の抽出対象領域の量子化値とし、前記信号についての量子化値を含む情報を当該信号の特徴として出力するマスク選択ベクトル量子化過程と、を含むことを特徴とする信号特徴抽出方法である。 In order to solve the above problem, the present invention is a signal feature extraction method for extracting a feature used for extraction of a specific signal from an input signal, and a plurality of identification information that can be identified in advance is provided. A storage process for storing a mask in a mask storage means, an input process for inputting a signal, a basic feature extraction process for extracting basic feature information consisting of a vector from the input signal, and reading and extracting the mask from the mask storage means In-mask maximum / minimum calculation process for calculating the maximum value and the minimum value from the partial area for calculating the maximum value determined by the read mask and the partial area for calculating the minimum value, respectively, and the calculated minimum value And the difference calculation process for calculating the difference between the maximum value and the identification information of the maximum mask of the calculated difference as the quantization value of the extraction target region of the basic feature information, A signal feature extraction method which comprises a mask selection vector quantization process to output as a feature of the signal information including the quantized values for items.

また、本発明は、上記に記載の発明において、前記マスク記憶手段に記憶させるマスクは、最大値を算出する部分領域と、最小値を算出する部分領域と、前記最大値及び前記最小値の算出の対象としない部分領域が組み合わされることを特徴とする。 Further, the present invention is the above-described invention, wherein the mask stored in the mask storage means is a partial area for calculating a maximum value, a partial area for calculating a minimum value, and calculation of the maximum value and the minimum value. It is characterized in that partial areas not subject to the above are combined.

また、本発明は、上記に記載の発明において、前記最大値を算出する部分領域と前記最小値を算出する部分領域とが隣接しないように、前記算出の対象としない部分領域と前記最大値を算出する部分領域と前記最小値を算出する部分領域とが組み合わされることを特徴とする。 Further, the present invention is the invention described in the above, wherein the partial area not to be calculated and the maximum value are set so that the partial area for calculating the maximum value and the partial area for calculating the minimum value are not adjacent to each other. The partial area to be calculated and the partial area to calculate the minimum value are combined.

また、本発明は、上記に記載の発明において、前記基本特徴情報及び前記マスクが２次元平面における四角形を単位とする複数の部分領域で示されている場合、前記最大値を算出する部分領域と前記最小値を算出する部分領域とが、前記四角形の辺についての少なくとも４近傍にて隣接しないように、前記算出の対象としない部分領域と前記最大値を算出する部分領域と前記最小値を算出する部分領域とが組み合わされることを特徴とする。 Further, in the present invention described above, when the basic feature information and the mask are indicated by a plurality of partial areas whose unit is a quadrangle in a two-dimensional plane, the present invention provides a partial area for calculating the maximum value; The partial area that is not subject to the calculation, the partial area that calculates the maximum value, and the minimum value are calculated so that the partial area that calculates the minimum value is not adjacent in at least four neighborhoods of the sides of the rectangle. And a partial region to be combined.

また、本発明は、上記に記載の発明において、前記基本特徴情報及び前記マスクが３次元空間における直方体を単位とする複数の部分領域で示されている場合、前記最大値を算出する部分領域と前記最小値を算出する部分領域とが、前記直方体の面についての少なくとも６近傍にて隣接しないように、前記算出の対象としない部分領域と前記最大値を算出する部分領域と前記最小値を算出する部分領域とが組み合わされることを特徴とする。 Further, in the present invention described above, when the basic feature information and the mask are represented by a plurality of partial areas having a rectangular parallelepiped as a unit in a three-dimensional space, the partial area for calculating the maximum value is The partial area not to be calculated, the partial area for calculating the maximum value, and the minimum value are calculated so that the partial area for calculating the minimum value is not adjacent at least in the vicinity of 6 with respect to the surface of the rectangular parallelepiped. And a partial region to be combined.

また、本発明は、入力される信号から特定の信号の抽出に利用される特徴を抽出する信号特徴抽出装置であって、予め識別可能な識別情報が付与される複数のマスクを記憶するマスク記憶手段と、信号を入力する入力手段と、前記入力手段が入力する信号から、ベクトルからなる基本特徴情報を抽出する基本特徴抽出手段と、前記マスク記憶手段からマスクを読み出し、前記基本特徴抽出手段が抽出する基本特徴情報の前記読み出したマスクにより定められる最大値を算出する部分領域と最小値を算出する部分領域とからそれぞれ最大値と最小値を算出するマスク内最大最小計算手段と、前記マスク内最大最小計算手段が算出する最小値と最大値の差分を算出する差分計算手段と、前記差分計算手段が算出する差分の最大のマスクの識別情報を前記基本特徴情報の抽出対象領域の量子化値とし、前記信号についての量子化値を含む情報を当該信号の特徴として出力するマスク選択ベクトル量子化手段と、を備えたことを特徴とする信号特徴抽出装置である。 In addition, the present invention is a signal feature extraction device that extracts a feature used for extracting a specific signal from an input signal, and stores a plurality of masks to which identification information that can be identified in advance is provided. Means, input means for inputting a signal, basic feature extraction means for extracting basic feature information consisting of a vector from a signal input by the input means, a mask is read from the mask storage means, and the basic feature extraction means is In-mask maximum / minimum calculation means for calculating a maximum value and a minimum value respectively from a partial area for calculating a maximum value determined by the read mask and a partial area for calculating a minimum value of basic feature information to be extracted; Difference calculation means for calculating the difference between the minimum value and the maximum value calculated by the maximum / minimum calculation means, and identification information of the maximum mask of the difference calculated by the difference calculation means A signal feature comprising: a mask selection vector quantization unit configured to output information including a quantization value of the signal as a quantization value of the extraction target region of the basic feature information as a feature of the signal. It is an extraction device.

また、本発明は、入力される信号から特定の信号の抽出に利用される特徴を抽出する信号特徴抽出装置のコンピュータに、予め識別可能な識別情報が付与される複数のマスクをマスク記憶手段に記憶させる記憶過程と、信号を入力する入力過程と、入力した信号から、ベクトルからなる基本特徴情報を抽出する基本特徴抽出過程と、前記マスク記憶手段からマスクを読み出し、抽出した基本特徴情報の前記読み出したマスクにより定められる最大値を算出する部分領域と最小値を算出する部分領域とからそれぞれ最大値と最小値を算出するマスク内最大最小計算過程と、算出した最小値と最大値の差分を算出する差分計算過程と、算出した差分の最大のマスクの識別情報を前記基本特徴情報の抽出対象領域の量子化値とし、前記信号についての量子化値を含む情報を当該信号の特徴として出力するマスク選択ベクトル量子化過程と、を実行させるための信号特徴抽出プログラムである。 Further, according to the present invention, a mask of a plurality of masks to which identification information that can be identified in advance is added to a computer of a signal feature extraction apparatus that extracts a feature used for extracting a specific signal from an input signal. A storage process to be stored; an input process to input a signal; a basic feature extraction process to extract basic feature information including a vector from the input signal; and a mask read out from the mask storage means, and the extracted basic feature information The maximum / minimum calculation process in the mask for calculating the maximum value and the minimum value respectively from the partial area for calculating the maximum value determined by the read mask and the partial area for calculating the minimum value, and the difference between the calculated minimum value and maximum value The difference calculation process to be calculated and the identification information of the maximum mask of the calculated difference as the quantization value of the extraction target area of the basic feature information, and the signal Information including the quantized value is a signal feature extraction program for executing a mask selection vector quantization process to output as a feature of the signal.

また、本発明は、入力される信号から特定の信号の抽出に利用される特徴を抽出する信号特徴抽出装置のコンピュータに、予め識別可能な識別情報が付与される複数のマスクをマスク記憶手段に記憶させる記憶過程と、信号を入力する入力過程と、入力される信号から、ベクトルからなる基本特徴情報を抽出する基本特徴抽出過程と、前記マスク記憶手段からマスクを読み出し、抽出した基本特徴情報の前記読み出したマスクにより定められる最大値を算出する部分領域と最小値を算出する部分領域とからそれぞれ最大値と最小値を算出するマスク内最大最小計算過程と、算出した最小値と最大値の差分を算出する差分計算過程と、算出した差分の最大のマスクの識別情報を前記基本特徴情報の抽出対象領域の量子化値とし、前記信号についての量子化値を含む情報を当該信号の特徴として出力するマスク選択ベクトル量子化過程と、を実行させるための信号特徴抽出プログラムを記録したコンピュータ読み取り可能な記録媒体である。 Further, according to the present invention, a mask of a plurality of masks to which identification information that can be identified in advance is added to a computer of a signal feature extraction apparatus that extracts a feature used for extracting a specific signal from an input signal. A storage process for storing, an input process for inputting a signal, a basic feature extraction process for extracting basic feature information consisting of vectors from the input signal, a mask read out from the mask storage means, and the extracted basic feature information Maximum / minimum calculation process in the mask for calculating the maximum value and the minimum value from the partial area for calculating the maximum value determined by the read mask and the partial area for calculating the minimum value, and the difference between the calculated minimum value and maximum value The difference calculation process for calculating the difference and the identification information of the largest mask of the calculated difference as the quantized value of the extraction target area of the basic feature information, and the signal The information including the quantized value is a computer-readable recording medium recording a signal feature extraction program for executing the mask selection vector quantization process to output as a feature, the said signal.

この発明によれば、信号特徴抽出方法は、予め識別可能な識別情報が付与される複数のマスクをマスク記憶手段に記憶させておき、入力される信号から、ベクトルからなる基本特徴情報を抽出し、マスク記憶手段からマスクを読み出し、抽出した基本特徴情報の読み出したマスクにより定められる最大値を算出する部分領域と最小値を算出する部分領域とからそれぞれ最大値と最小値を算出し、算出した最小値と最大値の差分を算出し、算出した差分の最大のマスクの識別情報を基本特徴情報の抽出対象領域の量子化値とし、信号についての量子化値を含む情報を当該信号の特徴として出力することとした。これにより、マスク内における最大値を算出する部分領域と最小値を算出する部分領域とを定めることで、コーナーのような信号内の特徴間の差分の大きい箇所について、頑健な特徴抽出を行なうことが可能となる。 According to this invention, the signal feature extraction method stores a plurality of masks to which identification information that can be identified in advance is stored in the mask storage means, and extracts basic feature information including vectors from the input signal. The mask is read from the mask storage means, and the maximum value and the minimum value are calculated from the partial area for calculating the maximum value and the partial area for calculating the minimum value determined by the read mask of the extracted basic feature information. The difference between the minimum value and the maximum value is calculated, the identification information of the mask with the maximum calculated difference is set as the quantization value of the extraction target area of the basic feature information, and the information including the quantization value of the signal is set as the feature of the signal I decided to output it. This makes it possible to perform robust feature extraction at a point where a difference between features in a signal such as a corner is large by defining a partial region for calculating the maximum value and a partial region for calculating a minimum value in the mask. Is possible.

また、この発明によれば、マスク記憶手段に記憶させるマスクは、最大値を算出する部分領域と、最小値を算出する部分領域と、最大値及び最小値の算出の対象としない部分領域が組み合わされることとした。これにより、コーナーのような信号内の特徴間の差分の大きい箇所のサンプリングのタイミングのずれによる、値の揺らぎの影響を軽減して特徴抽出を行なうことが可能となる。 According to the present invention, the mask stored in the mask storage means is a combination of a partial area for calculating the maximum value, a partial area for calculating the minimum value, and a partial area for which the maximum value and the minimum value are not calculated. It was decided that As a result, it is possible to perform feature extraction while reducing the influence of value fluctuation caused by a sampling timing shift at a location where a difference between features in a signal such as a corner is large.

また、この発明によれば、最大値を算出する部分領域と最小値を算出する部分領域とが隣接しないように、算出の対象としない部分領域と最大値を算出する部分領域と最小値を算出する部分領域とが組み合わされることとした。これにより、コーナーのような信号内の特徴間の差分の大きい箇所のサンプリングのタイミングのずれによる、値の揺らぎの影響を軽減して特徴抽出を行なうことが可能となる。 Further, according to the present invention, the partial area that is not subject to calculation, the partial area that calculates the maximum value, and the minimum value are calculated so that the partial area that calculates the maximum value and the partial area that calculates the minimum value are not adjacent to each other. To be combined with the partial area. As a result, it is possible to perform feature extraction while reducing the influence of value fluctuation caused by a sampling timing shift at a location where a difference between features in a signal such as a corner is large.

また、この発明によれば、基本特徴情報及びマスクが２次元平面における四角形を単位とする複数の部分領域で示されている場合、最大値を算出する部分領域と最小値を算出する部分領域とが、四角形の辺についての少なくとも４近傍にて隣接しないように、算出の対象としない部分領域と最大値を算出する部分領域と最小値を算出する部分領域とが組み合わされることとした。これにより、音響信号のような時間方向と周波数方向のパワー値で示される基本特徴情報において、コーナーのような信号内の特徴間の差分の大きい箇所のサンプリングのタイミングのずれによる、値の揺らぎの影響を軽減して特徴抽出を行なうことが可能となる。 Further, according to the present invention, when the basic feature information and the mask are indicated by a plurality of partial areas each having a quadrangle in the two-dimensional plane, the partial area for calculating the maximum value and the partial area for calculating the minimum value However, the partial area that is not subject to calculation, the partial area that calculates the maximum value, and the partial area that calculates the minimum value are combined so that they are not adjacent in the vicinity of at least four sides of the quadrangular side. As a result, in the basic feature information indicated by the power values in the time direction and the frequency direction such as acoustic signals, the fluctuation of the value due to the sampling timing shift at the location where the difference between the features in the signal such as the corner is large. It is possible to perform feature extraction while reducing the influence.

また、この発明によれば、基本特徴情報及びマスクが３次元空間における直方体を単位とする複数の部分領域で示されている場合、最大値を算出する部分領域と最小値を算出する部分領域とが、直方体の面についての少なくとも６近傍にて隣接しないように、算出の対象としない部分領域と最大値を算出する部分領域と最小値を算出する部分領域とが組み合わされることとした。これにより、映像信号のような時間方向と縦横軸の周波数方向のパワー値で示される基本特徴情報において、コーナーのような信号内の特徴間の差分の大きい箇所のサンプリングのタイミングのずれによる、値の揺らぎの影響を軽減して特徴抽出を行なうことが可能となる。 Further, according to the present invention, when the basic feature information and the mask are indicated by a plurality of partial areas having a rectangular parallelepiped as a unit in the three-dimensional space, the partial area for calculating the maximum value and the partial area for calculating the minimum value However, the partial area that is not subject to calculation, the partial area that calculates the maximum value, and the partial area that calculates the minimum value are combined so that they are not adjacent in the vicinity of at least 6 in the plane of the rectangular parallelepiped. As a result, in the basic feature information indicated by the power values in the time direction and the frequency direction of the vertical and horizontal axes such as video signals, the value due to the sampling timing shift at the point where the difference between the features in the signal such as the corner is large. It is possible to perform feature extraction while reducing the influence of fluctuations.

以下、本発明の実施形態を図面を参照して説明する。
本実施形態は、例えば、ＦＭ（Frequency Modulation）放送の音響信号やテレビの映像信号に含まれるＣＭ（Commercial Message）や番組、楽曲の信号の断片をキーとして、膨大な音響や映像のデータベースの中から同一の信号を高速に検出し、信号の箇所を検出し、同一のコンテンツを特定する信号特徴抽出方法の実施に関するものである。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
In the present embodiment, for example, in an enormous audio and video database, CM (Commercial Message), programs, and music signal fragments included in FM (Frequency Modulation) broadcast audio signals and television video signals are used as keys. The present invention relates to the implementation of a signal feature extraction method that detects the same signal at high speed, detects the location of the signal, and identifies the same content.

図１は、本実施形態による信号特徴抽出装置１を示す概略ブロック図である。
信号特徴抽出装置１は、入力部１０と、基本特徴抽出部１１と、マスク内最大最小計算部１２と、差分計算部１３と、マスク選択ベクトル量子化部１４と、マスク記憶部２０とを備えている。 FIG. 1 is a schematic block diagram showing a signal feature extraction apparatus 1 according to the present embodiment.
The signal feature extraction apparatus 1 includes an input unit 10, a basic feature extraction unit 11, an in-mask maximum / minimum calculation unit 12, a difference calculation unit 13, a mask selection vector quantization unit 14, and a mask storage unit 20. ing.

入力部１０は、例えば、音響装置や映像装置などの出力端子に接続され、特徴抽出する信号が入力されると、基本特徴抽出部１１に当該信号を入力する。基本特徴抽出部１１は、入力される信号から、ベクトルで示される基本特徴情報を抽出する。マスク記憶部２０は、基本特徴情報の中において、マスク内最大最小計算部１２が最大値と最小値を算出する領域を特定する複数のマスクを予め記憶する。 For example, the input unit 10 is connected to an output terminal of an audio device, a video device, or the like, and inputs a signal for feature extraction to the basic feature extraction unit 11. The basic feature extraction unit 11 extracts basic feature information indicated by a vector from the input signal. In the basic feature information, the mask storage unit 20 stores in advance a plurality of masks that specify areas in which the maximum / minimum calculation unit 12 in the mask calculates the maximum value and the minimum value.

図２は、マスク記憶部２０に記憶されるマスクの例を示した図であり、各マスクには予めマスク番号が付与されており、各マスクにおいて、右上がり斜め線の領域は、マスク内最大最小計算部１２が最大値を算出する対象の領域であることを示しており、右下がり斜め線の領域は、マスク内最大最小計算部１２が最小値を算出する対象の領域であることを示している。 FIG. 2 is a diagram showing an example of a mask stored in the mask storage unit 20, where a mask number is assigned in advance to each mask, and in each mask, the area of the diagonal line rising to the right is the maximum in the mask. This indicates that the minimum calculation unit 12 is a target region for calculating the maximum value, and a diagonally downward slanting line region indicates that the maximum / minimum calculation unit 12 in the mask is a target region for calculating the minimum value. ing.

マスク内最大最小計算部１２は、マスク記憶部２０からマスクを読み出し、読み出したマスクごとに特定される基本特徴情報の部分領域、すなわち最大値を算出する対象の領域と最小値を算出する対象の領域から、それぞれ最大値と最小値を算出する。差分計算部１３は、マスク内最大最小計算部１２がマスクごとに算出した最小値と最大値との差分、例えば、最小値から最大値を差し引いた値を算出する。 The in-mask maximum / minimum calculation unit 12 reads out the mask from the mask storage unit 20, and specifies the partial region of the basic feature information specified for each read mask, that is, the target region for calculating the maximum value and the target for calculating the minimum value. The maximum value and the minimum value are calculated from the area. The difference calculation unit 13 calculates a difference between the minimum value and the maximum value calculated for each mask by the in-mask maximum / minimum calculation unit 12, for example, a value obtained by subtracting the maximum value from the minimum value.

マスク選択ベクトル量子化部１４は、差分計算部１３が算出したマスクごとの最小値と最大値との差分の中で差分が最大のマスクの番号を、当該マスクにより特定された部分領域、すなわち注目箇所の量子化値として設定し、当該量子化値を特徴情報として出力する。 The mask selection vector quantization unit 14 determines the number of the mask with the largest difference among the differences between the minimum value and the maximum value for each mask calculated by the difference calculation unit 13, that is, the partial region identified by the mask, that is, attention It is set as a quantized value of a location, and the quantized value is output as feature information.

（信号特徴抽出装置の動作）
次に、図３のフローチャートを参照しつつ、信号特徴抽出装置１により行なわれる信号特徴抽出方法について説明する。なお、以下の説明では、目的となる信号の一例として、音響信号を適用した場合について説明を行なう。また、以下の説明において、ステップＳ６の比較にて用いられる差分最大値とマスク番号とを対応付けて記憶する変数と、ベクトル量子化値を記憶する変数とが記憶領域に予め定められており、それぞれの初期値として、「ＮＵＬＬ」が設定されているものとする。 (Operation of signal feature extraction device)
Next, a signal feature extraction method performed by the signal feature extraction apparatus 1 will be described with reference to the flowchart of FIG. In the following description, a case where an acoustic signal is applied will be described as an example of a target signal. In the following description, a variable for storing the maximum difference value used in the comparison in step S6 in association with the mask number and a variable for storing the vector quantization value are determined in advance in the storage area. Assume that “NULL” is set as the initial value of each.

まず、基本特徴抽出部１１は、入力部１０から入力される音響信号から基本特徴情報を抽出する。具体的には、音響信号が周波数Ｆ[Ｈｚ]で標本化されている場合、当該音響信号のＰポイントをフーリエ変換し、０〜Ｆ’[Ｈｚ]の各帯域でのパワーの値からなるＰ’次元のベクトル、すなわち特徴ベクトルを基本特徴情報として抽出する。そして、基本特徴抽出部１１は、この特徴ベクトルを、Ｓ秒ごとに抽出する（ステップＳ１）。 First, the basic feature extraction unit 11 extracts basic feature information from the acoustic signal input from the input unit 10. Specifically, when the acoustic signal is sampled at the frequency F [Hz], the P point of the acoustic signal is subjected to Fourier transform, and P including the power value in each band of 0 to F ′ [Hz]. 'A dimensional vector, that is, a feature vector is extracted as basic feature information. Then, the basic feature extraction unit 11 extracts this feature vector every S seconds (step S1).

例えば、音響信号が周波数８０００Ｈｚで標本化された８１９２ポイント（約１秒）からなる場合、これをフーリエ変換することにより、０〜４０００Ｈｚの各帯域でのパワー値からなる４０９６個の値で１フレームが表現され、このフレームを０．１秒ごとに抽出することになる。 For example, when an acoustic signal is composed of 8192 points (about 1 second) sampled at a frequency of 8000 Hz, this is Fourier-transformed to produce one frame with 4096 values consisting of power values in each band of 0 to 4000 Hz. This frame is extracted every 0.1 second.

フーリエ変換後の音響信号がＳ秒ごとに抽出された場合、基本特徴情報は、図４に示すような時間周波数空間上のパワー値として表されることになる。図４において、濃い網掛け及び薄い網掛けのない白色領域と、薄い網掛けの領域と、濃い網掛け領域とは、それぞれパワー値が異なることを示している。このとき、基本特徴抽出部１１は、図４に示すように時間周波数空間を四角形を単位とする升目上に分割し、分割領域ごとのパワー値の平均値を算出する。図５は、図４の基本特徴情報に対して升目ごとのパワー値の平均値を算出した結果を示した図である。 When the acoustic signal after Fourier transform is extracted every S seconds, the basic feature information is represented as a power value in a time-frequency space as shown in FIG. In FIG. 4, it is shown that the power value is different between the white area without dark shading and light shading, the thin shading area, and the dark shading area. At this time, as shown in FIG. 4, the basic feature extraction unit 11 divides the time-frequency space into cells with a square as a unit, and calculates an average value of power values for each divided region. FIG. 5 is a diagram illustrating a result of calculating an average value of power values for each cell with respect to the basic feature information of FIG.

次に、マスク内最大最小計算部１２は、基本特徴抽出部１１から図５に示されるような、パワー値で示された基本特徴情報が入力されると、入力される基本特徴情報において、最大値及び最小値の算出の対象とされる注目箇所の領域が基本特徴情報に存在するか否かを判定する（ステップＳ２）。例えば、図４では、５×５の注目箇所の領域を時間方向及び周波数方向に１升目ごとにずらして最大値及び最小値の算出を行なうことになる。このとき、入力された基本特徴情報において、算出の対象となる５×５の注目箇所の領域が基本特徴情報の領域に重なる場合には、注目箇所の領域が基本特徴情報に存在するとして判定し（ステップＳ２：ＹＥＳ）、基本特徴情報の領域にて存在する全ての注目箇所の領域において最大値及び最小値の算出を行なった場合には、注目箇所の領域が基本特徴情報に存在しないとして判定することになる（ステップＳ２：ＮＯ）。 Next, when the basic feature information indicated by the power value as shown in FIG. 5 is input from the basic feature extraction unit 11 as shown in FIG. It is determined whether or not the region of the target location for which the value and the minimum value are to be calculated exists in the basic feature information (step S2). For example, in FIG. 4, the maximum value and the minimum value are calculated by shifting the 5 × 5 region of interest region by one square in the time direction and the frequency direction. At this time, in the input basic feature information, if the 5 × 5 region of interest to be calculated overlaps the region of the basic feature information, it is determined that the region of interest exists in the basic feature information. (Step S <b> 2: YES), when the maximum value and the minimum value are calculated in all the regions of interest existing in the basic feature information region, it is determined that the region of interest does not exist in the basic feature information. (Step S2: NO).

次に、マスク内最大最小計算部１２は、注目箇所の領域が存在すると判定した場合、マスク記憶部２０に、次のマスクが存在するか否かを判定する（ステップＳ３）。例えば、マスク記憶部２０に図２に示される８個のマスクが存在する場合、１つの注目箇所の領域に対して８個のマスクを１つずつマスク記憶部２０から読み出すことになる。このとき、１つの注目箇所の領域に対して７個目までのマスクを読み出している場合には、次のマスクは存在するものとして判定し（ステップＳ３：ＹＥＳ）、８個のマスクを全て読み出した場合には、次のマスクが存在しないものとして判定することになる（ステップＳ３：ＮＯ）。 Next, if the in-mask maximum / minimum calculation unit 12 determines that the region of interest exists, it determines whether or not the next mask exists in the mask storage unit 20 (step S3). For example, if there are eight masks shown in FIG. 2 in the mask storage unit 20, eight masks are read from the mask storage unit 20 one by one for one region of interest. At this time, if up to the seventh mask is read out for one region of interest, it is determined that the next mask exists (step S3: YES), and all the eight masks are read out. If it is determined, it is determined that the next mask does not exist (step S3: NO).

マスク内最大最小計算部１２は、例えば、最初のマスクである図２のマスク番号１のマスクを読み出した場合（ステップＳ３：ＹＥＳ）、図５に示すように注目箇所の領域に、読み出したマスクを配置する。図６は、図５の５×５の注目箇所の領域を拡大して示した図である。マスク内最大最小計算部１２は、上述したように右上がり斜め線の領域は、最大値を算出する対象の領域であることを示しており、右下がり斜め線の領域は、マスク内最大最小計算部１２が最小値を算出する対象の領域であることを示している。図６の例では、マスク内最大最小計算部１２は、最大値として０．７を算出し、最小値として３．９を算出することになる。 For example, when the mask maximum / minimum calculation unit 12 reads the mask of mask number 1 in FIG. 2 as the first mask (step S3: YES), the mask read out in the region of interest as shown in FIG. Place. FIG. 6 is an enlarged view of the 5 × 5 region of interest in FIG. As described above, the mask maximum / minimum calculation unit 12 indicates that a region with a diagonal line rising to the right is a target region for calculating a maximum value, and a region with a diagonal line to the right is a maximum / minimum calculation within a mask. It shows that the part 12 is a target area for calculating the minimum value. In the example of FIG. 6, the in-mask maximum / minimum calculation unit 12 calculates 0.7 as the maximum value and calculates 3.9 as the minimum value.

次に、差分計算部１３は、マスク内最大最小計算部１２が算出した最小値と最大値との差分を算出し、算出した差分と、マスク内最大最小計算部１２が最大値と最小値の算出に用いたマスク番号とをマスク選択ベクトル量子化部１４に入力する（ステップＳ５）。図６の例では、３．９−０．７＝３．２を算出し、マスク番号としては、最初に読み出された「１」を選択し、算出した差分「３．２」とマスク番号「１」をマスク選択ベクトル量子化部１４に入力する。 Next, the difference calculation unit 13 calculates a difference between the minimum value and the maximum value calculated by the in-mask maximum / minimum calculation unit 12, and the calculated difference and the in-mask maximum / minimum calculation unit 12 calculate the difference between the maximum value and the minimum value. The mask number used for the calculation is input to the mask selection vector quantization unit 14 (step S5). In the example of FIG. 6, 3.9−0.7 = 3.2 is calculated, “1” read first is selected as the mask number, and the calculated difference “3.2” and the mask number are selected. “1” is input to the mask selection vector quantization unit 14.

次に、マスク選択ベクトル量子化部１４は、差分計算部１３から入力される差分の値と、差分最大値の変数に記憶されている値を比較し、差分計算部１３から入力される差分の値が、差分最大値の変数に記憶されている値を超過しているか否かを判定する（ステップＳ６）。ここで、差分最大値の変数に「ＮＵＬＬ」が記憶されている場合には、差分計算部１３から入力される差分の値が、差分最大値の変数に記憶されている値を超過しているものとして判定する。すなわち、当該判定処理において、図６の例における最初の判定の際には、最大差分値の変数には、入力される差分の値「３．２」が記憶され、マスク番号の変数には入力されるマスク番号の「１」が記憶される。 Next, the mask selection vector quantization unit 14 compares the difference value input from the difference calculation unit 13 with the value stored in the variable of the maximum difference value, and calculates the difference input from the difference calculation unit 13. It is determined whether or not the value exceeds the value stored in the variable of the maximum difference value (step S6). Here, when “NULL” is stored in the variable of the maximum difference value, the value of the difference input from the difference calculation unit 13 exceeds the value stored in the variable of the maximum difference value. Judge as something. That is, in the determination process, in the first determination in the example of FIG. 6, the input difference value “3.2” is stored in the variable of the maximum difference value, and the variable of the mask number is input. The mask number “1” is stored.

一方、差分計算部１３から入力される差分の値が、差分最大値の変数に記憶されている値を超過していないと判定した場合、ステップＳ３に戻り、次のマスクにてステップＳ４以降の処理が繰り返されることになる。 On the other hand, if it is determined that the difference value input from the difference calculation unit 13 does not exceed the value stored in the variable of the maximum difference value, the process returns to step S3, and the subsequent masks after step S4. The process will be repeated.

差分計算部１３から入力される差分の値が、差分最大値の変数に記憶されている値を超過しているものとして判定した場合（ステップＳ６：ＹＥＳ）、マスク選択ベクトル量子化部１４は、ベクトル量子化値の変数に、マスク番号の変数に記憶される値を記憶させて、マスク量子化値を更新する（ステップＳ７）。前述した算出した差分「３．２」とマスク番号「１」がそれぞれ最大差分値の変数とマスク番号の変数に記憶されている場合には、ベクトル量子化値として「１」を記憶させる。 When it is determined that the difference value input from the difference calculation unit 13 exceeds the value stored in the variable of the maximum difference value (step S6: YES), the mask selection vector quantization unit 14 The value stored in the mask number variable is stored in the vector quantization value variable, and the mask quantization value is updated (step S7). When the calculated difference “3.2” and the mask number “1” are stored in the variable of the maximum difference value and the variable of the mask number, respectively, “1” is stored as the vector quantization value.

ここで、図４に示す基本特徴情報の中から注目箇所として選択されている領域に示される矢印の方向のコーナー（角）を検出する場合に用いられる図２に示したマスクの構成について説明する。図２に示すマスクは、マスク番号１のマスクを時計周りに９０度ずつ回転させたマスク番号２、３、４のマスクと、マスク番号１の最大値の算出対象領域と、最小値の算出対象領域を反転したマスク番号５のマスクと、マスク番号５のマスクを時計周りに９０度ずつ回転させたマスク番号６、７、８のマスクである。このマスクの特徴としては、最大値算出対象領域と最小値算出対象領域の間に、算出対象としない領域を設けることで、四角形の辺における８近傍にて最大値算出対象領域と最小値算出対象領域とが隣接しないようにされており、図４に示すコーナーのような特徴的なサンプリングのタイミングのずれによる、パワー値の揺らぎの影響を軽減することにある。 Here, the configuration of the mask shown in FIG. 2 used when detecting the corner (corner) in the direction of the arrow shown in the region selected as the target location from the basic feature information shown in FIG. 4 will be described. . The mask shown in FIG. 2 includes masks of mask numbers 2, 3, and 4 obtained by rotating the mask of mask number 1 by 90 degrees clockwise, a calculation target area of the maximum value of mask number 1, and a calculation target of the minimum value. The mask of mask number 5, which is an inverted region, and the masks of mask numbers 6, 7, 8 obtained by rotating the mask of mask number 5 by 90 degrees clockwise. As a feature of this mask, by providing an area that is not to be calculated between the maximum value calculation target area and the minimum value calculation target area, the maximum value calculation target area and the minimum value calculation target in the vicinity of 8 in a quadrangular side. This is to reduce the influence of fluctuations in the power value due to characteristic sampling timing shifts such as the corners shown in FIG.

なお、コーナーを検出するような場合には、最大値算出対象領域または最小値算出対象領域を凹形状または凸形状とし、注目箇所の中心が最大値算出対象領域または最小値算出対象領域のいずれかに属するようにマスクを設定することで良好な結果を得ることができる。 When detecting a corner, the maximum value calculation target area or the minimum value calculation target area is a concave shape or a convex shape, and the center of the attention point is either the maximum value calculation target area or the minimum value calculation target area. Good results can be obtained by setting the mask so as to belong to.

図７は、図５に示した注目領域に対して、図２に示したマスク番号１から８のマスクを適用し、ステップＳ３からステップＳ７の処理が行なわれた場合のそれぞれの結果を示した図である。このとき、差分値の最大値は、マスク番号が「１」の場合における「３．２」であり、最後にステップＳ７が行なわれた後のベクトル量子化値の変数の値は「１」が記憶されることになり、マスク選択ベクトル量子化部１４は、図８に示すように注目箇所の中心の座標に「１」を設定する。このようなベクトル量子化値が、当該基本特徴情報において算出されたものが当該信号の特徴として出力されることになる。 FIG. 7 shows the respective results when the masks of mask numbers 1 to 8 shown in FIG. 2 are applied to the region of interest shown in FIG. 5 and the processing from step S3 to step S7 is performed. FIG. At this time, the maximum value of the difference value is “3.2” when the mask number is “1”, and the variable value of the vector quantization value after the last step S7 is “1”. As a result, the mask selection vector quantization unit 14 sets “1” to the coordinates of the center of the point of interest as shown in FIG. The vector quantization value calculated in the basic feature information is output as the feature of the signal.

図９は、上記の実施形態の構成における信号特徴抽出装置１の効果を示すための図である。図９の例では、注目箇所における中心が共通となる５×５の注目箇所と３×３の注目箇所において、図２に示した５×５のマスクを用いた場合と、最大値算出対象領域と最小値算出対象領域との間に算出対象としない領域を有しない２つの５×５のマスクを用いた場合と、最大値算出対象領域と最小値算出対象領域との間に算出対象としない領域を有しない３×３のマスクを用いた場合におけるそれぞれの最大差分値の算出結果が示されている。図２に示すマスクを用いた場合の最大差分値は、前述したように３．２として算出される。最大値算出対象領域と最小値算出対象領域との間に算出対象としない領域を有しない２つの５×５のマスクを用いた場合には、それぞれ０．３と２．０の最大差分値が算出され、最大値算出対象領域と最小値算出対象領域との間に算出対象としない領域を有しない３×３のマスクを用いた場合には、−０．６の最大差分値が算出される。 FIG. 9 is a diagram for illustrating the effect of the signal feature extraction apparatus 1 in the configuration of the above embodiment. In the example of FIG. 9, the maximum value calculation target region is obtained when the 5 × 5 mask shown in FIG. 2 is used at the 5 × 5 attention point and the 3 × 3 attention point that have the same center at the attention point. When using two 5 × 5 masks that do not have a non-calculation area between the minimum value calculation target area and the minimum value calculation target area, and not between the maximum value calculation target area and the minimum value calculation target area The calculation results of the respective maximum difference values when a 3 × 3 mask having no area is used are shown. The maximum difference value when the mask shown in FIG. 2 is used is calculated as 3.2 as described above. When two 5 × 5 masks having no non-calculation area between the maximum value calculation target area and the minimum value calculation target area are used, the maximum difference values of 0.3 and 2.0 are respectively obtained. When a 3 × 3 mask that is calculated and does not have a non-calculation area between the maximum value calculation target area and the minimum value calculation target area is used, a maximum difference value of −0.6 is calculated. .

この結果から、マスクの大きさについては、３×３のマスクに比べて５×５のマスクを用いる方が、差分値が大きくなり、当該値によりコーナーの存在を示す強度を強くできることがわかる。また、算出対象としない領域を有しない５×５のマスクと３×３のマスクでは、最小値と最大値の差分値が小さくなり、当該値によりコーナーの存在を示す強度が弱くなっている。これにより、算出対象としない領域を有するマスクを用いる方が、算出対象としない領域を有しないマスクを用いた場合よりもコーナーの存在をより強い強度で表現することができ、さらに、安定してコーナーを検出することが可能になることを示すことができる。 From this result, it can be seen that the mask size is larger when the 5 × 5 mask is used than when the 3 × 3 mask is used, and the intensity indicating the presence of corners can be increased by the value. Further, in a 5 × 5 mask and a 3 × 3 mask that do not have a region not to be calculated, the difference value between the minimum value and the maximum value becomes small, and the intensity indicating the presence of a corner is weakened by the value. As a result, the use of a mask having a region that is not a calculation target can express the presence of a corner with a stronger intensity than when a mask that does not have a calculation target is used, and more stably. It can be shown that a corner can be detected.

また、このことから、本実施形態の信号特徴抽出装置１を用いることで、特徴の算出対象としない領域を定めることで、信号内の特徴間の差分の大きい箇所を利用して、頑健な特徴の検出を可能にすることが可能となり、前述した従来技術に比べてより頑健な基準で特徴を抽出することが可能となる。 In addition, from this, by using the signal feature extraction apparatus 1 of the present embodiment, by defining a region that is not a feature calculation target, a robust feature can be used by using a location where the difference between features in the signal is large. Can be detected, and features can be extracted based on criteria more robust than the above-described conventional technique.

なお、上記の実施形態の構成において、ベクトル量子化値の全てを利用する必要はなく、注目箇所の周辺領域において最小値と最大値の差分値の絶対値の大きい値の上位の値だけを選択、すなわちスパースな状態で特徴を抽出するようにすることも可能である。 In the configuration of the above embodiment, it is not necessary to use all of the vector quantization values, and only the upper value of the larger absolute value of the difference value between the minimum value and the maximum value is selected in the peripheral region of the target location. In other words, it is possible to extract features in a sparse state.

上述した実施形態における図２に示したマスクは、４方向のコーナーを検出するための形状を有しているが、例えば、８方向のコーナーを検出する場合には、図１０に示すような形状のマスクを８個追加して適用するようにしてもよい。 The mask shown in FIG. 2 in the embodiment described above has a shape for detecting corners in four directions. For example, when detecting a corner in eight directions, the shape shown in FIG. The eight masks may be added and applied.

また、図２に示したマスクは、前述したように８近傍にて最大値算出対象領域と最小値算出対象領域とが隣接しないように配置されているが、図１１に示すように４近傍で隣接しないように配置するようにしてもよい。 The mask shown in FIG. 2 is arranged so that the maximum value calculation target area and the minimum value calculation target area are not adjacent in the vicinity of 8 as described above, but in the vicinity of 4 as shown in FIG. You may make it arrange | position so that it may not adjoin.

また、上記の実施形態では、目的信号を音響信号を例として説明を行なったが、本発明は、上記の実施形態の構成に限られず、例えば、目的信号を映像信号の縮小画像を用いるようにしてもよい。この場合、例えば、映像信号をテレビの放送信号としたとき、テレビの放送信号から１５秒程度の映像信号を探索する構成とするときには、まず、テレビの放送信号の１フレームの画像を横にＸ等分割し、縦にＹ等分割して、合計Ｘ×Ｙ個の領域を設ける。そして、各領域内でＲＧＢそれぞれについて画素値を平均する。そして、各領域のＲＧＢそれぞれの平均画素値からなる３×Ｘ×Ｙ個の値から構成される画像を１フレームあたり縮小画像とする。例えば、１フレームの画像を横に１６等分割し、縦に１２等分割して１９２個の領域を設け、各領域内でＲＧＢそれぞれについて画素値を平均して１フレーム分の縮小画像を生成した場合、当該縮小画像は５７６個の値を含むことになる。 In the above embodiment, the target signal is described by taking the acoustic signal as an example. However, the present invention is not limited to the configuration of the above embodiment, and for example, the target signal may be a reduced image of the video signal. May be. In this case, for example, when the video signal is a television broadcast signal, when a video signal of about 15 seconds is searched from the television broadcast signal, first, an image of one frame of the television broadcast signal is X Equally divided and vertically divided into Y, for a total of X × Y areas. Then, the pixel values are averaged for each of RGB in each region. Then, an image composed of 3 × X × Y values composed of the average pixel values of RGB in each region is set as a reduced image per frame. For example, an image of one frame is divided into 16 equal parts horizontally and divided into 12 equal parts vertically to provide 192 areas, and pixel values are averaged for each of RGB in each area to generate a reduced image for one frame. In this case, the reduced image includes 576 values.

映像信号の各フレームごとに、当該縮小画像を生成し、生成した縮小画像をＲＧＢごとに時間方向に重ねたものを基本特徴情報として、上述した特徴抽出の方法を適用することができる。このとき、マスクとしては、３次元に拡張したものが利用されることになる。例えば、図１２は、１つの３次元のマスクを示した図であり、マスク１−１からマスク１−５を時間軸方向に重ねることで３次元のマスクが構成されることになる。 The feature extraction method described above can be applied by generating the reduced image for each frame of the video signal and using the generated reduced image superimposed in the time direction for each RGB as basic feature information. At this time, the mask expanded in three dimensions is used. For example, FIG. 12 is a diagram showing one three-dimensional mask, and a three-dimensional mask is configured by overlapping masks 1-1 to 1-5 in the time axis direction.

図１３は、２次元の縮小画像が時間軸方向に重ねたものを３次元空間に配置され、注目箇所にマスク１−１からマスク１−５が重ねられた３次元のマスクが配置された状態を示した図である。このマスクを、前述した実施形態の構成を同じく、ＲＧＢごとの基本特徴情報において、縦軸、横軸、時間軸方向にずらして、最小値と最大値とを算出し、最小値と最大値の差分に基づいてベクトル量子化値を注目箇所ごとに算出することで特徴を抽出することが可能となる。 FIG. 13 shows a state in which two-dimensional reduced images superimposed in the time axis direction are arranged in a three-dimensional space, and a three-dimensional mask in which masks 1-1 to 1-5 are overlaid is placed at a point of interest. FIG. Similarly to the configuration of the above-described embodiment, the mask is shifted in the vertical axis, horizontal axis, and time axis directions in the basic feature information for each RGB to calculate the minimum value and the maximum value. A feature can be extracted by calculating a vector quantization value for each point of interest based on the difference.

なお、図１２に示すマスク１−１からマスク１−５で構成されるマスクでは、２６近傍で隣接最大値算出対象領域と最小値算出対象領域とが隣接しないように配置されているが、３次元のマスクにおいても、２６方向を検出するマスクへ拡張することもでき、また、６近傍で隣接しないマスクとすることもできる。また、さらに、３次元以上の多次元のマスクへ拡張して適用することも可能である。 In addition, in the mask constituted by the mask 1-1 to the mask 1-5 shown in FIG. 12, the adjacent maximum value calculation target region and the minimum value calculation target region are arranged in the vicinity of 26 so as not to be adjacent to each other. The dimension mask can also be extended to a mask that detects 26 directions, or can be a mask that is not adjacent in the vicinity of 6. Further, it can be applied to a three-dimensional or more multi-dimensional mask.

また、図３に示す各ステップを実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、また、図１に示す信号特徴抽出装置の機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、また、図１に示す信号特徴抽出装置における処理部の機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより、信号特徴抽出処理を行なってもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものであってもよい。 Also, a program for realizing each step shown in FIG. 3 is recorded on a computer-readable recording medium, and a program for realizing the function of the signal feature extraction device shown in FIG. A program for recording on a medium and for realizing the function of the processing unit in the signal feature extraction apparatus shown in FIG. 1 is recorded on a computer-readable recording medium, and the program recorded on the recording medium is stored in a computer system. The signal feature extraction processing may be performed by reading and executing. Here, the “computer system” may include an OS and hardware such as peripheral devices.

また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、フラッシュメモリ等の書き込み可能な不揮発性メモリ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。 Further, the “computer system” includes a homepage providing environment (or display environment) if a WWW system is used. The “computer-readable recording medium” means a flexible disk, a magneto-optical disk, a ROM, a writable nonvolatile memory such as a flash memory, a portable medium such as a CD-ROM, a hard disk built in a computer system, etc. This is a storage device.

さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムが送信された場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリ（例えばＤＲＡＭ（Dynamic Random Access Memory））のように、一定時間プログラムを保持しているものも含むものとする。また、上記プログラムは、このプログラムを記憶装置等に格納したコンピュータシステムから、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。ここで、プログラムを伝送する「伝送媒体」は、インターネット等のネットワーク（通信網）や電話回線等の通信回線（通信線）のように情報を伝送する機能を有する媒体のことをいう。また、上記プログラムは、前述した機能の一部を実現するためのものであっても良い。さらに、前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるもの、いわゆる差分ファイル（差分プログラム）であっても良い。 Further, the “computer-readable recording medium” means a volatile memory (for example, DRAM (Dynamic DRAM) in a computer system that becomes a server or a client when a program is transmitted through a network such as the Internet or a communication line such as a telephone line. Random Access Memory)), etc., which hold programs for a certain period of time. The program may be transmitted from a computer system storing the program in a storage device or the like to another computer system via a transmission medium or by a transmission wave in the transmission medium. Here, the “transmission medium” for transmitting the program refers to a medium having a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line. The program may be for realizing a part of the functions described above. Furthermore, what can implement | achieve the function mentioned above in combination with the program already recorded on the computer system, and what is called a difference file (difference program) may be sufficient.

本実施形態による信号特徴抽出装置の内部構成を示したブロック図である。It is the block diagram which showed the internal structure of the signal feature extraction apparatus by this embodiment. 同実施形態におけるマスク記憶部に記憶されるマスクを示した図である。It is the figure which showed the mask memorize | stored in the mask memory | storage part in the same embodiment. 同実施形態における信号特徴抽出装置の処理を示したフローチャートである。It is the flowchart which showed the process of the signal feature extraction apparatus in the embodiment. 同実施形態における入力される信号の一例を示した図である。It is the figure which showed an example of the signal input in the same embodiment. 同実施形態における基本特徴抽出部から出力される基本特徴情報及び当該基本特徴情報にマスクが適用された例を示した図である。It is the figure which showed the example by which the mask was applied to the basic feature information output from the basic feature extraction part in the same embodiment, and the said basic feature information. 同実施形態におけるマスク内最大最小計算部による最大値と最小値の算出対象を示した図である。It is the figure which showed the calculation object of the maximum value by the maximum / minimum calculation part in a mask in the same embodiment, and a minimum value. 同実施形態におけるマスク内最大最小計算部の処理を説明するための図である。It is a figure for demonstrating the process of the maximum / minimum calculation part in a mask in the embodiment. 同実施形態におけるマスク選択ベクトル量子化部による量子化が行なわれた結果を示した図である。It is the figure which showed the result as which the quantization by the mask selection vector quantization part in the same embodiment was performed. 同実施形態における信号特徴抽出方法による処理結果と他のマスクを適用した場合の比較を示した図である。It is the figure which showed the comparison at the time of applying the processing result by the signal feature extraction method in the embodiment, and another mask. 同実施形態における信号特徴抽出装置に適用できる他のマスクの例（その１）を示した図である。It is the figure which showed the example (the 1) of the other mask applicable to the signal feature extraction apparatus in the embodiment. 同実施形態における信号特徴抽出装置に適用できる他のマスクの例（その２）を示した図である。It is the figure which showed the example (the 2) of the other mask applicable to the signal feature extraction apparatus in the embodiment. 同実施形態における信号特徴抽出装置に映像信号を入力した場合に適用するマスクの例を示した図である。It is the figure which showed the example of the mask applied when a video signal is input into the signal feature extraction apparatus in the embodiment. 同実施形態における信号特徴抽出装置に映像信号を入力した場合のマスクの適用例を示した図である。It is the figure which showed the example of application of the mask at the time of inputting a video signal into the signal feature extraction device in the embodiment.

Explanation of symbols

１信号特徴抽出装置
１０入力部
１１基本特徴抽出部
１２マスク内最大最小計算部
１３差分計算部
１４マスク選択ベクトル量子化部
２０マスク記憶部 DESCRIPTION OF SYMBOLS 1 Signal feature extraction apparatus 10 Input part 11 Basic feature extraction part 12 Maximum / minimum calculation part in mask 13 Difference calculation part 14 Mask selection vector quantization part 20 Mask memory | storage part

Claims

A signal feature extraction method for extracting features used for extracting a specific signal from an input signal,
A storage process of storing a plurality of masks to which identification information that can be identified in advance is stored in a mask storage unit;
An input process for inputting a signal;
A basic feature extraction process for extracting basic feature information consisting of vectors from the input signal;
In the mask for calculating the maximum value and the minimum value from the partial area for calculating the maximum value determined by the read mask and the partial area for calculating the minimum value of the basic feature information extracted from the mask storage means. Maximum and minimum calculation process;
A difference calculation process for calculating a difference between the calculated minimum value and maximum value;
A mask selection vector quantization process for outputting the identification information of the mask with the largest calculated difference as a quantization value of the extraction target region of the basic feature information, and outputting information including the quantization value of the signal as a feature of the signal; ,
A signal feature extraction method characterized by comprising:

The mask stored in the mask storage means is
The signal feature extraction according to claim 1, wherein the partial area for calculating the maximum value, the partial area for calculating the minimum value, and the partial area for which the maximum value and the minimum value are not calculated are combined. Method.

The partial area not to be calculated, the partial area for calculating the maximum value, and the partial area for calculating the minimum value so that the partial area for calculating the maximum value and the partial area for calculating the minimum value are not adjacent to each other. The signal feature extraction method according to claim 2, wherein

When the basic feature information and the mask are represented by a plurality of partial areas each having a quadrangle on a two-dimensional plane, the partial area for calculating the maximum value and the partial area for calculating the minimum value are the quadrangle. The partial area not to be calculated, the partial area for calculating the maximum value, and the partial area for calculating the minimum value are combined so as not to be adjacent in at least four neighborhoods of the side of Item 3. The signal feature extraction method according to Item 2.

When the basic feature information and the mask are indicated by a plurality of partial areas having a rectangular parallelepiped as a unit in a three-dimensional space, the partial area for calculating the maximum value and the partial area for calculating the minimum value are the rectangular parallelepiped. The partial area not to be calculated, the partial area for calculating the maximum value, and the partial area for calculating the minimum value are combined so as not to be adjacent at least in the vicinity of 6 of the surface. Item 3. The signal feature extraction method according to Item 2.

A signal feature extraction device for extracting features used for extraction of a specific signal from an input signal,
Mask storage means for storing a plurality of masks to which identification information that can be identified in advance is provided;
An input means for inputting a signal;
Basic feature extraction means for extracting basic feature information consisting of vectors from a signal input by the input means;
A mask is read from the mask storage means, and a maximum value and a minimum value are respectively obtained from a partial area for calculating a maximum value determined by the read mask and a partial area for calculating a minimum value of basic feature information extracted by the basic feature extraction means. In-mask maximum / minimum calculation means for calculating a value;
A difference calculating means for calculating a difference between a minimum value and a maximum value calculated by the maximum / minimum calculating means in the mask;
The mask selection for outputting the information including the quantization value of the signal as the feature of the signal, using the identification information of the mask with the maximum difference calculated by the difference calculation means as the quantization value of the extraction target region of the basic feature information Vector quantization means;
A signal feature extraction apparatus comprising:

In the computer of the signal feature extraction device for extracting features used for extraction of a specific signal from the input signal,
A storage process of storing a plurality of masks to which identification information that can be identified in advance is stored in a mask storage unit;
An input process for inputting a signal;
A basic feature extraction process for extracting basic feature information consisting of vectors from the input signal;
In the mask for calculating the maximum value and the minimum value from the partial area for calculating the maximum value determined by the read mask and the partial area for calculating the minimum value of the basic feature information extracted from the mask storage means. Maximum and minimum calculation process;
A difference calculation process for calculating a difference between the calculated minimum value and maximum value;
A mask selection vector quantization process for outputting the identification information of the mask with the largest calculated difference as a quantization value of the extraction target region of the basic feature information, and outputting information including the quantization value of the signal as a feature of the signal; ,
Signal feature extraction program for executing

In the computer of the signal feature extraction device for extracting features used for extraction of a specific signal from the input signal,
A storage process of storing a plurality of masks to which identification information that can be identified in advance is stored in a mask storage unit;
An input process for inputting a signal;
A basic feature extraction process for extracting basic feature information consisting of vectors from an input signal;
In the mask for calculating the maximum value and the minimum value from the partial area for calculating the maximum value determined by the read mask and the partial area for calculating the minimum value of the basic feature information extracted from the mask storage means. Maximum and minimum calculation process;
A difference calculation process for calculating a difference between the calculated minimum value and maximum value;
A mask selection vector quantization process for outputting the identification information of the mask with the largest calculated difference as a quantization value of the extraction target region of the basic feature information, and outputting information including the quantization value of the signal as a feature of the signal; ,
The computer-readable recording medium which recorded the signal feature extraction program for performing this.