JP2011022621A

JP2011022621A - Pattern matching device and method

Info

Publication number: JP2011022621A
Application number: JP2010248442A
Authority: JP
Inventors: Tomonari Kakino; 友成柿野; Jian Luan; ルアン・ジアン; Jie Hao; ハオ・ジエ
Original assignee: Toshiba Corp; Toshiba TEC Corp
Current assignee: Toshiba Corp; Toshiba TEC Corp
Priority date: 2006-10-20
Filing date: 2010-11-05
Publication date: 2011-02-03
Anticipated expiration: 2027-03-19
Also published as: CN101165679A; CN100552664C; JP2008102482A; JP5188563B2

Abstract

<P>PROBLEM TO BE SOLVED: To reduce a matching error occurrence rate, even when a standard pattern is compressed. <P>SOLUTION: A pattern matching device 10 compresses the standard pattern by unifying elements similar to an adjoining feature element, into one feature element, in a pattern compression section 12, for each feature element of B1, B2 and B3 for composing the standard pattern. In a compression information creating section 13, a sequence of a compression ratio for each feature element of a compressed compression standard pattern, is created as compression information. The compressed compression standard pattern is stored in a storing section 14, by relating it to the compression information created for the compression standard pattern. A distance between the restored compressed standard pattern and an input pattern is calculated by a recurrence formula of dynamic programming, which has the compression information created for the compression standard pattern, as a variable. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、音声，文字，図形等の入力パターンが、予め想定されている標準パターンのうちのいずれであるかを判定するパターン認識分野で利用されるパターンマッチング装置及び方法に関する。 The present invention relates to a pattern matching apparatus and method used in the pattern recognition field for determining which input pattern of speech, characters, graphics, etc. is one of standard patterns assumed in advance.

従来のパターン認識分野では、パターンマッチング方式として動的計画法（Dynamic programming : ＤＰ）が広く活用されている（例えば、特許文献１参照）。 In the conventional pattern recognition field, dynamic programming (DP) is widely used as a pattern matching method (see, for example, Patent Document 1).

音声のパターン認識分野を例に、動的計画法の原理について説明する。
音声パターンＡは、次の（１）式のように表現される。

The principle of dynamic programming will be described using the speech pattern recognition field as an example.
The voice pattern A is expressed as the following equation (1).

（１）式において、ｉ｛ｉ＝１，２，…，Ｉ｝は時間を示し、ａは音声パターンＡの時間ｉにおける特徴要素を意味している。 In the formula (1), i {i = 1, 2,..., I} indicates time, and a indicates a feature element of the voice pattern A at time i.

そこで、各種の単語毎に、音声パターンＡと同様の、特徴要素の時系列で表現される標準パターンＢを用意しておく。この標準パターンＢは、次の（２）式のように表現される。

Therefore, a standard pattern B expressed in a time series of feature elements similar to the voice pattern A is prepared for each of various words. This standard pattern B is expressed as the following equation (2).

（２）式において、ｊ｛ｊ＝１，２，…，Ｊ｝は時間を示し、ｂは音声パターンＢの時間ｊにおける特徴要素を意味している。 In equation (2), j {j = 1, 2,..., J} indicates time, and b indicates a feature element at time j of the voice pattern B.

パターンマッチング装置では、入力された音声パターンＡを、各種単語の標準パターンＢとそれぞれ比較してパターン間の距離を求める。そして、最小距離の標準パターンを認識結果として出力する。 In the pattern matching apparatus, the input speech pattern A is compared with the standard pattern B of various words, and the distance between the patterns is obtained. Then, the standard pattern with the minimum distance is output as a recognition result.

実際の音声パターンでは、発話速度の変動に起因して時間軸歪が生じる。動的計画法は、この時間軸歪を整合する手法として極めて有効である。 In an actual voice pattern, time axis distortion occurs due to fluctuations in the speech rate. Dynamic programming is extremely effective as a technique for matching this time-axis distortion.

動的計画法では、次の（３）式に示された漸化式を繰返し演算する。

In dynamic programming, the recurrence formula shown in the following formula (3) is repeatedly calculated.

（３）式において、ｄ（ｉ，ｊ）は、音声パターンＡの特徴要素ａ_ｉと標準パターンＢの特徴要素ｂ_ｊとの要素間距離である。ｇ（ｉ，ｊ）は、音声パターンＡと標準パターンＢとの要素間累積距離である。 In the equation (3), d (i, j) is an inter-element distance between the feature element a _i of the voice pattern A and the feature element b _j of the standard pattern B. g (i, j) is an inter-element cumulative distance between the voice pattern A and the standard pattern B.

パターンマッチング装置では、上記漸化式（３）を繰り返し計算する。そして、図９に示すように、Ａ，Ｂ平面でａ_ｉとｂ_ｊとを最適に対応付ける経路（ＤＰ−pass）を求める。 The pattern matching apparatus repeatedly calculates the recurrence formula (3). Then, as shown in FIG. 9, a path (DP-pass) that optimally associates a _i and b _j in the A and B planes is obtained.

上記漸化式（３）の最上段の式は、図１０に示すＡ，Ｂ平面上の任意の点（ｉ，ｊ）に対し、下方に隣接する点（ｉ，ｊ−１）との関係を規定している。同漸化式（３）の中段の式は、点（ｉ，ｊ）に対し、左斜め下に隣接する点（ｉ−１，ｊ−１）との関係を規定している。同漸化式（３）の最下段の式は、点（ｉ，ｊ）に対し、左側に隣接する点（ｉ−１，ｊ）との関係を規定している。 The uppermost expression of the recurrence formula (3) is the relationship between an arbitrary point (i, j) on the A and B planes shown in FIG. Is stipulated. The middle formula of the recurrence formula (3) defines the relationship between the point (i, j) and the point (i-1, j-1) adjacent to the lower left side. The lowermost expression of the recurrence expression (3) defines the relationship between the point (i, j) and the point (i-1, j) adjacent to the left side.

なお、この漸化式（３）から最上段の式を省略することで傾斜制限を加える場合もある。この場合は、任意の点（ｉ，ｊ）に対して図１１に示すような関係が規定される。傾斜制限は極端な伸縮を制限するために設けられる。 In some cases, the tilt restriction may be added by omitting the uppermost expression from the recurrence formula (3). In this case, a relationship as shown in FIG. 11 is defined for an arbitrary point (i, j). The tilt limit is provided to limit extreme expansion and contraction.

ところで一般に、パターンマッチングに必要な標準パターンは多数用意される。このため、標準パターンを圧縮することによって、標準パターンを記憶する領域の効率化を図ることが考えられている。標準パターンを圧縮する方法としては、次の方法がある。すなわち、標準パターンを構成する各特徴要素について、それぞれ隣接する特徴要素との差をとる。そして、その差があらかじめ設定された閾値より小さい場合は、隣接した特長要素が近似していると判定する。近似しているものがあった場合には、それらを１つの特徴要素に統合する。かくして、標準パターンが圧縮される。 In general, a large number of standard patterns necessary for pattern matching are prepared. For this reason, it is considered to improve the efficiency of the area for storing the standard pattern by compressing the standard pattern. There are the following methods for compressing the standard pattern. That is, for each feature element constituting the standard pattern, a difference from an adjacent feature element is taken. If the difference is smaller than a preset threshold value, it is determined that adjacent feature elements are approximate. If there is an approximation, they are integrated into one feature element. Thus, the standard pattern is compressed.

特開昭５０-９６１０４号公報JP 50-96104 A

しかしながら、上述した方法で標準パターンを圧縮した場合、パターン認識に有益な時間情報が消失してしまう。すなわち、隣接する特徴要素が１つに統合されるので、各要素がそれぞれ有していた時間情報の一部が消失する。標準パターンから時間情報の一部が失われると、一般に、マッチングエラー（matching error）が発生し易くなる。特に、ノイズ（noise）の混入により部分的に近似した特徴パターンが数多く認識された場合や、傾斜制限が設けられた場合には、マッチングエラーが顕著に見られる。 However, when the standard pattern is compressed by the method described above, time information useful for pattern recognition is lost. That is, since adjacent feature elements are integrated into one, a part of the time information that each element has disappears. If part of the time information is lost from the standard pattern, a matching error generally tends to occur. In particular, when many feature patterns that are partially approximated due to noise are recognized or when tilt restriction is provided, a matching error is noticeable.

本発明は、このような事情に基づいてなされたもので、その目的とするところは、標準パターンを圧縮した場合でもマッチングエラーの発生率を低減することができるパターンマッチング装置及びその方法を提供しようとするものである。 The present invention has been made based on such circumstances, and an object of the present invention is to provide a pattern matching apparatus and method that can reduce the occurrence rate of matching errors even when a standard pattern is compressed. It is what.

本発明のパターンマッチング装置は、標準パターンを構成する各特徴要素について、隣接する特徴要素と近似しているものを１つの特徴要素に統合することによって標準パターンを圧縮する。また、圧縮された圧縮標準パターンの特徴要素毎の圧縮比の系列を圧縮情報として生成する。そして、圧縮された圧縮標準パターンを、当該圧縮標準パターンに対して生成された圧縮情報と関連付けて記憶する。そして、記憶された圧縮標準パターンと入力パターンとのパターン間距離を、当該圧縮標準パターンに対して生成された圧縮情報を変数として持つ動的計画法の漸化式によって算出する。 The pattern matching apparatus of the present invention compresses a standard pattern by integrating, for each feature element constituting the standard pattern, one that approximates an adjacent feature element into one feature element. In addition, a series of compression ratios for each feature element of the compressed compression standard pattern is generated as compression information. Then, the compressed standard pattern is stored in association with the compression information generated for the compressed standard pattern. Then, the inter-pattern distance between the stored compressed standard pattern and the input pattern is calculated by a recursive formula of dynamic programming having as a variable the compression information generated for the compressed standard pattern.

かかる手段を講じた本発明によれば、標準パターンを圧縮した場合でもマッチングエラーの発生率を低減することができるパターンマッチング装置及びその方法を提供できる。 According to the present invention in which such measures are taken, it is possible to provide a pattern matching apparatus and method that can reduce the occurrence rate of matching errors even when a standard pattern is compressed.

本発明の実施の形態であるパターンマッチング装置の構成を示すブロック図。The block diagram which shows the structure of the pattern matching apparatus which is embodiment of this invention. 同パターンマッチング装置で行われる標準パターン圧縮処理を説明するための図。The figure for demonstrating the standard pattern compression process performed with the pattern matching apparatus. 同パターンマッチング装置の記憶部に記憶されるデータ構造を示す模式図。The schematic diagram which shows the data structure memorize | stored in the memory | storage part of the pattern matching apparatus. 第１の実施の形態におけるパターンマッチング処理部の要部構成を示すブロック図。The block diagram which shows the principal part structure of the pattern matching process part in 1st Embodiment. 第２の実施の形態におけるパターンマッチング処理部の要部構成を示すブロック図。The block diagram which shows the principal part structure of the pattern matching process part in 2nd Embodiment. 第２の実施の形態のパターンマッチング処理部で計算される第１の漸化式を説明するための図。The figure for demonstrating the 1st recurrence formula calculated in the pattern matching process part of 2nd Embodiment. 第２の実施の形態のパターンマッチング処理部で計算される第２の漸化式を説明するための図。The figure for demonstrating the 2nd recurrence formula calculated in the pattern matching process part of 2nd Embodiment. 第２の実施の形態のパターンマッチング処理部で計算される第３の漸化式を説明するための図。The figure for demonstrating the 3rd recurrence formula calculated in the pattern matching process part of 2nd Embodiment. 従来の動的計画法を説明するための図。The figure for demonstrating the conventional dynamic programming. 従来の動的計画法で計算される漸化式を説明するための図。The figure for demonstrating the recurrence formula calculated by the conventional dynamic programming. 従来の動的計画法で計算される漸化式の他の例を説明するための図。The figure for demonstrating the other example of the recurrence formula calculated by the conventional dynamic programming.

以下、本発明を音声のパターン認識分野に適用した実施の形態について、図面を用いて説明する。はじめに、第１の実施の形態について、図１〜図４を用いて説明する。 Embodiments in which the present invention is applied to the field of speech pattern recognition will be described below with reference to the drawings. First, a first embodiment will be described with reference to FIGS.

図１は本実施の形態に係るパターンマッチング装置１０の要部構成を示すブロック図である。パターンマッチング装置１０は、音声分析部１１，パターン圧縮部１２、圧縮情報生成部１３、記憶部１４及びパターンマッチング処理部１５を備えている。 FIG. 1 is a block diagram showing a main configuration of a pattern matching apparatus 10 according to the present embodiment. The pattern matching device 10 includes a voice analysis unit 11, a pattern compression unit 12, a compression information generation unit 13, a storage unit 14, and a pattern matching processing unit 15.

音声分析部１１には、音声信号Ｍが入力される。音声信号Ｍは、マイクロフォンを通して入力された音声から変換された電気信号である。音声分析部１１は、入力された音声信号Ｍを分析する。そして、入力音声の特徴要素ａを時系列で抽出して、前記（１）式で示される音声パターンＡを生成する。音声パターンＡは、パターンマッチング処理部１５に出力される。 A voice signal M is input to the voice analysis unit 11. The sound signal M is an electric signal converted from sound input through a microphone. The voice analysis unit 11 analyzes the input voice signal M. Then, the feature element a of the input voice is extracted in time series to generate the voice pattern A represented by the above equation (1). The voice pattern A is output to the pattern matching processing unit 15.

パターン圧縮部１２には、複数の標準パターンＢ１，Ｂ２，Ｂ３，…が入力される。各標準パターンＢ１，Ｂ２，Ｂ３，…は、各種の単語毎に予め用意されたものである。これらの標準パターンＢ１，Ｂ２，Ｂ３，…は圧縮されていない。パターン圧縮部１２は、入力された各標準パターンＢ１，Ｂ２，Ｂ３，…をそれぞれ時間方向に圧縮する。 The pattern compression unit 12 receives a plurality of standard patterns B1, B2, B3,. Each standard pattern B1, B2, B3,... Is prepared in advance for each type of word. These standard patterns B1, B2, B3,... Are not compressed. The pattern compression unit 12 compresses each input standard pattern B1, B2, B3,.

その圧縮方法について、図２を用いて説明する。同図において、ｂ_１，ｂ_２，ｂ_３，…，ｂ_９は任意の標準パターンＢｘの各特長要素を示している。標準パターンＢｘは、各特長要素ｂ_１，ｂ_２，ｂ_３，…，ｂ_９がその順番に時系列に並べられたものである。 The compression method will be described with reference to FIG. In the figure, b ₁ , b ₂ , b ₃ ,..., B ₉ indicate characteristic elements of an arbitrary standard pattern Bx. In the standard pattern Bx, the feature elements b ₁ , b ₂ , b ₃ ,..., B ₉ are arranged in time series in that order.

パターン圧縮部１２では、標準パターンＢｘを構成する各特長要素ｂ_１，ｂ_２，ｂ_３，…，ｂ_９について、それぞれ隣接した特徴要素との距離を求める。そして、距離が予め設定された閾値以下であった場合に、これら隣接した複数の特徴要素を一つの平均特徴要素ｂ′_ｋ（ｋ＝１，２，３，…）に置き換える。この処理により、局所的に複数の特徴要素が平均特徴要素に圧縮される。標準パターンＢｘの全体にこの処理を施すことにより、圧縮標準パターンＢ′ｘが生成される。 The pattern compressing unit 12 obtains distances between adjacent feature elements for the feature elements b ₁ , b ₂ , b ₃ ,..., B ₉ constituting the standard pattern Bx. When the distance is equal to or less than a preset threshold value, the plurality of adjacent feature elements are replaced with one average feature element b ′ _k (k = 1, 2, 3,...). By this processing, a plurality of feature elements are locally compressed into average feature elements. By applying this process to the entire standard pattern Bx, a compressed standard pattern B′x is generated.

図２の例の場合、隣接する特徴要素ｂ_２とｂ_３との距離が閾値以下であり、これら特徴要素ｂ_２，ｂ_３が平均特徴要素ｂ′_２｛ｂ′_２＝（ｂ_２＋ｂ_３）／２｝に置き換えられている。また、隣接する特徴要素ｂ_４，ｂ_５及びｂ_６の距離が閾値以下であり、これらの特徴要素ｂ_４，ｂ_５及びｂ_６が平均特徴要素ｂ′_３｛ｂ′_３＝（ｂ_４＋ｂ_５＋ｂ_６）／３｝に置き換えられている。また、隣接する特徴要素ｂ_７とｂ_８との距離が閾値以下であり、これら特徴要素ｂ_７，ｂ_８が平均特徴要素ｂ′_４｛ｂ′_４＝（ｂ_７＋ｂ_８）／２｝に置き換えられている。かくして、標準パターンＢｘ（＝ｂ_１，ｂ_２，ｂ_３，…，ｂ_９）が圧縮標準パターンＢ′ｘ（＝ｂ′_１，ｂ′_２，ｂ′_３，ｂ′_４，ｂ′_５）に圧縮される。 In the case of the example in FIG. 2, the distance between adjacent feature elements b ₂ and b ₃ is equal to or less than a threshold value, and these feature elements b ₂ and b ₃ are average feature elements b ′ ₂ {b ′ ₂ = (b ₂ + b ₃ ) / 2}. Further, the distance between adjacent feature elements b ₄ , b ₅ and b ₆ is less than or equal to the threshold value, and these feature elements b ₄ , b ₅ and b ₆ are average feature elements b ′ ₃ {b ′ ₃ = (b ₄ + b ₅ + b ₆ ) / 3}. Further, the distance between the adjacent feature elements b ₇ and b ₈ is equal to or smaller than the threshold value, and these feature elements b ₇ and b ₈ become the average feature element b ′ ₄ {b ′ ₄ = (b ₇ + b ₈ ) / 2}. Has been replaced. Thus, the standard pattern Bx (= b ₁ , b ₂ , b ₃ ,..., B ₉ ) is the compressed standard pattern B′x (= b ′ ₁ , b ′ ₂ , b ′ ₃ , b ′ ₄ , b ′ ₅ ). Is compressed.

圧縮情報生成部１３は、前記パターン圧縮部１２により圧縮された圧縮標準パターンＢ′ｘの特徴要素ｂ′_１，ｂ′_２，ｂ′_３，ｂ′_４，ｂ′_５毎の圧縮比ｎ１，ｎ２，ｎ３，ｎ４．ｎ５の系列を圧縮情報Ｎｘとして生成する。本実施の形態では、図２に示すように、圧縮標準パターンＢ′ｘの各特徴要素ｂ′_１，ｂ′_２，ｂ′_３，ｂ′_４，ｂ′_５が、それぞれ元の標準パターンＢｘの何個分の特徴要素を代表しているのかを示す値を圧縮比ｎ_１，ｎ_２，ｎ_３，ｎ_４．ｎ_５と定義している。 The compression information generation unit 13 includes a compression ratio n1, for each of the characteristic elements b ′ ₁ , b ′ ₂ , b ′ ₃ , b ′ ₄ , b ′ _{5 of} the compression standard pattern B′x compressed by the pattern compression unit 12. n2, n3, n4. An n5 series is generated as compressed information Nx. In the present embodiment, as shown in FIG. 2, each of the characteristic elements b ′ ₁ , b ′ ₂ , b ′ ₃ , b ′ ₄ , b ′ ₅ of the compressed standard pattern B′x is converted into the original standard pattern Bx. Of the number of feature elements representing the compression ratios n ₁ , n ₂ , n ₃ , n ₄ . n ₅ to be defined.

すなわち、圧縮標準パターンＢ′ｘの特徴要素ｂ′_１は、標準パターンＢｘの特徴要素ｂ_１だけを代表しているので、圧縮比ｎ１は“１”である。同様に、特徴要素ｂ′_２は、特徴要素ｂ_２とｂ_３とを代表しているので、圧縮比ｎ２は“２”である。特徴要素ｂ′_３は、特徴要素ｂ_４とｂ_５とｂ_６とを代表しているので、圧縮比ｎ３は“３”である。特徴要素ｂ′_４は、特徴要素ｂ_７とｂ_８とを代表しているので、圧縮比ｎ４は“２”である。特徴要素ｂ′_５は、特徴要素ｂ_９だけを代表しているので、圧縮比ｎ５は“１”である。 That is, since the feature element b ′ ₁ of the compressed standard pattern B′x represents only the feature element b ₁ of the standard pattern Bx, the compression ratio n1 is “1”. Similarly, since the characteristic element b ′ ₂ represents the characteristic elements b ₂ and b ₃ , the compression ratio n2 is “2”. Since the characteristic element b ′ ₃ represents the characteristic elements b ₄ , b _5, and b ₆ , the compression ratio n 3 is “3”. Since the characteristic element b ′ ₄ represents the characteristic elements b ₇ and b ₈ , the compression ratio n4 is “2”. Since the characteristic element b ′ ₅ represents only the characteristic element b ₉ , the compression ratio n5 is “1”.

かくして、圧縮標準パターンＢ′ｘに対する圧縮情報はＮｘは、“１，２，３，１，２”となる。 Thus, the compression information for the compression standard pattern B′x is Nx “1, 2, 3, 1, 2”.

記憶部１４は、図３に示すように、パターン圧縮部１２で圧縮された圧縮標準パターンＢ′ｘを、当該圧縮標準パターンに対して圧縮情報生成部１３で生成された圧縮情報Ｎｘと関連付けて記憶する。 As shown in FIG. 3, the storage unit 14 associates the compression standard pattern B′x compressed by the pattern compression unit 12 with the compression information Nx generated by the compression information generation unit 13 for the compression standard pattern. Remember.

パターンマッチング処理部１５は、図４に示すように、復元部２１と、平滑化処理部２２と、距離計算部２３とを備えている。 As shown in FIG. 4, the pattern matching processing unit 15 includes a restoration unit 21, a smoothing processing unit 22, and a distance calculation unit 23.

復元部２１は、記憶部１４に記憶されている圧縮標準パターンＢ′ｘを、当該圧縮標準パターンＢ′ｘと関連付けて記憶されている圧縮情報Ｎｘに基づき伸長して、標準パターンＢ_ｘ１に復元する。例えば図２の例の場合、圧縮標準パターンＢ′ｘ（ｂ′_１，ｂ′_２，ｂ′_３，ｂ′_４，ｂ′_５）に関連する圧縮情報Ｎｘ（ｎ_１，ｎ_２，ｎ_３，ｎ_４．ｎ_５）は、“１，２，３，１，２”である。 Restoration unit 21 restores the compressed standard pattern B'x stored in the storage unit 14, and extends on the basis of the compression information Nx stored in association with the compression standard pattern B'x, the reference pattern B _x1 To do. For example, in the case of the example of FIG. 2, the compression information Nx (n ₁ , n ₂ , n ₃ ) related to the compression standard pattern B′x (b ′ ₁ , b ′ ₂ , b ′ ₃ , b ′ ₄ , b ′ ₅ ). , N ₄ .n ₅ ) is “1, 2, 3, 1, 2”.

したがって、特徴要素ｂ′_１は、そのままとなる。特徴要素ｂ′_２は、隣接する２つの特徴要素ｂ′_２，ｂ′_２となる。特徴要素ｂ′_３は、隣接する３つの特徴要素ｂ′_３，ｂ′_３，ｂ′_３となる。特徴要素ｂ′_４は、隣接する２つの特徴要素ｂ′_４，ｂ′_４となる。特徴要素ｂ′_５は、そのままとなる。かくして、圧縮標準パターンＢ′ｘは、標準パターンＢ_ｘ１（ｂ′_１，ｂ′_２，ｂ′_２，ｂ′_３，ｂ′_３，ｂ′_３，ｂ′_４，ｂ′_４，ｂ′_５）に復元される。 Therefore, the characteristic element b ′ ₁ remains as it is. The feature element b ′ ₂ becomes _two adjacent feature elements b ′ ₂ and b ′ ₂ . The feature element b ′ ₃ becomes _three adjacent feature elements b ′ ₃ , b ′ ₃ , and b ′ ₃ . The feature element b ′ ₄ becomes two adjacent feature elements b ′ ₄ and b ′ ₄ . The characteristic element b ′ ₅ remains as it is. Thus, the compressed standard pattern B′x is the standard pattern B _x1 (b ′ ₁ , b ′ ₂ , b ′ ₂ , b ′ ₃ , b ′ ₃ , b ′ ₃ , b ′ ₄ , b ′ ₄ , b ′ ₅ ) Is restored.

平滑化処理部２２は、復元部２１で復元された標準パターンＢ_ｘ１に対して低域通過フィルタによる平滑化処理を行う。復元された標準パターンＢ_ｘ１には、圧縮によるノイズが発生している。平滑化処理を施すことによって、この種のノイズを除去できる。 The smoothing processing unit 22 performs a smoothing process using a low-pass filter on the standard pattern B _x1 restored by the restoration unit 21. In the restored standard pattern B _x1 , noise due to compression is generated. This kind of noise can be removed by performing the smoothing process.

距離計算部２３は、音声分析部１１を介して入力された音声パターンＡと、平滑化処理部２２にて平滑化処理された標準パターンＢ_ｘ１とのパターン間距離Ｇｘを、周知の動的計画法により算出する。例えば、前記（３）式に示された漸化式の繰返し演算によってパターン間距離Ｇｘを算出する。 The distance calculation unit 23 calculates the inter-pattern distance Gx between the speech pattern A input via the speech analysis unit 11 and the standard pattern B _x1 smoothed by the smoothing processing unit 22, using a well-known dynamic plan Calculated by the method. For example, the inter-pattern distance Gx is calculated by repetitive calculation of the recurrence formula shown in the formula (3).

パターンマッチング処理部１５では、全ての圧縮標準パターンＢ′ｘについて、前記復元部２１、平滑化処理部２２及び距離計算部２３での処理を繰返し実行する。そして、算出されたパターン間距離Ｇｘが最小となる標準パターンＢｘ_１を求めて、音声パターンＡの認識結果Ｇとして出力する。 The pattern matching processing unit 15 repeatedly executes the processing in the restoration unit 21, the smoothing processing unit 22, and the distance calculation unit 23 for all the compressed standard patterns B′x. Then, the standard pattern Bx ₁ that minimizes the calculated inter-pattern distance Gx is obtained and output as the recognition result G of the voice pattern A.

このように、本実施の形態のパターンマッチング装置１０においては、標準パターンＢｘを圧縮して圧縮標準パターンＢ′ｘを生成する際に、その圧縮標準パターンＢ′ｘの特徴要素毎の圧縮比の系列を圧縮情報Ｎｘとして生成している。そして、圧縮標準パターンＢ′ｘを、当該圧縮標準パターンＢ′ｘに対して生成された圧縮情報Ｎｘと関連付けて記憶部１４で記憶している。 As described above, in the pattern matching apparatus 10 of the present embodiment, when the standard pattern Bx is compressed to generate the compressed standard pattern B′x, the compression ratio for each feature element of the compressed standard pattern B′x is changed. A series is generated as compressed information Nx. The compression standard pattern B′x is stored in the storage unit 14 in association with the compression information Nx generated for the compression standard pattern B′x.

音声信号Ｍが入力されると、音声分析部１１により入力音声の特徴要素ａが時系列で抽出されて、音声パターンＡが生成される。音声パターンＡは、パターンマッチング処理部１５に出力される。パターンマッチング処理部１５では、記憶部１４に記憶された全ての圧縮標準パターンＢ′ｘに対して、以下のパターンマッチング処理が実行される。 When the voice signal M is input, the voice analysis unit 11 extracts the feature element a of the input voice in time series, and the voice pattern A is generated. The voice pattern A is output to the pattern matching processing unit 15. The pattern matching processing unit 15 performs the following pattern matching processing on all the compressed standard patterns B′x stored in the storage unit 14.

先ず、記憶部１４から任意の圧縮標準パターンＢ′ｘと、それに関連する圧縮情報Ｎｘとが読み出される。そして、圧縮標準パターンＢ′ｘが圧縮情報Ｎｘに基づき伸長されて、標準パターンＢ_ｘ１に復元される。次に、復元された標準パターンＢ_ｘ１に対して平滑化処理が行われる。しかる後、入力された音声パターンＡと平滑化処理が施された標準パターンＢ_ｘ１とのパターン間距離Ｇｘが動的計画法により算出される。 First, an arbitrary compressed standard pattern B′x and related compression information Nx are read from the storage unit 14. Then, the compressed standard pattern B′x is expanded based on the compression information Nx, and restored to the standard pattern B _x1 . Next, a smoothing process is performed on the restored standard pattern B _x1 . Thereafter, the inter-pattern distance Gx between the input speech pattern A and the smoothed standard pattern B _x1 is calculated by dynamic programming.

こうして、パターンマッチング処理部１５では、圧縮標準パターンＢ′ｘ毎に、音声パターンＡとのパターン間距離Ｇｘが算出される。そして、パターン間距離Ｇｘが最小となる標準パターンＢｘ_１が音声パターンＡの認識結果Ｇとして出力される。 Thus, the pattern matching processing unit 15 calculates the inter-pattern distance Gx with the sound pattern A for each compressed standard pattern B′x. Then, the standard pattern Bx ₁ that minimizes the inter-pattern distance Gx is output as the recognition result G of the voice pattern A.

このように、圧縮された標準パターンＢ′ｘをそのまま音声パターンＡとのパターン間距離の演算に用いるのではなく、圧縮情報Ｎｘで標準パターンＢ_ｘ１に復元してから用いている。したがって、標準パターンＢｘの圧縮により失われた時間情報が加味されるので、マッチングエラーの発生率が低減される。 Thus, the compressed standard pattern B′x is not used as it is for the calculation of the inter-pattern distance with the speech pattern A, but is used after being restored to the standard pattern B _x1 with the compression information Nx. Therefore, since the time information lost due to the compression of the standard pattern Bx is taken into account, the occurrence rate of matching errors is reduced.

次に、第２の実施の形態について説明する。第２の実施の形態は、パターンマッチング処理部１５のみが第１の実施の形態と異なる。第２の実施の形態におけるパターンマッチング処理部１５の要部構成を図５のブロック図で示す。 Next, a second embodiment will be described. The second embodiment is different from the first embodiment only in the pattern matching processing unit 15. The principal part structure of the pattern matching process part 15 in 2nd Embodiment is shown with the block diagram of FIG.

パターンマッチング処理部１５は、漸化式設定部３１と距離計算部３２とを備えている。距離計算部３２は、音声パターンＡと圧縮標準パターンＢ′ｘとのパターン間距離Ｇｘを、漸化式設定部３１に設定された漸化式の繰返し演算によって算出する。この際、圧縮標準パターンＢ′ｘと関連付けられて記憶されている圧縮情報Ｎｘを用いて漸化式の計算を行う。 The pattern matching processing unit 15 includes a recurrence formula setting unit 31 and a distance calculation unit 32. The distance calculation unit 32 calculates the inter-pattern distance Gx between the voice pattern A and the compressed standard pattern B′x by repetitive calculation of the recurrence formula set in the recurrence formula setting unit 31. At this time, the recurrence formula is calculated using the compression information Nx stored in association with the compression standard pattern B′x.

漸化式設定部３１には、次の（４）式に示された漸化式が設定されている。

The recurrence formula shown in the following formula (4) is set in the recurrence formula setting unit 31.

（４）式において、ｄ（ｉ，ｊ）は、音声パターンＡの特徴要素ａ_ｉと圧縮標準パターンＢ′ｘの特徴要素ｂ′_ｊとの要素間距離である。ｇ（ｉ，ｊ）は、音声パターンＡと圧縮標準パターンＢ′ｘとの要素間累積距離である。ｎ_ｊ−１は圧縮情報Ｎｘの要素（圧縮比）である。 In the equation (4), d (i, j) is an inter-element distance between the feature element a _i of the voice pattern A and the feature element b ′ _j of the compressed standard pattern B′x. g (i, j) is an inter-element cumulative distance between the voice pattern A and the compressed standard pattern B′x. n _j−1 is an element (compression ratio) of the compression information Nx.

上記漸化式（４）の最上段の式は、図６に示すＡ，Ｂ平面上の任意の点（ｉ，ｊ）に対し、左側に隣接する点（ｉ−１，ｊ）との関係を規定している。 The uppermost expression of the recurrence formula (4) is the relationship between an arbitrary point (i, j) on the A and B planes shown in FIG. Is stipulated.

同漸化式（４）の最下段の式は、圧縮情報要素ｎ_ｊ−１が“１”の場合である。この場合は、点（ｉ，ｊ）に対し、左斜め下に隣接する点（ｉ−１，ｊ−１）との関係を規定している。 The lowest equation of the recurrence equation (4) is when the compression information element n _j−1 is “1”. In this case, the relationship between the point (i, j-1) and the point (i-1, j-1) adjacent to the lower left is defined.

同漸化式（４）の中段の式は、圧縮情報要素ｎ_ｊ−１が“１”より大きい場合である。この場合は、点（ｉ，ｊ）に対し、左斜め下に隣接する点（ｉ−１，ｊ−１）から、さらに特徴要素ｂ′_ｊ−１に対応する圧縮情報要素ｎ_ｊ−１に従い制限経路長が伸長された点（ｉ−ｎ_ｊ−１，ｊ−１）との関係を規定している。 The middle formula of the recurrence formula (4) is when the compressed information element n _j−1 is larger than “1”. In this case, with respect to the point (i, j), the point (i−1, j−1) adjacent to the lower left diagonally further follows the compression information element n _j−1 corresponding to the feature element b ′ _j−1. It defines the relationship with the point (in _j-1 , j-1) where the restricted path length is extended.

このような漸化式（４）の演算を繰返し行うことによって、圧縮標準パターンＢ′ｘの特徴要素ｂ′_ｊ−１は、入力パターンＡのｎ_ｊ−１個の要素ａ_ｉとの対応が課せられる。したがって、標準パターンＢｘの圧縮により失われた時間情報が加味されるので、マッチングエラーの発生率が低減される。 By repeatedly performing the calculation of the recurrence formula (4), the feature element b ′ _j−1 of the compressed standard pattern B′x can correspond to the n _j−1 elements a _i of the input pattern A. Imposed. Therefore, since the time information lost due to the compression of the standard pattern Bx is taken into account, the occurrence rate of matching errors is reduced.

同様な効果が得られる漸化式は、上記（４）式に限定されるものではない。例えば、図７に示すように、制限経路長の伸長を、特徴要素ｂ′_ｊに対して行う下記（５）式の漸化式を漸化式設定部３１に設定してもよい。

The recurrence formula that provides the same effect is not limited to the above formula (4). For example, as shown in FIG. 7, the recurrence formula of the following formula (5) for extending the restricted path length with respect to the feature element b ′ _j may be set in the recurrence formula setting unit 31.

あるいは、図８に示すように、制限経路長の伸長を、特徴要素ｂ′_ｊとｂ′_ｊ−１との両方に対して行う下記（６）式の漸化式を漸化式設定部３１に設定してもよい。

Alternatively, as shown in FIG. 8, the recurrence formula of the following formula (6) for extending the restricted path length for both the feature elements b ′ _j and b ′ _j−1 is the recurrence formula setting unit 31. May be set.

なお、前記各実施の形態では、標準パターンＢｘを圧縮する際に、近似した複数の特徴要素の平均をとるようにしたが、近似した特徴要素の１つを代表として選択するようにしてもよい。また、コードブックによるクラスタリング手法を用いることも可能である。 In each of the above embodiments, when compressing the standard pattern Bx, an average of a plurality of approximate feature elements is taken. However, one of the approximate feature elements may be selected as a representative. . It is also possible to use a codebook clustering technique.

また、前記実施の形態では、圧縮情報ｎＸの要素（圧縮比）ｎ_ｊをそのまま用いているが、所定の倍率、例えば０．８倍の値を用いてパターン間距離を計算するようにしてもよい。 In the above embodiment, the element (compression ratio) n _j of the compression information nX is used as it is, but the inter-pattern distance may be calculated using a predetermined magnification, for example, a value of 0.8. Good.

また、前記実施の形態では、音声のパターン認識分野に適用した場合を示したが、本発明は、文字，図形等のパターン認識分野にも同様に適用できるものである。この他、本発明の要旨を逸脱しない範囲で種々変形実施可能であるのは勿論である。 In the above-described embodiment, the case where the present invention is applied to the voice pattern recognition field has been described. However, the present invention can be similarly applied to the pattern recognition field such as characters and figures. Of course, various modifications can be made without departing from the scope of the present invention.

１０…パターンマッチング装置、１１…音声分析部、１２…パターン圧縮部、１３…圧縮情報生成部、１４…記憶部、１５…パターンマッチング処理部。 DESCRIPTION OF SYMBOLS 10 ... Pattern matching apparatus, 11 ... Speech analysis part, 12 ... Pattern compression part, 13 ... Compression information generation part, 14 ... Memory | storage part, 15 ... Pattern matching process part

Claims

In the pattern matching device that calculates the inter-pattern distance between the standard pattern and the input pattern indicated by the time series of each feature element, and outputs the inter-pattern distance as a recognition result,
Pattern compressing means for compressing the standard pattern by integrating, for each feature element constituting the standard pattern, one that approximates an adjacent feature element into one feature element;
Compression information generating means for generating, as compressed information, a series of compression ratios for each feature element of the compressed standard pattern compressed by the pattern compressing means;
Storage means for storing the compression standard pattern compressed by the pattern compression means in association with the compression information generated by the compression information generation means for the compression standard pattern;
The distance between patterns of the compression standard pattern stored by the storage unit and the input pattern is a dynamic programming method having as a variable the compression information generated by the compression information generation unit for the compression standard pattern. A distance calculating means for calculating by a recurrence formula;
A pattern matching apparatus comprising:

The recurrence formula used in the distance calculation means is
The inter-element distance between the feature element a _{i of the} input pattern and the feature element b _j of the compressed standard pattern is d (i, j), and the cumulative inter-element distance between the input pattern and the compressed standard pattern is g (i , J), and compression information that is a sequence of the compression ratio n is n _j ,

The pattern matching apparatus according to claim 1, wherein

The pattern matching apparatus according to claim 1, wherein

The pattern matching apparatus according to claim 1, wherein

A pattern matching method in a pattern matching device that calculates a distance between patterns of a standard pattern and an input pattern that are each shown in time series of feature elements, and outputs the distance between the patterns as a recognition result,
For each feature element constituting the standard pattern, a compression step of compressing the standard pattern by integrating one feature element that approximates an adjacent feature element;
A generation step of generating, as compression information, a series of compression ratios for each feature element of the compression standard pattern compressed by the pattern compression unit;
Recursion of dynamic programming having as a variable the inter-pattern distance between the compressed standard pattern compressed in the compression step and the input pattern, with the compression information generated by the generation step for the compressed standard pattern as a variable A calculation step calculated by an equation;
A pattern matching method comprising: