JPH0574836B2

JPH0574836B2 -

Info

Publication number: JPH0574836B2
Application number: JP60117135A
Authority: JP
Inventors: Shichiro Tsuruta
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1985-05-30
Filing date: 1985-05-30
Publication date: 1993-10-19
Also published as: JPS61275800A

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、パタン認識装置に関する。[Detailed description of the invention] (Industrial application field) The present invention relates to a pattern recognition device.

（従来技術とその問題点）パタン認識装置は、例えば音声のように特徴ベ
クトルの時系列として表現されるパタンを標準パ
タンとの比較操作（マツチング）により類似性を
算出し、パタン間の認識を行なう装置である。例
えば、音声認識装置は認識すべき音声パタンを標
準パタンとして予め装置に登録しておき、認識動
作時には、入力される未知音声パタンに対して、
各標準パタンとの間で比較操作を行い、両者の一
致の度合を調べ、最もよく一致する標準パタンを
決定し、認識判定を行う方法に基づいている。こ
のパタンマツチング法では、パタンに固有な変動
現象、例えば、音声パタンにおける発声速度の変
動に起因する時間軸の不均一な伸縮といつた現象
に対して安定な尺度を用いることが重要である。
これに関しては日本音響学会の音声研究会資料
S73−22（昭和48年12月発行）「音声認識における
各種DPマツチング法の比較」（以下文献(1)と呼
ぶ）に詳しく述べられている。また、この方法を
能率的にかつ高精度で実現する方法として動的計
画法（以下DP法と略す）を利用し、時間軸の変
動を正規化したパタン間距離を算出するDPマツ
チング法が有効であると提案されている。このマ
ツチング法では、マツチングの対象となるパタン
は、一定の周期でサンプリングされた時系列ベク
トルを前提としている。このために、各パタンを
記憶するための記憶容量は、各パタンの長さ分だ
け必要となる。例えば、単語音声認識装置の場合
では、単語数分の容量が必要となり、膨大な記憶
容量を必要とする。また、演算時間もパタン長に
比例して増大するために高速演算が要求される。
一方、音声パタンは、母音定常部等においては、
周波数構造の時間的変化は小さく、母音から子音
といつた過渡部分などでは、その変化が大きい。
この性質を利用して、各パタンを情報圧縮し、記
憶容量を削減する装置が知られている（文献(2)特
開昭58−137899号公報）。以下、文献(2)のパタン
圧縮法の概要を説明する。(Prior art and its problems) A pattern recognition device calculates the similarity of a pattern expressed as a time series of feature vectors, such as voice, with a standard pattern (matching), and performs recognition between patterns. It is a device that performs For example, a speech recognition device registers the speech pattern to be recognized as a standard pattern in the device in advance, and during recognition operation,
This method is based on a method in which a comparison operation is performed between each standard pattern, the degree of agreement between the two is examined, the standard pattern that most closely matches is determined, and a recognition judgment is made. In this pattern matching method, it is important to use a measure that is stable against fluctuation phenomena specific to patterns, such as uneven expansion and contraction of the time axis caused by fluctuations in speaking rate in speech patterns. .
Regarding this, please refer to the Acoustical Society of Japan's speech research group materials.
It is described in detail in S73-22 (published in December 1971) "Comparison of various DP matching methods in speech recognition" (hereinafter referred to as document (1)). In addition, as a way to implement this method efficiently and with high precision, the DP matching method is effective, which uses dynamic programming (hereinafter abbreviated as DP method) to calculate the distance between patterns by normalizing fluctuations in the time axis. It is proposed that This matching method assumes that the pattern to be matched is a time series vector sampled at a constant period. For this reason, the storage capacity for storing each pattern is required to correspond to the length of each pattern. For example, in the case of a word speech recognition device, a capacity corresponding to the number of words is required, which requires an enormous storage capacity. Furthermore, since the calculation time increases in proportion to the pattern length, high-speed calculation is required.
On the other hand, in the vowel stationary part, the speech pattern is
Temporal changes in the frequency structure are small, and changes are large in transitional areas such as from vowels to consonants.
A device is known that utilizes this property to compress information on each pattern to reduce storage capacity (Reference (2) Japanese Patent Application Laid-open No. 137899/1983). Below, an overview of the pattern compression method of document (2) will be explained.

音声パタンは、音声信号の周波数分析器等の出
力を一定の周期でサンプリングすることで、特徴
ベクトルの時系列としてＢ＝〓１、〓２、……〓_j、〓_J (1) と示される。〓_jは時刻ｊにおける特徴ベクトル
である。(1)式のパタンＢを第１図ａに示すように
Ｋ＋１個の区切り点l₁〜l_K+1によつてＫ区間に区
切り、各区分内における中央ベク〓′１、〓′₂、
……、〓′_Kの列を圧縮パタンＣとする。中央ベク
トル〓′_kはｋ番目の区間（l_k、l_k+1）の中央位置
のベクトルを該区間の代表ベクトルとして抽出し
たものである。すなわち〓′_k＝〓_(lk+lk+1)/2である。 The audio pattern is expressed as a time series of feature vectors by sampling the output of an audio signal frequency analyzer, etc. at a constant cycle, and is expressed as a time series of feature vectors as follows: B=〓1,〓2,... _〓j , _〓J (1) . 〓 _j is the feature vector at time j. Pattern B in equation (1) is divided into K sections by K+1 breakpoints l ₁ to l _K+1 as shown in Figure 1a, and the central vectors in each section are 〓'1, _〓'2 ,
..., 〓' Let the sequence of _K be the compression pattern C. The central vector 〓' _k is the vector at the center of the k-th section (l _k , l _k+1 ) extracted as the representative vector of the section. That is, 〓′ _k = 〓 _(lk+lk+1)/2 .

区切り点列｛l_k｝は、代表ベクトル〓′_kが区間
（l_k、l_k++1）に含まれる〓_jに対するベクトル誤差
の大きさを‖〓′_k−〓_j‖と表わしたとき、各区
間でのベクトル誤差が最小となるような最適に決
定する。すなわちＴ＝ Min ｛l_k｝_K+1 〓^k=1 _lk+1 〓^j=lk+1 ‖〓′_k−〓_j‖ (2) (2)式の最小化問題を解くことで最適区切り点列
｛l_k｝が求まる。(2)式の具体的計算法として、動
的計画法を用いて、次のように行う。 The breakpoint sequence {l _k } is expressed as ‖〓′ k −〓 j ‖ when the magnitude of the vector error for 〓 _j where the representative vector 〓′ _k is included in the interval (l _k , l _k++1 ) is expressed as ‖〓′ _k −〓 _j ‖ , is determined optimally so that the vector error in each interval is minimized. That is, T= Min {l _k } _K+1 〓 ^k=1 _lk+1 〓 ^j=lk+1 ‖〓′ _k −〓 _j ‖ (2) By solving the minimization problem of equation (2), the optimal breakpoint is found. The sequence {l _k } is found. As a specific method for calculating equation (2), dynamic programming is used as follows.

初期値Ｔ（０、０）＝０ (3) のもとに、漸化式Ｔ（ｍ、ｋ）＝ Min ｍ−δ≦ｌ＜ｍ〔Ｔ（ｌ−１、ｋ−１）＋_n 〓^j=l ‖〓_(l+n)/2−〓_j‖〕 (4) なる漸化式をｍ−δ≦ｌ≦ｍ−１ (5) なる条件の範囲で、ｍ＝ｌ〜Ｊ、ｋ＝１〜Ｋ＋１
に関して逐次計算することで、区切り点列｛l_k｝
でＫ分割した場合に、各分割区分間（l_k、l_k+1）
で発声する中央ベクトル〓′_kと割愛される各ベク
トル〓_jとの誤差ｄ（〓′_k、〓_j）＝_lk+1 〓^j=lk+1 ‖〓′_k−〓_j‖ が全区間について最小となるような区切り点列
｛l_k｝を最適区切り点列として求めることができ
る。この最適区切り点列｛l_k｝が求まると、各区
切り点間の中央ベクトル〓_k＝〓（_lk+lk+1)/2が求ま
り、最適に圧縮された圧縮パタンＣ＝〓１、〓２、……、〓_k、……、〓_K が求められる。該圧縮パタンＣの各ベクトル〓_k
は、それぞれの区間内における代表ベクトルとさ
れ、後で入力される入力パタンとの間でパタンマ
ツチング法が実行され認識が行われる。すなわ
ち、代表ベクトルの系列により表現される圧縮パ
タンにおいて、間引かれた冗長なベクトルを、抽
出された代表ベクトルにより補間して、圧縮され
る前の時間軸上に再現し、DPマツチング法を適
用して認識処理を行う。 Based on the initial value T (0, 0) = 0 (3), recurrence formula T (m, k) = Min m-δ≦l<m [T (l-1, k-1) + _n 〓 ^j=l ‖〓 _(l+n)/2 −〓 _j ‖〕 (4) The recurrence formula becomes m−δ≦l≦m−1 (5) m=l~J,k =1~K+1
By sequentially calculating the breakpoint sequence {l _k }
When dividing into K, between each division section (l _k , l _k+1 )
_The error d ₍ ^〓 ′ _k _, 〓 _j ) between the central vector 〓 _′ _k uttered at The minimum breakpoint sequence {l _k } can be found as the optimal breakpoint sequence. When this optimal breakpoint sequence {l _k } is found, the median vector between each breakpoint 〓 _k = 〓 ( _lk+lk+1)/2 is found, and the optimally compressed compression pattern C=〓1, 〓2 , ..., 〓 _k , ..., 〓 _K is found. Each vector of the compression pattern C〓 _k
are taken as representative vectors within each section, and a pattern matching method is executed between them and an input pattern input later to perform recognition. In other words, in the compression pattern expressed by a series of representative vectors, the thinned out redundant vectors are interpolated with the extracted representative vectors, reproduced on the time axis before compression, and the DP matching method is applied. perform recognition processing.

以上述べた方法は、圧縮パタンが圧縮前のパタ
ンに最も近似したパタンとなるため、認識率の低
下をまねくことなくパタン圧縮を可能としてい
る。 In the method described above, the compressed pattern becomes a pattern that is most similar to the pattern before compression, so that pattern compression is possible without reducing the recognition rate.

しかしながら、このパタン圧縮法では、圧縮パ
タンを得るために(3)式〜(5)式で示される漸化式を
計算する必要があり、例えば入力パタンと標準パ
タンの両パタンとも圧縮し、認識処理する場合に
は、高速演算回路が要求される。従つて、圧縮処
理を実行する回路は必ずしも安価なものではな
い。 However, in this pattern compression method, it is necessary to calculate the recurrence formula shown in equations (3) to (5) in order to obtain a compressed pattern. For example, both the input pattern and the standard pattern are compressed and recognized. For processing, a high-speed arithmetic circuit is required. Therefore, circuits that perform compression processing are not necessarily inexpensive.

（発明の目的）本発明は、このような従来の欠点を除去せしめ
て、極めて少ない演算量で、パタン圧縮を実現
し、かつ記憶容量、演算量とも大幅に削減したパ
タン認識装置を提供することにある。(Object of the Invention) An object of the present invention is to provide a pattern recognition device that eliminates such conventional drawbacks, realizes pattern compression with an extremely small amount of calculations, and significantly reduces both storage capacity and amount of calculations. It is in.

（発明の構成）本発明によれば、特徴ベクトルの系列｛b_j｝、
（ｊ＝１、２、……、Ｊ）として表現される標準
パタンを保持するパタンメモリと、該パタンメモ
リに格納された前記ベクトル列｛b_j｝のパタン変
化量D_j、（ｊ＝１、……Ｊ）をパタン全体にわた
り算出し、パタン長Ｊから決定される分割数Ｋに
より前記パタン変化量D_jの総変化量D_JをＫ等分
するように前記ベクトル列｛b_j｝をＫ区分に分割
し、上記演算によつて得られる区分点列｛l_k｝を
決定し、該｛l_k｝により定まる各区間の中央位置
のベクトルb′_kを前記パタンメモリから読みだし
各区間の代表ベクトルとして圧縮パタンＣ＝c₁、
c₂、……c_Kを出力する圧縮処理部と、前記圧縮パ
タンＣを格納する圧縮パタンメモリと、任意の入
力パタンと前記圧縮パタンとの比較を行う認識処
理部とを備えたことを特徴とするパタン認識装置
が得られる。(Structure of the Invention) According to the present invention, a sequence of feature vectors {b _j },
A pattern memory that holds a standard pattern expressed as (j=1, 2, ..., J), and a pattern change amount D _j of the vector sequence {b _j } stored in the pattern memory, (j=1 ,...J) over the entire pattern, and divide the vector sequence {b _j } so that the total variation D _J of the pattern variation D _j is divided into K equal parts by the number of divisions K determined from the pattern length J. Divide into K sections, determine the section point sequence {l _k } obtained by the above calculation, read the vector b' _k of the center position of each section determined by the {l _k } from the pattern memory, and calculate each section. As a representative vector of the compression pattern C=c ₁ ,
c ₂ , ... c _K ; a compression pattern memory that stores the compression pattern C; and a recognition processing unit that compares an arbitrary input pattern with the compression pattern. A pattern recognition device is obtained.

更に本発明によれば、特徴ベクトルの系列
｛b_j｝、（ｊ＝１、２、……、Ｊ）として表現され
る標準パタンを保持するパタンメモリと該パタン
メモリに格納された前記ベクトル列｛b_j｝のパタ
ン変化量D_j、（ｊ＝１、２、……、Ｊ）をパタン
全体にわたり算出し、前記パタン変化量D_jの総
変化量D_Jから決定される分割数Ｋにより、前記
パタン総変化量D_JをＫ等分するように前記ベク
トル列｛b_j｝をＫ区分に分割し、上記演算によつ
て得られる区分点列｛l_k｝を決定し、該｛l_k｝に
より定まる各区間の中央位置のベクトルb′_kを前
記パタンメモリから読み出し各区間の代表ベクト
ルとして圧縮パタンＣ＝c₁、c₂、……c_Kを出力す
る圧縮処理部と、前記圧縮パタンＣを格納する圧
縮パタンメモリと、任意の入力パタンと前記圧縮
パタンとの比較を行う認識処理部とを備えたこと
を特徴とするパタン認識装置が得られる。 Further, according to the present invention, a pattern memory holding a standard pattern expressed as a sequence of feature vectors {b _j }, (j=1, 2, ..., J) and the vector sequence stored in the pattern memory are provided. The amount of pattern change D _j , (j=1, 2, ..., J) of {b _j } is calculated over the entire pattern, and the number of divisions K determined from the total amount of change D _J of the pattern change amount D _j is calculated. , divide the vector sequence {b _j } into K sections so as to divide the total pattern variation D _J into K equal parts, determine the segment point sequence {l _k } obtained by the above calculation, _k } of the center position of each section from the pattern memory and outputs a compressed pattern C=c ₁ , c ₂ _, . . . c _K as a representative vector of each section; A pattern recognition device is obtained, comprising a compressed pattern memory that stores a pattern C, and a recognition processing section that compares an arbitrary input pattern with the compressed pattern.

（構成の詳細な説明）本発明について、図面を参照して詳細に説明す
る。入力パタンＡは入力の周波数構造を表わす特
徴ベクトルの系列Ａ＝a₁、a₂、……、a_i、……、a_I (6) で表現される。また比較すべき標準パタンＢをＢ＝b₁、b₂、……、b_j、……、b_J (7) と表わす。本発明においては、標準パタンＢを第
１図に示すようにＫ＋１個の区切り点l₁，l₂，…
…，l_k，……，l_K+1によつてＫ区間に区切り、各
区分（l_k、l_k+1）内におけるベクトルb′₁、b′₂、…
…、b′_Kの列を圧縮パタンＣとする。各区分（l_k、
l_k+1）の中央のベクトルb′_kは、区切り点l_kとl_k+1
の中央位置のベクトルb_(lk+lk+1)/2を該区間の代表
ベクトルb′_kとして抽出したものである。区切り
点列｛l_k｝は、次の様にして決定される。第１図
ｂに示すように標準パタンＢのパタン変化量D_j
を、ベクトル間距離d_j＝‖b_j−b_j-1‖と表わした
とき、このベクトル間距離d_jのパタン区間にわた
る総和 D_j＝_j 〓^m=1 d_n (8) として表わす。前記区切り点l₁，……l_K+1は、パ
タン変化量D_jのパタン全体にわたる総変化量D_J
をＫ等分するように、標準パタンＢをＫ区間に分
割することで決定する。すなわち、等分された変
化量をΔDとすると、第１図ｂに示すように、ベ
クトル列｛b_j｝の先頭からベクトル列｛b_j｝の変
化量D_jが、等分された変化量ΔDを越えた点をl₂，
l₃，……，l_k，……，l_k+1として決定する。区切り
点列｛l_k｝は、｛l_k｝＝l₁、l₂、……、l_k、……、l_k+1 (9) で表わされる。この区切り点間の中央のベクトル
b′_kを改めてc_kと表わすことにすると、圧縮パタ
ンＣはＣ＝c₁、c₂、……、c_k、……、c_k (10) で表わされる。ＫをＪより小さい値に設定するこ
とで圧縮パタンＣは標準パタンＢより小容量のメ
モリに格納することが可能となる。以上が本発明
の原理である。(Detailed Description of Configuration) The present invention will be described in detail with reference to the drawings. The input pattern A is expressed by a series of feature vectors A=a ₁ , a ₂ , . . . , a _i , . . . , a _I (6) representing the frequency structure of the input. Further, the standard pattern B to be compared is expressed as B=b ₁ , b ₂ , . . . , b _j , . . . , b _J (7). In the present invention, the standard pattern B is divided into K+1 breakpoints l ₁ , l ₂ , . . . as shown in FIG.
Divided into K sections by ..., l _k , ..., l _K+1 , vectors b' ₁ , b' ₂ , ... in each section (l _k , l _k+1 )
..., b' _K is a compression pattern C. Each division (l _k ,
The vector b′ _k at the center of l _k ₊ ₁
The vector b _(lk+lk+1)/2 at the center position of is extracted as the representative vector b' _k of the section. The breakpoint sequence {l _k } is determined as follows. As shown in Figure 1b, the amount of pattern change D _j of standard pattern B
is expressed as inter-vector distance d _j =‖b _j -b _j-1 ‖, then the sum total of this inter-vector distance d _j over the pattern section is expressed as D _j = _j 〓 ^m=1 d _n (8). The breakpoint l ₁ ,...l _K+1 is the total variation D _J of the pattern variation D _j over the entire pattern.
The standard pattern B is determined by dividing the standard pattern B into K sections so that the standard pattern B is divided into K equal parts. In other words, if the amount of change divided into equal parts is ΔD, then the amount of change D _j of the vector sequence {b _j } from the beginning of the vector sequence {b _j } is the amount of change divided into equal parts, as shown in FIG. 1b. The point beyond ΔD is l ₂ ,
Determine as l ₃ ,..., l _k ,..., l _k+1 . The breakpoint sequence {l _k } is expressed as {l _k }=l ₁ , l ₂ , ..., l _k , ..., l _k+1 (9). the median vector between this breakpoints
If b′ _k is expressed as c _k again, the compression pattern C is expressed as C=c ₁ , c ₂ , . . . , c _k , . . . , c _k (10). By setting K to a value smaller than J, compressed pattern C can be stored in a memory with a smaller capacity than standard pattern B. The above is the principle of the present invention.

（実施例）以上の原理に基づいて動作する本発明の第１の
実施例のブロツク図を第２図に示す。また、圧縮
処理部４０の詳細図を第３図にする。マイクロホ
ン等１０から入力される音声信号は、音声分析部
２０で周波数構造を表わす(6)および(7)式の特徴ベ
クトルの時系列パタンに変換される。該パタンは
音声パタンメモリ３０に記憶される。ここで、入
力音声から標準パタンを作成する場合、音声パタ
ンメモリ３０には標準パタンＢが格納されてい
る。圧縮処理部４０は、音声パタンメモリ３０か
らベクトル値b_j、とb_j-1を読み出し距離計算部４
１によりベクトル間距離d_j＝‖b_j−b_j-1‖を計算
し、順次、累算部４２により(8)式のパタン変化量
D_jを算出する。この処理をｊを１からＪまで変
化させることで、パタン全体にわたるパタン変化
量D_j、（ｊ＝１、……Ｊ）が算出され、パタン変
化量メモリ４３に記憶される。一方、制御部７０
では、標準パタン長Ｊに基づいて圧縮率を決定す
る。(Embodiment) FIG. 2 shows a block diagram of a first embodiment of the present invention which operates based on the above principle. Further, a detailed diagram of the compression processing section 40 is shown in FIG. An audio signal input from a microphone or the like 10 is converted by the audio analysis unit 20 into a time-series pattern of feature vectors expressed by equations (6) and (7) representing a frequency structure. The pattern is stored in the audio pattern memory 30. Here, when creating a standard pattern from input audio, standard pattern B is stored in the audio pattern memory 30. The compression processing unit 40 reads the vector values b _j and b _j-1 from the audio pattern memory 30 and calculates the distance calculation unit 4 .
1, the vector distance d _j =‖b _j −b _j-1
Calculate D _j . By changing j from 1 to J in this process, the pattern change amount D _j , (j=1, . . . J) over the entire pattern is calculated and stored in the pattern change amount memory 43. On the other hand, the control section 70
Now, the compression ratio is determined based on the standard pattern length J.

次に第３図に示した圧縮処理部においては、定
数C_Jにより、圧縮ベクトル数Ｋが圧縮率決定回路
７１により決定される。圧縮ベクトル数Ｋの決定
により、区分変化量決定回路７２では、パタン全
体にわたる総変化量D_JをＫ等分することで、等
分された変化量ΔDを算出する。この等分された
変化量ΔDを基準とし、ｋを１から順次Ｋまで変
化させkΔDを乗算回路７３により算出し、圧縮
処理部の比較部４７に出力する。比較部４７で
は、Ｊを１から順次変化させて全体にわたるパタ
ン変化量D_jとkΔDを比較し、D_jがkΔDを越えた
点をｊ＝l_kとして区切り点を決定する。区切り点
の決定により、ｋを１づつ増加させ、ｋ＝Ｋまで
処理することで、区切り点列｛l_k｝_k=1〜Kが決定さ
れる。この区切り点列から、標準パターに対し
て、区切り点間の中央のベクトルを代表ベクトル
b_kとして決定し、圧縮パタンメモリ５０に｛C_k｝
として格納する。該圧縮パタンの各ベクトルC_k
は、それぞれ区間内における代表ベクトルとさ
れ、後に入力される入力パタンＡとの間で認識処
理部６０により周知のパタンマツチング法が実行
される。 Next, in the compression processing section shown in FIG. 3, the number K of compression vectors is determined by the compression rate determining circuit 71 using the constant C _J. By determining the number K of compressed vectors, the section change amount determination circuit 72 calculates the equally divided amount of change ΔD by dividing the total amount of change D _J over the entire pattern into K equal parts. Using this equally divided amount of change ΔD as a reference, k is sequentially varied from 1 to K, and kΔD is calculated by the multiplication circuit 73 and output to the comparison unit 47 of the compression processing unit. The comparison unit 47 sequentially changes J from 1 to compare the overall pattern change amount D _j and kΔD, and determines a break point by setting the point where D _j exceeds kΔD as j=l _k . By determining the breakpoints, k is incremented by 1 and processing is performed until k=K, thereby determining the breakpoint sequence {l _k } _{k=1 to K.} From this breakpoint sequence, the center vector between the breakpoints is the representative vector for the standard putter.
b _k and stored in the compressed pattern memory 50 as {C _k }
Store as . Each vector C _k of the compression pattern
are respectively taken as representative vectors within the interval, and a well-known pattern matching method is executed by the recognition processing unit 60 between them and the input pattern A that will be input later.

第１の実施例における圧縮ベクトルの決定方で
は、圧縮の対象であるパタンのパタン長Ｊを基準
に圧縮ベクトル数を決定した。この方法ではパタ
ンの総変化量にかかわらず一律に圧縮ベクトル数
Ｋがパタン長Ｊにより決定される。従つてパタン
の総変化量が少ない、いわゆる平坦なパタンに対
する圧縮効果が少ない。この点を改良した第２の
実施例の原理を第４図を用いて説明する。 In the method of determining compression vectors in the first embodiment, the number of compression vectors is determined based on the pattern length J of the pattern to be compressed. In this method, the number K of compressed vectors is uniformly determined by the pattern length J, regardless of the total amount of change in the pattern. Therefore, the compression effect on so-called flat patterns, where the total amount of change in the pattern is small, is small. The principle of the second embodiment, which is improved in this respect, will be explained with reference to FIG.

第４図のａ，ｂは第１の実施例によるパタン圧
縮方法を図示したもので、ｃ，ｄは第２の実施例
による方法を図示したものである。第１の発明で
は、区切り点数Ｋは圧縮率を定める定数C_Jでパタ
ン長Ｊを除してＫ＝Ｊ／C_Jとして決定される。従
つて、区切り点数Ｋはパタン長Ｊから一律に決定
される。第４図ａではＫ＝６の場合が示してあ
る。一方、第４図ｃは、第４図ａのパタンB₁と
同じ長さＪのパタンB₂に対する圧縮方法を示し
た物で、……線は第一の実施例による圧縮結果を
示している。第４図ｂ，ｄはパタンB₁、パタン
B₂の総変化量D_1J，D_2Jを示したもので、圧縮率C_J
が同じな場合、等分された変化量ΔD₁，ΔD₂は、
パタンの変動量に依存する。この場合パタンB₂
の方がパタンB₁より変動量が多いため、ΔD₂＞
ΔD₁となる。このことから、第一の実施例による
圧縮法では、圧縮されたパタンと元のパタンとの
近似度が、パタン内の変動量により大きく左右さ
れることになる。第２の実施例はパタン内の変動
量に左右されない圧縮法を実現する。いま、第４
図のｃ，ｄにおいて、等分される変化量ΔDを一
定な値C_Dとして与え、圧縮ベクトル数ＫをＫ＝
D_2J／C_Dとして決定すると、等分される変化量ΔD
は、パタン長Ｊに無関係となる。すなわち、等分
される変化量ΔD（＝D_J／Ｋ）を一定な値C_Dとし
て、第１の発明の方法により区切り点列｛l_k｝を
決定すれば、パタン変動量に左右されることなく
近似度一定の圧縮パタンが得られる。 In FIG. 4, a and b illustrate the pattern compression method according to the first embodiment, and c and d illustrate the method according to the second embodiment. In the first invention, the number of break points K is determined by dividing the pattern length J by a constant C _J that determines the compression ratio, as K=J/C _J. Therefore, the number of break points K is uniformly determined from the pattern length J. In FIG. 4a, the case of K=6 is shown. On the other hand, FIG. 4c shows a compression method for pattern _B2 having the same length _J as pattern B1 in FIG. . Figure 4 b and d are pattern B ₁ and pattern
It shows the total amount of change D _1J and D _2J of B ₂ , and the compression ratio C _J
are the same, the equally divided changes ΔD ₁ and ΔD ₂ are
Depends on the amount of pattern variation. In this case pattern B ₂
Since pattern B ₁ has a larger amount of variation than pattern B 1, ΔD ₂ >
ΔD becomes ₁ . From this, in the compression method according to the first embodiment, the degree of approximation between the compressed pattern and the original pattern is largely influenced by the amount of variation within the pattern. The second embodiment realizes a compression method that is independent of the amount of variation within the pattern. Now, the fourth
In c and d of the figure, the amount of change ΔD divided into equal parts is given as a constant value C _D , and the number of compressed vectors K is K=
When determined as D _2J /C _D , the amount of change ΔD that is divided into equal parts is
is unrelated to the pattern length J. That is, if the change amount ΔD (=D _J /K) to be divided into equal parts is set to a constant value C _D , and the break point sequence {l _k } is determined by the method of the first invention, it will depend on the pattern change amount. A compression pattern with a constant degree of approximation can be obtained without any problems.

以下、第２の実施例を説明する。第５図は本発
明による圧縮処理部４０と制御部７０の詳細図で
ある。他の部分は第１の発明による第２図と同じ
機能である。今、音声パタンメモリ３０には標準
パタンＢが格納されている。第１の実施例と同様
に、(8)式のパタン変化量D_jを算出し、パタン変
化量メモリ４３に記憶される。制御部７０では、
等分される変化量を決定する定数C_Dにより、圧
縮ベクトル数Ｋが圧縮ベクトル数決定回路７４よ
りパタン総変化量D_JをC_Dで除すことで決定され
る。この圧縮ベクトル数Ｋの決定により、以後の
処理は第１の実施例とまつたく同様に行なわれ、
区切り点列｛l_k｝_k=1〜K及び圧縮パタン｛C_k｝が圧
縮パタンメモリ５０に格納され、認識処理部６０
によりパタンマツチング法が実行される。 The second embodiment will be described below. FIG. 5 is a detailed diagram of the compression processing section 40 and the control section 70 according to the present invention. The other parts have the same functions as those in FIG. 2 according to the first invention. Standard pattern B is currently stored in the audio pattern memory 30. Similarly to the first embodiment, the pattern change amount D _j of equation (8) is calculated and stored in the pattern change amount memory 43. In the control unit 70,
The number K of compressed vectors is determined by the compressed vector number determination circuit 74 by dividing the total amount of pattern change D _J by _{CD, using a constant C D} _that determines the amount of change to be equally divided. By determining this number of compressed vectors K, the subsequent processing is performed in exactly the same way as in the first embodiment,
The break point sequence {l _k } _{k=1 to K} and the compressed pattern {C _k } are stored in the compressed pattern memory 50, and the recognition processing unit 60
The pattern matching method is executed.

（発明の効果）本実施例では標準パタンを、極めて少ない演算
量により代表ベクトルによつて圧縮することが可
能であり、従来方法に較べ、メモリ量・演算量と
もに大幅な削減を可能とする。上記圧縮による誤
差は小であり、認識率を低下させることはない。
なお、音声パタン以外の特徴ベクトル列に対して
も、同様な構成によつて、同様に認識率を低下さ
せることなく、メモリ量・演算量を大幅に低減さ
せることが可能となる。(Effects of the Invention) In this embodiment, it is possible to compress a standard pattern using a representative vector with an extremely small amount of calculation, and compared to the conventional method, it is possible to significantly reduce both the amount of memory and the amount of calculation. The error caused by the above compression is small and does not reduce the recognition rate.
Note that by using a similar configuration for feature vector sequences other than voice patterns, it is possible to significantly reduce the amount of memory and the amount of calculation without similarly reducing the recognition rate.

[Brief explanation of drawings]

第１図は本発明における圧縮処理の原理の説明
する原理説明図、第２図は本発明の第１実施例を
示すブロツク図、第３図は圧縮処理部及び制御部
の構成図、第４図は本発明の第２の実施例の原理
を説明するための図、第５図は本発明の第２の実
施例における圧縮処理部および制御部の構成例を
示す図である。図において、１０……入力端子、２０……音声
分析部、５０……音声パタンメモリ、４０……圧
縮処理部、５０……圧縮パタンメモリ、６０……
認識処理部、７０……制御部、４１……距離計算
部、４２……累算部、４３……パタン変化量メモ
リ、４４……比較部、７１……圧縮率決定回路、
７２……区分変化量決定回路、７３……乗算回
路、７４……圧縮ベクトル数決定回路である。 Fig. 1 is a principle explanatory diagram explaining the principle of compression processing in the present invention, Fig. 2 is a block diagram showing a first embodiment of the invention, Fig. 3 is a configuration diagram of a compression processing section and a control section, and Fig. 4 The figure is a diagram for explaining the principle of the second embodiment of the present invention, and FIG. 5 is a diagram showing an example of the configuration of a compression processing section and a control section in the second embodiment of the present invention. In the figure, 10...input terminal, 20...speech analysis section, 50...sound pattern memory, 40...compression processing section, 50...compression pattern memory, 60...
Recognition processing unit, 70...Control unit, 41...Distance calculation unit, 42...Accumulation unit, 43...Pattern change amount memory, 44...Comparison unit, 71...Compression rate determination circuit,
72...section change amount determining circuit, 73...multiplying circuit, 74...compression vector number determining circuit.

Claims

[Claims] 1. Sequence of feature vectors {b _j }, (j=1, 2,...
..., J), and the pattern change amount D _j of the vector sequence {b _j } stored in the pattern memory, (j=
1,...J) over the entire pattern, and divide the vector sequence {b _j } so that the total variation D _J of the pattern variation D _j is divided into K equal parts by the number of divisions K determined from the pattern length J. Divide into K sections, determine the segment point sequence _{ l _k } obtained by the above calculation, and
a compression processing unit that reads a vector b′ _k at the center position of each section determined by the above from the pattern memory and outputs a compressed pattern C=c ₁ , c ₂ , . . . c _K as a representative vector of each section; 1. A pattern recognition device comprising: a compressed pattern memory that stores C; and a recognition processing section that compares an arbitrary input pattern with the compressed pattern. 2 Sequence of feature vectors {b _j }, (j=1, 2,...
..., J) and the pattern change amount D _j of the vector sequence {b _j } stored in the pattern memory, (j=1,
₂ _, _. Divide the vector sequence {b _j } into K sections and obtain the segment point sequence {l _k } obtained by the above operation.
is determined, and the vector b' _k at the center position of each section determined by {l _k } is read out from the pattern memory and the compressed pattern C=
A compression processing unit that outputs c ₁ , c ₂ , ...c _K , a compression pattern memory that stores the compression pattern C, and a recognition processing unit that compares an arbitrary input pattern with the compression pattern. A pattern recognition device characterized by: