JPS6258520B2

JPS6258520B2 -

Info

Publication number: JPS6258520B2
Application number: JP2873882A
Authority: JP
Inventors: Taizo Iijima; Masato Akagi
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1982-02-26
Filing date: 1982-02-26
Publication date: 1987-12-07
Also published as: JPS5911000A

Description

【発明の詳細な説明】〔発明の技術分野〕本発明は、Ｌ種類の標準波形系列より求めた線
形予測係数をインパルス応答とするＬ種類のフイ
ルタを並列に並べ、味知の信号波形系列を入力し
た時に、その出力の自乗和が最小となるフイルタ
添字を入力信号波形系列の属するカテゴリとする
信号波形認識装置において、フイルタ出力の変動
によつて生じる誤認識を防ぐために、Ｌ種類のフ
イルタの出力それぞれに対して、予め１個以上Ｋ
個のお互いが正規直交関係をもつ系列を用意して
おき、その系列が示す方向の成分をフイルタの出
力からとりさることによつて、フイルタ出力に加
わつた変動を少なくするようにした信号波形認識
方式に関するものである。[Detailed Description of the Invention] [Technical Field of the Invention] The present invention has L types of filters whose impulse responses are linear prediction coefficients obtained from L types of standard waveform sequences, arranged in parallel, and which generates Ajichi's signal waveform sequence. In a signal waveform recognition device in which the category to which an input signal waveform series belongs is the filter subscript that minimizes the sum of squares of its output when input, L types of filters are used to prevent misrecognition caused by fluctuations in filter output. For each output, one or more K
A signal waveform recognition method that reduces fluctuations added to the filter output by preparing a series in which each of the series has an orthonormal relationship and removing components in the direction indicated by the series from the filter output. It is related to the method.

[Technical background of the invention]

信号波形認識方式には種々の方法があるが、そ
の中に残差波形電力法と呼ばれる方法がある。本
発明は、残差波形電力法を発展改良したものであ
るので、残差波形電力法について説明する。 There are various signal waveform recognition methods, one of which is a method called the residual waveform power method. Since the present invention is an improved version of the residual waveform power method, the residual waveform power method will be explained.

信号波形系列｛Xn｝を、伝達関係の極が｛α
_n｝^Ｐ _ｎ＝１で与えられるフイルタのインパルス応答と
する。すなわち、伝達関係をＧ_(z)とするなら
ば、となる。ゆえに、となる。 The signal waveform series {Xn} is expressed as the pole of the transfer relationship is {α
_n } ^P Let the impulse response of the filter be given by _n=1 . In other words, if the transfer relationship is G _(z) , then becomes. therefore, becomes.

信号波形系列｛ｘ_o｝を白色化するフイルタ
は、Ｇ_(z)の逆の特性をもつフイルタである。ここで、｛ａ_k｝^Ｐ _ｋ＝１は線形予測係数であ
る。信号波形系数系列｛ｘ_o｝をフイルタＡ_(z)に
通すならば、となる。実際の波形には優乱項が含まれているた
めに、残差を｛ｅ_o｝としてと書かれる。 The filter that whitens the signal waveform series {x _o } is a filter with the opposite characteristics of G _(z). It is. Here, {a _k } ^P _k=1 is a linear prediction coefficient. If the signal waveform series {x _o } is passed through the filter A _(z) , then becomes. Since the actual waveform contains a dominant term, the residual can be expressed as {e _o }. is written.

もともと線形予測係数｛ａ_k｝は、残査の自乗
和を最小にするように選ばれるのであるから、逆
フイルタＡ_(z)の出力の自乗和は最小となる。こ
の原理を応用した方法が残差波形電力法である。 Since the linear prediction coefficients {a _k } are originally selected so as to minimize the sum of squares of the residuals, the sum of squares of the output of the inverse filter A _(z) is minimized. A method that applies this principle is the residual waveform power method.

信号波形系列のカテゴリがＬ種存在するとす
る。それぞれに対して線形予測係数｛ａ(l)_ｋ｝
　^Ｐ　　^Ｌ _ｋ＝１　_ｌ＝１を決定し、逆フイルタＡ(l)_（ｚ）を作る。信
号波形系列｛ｘ(l)_ｏ｝が、フイルタＧ(l)_（ｚ）＝１／
Ａ
(l)_（ｚ）からの出力であるとする。フイルタＡ(l)_（ｚ
）の出
力を｛ｅ(l)_ｏ｝とすれば、となる。他のフイルタを用いれば、となる。ｅ(l)_ｏとｅ(l)_ｏ′の間には、の関係があるから、逆フイルタの添字を信号波形
系列が属するカテゴリとすることができる。 It is assumed that there are L categories of signal waveform series. For each linear prediction coefficient {a(l) _k }
Determine ^P ^L _k=1 _l=1 and apply the inverse filter A(l) Make _(z) . The signal waveform sequence {x(l) _o } is the filter G(l) _(z) = 1/
A
(l) Suppose that it is the output from _(z) . Filter A(l) _{(z
)} is {e(l) _o }, then becomes. If you use other filters, becomes. Between e(l) _o and e(l) _o ′, Because of this relationship, the subscript of the inverse filter can be set as the category to which the signal waveform series belongs.

[Problems with background technology]

上記方法は、予め線形予測係数を用意しておく
関係上、認識処理時間中に線形予測係数を計算す
る手続きがないので、高速な認識処理が可能であ
るという長所をもつ。 The above method has the advantage that high-speed recognition processing is possible because the linear prediction coefficients are prepared in advance and there is no procedure for calculating the linear prediction coefficients during the recognition processing time.

ところが、信号波形系列の生成フイルタＧ(l)_（ｚ）
が変動したとすれば、波形｛ｘ(l)_ｏ｝は変動する。
変動した波形を｛(l)_ｏ｝として式(6)，(7)を計算す
れば、となるが、必ずしもとなるとは言えない。不等号が逆転すれば誤認識
となる。 However, the generation filter G(l) _(z) of the signal waveform series
If the waveform {x(l) _o } changes, the waveform {x(l) o } changes.
If we calculate equations (6) and (7) using the fluctuated waveform as {(l) _o }, we get , but not necessarily I cannot say that it will be. If the inequality sign is reversed, it will be a misrecognition.

[Purpose of the invention]

本発明は、残差波形電力法のこのような欠点を
改良しつつ、長所を同時にもちあわせた認識装置
を実現することを目的とするものである。 An object of the present invention is to realize a recognition device that improves the drawbacks of the residual waveform power method and has the advantages at the same time.

[Summary and effects of the invention]

本発明は、類の未知なる入力信号波形系列を、
Ｌ種類の類の既知なる標準信号波形系列の線形予
測係数をインパルス応答波形とするフイルタを介
して残差波形系列を得、この残差波形系列の自乗
和出力により入力信号波形系列よりの属する類を
決定する認識方式において、前記Ｌ種類の各類毎
に予め設定されかつ前記入力信号波形系列の変動
或分を除くための１又は複数の補助波形系列を発
生し、この補助波形系列と前記残差波形系列との
内積値を求め、各内積値の自乗和を前記残差波形
系列の自乗和出力から減算し、この減算結果を用
いて前記入力信号波形系列の属する類を決定する
ことを特徴とするものである。 The present invention enables unknown input signal waveform sequences of
A residual waveform sequence is obtained through a filter that uses the linear prediction coefficients of known standard signal waveform sequences of L types of classes as impulse response waveforms, and the output of the sum of squares of this residual waveform sequence is used to determine the class to which the input signal waveform series belongs. In the recognition method for determining the auxiliary waveform sequence, one or more auxiliary waveform sequences are generated which are set in advance for each of the L types and for removing some fluctuations in the input signal waveform sequence, and the auxiliary waveform sequence and the residual waveform sequence are The method is characterized in that an inner product value with the difference waveform series is determined, the sum of squares of each inner product value is subtracted from the output of the sum of squares of the residual waveform series, and the subtraction result is used to determine the class to which the input signal waveform series belongs. That is.

しかして、入力信号波形系列として音声波形が
供給される音声認識システムにあつては、個人差
の影響を除き、特に不特定話者に対して高精度な
識別を行なうことができる。 Therefore, in a speech recognition system that is supplied with speech waveforms as an input signal waveform series, it is possible to eliminate the influence of individual differences and perform highly accurate identification, especially for unspecified speakers.

また音声波形に限られず、各種の入力信号波形
の識別に際し、その変形や変動に強い識別を行な
うことができる。 Furthermore, when identifying various input signal waveforms, not limited to voice waveforms, it is possible to perform identification that is resistant to deformations and fluctuations.

[Embodiments of the invention]

本発明の装置の具体的な構成を説明する前に、
本発明の装置で達成できる識別方式について述べ
る。 Before explaining the specific configuration of the device of the present invention,
An identification method that can be achieved with the device of the present invention will be described.

カテゴリｌから生成された信号波形系列を、そ
の逆フイルタＡ^（ｌ） _（ｚ）に通したときの残差波形系
列
の作るベクトルｅ^（ｌ） _ｉ＝（ｅ_i1ｅ_i2…ｅ_iN）^Tが正
規直
交ベクトル｛μ^（ｌ） _ｋ｝^Ｎ _ｋ＝１によつてFourier展
開でき
るものとする。ここで｜μ(l)_ｋの選び方を考える。
ベクトルｅ^（ｌ） _ｉの作る行列をＥ^（ｌ） _ｉ＝（ｅ^（ｌ） _ｉｅ^（ｌ） _ｉ＋１…ｅ^（ｌ） _{ｉ＋Ｎ−１}）(12) とすれば、共分散行列Ｖ^（ｌ） _ｉは、Ｖ^（ｌ） _ｉ＝Ｅ^（ｌ）Ｔ _ｉＥ^（ｌ） _ｉとなる。共分散行列の平均を考えて、とすれば、Ｐ次の共分散の平均が求められる。こ
こでＩは任意の大数である。 The vector e ^(l) _i = (e _i1 e _i2 ...e _iN ) ^T created by the residual waveform sequence when the signal waveform sequence generated from category l is passed through its inverse filter A ^(l) _(z) is It is assumed that Fourier expansion can be performed using an orthonormal vector {μ ^(l) _k } ^N _k=1 . Now consider how to choose |μ(l) _k .
The matrix created by vector e ^(l) _i is E ^(l) _i = (e ^(l) _i e ^(l) _i+1 ...e ^(l) _i+N-1 )(12) Then, the covariance matrix V ^(l) _i becomes V ^(l) _i = E ^(l)T _i E ^(l) _i . Considering the mean of the covariance matrix, Then, the average of the P-order covariances is calculated. Here I is an arbitrary large number.

Ｖ^(l)の固有値問題Ｖ^(l)〓^（ｌ） _ｋ＝λ^（ｌ） _ｋ〓^（ｌ） _ｋ， ‖‖μ^（ｌ） _ｋ‖＝１，ｋ＝１，２，…，Ｎ (15) を解けば、固有値λ^（ｌ） _ｋはμ^（ｌ） _ｋ方向の分散を
表わ
している。そこで、固有値の大きい方からＫ個採
用することとし、ｄ^（ｌ） _ｋ＝（ｅ^(l)，〓^（ｌ） _ｋ），ｋ＝１，２，…，Ｋ (16) とすれば、カテゴリｌの任意の残差系列ベクトル
は、〓(l)＝μ(l)〓(l)＋Ｏ (17) とFourier展開できる。ここで、μ^(l)＝（〓^（ｌ） _１〓
^（ｌ） _２…〓^（ｌ） _ｋ），〓^(l)＝（ｄ^（ｌ） _１ｄ^（ｌ
） _２…ｄ^（ｌ） _ｋ）^Tである。 Eigenvalue problem of V ^(l) V ^(l) 〓 ^(l) _k = λ ^(l) _k 〓 ^(l) _k , ‖‖μ ^(l) _k ‖=1, k=1, 2,..., N (15 ), the eigenvalue λ ^(l) _k represents the dispersion of μ ^(l) _k in the direction. Therefore, we will adopt K items from the one with the largest eigenvalue, and if we set d ^(l) _k = (e ^(l) , 〓 ^(l) _k ), k = 1, 2, ..., K (16), the category Any residual sequence vector of l can be Fourier expanded as 〓(l)=μ(l)〓(l)+O (17). Here, μ ^(l) = (〓 ^(l) ₁ 〓
^(l) ₂ ...〓 ^(l) _k ), 〓 ^(l) = (d ^(l) ₁ d ^{(l
)} ₂ ...d ^(l) _k ) ^T.

式（15）より〓^（ｌ） _ｋは形式的にはＮ本とするこ
とができるが、もし予測フイルタの次元をＰとす
るならば、〓^（ｌ） _ｋは最大Ｐ個計算できる。 From equation (15), 〓 ^(l) _k can be formally set to N, but if the dimension of the prediction filter is P, a maximum of P 〓 ^(l) _k can be calculated.

｛〓^（ｌ） _ｋ｝^Ｋ _ｋ＝１は分散の大きい方から選んで
きた
のであるから、式（17）よりμ^(l)ｄ^(l)をとりの
ぞくことができれば、系列｛ｅ^（ｌ） _ｏ｝の変動は小
さくなることが期待できる。 {〓 ^(l) _k } ^K Since _k=1 has been selected from the one with the largest variance, if μ ^(l) d ^(l) can be removed from equation (17), the series {e ^(l) _o } can be expected to be small.

〓^(l)より｛〓^（ｌ） _ｋ｝^Ｋ _ｋ＝１で規定される方向
の成分
をとりされば〓^(l)は｛〓^（ｌ） _ｋ｝^Ｋ _ｋ＝１の方向に
極性を
もつこととなる。〓 From ^(l) , if we take the component in the direction defined by {〓 ^(l) _k } ^K _k=1, then ^(l) has polarity in the direction of {〓 ^(l) _k } ^K _k=1. becomes.

を計算すれば、〓^(l)は上記の性質をもつているの
で、この意味で〓^（ｌ） _ｉを極性誤差と呼ぶ。 If we calculate, 〓 ^(l) has the above properties, so in this sense, 〓 ^(l) _i is called a polarity error.

〓^(l)のノルムの自乗によつて判断すれば、ｅ^(l
^）のノルムの自乗によつて判定するよりもより変
動に強い認識が行なえる。そこで、〓^(l)のノルム
の自乗を計算すれば、（〓_k，〓_k′）＝δ_kk′である
ので次式が成立する。〓 Judging by the square of the norm of ^(l) , e ^(l
⁾ can be recognized more resistant to fluctuations than by the square of the norm. Therefore, if we calculate the square of the norm of 〓 ^(l) , the following equation holds true since (〓 _k , 〓 _k ′)=δ _kk ′.

式（19）により認識する方式を極性誤差識別法
と呼ぶ。極性誤差識別法は予め予測係数を用意す
るという点で残差波形電力法に類似し、その長所
をうけついでいる。なおかつ、予め正規直交ベク
トル｛〓^（ｌ） _ｋ｝を１個以上Ｋ個用意し、残差系列
との内積値の自乗値を残差系列のノルムの自乗よ
りとりさることによつて統計的に分散の大きい方
向に対して許容度を増し、変動に強い認識方式と
なつている。 The method of recognizing using equation (19) is called the polarity error identification method. The polarity error identification method is similar to the residual waveform power method in that prediction coefficients are prepared in advance, and inherits its advantages. Furthermore, by preparing one or more K orthonormal vectors {〓 ^(l) _k } in advance and taking the square of the inner product with the residual sequence from the square of the norm of the residual sequence, it is possible to statistically This recognition method has increased tolerance in the direction of large variance, making it resistant to fluctuations.

以下に具体的な装置の一実施例を図面を参照し
ながら説明する。処理の全体図を示せば第１図と
となる。 An example of a specific device will be described below with reference to the drawings. The overall diagram of the process is shown in Figure 1.

信号波形系列｛ｘ_o｝は、カテゴリ数Ｌだけ用
意された極性誤差の自乗和演算回路１１…１ｌ…
１Ｌへ送られる。演算回路１ｌは式１９を計算
し、結果として‖〓^(l)‖^２を出力する。次の最
小値検出回路２において最小となるｌを検出し、
認識結果としてｌを出力する。 The signal waveform series {x _o } is generated by polarity error sum-of-square calculation circuits 11...1l... which are prepared for the number of categories L.
Sent to 1L. The arithmetic circuit 1l calculates Equation 19 and outputs ‖〓 ^(l) ‖ ² as a result. The next minimum value detection circuit 2 detects the minimum l,
Output l as the recognition result.

第２図は、第１図に示した極性誤差の自乗和演
算回路の１つである回路１ｌの具体的なブロツク
図である。 FIG. 2 is a concrete block diagram of the circuit 1l, which is one of the polarity error sum-of-square calculation circuits shown in FIG.

信号波形系列ｘ_oは、逆フイルタＡ^（ｌ） _（ｚ）３１
に送
られ、逆フイルタＡ^（ｌ） _（ｚ）３１は残差波形系列ｅ
^（ｌ） _ｏ
を出力する。回路３２は残差波形系列のノルムの
自乗を計算する回路であり（残差波形電力法には
この回路のみが用いられる）、出力は‖〓^(l)‖^２
である。また回路３３は、残差波形系列のノルム
の自乗の中に含まれる｛μ^（ｌ） _ｋ｝^Ｋ _ｋ＝１の成分を
計算
する回路であり、本発明において新たに付け加え
られた部分である。そして回路３２，３３よりの
出力が減算されて結果として‖〓^(l)‖^２が出て
くる。 The signal waveform sequence x _o is the inverse filter A ^(l) _(z) 31
and the inverse filter A ^(l) _(z) 31 outputs the residual waveform sequence e
^(l) _o
Output. The circuit 32 is a circuit that calculates the square of the norm of the residual waveform series (only this circuit is used for the residual waveform power method), and the output is ‖〓 ^(l) ‖ ²
It is. Further, the circuit 33 is a circuit that calculates the component {μ ^(l) _k } ^K _{k = 1} included in the square of the norm of the residual waveform series, and is a newly added part in the present invention. Then, the outputs from the circuits 32 and 33 are subtracted, and the result is ‖〓 ^(l) ‖ ² .

第３図は、第２図の中の残差波形系列のノルム
の自乗の中に含まれる｛〓^（ｌ） _ｋ｝^Ｋ _ｋ＝１の成分を
計算
する回路３３を具体的に示したものである。 FIG. 3 specifically shows the circuit 33 that calculates the component {〓 ^(l) _k } ^K _k=1 included in the square of the norm of the residual waveform series in FIG. 2. be.

回路４１１，…，４１ｋ，…，４１Ｋにより、
ｅに含まれる｛〓^（ｌ） _ｋ｝^Ｋ _ｋ＝１の成分を計算し、
回路
４２１，…，４２ｋ，…４２Ｋで自乗値を計算し
ている。最後に回路４３により全体の和を計算し
ている。 By the circuits 411,..., 41k,..., 41K,
Calculate the component of {〓 ^(l) _k } ^K _k=1 included in e,
The circuits 421, . . . , 42k, . . . 42K calculate the square value. Finally, the circuit 43 calculates the total sum.

例えば母音を識別する音声認識システムとして
は、第１図における演算回路１１，…，１ｌ，
…，１Ｌは各母音に対応している。 For example, as a speech recognition system for identifying vowels, the arithmetic circuits 11,..., 1l,
..., 1L correspond to each vowel.

また入力信号波形系列ｘ_oは、図示しないＡ／
Ｄの変換器等により入力音声信号をΔｔ時間間隔
でサンプリングされたデイジタル信号系列｛ｘ
_o｝^ｎ１ _ｏ＝ｏ０が用いられる。 In addition, the input signal waveform series x _o is A/
A digital signal sequence {x
_o } ⁿ¹ _o=o0 is used.

更に、補助波形系列〓^（ｌ） _ｋは、各母音毎に予じ
め多数の人が発生した音声を収集し、これらを用
いて式（15）等により決定され、その値は図示し
ない記憶装置に予じめ収容され、回路３３に供給
される。 Furthermore, the auxiliary waveform series 〓 ^(l) _k is determined by collecting sounds produced by a large number of people in advance for each vowel, and using these, using equation (15), etc., and its value is stored in a storage device (not shown). is stored in advance and supplied to the circuit 33.

[Modified example of the invention]

(1) 本発明は音声波形の識別に限られず、心電図
波形等の識別にも用いることができる。また、
これらの１次元波形以外のｎ次元の波形につい
てもｎ次元フイルタを用いることにより実現す
ることができる。例えばｎ＝２としては平面画
像から得られた信号波形の識別も可能である。 (1) The present invention is not limited to identifying voice waveforms, but can also be used to identify electrocardiogram waveforms, etc. Also,
N-dimensional waveforms other than these one-dimensional waveforms can also be realized by using an n-dimensional filter. For example, when n=2, it is also possible to identify signal waveforms obtained from planar images.

(2) 上記実施例はすべてデイジタル回路により構
成したものであるが、その少なくとも一部をマ
イクロコンピユータ等によるプログラム制御で
実現することもできるし、アナログ演算回路で
構成することもできる。この場合、積和回路、
自乗和回路等は内積演算が可能な素子、例えば
光学フイルタを用いたもの、表面弾性波素子等
を用いることができる。(2) Although all of the above embodiments are constructed using digital circuits, at least a portion thereof may be realized by program control using a microcomputer or the like, or may be constructed using analog arithmetic circuits. In this case, the product-sum circuit,
For the sum-of-squares circuit, an element capable of calculating an inner product, for example, an element using an optical filter, a surface acoustic wave element, etc. can be used.

(3) 各類に対する変動方向の数Ｋは類毎に異なつ
ていてもよい。(3) The number K of variation directions for each class may be different for each class.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示す図、第２図及
び第３図は本発明の一実施例の各部の一構成例を
示す図である。１１，…，１ｌ，…，１Ｌ……演算回路、２…
…最小値検出回路、３１……逆フイルタ、３２…
…自乗和回路、３３……内積回路。 FIG. 1 is a diagram showing an embodiment of the present invention, and FIGS. 2 and 3 are diagrams showing an example of the configuration of each part of the embodiment of the present invention. 11,..., 1l,..., 1L... Arithmetic circuit, 2...
...Minimum value detection circuit, 31...Inverse filter, 32...
...Sum of squares circuit, 33...Inner product circuit.

Claims

[Claims]

1) A residual waveform sequence is obtained by passing an unknown input signal waveform sequence of class 1 through a filter whose impulse response waveform is the linear prediction coefficient of a known standard signal waveform sequence of L types, and then squares this residual waveform sequence. In a recognition method that determines the class to which an input signal waveform sequence belongs based on the sum output, one or more auxiliary waveform sequences are set in advance for each of the L types and are used to remove a certain amount of variation in the input signal waveform sequence. generated, calculate the inner product value of this auxiliary waveform series and the residual waveform series, subtract the sum of squares of each inner product value from the sum of squares output of the residual waveform series, and use this subtraction result to calculate the input signal waveform. A signal waveform recognition method characterized by determining the class to which a sequence belongs.