JPH0219478B2

JPH0219478B2 -

Info

Publication number: JPH0219478B2
Application number: JP58069475A
Authority: JP
Inventors: Masao Watari
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1983-04-20
Filing date: 1983-04-20
Publication date: 1990-05-01
Also published as: JPS59195287A

Description

【発明の詳細な説明】本発明は簡易形音声分析装置の改良に関する。[Detailed description of the invention] The present invention relates to an improvement in a simplified speech analysis device.

通常、音声認識装置では、音声波形を分析し、
その分析出力である特徴パラメータの時系列とあ
らかじめ記憶されているパターンとの間で識別計
算を行い、認識結果を求めている。 Normally, speech recognition devices analyze speech waveforms and
Discrimination calculations are performed between the time series of feature parameters that are the output of the analysis and pre-stored patterns to obtain recognition results.

従来、この音声認識装置に使用されている音声
分析としてバンドパスフイム分析やケプストラム
分析や変形ケプストラム分析があつた。 Conventionally, speech analyzes used in speech recognition devices include bandpass film analysis, cepstrum analysis, and modified cepstrum analysis.

音声波は声帯の振動によつて励振された声道か
らの放射出力であると考えることができ、音声信
号Ｇ（ｔ）は(1)式のように声道のインパルス応答
Ｒ（ｔ）と音源波形Ｓ（ｔ）の畳み込みで表わされ
る。 A speech wave can be considered to be a radiation output from the vocal tract excited by the vibration of the vocal cords, and the speech signal G(t) is expressed as the impulse response R(t) of the vocal tract as shown in equation (1). It is expressed by convolution of the sound source waveform S(t).

Ｇ（ｔ）＝Ｒ（ｔ）＊Ｓ（ｔ） ……(1) ＊は畳み込み操作 (1) 式をフーリエ変換すると G_f（ｗ）＝R_f（ｗ）×S_f（ｗ） ……(2) となる。音源特性S_f（ｗ）は周期的な線スペクト
ラムであり、声道特性R_f（ｗ）は、音声スペクト
ルG_f（ｗ）のエンベロープである。このエンベロ
ープを得る方法として一定値以上の帯域幅を持つ
たバンドパスフイルタを音声帯域内に複数個並べ
るバンドパスフイルタ分析がある。一定値以上の
帯域幅を持つことにより音源特性である線スペク
トルの影響を弱くし複数個並べることにより、エ
ンベロープの全体の特性すなわち声道特性を得る
ことができていた。 G (t) = R (t) * S (t) ... (1) * is the convolution operation (1) When the formula is Fourier transformed, G _f (w) = R _f (w) × S _f (w) ... (2) becomes. The sound source characteristic S _f (w) is a periodic line spectrum, and the vocal tract characteristic R _f (w) is an envelope of the voice spectrum G _f (w). As a method for obtaining this envelope, there is a bandpass filter analysis in which a plurality of bandpass filters having a bandwidth of a certain value or more are arranged in the audio band. By having a bandwidth above a certain value, the influence of the line spectrum, which is the sound source characteristic, is weakened, and by arranging multiple lines, it was possible to obtain the overall characteristics of the envelope, that is, the vocal tract characteristics.

ところで、より精密な声道特性を得るために
は、バンドパスフイルタの帯域幅を細くする必要
があるが、細くすると音源スペクトルである線ス
ペクトルの影響が大きく表われてくる。このため
バンドパスフイルタの帯域幅はある程度以上細く
することはできず、バンドパスフイルタ分析では
より精密な声道特性を得ることはできなかつた。
一方、声道特性と音源特性を分離し、より精密な
声道特性を求める方法としてケプストラム分析が
ある。ケプストラム分析では、(2)式をさらにlog
変換し、 log｜G_f（ｗ）｜＝log｜R_f（ｗ）｜＋log｜S_f（ｗ）｜ ……(3) 次に逆フーリエ変換によつてケプストラムを得
る。 By the way, in order to obtain more precise vocal tract characteristics, it is necessary to narrow the bandwidth of the bandpass filter, but when it is narrowed, the influence of the line spectrum, which is the sound source spectrum, becomes greater. For this reason, the bandwidth of the bandpass filter cannot be narrowed beyond a certain level, and more precise vocal tract characteristics cannot be obtained by bandpass filter analysis.
On the other hand, cepstral analysis is a method for separating vocal tract characteristics and sound source characteristics and obtaining more precise vocal tract characteristics. In cepstral analysis, equation (2) is further transformed into log
_Then , the cepstrum _is obtained by inverse _Fourier transformation.

G_c（ｑ）＝R_c（ｑ）＋S_c（ｑ） ……(4) この(4)式のように、スペクトラム領域での積が
ケプストラム領域では和となる。音源スペクトル
S_f（ｗ）である周期的な線スペクトルのケプスト
ラムS_c（ｑ）は音源の周期T_pの近傍にのみ表われ
る。一方、声道スペクトルR_f（ｗ）はG_f（ｗ）の
エンベロープとして表われ、そのケプストラム
R_c（ｑ）は低ケフレンシ部へ表われる。すなわ
ち、音声信号をケプストラム分析し、ケフレンシ
の低い成分に音源特性より分離された声道特性を
得ることができる。 G _c (q) = R _c (q) + S _c (q) ... (4) As shown in equation (4), the product in the spectral domain becomes the sum in the cepstrum domain. sound source spectrum
The cepstrum S _c (q) of the periodic line spectrum S _f (w) appears only in the vicinity of the period T _p of the sound source. On the other hand, the vocal tract spectrum R _f (w) appears as an envelope of G _f (w), and its cepstrum
R _c (q) appears in the low quefrency region. That is, by performing cepstral analysis on the audio signal, it is possible to obtain vocal tract characteristics that are separated from the sound source characteristics into components with low quefrency.

さらに特願昭56−069031号明細書（特開昭57―
185098号公報に記載されているように音声のスペ
クトルの中より帯域内の周波数成分のみを切出し
ゼロ周波数までシフトする切出し部を用けること
により、伝送路の帯域外の特性の影響を除去する
ことができる。また、周波数の高域部を圧縮する
写像関数による周波数軸のスケール変換例えば
logスケール変換、Melスケール変換などを行う
スケール変換部を用けることにより、高域より低
域へ重みが置かれたすなわち人間の聴覚特性に近
い特性を持つた変形ケプストラムが得られる。前
記スケール変換は第１図に示すような写像関数
S_n＝Ｍ（S_e）により、伝送路の帯域内のみのデー
タをlogスケールまたはMelスケールへ並び換え
ることである。すなわちS_e番目のスペクトルを
S_n番目の変形スペクトルとすることである。 Furthermore, the specification of Japanese Patent Application No. 56-069031
As described in Publication No. 185098, by using an extraction section that extracts only the frequency components within the band from the audio spectrum and shifts them to zero frequency, the influence of characteristics outside the band of the transmission path can be removed. Can be done. In addition, scale conversion of the frequency axis using a mapping function that compresses the high frequency part, for example,
By using a scale conversion section that performs log scale conversion, Mel scale conversion, etc., a modified cepstrum can be obtained in which weight is placed more on low frequencies than on high frequencies, that is, having characteristics close to human auditory characteristics. The scale conversion is performed using a mapping function as shown in Figure 1.
This is to rearrange the data only within the band of the transmission path into log scale or Mel scale using S _n =M (S _e ). That is, the S _eth spectrum is
_S is the nth deformed spectrum.

しかしながら、前記のバンドパスフイルタ分析
やケプストラム分析や変形ケプストラム分析は、
フーリエ変換を基本としており三角関数系との乗
算を必要とし装置が大型となる欠点があつた。 However, the above-mentioned bandpass filter analysis, cepstrum analysis, and modified cepstrum analysis
It is based on Fourier transform and requires multiplication with a trigonometric function system, which has the disadvantage of increasing the size of the device.

一方、フーリエ変換の近似的な変換であるウオ
ルシユ変換は±１の２値の直交関数系による変換
であるため、ウオルシユスペクトルは加減算のみ
で求めることができる。このウオルシユ変換を用
いることにより特開昭57−700号公報に記載され
ているように小型の疑似バンドパスフイルタを実
現することが知られている。しかし疑似バンドパ
スフイルタではより細かで精密な声道特性を得ら
れない欠点があつた。 On the other hand, since the Walsh transform, which is an approximation of the Fourier transform, is a transform using a binary orthogonal function system of ±1, the Walsh spectrum can be obtained only by addition and subtraction. It is known that by using this Walsh transformation, a small-sized pseudo bandpass filter can be realized as described in Japanese Patent Application Laid-Open No. 57-700. However, the pseudo bandpass filter had the disadvantage that finer and more precise vocal tract characteristics could not be obtained.

本発明の目的は、ケプストラム分析におけるフ
ーリエ変換と逆フーリエ変換を共にウオルシユ変
換へ置き換えることにより、ケプストラムの近似
を得る小型の装置、すなわち、より細かで精密な
声道特性が得られかつ小型の音声分析装置を提供
することにある。 The purpose of the present invention is to provide a compact device for approximating the cepstrum by replacing both the Fourier transform and inverse Fourier transform in cepstrum analysis with the Walsh transform. Our objective is to provide analytical equipment.

本発明による音声分析装置は、入力信号のウオ
ルシユ変換を行う第１のウオルシユ変換部と、前
記ウオルシユ変換部の出力よりウオルシユパワー
スペクトルを求めそのlog変換を行うlog変換部
と、ウオルシユ交番数軸のスケール変換を行う写
像関数S_n＝Ｍ（S_e）により前記log変換部の出力
を変形交番数軸へ写像するスケール変換部と、前
記スケール変換部の出力のウオルシユ変換を行う
第２のウオルシユ変換部を有している。 The speech analysis device according to the present invention includes a first Walsh transformer that performs a Walsh transform on an input signal, a log transformer that obtains a Walsh power spectrum from the output of the Walsh transformer and performs a log transform, and a Walsh alternating number axis a scale conversion unit that maps the output of the log conversion unit onto a modified alternating number axis using a mapping function S _n =M (S _e ) that performs scale conversion; and a second scale conversion unit that performs a Walsh conversion of the output of the scale conversion unit. It has a conversion section.

次に本発明で使用するウオルシユ変換について
説明する。ウオルシユ変換は(5)式で定義される
が、Ｗ＝H_i・Ｘ ……(5) ただしＷはウオルシユスペクトルＸは入
力列ベクトル Hiはウオルシユ変換行列であり Hi＝Hi−１○×H₁ H₁＝（¹ ¹ _1-1）にて求められる。 Next, the Walsh transform used in the present invention will be explained. The Walsh transform is defined by equation (5), W=H _i・X ...(5) where W is the Walsh spectrum ₁ H ₁ = ( ¹ ¹ _1-1 ).

（○×はクロネツカ積）前記(5)式を高速に計算する方法である高速ウオ
ルシユ変換（FWTと略称する）が、昭和51年12
月電子通信学会論文誌Vol59−A.No.12の第1134頁
より第1135頁までの「フーリエ変換とウオルシユ
変換に関する−検討」に記載されている。(○× is Kronetska product) The fast Walsh transform (abbreviated as FWT), which is a method for quickly calculating the above equation (5), was developed in December 1975.
It is described in "Study on Fourier transform and Walsh transform" from pages 1134 to 1135 of the Journal of the Institute of Electronics and Communication Engineers Vol. 59-A. No. 12.

入力時系列を逆２進順に並べたベクトルをX.
ウオルシユスペクトルをＷ、変換行列をＣとすれ
ば、 FWTは、Ｗ＝Ｃ・Ｘ＝G_o・G_2-1…G₁・Ｘ ……(6) とｎ回の行列の積として表現できる。ここで各G_i
は(7)，(8)，(9)式より決定される。 The vector of input time series arranged in reverse binary order is X.
If the Walsh spectrum is W and the transformation matrix is C, FWT can be expressed as the product of W=C・X =G _o・G _2-1 ...G ₁・X ...(6) and n times of matrices. . Here each G _i
is determined from equations (7), (8), and (9).

ただしI_iは2ⁱ行2ⁱ列の単位行列であり、diag（）
は括弧内を対角要素とする対角行列でである。こ
こで逆２進順とは自然数を２進表現し、その桁を
逆転させた数を考え、その数の順序に並べること
であり、ｎ＝３の場合Ｘ＝
（X₀X₄X₂X₆X₁X₅X₃X₇）となる。さらに各G_iは、となる。 However, I _i is an identity matrix with 2 ⁱ rows and 2 ⁱ columns, and diag( )
is a diagonal matrix with diagonal elements in parentheses. Here, reverse binary order means to express a natural number in binary, consider a number with its digits reversed, and arrange it in the order of that number. If n = 3, then X =
(X ₀ X ₄ X ₂ X ₆ X ₁ X ₅ X ₃ X ₇ ). Furthermore, each G _i is becomes.

前記の行列G_iの各行ともゼロでない要素は２つ
のみであり高速フーリエ変換で用いられるバタフ
ライ演算と同形の演算を表現している。ただし、
その要素は±１のみであるため加減算のみで実行
できる。またウオルシユスペクトルはＷ＝
（W₀W₂W₄W₆W₇W₅W₃W₁）として求められる。 Each row of the matrix G _i has only two non-zero elements, and represents an operation isomorphic to the butterfly operation used in fast Fourier transform. however,
Since the element is only ±1, it can be executed only by addition and subtraction. Also, the Walsh spectrum is W=
(W ₀ W ₂ W ₄ W ₆ W ₇ W ₅ W ₃ W ₁ ).

本発明の音声分析装置は、ケプストラム分析に
おけるフーリエ変換および逆フーリエ変換へ置き
換えることにより、加減算器による簡単な演算器
で構成できる利点を持つている。さらにウオルシ
ユ変換を用いた疑似バンドパスフイルタ分析装置
に比較し、より細かで精密な声道特性が得られる
利点を持つている。 The speech analysis device of the present invention has the advantage that it can be configured with a simple arithmetic unit using an adder/subtractor by replacing the Fourier transform and inverse Fourier transform in cepstrum analysis. Furthermore, compared to pseudo-bandpass filter analyzers that use Walsh transform, it has the advantage of providing more detailed and precise vocal tract characteristics.

次に本発明の装置の具体的な構成を図面を参照
しながら説明する。 Next, the specific configuration of the apparatus of the present invention will be explained with reference to the drawings.

本発明の第１の実施例は第２図に示すように、
第１バツフアメモリ部第１ウオルシユ演算部２、
第１ウオルシユ変換制御部３、log変換部４、ス
ケール変換部５、第２バツフアメモリ部６、第２
ウオルシユ演算部７、第２ウオルシユ変換制御部
８より構成される。始めに入力時系列データが第
１バツフアメモリ部１へ入力され一時記憶され
る。記憶された後、第４図に示したｎ＝４の場合
の計算の流れ図に従つた第１ウオルシユ変換制御
部３の制御信号により、第１段より第ｎ段まで計
算が進められる。第ｉ段の処理は、第４図に示し
た第ｉ段の2^n-1個のバタフライ演算を実行するこ
とであり、(7)式のG_iの行列を乗ずることを意味し
ている。 The first embodiment of the present invention is as shown in FIG.
first buffer memory unit first Walsh calculation unit 2;
The first Walsh conversion control section 3, the log conversion section 4, the scale conversion section 5, the second buffer memory section 6, the second
It is composed of a Walsh calculation section 7 and a second Walsh conversion control section 8. First, input time series data is input to the first buffer memory section 1 and temporarily stored. After being stored, the calculation is advanced from the first stage to the nth stage by the control signal of the first Walsh transformation control unit 3 according to the calculation flowchart for n=4 shown in FIG. The processing at the i-th stage is to execute the 2 ^n-1 butterfly operations of the i-th stage shown in FIG. 4, and means to multiply by the matrix of G _i in equation (7).

バタフライ演算は Y_a＝X_a＋X_b Y_b＝X_a−X_b ……(10) または Y_a＝X_a−X_b Y_b＝X_a＋X_b ……(11) であり、第３図に示す第１ウオルシユ演算部２に
て求められる。バタフライ演算では始めにX_a，
X_bが第１バツフアメモリ部１より読み出され、
レジスタ２０１，２０２へそれぞれ格納される。
加算器２１１と減算器２１２にて(10)または(11)式の
加減算が求められ、その結果であるY_a，Y_bは第
１バツフアメモリ部１のX_a，X_bが記憶されてい
た場所へ書かれる。すなわち(10)式を選択した場
合、加算器２１１の出力は第１バツフアメモリ部
１のX_aが記憶されていた場所へ書かれ、減算器
２１２の出力はX_bの場所へ書かれる。また(11)式
を選択した場合は、加算器２１１の出力はX_bの
場所へ書かれ、減算器２１２の出力はX_aの場所
へ書かれる。この(10)または(11)の選択は第１ウオル
シユ変換制御部３によつて行われる。最終段であ
る第ｎ段まで前記処理が終了すると、第１バツフ
アメモリ１にウオルシユスペクトラムが得られ
る。 The butterfly operation is Y _a =X _a +X _b Y _b =X _a −X _b ...(10) or Y _a =X _a −X _b Y _b =X _a +X _b ...(11), and Fig. It is determined by the first Walsh calculation unit 2 shown in FIG. In the butterfly operation, first X _a ,
X _b is read from the first buffer memory section 1,
The data are stored in registers 201 and 202, respectively.
The adder 211 and the subtracter 212 calculate the addition and subtraction of equation (10) or (11), and the results Y _a and Y _b are stored in the first buffer memory section 1 where X _a and X _b were stored. will be written to. That is, when formula (10) is selected, the output of the adder 211 is written to the location where X _a of the first buffer memory section 1 was stored, and the output of the subtracter 212 is written to the location of X _b . When formula (11) is selected, the output of the adder 211 is written to the location X _b , and the output of the subtracter 212 is written to the location X _a . This selection of (10) or (11) is performed by the first Walsh conversion control section 3. When the processing is completed up to the n-th stage, which is the final stage, a Walsh spectrum is obtained in the first buffer memory 1.

ウオルシユ変換が終了した後、第５図に示す
log変換部４とスケール変換部５によつて、logパ
ワーウオルシユスペクトルが求められ、スケール
変換を行う写像関数S_n＝Ｍ（S_l）により変形交番
数軸へ写像を行う。すなわち、スケール変換制御
部５１は第６図に示すタイムチヤートに従つた制
御信号を発し、始めに第１バツフアメモリ部１よ
り信号ａ１に従つてウオルシユスペクトルの偶数
項W_2iと奇数項W_2i+1を順次読み出し、log変換部
４の乗算器４１で２乗され加算器４２とアキユム
レータ４３を用いてパワーウオルシユスペクトル
（P_i＝W² _2i＋W_2i+ ² ₁）が求められ、づついてlog変換
部４にてlog変換され、信号ａ２をアドレスとし
て指示された写像関数値Ｍ(i)を写像関数テーブル
メモリ部５２より読み出し、その出力MM（ｉ）
を第２バツフアメモリ部６のアドレス信号ａ３と
してlogパワーウオルシユスペクトル（logP_i）を
第２バツフアメモリ部６のＭ（ｉ）番地へ格納す
る。 After the Walsh transformation is completed, the result is shown in Figure 5.
A log power Walsh spectrum is obtained by the log conversion unit 4 and the scale conversion unit 5, and is mapped onto the modified alternating number axis using a mapping function S _n =M (S _l ) that performs scale conversion. That is, the scale conversion control section 51 issues a control signal according to the time chart shown _in FIG _{. 1} are sequentially read out, squared by the multiplier 41 of the log conversion unit 4, and the power wash spectrum (P _i = W ² _2i + W _2i+ ² ₁ ) is obtained using the adder 42 and the accumulator 43, and then log conversion is performed. The mapping function value M(i) which is log-converted in the unit 4 and designated using the signal a2 as an address is read out from the mapping function table memory unit 52, and its output MM(i)
The log power wash spectrum (logP _i ) is stored at address M(i) of the second buffer memory section 6 with the address signal a3 of the second buffer memory section 6 being set as the address signal a3 of the second buffer memory section 6.

第２のウオルシユ変換は第１ウオルシユ変換と
同様に動作し第２バツフアメモリ部６、第２ウオ
ルシユ変換部７、第２ウオルシユ変換制御部８に
て実行される。なお、ウオルシユ変換において
は、正変換行列Ｃと逆変換行列C^-1とは、同一の
形（定数倍の違いを無視して）をしているので、
第２ウオルシユ変換部７に逆ウオルシユ変換を行
なわせても、最終結果は同一となる。ところで、
通常音声認識ではケプストラムの低次の項のみ使
用するため、第２のウオルシユ変換は低次の項の
み計算すればよい。そのため第２のウオルシユ変
換は(5)式の変換行列の低次の項のみ、すなわち(12)
式の交番数の小さいｋについて W_k＝_N-1 〓^l=0 H_klX_l ……(12) ただし、Ｎ＝2ⁿ のみ計算すればよい。 The second Walsh transformation operates in the same manner as the first Walsh transformation, and is executed by the second buffer memory section 6, the second Walsh transformation section 7, and the second Walsh transformation control section 8. In addition, in the Walsh transformation, the forward transformation matrix C and the inverse transformation matrix C ^-1 have the same form (ignoring the difference in constant multiplication), so
Even if the second Walsh transform unit 7 performs the inverse Walsh transform, the final result will be the same. by the way,
Normally, in speech recognition, only the low-order terms of the cepstrum are used, so the second Walsh transform only needs to calculate the low-order terms. Therefore, the second Walsh transformation requires only the low-order terms of the transformation matrix in equation (5), that is, (12)
For k where the number of alternations in the equation is small, W _k = _N-1 〓 ^l=0 H _kl X _l ...(12) However, only N=2 ⁿ needs to be calculated.

本発明の第２の実施例は、第２のウオルシユ変
換を(12)式にて求める装置であり、第１の実施例に
おける第２ウオルシユ変換制御部８、第２ウオル
シユ演算部７を第７図に示す構成へ変更したもの
である。第２ウオルシユ変換制御部７は第８図に
示すタイムチヤートに従つた制御信号を発し、信
号Cl７によつてアキユムレータ７２をクリヤし、
信号ｂ１に従つて第２バツフアメモリ部６より変
形logパワーウオルシユスペクトルX_l＝logP_M(i)を
読み出し、ウオルシユ変換行列H_klに従つた＋１
または−１の信号b₂により加減算器７１はアキユ
ムレータ７２との間で加算または減算を行う。す
なわち信号ｂ２が＋１の場合はACC＋X_l→ACC
を行い、信号ｂ２が−１の場合はACC→X_l→
ACCを行う。信号ｂ１がＮ−１となつた時アキ
ユムレータ７２へウオルシユ変換値W_kすなわち
疑似ケプストラムが得られる。 The second embodiment of the present invention is an apparatus that calculates the second Walsh transformation using equation (12), and the second Walsh transformation control section 8 and the second Walsh calculation section 7 in the first embodiment are replaced by a seventh The configuration has been changed to the one shown in the figure. The second Walsh conversion control section 7 issues a control signal according to the time chart shown in FIG. 8, clears the accumulator 72 with the signal Cl7,
According to the signal b1, the modified log power Walsh spectrum X _l =logP _M(i) is read out from the second buffer memory section 6, and it is +1 according to the Walsh transformation matrix H _kl .
Alternatively, the adder/subtracter 71 performs addition or subtraction with the accumulator 72 in response to the -1 signal _b2 . In other words, if signal b2 is +1, ACC+X _l →ACC
and if signal b2 is -1, ACC→X _l →
Do ACC. When the signal b1 becomes N-1, a Walsh transform value _Wk , that is, a pseudo cepstrum is obtained in the accumulator 72.

以上本発明を実施例に基づき説明したが、これ
らの記載は本発明の範囲を限定するものではな
い。特に本発明の実施例ではFWTのアルゴリズ
ムとして(6)式に示すように入力時系を逆２進順に
並べG₁よりG_oまで順次積を取り求めていたが、
（13）式に示すような正順序の時系列X′をG_o ^T、
よりG₁ ^Tまで順次積を取りその結果として逆２進
順のウオルシユスペクトルW′を得る方法も採用
できることは明白である。 Although the present invention has been described above based on Examples, these descriptions do not limit the scope of the present invention. In particular, in the embodiment of the present invention, as an FWT algorithm, the input time series is arranged in reverse binary order as shown in equation (6), and the products are sequentially obtained from G ₁ to G _o .
Let G _o ^T be the time series X′ in the normal order as shown in equation (13),
It is clear that it is also possible to take the products sequentially up to G ₁ ^T and obtain the Walsh spectrum W' in reverse binary order as a result.

W′＝G₁ ^T・G₂ ^T…G_o ^T・X′ ……（13）また、パワースペクトルをP_i＝W² _2i，W² _2i+1とし
て求めているが、乗算器を必要としているためP_i
＝｜W_2i｜＋｜W_2i+1｜のように絶対値の和として
パワースペクトルを近似的に求める方法も採用で
きることは明白である。 W′=G ₁ ^T・G ₂ ^T …G _o ^T・X′ …(13) Also, the power spectrum is obtained as P _i =W ² _2i , W ² _2i+1 , but it requires a multiplier. Because there is P _i
It is obvious that a method of approximately obtaining the power spectrum as a sum of absolute values, such as =|W _2i |+|W _2i+1 |, can also be adopted.

[Brief explanation of drawings]

第１図はスケール変換を示す図であり、第２図
は本発明の第１の実施例のブロツク図であり、第
３図は第１ウオルシユ演算部２のブロツク図であ
り、第４図は第１ウオルシユ変換の計算の流れ図
であり、第５図はlog変換部４とスケール変換部
５のブロツク図であり、第６図はスケール変換の
タイムチヤートであり、第７図は第２の実施例に
おける第２ウオルシユ演算部７のブロツク図であ
り、第８図は第２ウオルシユ変換のタイムチヤー
トである。図において、１は第１バツフアメモリ部、２は
第１ウオルシユ演算部、３は第１ウオルシユ変換
制御部、４はlog変換部、５はスケール変換部、
６は第２バツフアメモリ部、７は第２ウオルシユ
演算部、８は第２ウオルシユ変換制御部２０１，
２０２はレジスタ、２１１は加算器、２１２は減
算器である。第５図において４１は乗算器、４２
は加算器、４３はアキユムレータ、４４はlog変
換器、５１はスケール変換制御部、５２は写像関
数テーブルメモリ、７１は加減算器、７２はアキ
ユムレータである。 FIG. 1 is a diagram showing scale conversion, FIG. 2 is a block diagram of the first embodiment of the present invention, FIG. 3 is a block diagram of the first wallet calculation section 2, and FIG. FIG. 5 is a block diagram of the log conversion section 4 and scale conversion section 5, FIG. 6 is a time chart of the scale conversion, and FIG. 7 is a flowchart of the calculation of the first Walsh transformation. 8 is a block diagram of the second Walsh calculation unit 7 in the example, and FIG. 8 is a time chart of the second Walsh transformation. In the figure, 1 is a first buffer memory section, 2 is a first Walsh calculation section, 3 is a first Walsh conversion control section, 4 is a log conversion section, 5 is a scale conversion section,
6 is a second buffer memory unit, 7 is a second wallet calculation unit, 8 is a second wallet conversion control unit 201,
202 is a register, 211 is an adder, and 212 is a subtracter. In FIG. 5, 41 is a multiplier, 42
is an adder, 43 is an accumulator, 44 is a log converter, 51 is a scale conversion control section, 52 is a mapping function table memory, 71 is an adder/subtractor, and 72 is an accumulator.

Claims

[Claims]

1. A first Walsh transform unit that performs a Walsh transform of an input signal, a log transform unit that obtains a Walsh power spectrum from the output of the Walsh transform unit and performs a log transform thereof, and a mapping function S that performs a scale transform of a Walsh alternating number axis. By _n = M(S _e ), the above
A speech analysis device comprising: a scale converter that maps the output of the log converter onto a modified alternating number axis; and a second Walsh converter that performs Walsh transform of the output of the scale converter.