JPS6073600A - Voice recognition equipment - Google Patents

Voice recognition equipment

Info

Publication number
JPS6073600A
JPS6073600A JP58181135A JP18113583A JPS6073600A JP S6073600 A JPS6073600 A JP S6073600A JP 58181135 A JP58181135 A JP 58181135A JP 18113583 A JP18113583 A JP 18113583A JP S6073600 A JPS6073600 A JP S6073600A
Authority
JP
Japan
Prior art keywords
correlation
section
voice recognition
recognition equipment
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58181135A
Other languages
Japanese (ja)
Inventor
和弘 金子
角石 光夫
誠治 加藤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP58181135A priority Critical patent/JPS6073600A/en
Publication of JPS6073600A publication Critical patent/JPS6073600A/en
Pending legal-status Critical Current

Links

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。
(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】 ta> 発明の技術分野 本発明は音声認識装置に係り、特に入力音声信号の特徴
パラメータを効率よく且つ精度よく抽出することが可能
となる音声認識装置に関する。
DETAILED DESCRIPTION OF THE INVENTION Technical Field of the Invention The present invention relates to a speech recognition device, and more particularly to a speech recognition device that is capable of efficiently and accurately extracting characteristic parameters of an input speech signal.

(b)従来技術と問題点 音声認識装置は、例えば第1図に示す如くマイクロホン
1等より入力される音声信号の特徴を抽出する特徴抽出
部2.この抽出部2より得られた特徴をパラメータとし
て登録する音声パターンメモリ5.このメモリ5に格納
された特徴パラメータを基に、入力される音声を認識し
てその認識結果を出力する認識部4を備える。
(b) Prior Art and Problems A speech recognition device includes a feature extraction section 2 which extracts the features of a speech signal inputted from a microphone 1 or the like, as shown in FIG. 1, for example. A voice pattern memory 5 in which the features obtained from the extractor 2 are registered as parameters. A recognition unit 4 is provided that recognizes input speech based on the feature parameters stored in the memory 5 and outputs the recognition result.

ここで特徴抽出部2の構成として、それぞれ異なる中心
周一波数をを有する複数の帯域濾波器を用いる方法、及
びこれら帯域濾波器に代えて相関器を用いる方法がある
。しかるに前者の方法は、回路規模の増大ないしは演算
量の増大を招く懸念があり、後者の方法に比べてやや不
利である。
Here, as the configuration of the feature extracting section 2, there are a method of using a plurality of bandpass filters each having a different center frequency and wave number, and a method of using a correlator in place of these bandpass filters. However, the former method is somewhat disadvantageous compared to the latter method, as there is a concern that the circuit scale will increase or the amount of calculations will increase.

一方後者の方法は、回路規模或いは演算量の点で有利で
はあるが、相関をとる時間を長くするとこの出力特性が
変化する懸念がある。即ち、自己相関関数をY、入力さ
れる音声波形を5in(2πft十〇)、基準関数を5
in2ycfot及びCO32πrot、波形のサンプ
リングに使う時間を八Tとすると次式で定義される。
On the other hand, although the latter method is advantageous in terms of circuit scale or amount of calculation, there is a concern that the output characteristics may change if the correlation is taken for a long time. That is, the autocorrelation function is Y, the input audio waveform is 5 inches (2πft 〇), and the reference function is 5
In2ycfot and CO32πrot are defined by the following equation, assuming that the time used for waveform sampling is 8T.

相関をとる時間(フレーム幅)はT=△TXNである。The time (frame width) for taking the correlation is T=ΔTXN.

第3図に上記(1)式の特性を示す。つまり、この特性
で音声波形を変換し特徴ノ々ラメータを得ることになる
。相関をとる時間を長くすると、変換された特徴パラメ
ータはより正確になる。しかし相関をとる時間Nを2N
、3N ・−mNとしてし)りと出力特性が第4図に示
すように変化してしまうという欠点がある。
FIG. 3 shows the characteristics of equation (1) above. In other words, the voice waveform is converted using this characteristic to obtain the characteristic parameter. The longer the correlation time, the more accurate the transformed feature parameters will be. However, the time N for taking the correlation is 2N
, 3N .-mN), the output characteristic changes as shown in FIG. 4, which is a drawback.

(C) 発明の目的 本発明の目的は、特徴抽出部におし)で相関器を用いる
方法において、入力音声信号の相関をとる時間を変えて
も出力特性が変化しないようにして正確な特徴パラメー
タに変換する相関関数を提イ共し、その結果、音声認識
性能を向上させること力(可能となる音声l織装置を提
供するにある。
(C) Object of the Invention The object of the present invention is to obtain accurate features in a method using a correlator in the feature extracting section so that the output characteristics do not change even if the time for correlating the input audio signal is changed. The object of the present invention is to provide a speech recognition device that can improve speech recognition performance by providing a correlation function that converts the parameters into parameters.

(d) 発明の構成 上記目的を一達成するため本発明においては、上述した
(1)式における単位(△TXNの時間に相当)の相関
を複数回繰返し、それぞれの相関結果を累積加算した結
果を用いて特徴抽出を行うように構成したものである。
(d) Structure of the Invention In order to achieve the above object, the present invention repeats the correlation of the unit (corresponding to the time of △TXN) in the above-mentioned formula (1) multiple times, and calculates the result of cumulatively adding the respective correlation results. It is configured to perform feature extraction using .

以下実施例を用いて本発明を詳述する。The present invention will be explained in detail below using Examples.

(e) 発明の実施例 第2図は本発明の一実施例構成を示すエルプリズム図で
あり、第1図に示した抽出部2における相関器の構成を
示すものである。図中、21.22は乗算部、23.2
4は2乗演算部、25は加算部、26は加算累積部を示
す。また、Ys (△T)、Yc(△T)はそれぞれ上
記(1)式で示した基準関数5in2yrfQ△T、c
os2yr’fO△Tを表わす。
(e) Embodiment of the Invention FIG. 2 is an L prism diagram showing the configuration of an embodiment of the present invention, and shows the configuration of the correlator in the extraction section 2 shown in FIG. In the figure, 21.22 is a multiplication section, 23.2
4 is a square calculation section, 25 is an addition section, and 26 is an addition accumulation section. Furthermore, Ys (△T) and Yc (△T) are the reference functions 5in2yrfQ△T and c shown in equation (1) above, respectively.
represents os2yr'fO△T.

即ち本実施例では、入力音声信号INを乗算部21.2
2にてサンプリング周期△Tで基準関数との積をとり、
また2乗演算部23.24によってそれぞれの積の2乗
演算を行う。これにより単位フレームの相関Y′をとる
。この単位の相関出力Y′は加算累積部26によって逐
次加算、累積される。尚、Tは遅延回路であり累積結果
を次のフレームの相関出力が得られるまでバッファリン
グするものである。
That is, in this embodiment, the input audio signal IN is multiplied by the multiplier 21.2.
In step 2, take the product with the reference function at the sampling period △T,
Further, the square calculation units 23 and 24 perform a square calculation of each product. This calculates the correlation Y' of the unit frame. This unit of correlation output Y' is successively added and accumulated by the addition/accumulation section 26. Note that T is a delay circuit that buffers the cumulative result until the correlation output of the next frame is obtained.

このように単位フレーム(ΔTXN時間)の相関を複数
回(例えばm回)行った結果が、相関器゛ 数Yの最終
的な出力0tJTとして、図示しない特徴パラメータ変
換部に供給されることになる。この出力0tJTを上記
(1)式と対応して示すと下式の、 (2πfΔT−1
十〇) Y2=cos2if、OΔT−i−sin(2πfへT
−i十〇) 従って、例えば相関出力を、繰返した回数分(m+1)
で平均をとることによって、第3図に示したと同様の出
力特性を得ることができる。
In this way, the result of performing correlation for a unit frame (ΔTXN time) multiple times (for example, m times) is supplied to the feature parameter conversion unit (not shown) as the final output 0tJT of the number Y of correlators. . If this output 0tJT is shown in correspondence with the above equation (1), the following equation is expressed as (2πfΔT−1
10) Y2=cos2if, OΔT-i-sin (T to 2πf
-i 〇) Therefore, for example, the correlation output is repeated for the number of times (m+1)
By averaging the output characteristics as shown in FIG. 3, it is possible to obtain an output characteristic similar to that shown in FIG.

しかも相関をとった期間は、時系列的に巾広くとった事
となり、特徴パラメータを精度よく抽出できるものであ
る。
Furthermore, the period during which the correlation was taken is spread over a wide time series, allowing the characteristic parameters to be extracted with high accuracy.

(f) 発明の効果 以上詳述した如(本発明によれば、特徴抽出部における
相関器の出力特性を変化させることな(、相関時間を実
質的に長くとることができ、音声波形をより正確な特徴
パラメータに変換でき、ひいては認識率を大巾に向上さ
せることが可能となる。
(f) Effects of the Invention As detailed above (according to the present invention, the correlation time can be substantially lengthened without changing the output characteristics of the correlator in the feature extraction section), and the speech waveform can be further improved. This can be converted into accurate feature parameters, which in turn makes it possible to greatly improve the recognition rate.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は音−m織装置の構成例を示す図、第2図は本発
明の一実施例を示す図、第3図及び第4図は相関器の出
力特性を示す図である。2は特徴抽出部、21.22は
乗算部、26は加算累積部をそれぞれ示す。また第3′
@、第4図において横軸は周波数、縦軸は減衰量を表わ
す。
FIG. 1 is a diagram showing an example of the configuration of a sound-m weaving device, FIG. 2 is a diagram showing an embodiment of the present invention, and FIGS. 3 and 4 are diagrams showing output characteristics of a correlator. Reference numeral 2 indicates a feature extraction section, 21 and 22 a multiplication section, and 26 an addition accumulation section. Also the 3rd
@, In FIG. 4, the horizontal axis represents frequency, and the vertical axis represents attenuation amount.

Claims (1)

【特許請求の範囲】[Claims] 相関をとることにより該音声信号波形の特徴を抽出する
特徴抽出部を備えた装置において、前記入力音声信号に
おける各々時間的に異なる信号波形部分のそれぞれと前
記基準信号との相関をとる相関部と、該相関部から得ら
れる演算結果を加算累することを特徴とする音声認識装
置。
The apparatus includes a feature extraction section that extracts features of the audio signal waveform by making a correlation, the correlation section that takes a correlation between each of the temporally different signal waveform portions of the input audio signal and the reference signal; , a speech recognition device characterized in that the calculation results obtained from the correlation section are summed.
JP58181135A 1983-09-29 1983-09-29 Voice recognition equipment Pending JPS6073600A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58181135A JPS6073600A (en) 1983-09-29 1983-09-29 Voice recognition equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58181135A JPS6073600A (en) 1983-09-29 1983-09-29 Voice recognition equipment

Publications (1)

Publication Number Publication Date
JPS6073600A true JPS6073600A (en) 1985-04-25

Family

ID=16095488

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58181135A Pending JPS6073600A (en) 1983-09-29 1983-09-29 Voice recognition equipment

Country Status (1)

Country Link
JP (1) JPS6073600A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS433672Y1 (en) * 1965-08-06 1968-02-15
JPS6228234U (en) * 1985-08-01 1987-02-20

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS433672Y1 (en) * 1965-08-06 1968-02-15
JPS6228234U (en) * 1985-08-01 1987-02-20

Similar Documents

Publication Publication Date Title
CN100356445C (en) Method and apparatus for separating sound-source signal and method and device for detecting pitch
CA1164569A (en) System for extraction of pole/zero parameter values
US4076960A (en) CCD speech processor
JPH0465392B2 (en)
JPS6356560B2 (en)
JPS6073600A (en) Voice recognition equipment
JPH08328593A (en) Spectrum analysis method
JPS6152478B2 (en)
JPH05265490A (en) Pitch period extracting device
JPS59120B2 (en) Voice analysis processing method
JPH0461359B2 (en)
JPH0122638B2 (en)
JP3258318B2 (en) Voice-controlled electronic musical instruments
JPS59164599A (en) Correlator
JPH04342298A (en) Momentary pitch analysis method and sound/silence discriminating method
SU860276A1 (en) Method of detecting phase modulated signals
JPH0344318B2 (en)
RU2003105498A (en) METHOD FOR IDENTIFICATION OF INFORMATIVE PARAMETERS OF ST-SEGMENT OF ELECTROCARDIO SIGNAL AND DEVICE FOR ITS IMPLEMENTATION
SU955045A1 (en) Experimental correlated random number generator
JPH0377520B2 (en)
JPH0670753B2 (en) Polar zero analyzer
JPS60158500A (en) Pitch extractor
JPH0311479B2 (en)
JPS607499A (en) Pitch extraction circuit
JPH02244200A (en) Pitch detecting circuit for processing voice signal