JPS58143396A - Voice recognition unit - Google Patents

Voice recognition unit

Info

Publication number
JPS58143396A
JPS58143396A JP57025701A JP2570182A JPS58143396A JP S58143396 A JPS58143396 A JP S58143396A JP 57025701 A JP57025701 A JP 57025701A JP 2570182 A JP2570182 A JP 2570182A JP S58143396 A JPS58143396 A JP S58143396A
Authority
JP
Japan
Prior art keywords
comparison
section
standard
recognition
voice recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP57025701A
Other languages
Japanese (ja)
Inventor
弘之 金田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
Nippon Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Electric Co Ltd filed Critical Nippon Electric Co Ltd
Priority to JP57025701A priority Critical patent/JPS58143396A/en
Publication of JPS58143396A publication Critical patent/JPS58143396A/en
Pending legal-status Critical Current

Links

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。
(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】 本発明は音声認識装置に関テるものである。[Detailed description of the invention] The present invention relates to a speech recognition device.

従来の音声認識装置は、人力音声を分析して得られる1
つの特徴(パラメータ)についてのみ標準パラメータと
比較を行なうか、あるいは複数の特徴について順次直列
的に比較を行なっていく方式であった。このため、認識
率が低くかつ認識に多大な時間を歓した。
Conventional speech recognition devices are based on 1, which is obtained by analyzing human speech.
The method used was to compare only one feature (parameter) with standard parameters, or to compare multiple features sequentially in series. For this reason, the recognition rate was low and recognition took a lot of time.

本発明の目的は高速認識を行な9裟に’に提供すること
にあり、複数の特徴を抽出し、その各々についての比較
動作を並列に行なうようにしたものである。更に、比較
結果は判定部によシ重みつきで判定され、標準パラメー
タと最も近似したものを選び出すようにして、VA認識
率を著しく低減している。
An object of the present invention is to perform high-speed recognition and provide a wide range of features, by extracting a plurality of features and performing comparison operations for each feature in parallel. Furthermore, the comparison results are weighted and determined by the determination section, and the one most similar to the standard parameters is selected, thereby significantly reducing the VA recognition rate.

以下、図面を用いて本発明の一8!尻例を1l−1=細
に説明する。
Hereinafter, part 8 of the present invention will be explained using drawings. An example of 1l-1 will be explained in detail.

第1図は本発明の一実施例發部のブロック図であり、こ
こでは3種の%g、(−Tif−声パラメータ)につい
て比較を行なう例を提示している。10は音声入力端子
、20は%像抽出部、30は比較部(イ)、31は比較
部(ロ)、32は比較部(ハ)、40は標準特徴(イ)
メモリ、41は標準時gL(ロ)メモリ、42は標準%
徴(ハ)メモリ、50は判定部、60は認識結果出力端
子である。
FIG. 1 is a block diagram of an embodiment of the present invention. Here, an example is presented in which three types of %g and (-Tif-voice parameters) are compared. 10 is an audio input terminal, 20 is a % image extraction section, 30 is a comparison section (a), 31 is a comparison section (b), 32 is a comparison section (c), 40 is a standard feature (a)
Memory, 41 is standard time gL (b) memory, 42 is standard %
50 is a determination unit, and 60 is a recognition result output terminal.

音声入力端子10より入力された音声からは、特徴抽出
部20によ938M類の%徴が抽出される。
From the voice input from the voice input terminal 10, the feature extraction unit 20 extracts 938M class % features.

ここで抽出される%徴とは、例えば周波数スペクトルバ
タンイあるいは零交差数口あるいはバワ−変化成分ハ等
であわ、いずれも1認識率位だけの集合を示す。%徴抽
出部20で得られた特徴は同様にして前もって得られて
いる標準特徴10メモリ40、標準特徴(ロ)メモリ4
1、標準特徴(ハ)メモリ42内のそれぞれの内容と対
応する比較部イ30、比較部ロ31.比較部ハ32で比
較される。比較においては、各%徴が1個あるいは2個
あるいは3個等任意の個数の認識結果を夫々優先順位を
つけて出力するように設定されている。判定部50では
、比較部30〜32にそれぞれ重みづけし、かつ各比較
部よシ複数個gi*結果が得られる場合にはその順位を
も考慮し、最終的に1個の認識結果を求め、認識結果出
力端子60に出力する。
The % characteristics extracted here include, for example, a frequency spectrum, a number of zero crossings, a power change component, etc., and each represents a set of only one recognition rate. The features obtained by the % feature extraction unit 20 are stored in the standard feature 10 memory 40 and the standard feature (b) memory 4, which have been previously obtained in the same way.
1. Standard features (c) Comparison section A 30 and comparison section B 31 corresponding to the respective contents in the memory 42. Comparison is made in comparison section C32. In the comparison, it is set to output an arbitrary number of recognition results such as 1, 2, or 3 for each % feature, with respective priorities assigned. The determination unit 50 weights each of the comparison units 30 to 32, and if a plurality of gi* results are obtained from each comparison unit, the ranking thereof is also considered, and finally one recognition result is determined. , is output to the recognition result output terminal 60.

−例としてrAJからrJJまでの音声のうち任意の1
音声が入力された場合を仮定する。比較部AではrA」
、rJ、rcJの順に比較部BではrBJ、rcj、l
AJのII&に、また比較部Cでは「B」。
- For example, any one of the voices from rAJ to rJJ
Assume that voice is input. rA in comparison part A.”
, rJ, rcJ in the order of comparison part B, rBJ, rcj, l
AJ's II&, and comparison section C is "B".

FD」、「c」の順に認識したとする。判定部では比較
部の重みを比較部イ、比較部口、比較部ハに対しそれぞ
れ1.2.3とつけるとする。判定部では各比較部の認
識結果に対し、順位の逆数に各比較部につけられた重み
を来した値の総和を求め その最大値をとる音声を最終
的な認識結果として出力する。上記例では、 rAJについ”’CIXl+−x2−5−3 rBJについて −Xl+1x2+1x3一旦2 rcJについて ↓X1+’X2+’X3=−Z3  
 2   3   3 「1)」について 1×3一旦 2 となり、最大値旦をとるrBJが認識結果として出力さ
れる。これら各比較動作は並列して行なわれるように、
各比較部へ供給される制御クロックを共通に与えている
ため、高速でこれを実行できる。
FD" and "c" are recognized in this order. In the determination section, weights of the comparison section are given as 1.2.3 for comparison section A, comparison section 7, and comparison section C, respectively. The determination section calculates the sum of the reciprocals of the ranks and the weights assigned to each comparison section for the recognition results of each comparison section, and outputs the voice that takes the maximum value as the final recognition result. In the above example, for rAJ "'CIXl+-x2-5-3 for rBJ -Xl+1x2+1x3 once 2 for rcJ ↓X1+'X2+'X3=-Z3
2 3 3 Regarding "1)", 1×3 becomes 2 once, and rBJ, which takes the maximum value 1, is output as a recognition result. These comparison operations are performed in parallel.
Since a common control clock is supplied to each comparing section, this can be executed at high speed.

以上説明したように、入力音声を複数の特、gl、につ
いて、並列に標準特徴と比較し、各比較結果に重みをつ
けて判定することによシ、従来の1%徴についてのみ比
較するものに比べてvAb率が著しく低減でき、また直
列的に複数の特徴について比較するものに比べ並列に同
時に比較できるため認識処理に嶽する時間も短縮される
という効果がある。
As explained above, by comparing input speech with standard features in parallel for multiple features, gl, and making decisions by weighting each comparison result, the conventional method that compares only the 1% feature This method has the effect that the vAb rate can be significantly reduced compared to the previous method, and the time required for recognition processing can be shortened since it can be compared simultaneously in parallel compared to a method that compares a plurality of features serially.

同第1図の丈施例では、特徴抽出部20で各音声入力ご
とに3柚の特徴を抽出しているが、線形予測係数尋特徴
抽出が難解な場合には、各特徴ごとに専用の抽出部を設
けてもよい。
In the example shown in Fig. 1, the feature extractor 20 extracts three features for each voice input, but if the linear prediction coefficient feature extraction is difficult to understand, a dedicated An extraction section may also be provided.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例の安部フロック図である。 10−・・・・・・音声入力端イ、20・・・・・・特
徴抽出部、30・・・・・・比較部(イ)、31・・・
・・・比較部(ロ)、32・・・・・・比較部0.40
・・・・・・標準%徴(イ)メモリ、41・・・・・・
標準%徴(ロ)メモリ、42・・・・・・標準特徴(9
メモリ、50・・・・・・判定部、60・・・・・・認
識結果出力端子。 5−
FIG. 1 is an Abe block diagram of an embodiment of the present invention. 10-...Audio input end a, 20...feature extraction section, 30...comparison section (a), 31...
...Comparison part (b), 32...Comparison part 0.40
...Standard percentage (a) Memory, 41...
Standard % characteristics (b) Memory, 42...Standard characteristics (9
Memory, 50... Judgment unit, 60... Recognition result output terminal. 5-

Claims (1)

【特許請求の範囲】[Claims] 入力音声の有する異なる特徴パラメータについて標準パ
ラメータと比較を行なう比較部と、該比較部の出力を重
みつきで総合判定する判定部とを有することを特徴とす
るi−声認識装置。
An i-voice recognition device comprising: a comparison section that compares different characteristic parameters of input speech with standard parameters; and a judgment section that makes a weighted comprehensive judgment on the output of the comparison section.
JP57025701A 1982-02-19 1982-02-19 Voice recognition unit Pending JPS58143396A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57025701A JPS58143396A (en) 1982-02-19 1982-02-19 Voice recognition unit

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57025701A JPS58143396A (en) 1982-02-19 1982-02-19 Voice recognition unit

Publications (1)

Publication Number Publication Date
JPS58143396A true JPS58143396A (en) 1983-08-25

Family

ID=12173082

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57025701A Pending JPS58143396A (en) 1982-02-19 1982-02-19 Voice recognition unit

Country Status (1)

Country Link
JP (1) JPS58143396A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4664775A (en) * 1982-07-31 1987-05-12 Toa Nenryo Kogyo Kabushiki Kaisha Method for manufacturing low pour point petroleum product with zeolite TSZ
JPS6318400A (en) * 1986-07-10 1988-01-26 株式会社デンソー Voice recognition/processing controller
JPH08266747A (en) * 1995-03-31 1996-10-15 Matsushita Electric Ind Co Ltd Voice recognition device, reacting device, reaction selecting device, and reactive toy using those devices
WO2000022607A1 (en) * 1998-10-09 2000-04-20 Sony Corporation Learning device and method, recognizing device and method, and recording medium

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4664775A (en) * 1982-07-31 1987-05-12 Toa Nenryo Kogyo Kabushiki Kaisha Method for manufacturing low pour point petroleum product with zeolite TSZ
JPS6318400A (en) * 1986-07-10 1988-01-26 株式会社デンソー Voice recognition/processing controller
JPH08266747A (en) * 1995-03-31 1996-10-15 Matsushita Electric Ind Co Ltd Voice recognition device, reacting device, reaction selecting device, and reactive toy using those devices
WO2000022607A1 (en) * 1998-10-09 2000-04-20 Sony Corporation Learning device and method, recognizing device and method, and recording medium
US6449591B1 (en) 1998-10-09 2002-09-10 Sony Corporation Learning apparatus, learning method, recognition apparatus, recognition method, and recording medium
KR100729316B1 (en) * 1998-10-09 2007-06-19 소니 가부시끼 가이샤 Learning device and method, recognizing device and method, and recording medium

Similar Documents

Publication Publication Date Title
NL9220005A (en) SWITCH FOR LOOKING FOR A MAXIMUM VALUE.
CN108877783A (en) The method and apparatus for determining the audio types of audio data
JPH02238588A (en) Recognizing device
CN109635755A (en) Face extraction method, apparatus and storage medium
CN107729491A (en) Improve the method, apparatus and equipment of the accuracy rate of topic answer search
JPS58143396A (en) Voice recognition unit
CN108614810A (en) Complain hot spot automatic identifying method and device
JPS6297000A (en) Analysus of sound
CN110390344B (en) Alternative frame updating method and device
CN110874567B (en) Color value judging method and device, electronic equipment and storage medium
JPS61148497A (en) Standard pattern generator
JP3352144B2 (en) Voice recognition device
SU618759A1 (en) Image recognition device
CN111739042B (en) Complex background power line extraction method based on digital image features
JPH0321948B2 (en)
JPS58208795A (en) Rejection system for voice recognition
CN113674755A (en) Voice processing method, device, electronic equipment and medium
JPH04175885A (en) Picture feature selecting system for generalized hough transformation
JPH06167997A (en) Speech recognizing device
CN113434731A (en) Music video genre classification method and device, computer equipment and storage medium
JPH04151700A (en) Pattern matching control circuit
WO2002067245A1 (en) Speaker verification
JPH0115079B2 (en)
JPS63298624A (en) Comparing device
JPS62201498A (en) Voice recognition