JPS58192095A - Voice recognition equipment - Google Patents

Voice recognition equipment

Info

Publication number
JPS58192095A
JPS58192095A JP57074931A JP7493182A JPS58192095A JP S58192095 A JPS58192095 A JP S58192095A JP 57074931 A JP57074931 A JP 57074931A JP 7493182 A JP7493182 A JP 7493182A JP S58192095 A JPS58192095 A JP S58192095A
Authority
JP
Japan
Prior art keywords
voice
presence signal
value
time
pattern
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP57074931A
Other languages
Japanese (ja)
Other versions
JPH0376473B2 (en
Inventor
宏樹 大西
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sanyo Electric Co Ltd
Sanyo Denki Co Ltd
Original Assignee
Sanyo Electric Co Ltd
Sanyo Denki Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sanyo Electric Co Ltd, Sanyo Denki Co Ltd filed Critical Sanyo Electric Co Ltd
Priority to JP57074931A priority Critical patent/JPS58192095A/en
Publication of JPS58192095A publication Critical patent/JPS58192095A/en
Publication of JPH0376473B2 publication Critical patent/JPH0376473B2/ja
Granted legal-status Critical Current

Links

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。
(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】 本発明は音声認識装置に関Tる。[Detailed description of the invention] The present invention relates to a speech recognition device.

この種音声認識装置は、音声信号!含む入力信号から特
徴パラメータ、例えばSIN数スペクトル値の時系列を
抽出し、−万、この入力信号からその音声信号の時間餉
t$、を検出する事に依って、この時間領域内に含まれ
る周波数スペクトル値の時系列からなる音声パターンを
作成し、これtパターン言忍識Tるものである・ 従って、入力音声!確実にMJiI識Tるには、この入
力音声の特lIv正しく表わす音声パターンを作成しな
ければならず、この為C:は、音声信号の時間領域を正
確に検出する頃が必要である。
This kind of voice recognition device is a voice signal! By extracting a time series of characteristic parameters, e.g., SIN number spectral values, from the input signal containing the input signal, and detecting the time value t$ of the audio signal from this input signal, Create an audio pattern consisting of a time series of frequency spectrum values, and this pattern is well known. Therefore, the input audio! In order to reliably recognize MJiI, it is necessary to create an audio pattern that correctly represents the characteristics of this input audio, and for this reason, it is necessary to accurately detect the time domain of the audio signal.

従来の音Y!!!■識装置に於いては、入力信号のパワ
ー値を抽出し、iJ!1図1mlに示T如く、このパワ
ー値Sが雑音信号レベルより高く設定された!J2の特
定値TH,2を越えた時点を音声信号の時間飴麗Tの始
点t8とし、このパワー値Sが第2の特定値TH,ZY
下まわった状態が音声の休止時間より大なる一定時間、
例えば200m5ec経過した時点をf庸信号の時間領
域の終点しeとしCいた。しかもこの94信号の時j1
]@琺T甲に於(1゛C,そのパワー値SがfNがもつ
パワーfa Sのピーク値より低く設定された第1の特
定値TH,1((TH,2)′lk−越える事を必要条
件とし°Cいたので、帷f(i号しベルが低い時には、
かなり正確に人力1i4fの時間領域T馨検出下る東か
でさた。
Traditional sound Y! ! ! ■In the identification device, the power value of the input signal is extracted and the iJ! As shown in Figure 1 1ml, this power value S was set higher than the noise signal level! The time point when J2 exceeds the specific value TH,2 is set as the starting point t8 of the time interval T of the audio signal, and this power value S becomes the second specific value TH,ZY.
For a certain period of time during which the falling state is longer than the pause time of the audio,
For example, the time point when 200 m5ec has elapsed is set as the end point of the time domain of the f-signal. Moreover, at this 94 signal j1
]@KinT A (1゛C, its power value S is set lower than the peak value of the power fa S of fN. Since the required condition is °C, when the bell is low (I and the bell is low,
The time-domain T-field detection of human power 1i4f is quite accurate.

しかしながら、周囲雑音が大きな状況下に着いては、音
声とともに入力された雑音イ3号レしIしが高くなるの
で、!lIN図1b+に示T如く、第2の特定値TH,
2’l:大さく設定しなければならず、この為音声1号
の時間mji&Tが正確■:檜出されず、実際より短か
い時間領域T′となゲCしよう不都合があった。
However, when you are in a situation where there is a lot of ambient noise, the noise level input along with the voice becomes high. As shown in FIG. 1b+, the second specific value TH,
2'l: It was necessary to set it to a large value, and therefore the time mji&T of audio No. 1 was not accurate.

従来のfm認識装置は、上述の不都合C二値ってf市の
特徴1に表わ丁正確な狂田パターンを得る事がでさず、
入力f声の誤認識を招(入点があった本発明は斯る欠点
7に一解市する頃を目的とし°〔lされ、雑1傷号の悪
影響l除去し、正確な音声パターンを得る手段を備えた
音声認識装置tを提供Tるものである。
The conventional fm recognition device has the above-mentioned disadvantage C binary value, which is expressed in characteristic 1 of f city, and cannot obtain an accurate Kurota pattern.
The purpose of the present invention is to solve this drawback (7), which can lead to erroneous recognition of the input voice, and to eliminate the negative effects of miscellaneous sounds and create accurate voice patterns. The present invention provides a speech recognition device equipped with a means for obtaining a speech recognition device.

第2内に本発明のf田認識装置の一実施例のブロック図
を示し、この装置の動作のタイミング図Y!J5図に示
T6同図に於いて、(【)はf田を入力Tるマイク、(
2)は該マイク1」)からの音声信号を含む入力信号を
増巾Tるマイクアンプ、(3)は該マイクアンプ(2)
からの入力信号からその特徴パラメータX1例えば周波
数スペクトル値の時系列Xty抽出するパラメータ抽出
回路である。(4)は該パラメータ抽出回路(3)から
の特命パラメータXtv時系列的に貯えるパラメータバ
ッフアメそりである(5)は上記マイクアンプ(2)か
らの入力信号のパワー値Sの時系列Stv抽出てるパワ
ー抽出回路であり1例えば、全局波数帯域l:於ける周
波数スペクトル値の総和を算出Tるものである。(6)
は$1の比較回路であり、上記パワー抽出回路(5)か
らのパフ−値stが1J!1図に示したと同様の第1の
特定値TH,1と比較され、St、>TH,1の時第1
の音声存在信号V1を出力Tる。(71は第2の比較回
路であり、上記パワー抽出回路(5)からのパワー値S
tが141図に示したと同様の第2の特定値TH,2と
比較され、Sも:>T)1.2の時第2の音声存在信号
■!を出力する。(8)は上記パラメータ抽出回路(3
1からのパラメータXtの差分−ΔXt、−Xt−Xt
−1)il算出Tルff分[a路、t9+1!@ SQ
、)比8回路であり、を記差分回路(81からの差分価
ΔXt。
A block diagram of an embodiment of the ffield recognition device of the present invention is shown in the second part, and a timing diagram of the operation of this device is shown in Y! Figure J5 shows T6 In the same figure, ([) is the microphone that inputs the f field, (
2) is a microphone amplifier that amplifies the input signal including the audio signal from the microphone 1); (3) is the microphone amplifier (2);
This is a parameter extraction circuit that extracts characteristic parameters X1, for example, a time series Xty of frequency spectrum values, from an input signal from an input signal. (4) is a parameter buffer that stores the special parameters Xtv from the parameter extraction circuit (3) in time series; (5) is the time series Stv extraction of the power value S of the input signal from the microphone amplifier (2); For example, it is a power extraction circuit that calculates the sum of frequency spectrum values in all station wave number bands l:. (6)
is a $1 comparison circuit, and the puff value st from the power extraction circuit (5) is 1J! It is compared with the first specific value TH,1 similar to that shown in FIG.
A voice presence signal V1 is outputted. (71 is the second comparison circuit, which outputs the power value S from the power extraction circuit (5)
When t is compared with a second specific value TH,2 similar to that shown in Fig. 141, and S is also: > T) 1.2, the second voice presence signal ■! Output. (8) is the above parameter extraction circuit (3
Difference of parameter Xt from 1 -ΔXt, -Xt-Xt
-1) il calculation T le ff minute [a road, t9+1! @SQ
, ) is a ratio 8 circuit, which records the differential value ΔXt from the differential circuit (81).

が第3の特定値TH,5と比較され、ΔX t、)T)
f、3の時第3のf胃存在偵号v5を出力Tる。
is compared with a third specific value TH,5, ΔX t,)T)
When f, 3, the third f stomach existence reconnaissance v5 is output T.

尚、この第3の特定値TH,5は1lliB号に特有の
その特命パラメータXto)q化率ΔXtより低く、変
化に乏しい周囲雑音の特徴パラメータXtの変化率ΔX
vより高い値に設定され°〔いる。GOはカワンタ構成
からなるタイマー回路であり、上記第5地絞回路(9)
からの第3の音田存狂信号VMと上記vIIL2比較(
ロ)略(7)からの弗2のf声存在信号の ■!との論理和011で表わされる第1又は第2Af 
iff存在信号Vz+vmy受信し°C1ζ・の信号v
2+■5が存在Tる時間とさらにこの信号が終rしてか
ら200零sec経過した時点まで新たな第4(7) 
ii lli 存在信号V a 1kQtETル、 0
3ハ上記第117.)比較回路(61から得られる第1
の音声存在信号■1の時系列V (tv貯える第1の音
声存在信号バッファメモリ、03はL記タイマー回路■
から得られるwJ4のf声存在信号V4の時系列■4℃
を貯える第4の音声存在信号バッファメモリであり、こ
rlfバッファメモ!705,031の各[%V 1t
、 V41dI k記パラメータバッフアメそり(4)
の特徴パラメータXtに対応づけられている。1114
1は上記パラメータバッファメモリ1411m貯えられ
た特at<ラメータXt1に読み出T為の読み出し制御
回路であり、上記両第1及び弗4p声存在信号バッファ
メモ!JO3ThO:lに基さ、第1のfw存在信号v
1が発生した時点t’R直前の第4の音声存在信号■4
発生時点t8をf声の時間領域Tの始点とし、−万、こ
の第1のf声存在信号■1が終了した時点t′e直後の
第4のf田存在信号v4終r時点t#・から20(lf
s8c分以前の時点即ち1113の音嘴存在イざ’pf
■sの終了時点t e lie f田の時間領域Tの終
点とし、この時間@VT:ts〜toに含まれるL紀パ
ラメータバッファメモ1月4)の特徴パラメ−9Xt(
1)時系列Xc8〜Xtet続み出”r、asは人力1
mパターンメモリであり、上記パラメータバッファメモ
リ(4)から続み出された特徴パラメータXtの時系列
Xt8〜XC01に:人力f4パターンとして貯える。
Note that this third specific value TH,5 is lower than the special parameter Xto)q conversion rate ΔXt specific to the 1lliB, and is the change rate ΔX of the characteristic parameter Xt of the ambient noise that does not change much.
It is set to a value higher than v. GO is a timer circuit having a Kawanta configuration, and is connected to the fifth ground restricting circuit (9) above.
Comparison of the third Onda Zonkyo signal VM and the above vIIL2 from
b) ■ of the f-voice presence signal of 弗2 from (7)! The first or second Af expressed by the logical sum 011 with
if presence signal Vz+vmy received °C1ζ・signal v
A new fourth (7) signal is added until the time when 2+■5 exists and 200 zero seconds have passed since this signal ended.
ii lli Presence signal V a 1kQtET, 0
3c No. 117 above. ) comparison circuit (the first obtained from 61
Audio presence signal ■1 time series V (first audio presence signal buffer memory for storing tv, 03 is timer circuit written in L)
Time series of f-voice presence signal V4 of wJ4 obtained from ■4℃
This is the fourth audio presence signal buffer memory that stores the rlf buffer memo! 705,031 each [%V 1t
, V41dI k Parameter Buffer Amesori (4)
is associated with the feature parameter Xt. 1114
1 is a readout control circuit for reading out the special at Based on JO3ThO:l, the first fw presence signal v
4th voice presence signal immediately before time t'R when 1 occurs ■4
Let the generation time t8 be the starting point of the time domain T of the f-voice, and the fourth f-voice presence signal v4 immediately after the time t'e when the first f-voice presence signal ■1 ends at the time t#. to 20 (lf
The existence of the sound beak at the time before s8c, that is, 1113 'pf
■The end point of s is the end point of the time domain T of t e lie f, and the characteristic parameter of the L period parameter buffer memo January 4) included in this time @VT:ts~to is
1) Time series Xc8~Xtet continues "r, as is human power 1
m pattern memory, and stores the characteristic parameters Xt successively retrieved from the parameter buffer memory (4) in the time series Xt8 to XC01 as a manual f4 pattern.

O旧求パターン認−回路であり上記入力it声パターン
メ七!105+の人力−1wパターンを予じめ格納し゛
〔いる多数の登録f声パターンと比較し、最もtjA似
Tる登録VrfHパターンを選出1711手に依り、こ
の時の人力@*が認識される。
It is the old pattern recognition circuit and the input it voice pattern is seven! The human power @* at this time is recognized by comparing the 105+ human power-1w pattern with a large number of pre-stored registered VrfH patterns and selecting the registered VrfH pattern that is most similar to tjA.

斯る構成の音声認識装置に於い°C・1.第3図に不丁
如<、f画信号の時間@域Tには、墾1図に示した従来
例同様f声偵号待葡のパワー億Sがピークを阿Tる事を
示T1M1の音声存在信号■t。
In the speech recognition device having such a configuration, the temperature is 1°C. Figure 3 shows that the power of the f-voice signal reaches its peak at the time @ area T of the f-picture signal, similar to the conventional example shown in Figure 1. Audio presence signal ■t.

及び、雑f (I I+レベルより大なるレベル’IK
Tる[l#(1号のパワー11Sが得らn′Cいる事を
示T第2のit#’[E信号V ! 0) 14時15
mM T 1 、 T 2 Y含んでいるが、このit
−信号の時間領域Tの始点及び終点を決定するのに、雑
音4i号の特徴パラメータの変化率より大なるf声信号
の特徴パラメータXの変化率ΔXが得られた事をボ丁第
3の音漕存γE(g号V3の始φ、及び終点を用いた点
が従来装置と異なり、しかもWする時間領域T1に於い
て。
and miscellaneous f (I level 'IK greater than I+ level'
T[l#(1st power 11S indicates that n'C is obtained T2nd it#'[E signal V! 0) 14:15
mM T 1 , T 2 Y contains, but this it
- In order to determine the start and end points of the time domain T of the signal, the third point is that the rate of change ΔX of the characteristic parameter X of the f voice signal is obtained which is greater than the rate of change of the characteristic parameter of the noise No. The sound row exists γE (different from the conventional device in that the start φ and end point of the g-number V3 are used, and moreover, in the time domain T1 where W is used.

f?tf存在信号V 5(b2Q 011fllieQ
禾満の中断は。
f? tf presence signal V 5 (b2Q 011fllieQ
The interruption of Heman.

音声信号の休止周町とし”〔認められ−Cいる。When the audio signal is paused, the audio signal is paused.

従つ°C1斯る音声認識装置は、第4図≦二不T如く、
怖f倍号レベルが高い状況下で使用されたとしても、第
2の特定値TH,2ンこの軸性信号レベルより大なる(
ilTH,2’に設定Tれば良く、雑if@号レベルの
高さに依存しない第3の音声存在信号v5に依って、音
声の時l′&tl領域Tの始点ts及び終点rigが非
常に正確に導出される。依つC入力1wパターンメモリ
09には、正確なf声の時間領域Tに含まれる特徴パラ
メータXt、からなるf声パターンが得られ、パターン
諭跪回路f161に依つ゛C1確実な入力f由の認識が
なされる。
Accordingly, °C1 such a speech recognition device is as shown in FIG.
Even if it is used in a situation where the fear f multiplier level is high, the second specific value TH,2 is greater than this axial signal level (
It is sufficient to set T to ilTH, 2', and depending on the third voice presence signal v5 that does not depend on the height of the miscellaneous if @ signal level, the start point ts and end point rig of the l'&tl region T are extremely Derived accurately. Therefore, in the C input 1w pattern memory 09, an f-voice pattern consisting of feature parameters Xt included in the time domain T of accurate f-voice is obtained. Recognition is made.

弗5因に本発明の音声認識装置の他の実施例のパラメー
タメモリ071ヲ備え、この麹1バラメーダメモリun
の各雑音パラメータXty平均化a&シた54 tq値
X Y人力f田パターンメモ!Jt151の人力frH
パターンを構成Tる雑音酸分Y含む各特徴パラメータX
tから減じる導119に依り、#ity分を除去した入
力ffIパターンを再楕(戎しC1hだな人力i声パタ
ーン09′に貯えた点にある。この場分、上記雑音パラ
メータメモリ(171に貯えられる* −i 4t号の
特徴パラメータX′tは、読み出し制御回路IHiE依
つ〔、第1のf声存在信舞■1と第4の行田存在(if
号■4とがどちらも表われない時間領域に於けるパラメ
ータバッフアメそり(4)の特1ノへラメータXtを読
み出したものである。
Fifth, the parameter memory 071 of another embodiment of the speech recognition device of the present invention is provided, and this koji 1 parameter memory un
Each noise parameter Xty averaging a & 54 tq value Jt151 human power frH
Each feature parameter X including the noise acid content Y that makes up the pattern T
The input ffI pattern from which #ity has been removed is stored in the human voice pattern 09', which is C1h, by the derivation 119 subtracted from t. The characteristic parameter X't of the stored *-i 4t number depends on the readout control circuit IHiE [, the first f-voice existence signal 1 and the fourth Gyoda existence (if
This is the result of reading out the parameter Xt of the parameter buffer (4) in the time domain where neither number (4) and number (4) appear.

所る構成の買厖例≦ニア/′いては新たな入力廿’iH
t<ターンメモリOイに貯えられた入力ff457曵タ
ーンは。
Purchase example of a certain configuration ≦Near/' and new input 'iH
The input ff457 input turn stored in the t<turn memory Oi is.

雑f収分が完全に除去さrたちのとなり、パターン認識
回路flGでのパターン41 ta、fil+ちに田相
識が正+4に行なわnる。
The miscellaneous components are completely removed, and the pattern recognition circuit flG performs the pattern 41 ta, fil+and the pattern recognition is positive+4.

本発明のi町忍豫@装置は、以1・の悦明から明らかな
如く、音声の特徴パラメータ時系列を時間差分した差分
値が一足の価と比較される第3のf声存在信号の発g:
時ぐに基づいて、Y声の時fb’l領堰Tの発午時点!
決定Tるので、入力を田の正確な入力開始時点が得られ
、tた。上記第3のf声存在信号の終了に基づいて、晋
胃の時間@域Tの終了時点を決定Tるので、入力f声の
正確な入力路r時点が得られ、このfWIの時間領域T
に含まれる特徴パラメータに依つ°C1正確な音声パタ
ーン!作成Tる亭かでさる。また、この第5の曾爾存在
信号に一足時間内の中断があつ°〔もこれ!連続せしめ
たflllの時間領域が検知され、fwの休止箇所で音
mv区切つ°Cしまう不都合がなくなる。
As is clear from the above 1., the i-cho Shinobu @ device of the present invention generates a third f-voice presence signal in which the difference obtained by time-differentiating the voice feature parameter time series is compared with the value of one foot. Origin:
Based on the time, the time of departure of fb'l territory weir T at the time of Y voice!
Since the determination is made, the exact input start point of the input is obtained, and the input is determined. Based on the end of the third f-voice presence signal, the end point of the time @ area T of Jin's stomach is determined, so the accurate input path r time of the input f-voice is obtained, and the time area T of this fWI is determined.
°C1 accurate voice pattern depending on the feature parameters included in! Created by Tru-tei Kadesaru. Also, there is an interruption in this fifth Soni presence signal within one foot time ° [also this! The time domain of continuous fllll is detected, eliminating the inconvenience of separating the sound mv at the pause point of fw.

さらに、雑音の特徴パラメータにて、[ffの特徴パラ
メータ!補償せしめてなる特徴パラメータの時系列から
音声パターン!作成下るので、即ち周囲雑音等の雑音信
号の悪影響を解消したf声パターンをパターン認識下る
ので%誤認識の慣れがなく、認識率の大巾な向上が望め
る。
Furthermore, in the noise feature parameters, [ff feature parameters! Sound patterns are created from the time series of feature parameters that are compensated! Since it is easy to create, that is, the f-voice pattern that eliminates the adverse effects of noise signals such as ambient noise is used for pattern recognition, there is no need to get used to erroneous recognition, and a significant improvement in the recognition rate can be expected.

【図面の簡単な説明】[Brief explanation of the drawing]

$1図1ml、lblは夫々従来のf田MI識装置の動
作を示Tタイミング図、lF$2図は本発明のfw認畠
装置の一実施例の構成を示ニブロック図、第3因及び1
14因は夫々本発明装置の動作を示すタイミング図、第
5図は本発明装置の他の実施例の要部を示Tブロック図
、である。 131・・・パラメータ抽出回路、(4)・・・パラメ
ータバッファメモリ、 (5)・・・パワー抽出回路、
(61・・・第1の比較回路、 (71・・・第2の比
較回路、(8)・・・差分回路、(9)・・・第3の比
較回路、(1(1・・・タイマー回路、  03・・・
!J1のf声存在信号パブファメモリOJ・・・s4の
fwI存在信号バッファメモリ、041・・・読み出し
制御回路、  as+od・・・入力fWIパターンメ
モリ、061・・・パターン綾織回路、171・・、雑
音パラメータメモリ 出願人 三洋電機株式会社 代理人 弁理士 佐 野 静 夫゛・、−・・第1図
Figure 1ml and lbl respectively show the operation of the conventional fw MI identification device. and 1
14 are timing diagrams showing the operation of the apparatus of the present invention, and FIG. 5 is a block diagram showing the main parts of another embodiment of the apparatus of the present invention. 131...Parameter extraction circuit, (4)...Parameter buffer memory, (5)...Power extraction circuit,
(61...first comparison circuit, (71...second comparison circuit, (8)...difference circuit, (9)...third comparison circuit, (1(1... Timer circuit, 03...
! J1 f voice presence signal buffer memory OJ... s4 fwI presence signal buffer memory, 041... Readout control circuit, as+od... Input fWI pattern memory, 061... Pattern twill weave circuit, 171... Noise Parameter Memory Applicant Sanyo Electric Co., Ltd. Agent Patent Attorney Shizuka Sano ゛・・・・・Fig. 1

Claims (1)

【特許請求の範囲】 1)音声の特徴パラメータ時系刈からなるfFl!パタ
ーンをパターンiJmTる音t!!認識装置1:於いて
、入力[4のパワーを抽出下るパワー抽出手段と、音欝
の特徴パラメータ時系列の差分値を算出する差分回路と
、上記パワー抽出手段からのパワー値が第1の特定値を
毬えた期間に1Jls1の音声存在信号を出力する弔1
の比較回路と、L記パワー抽出手段からのパワー値が、
を紀弗1の特定価より低い値の第2の特定値を越えた期
間に第2の音声存在信号を出力Tる@2の比較回路と、
上記差分回路からの差分値が第3の特定値を越えた期間
にvIPI3の音声存在信号を出力TるIi’S3の比
較回路とを備え、これ等比較回路からの音響存在信号に
暴づさ、少なくとも1つの第1の音響存在信号!含む弗
2の音響存在信号の発生時点直前の第3の竹田存在信号
発生時点以後の音響の特徴パラメータ時系列から音声パ
ターンを作成Tる事を特徴とした音声認wA¥装置。 2)上記各比較回路からのfFII存在信号に基づ8、
少なくとも1つの第1の音声存在信号を含む第2のf声
存在償号終了時点@後の第3の音声存在信号終了時点以
前の音響の特徴パラメータ時系列から音響パターンを作
成する事ヲ特徴とした特許請求の範囲181項記載のf
−u7I職装置。 5)上記ilSの比較回路にタイマ一手段を設けこのタ
イマ一手段1:依って、上記!J6の比較回路からの第
3の音響存在信号の終r時点から次の発生時点までの時
間が一定時間未満の場@ H二kl %この第3の音田
存信号を連続せしめた特許請求の範囲第1項又は$2項
記載の音声認識装置。 4)上記各比較(ロ)略からの各音響存在信号がいずれ
も得られない期間C二於いて、雑音の特徴パラ1−IF
y得、この雑音の特徴パラメータの値に依って青田の特
徴パラメータの値vm償してなる音声パター21作成T
る亭を特徴とする特許請求の節回111項、182項、
叉に−X@5項記載の音声u7j識装置。
[Claims] 1) fFl!, which consists of time-series analysis of voice feature parameters! Pattern to pattern iJmTru sound t! ! Recognition device 1: A power extraction means for extracting the power of the input [4], a difference circuit for calculating the difference value of the time series of the characteristic parameters of the tone, and a power value from the power extraction means for the first identification. Condolence 1 that outputs a voice presence signal of 1Jls1 during the period when the value is retained
The power value from the comparator circuit and the power extracting means written in L is
a comparison circuit of T@2 which outputs a second audio presence signal during a period in which the second specific value, which is lower than the specific value of the first one, is exceeded;
a comparison circuit of Ii'S3 which outputs a voice presence signal of vIPI3 during a period in which the difference value from the difference circuit exceeds a third specific value; At least one first acoustic presence signal! A voice recognition device is characterized in that a voice pattern is created from a time series of acoustic feature parameters after the generation of a third Takeda presence signal, which is immediately before the generation of the second sound presence signal. 2) Based on the fFII presence signal from each of the above comparison circuits8,
The feature is that an acoustic pattern is created from a time series of acoustic feature parameters before the end time of the third voice presence signal after the second f-voice presence code completion time @ which includes at least one first voice presence signal. f in claim 181
-u7I professional equipment. 5) A timer means is provided in the comparison circuit of the ILS, and the timer means 1: Therefore, the above! If the time from the end point of the third acoustic presence signal from the comparison circuit of J6 to the next generation point is less than a certain time, the patent claim that makes this third sound presence signal continuous The speech recognition device according to item 1 or item 2 of scope. 4) During the period C2 in which none of the sound presence signals from the above comparisons (b) are obtained, the characteristics of the noise Para 1-IF
Create a voice pattern 21 by compensating the value of Aota's feature parameter vm depending on the value of the feature parameter of this noise.
Sections 111 and 182 of patent claims that feature a pavilion
Additionally, the voice u7j recognition device described in -X@5.
JP57074931A 1982-05-04 1982-05-04 Voice recognition equipment Granted JPS58192095A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57074931A JPS58192095A (en) 1982-05-04 1982-05-04 Voice recognition equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57074931A JPS58192095A (en) 1982-05-04 1982-05-04 Voice recognition equipment

Publications (2)

Publication Number Publication Date
JPS58192095A true JPS58192095A (en) 1983-11-09
JPH0376473B2 JPH0376473B2 (en) 1991-12-05

Family

ID=13561587

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57074931A Granted JPS58192095A (en) 1982-05-04 1982-05-04 Voice recognition equipment

Country Status (1)

Country Link
JP (1) JPS58192095A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6267598A (en) * 1985-09-20 1987-03-27 株式会社リコー Voice section detection system

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS56135898A (en) * 1980-03-26 1981-10-23 Sanyo Electric Co Voice recognition device

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS56135898A (en) * 1980-03-26 1981-10-23 Sanyo Electric Co Voice recognition device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6267598A (en) * 1985-09-20 1987-03-27 株式会社リコー Voice section detection system

Also Published As

Publication number Publication date
JPH0376473B2 (en) 1991-12-05

Similar Documents

Publication Publication Date Title
US4087632A (en) Speech recognition system
JPS5862699A (en) Voice recognition equipment
JPS634200B2 (en)
JPH0465392B2 (en)
JPS58192095A (en) Voice recognition equipment
Dologlou et al. Pitch detection based on zero-phase filtering
Deepak et al. Glottal instants extraction from speech signal using generative adversarial network
JPS645320B2 (en)
JP2002064786A (en) Video division method and video division device
JPH0520760B2 (en)
JPS63262695A (en) Voice recognition system
JPS5817497A (en) Voice pitch detector
JPS595917B2 (en) Onseigouseisouchi
JPS58160994A (en) Voice recognition equipment
JPS6250837B2 (en)
Lee et al. Musical onset detection with linear prediction and joint features
JPS635398A (en) Voice analysis system
JPS59102296A (en) Pitch extraction
JPS6068397A (en) Voice feature extraction system
JPS6136798A (en) Voice segmentation
JPS60225197A (en) Voice recognition equipment
JPS58137900A (en) Voice message identifying system
JPS59180594A (en) Voice recognition equipment
JPS581197A (en) Voice recognition unit
JPS60201400A (en) Binary coder for voice spectrum