JPH04181998A - Device and method for speech recognition - Google Patents

Device and method for speech recognition

Info

Publication number
JPH04181998A
JPH04181998A JP2311825A JP31182590A JPH04181998A JP H04181998 A JPH04181998 A JP H04181998A JP 2311825 A JP2311825 A JP 2311825A JP 31182590 A JP31182590 A JP 31182590A JP H04181998 A JPH04181998 A JP H04181998A
Authority
JP
Japan
Prior art keywords
standard pattern
speech
numeric data
input
standard
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2311825A
Other languages
Japanese (ja)
Inventor
Yoshihiro Akai
赤井 善裕
Kazuo Ishii
和夫 石井
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Engineering Ltd
Original Assignee
NEC Engineering Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Engineering Ltd filed Critical NEC Engineering Ltd
Priority to JP2311825A priority Critical patent/JPH04181998A/en
Publication of JPH04181998A publication Critical patent/JPH04181998A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To recognize the break of the head or tail of a voiced word and the mixture of a circumferential noise into the voiced word by comparing numeric data on an input speech signal with stored numeric data on standard patterns and recognizing the speech. CONSTITUTION:A speech input part 11 amplifies the input speech signal and generate numeric data by an A/D converter, etc., and a speech output part 12 converts the standard patterns stored in a storage part 14 into a speech signal by a D/A converter, etc., and outputs it. Then a control recognition part 13 controls respective functions and compares the numeric data inputted by the speech input part 11 with the standard patterns stored in the storage part 14 to perform a recognizing process. Further, when a standard pattern is registered, the control recognition part 13 stores the voice signal made into the numeric data by the voice input part 11 in the storage part 14 as the standard pattern with the command of an operation command part 15 according to a program in the storage part 14 and the voice output part 12 makes the standard pattern into a speech signal. Consequently, the break of the head or tail of the registered standard pattern, the mixture of the circumferential noise, etc., can be confirmed auditorily.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は音声認識装置および方法、とくに、特定話者方
式の音声認識装置および方法に関する。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a speech recognition device and method, and particularly to a speaker-specific speech recognition device and method.

〔従来の技術〕[Conventional technology]

従来の特定話者方式の音声認識装置における標準パター
ンの登録は、第2図に示す操作指令部24の指令および
記憶部23のプログラムにより制御指令部22か、音声
入力部21で数値データ化された音声信号を記憶部23
に記憶するのみである。
Registration of a standard pattern in a conventional speaker-specific speech recognition device is performed by converting the standard pattern into numerical data by the control command unit 22 or the voice input unit 21 according to the commands from the operation command unit 24 and the program in the storage unit 23 shown in FIG. The recorded audio signal is stored in the storage unit 23.
It is only stored in .

〔発明が解決しようとする課題〕[Problem to be solved by the invention]

上述した従来の音声認識装置の標準パターンの登録方法
では装置内に入力された音声がどの様に記憶されたかが
わからず認識率に直接影響する標準パターンの良悪の判
定を発声を行った状況を判断し、個人の経験で行なうし
かないという問題がある。
With the standard pattern registration method of the conventional speech recognition device described above, it is not known how the input voice was stored in the device, so it is difficult to judge whether the standard pattern is good or bad, which directly affects the recognition rate. The problem is that you have no choice but to make a judgment and use your own experience.

つまり、発声語の語頭4語尾が切れていたり、発声語に
周囲の雑音が混じっている等の確認ができないどう欠点
があった。
In other words, it has the disadvantage that it is not possible to confirm whether the first four or last words of a spoken word are cut off, or whether surrounding noise is mixed in with the spoken word.

〔課題を解決するための手段〕[Means to solve the problem]

本発明の音声認識装置は、 (a)入力された音声信号を数値データ化する手段と、 (b)この数値データを標準パターンとして記憶する手
段と、 (c)記憶されている標準パターンの数値データを音声
信号化する手段と、 (d)入力された音声信号の数値データと記憶されてい
る標準パターンの数値データとを比較し、認識する手段
と、 (e)認識結果を装置外へ出力する手段、とを含んで構
成される。
The speech recognition device of the present invention includes (a) means for converting an input speech signal into numerical data, (b) means for storing this numerical data as a standard pattern, and (c) numerical values of the stored standard pattern. (d) A means for comparing and recognizing the numerical data of the input audio signal with the numerical data of the stored standard pattern; (e) Outputting the recognition result to the outside of the device. and a means for doing so.

〔実施例二。[Example 2.

以下本発明につき図面を参照して説明する、第1図は本
発明の一実施例を示すフロック図である。
The present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing one embodiment of the present invention.

第1区に示す音声認識装置は音声入力部11゜音声出力
部12.制御認識部13.記憶部14゜操作指令部15
を具備する。
The voice recognition device shown in the first section includes a voice input section 11, a voice output section 12. Control recognition unit 13. Storage section 14゜operation command section 15
Equipped with.

音声入力部11は図示しないマイクロホンや無線入力装
置等から入力される音声信号を増幅し、A/D変換器等
により数値データ化を行なうものである。
The audio input section 11 amplifies an audio signal input from a microphone, wireless input device, etc. (not shown), and converts it into numerical data using an A/D converter or the like.

音声出力部12は図示しないイヤホン、スピーカーおよ
び無線出力装置に対し記憶部14に記憶されている標準
パターンをり、・A変換器等により音声信号化し出力を
行なうものである。
The audio output unit 12 converts the standard pattern stored in the storage unit 14 into an audio signal using an A converter or the like and outputs the standard pattern to earphones, speakers, and wireless output devices (not shown).

制御認識部13は各機能の制御および音声入力部11て
入力した数値データと記憶部14に言己憶されている標
準パターンとを比較し、認識処理を行なうものである。
The control recognition section 13 controls each function and compares the numerical data inputted through the voice input section 11 with a standard pattern stored in the storage section 14 to perform recognition processing.

記憶部14はプログラム、データおよび標準パターンを
9己憶するものである。
The storage unit 14 stores nine programs, data, and standard patterns.

操作指令部15はキーボードデイスプレィあるいはホス
トコンピュータ等外部接続機器との通信を行なうもので
ある。
The operation command unit 15 is for communicating with externally connected equipment such as a keyboard display or a host computer.

標準パターンの登録時には操作指令部15の指令および
記憶部14のプログラムにより制御認識部13か、音声
入力部11で数値データ化された音声信号を記憶部14
に標準パターンとして記憶し標準パターンを音声出力部
12で音声信号化する。
When registering a standard pattern, the control recognition section 13 or the voice input section 11 sends an audio signal converted into numerical data to the storage section 14 according to a command from the operation command section 15 and a program stored in the storage section 14.
The standard pattern is stored as a standard pattern in the audio output section 12 and converted into an audio signal.

以上説明した実施例では標準パターンの登録時のm能で
あるが、認識動作中に入力された音声を本機能により音
声出力しても良い。
In the embodiment described above, this function is used when registering a standard pattern, but the voice input during the recognition operation may be output as a voice using this function.

さらに標準パターンの音声出力機能は認識結果確認用音
声応答機能と兼ね合わせても良い。
Furthermore, the standard pattern audio output function may be combined with the recognition result confirmation audio response function.

〔発明の効果〕〔Effect of the invention〕

以上説明したように本発明は、登録した標準パターンに
語頭語尾の途切れ、周囲騒音の混己り等がないかを使用
者の聴覚で確認することができることにより、正しい標
準パターンの作成が容易となり、特定話者方式の音声認
識装置の認識率を向上させる効果がある。
As explained above, the present invention facilitates the creation of correct standard patterns by allowing the user to visually confirm whether the registered standard patterns are free from interruptions at the beginning or end of words, or if there is any interference with ambient noise. This has the effect of improving the recognition rate of a speaker-specific speech recognition device.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例を示すブロック図、第2図は
従来の一例を示すフロック図である。 ]1・・・音声入力部、12・・・音声出力部、13・
・・制御認識部、14・・・記憶部、15・・・操作指
令部、21・・音声入力部、22・・・制御認識部、2
3・・記憶部、24・・・操作指令部。
FIG. 1 is a block diagram showing an embodiment of the present invention, and FIG. 2 is a block diagram showing a conventional example. ]1... Audio input section, 12... Audio output section, 13.
...Control recognition unit, 14...Storage unit, 15...Operation command unit, 21...Voice input unit, 22...Control recognition unit, 2
3...Storage unit, 24...Operation command unit.

Claims (1)

【特許請求の範囲】 1、装置使用者の声による認識対象語(以後標準パター
ンと称す)を予め登録しておく特定話者方式の音声認識
装置において、入力された音声信号を数値データ化する
手段と、この数値データを標準パターンとして記憶する
手段と、記憶されている標準パターンの数値データを音
声信号化する手段と、入力された音声信号の数値データ
と記憶されている標準パターンの数値データとを比較し
認識する手段と、認識結果を装置外へ出力する手段とを
具備する音声認識装置。 2、装置使用者の声による認識対象語(以後標準パター
ンと称す)を予め登録しておく特定話者方式の音声認識
方法において、入力された音声信号を数値データ化する
手順と、この数値データを標準パターンとして記憶する
手順と、記憶されている標準パターンの数値データを音
声信号化する手順と、入力された音声信号の数値データ
と記憶されている標準パターンの数値データとを比較し
認識する手順と、認識結果を装置外へ出力する手順とを
具備する音声認識方法。
[Claims] 1. In a speaker-specific speech recognition device in which words to be recognized (hereinafter referred to as standard patterns) in the voice of the device user are registered in advance, an input speech signal is converted into numerical data. means for storing the numerical data as a standard pattern; means for converting the stored numerical data of the standard pattern into an audio signal; numerical data of the input audio signal and stored numerical data of the standard pattern. A speech recognition device comprising: a means for comparing and recognizing the results; and a means for outputting the recognition result to the outside of the device. 2. In a speaker-specific speech recognition method in which words to be recognized by the device user's voice (hereinafter referred to as standard patterns) are registered in advance, the procedure for converting input speech signals into numerical data and this numerical data A procedure for storing the numeric data of the memorized standard pattern as a standard pattern, a procedure for converting the numeric data of the memorized standard pattern into an audio signal, and comparing and recognizing the numeric data of the input audio signal with the numeric data of the memorized standard pattern. A speech recognition method comprising a procedure and a procedure for outputting a recognition result outside the device.
JP2311825A 1990-11-16 1990-11-16 Device and method for speech recognition Pending JPH04181998A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2311825A JPH04181998A (en) 1990-11-16 1990-11-16 Device and method for speech recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2311825A JPH04181998A (en) 1990-11-16 1990-11-16 Device and method for speech recognition

Publications (1)

Publication Number Publication Date
JPH04181998A true JPH04181998A (en) 1992-06-29

Family

ID=18021854

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2311825A Pending JPH04181998A (en) 1990-11-16 1990-11-16 Device and method for speech recognition

Country Status (1)

Country Link
JP (1) JPH04181998A (en)

Similar Documents

Publication Publication Date Title
US5983186A (en) Voice-activated interactive speech recognition device and method
JP3968133B2 (en) Speech recognition dialogue processing method and speech recognition dialogue apparatus
JP4837917B2 (en) Device control based on voice
JPH04204700A (en) Speech recognition device
US20070047708A1 (en) Voice call reply using voice recognition and text to speech
JP2019184809A (en) Voice recognition device and voice recognition method
EP1185976A1 (en) Speech recognition device with reference transformation means
JPS6126677B2 (en)
JPH04181998A (en) Device and method for speech recognition
JP2007286376A (en) Voice guide system
JPS63149699A (en) Voice input/output device
JP3846500B2 (en) Speech recognition dialogue apparatus and speech recognition dialogue processing method
JPS6126678B2 (en)
JP2020085942A (en) Information processing apparatus, information processing method, and program
JP2017092725A (en) Robot unit having vocalization function, vocalization control method and program
JPS61239358A (en) Documentation system by voice input
JP2005148764A (en) Method and device for speech recognition interaction
JP2975808B2 (en) Voice recognition device
JPH039400A (en) Voice recognizer
JP2744039B2 (en) Voice recognition device
JPS59201141A (en) Input device of sound information
JP2014021425A (en) Speech recognition system and integrated circuit device
JPS59176791A (en) Voice registration system
JPH0488399A (en) Voice recognizer
JPH03122696A (en) Voice recognizing device