JPH04136900A

JPH04136900A - Voice recognition processing method for voice input/ output device

Info

Publication number: JPH04136900A
Application number: JP2257282A
Authority: JP
Inventors: Toru Miyamae; 徹宮前; Waichiro Tsujita; 辻田　和一郎
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1990-09-28
Filing date: 1990-09-28
Publication date: 1992-05-11

Abstract

PURPOSE:To select a recognized word matching an input voice without re- inputting a voice by adding +1 to the contents of a misrecognition frequency memory corresponding to a recognized code number when the recognized number is selected and determined as a 2nd or succeeding candidate. CONSTITUTION:A voice input person confirms that his or her voice is displayed on a display 8 as a 2nd candidate, and then presses a cursor key 9b once. A CPU 1 reads the contents of a register R2 and moves a cursor to below the 2nd candidate. When a confirmation key 9a is depressed after the movement of the cursor to the 2nd candidate is confirmed, +1 is added to the contents of the misrecognition frequency memory M1 corresponding to the recognized code number. When there is no matching word among displayed recognized words, the recognized words are displayed on the display 8 in the decreasing order of frequencies of misrecognition. Consequently, the recognized word matching the input voice can be selected without re-inputting a voice.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は音声入出力装置に係り、特に音声認識処理方法
に関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a voice input/output device, and particularly to a voice recognition processing method.

（従来の技術）一般に、音声入出力装置は大きく分けて、入力された音
声信号の認識処理を行う音声認識処理手段、認識処理さ
れた結果による出力処理手段、音声認識処理手段及び出
力処理手段を制御する制御処理手段からなる。音声認識
処理手段は入力された音声信号を公知技術である線形予
測符号化分析法等を用いて音声パターン化し、予め標準
音声パターンメモリに登録しておいたＮ個の標準音声パ
ターンと公知技術であるダイナミ・クジログラミング法
等を用いてマツチングを行い、類似度の高い標準音声・
母ターンに対応する認識コード番号を選択する。制御処
理手段は選択された認識コード番号に基づいて出力処理
手段、例えばワードグロセッサ、音声プリンタ、音声合
成翻訳装置、ガイダンス、電気機器の制御等にデータ、
制御信号を出力する。出力処理手段は入力したデータ、
制御信号に基づいて文字を表示したり、印字したり、音
声を出したり、工作機械を動かしたり、電気をつけたり
する。(Prior Art) In general, a voice input/output device can be roughly divided into a voice recognition processing means for recognizing an input voice signal, an output processing means for outputting the result of the recognition process, a voice recognition processing means, and an output processing means. It consists of a control processing means for controlling. The speech recognition processing means converts the input speech signal into a speech pattern using a known technique such as linear predictive coding analysis method, and converts the input speech signal into a speech pattern using a known technique and N standard speech patterns registered in a standard speech pattern memory in advance. Matching is performed using a certain dynamic whale programming method, etc., and standard speech and speech with a high degree of similarity are
Select the recognition code number corresponding to the mother turn. The control processing means transmits data to an output processing means, such as a word processor, a speech printer, a speech synthesis translation device, guidance, control of electrical equipment, etc., based on the selected recognition code number.
Outputs a control signal. The output processing means input data,
Displaying characters, printing, producing sounds, operating machine tools, and turning on electricity based on control signals.

ところで、音声は不安定要素を含むので、音声認識処理
手段は常に入力された音声信号に対して正しい認識コー
ド番号を選択するとは限らず、誤った認識コード番号を
選択することがある。そのために選択された認識コード
番号が正しいか否かを音声入力者がチエツクしなければ
ならない。チエツクする方法としては入力した音声信号
に対して類似度の高い順にｎ　（ｎ（Ｎ　）個の認識コ
ード番号を選択させ、それらの認識コード番号に対応す
る認識単語を表示手段に表示させ、音声入力者に決定さ
せていた。By the way, since speech includes unstable elements, the speech recognition processing means does not always select the correct recognition code number for the input speech signal, and may select an incorrect recognition code number. Therefore, the voice input person must check whether the selected recognition code number is correct. The checking method is to select n (N) recognition code numbers in descending order of similarity to the input audio signal, display the recognition words corresponding to these recognition code numbers on the display means, and then It was left to the person who entered the information to decide.

（発明が解決しようとする課題）従来の音声入出力装置の音声認識処理方法にあっては、
入力した音声に対する認識単語が類似度の高い順に選択
されたｎ個の認識単語の中に候補として選択されない場
合は、再び音声入力を行ない、候補として選択されるま
で繰返さなければならず、音声入力者にとって使用しづ
らいという問題点があった。(Problems to be Solved by the Invention) In the conventional speech recognition processing method for a speech input/output device,
If the recognition word for the input voice is not selected as a candidate among the n recognition words selected in order of similarity, the voice input must be performed again and the process must be repeated until it is selected as a candidate. The problem was that it was difficult for people to use it.

本発明は音声認識処理された結果、入力した音声に対す
る認識単語がｎ個の認識単語の候補の中に選択されない
とき、音声を再入力せずとも入力した音声に合致する認
識単語を選択できる音声入出力装置の音声認識処理方法
を提供することを目的とする。The present invention provides a voice that allows you to select a recognized word that matches the inputted voice without re-inputting the voice when the recognized word for the inputted voice is not selected among the n recognition word candidates as a result of voice recognition processing. The purpose of this invention is to provide a voice recognition processing method for an input/output device.

（課題を解決するための手段）上記目的を達成するために、本発明の音声入出力装置の
音声認識処理方法においては、Ｎ個の標準音声パターン
に付与してある認識コード番号に対応させて誤認識回数
メモリを設け、入力した音声信号に対応する標準音声パ
ターンの認識コード番号がｎ個の中の第１候補に決定さ
れないで、残りの第２候補以降に選択されて決定された
とき、又は選択されたｎ個の認識コード番号を除く（Ｎ
−ｎ）個の認識コード番号の中から誤認識回数の多い順
に認識コード番号が候補として選択されて決定されたと
きは、入力した音声信号に対応する誤認識回数メモリの
内容に＋１加算するようにしたものである。(Means for Solving the Problems) In order to achieve the above object, in the speech recognition processing method for a speech input/output device of the present invention, N standard speech patterns are made to correspond to recognition code numbers assigned to them. A memory for the number of times of recognition errors is provided, and when the recognition code number of the standard voice pattern corresponding to the input voice signal is not determined as the first candidate among n, but is selected and determined as the remaining second and subsequent candidates, Or exclude selected n recognition code numbers (N
-n) When recognition code numbers are selected and determined as candidates in order of the number of misrecognitions from among the recognition code numbers, +1 is added to the content of the number of misrecognitions memory corresponding to the input audio signal. This is what I did.

（作　用）上記のように構成された音声入出力装置の音声認識処理
方法においては入力した音声が音声バタン化され、Ｎ個
の標準音声パターンと順々に比較されると、類似度の高
い順にｎ個の標準音声パターンに対応する認識コード番
号が候補として選択される。そして表示手段にｎ個の認
識単語が表示される。音声入力者は表示されたｎ個の認
識単語を見て、入力した音声と合致する認識単語が第２
候補以下にあるときは、選択処理を行って決定する。選
択及び決定により、入力した音声の認識コード番号に対
応する誤認識回数メモリの内容に＋１を加算する。又、
表示されたｎ個の認識単語の中に合致するものがないと
きは、すでに候補として選択されたｎ個認識単語を除く
、（Ｎ−ｎ）個認識単語の中から誤認識回数の多い順に
認識単語が候補として選択され、表示される。選択及び
決定により、入力した音声の認識コード番号に対する誤
認識回数メモリの内容に＋１を加算する。(Function) In the speech recognition processing method of the speech input/output device configured as described above, the input speech is converted into a speech pattern, and when it is sequentially compared with N standard speech patterns, Recognition code numbers corresponding to n standard voice patterns are sequentially selected as candidates. Then, n recognized words are displayed on the display means. The voice inputter looks at the displayed n recognition words and selects the second recognition word that matches the input voice.
If it is below the candidates, a selection process is performed to determine it. Upon selection and decision, +1 is added to the contents of the erroneous recognition count memory corresponding to the input voice recognition code number. or,
If there is no match among the n recognized words displayed, recognition is performed in order of the number of misrecognitions from among the (N-n) recognized words, excluding the n recognized words already selected as candidates. Words are selected and displayed as candidates. By selection and decision, +1 is added to the contents of the erroneous recognition count memory for the recognition code number of the input voice.

従って、入力した音声に対する認識単語がｎ個の認識単
語の中に選択されないとき、音声を再入力せずとも、入
力した音声に合致する認識単語を選択できるのである。Therefore, when the recognition word corresponding to the input speech is not selected from among the n recognition words, the recognition word matching the input speech can be selected without re-inputting the speech.

（実施例）本発明の一実施例について図面を参照しながら説明する
。なお、各図面に共通な要素には同一符号を付す。(Example) An example of the present invention will be described with reference to the drawings. Note that elements common to each drawing are given the same reference numerals.

第１図は本発明の一実施例の構成ブロック図である。中
央処理装置１（以後ＣＰＵ　１と記す）にはメインメモ
リ２（以後メモリ２と記す）、誤認識回数処理手段３が
それぞれパスライン１２．１３で接続してある。又、Ｃ
ＰＵ　１には音声認識用ＬＳＩ６、標準音声Ａ’ターン
メモリ７、デイスプレィ８、操作部９、音声合成処理部
１０、音声合成データメモリ１１がパスライン１４で接
続してある。誤認識回数処理手段３はＭｌからＭＮＮ１
６、個。誤認識回数メモリと加算器４とからなり、各誤
認識回数メモリと加算器４との間をパスライン１５で接
続してある。又、各誤認識回数メモリと加算器４とはＣ
ＰＵ　１と前述したパスライン１３で接続してある。音
声認識用ＬＳＩ６とマイクロフォン５とはライン１６で
接続してある。メモリ２には音声入出力装置を制御する
制御プログラムと（１〜Ｎ）個の認識コード番号に対応
させたＮ個の認識単語の表示データとを記憶しておく。FIG. 1 is a block diagram of an embodiment of the present invention. A main memory 2 (hereinafter referred to as memory 2) and a false recognition number processing means 3 are connected to the central processing unit 1 (hereinafter referred to as CPU 1) through path lines 12 and 13, respectively. Also, C
A speech recognition LSI 6, a standard speech A' turn memory 7, a display 8, an operation section 9, a speech synthesis processing section 10, and a speech synthesis data memory 11 are connected to the PU 1 via a pass line 14. The misrecognition number processing means 3 processes M1 to MNN1.
6, pieces. It consists of a memory for the number of misrecognitions and an adder 4, and each memory for the number of misrecognitions and the adder 4 are connected by a path line 15. Also, each misrecognition number memory and adder 4 are C
It is connected to PU 1 by the aforementioned pass line 13. The voice recognition LSI 6 and the microphone 5 are connected by a line 16. The memory 2 stores a control program for controlling the voice input/output device and display data of N recognized words corresponding to (1 to N) recognition code numbers.

例えば、１単語当り１６バイト使用し、６４単語の表示
データを記憶するとすれば、１６Ｘ６４＝１０２４バイト−１ｋＢのデータ記憶エリアを割り当てる。そしてそのデータ記
憶エリアに認識コード番号１〜６４に対応させて、認識
単語“コーヒー　　′°コーチャ観コココア　６コーラ
”ビール” “ウィスキー”の表示データを記憶する。誤認識回数メ
モリＭｌ−Ｍ６４は認識コード番号１〜６４に一定の数
値Ｘを加えたアドレスを付与する。よって、例えば誤認
識回数メモリＭ１はアドレスが”　Ｘ　＋　１　”で認
識単語゛°ココーー”の誤認識回数を記憶する。標章音
声・母ターンメモリ７には音声入力者の音声で６４個の
認識単語を標準音声パターンとして記憶しておく。認識
単語を入力する順序ハ“コーヒー　　″コーチャ　　“
ココア”軸コーラ”　　　　　Ｉ′ビール”　ＩＩウィ
スキーの順で入力する。このときＣＰＵ　Ｉは標準音声
・やターンに対して順に認識コード番号を付与しながら
標準音声パターンメモリ７に登録していく。音声合成デ
ータメモリ１１には認識コード番号に対応して、例えば
英会話文の音声合成データが記憶しておく。操作部９に
は確認キー９ａとカーソルキー９ｂとがある。For example, if 16 bytes are used per word and 64 words of display data are to be stored, a data storage area of 16×64=1024 bytes−1 kB is allocated. Then, the display data of the recognized words "Coffee '° Kocha Kan Coco Coa 6 Cola" Beer "Whisky" is stored in the data storage area in association with the recognition code numbers 1 to 64. An address is given by adding a fixed value X to the code numbers 1 to 64. Therefore, for example, the misrecognition number memory M1 stores the number of misrecognitions of the recognized word "°Koko" at the address "X+1". The mark voice/main turn memory 7 stores 64 recognized words in the voice of a voice input person as a standard voice pattern. The order of entering recognition words is “coffee” kocha “
The inputs are in the following order: Cocoa, "Axis Cola,"I' Beer, and II Whiskey. At this time, the CPU I assigns recognition code numbers to the standard voices and turns in order and registers them in the standard voice pattern memory 7.Voice The synthesis data memory 11 stores, for example, speech synthesis data of an English conversation sentence in correspondence with the recognition code number.The operation unit 9 has a confirmation key 9a and a cursor key 9b.

次に動作について第２図をも加えて説明する。Next, the operation will be explained with reference to FIG. 2.

第２図は一実施例の動作フローチャートである。FIG. 2 is an operational flowchart of one embodiment.

今、音声入力者は外国のカフェでコーヒーをウェイター
に注文するため、ステップＳｌで゛コーヒ”とマイクロ
フォン５から音声入力する。ステップＳ２で音声認識処
理が行なわれる。音声認識用ＬＳＩ６は”コーヒー”の
音声信号を音声パターン化し、標準音声パターンメモリ
７に記憶してある６４個の標準音声パターンと順々に比
較して類似度の高い順にｎ個、例えば３個の認識単語゛
コーヒー　　“コーラ”　°゛コーチヤに対する認識コ
ード番号゛１”、°゛４″、“２＃を選択する。ステッ
プＳ３でデイスプレィ８に認識単語゛コーヒー”コーラ
”コーチャ”を表示する。即ち、ＣＰＵ　１は選択され
た認識コード番号のうち最も類似度の高い認識コード番
号を音声認識用ＬＳＩ　６から読み出し、メモリ２に記
憶してある認識コード番号１〜６４と比較して、一致し
た認識単語の表示データ“コーヒー”を第１候補として
デイスプレィ８に表示する。同様にしてパコーラ”コー
チャ”を第２候補、第３侯補として表示する。Now, in order to order coffee from a waiter at a cafe in a foreign country, the voice input user inputs "coffee" through the microphone 5 in step Sl.Speech recognition processing is performed in step S2.The voice recognition LSI 6 inputs "coffee". The speech signal is converted into a speech pattern, and compared with the 64 standard speech patterns stored in the standard speech pattern memory 7, n words, for example, 3 recognized words "coffee" and "cola" are selected in descending order of similarity. Select the recognition code numbers "1", "4", and "2#" for the "cochia". In step S3, the recognition words "coffee", "cola", and "cocha" are displayed on the display 8. That is, the CPU 1 reads out the recognition code number with the highest similarity among the selected recognition code numbers from the voice recognition LSI 6, and stores it in the memory. The recognition code numbers 1 to 64 stored in 2 are compared, and the display data of the matching recognition word "coffee" is displayed on the display 8 as the first candidate.Similarly, pakola "kocha" is displayed as the second candidate. Displayed as the 3rd Marquis.

このときＣＰＵ　Ｊは各候補の認識コード番号″１′″
”４″、“２”を所定のアドレスに記憶しておく。又、
表示された各認識単語の先頭の文字の下の位置データを
カーソル表示位置としてＣＰＵ　ｌ内のレジスタＲ１＋
　Ｒ２＋　Ｒ３に記憶しておく。即ち、第１候補の“コ
ーヒー”のコの字の下のカーソル表示位置データをレジ
スタＲ１に記憶し、゛コーラ″“コーチャ”の各コの字
の下のカーソル表示位置データをレジスタＲ２＋　Ｒ３
に記憶する。又、カーソルの位置データをレジスタＲ４
に記憶する。At this time, CPU J receives the recognition code number "1'" of each candidate.
"4" and "2" are stored at predetermined addresses. or,
The position data under the first character of each recognized word displayed is set as the cursor display position and is stored in register R1+ in CPU l.
Store it in R2+R3. That is, the cursor display position data under the U-shape of the first candidate "coffee" is stored in register R1, and the cursor display position data under each U-shape of "Cola" and "Kocha" is stored in registers R2+R3.
to be memorized. Also, the cursor position data is stored in register R4.
to be memorized.

ステップＳ４で音声入力者は発声した音声“コーヒー″
が第１候補としてデイスプレィ８に表示すれたことを確
認し、ステップＳ５で操作部９の確認キー９ａを押す。In step S4, the voice inputter utters the voice "coffee".
is displayed on the display 8 as the first candidate, and presses the confirmation key 9a of the operation section 9 in step S5.

ＣＰＵ　ＪはレジスタＲ４とレジスタＲ１との内容を比
較して、一致することから第１候補が選択されたことを
検知して、メモリ２の所定のアドレスに記憶してある第
１候補の認識コード番号゛′１′を音声合成処理部１ｏ
に出力する。CPU J compares the contents of register R4 and register R1, detects that the first candidate has been selected since they match, and reads the recognition code of the first candidate stored at a predetermined address in memory 2. The number ``'1'' is sent to the speech synthesis processing unit 1o.
Output to.

ここでメモリ２の所定のアドレスとレジスタＲ１゜Ｒ２
＋　Ｒ３＋　Ｒ４との関係を述べると、レジスタＲ４の
内容とレジスタＲ，，Ｒ２、Ｒ３の内容とヲ順々に比較
して、レジスタＲ４の内容がレジスタＲ１＋　Ｒ２＋　
Ｒ３のそれぞれの内容と一致したとき、メモリ２の所定
のアドレスから第１候補、第２候補、第３候補の認識コ
ード番号が読み出される。ステップＳ６＋８７で音声合
成処理部１゜は入力した認識コード番号′°１”に対す
る音声合成データを音声合成データメモリ１１から読み
出して音声合成し、例えば”　Ａ　ｃｕｐ　ｏｆ　ｃｏ
ｆｆｅｅ　ｐｌｅａｓｅとスぎ一力等から出力する。と
ころで、ステップＳ２の音声認識処理で入力音声゛コー
ヒー″に対して、類似度の高い順に°゛コーラ″″コー
ヒー°°コーチヤ′認識コード番号”４”　、　”１”
　、　”２”が選択されたとすると、ステップＳ３でデ
イスプレィ８には第１候補、第２候補、第３候補として
“コーラ”コーヒー　　”　コ−チャ”ｆ　表示する。Here, the predetermined address of memory 2 and register R1゜R2
+ To describe the relationship between R3+ and R4, the contents of register R4 are compared with the contents of registers R, R2, and R3 in order, and the contents of register R4 are compared to registers R1+ R2+
When the respective contents of R3 match, the recognition code numbers of the first candidate, second candidate, and third candidate are read from a predetermined address in the memory 2. In step S6+87, the speech synthesis processing unit 1° reads out the speech synthesis data for the input recognition code number '°1' from the speech synthesis data memory 11 and synthesizes the speech, for example, "A cup of co".
Output ffee please from Sugi Ichiriki, etc. By the way, in the speech recognition process of step S2, for the input voice "coffee", the recognition code numbers "4", "1" are given in descending order of similarity.
, "2" is selected, the display 8 displays "cola", "coffee", and "cocha"f as the first, second, and third candidates in step S3.

ステップＳ４で音声入力者は発声した音声゛コーヒー”
が第２候補としてデイスプレィ８に表示されたことを確
認し、ステップＳ８で操作部のカーソルキー９ｂを１回
押す。ＣＰＵ　１１ｄレノスタＲ２の内容を読み出し、
レジスタＲ４に移動するとともにカーソルを第２候補で
ある“コーヒーのコの字の下に移動する。ステップＳ９
で音声入力者はカーソルが第２候補に移ったことを確認
し、ステップＳＩＯで確認キー９ａを押す。ステップＳ
１、でＣＰＵ　１は誤認識回数処理を行う。即ち、ＣＰ
Ｕ　１はレジスタＲ４の内容とレジスタＲ２との内容が
一致することからメモリ２の所定のレジスタから認識コ
ード番号′”１″を読み出し、一定の数値“′Ｘ″を加
えてアドレスとする。そしてアドレス°’Ｘ＋１　”番
地に対応する誤認識回数メモＩＪＭ１の内容を加算器４
に読み出し、＋１を加算して再び誤認識回数メモ’ＪＭ
＋に書き込む。最後に認識コード番号パ１″′を音声合
成処理部１０に出力する。以下ステップＳ６　、Ｓ７を
経て終了する。入力音声が第３候補として音声認識処理
された場合は、上述したステップ内容にステップＳ１□
、Ｓ１３が追加される。In step S4, the voice inputter utters the voice "coffee".
is displayed on the display 8 as the second candidate, and in step S8, the cursor key 9b of the operation section is pressed once. Read the contents of CPU 11d Renostar R2,
Move to register R4 and move the cursor under the second candidate "coffee". Step S9
The voice input user confirms that the cursor has moved to the second candidate and presses the confirmation key 9a in step SIO. Step S
1, the CPU 1 processes the number of misrecognitions. That is, C.P.
Since the contents of register R4 and register R2 match, U1 reads the recognition code number ``1'' from a predetermined register in the memory 2, and adds a certain value ``X'' to the address. Then, adder 4 adds the contents of erroneous recognition count memo IJM1 corresponding to address °'X+1''.
, add +1 and record the number of misrecognitions again.
Write in +. Finally, the recognition code number Pa1''' is output to the speech synthesis processing section 10.The process is then completed through steps S6 and S7.When the input speech is subjected to speech recognition processing as the third candidate, the above-mentioned step contents are repeated. S1□
, S13 are added.

ところで、入力音声パコーヒー″に対する音声認識処理
の結果、ステップＳ４で候補として“コーラ″″コーチ
ャ　　“ココア”の順テティスプレイ８に表示された場
合はカーソルキー９ｂを３回押す。ＣＰＵ　１は１回目
のカーソルキー９ｂの押下でステップＳ８　、Ｓ９を終
り、２回目の押下でステップＳ１２　ｒ　ｓｔａを経て
、３回目の押下でステップＳＩ５に移る。ステップＳＩ
５で認識単語°′コーラ”゛コーチャ　　“ココア″を
除いた認識単語の中で最も誤認識回数の多い順に認識単
語をデイスプレィ８に表示する。即ち、誤認識回数メモ
リの内容を読み出し、レジスタＲ１を用いて誤認識回数
値の大きい順にソーティングを行い、メモリ２のデータ
エリアに記憶する。そしてさらに同じ誤認識回数値は比
較処理を用いて除去して、メモリ２のデータエリアに誤
認識回数値配列を作る。By the way, as a result of the speech recognition process for the input voice "Pacoffee", if "Cola""Cocoa" are displayed as candidates in the order text play 8 in step S4, the cursor key 9b is pressed three times. The CPU 1 completes steps S8 and S9 when the cursor key 9b is pressed for the first time, passes through step S12 r sta when the cursor key 9b is pressed for the second time, and proceeds to step SI5 when the cursor key is pressed for the third time. Step SI
In step 5, the recognized words are displayed on the display 8 in the order of the number of misrecognitions that is the highest among the recognized words excluding the recognized word °'cola""cocoa". That is, the contents of the memory for the number of misrecognitions are read out and Sorting is performed in descending order of the number of misrecognitions using Create an array.

次に最も大きい誤認識回数値をレジスタＲ１に転送し、
誤認識回数メモＩＪＭ１〜Ｍ６４の内容と比較する。こ
のときレジスタＲ２をカウンタとして使用し、初期値“
０″を格納しておく。そして比較処理を行う毎に＋１加
算していく。このカウンタ値が認識コード番号に対応す
る。この比較処理でレジスタＲ１の内容と、誤認識回数
メモＩＪ　Ｍｌ−Ｍ６４の内容とが一致したときの認識
コード番号を求める。求めた認識コード番号とメモリ２
の所定のアドレスに記憶してある”コーラ″゛コーチャ
゛ココア”の認識コード番号”４”　、　”２”　、　
”３”と比較して一致するものを除いた認識コード番号
をメモリ２のデータエリアに記憶しておく。同様に誤認
識回数値配列から次に大きい誤認識回数値をレジスタＲ
１に転送し、その誤認識回数値に対応する認識コード番
号を求める。求めた認識コード番号とメモリ２の所定の
アドレスに記憶してある認識コード番号パ４”、°“２
　ＩＩ　、　ＩＩ３′Ｉと比較して一致するものを除き
、残った認識コード番号をメモリ２に転送して最も大き
い誤認識回数値に対応する認識コード番号に続けて記憶
する。以下同様にしてメモリ２に誤認識回数値配列に対
応する認識コード番号配列を作る。ＣＰＵ　１は認識コ
ード番号配列の最も配列番号の若い認識コード番号に対
応する認識単語の表示データをメモリ２から読み出して
デイスプレィ８に表示する。ステップＳ１６で正解であ
れば、ステップＳＩＯに移る。父、ステップＳ１６で正
解でなければ、ステップＳ１４に戻る。ステップＳ１４
でカーソルキー９ｂを押すと、ステップ８１５でＣＰＵ
１は認識コード番号配列から次に配列番号の若い認識コ
ード番号に対応する認識単語の表示データをメモリ２か
ら読み出してディスプレイ８に表示する。以下上述した
ステップを経て終了する。Next, transfer the largest misrecognition count value to register R1,
Compare with the contents of the erroneous recognition count memos IJM1 to IJM64. At this time, register R2 is used as a counter, and the initial value "
0" is stored. Then, +1 is added each time a comparison process is performed. This counter value corresponds to the recognition code number. In this comparison process, the contents of register R1 and the number of misrecognitions memo IJ Ml-M64 are stored. Find the recognition code number when the contents match.The found recognition code number and memory 2
The recognition code numbers "4", "2",
The recognition code numbers are compared with "3" and the recognition code numbers excluding those that match are stored in the data area of the memory 2. Similarly, from the misrecognition number array, set the next largest misrecognition number value to register R.
1, and obtain the recognition code number corresponding to the number of misrecognitions. The obtained recognition code number and the recognition code number stored at a predetermined address in memory 2 are
II and II3'I, excluding those that match, the remaining recognition code numbers are transferred to the memory 2 and stored following the recognition code number corresponding to the largest number of misrecognitions. Thereafter, in the same manner, a recognition code number array corresponding to the erroneous recognition frequency value array is created in the memory 2. The CPU 1 reads the display data of the recognition word corresponding to the recognition code number having the smallest sequence number in the recognition code number array from the memory 2 and displays it on the display 8. If the answer is correct in step S16, the process moves to step SIO. If the answer is not correct in step S16, the process returns to step S14. Step S14
When cursor key 9b is pressed in step 815, the CPU
1 reads the display data of the recognition word corresponding to the recognition code number with the next smallest array number from the recognition code number array from the memory 2 and displays it on the display 8. The process is then completed through the steps described above.

本実施例では音声認識処理後の出力処理手段として音声
合成処理部を設けたが、ワードプロセノザ、音声プリン
タ、電気機器の制御部等を設けてもよい。又、本実施例
では、誤認識回数処理手段として誤認識回数メモリと加
算器とを設けたが、ＣＰＵを加算器として使用し、メイ
ンメモリのデータエリアを誤認識回数メモリとして使用
してもよい。In this embodiment, a speech synthesis processing section is provided as an output processing means after speech recognition processing, but a word processor, a speech printer, a control section for electrical equipment, etc. may also be provided. Further, in this embodiment, a memory for the number of false recognitions and an adder are provided as the means for processing the number of false recognitions, but the CPU may be used as the adder and the data area of the main memory may be used as the memory for the number of false recognitions. .

（発明の効果）本発明は以上説明したように構成されているので、以下
に記載される効果を奏する。(Effects of the Invention) Since the present invention is configured as described above, it produces the effects described below.

Ｎ個の標準音声パターンに付与してある認識コード番号
に対応させて誤認識回数メモリを設け、入力した音声信
号に対応する標準音声パターンの認識コード番号がｎ個
の中の第１侯補に決定されないで、残りの第２候補以降
に選択されて決定されたとき、又は選択されたｎ個の認
識コード番号を除（（Ｎ＝ｎ）個の認識コード番号の中
から誤認識回数の多い順に認識コード番号が候補として
選定されて決定されたときは、入力した音声信号に対応
する誤認識回数メモリの内容に＋１加算するようにした
ので、入力した音声に対する認識単語がｎ個の認識単語
の中に選択されないとき、音声を再入力せずとも、入力
した音声に合致する認識単語を選択できる。A memory for the number of incorrect recognitions is provided in correspondence with the recognition code numbers assigned to N standard voice patterns, and the recognition code number of the standard voice pattern corresponding to the input voice signal is the first candidate among the n pieces. When the recognition code number is not determined and is selected and determined after the remaining second candidate, or when the selected n recognition code numbers are excluded ((N=n) recognition code numbers have the highest number of misrecognitions) When recognition code numbers are sequentially selected and determined as candidates, +1 is added to the contents of the incorrect recognition count memory corresponding to the input audio signal, so that the number of recognition words for the input audio is n recognition words. If it is not selected in the list, you can select a recognized word that matches the input voice without having to re-enter the voice.

[Brief explanation of drawings]

第１図は本発明の一実施例の構成ブロック図、第２図は
一実施例の動作フローチャートである。１・・・ＣＰＵ、２・・・メモリ、３・・・誤認識回数
処理手段、４・・・加算器、６・・・音声認識用ＬＳＩ
、７・・・標準音声パターンメモリ、８・・・デイスプ
レィ、９・・操作部、９ａ・・・確認キー　９ｂ・・・
カーソルキー、　１０・・・音声合成処理部、１ノ・・
・音声合成データメモリ。FIG. 1 is a block diagram of the configuration of an embodiment of the present invention, and FIG. 2 is an operational flowchart of the embodiment. DESCRIPTION OF SYMBOLS 1... CPU, 2... Memory, 3... Misrecognition number processing means, 4... Adder, 6... LSI for speech recognition
, 7...Standard audio pattern memory, 8...Display, 9...Operation unit, 9a...Confirmation key 9b...
Cursor key, 10...Speech synthesis processing section, 1no...
・Speech synthesis data memory.

Claims

[Claims] The input speech signal is converted into speech patterns, and the recognition code numbers assigned to the standard speech patterns are n( In a voice recognition processing method for a voice input/output device that selects (n<N) standard voice patterns and determines one among them, a memory for the number of false recognitions is provided corresponding to the recognition code numbers assigned to the N standard voice patterns. , when the recognition code number of the standard voice pattern corresponding to the input voice signal is not determined as the first candidate among the above n, but is selected and determined as the remaining second or subsequent candidates, or when the selected n When the recognition code numbers are selected and determined as candidates in order of the number of misrecognitions from among (N-n) recognition code numbers excluding the recognition code numbers, the input audio signal is A speech recognition processing method for a speech input/output device, characterized by adding +1 to the contents of a recognition number memory.