JPS63300297A

JPS63300297A - Voice recognition equipment

Info

Publication number: JPS63300297A
Application number: JP62133357A
Authority: JP
Inventors: 康弘小森
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1987-05-30
Filing date: 1987-05-30
Publication date: 1988-12-07

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】［産業上の利用分野］本発明は、音声認識装置に関するものである。[Detailed description of the invention] [Industrial application field] The present invention relates to a speech recognition device.

［従来の技術］例えば従来の音声認識ワードプロセッサでは、全ての音
声人力情報を単語認識候補にしていた。[Prior Art] For example, in a conventional speech recognition word processor, all speech human input information is used as word recognition candidates.

［発明が解決しようとする問題点］したがって、このような音声認識ワードプロセッサにお
いては、例えば／ｆａｉｌ／　（ファイル）という音声
入力に対して、外来音に対応できるものは、認識結果に
「入る」、「ファイル」などの混同が起こり得た。さら
に全ての人力音声情報に対し、漢字変換が必要な単語、
外来語１通常のひらがなの日本語、新語および擬音等を
全て候補単語とするのできわめて無駄な処理をしていた
。[Problems to be Solved by the Invention] Therefore, in such a speech recognition word processor, for example, in response to a speech input of /fail/ (file), if it can respond to an external sound, it will "enter" in the recognition result. Confusion such as "file" could occur. In addition, for all human voice information, words that require kanji conversion,
Foreign Words 1 Since the usual Japanese hiragana, new words, onomatopoeia, etc. are all used as candidate words, processing is extremely wasteful.

本発明の目的は、上述の欠点を除去するとともに、認識
候補の選択によって認識処理の高速化および新語・擬音
語の人力をも可能にした音声認識装置を提供することに
ある。SUMMARY OF THE INVENTION An object of the present invention is to provide a speech recognition device that eliminates the above-mentioned drawbacks, speeds up recognition processing by selecting recognition candidates, and enables manual recognition of new words and onomatopoeia.

［問題点を解決するための手段］本発明は、人力音声情報を少なくとも１つの標準パタン
情報と照合して音声認識する音声認識手段と、音声認識
手段の出力情報を必要に応じて辞書情報とマツチングさ
せることによって単語認識情報を得る単語マツチング手
段と、入力語を指定する入力語指定手段と、指定手段の
指定に基づいて、音声認識手段における適用標準パタン
情報を選択すると共に必要に応じて単語マツチング手段
における通用辞書情報を選択する選択手段とを具える。[Means for Solving the Problems] The present invention provides a voice recognition means for recognizing human voice information by comparing it with at least one standard pattern information, and combining the output information of the voice recognition means with dictionary information as necessary. a word matching means for obtaining word recognition information by matching; an input word specifying means for specifying an input word; and an input word specifying means for specifying an input word; and selection means for selecting universal dictionary information in the matching means.

［作　用］本発明によれば、入力語指定手段の指定に基づいて、音
声認識手段における適用標準パタン情報を選択すると共
に必要に応じて単語マツチング手段における適用辞書情
報を選択する。[Function] According to the present invention, based on the designation of the input word designation means, standard pattern information to be applied in the speech recognition means is selected, and, if necessary, dictionary information to be applied to the word matching means is selected.

［実施例］本発明においては、人力音声は、単音節および連続単語
のいずれかであるかを問わない。更に、標準パタンも、
単音節やｖＣｖや音素などの違いは問わない。以下の実
施例においては人力音声を音素で認識する例を示す。[Example] In the present invention, it does not matter whether the human voice is a single syllable or a continuous word. Furthermore, the standard pattern
It does not matter whether it is a monosyllable, vCv, phoneme, etc. In the following embodiment, an example will be shown in which human speech is recognized using phonemes.

第１図は音声認識ワードプロセッサに適用した本発明の
一実施例のブロック図である。１は入力音声信号であっ
て、音素認識回路２に人力する。FIG. 1 is a block diagram of an embodiment of the present invention applied to a speech recognition word processor. 1 is an input speech signal, which is manually input to the phoneme recognition circuit 2.

３は入力語指定信号であって、キーボード上の入力語指
定キーの押下によってキーボードから出力される。４は
第１スイツチであって、入力語指定信号３に基づいて標
準パタン回路５の出力信号の他に外来音標準パタン回路
６の出力信号を追加して音素認識回路２に与えるか否か
を選択する。７は単語マツチング回路であって、音素認
識回路２からの出力信号と後述の各辞書出力とのマツチ
ングを行い音素認識回路２からの信号に関して単語を認
識し、認識結果１２を出力する。８は第２スイツチであ
って、人力指定信号３に基づいて、漢字辞書（日本語に
おいて漢字を使用する単語辞書）９、日本語ひらがな辞
書ｌＯおよび外来語辞書（通常カタカナを使用するもの
）１１のいずれかの出力を単語マツチング回路７に与え
るか否かを選択する。3 is an input word designation signal, which is output from the keyboard when an input word designation key on the keyboard is pressed. Reference numeral 4 designates a first switch which determines whether or not to add the output signal of the extraneous sound standard pattern circuit 6 to the output signal of the standard pattern circuit 5 in addition to the output signal of the standard pattern circuit 5, based on the input word designation signal 3. select. Reference numeral 7 denotes a word matching circuit, which performs matching between the output signal from the phoneme recognition circuit 2 and each dictionary output described below, recognizes a word with respect to the signal from the phoneme recognition circuit 2, and outputs a recognition result 12. Reference numeral 8 denotes a second switch which, based on the human input signal 3, selects a kanji dictionary (word dictionary that uses kanji in Japanese) 9, a Japanese hiragana dictionary 10, and a foreign word dictionary (usually uses katakana) 11. It is selected whether or not to give the output of any one of them to the word matching circuit 7.

次に、入力語指定信号３による第１スイツチ４および第
２スイツチ８のスイッチング動作を説明する。Next, the switching operations of the first switch 4 and the second switch 8 based on the input word designation signal 3 will be explained.

まず入力語指定信号３が外来語の指定（具体的には例え
ばキーボードでカタカナ入力を指定する）信号の時、第
１スイツチ４は閉じて外来音標準パタン回路６を選択す
る。またこの時、第２スイツチ８においては、スイッチ
２−３が閉じて外来語辞書１１を選択（閉じる）し、他
のスイッチ２−１゜２−２は開く。First, when the input word designation signal 3 is a foreign word designation signal (specifically, for example, designates katakana input on the keyboard), the first switch 4 closes and the foreign sound standard pattern circuit 6 is selected. At this time, in the second switch 8, the switch 2-3 is closed to select (close) the foreign word dictionary 11, and the other switches 2-1 and 2-2 are opened.

入力語指定信号が漢字変換の指定信号の時、第１スイツ
チ４は開く。この時、第２スイツチ８はスイッチ２−１
が閉じて漢字辞書９を選択する（他のスイッチ２−２．
２−３は開く）。入力語指定信号３がひらがなの指定信
号の時、第１スイツチ４は開き、第２スイツチ８のスイ
ッチ２−２が閉じて日本語ひらがな辞書１０を選択する
（他のスイッチ２−１゜２−３は開く）。入力語指定信
号３が無変換指定信号とひらがなまたはカタカナの指定
信号との時、第１スイツチ４は閉じて外来音標準パタン
回路６を選択し、第２スイツチ８は全て開き、辞書は使
用しない。この際音素認識回路２からの信号について、
単語マツチング回路においてひらがなはひらがなに、カ
タカナはカタカナに変換され、認識結果１２を出力する
。When the input word designation signal is a Kanji conversion designation signal, the first switch 4 is opened. At this time, the second switch 8 is the switch 2-1.
closes and selects Kanji dictionary 9 (other switches 2-2.
2-3 open). When the input word designation signal 3 is a hiragana designation signal, the first switch 4 is opened and the second switch 8, switch 2-2, is closed to select the Japanese hiragana dictionary 10 (other switches 2-1, 2-2, etc.) 3 is open). When the input word designation signal 3 is a non-conversion designation signal and a hiragana or katakana designation signal, the first switch 4 is closed and the external sound standard pattern circuit 6 is selected, and the second switch 8 is all opened and the dictionary is not used. . At this time, regarding the signal from the phoneme recognition circuit 2,
In the word matching circuit, hiragana is converted into hiragana and katakana is converted into katakana, and a recognition result 12 is output.

第２図は音声認識ワードプロセッサに通用した本発明の
他の実施例のブロック図である。同図に示すように、音
素認識回路２１において、入力音声情報１を通常標準パ
タンおよび外来音パタン５６と照合し、同回路２１の出
力に関して音素選択回路４１において入力語指定信号３
に基づいて音素選択し、同回路４１の出力を単語マツチ
ング回路７１において漢字、ひらがな、外来語辞書９１
とマツチングさせ、同マツチング回路７１の出力に関し
て単語選択回路８１において入力語指定信号３に基づい
て該当単語を得るように単語選択する。このような構成
においてもきわめて効率のよい音声認識が可能である。FIG. 2 is a block diagram of another embodiment of the present invention applicable to a speech recognition word processor. As shown in the figure, the phoneme recognition circuit 21 compares the input voice information 1 with the normal standard pattern and the extraneous sound pattern 56, and the phoneme selection circuit 41 outputs the input word designation signal 3 with respect to the output of the circuit 21.
The output of the same circuit 41 is sent to a word matching circuit 71 to select a phoneme based on the kanji, hiragana, and foreign word dictionary 91.
Based on the input word designation signal 3, the word selection circuit 81 selects a word based on the input word designation signal 3 based on the output of the matching circuit 71. Even with such a configuration, extremely efficient speech recognition is possible.

［発明の効果コ以上説明したように、本発明によれば単語の混同をなく
すことができ、さらに単語の辞書指定により処理を高速
化することができる。加えて登録されていない単語に関
する人力も可能にする。[Effects of the Invention] As explained above, according to the present invention, it is possible to eliminate word confusion, and furthermore, it is possible to speed up processing by specifying words in a dictionary. In addition, it also enables human input regarding unregistered words.

[Brief explanation of the drawing]

第１図は本発明一実施例のブロック図、第２図は同地の
実施例のブロック図である。４・・・第１スイツチ、８・・・第２スイツチ。ｍ１図第２図FIG. 1 is a block diagram of one embodiment of the present invention, and FIG. 2 is a block diagram of the same embodiment. 4...First switch, 8...Second switch. m1 figure 2

Claims

[Scope of Claims] 1) Speech recognition means that performs speech recognition by comparing input speech information with at least one standard pattern information; and matching output information of the speech recognition means with dictionary information as necessary to identify words. a word matching means for obtaining recognition information; an input word specifying means for specifying an input word; and, based on the specification of the specifying means, selecting applicable standard pattern information in the speech recognition means and, if necessary, controlling the word matching means. 1. A speech recognition device comprising: selection means for selecting applicable dictionary information. 2) In the speech recognition device according to claim 1, the selection means adds normal standard pattern information and foreign sound standard pattern information to the applied standard pattern information when the input word designation means designates a foreign word. and selecting foreign word dictionary information as applied dictionary information. 3) In the speech recognition device according to claim 1, the selection means selects normal standard pattern information as the applied standard pattern information when the input word specifying means specifies kanji conversion, and selects the normal standard pattern information as the applied standard pattern information, and selects the normal standard pattern information as the applied standard pattern information A speech recognition device that selects kanji dictionary information. 4) In the speech recognition device according to claim 1, when the input word specifying means specifies hiragana, the selection means selects the normal standard pattern information as the applied standard pattern information, and selects the normal standard pattern information as the applied dictionary information. A speech recognition device that selects Japanese hiragana dictionary information. 5) In the speech recognition device according to claim 1, when the input word specifying means specifies hiragana and no conversion or katakana and no conversion, the selection means adds normal standard pattern information to the applied standard pattern information. and a speech recognition device that selects an external sound standard pattern and does not select dictionary information.