JPS59184398A - Voice recognition equipment - Google Patents
Voice recognition equipmentInfo
- Publication number
- JPS59184398A JPS59184398A JP58058871A JP5887183A JPS59184398A JP S59184398 A JPS59184398 A JP S59184398A JP 58058871 A JP58058871 A JP 58058871A JP 5887183 A JP5887183 A JP 5887183A JP S59184398 A JPS59184398 A JP S59184398A
- Authority
- JP
- Japan
- Prior art keywords
- section
- voice recognition
- recognition
- recognition equipment
- present
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.
Description
【発明の詳細な説明】 本発明は音声認識装置に関する。[Detailed description of the invention] The present invention relates to a speech recognition device.
近年、コンビーータや各種制御装置等にオケル入力装置
として音声認識装置が本格的実用期を迎えるに至ってい
る。人間の話す言葉をそのまま認識できる音声認識装置
は利用のための特別な8+1練もいらず視線や手足が拘
束されないなど数々のオU点があることは既に周知の通
夛でおるが、現在本格的実用期に入っているいわゆる特
定話者用の単語音声認識装置においてはそのような利点
のある一方で、認識が単語あるいは単語列を単位として
行われるため、定形叙述文などの認識にはそのままでは
適用できないという欠点がある。In recent years, voice recognition devices have come into full-scale practical use as input devices for converters and various control devices. It is already well known that voice recognition devices that can recognize the words spoken by humans have many merits, such as no special 8+1 training is required to use them, and the eyes and hands and feet are not restricted. While word speech recognition devices for so-called specific speakers, which are now in practical use, have such advantages, recognition is performed in units of words or word strings, so they are not suitable for recognition of fixed descriptive sentences, etc. The disadvantage is that it cannot be applied.
これに対し、日本では表音文字により言語体系が構成さ
れていることを利用した単音節認識装置が出現している
。これによれば単音節を複数個発声することで任意の叙
述文を構成することができるが、現在のところこのよう
な単音節認識装置の認識率即ち発声された音声をどれだ
け正確に認識できるかの割合は単語音声認識装置のそれ
に比べわずかに及ばないという状態にある。また、この
ような方法は日本語のように表音文字によ多言語体系が
構成されている言語にしか適用できないという欠点を有
している。このような現状の一方で、実用という面から
は頻出する叙述文はかなシ限定されておシ、典型的ない
くつかの定形叙述文形を想定すればその殆どが包含でき
る。これは日本語に限らず他の言語でも適用可能な論理
であシ、単音節認識装置によらずとも殆どの定形叙述文
が、限られた数の単語の組み合わせで構成可能であるこ
とを意味する。In contrast, in Japan, a monosyllable recognition device has appeared that takes advantage of the fact that the language system is composed of phonetic characters. According to this, it is possible to construct any descriptive sentence by uttering multiple monosyllables, but at present the recognition rate of such monosyllable recognition devices, that is, how accurately the uttered sounds can be recognized. This ratio is slightly lower than that of word speech recognition devices. Furthermore, this method has the disadvantage that it can only be applied to languages such as Japanese, which have a multilingual system composed of phonetic characters. On the other hand, from a practical point of view, frequently occurring descriptive sentences are limited to short sentences, and most of them can be included by assuming a few typical fixed descriptive sentence forms. This is a logic that can be applied not only to Japanese but also to other languages, and it means that most fixed-form descriptive sentences can be composed of a limited number of combinations of words without using a monosyllable recognition device. do.
本発明の目的は、任意の言語の文法に従って定形叙述文
を認識する音声認識装置を提供することである。An object of the present invention is to provide a speech recognition device that recognizes fixed predicate sentences according to the grammar of any language.
本発明によれば文法規則にしたがって文法的に有シ得な
い大形が発生しないように逐次認識対象語を予測選別し
、高い精度で定形叙述文を認識できる音声認識装置が得
られる。According to the present invention, it is possible to obtain a speech recognition device that can predict and select words to be recognized sequentially according to grammatical rules so that grammatically impossible large words do not occur, and can recognize fixed-form descriptive sentences with high accuracy.
以下本発明の一実施例の図を用いて本発明の詳細な説明
する。図面は本発明装置の一実施例を示したものであり
、本図において100は人間の音声を電気信号に変換す
るピックアップ部、200はピックアップ部lOOで得
られた電気信号を周波数分析する周波数分析部、300
は周波数分析部200 の出力を標本化・量子化する標
本化・量゛さ 子化部であjo、400
は標本化・量子化された音□
声をあらかじめ格納されている音声パターン即ち標準パ
ターンと比較し、その結果を判定する判定部、500は
該言語の文法規則にしたがった定形文形をあらかじめ記
憶しておく定形大形記憶部、600は標準パターン記憶
部である。The present invention will be described in detail below using the drawings of one embodiment of the present invention. The drawing shows one embodiment of the device of the present invention. In the drawing, 100 is a pickup section that converts human voice into an electrical signal, and 200 is a frequency analyzer that analyzes the frequency of the electrical signal obtained by the pickup section lOO. Department, 300
400 is a sampling/quantization unit that samples and quantizes the output of the frequency analysis unit 200;
500 is a judgment unit that compares the sampled and quantized sound □ voice with a pre-stored speech pattern, that is, a standard pattern, and judges the result; The regular large storage section 600 is a standard pattern storage section.
動作を説明すると、発声者の音声はピックアップ部10
0で電気信号に変換されて周波数分析部200へ入力さ
れここで周波数帯域毎のパワースペクトルに分割される
。この段階では電気信号は未だアナログ量であるが、通
常の場合は続く標本化・量子化部300においてこれら
パワースペクトルは標本化及び量子化をうけてディジタ
ル量になる。判定部400は予め定形大形記憶部500
を参照して次の発声でどのような種類の単語が認識され
るのが妥当かを調べ、標本化・量子化部300の出力と
比較されるべき標準パターンを標準パターン記憶部60
0から読み出し比較判定の演算を行う。このようにして
一度に認識対象とする標準パターンを限定することで、
認識スピードの向上と認識率の向上の両方の利点を得る
ことが可能となる。また判定部400 ではこのように
して得られた認識結果を任意の装置に出力する機能も有
する。定形大形記憶部500にあらかじめ記憶されるべ
き文法規則としては、例えば英語では基本文型S(王語
)+■(述語)+0(目的語)というような単語レベル
で展開されたものが考えられよう。To explain the operation, the voice of the speaker is picked up by the pickup section 10.
0, it is converted into an electrical signal and input to the frequency analysis section 200, where it is divided into power spectra for each frequency band. At this stage, the electrical signals are still analog quantities, but normally in the subsequent sampling/quantization section 300, these power spectra are subjected to sampling and quantization to become digital quantities. The determination unit 400 stores in advance a fixed large storage unit 500.
, the standard pattern to be compared with the output of the sampling/quantization unit 300 is stored in the standard pattern storage unit 60.
Read from 0 and perform computation for comparison and determination. By limiting the standard patterns to be recognized at one time in this way,
It is possible to obtain the advantages of both improved recognition speed and improved recognition rate. The determination unit 400 also has a function of outputting the recognition results obtained in this manner to an arbitrary device. Examples of grammatical rules that should be stored in advance in the fixed form large storage unit 500 include those developed at the word level, such as the basic sentence pattern S (King language) + ■ (Predicate) + 0 (Object) in English. Good morning.
本説明では、周波数分析部200 と標本化・量子化部
300で音声をディジタル化して認識動作を行うと説明
したが、マツチングのアルゴリズムによっては必ずしも
このような方法によらずともよく、そのような場合にお
いても本発明の意図するところはいささかも損われるも
のではない。また本説明ではピックアップ部100 を
用いて音声を収録すると説明したが、これにはマイクロ
フォンの他テープレコーダ等の装置を利用することも可
能である。In this explanation, it has been explained that the frequency analysis section 200 and the sampling/quantization section 300 digitize the audio and perform the recognition operation, but depending on the matching algorithm, this method may not necessarily be used. Even in such cases, the intent of the present invention is not impaired in the slightest. Furthermore, in this description, it has been explained that the pickup unit 100 is used to record audio, but it is also possible to use a device such as a tape recorder in addition to a microphone.
以上説明したように、本発明は任意の言語に固有の文法
規則を利用することで、文法的な誤シのない定形叙述文
を高い精度で一識するのにきわめて有効である。As explained above, the present invention is extremely effective in identifying fixed-form descriptive sentences without grammatical errors with high accuracy by using grammatical rules specific to any language.
図面は本発明の一実施例を示す構成図である。
図において、100・旧・・ピックアップ部、200・
・・・・・周波数分析部、30o・・・・・・標本化・
量子化部、400・・・・・・判定部、500・・・・
・・定形大形記憶部、600・・・・・・標準パターン
記憶部である。The drawing is a configuration diagram showing an embodiment of the present invention. In the figure, 100. Old pickup section, 200.
...Frequency analysis section, 30o...Sampling...
Quantization section, 400... Judgment section, 500...
. . . Regular large storage section, 600 . . . Standard pattern storage section.
Claims (1)
、所定の文法にしたがって逐次認識対象単語を予測選別
して定形叙述文を認識することを特徴とする音声認識装
置。1. A speech recognition device that recognizes a string of uttered input words, wherein the speech recognition device recognizes fixed-form descriptive sentences by sequentially predicting and selecting words to be recognized according to a predetermined grammar.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP58058871A JPS59184398A (en) | 1983-04-04 | 1983-04-04 | Voice recognition equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP58058871A JPS59184398A (en) | 1983-04-04 | 1983-04-04 | Voice recognition equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
JPS59184398A true JPS59184398A (en) | 1984-10-19 |
Family
ID=13096804
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP58058871A Pending JPS59184398A (en) | 1983-04-04 | 1983-04-04 | Voice recognition equipment |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPS59184398A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003097497A (en) * | 2001-09-20 | 2003-04-03 | Taikisha Ltd | Ventilating fan improving structure |
-
1983
- 1983-04-04 JP JP58058871A patent/JPS59184398A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003097497A (en) * | 2001-09-20 | 2003-04-03 | Taikisha Ltd | Ventilating fan improving structure |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4720863A (en) | Method and apparatus for text-independent speaker recognition | |
US4284846A (en) | System and method for sound recognition | |
JPH09500223A (en) | Multilingual speech recognition system | |
Christiansen et al. | Detecting and locating key words in continuous speech using linear predictive coding | |
JPH0990974A (en) | Signal processor | |
JP2895493B2 (en) | Speaker identification device and method | |
US7788096B2 (en) | Method and apparatus for generating decision tree questions for speech processing | |
JPH11175082A (en) | Voice interaction device and voice synthesizing method for voice interaction | |
JP2003036097A (en) | Device and method for detecting and retrieving information | |
WO1983002190A1 (en) | A system and method for recognizing speech | |
JPS59184398A (en) | Voice recognition equipment | |
JPH07191696A (en) | Speech recognition device | |
RU2119196C1 (en) | Method and system for lexical interpretation of fused speech | |
JPS59184399A (en) | Voice recognition equipment | |
Olson et al. | Speech processing techniques and applications | |
KR20080065775A (en) | Phonation visualization system using lip language education | |
US20210225366A1 (en) | Speech recognition system with fine-grained decoding | |
JPS613241A (en) | Speech recognition system | |
Sakai et al. | Phonetic typewriter | |
US5899974A (en) | Compressing speech into a digital format | |
JPS58107598A (en) | Voice recognition equipment | |
JPS59224900A (en) | Voice recognition system | |
Desai et al. | Development of a personalized integrated voice recognition and synthesis system | |
JPS6227798A (en) | Voice recognition equipment | |
JPS62174800A (en) | Example pronunciation output unit for foreign language vowel |