JPH0375799A

JPH0375799A - Voice recognizing system

Info

Publication number: JPH0375799A
Application number: JP1213302A
Authority: JP
Inventors: Hiromi Shibuya; 渋谷　浩洋; Yasutomo Onishi; 大西　康友
Original assignee: Matsushita Refrigeration Co
Current assignee: Panasonic Holdings Corp
Priority date: 1989-08-18
Filing date: 1989-08-18
Publication date: 1991-03-29

Abstract

PURPOSE:To improve a voice recognition rate by providing a standard pattern memory means, voice analyzing means, standard pattern selecting means, means for sequentially selecting the standard pattern selecting means, comparing means, and utterance inducing means. CONSTITUTION:The utterance inducing means 5 induces utterance with 'Which do you select? and waits for the utterance for the flavor name from a customer when coins are charged into a coil receiving means 7. The standard pattern nearest the inputted voice patterns is selected from the standard patterns stored in the standard pattern memory means 3 by the standard pattern selecting means A 9 or standard pattern selecting means B 10, by which the flavor name is recognized. The flavor name is recognized by the means 5 and an answer is confirmed. A control means 6 pours drinks into a cup and ejects the cup by using a drink ejecting means 8. The standard pattern nearest the voice pattern extracted by the voice analyzing means is selected by the comparing means in such a manner and, therefore, the voice recognition system having the high recognition rate is obtd.

Description

【発明の詳細な説明】産業上の利用分野本発明は、特定話者及び不特定話者が入力した単語音声
を認識しその音声により数々の処理を行なうための音声
認識システムに間し、特に、不特定話者に間するもので
ある。DETAILED DESCRIPTION OF THE INVENTION Field of Industrial Application The present invention relates to a speech recognition system for recognizing word speech input by specific speakers and unspecified speakers and performing various processes using the speech. , for unspecified speakers.

従来の技術従来、カップ飲料等の自動販売機（以後、簡単にカップ
自販機と称する）を始めとする自販機用音声認識システ
ムは、第４図に示すように、まず、利用者がマイクロホ
ン１により入力した音声を音声分析手段２により分析し
て音声パターンを抽出する０分析には帯域通過フィルタ
ー群を使ったＢＰＦ（Ｂａｎｄ　　Ｐａｔｈ　　Ｆｌｌ
ｔｅｒ）分析結果を時間軸と周波数軸で標本化し、強度
をデジタル処理する手法を用いる。標準パターン記憶手
段８には、同様の方法により抽出した多数の不特定話者
が発声した複数の離散単語の音声パターンを標準パター
ンとして記憶しである。ただし、ここで標準パターンと
して記憶されている単語は、カップ自販機で販売するフ
レーバー（コーヒージュース等飲料の品名）の呼称とい
くつかの返答単語（はい、いいえ、ホット、アイス等）
である。2. Description of the Related Art Conventionally, a voice recognition system for a vending machine such as a vending machine for cup beverages (hereinafter simply referred to as a cup vending machine) first inputs information from a user through a microphone 1, as shown in FIG. The voice analyzed by the voice analysis means 2 is analyzed to extract a voice pattern using a BPF (Band Path Full) using a group of band pass filters.
ter) A method is used to sample the analysis results on the time and frequency axes and digitally process the intensity. The standard pattern storage means 8 stores, as standard patterns, audio patterns of a plurality of discrete words uttered by a large number of unspecified speakers, which are extracted by a similar method. However, the words stored as standard patterns are the name of the flavor sold at the cup vending machine (product name of coffee juice, etc.) and some response words (yes, no, hot, ice cream, etc.)
It is.

そして、標準パターン選出手段４で、標準パターンの中
から入カバターンに最も近い標準パターンをＤＰ（Ｄｙ
ｎａｍｉｃ　　Ｐｒｏｇｒａｍｉｎｇ）マツチング法に
より選び出す、ＤＰマツチング法とは動的計画法と訳さ
れ、１９５７年に米国のＢｅｌｌｍａｎが提案した数理
計画法の一手法で、多段決定過程の最適化に適用される
。その手法は、各段である決定（制御）を行なって状態
を変換させながら、目的に達するまでの過程での良さ／
悪さを評価する間数を最大／最小とするというものであ
る。また、認識システムが特定話者に対応する場合は、
標準パターン記憶手段３に特定話者が発声した認識単語
の音声パターンを登録し、一方不特定話者に対応する場
合は、不特定多数の話者が発声した認識単語の音声パタ
ーンの内、代表パターンのいくつかを登録する０発声誘
導手段５は、音声合成手段により構成され、後述する制
御手段６に応じて、利用者の発声を促すために音声によ
る発声を促す。ただし、フレーバー塩は、カップ自販機
前面のパネル板等に明記してあり、利用者はその中から
好みのフレーバー塩を１つ選んで発声するものである。Then, the standard pattern selection means 4 selects the standard pattern closest to the input cover turn from among the standard patterns as DP (Dy
The DP matching method, which is translated as dynamic programming, is a method of mathematical programming proposed by Bellman in the United States in 1957, and is applied to the optimization of multi-stage decision processes. The method involves making certain decisions (controls) at each stage to transform the state, and evaluating the quality and quality of the process until reaching the goal.
The idea is to maximize/minimize the number of times for evaluating badness. Also, if the recognition system corresponds to a specific speaker,
The voice pattern of the recognized word uttered by a specific speaker is registered in the standard pattern storage means 3, and when dealing with unspecified speakers, a representative voice pattern of the recognized word uttered by an unspecified number of speakers is registered. The speech guiding means 5, which registers some of the patterns, is constituted by a speech synthesizing means, and prompts the user to make vocalizations in accordance with the control means 6, which will be described later. However, the flavor salts are clearly marked on the panel board on the front of the cup vending machine, and the user selects one flavor salt of their choice and speaks it out loud.

制御手段６は、処理に応じて発声誘導手段５に誘導音声
の発声を指示し、また、標準パターン選出手段４により
ＤＰマツチング法を用いで選出された標準パターンと入
カバターンとの距離がリジェクトしきい値より小さけれ
ば入カバターンを認識したと判断すると共に、認識結果
により以後のカップ自販機の動作を制御し、リジェクト
しきい値より大きければリジェクトと判断して、再び標
準パターン選出手段４に選出された標準パターンが入力
されるのを待つ、リジェクトしきい値は小さいほど認識
率は上がるが、リジェクト回数も増えるため、ある程度
高い値に設定しなければならない、また、７はコインの
受取りと釣銭の払い戻しを行なうコイン受取り手段、８
は選択されたフレーバーをカップに注ぎ搬出する飲料搬
出手段である０次に、第５図に、従来の自販機用音声認
識システムの音声認識可能期間を示す。The control means 6 instructs the voice guidance means 5 to utter the guidance voice according to the process, and also determines whether the distance between the standard pattern selected by the standard pattern selection means 4 using the DP matching method and the input cover turn is rejected. If it is smaller than the threshold, it is determined that the input pattern has been recognized, and the subsequent operation of the cup vending machine is controlled based on the recognition result, and if it is larger than the reject threshold, it is determined to be rejected, and the pattern is selected again by the standard pattern selection means 4. The smaller the rejection threshold is, the higher the recognition rate will be, but the number of rejections will also increase, so it must be set to a reasonably high value. Coin receiving means for refund, 8
is a beverage dispensing means for pouring the selected flavor into a cup and discharging it. FIG. 5 shows the voice recognition possible period of the conventional voice recognition system for vending machines.

ｔｌは音声終了確認期間、ｔ２はパターン選出処理期間
である。第５図より、ｔｌ及びｔ２の期間は、音声認識
が不可能であることがわかる。tl is a voice end confirmation period, and t2 is a pattern selection processing period. From FIG. 5, it can be seen that speech recognition is impossible during periods tl and t2.

発明が解決しようとする課題しかしながら、上記のような方法では、ｔｌ及びｔ２の
期間は、発声者の音声を認識できない、また、標準パタ
ーン選出手段により選出された標準パターンの内最初に
リジエクトレきい値を満足した標準パターンを入カバタ
ーンとして認識するため、認識率が低下するという欠点
を有していた。Problems to be Solved by the Invention However, in the above method, the voice of the speaker cannot be recognized during the periods tl and t2. Since the standard pattern that satisfies the above criteria is recognized as the input pattern, it has the disadvantage that the recognition rate decreases.

本発明は上記従来のＨ題を解決するもので、音声認識不
可能である期間ｔ２をなくし、また、標準パターン選出
手段により選出された標準パターンの内、入カバターン
との距離が最も近いものを選択することにより、認識率
の高い音声認識システムを提供することを目的とする。The present invention solves the above-mentioned conventional problem H, eliminates the period t2 during which voice recognition is not possible, and selects the standard pattern that is closest to the input cover pattern among the standard patterns selected by the standard pattern selection means. The purpose is to provide a speech recognition system with a high recognition rate by selecting the following.

課題を解決するための手段この目的を達成するためは本発明の音声認識システムは
、複数の離散単語音声の標準パターン群を記憶した標準
パターン記憶手段と、発声者の音声を分析し音声パター
ンを抽出する音声分析手段と、前記音声分析手段により
抽出した音声パターンに最も近い標準パターンを前記標
準パターン群から選出する複数の標準パターン選出手段
と、前記複数の標準パターン選出手段中のいずれかの標
準パターン選出手段を順番に選択する選択手段と、選択
された標準パターン選出手段が選出した標準パターンの
内、前記音声分析手段により・抽出した音声パターンに
最も近い標準パターンを選択する比較手段と、発声者に
単語を発声するように誘導する発声誘導手段とからなる
構成を有している。Means for Solving the Problems In order to achieve this object, the speech recognition system of the present invention includes a standard pattern storage means that stores a group of standard patterns of a plurality of discrete word sounds, and a standard pattern storage means that analyzes the speech of a speaker and generates speech patterns. a speech analysis means for extracting, a plurality of standard pattern selection means for selecting from the standard pattern group a standard pattern closest to the speech pattern extracted by the speech analysis means, and any standard among the plurality of standard pattern selection means. a selection means for sequentially selecting pattern selection means; a comparison means for selecting a standard pattern closest to the speech pattern extracted by the speech analysis means from among the standard patterns selected by the selected standard pattern selection means; and utterance guidance means for guiding the person to utter the words.

作用この構成によって、複数の標準パターン選出手段を持つ
ことにより、１つの標準パターン選出手段がパターン選
出処理期間にある時は、他の標準パターン選出手段がパ
ターン選出処理を行なうことにより、音声認識不可能で
ある期間ｔ２をなくし、また、比較手段が複数の標準パ
ターン選出手段により選出された標準パターンの内、入
カバターンとの距離が最も近いものを選択することによ
り、認識率の高い音声認識システムを実現できることと
なる。Effect With this configuration, by having a plurality of standard pattern selection means, when one standard pattern selection means is in the pattern selection processing period, the other standard pattern selection means performs pattern selection processing, thereby preventing voice recognition failure. A speech recognition system with a high recognition rate is achieved by eliminating the possible period t2 and by selecting the standard pattern closest to the input cover pattern among the standard patterns selected by a plurality of standard pattern selection means. This means that it can be realized.

実施例以下本発明の一実施例について、図面を参照しながら説
明する。EXAMPLE An example of the present invention will be described below with reference to the drawings.

本実施例は、不特定話者に対する音声認識システムをカ
ップ自販機に適応したものである。ただし、構成要件中
、従来例と同構成のものは、同番号を付し、説明を割愛
する。第１図は、本発明の実施例における音声認識シス
テムの機能ブロック図を示すものである。９．１０はそ
れぞれ、標準パターン選出手段Ａ、標準パターン選出手
段Ｂであり、標準パターンの中から入カバターンに最も
近い標準バタ、−ンをＤＰ（Ｄｙｎａｍｉｃ　　Ｐｒｏ
ｇｒａｍｌｎｇ）マツチング法により選び出し音声を認
識するものである。１１は選択手段であり、前記複数の
標準パターン選出手段９．１０のいずれかの標準パター
ン選出手段を選択するものである。In this embodiment, a voice recognition system for unspecified speakers is applied to a cup vending machine. However, among the structural requirements, those having the same configuration as the conventional example are given the same numbers and explanations are omitted. FIG. 1 shows a functional block diagram of a speech recognition system in an embodiment of the present invention. 9.10 are standard pattern selection means A and standard pattern selection means B, respectively, which select the standard pattern closest to the input cover turn from among the standard patterns as DP (Dynamic Pro).
This method recognizes selected speech using a matching method (gramlng). Reference numeral 11 denotes a selection means, which selects one of the plurality of standard pattern selection means 9 and 10.

１２は比較手段であり、前記複数の標準パターン選出手
段９，１０が選出した標準パターンの内、入カバターン
との距離が最も近いものを選択するものである。第２図
に、本発明の実施例における自販機用音声認識システム
の音声認識可能期間を示すｍ　　ｊｌは音声終了確認期
間、ｔ２、ｔ２°はパターン選出処理期間である。第２
図に示すように、前記選択手段１１は前記標準パターン
選出手段Ａ９がパターン選出処理期間に入るまでは前記
標準パターン選択手段１１を選択し、その後は、前記標
準パターン選出手段ＢＩＯを選択する。そして、前記標
準パターン選出手段ＢＩＯがパターン選出処理期間に入
ると再び前記標準パターン選出手段Ａ９を選択し、以後
この一連の動作を繰り返す。Reference numeral 12 denotes a comparison means, which selects the standard pattern that is closest in distance to the input cover turn from among the standard patterns selected by the plurality of standard pattern selection means 9 and 10. FIG. 2 shows the voice recognition possible period of the voice recognition system for vending machines according to the embodiment of the present invention. m jl is a voice end confirmation period, and t2 and t2° are pattern selection processing periods. Second
As shown in the figure, the selection means 11 selects the standard pattern selection means 11 until the standard pattern selection means A9 enters the pattern selection processing period, and thereafter selects the standard pattern selection means BIO. Then, when the standard pattern selection means BIO enters the pattern selection processing period, it selects the standard pattern selection means A9 again, and thereafter repeats this series of operations.

したがって、例えば発声者が”え−と（音声入力期間）
、コーヒー（音声入力期間）”と発声した場合、従来の
自販機用音声認識システムでは、′コーヒー　という音
声は前記標準パターン選出手段Ａ９のパターン選出処理
期間（”え−と″という音声の処理期間）に発声される
ため認識不可能であったが、本発明の実施例では前記ｍ
準パターン選出手段ＢＩＯによって、認識されることと
なる。Therefore, for example, if the speaker says, ``Um (voice input period)''
, coffee (voice input period)'', in the conventional voice recognition system for vending machines, the voice ``coffee'' is the pattern selection processing period of the standard pattern selection means A9 (the processing period for the voice ``um''). However, in the embodiment of the present invention, the m
It will be recognized by the quasi-pattern selection means BIO.

次に、表１、表２を用いて前記比較手段１２の動作につ
いて説明する。Next, the operation of the comparison means 12 will be explained using Tables 1 and 2.

表１表２表１は、標準パターン選出手段が選出した標準パターン
及び入カバターンとの距離を示している。Table 1 Table 2 Table 1 shows the standard patterns selected by the standard pattern selection means and the distances from the input cover turns.

表１より、標準パターン選出手段Ａ、Ｂが選出した標準
パターンはそれぞれａ、ｂであり、選出した標準パター
ンと入カバターンとの距離はそれぞれＬ１ｅＬ２である
ことがわかる０表２は、Ｌ１ｔＬ２の大小関係と、前記
比較手段１２が選択する標準パターンを示している。表
２より、前記比較手段１２はＬｌ＜Ｌ２、Ｌ、＝Ｌ２の
ときは標準パターンａを選択し、　Ｌｌ＜Ｌ２のときは
標準パターンｂを選択することがわかる。したがって、
前記複数の標準パターン選出手段９．１０が選出した標
準パターンの内、前記音声分析手段２により抽出した音
声パターンに最も近い標準パターンを選択することとな
る。なお、本実施例では前記標準パターン選出手段９，
１０により選出された標準パターンが２つの場合につい
て説明したが９選出された標準パターンが２つよりも多
い場合も同様にして入力音声パターンに最も近い標準パ
ターンが選択されるのは言うまでもない。From Table 1, it can be seen that the standard patterns selected by standard pattern selection means A and B are a and b, respectively, and the distances between the selected standard patterns and the incoming cover turns are L1eL2, respectively.Table 2 shows the magnitude of L1tL2. The relationship and the standard pattern selected by the comparing means 12 are shown. From Table 2, it can be seen that the comparison means 12 selects the standard pattern a when Ll<L2, L,=L2, and selects the standard pattern b when Ll<L2. therefore,
Among the standard patterns selected by the plurality of standard pattern selection means 9.10, the standard pattern closest to the voice pattern extracted by the voice analysis means 2 is selected. In this embodiment, the standard pattern selection means 9,
Although the case where there are two standard patterns selected in 10 has been described, it goes without saying that even if there are more than two standard patterns selected in 9, the standard pattern closest to the input audio pattern is selected in the same way.

以上のように構成されたカップ自販機用音声認識システ
ムについて、第３図のフローチャートを用いてその販売
動作を説明する。第３図において、まず、ステップ２０
１で、前記コイン受取手段７にコインが投入されたか否
かを判定し、コインが投入されればステップ２０２に進
む、ステップ２０２では、前記発声誘導手段５により”
いらつしゃいませ、何になさいますか”と誘導し、客か
らのフレーバー名の発声を待゛つ。そして、ステップ２
０８で、前記標準パターン選出手段Ａ９あるいは前記標
準パターン選出手段ＢＩＯにより、前記標準パターン記
憶手段３に記憶されている標準パターンから、入力され
た音声パターンに最も近い標準パターンを選出してフレ
ーバー名を認識する。ステップ２０４では、ステップ２
０３での認識結果が適当か否かを判定し、リジェクトの
場合はステップ２０５へ進み、発声誘導手段５により”
もう−度お答え下さい”と誘導して２０３へ戻る。一方
、リジェクトでない場合はステップ２０６へ進む。The vending operation of the voice recognition system for a cup vending machine configured as described above will be explained using the flowchart shown in FIG. In FIG. 3, first, step 20
1, it is determined whether a coin has been inserted into the coin receiving means 7, and if a coin has been inserted, the process proceeds to step 202.In step 202, the voice guidance means 5 says "
"Welcome, what would you like?" and waits for the customer to say the flavor name. Step 2
In step 08, the standard pattern selection means A9 or the standard pattern selection means BIO selects the standard pattern closest to the input voice pattern from among the standard patterns stored in the standard pattern storage means 3 and creates a flavor name. recognize. In step 204, step 2
It is determined whether the recognition result in step 03 is appropriate or not, and if it is rejected, the process proceeds to step 205, where the voice guidance means 5 says "
Please answer again" and return to step 203. On the other hand, if the answer is not rejected, proceed to step 206.

ステップ２０６では、ステップ２０３で認識したフレー
バーにより以降の動作を分岐するものであるが、本実施
例においてはコーヒーを認識したものとし、他のフレー
バー名を認識した場合の動作についてはコーヒーの場合
と同様であるため説明を割愛する０次にステップ２０？
では、発声誘導手段５により”コーヒーですね”と確認
し、客の返答を待つ、そして、２０８で、フレーバー名
と同様の方法で、はいかいいえの返答を認識する。In step 206, the subsequent operation is branched depending on the flavor recognized in step 203, but in this embodiment, it is assumed that coffee has been recognized, and the operation when another flavor name is recognized is the same as in the case of coffee. Since it is similar, the explanation will be omitted. 0 Next step 20?
Then, the voice guidance means 5 confirms that "it's coffee," and waits for the customer's response.Then, in step 208, a yes or no response is recognized in the same manner as for the flavor name.

ステップ２０９では、ステップ２０８での認識結果が適
当か否かを判定し、リジェクトの場合はステップ２０７
へ戻り、そうでない場合はステップ２１０へ進む、ステ
ップ２１０では、ステップ２０８で認識した返答がはい
の場合はステップ２１１へ進み、いいえの場合はステッ
プ２０５へ戻る。In step 209, it is determined whether the recognition result in step 208 is appropriate or not, and in the case of rejection, step 207
Otherwise, the process proceeds to step 210. In step 210, if the answer recognized in step 208 is yes, the process proceeds to step 211, and if no, the process returns to step 205.

ステップ２１１では、制御手段６が、コーヒーを前記飲
料搬出手段８を使ってカップに注ぎ搬出する。そして、
ステップ２１２で、釣り銭がある場合は、コイン受取手
段７により釣り銭を払い戻し、最後に、ステップ２１３
で発声誘導手段５により”ありがとうございました”と
発声して一連の動作を終了する。In step 211, the control means 6 pours and transports coffee into a cup using the beverage transport means 8. and,
In step 212, if there is change, the change is refunded by the coin receiving means 7, and finally, in step 213
Then, the voice guidance means 5 utters "Thank you very much" and the series of operations ends.

以上のように本実施例によれば、複数の標準パターン選
出手段をもつことにより一つの標準パターン選出手段が
パターン選出処理期間に入ると、他の標準パターン選出
手段が音声分析結果取り込み期間に入るため、従来は音
声認識が不可能であった期間においても音声認識が可能
となり、例えば発声者がフレーバー選択時に迷っている
時に、′え−と、（フレーバー名）”と発声しても認識
できる確率が高くなる。また、複数の標準パターン選出
手段が選出した標準パターンの内、音声分析手段により
抽出した音声パターンに最も近い標準パターンを選択す
る。このため、音声認識システムの認識率が向上すると
共に、リジェクトの回数も減少し発声者がスムーズに対
話ができることとなるなどその効果は大である。As described above, according to this embodiment, by having a plurality of standard pattern selection means, when one standard pattern selection means enters the pattern selection processing period, the other standard pattern selection means enters the speech analysis result acquisition period. Therefore, voice recognition is now possible even during periods when voice recognition was previously impossible. For example, when a speaker is unsure about choosing a flavor, it is possible to recognize the voice by saying, ``Um, (flavor name).'' The probability increases. Also, among the standard patterns selected by the plurality of standard pattern selection means, the standard pattern closest to the speech pattern extracted by the speech analysis means is selected. Therefore, the recognition rate of the speech recognition system improves. At the same time, the number of rejections is reduced, and the speaker can have a smooth conversation, which is a great effect.

発明の効果以上のように本発明の音声認識システムは、複数の離散
単語音声の標準パターン群を記憶した標準パターン記憶
手段と、発声者の音声を分析し音声パターンを抽出する
音声分析手段と、前記音声分析手段により抽出した音声
パターンに最も近い標準パターンを前記標準パターン群
から選出する複数の標準パターン選出手段と、前記複数
の標準パターン選出手段中のいずれかの標準パターン選
出手段を順番に選択する選択手段と、選択された標準パ
ターン選出手段が選出した標準パターンの内、前記音声
分析手段により抽出した音声パターンに最も近い標準パ
ターンを選択する比較手段と、発声者に単語を発声する
ように誘導する発声誘導手段とを設けることにより、一
つの標準パターン選出手段がパターン選出処理期間にあ
る時は、他の標準パターン選出手段が音声分析結果取り
込み期間に入るため従来は音声認識が不可能であった期
間においても音声認識が可能となり、また、複数の標準
パターン選出手段が選出した標準パターンの内、音声分
析手段により抽出した音声パターンに最も近い標準パタ
ーンを比較手段により選択するため、認識率の高い音声
認識システムを実現することができることとなる。Effects of the Invention As described above, the speech recognition system of the present invention includes: a standard pattern storage means that stores a group of standard patterns of a plurality of discrete word sounds; a speech analysis means that analyzes a speaker's speech and extracts a speech pattern; a plurality of standard pattern selection means for selecting a standard pattern closest to the voice pattern extracted by the voice analysis means from the standard pattern group; and one of the standard pattern selection means among the plurality of standard pattern selection means is selected in order. a selection means for selecting a standard pattern that is closest to the speech pattern extracted by the speech analysis means from among the standard patterns selected by the selected standard pattern selection means; By providing a voice guidance means for guiding, when one standard pattern selection means is in the pattern selection processing period, the other standard pattern selection means enters the speech analysis result acquisition period, which conventionally made speech recognition impossible. Speech recognition is now possible even during a certain period of time, and since the comparison means selects the standard pattern closest to the speech pattern extracted by the speech analysis means from among the standard patterns selected by the plurality of standard pattern selection means, the recognition rate increases. This makes it possible to realize a speech recognition system with high performance.

[Brief explanation of drawings]

第１図は本発明の一実施例における音声認識システムの
機能ブロック図、第２図は本発明の実施例における音声
認識システムの音声認識可能期間の説明図、第３図は本
発明の実施例における音声認識システムの動作例を示す
フローチャート、第４図は従来の音声認識システムの機
能ブロック図、第５図は従来の音声認識システムの音声
認識可能期間の説明図である。２・・・音声分析手段、８・　・標準パターン記憶手段
、５・　・発声誘導手段、９・　・標準パターン選出手
段Ａ、１０・　・標準パターン選出手段Ｂ１１１・　・
選択手段、１２・　・比較手段。FIG. 1 is a functional block diagram of a speech recognition system according to an embodiment of the present invention, FIG. 2 is an explanatory diagram of the speech recognition possible period of the speech recognition system according to an embodiment of the present invention, and FIG. 3 is an embodiment of the present invention. FIG. 4 is a functional block diagram of the conventional speech recognition system, and FIG. 5 is an explanatory diagram of the speech recognition possible period of the conventional speech recognition system. 2. Voice analysis means, 8. Standard pattern storage means, 5. Vocal guidance means, 9. Standard pattern selection means A, 10. Standard pattern selection means B111.
Selection means, 12. - Comparison means.

Claims

[Claims]

a standard pattern storage means that stores a group of standard patterns of a plurality of discrete word sounds, a voice analysis means that analyzes the voice of a speaker and extracts a voice pattern, and a standard pattern that is closest to the voice pattern extracted by the voice analysis means. a plurality of standard pattern selection means for selecting from the standard pattern group; a selection means for sequentially selecting one of the standard pattern selection means from the plurality of standard pattern selection means; A speech recognition system comprising a comparing means for selecting, from standard patterns, a standard pattern closest to the speech pattern extracted by the speech analyzing means, and a speech guiding means for guiding a speaker to pronounce a word.