JP2002196786A

JP2002196786A - Speech recognition device

Info

Publication number: JP2002196786A
Application number: JP2000394845A
Authority: JP
Inventors: Koji Nagao; 浩治永尾; Yuji Kishimoto; 雄治岸本
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2000-12-26
Filing date: 2000-12-26
Publication date: 2002-07-12

Abstract

PROBLEM TO BE SOLVED: To provide a speech recognition device with high recognition rate. SOLUTION: In the speech recognition device which recognizes the voice of a user, the device has a learning means to perform learning on the basis of the voice input by the user, and a dictionary to be corrected for the specific speaker based on the learned result by the learning means, the learning means collates the inputted voice with the dictionary, and the learned result is returned to the initial value side on the basis of this collated result only by the prescribed amount.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は車載装置に関する
もので、特にその学習機能に係わるものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a vehicle-mounted device, and more particularly to a learning function of the device.

【０００２】[0002]

【従来の技術】近年、車両にはカーナビゲーションシス
テムに代表されるように様々な装置が搭載され、運転中
の運転者が、それらの装置を安全に操作するための入力
装置として音声認識装置が開発された。2. Description of the Related Art In recent years, various devices are mounted on a vehicle as typified by a car navigation system, and a voice recognition device is used as an input device for a driver during driving to operate the devices safely. It has been developed.

【０００３】このような従来の音声認識装置として、例
えば特開平６−１３０９８５号公報に示されるような装
置が知られている。特定話者用として用いられる従来の
音声認識装置は、不特定話者用に用意された辞書を特定
話者用に修正して認識率を向上するために、利用者の音
声を入力してこれを学習する必要がある。[0003] As such a conventional speech recognition apparatus, for example, an apparatus disclosed in Japanese Patent Application Laid-Open No. H6-130985 is known. Conventional speech recognition devices used for specific speakers input a user's voice in order to improve the recognition rate by modifying the dictionary prepared for unspecified speakers for specific speakers. Need to learn.

【０００４】例えば、図１は一般的な音声認識装置の構
成を示すブロック図であり、辞書１、学習部２、認識部
３、学習認識制御部４を備える。このような一般的な音
声認識装置は、まず学習認識制御部４を学習モードにし
ておき、利用者が例えば予め決められている複数の単語
を順次音声入力し、学習認識制御部４がこれを学習部２
に送出する。これにより学習部２は、辞書１を順次特定
話者用に修正する。このようにして学習終了後、学習認
識制御部４を認識モードにする。そして未知の入力音声
を認識部３に送出し、認識部３はこれを特定話者用に修
正された辞書を参照して認識結果を出力する。[0004] For example, FIG. 1 is a block diagram showing a configuration of a general speech recognition apparatus, and includes a dictionary 1, a learning unit 2, a recognition unit 3, and a learning recognition control unit 4. In such a general speech recognition device, first, the learning recognition control unit 4 is set to a learning mode, and a user sequentially inputs a plurality of predetermined words, for example, and the learning recognition control unit 4 outputs the words. Learning part 2
To send to. Thereby, the learning unit 2 sequentially corrects the dictionary 1 for a specific speaker. After the learning is completed, the learning recognition control unit 4 is set to the recognition mode. Then, the unknown input voice is sent to the recognition unit 3, and the recognition unit 3 outputs the recognition result by referring to the dictionary corrected for the specific speaker.

【０００５】また、学習モードだけでなく、通常使用時
にも同様の学習によって、常時少しずつ辞書を特定話者
用に修正していく学習を行う音声認識装置も知られてい
る。There is also known a speech recognition apparatus that performs learning in which the dictionary is constantly modified little by little by the same learning during normal use as well as in the learning mode.

【０００６】[0006]

【発明が解決しようとする課題】上記のような従来の音
声認識装置は、特定話者用として学習が終了した後は、
その特定話者用に修正された辞書を参照して入力音声を
認識するため、学習モードにて学習したときと同じ話し
方であれば、認識率は向上するが、逆に、少しでも話し
方を変わると、参照する辞書と異なるため入力音声の認
識率が低下するという問題があった。The above-mentioned conventional speech recognition apparatus, after learning for a specific speaker is completed,
Since the input speech is recognized by referring to the dictionary modified for the specific speaker, the recognition rate improves if the same speech style is used when learning in the learning mode, but on the contrary, the speech style changes a little. Therefore, there is a problem that the recognition rate of the input voice is reduced because the dictionary is different from the dictionary to be referred to.

【０００７】この発明は、このような問題を解決するた
めになされたものであり、認識率の高い音声認識装置を
提供することを目的とする。The present invention has been made to solve such a problem, and has as its object to provide a speech recognition device having a high recognition rate.

【０００８】[0008]

【課題を解決するための手段】この発明に係る音声認識
装置は、利用者の音声を認識する音声認識装置におい
て、利用者の音声入力に基づき学習を行う学習手段と、
この学習手段による学習結果に基づき特定話者用に修正
される辞書を備え、上記学習手段は入力された音声と上
記辞書とを照合し、この照合結果に基づいて上記学習結
果を所定量だけ初期値側に戻すものである。A speech recognition apparatus according to the present invention is a speech recognition apparatus for recognizing a user's voice, wherein learning means for learning based on a user's voice input;
A dictionary that is modified for a specific speaker based on the learning result of the learning means; the learning means collates the input voice with the dictionary; and initializes the learning result by a predetermined amount based on the collation result. It returns to the value side.

【０００９】また、学習手段は、音声の入力回数が所定
値以上で、かつ入力された音声と辞書とを照合して両者
の一致回数が所定値以下のとき、学習結果を所定量だけ
初期値側に戻すものである。The learning means compares the input voice with the dictionary and the dictionary with the input voice and the number of matches is equal to or less than a predetermined value. To return to the side.

【００１０】また、学習手段は、学習結果を初期値側に
戻す所定量を、音声入力回数と一致回数の割合に応じて
変更するものである。The learning means changes the predetermined amount for returning the learning result to the initial value side in accordance with the ratio between the number of times of voice input and the number of times of matching.

【００１１】さらに、学習手段は、学習結果に基づいて
学習進行度合を演算し、この学習進行度合を表示するも
のである。Further, the learning means calculates a learning progress degree based on the learning result and displays the learning progress degree.

【００１２】さらにまた、学習手段は、学習進行度合が
所定値以下まで下がった場合には、警報を行うものであ
る。Further, the learning means gives an alarm when the degree of progress of the learning falls below a predetermined value.

【００１３】[0013]

【発明の実施の形態】実施の形態１．以下、この発明の
一実施形態を説明する。構成は上述した一般的な音声認
識装置と同様であり、詳細な説明は省略する。この実施
の形態１の動作を図２のフローチャートに従って説明す
る。また、音声認識装置の学習処理については、一般的
に知られているものであり詳細な説明は省略する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiment 1 Hereinafter, an embodiment of the present invention will be described. The configuration is the same as that of the above-described general speech recognition device, and a detailed description is omitted. The operation of the first embodiment will be described with reference to the flowchart of FIG. Further, the learning process of the speech recognition device is generally known, and a detailed description thereof will be omitted.

【００１４】ここで、この図２の処理は、一定時間ごと
に実行されるものとする。まず、ステップＳ１１にてカ
ウンタｔに１加算する。このカウンタｔはこの処理の実
行回数、すなわち経過時間をカウントするものである。
ステップＳ１２にて入力音声の有無を判定し、入力音声
がなければ後述するステップＳ１７へ進む。一方、入力
音声がある場合はステップＳ１３へ進み、既に学習処理
によって得られた学習結果に基づいて特定話者用に修正
された辞書１の内容と入力された音声とを比較し、ステ
ップＳ１４にて両者が一致するか否かを判定する。ここ
で一致とは、入力された音声と辞書の内容が完全に一致
するか否かにより判定してもよく、また両者の一致度合
が所定以上であれば一致と判定してもよい。Here, it is assumed that the process of FIG. 2 is executed at regular intervals. First, 1 is added to the counter t in step S11. The counter t counts the number of executions of this process, that is, the elapsed time.
In step S12, the presence or absence of an input voice is determined. If there is no input voice, the process proceeds to step S17 described later. On the other hand, if there is an input voice, the process proceeds to step S13, where the contents of the dictionary 1 corrected for the specific speaker based on the learning result already obtained by the learning process are compared with the input voice, and the process proceeds to step S14. To determine whether they match. Here, the match may be determined based on whether or not the input voice and the contents of the dictionary completely match, or may be determined to match if the degree of matching between the two is equal to or greater than a predetermined value.

【００１５】ステップＳ１４の判定にて、両者が一致し
ていればステップＳ１５にてカウンタＣに１加算し、両
者が一致していなければステップＳ１６にてカウンタＣ
から１減算する。このステップＳ１４、Ｓ１５、Ｓ１６
の処理は、入力音声と辞書の内容が一致した回数を計数
するものである。If it is determined in step S14 that they match, one is added to the counter C in step S15, and if they do not match, the counter C is added in step S16.
Is subtracted by 1. Steps S14, S15, S16
Is to count the number of times the input voice matches the contents of the dictionary.

【００１６】次に、ステップＳ１７にてカウンタｔが所
定値以上か否か、すなわち所定時間経過しているか否か
を判定し、経過していなければ処理を終了し、経過して
いれば、ステップＳ１８へ進む。ステップＳ１８ではカ
ウンタＣが所定値以下であるか否かを判定し、カウンタ
Ｃが所定値以下であれば、辞書の内容と入力音声が一致
する回数が少なく、利用者の話し方がばらついていると
判断してステップＳ１９へ進み、学習結果を所定量だけ
初期値側に戻す処理を実行する。一方、ステップＳ１８
にてカウンタＣが所定値以上であれば、辞書の内容と入
力音声が一致する回数が多く、利用者の話し方が一定し
ていると判断して学習結果を初期側へ戻す処理は実行し
ない。いずれの場合も最後にステップＳ２０へ進み、カ
ウンタｔ、カウンタＣをリセットして処理を終了する。Next, in step S17, it is determined whether or not the counter t is equal to or more than a predetermined value, that is, whether or not a predetermined time has elapsed. If not, the process is terminated. Proceed to S18. In step S18, it is determined whether or not the counter C is equal to or less than a predetermined value. The process proceeds to step S19, in which a process of returning the learning result to the initial value side by a predetermined amount is executed. On the other hand, step S18
If the counter C is equal to or more than the predetermined value, the number of times that the contents of the dictionary match the input voice is large, and it is determined that the user's speaking style is constant, and the process of returning the learning result to the initial side is not executed. In any case, the process finally proceeds to step S20, where the counter t and the counter C are reset and the process ends.

【００１７】以上のような処理を実行することによっ
て、所定時間内に辞書の内容と一致する音声を所定回数
以上入力しなければ、学習結果が初期値側へ戻っていく
こととなるため、利用者は学習結果が消去されることを
防ごうとする意識が働き、意識的に一定した話し方にて
音声を入力するので、音声認識装置としては入力音声が
一定するため認識率が向上する。By executing the above-mentioned processing, the learning result returns to the initial value side unless a voice that matches the contents of the dictionary is input a predetermined number of times or more within a predetermined time. Since the consciousness of the user works to prevent the learning result from being erased and the voice is input in a consciously constant manner of speech, the input rate of the voice recognition device is constant, so that the recognition rate is improved.

【００１８】実施の形態２．次に、この発明の他の実施
形態について説明する。上記実施の形態１では、所定時
間内に、辞書の内容と適合する音声を所定回数以上入力
しなければ、学習結果が初期値側へ戻っていくため、無
入力の状態が長時間経過する場合にも同様に学習結果が
初期値側へ戻っていく。所定入力回数毎に辞書の内容と
一致する音声の入力回数を判定して学習結果を初期値側
へ戻す処理を行うようにすることによって、これを防止
することができる。Embodiment 2 FIG. Next, another embodiment of the present invention will be described. In the first embodiment, the learning result returns to the initial value side unless a voice matching the contents of the dictionary is input a predetermined number of times or more within a predetermined time. Similarly, the learning result returns to the initial value side. This can be prevented by performing the process of determining the number of times of inputting the voice that matches the contents of the dictionary for each predetermined number of inputs and returning the learning result to the initial value side.

【００１９】具体的には、図３のフローチャートに示す
ように、まず、ステップＳ２１にて入力音声を待ち受
け、入力音声があると、ステップＳ２２へ進み、カウン
タｎに１加算する。このカウンタｎは音声が入力された
回数をカウントするものである。次にステップＳ２３へ
進み、ステップＳ２４にて両者が一致するか否かを判定
する。ここで一致とは、上記実施の形態１と同様に、入
力された音声と辞書の内容が完全に一致するか否かによ
り判定してもよく、また両者の一致度合が所定以上であ
れば一致と判定してもよい。More specifically, as shown in the flowchart of FIG. 3, first, in step S21, an input voice is awaited. If there is an input voice, the process proceeds to step S22, and 1 is added to the counter n. This counter n counts the number of times that a voice is input. Next, the process proceeds to step S23, and in step S24, it is determined whether or not both match. Here, the match may be determined based on whether or not the input speech and the contents of the dictionary completely match, as in the first embodiment. May be determined.

【００２０】ステップＳ２４の判定にて、両者が一致し
ていればステップＳ２５にてカウンタＣに１加算し、両
者が一致していなければステップＳ２６にてカウンタＣ
から１減算する。このステップＳ２４、Ｓ２５、Ｓ２６
の処理は、入力音声と辞書の内容が一致した回数を計数
するものである。If it is determined in step S24 that they match, one is added to the counter C in step S25, and if they do not match, the counter C is added in step S26.
Is subtracted by 1. Steps S24, S25, S26
Is to count the number of times the input voice matches the contents of the dictionary.

【００２１】次に、ステップＳ２７にてカウンタｎが所
定値以上か否か、すなわち音声が所定回数以上入力され
たか否かを判定し、入力されていなければステップＳ２
１へ戻り、次の入力音声を待ち受ける。一方、所定回数
以上入力されていれば、ステップＳ２８へ進む。ステッ
プＳ２８ではカウンタＣが所定値以下であるか否かを判
定し、カウンタＣが所定値以下であれば、辞書の内容と
入力音声が一致する回数が少なく、利用者の話し方がば
らついていると判断してステップＳ２９へ進み、学習結
果を所定量だけ初期値側に戻す処理を実行する。一方、
ステップＳ２８にてカウンタＣが所定値以上であれば、
辞書の内容と入力音声が一致する回数が多く、利用者の
話し方が一定していると判断して学習結果を初期側へ戻
す処理は実行しない。いずれの場合も最後にステップＳ
３０へ進み、カウンタｎ、カウンタＣをリセットしてス
テップ２１へ戻り次の入力音声を待ち受ける。Next, in step S27, it is determined whether or not the counter n is equal to or more than a predetermined value, that is, whether or not the voice has been input a predetermined number of times.
The process returns to 1 and waits for the next input voice. On the other hand, if the number has been input more than the predetermined number, the process proceeds to step S28. In step S28, it is determined whether or not the value of the counter C is equal to or less than a predetermined value. The process proceeds to step S29, in which a process of returning the learning result to the initial value side by a predetermined amount is executed. on the other hand,
If the counter C is equal to or more than the predetermined value in step S28,
The process of returning the learning result to the initial side by judging that the contents of the dictionary match the input voice many times and that the user's speech style is constant is not executed. In any case, the last step S
Proceeding to 30, the counter n and counter C are reset, and the process returns to step 21 to wait for the next input voice.

【００２２】以上のような処理を実行することによっ
て、実際に音声が所定回数入力されたときに、そのうち
辞書の内容と一致する音声が所定回数以上入力されてい
なければ、学習結果が初期値側へ戻っていくこととなる
ため、利用者は学習結果が消去されることを防ごうとす
る意識が働き、意識的に一定した話し方にて音声を入力
するので、音声認識装置としては入力音声が一定するた
め認識率が向上する。また、無入力状態が長時間継続す
るような場合に学習結果が初期値側へ戻っていくことを
防止することができ、実際に入力された音声と辞書との
不一致回数、すなわち、利用者の話し方のばらつきに応
じて学習内容を初期値側へ戻すことができる。By executing the above-described processing, when a voice is actually input a predetermined number of times, if a voice that matches the contents of the dictionary is not input a predetermined number of times or more, the learning result is reduced to the initial value. Because the user is conscious of trying to prevent the learning result from being erased and inputs speech in a consciously constant manner of speech, the input speech is used as a speech recognition device. Since it is constant, the recognition rate is improved. Further, it is possible to prevent the learning result from returning to the initial value side when the non-input state continues for a long time, and the number of mismatches between the actually input voice and the dictionary, that is, the user's The learning content can be returned to the initial value side according to the variation in the way of speaking.

【００２３】以上説明した実施の形態１および２ではい
ずれも、利用者の話し方がばらつく場合には所定量だけ
学習結果を初期値側へ戻すように処理を行ったが、この
所定量を話し方のばらつき度合に応じて変更するように
してもよい。すなわち、上記実施の形態２において、音
声が入力された回数ｎと入力された音声と辞書の内容が
一致した回数Ｃの割合に応じて所定量を変更して学習結
果を初期値側へ戻すようにしてもよい。これによって、
話し方を一定させようとする利用者の意識がより一層強
く働き、より認識率を向上することができる。In each of Embodiments 1 and 2 described above, when the user's way of speaking varies, the processing is performed so that the learning result is returned to the initial value side by a predetermined amount. It may be changed according to the degree of variation. That is, in the second embodiment, the learning result is returned to the initial value side by changing the predetermined amount according to the ratio of the number n of times the voice is input and the number C of times when the input voice matches the contents of the dictionary. It may be. by this,
The consciousness of the user who tries to keep the speech style constant works more strongly, and the recognition rate can be further improved.

【００２４】また、学習の進行度合（後退度合）を、イ
ンパネやカーナビゲーション用モニタなどに表示するよ
うに構成してもよい。これによって、学習結果が初期値
側に戻っていくことを利用者が確認でき、話し方を一定
させようとする利用者の意識がより一層強く働き、より
認識率を向上することができる。The degree of progress of learning (degree of retreat) may be displayed on an instrument panel or a car navigation monitor. As a result, the user can confirm that the learning result returns to the initial value side, and the user's consciousness for stabilizing the way of speaking works more strongly, and the recognition rate can be further improved.

【００２５】さらに、学習の進行度合（後退度合）が所
定値以下となった場合には、利用者に対して警報を行う
ようにしてもよい。これによって、学習結果がある程度
初期値側へ戻ってしまった場合に、利用者に再度学習処
理が必要であることを認識させることができ、認識率が
低下したまま装置を使用しつづけることなく、再度学習
処理を行い、認識率を向上させて使用することができ
る。Furthermore, when the degree of progress of learning (degree of retreat) becomes equal to or less than a predetermined value, a warning may be issued to the user. Thereby, when the learning result has returned to the initial value to some extent, it is possible to make the user recognize that the learning process is necessary again, and without continuing to use the device while the recognition rate is lowered, The learning process can be performed again to improve the recognition rate before use.

【００２６】[0026]

【発明の効果】この発明に係る音声認識装置は、利用者
の音声を認識する音声認識装置において、利用者の音声
入力に基づき学習を行う学習手段と、この学習手段によ
る学習結果に基づき特定話者用に修正される辞書を備
え、上記学習手段は入力された音声と上記辞書とを照合
し、この照合結果に基づいて上記学習結果を所定量だけ
初期値側に戻すものであり、学習結果が初期値側へ戻っ
ていくこととなるため、利用者は学習結果が消去される
ことを防ごうとする意識が働き、意識的に一定した話し
方にて音声を入力するので、音声認識装置としては入力
音声が一定するため認識率が向上する。The speech recognition apparatus according to the present invention is a speech recognition apparatus for recognizing a user's voice, comprising: learning means for learning based on a user's voice input; and a specific speech based on a learning result by the learning means. The learning means compares input speech with the dictionary, and returns the learning result to the initial value side by a predetermined amount based on the result of the comparison. Will return to the initial value side, and the user will work to prevent the learning result from being erased, and will input speech in a consciously constant manner of speech. Since the input voice is constant, the recognition rate is improved.

【００２７】また、学習手段は、音声の入力回数が所定
値以上で、かつ入力された音声と辞書とを照合して両者
の一致回数が所定値以下のとき、学習結果を所定量だけ
初期値側に戻すものであり、実際に音声が所定回数入力
されたときに、そのうち辞書の内容と一致する音声が所
定回数以上入力されていなければ、学習結果が初期値側
へ戻っていくこととなるため、利用者は学習結果が消去
されることを防ごうとする意識が働き、意識的に一定し
た話し方にて音声を入力するので、音声認識装置として
は入力音声が一定するため認識率が向上する。また、無
入力状態が長時間継続するような場合に学習結果が初期
値側へ戻っていくことを防止することができ、実際に入
力された音声と辞書との不一致回数、すなわち、利用者
の話し方のばらつきに応じて学習内容を初期値側へ戻す
ことができる。Further, the learning means compares the input voice with the dictionary and compares the input voice with the dictionary and determines whether the number of matches between the voice and the dictionary is equal to or less than a predetermined value. When the voice is actually input a predetermined number of times, if the voice that matches the contents of the dictionary has not been input more than a predetermined number of times, the learning result will return to the initial value side Therefore, the user works to prevent the learning result from being erased, and inputs the voice in a consciously constant manner of speech.As a speech recognition device, the input voice is constant and the recognition rate is improved. I do. Further, it is possible to prevent the learning result from returning to the initial value side when the non-input state continues for a long time, and the number of mismatches between the actually input voice and the dictionary, that is, the user's The learning content can be returned to the initial value side according to the variation in the way of speaking.

【００２８】また、学習手段は、学習結果を初期値側に
戻す所定量を、音声入力回数と一致回数の割合に応じて
変更するものであり、話し方を一定させようとする利用
者の意識がより一層強く働き、より認識率を向上するこ
とができる。The learning means changes the predetermined amount for returning the learning result to the initial value side in accordance with the ratio between the number of times of voice input and the number of times of matching. It works more strongly and can improve the recognition rate.

【００２９】さらに、学習手段は、学習結果に基づいて
学習進行度合を演算し、この学習進行度合を表示するも
のであり、学習結果が初期値側に戻っていくことを利用
者が確認でき、話し方を一定させようとする利用者の意
識がより一層強く働き、より認識率を向上することがで
きる。Further, the learning means calculates a learning progress degree based on the learning result and displays the learning progress degree. The user can confirm that the learning result returns to the initial value side. The consciousness of the user who tries to keep the speech style constant works more strongly, and the recognition rate can be further improved.

【００３０】さらにまた、学習手段は、学習進行度合が
所定値以下まで下がった場合には、警報を行うものであ
り、学習結果がある程度初期値側へ戻ってしまった場合
に、利用者に再度学習処理が必要であることを認識させ
ることができ、認識率が低下したまま装置を使用しつづ
けることなく、再度学習処理を行い、認識率を向上させ
て使用することができる。Further, the learning means gives an alarm when the learning progress degree falls below a predetermined value. When the learning result has returned to the initial value side to some extent, the user is asked again. It is possible to recognize that the learning process is necessary, and it is possible to perform the learning process again without continuing to use the device while the recognition rate is lowered, and to use the device with an improved recognition rate.

[Brief description of the drawings]

【図１】一般的な音声認識装置の構成を示すブロック
図である。FIG. 1 is a block diagram illustrating a configuration of a general voice recognition device.

【図２】実施の形態１の動作を示すフローチャートで
ある。FIG. 2 is a flowchart showing the operation of the first embodiment.

【図３】実施の形態２の動作を示すフローチャートで
ある。FIG. 3 is a flowchart showing an operation of the second embodiment.

[Explanation of symbols]

１辞書２学習部３認識部４学習認識制御部 Reference Signs List 1 dictionary 2 learning unit 3 recognition unit 4 learning recognition control unit

Claims

[Claims]

1. A speech recognition apparatus for recognizing a user's voice, comprising: a learning unit for learning based on a user's voice input; and a dictionary modified for a specific speaker based on a learning result by the learning unit. A speech recognition apparatus, wherein the learning unit collates the input speech with the dictionary and returns the learning result to an initial value by a predetermined amount based on the collation result.

2. The learning means according to claim 1, wherein the number of times of voice input is equal to or more than a predetermined value, and the input voice is matched with the dictionary and the number of matches between the two is equal to or less than a predetermined value. The voice recognition device according to claim 1, wherein the voice recognition device is returned to a side.

3. The speech recognition apparatus according to claim 2, wherein the learning means changes a predetermined amount for returning the learning result to the initial value side in accordance with a ratio between the number of times of voice input and the number of times of matching.

4. The speech recognition apparatus according to claim 1, wherein the learning means calculates a learning progress degree based on a learning result and displays the learning progress degree.

5. The speech recognition apparatus according to claim 4, wherein the learning means issues an alarm when the learning progress degree falls below a predetermined value.