JP2002333898A

JP2002333898A - Sound-recognizing system for electronic pet

Info

Publication number: JP2002333898A
Application number: JP2001136758A
Authority: JP
Inventors: Yutaka Saito; 裕斉藤; Seiichi Ito; 成一伊藤
Original assignee: VIVARIUM Inc
Current assignee: VIVARIUM Inc
Priority date: 2001-05-07
Filing date: 2001-05-07
Publication date: 2002-11-22

Abstract

PROBLEM TO BE SOLVED: To perform efficient sound recognition without misrecognitions especially by comparing the difference in the recognition rates of respective candidates. SOLUTION: In the sound recognizing system for electronic pet, using an electronic pet such as human faced fish 'Seaman', in terminal equipment for storing the program of the electronic pet, in the case of conversation between the 'Seaman' and an operator, the voice of the answer of the operator is recognized and a plurality of candidates having different recognition rates are presented. When there is a difference of recognition rates more than a prescribed value between first and second candidates, the first candidate is defined as being recognized result. When there is no difference of recognition rates more than the prescribed value between the first and second candidates, confirmation processing is applied on the first candidate, the misrecognition is eliminated, and efficient sound recognition is enabled.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、電子ペットの音声
認識システム、及びその方法に関する。[0001] 1. Field of the Invention [0002] The present invention relates to an electronic pet voice recognition system and method.

【０００２】[0002]

【従来の技術】近年のエレクトロニクス技術の進歩に伴
い、操作者の個性に関する情報を知識として蓄積し、該
蓄積した各種データに基づいてその操作者に合致した対
応を実現する電子ペットプログラムがある。例えば、水
槽の中を泳ぎ回る電子ペットと会話し、操作者とコニュ
ケーションをとるシステムなどである。2. Description of the Related Art With the recent advance in electronics technology, there is an electronic pet program that accumulates information regarding the personality of an operator as knowledge and realizes a response suitable for the operator based on the accumulated various data. For example, there is a system that talks with an electronic pet swimming in a water tank and communicates with an operator.

【０００３】[0003]

【発明が解決しようとする課題】このような電子ペット
とのコミニュケーションシステムにおいて、電子ペット
とユーザとの会話によってコミニュケーションが進行す
る。この際、音声認識エンジンや音声合成エンジンを使
用して電子ペット及びユーザの音声認識、及び音声合成
が行われる。In such a communication system with an electronic pet, communication proceeds by conversation between the electronic pet and a user. At this time, voice recognition and voice synthesis of the electronic pet and the user are performed using a voice recognition engine and a voice synthesis engine.

【０００４】しかしながら、従来の音声認識において
は、必ずしも確実な認識を行っているとは言えなかっ
た。例えば、人面魚「シーマン」の質問に対する回答を
認識する場合、最も認識率の高い認識結果を回答として
いた。このため、それほど認識率自体が高くない場合で
も回答であると判断し、誤認識を行う危険があった。However, in the conventional speech recognition, it cannot be said that reliable recognition is always performed. For example, when recognizing the answer to the question of the mermaid "Seaman", the recognition result with the highest recognition rate was used as the answer. For this reason, even when the recognition rate itself is not so high, it is determined that the answer is given, and there is a risk of erroneous recognition.

【０００５】また、必ず認識結果をユーザに確認する処
理を含めるシステムもある。しかし、この場合には極め
て効率の悪い音声認識方法となる。本発明は上記課題に
鑑み、認識率の差を比較することによってより正確な音
声認識を行うことを目的とするものである。[0005] Some systems always include a process for confirming the recognition result to the user. However, this is a very inefficient speech recognition method. The present invention has been made in view of the above problems, and has as its object to perform more accurate voice recognition by comparing differences in recognition rates.

【０００６】[0006]

【課題を解決するための手段】上記課題は請求項１記載
の発明によれば、複数の端末機器と、ネットワーク回線
網を介して接続されデータ通信可能に接続されたサーバ
コンピュータとを備えた電子ペットの音声認識システム
であって、電子ペットの質問に対して認識率の異なる複
数の候補を出力する音声認識手段と、該音声認識によっ
て得られる第１の候補と第２の候補間の認識率の差を比
較する比較手段と、該比較手段による比較結果が所定値
を越えるとき、前記第１の候補を前記音声認識の結果と
認定する認定手段と、前記比較手段による比較結果が所
定値を越えないとき、前記第１の候補に対する確認処理
を行う確認手段とを有する電子ペットの音声認識システ
ムを提供することによって達成できる。According to the first aspect of the present invention, there is provided an electronic apparatus comprising: a plurality of terminal devices; and a server computer connected via a network and connected to enable data communication. A voice recognition system for a pet, comprising: voice recognition means for outputting a plurality of candidates having different recognition rates for an electronic pet question; and a recognition rate between a first candidate and a second candidate obtained by the voice recognition. Comparing means for comparing the difference between the first candidate and the speech recognition result when the comparison result by the comparing means exceeds a predetermined value. If not, it can be achieved by providing a voice recognition system for an electronic pet having a confirmation means for performing a confirmation process on the first candidate.

【０００７】このように構成することによって、第１候
補と第２候補に大きな認識率の差があるとき、第１候補
を正しい回答であると判断して音声認識し、効率良い音
声認識処理を行うものである。With this configuration, when there is a large difference between the first candidate and the second candidate in the recognition rate, the first candidate is determined to be a correct answer and speech recognition is performed, and efficient speech recognition processing is performed. Is what you do.

【０００８】請求項２の記載は、上記請求項１記載の発
明において、前記第１の候補に対する確認処理におい
て、肯定的な回答を受けた場合、該第１候補を前記音声
認識の結果と認定する構成である。According to a second aspect of the present invention, in the invention according to the first aspect, when a positive answer is received in the confirmation processing for the first candidate, the first candidate is recognized as the result of the speech recognition. It is a configuration to do.

【０００９】このように構成することによって、前記第
１候補を音声認識の結果と判断することができ、誤認識
を防止することができる。請求項３の記載は、上記請求
項１記載の発明において、前記第１の候補に対する確認
処理において、否定的な応答を受けた場合、更に前記第
２候補に対する確認処理を行う構成である。With this configuration, the first candidate can be determined as a result of speech recognition, and erroneous recognition can be prevented. According to a third aspect of the present invention, in the invention according to the first aspect, in the confirmation processing for the first candidate, if a negative response is received, the confirmation processing for the second candidate is further performed.

【００１０】このように構成することによって、第２候
補以下についても効率良く、且つ誤認識を行うことな
く、音声認識処理を行うことが可能となる。請求項４の
記載は、上記請求項３の記載において、前記第２の候補
に対する確認処理において、肯定的な応答を受けた場
合、該第２の候補を前記音声認識の結果と認定する構成
である。[0010] With this configuration, it is possible to efficiently perform the speech recognition processing for the second candidate and the subsequent candidates without erroneous recognition. According to a fourth aspect of the present invention, in the configuration of the third aspect, when a positive response is received in the confirmation processing for the second candidate, the second candidate is recognized as the result of the speech recognition. is there.

【００１１】このように構成することによっても、第２
候補以下についても効率良く、且つ誤認識を行うことな
く、音声認識処理を行うことが可能となる。請求項５の
記載は、上記請求項３の記載において、前記第２の候補
に対する確認処理において、否定的な応答を受けた場
合、第３の候補に対する確認処理を行う構成である。With this configuration, the second
Speech recognition processing can be performed efficiently for candidates and below without erroneous recognition. According to a fifth aspect of the present invention, in the third aspect, when a negative response is received in the confirmation processing for the second candidate, the confirmation processing for the third candidate is performed.

【００１２】請求項６の記載は、上記請求項１乃至５の
記載において、前記第１の候補の認識率は一定値以上で
ある。このように構成することにより、例えば第１の候
補が低い認識率であり、しかも第２の候補との間に所定
値以上の認識率の差がある場合、低い認識率の第１候補
が回答であると判断されることを防止する。According to a sixth aspect of the present invention, in the first to fifth aspects, the recognition rate of the first candidate is a certain value or more. With such a configuration, for example, when the first candidate has a low recognition rate and a difference between the second candidate and the second candidate is equal to or more than a predetermined value, the first candidate having the low recognition rate Is prevented from being determined.

【００１３】上記課題は請求項７記載の発明によれば、
複数の端末機器と、ネットワーク回線網を介して接続さ
れデータ通信可能に接続されたサーバコンピュータとを
備えた電子ペットの音声認識方法であって、電子ペット
の質問に対する回答を音声認識する処理と、該音声認識
によって複数の認識候補が得られた場合、第１の候補と
第２の候補間の認識率の差を比較する比較処理と、該処
理による比較結果が所定値を越えるとき、前記第１の候
補を前記音声認識の結果と認定する認定処理と、前記処
理による比較結果が所定値を越えないとき、前記第１の
候補に対する確認処理を行う確認処理とを行う電子ペッ
トの音声認識方法を提供することによって達成できる。[0013] The above object is attained according to a seventh aspect of the present invention.
A plurality of terminal devices, a voice recognition method of an electronic pet including a server computer connected via a network network and communicably connected, the processing for voice recognition of the answer to the question of the electronic pet, When a plurality of recognition candidates are obtained by the voice recognition, a comparison process of comparing the difference in the recognition rate between the first candidate and the second candidate, and when the comparison result by the process exceeds a predetermined value, A voice recognition method for an electronic pet, comprising: a certification process for recognizing a first candidate as a result of the voice recognition; and a confirmation process for performing a confirmation process on the first candidate when a comparison result by the process does not exceed a predetermined value. Can be achieved by providing

【００１４】本発明は方法の発明であり、このように構
成することによっても、第１候補と第２候補に大きな認
識率の差があるとき、第１候補を正しい回答であると判
断し、音声認識することができる。The present invention is an invention of a method. With such a configuration, when there is a large difference between the first candidate and the second candidate in the recognition rate, the first candidate is determined to be a correct answer, Can recognize voice.

【００１５】請求項８の記載は、上記請求項７記載の発
明において、前記第１の候補の認識率は一定値以上であ
る。上記課題は請求項９記載の発明によれば、複数の端
末機器と、ネットワーク回線網を介して接続されデータ
通信可能に接続されたサーバコンピュータとを備えた電
子ペットの音声認識プログラムであって、電子ペットの
質問に対する回答を音声認識する機能と、該音声認識に
よって複数の認識候補が得られた場合、第１の候補と第
２の候補間の認識率の差を比較する比較機能と、該比較
機能による比較結果が所定値を越えるとき、前記第１の
候補を前記音声認識の結果と認定する認定機能と、前記
機能による比較結果が所定値を越えないとき、前記第１
の候補に対する確認処理を行う確認機能とを有する電子
ペットの音声認識プログラムを提供することによって達
成できる。According to an eighth aspect of the present invention, in the above-mentioned seventh aspect, the recognition rate of the first candidate is equal to or more than a predetermined value. According to the ninth aspect of the present invention, there is provided a voice recognition program for an electronic pet, comprising: a plurality of terminal devices; and a server computer connected via a network and connected to enable data communication. A function of recognizing the answer to the question of the electronic pet by voice, and a comparing function of comparing a difference in recognition rate between the first candidate and the second candidate when a plurality of recognition candidates are obtained by the voice recognition; A certifying function for certifying the first candidate as a result of the speech recognition when the comparison result by the comparison function exceeds a predetermined value; and a certifying function when the comparison result by the function does not exceed a predetermined value.
This can be achieved by providing a voice recognition program for an electronic pet having a confirmation function of performing a confirmation process for a candidate.

【００１６】本発明はプログラムの発明であり、このよ
うに構成することによっても、第１候補と第２候補に大
きな認識率の差があるとき、第１候補を正しい回答であ
ると判断し、音声認識することができる。The present invention is an invention of a program. With this configuration, when there is a large difference in recognition rate between the first candidate and the second candidate, the first candidate is determined to be a correct answer, Can recognize voice.

【００１７】請求項１０の記載は、上記請求項９記載の
発明において、前記第１の候補の認識率は一定値以上で
ある。According to a tenth aspect, in the invention of the ninth aspect, the recognition rate of the first candidate is equal to or more than a predetermined value.

【００１８】[0018]

【発明の実施の形態】以下、本発明の実施形態を図面に
基づいて説明する。図１は、本発明の電子ペットを利用
する音声認識システムのシステム構成図である。本例の
システムは、同図に示すように、利用者が所有する複数
のプラットホームであるパーソナルコンピュータ１ａ、
及び１ｂと、コンピュータゲーム機６と、携帯電話７
と、電子ペットである人面魚「シーマン」の提供サービ
スを行うサービス事業者が所有するサーバコンピュータ
３とから構成されている。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a system configuration diagram of a voice recognition system using an electronic pet according to the present invention. As shown in FIG. 1, the system of the present embodiment includes a plurality of personal computers 1a,
And 1b, a computer game machine 6, and a mobile phone 7
And a server computer 3 owned by a service provider that provides a service for providing a mermaid "Seaman" as an electronic pet.

【００１９】上記パーソナルコンピュータ１ａ、１ｂ、
並びにゲーム機６は各通信装置２ａ、２ｂ、及び５を介
してそれぞれインターネット２１に接続され、また携帯
電話７は図示しない中継局や中央制御局を介して、イン
ターネット２１に接続されている。The personal computers 1a, 1b,
The game machine 6 is connected to the Internet 21 via each of the communication devices 2a, 2b, and 5, and the mobile phone 7 is connected to the Internet 21 via a not-shown relay station or central control station.

【００２０】また、インターネット２１に接続されたパ
ーソナルコンピュータは１ａ、１ｂで示すが、上記２台
のコンピュータ以外に、多数のコンピュータがインター
ネット２１に接続されている。尚、パーソナルコンピュ
ータ１ａは、例えばＯＳとしてウインドウズを使用し、
パーソナルコンピュータ１ｂは、ＯＳとして例えばマッ
キントッシュを使用する。Although personal computers connected to the Internet 21 are indicated by 1a and 1b, many computers are connected to the Internet 21 in addition to the above two computers. The personal computer 1a uses, for example, Windows as the OS,
The personal computer 1b uses, for example, a Macintosh as the OS.

【００２１】先ず、本例において受信者並びに発信者が
使用するパ−ソナルコンピュータ１の構成を図２に示
す。尚、パーソナルコンピュータ１の構成説明におい
て、代表してパーソナルコンピュータ１ａの例で説明す
る。First, the configuration of the personal computer 1 used by the receiver and the sender in this embodiment is shown in FIG. In the description of the configuration of the personal computer 1, the personal computer 1a will be representatively described.

【００２２】図２に示すように、パーソナルコンピュー
タ１ａの内部には、データの送受を行うデータバス１０
が配設され、このデータバス１０に中央処理装置（以
下、ＣＰＵで示す）１１や、ＲＡＭ１２、リアルタイム
クロック（ＲＴＣ）１７等が接続されている。As shown in FIG. 2, a data bus 10 for transmitting and receiving data is provided inside the personal computer 1a.
The data bus 10 is connected to a central processing unit (hereinafter, referred to as a CPU) 11, a RAM 12, a real-time clock (RTC) 17, and the like.

【００２３】ＣＰＵ１１は後述する利用者への質問の出
題や回答の受付を音声合成により表示装置に表示される
電子ペットの言葉として出力する出題処理や、利用者の
回答を音声認識して予め登録されている複数の回答候補
から該当する項目を選出登録して利用者の個性情報とし
て蓄積する回答受付処理等を実施する。この処理の際、
ワークエリアとしてＲＡＭ１２を使用する。The CPU 11 outputs a question to the user and accepts an answer, which will be described later, as words of an electronic pet displayed on the display device by speech synthesis, and recognizes and pre-registers the answer of the user by voice recognition. A response receiving process for selecting and registering a corresponding item from a plurality of answer candidates set as above and storing the selected item as personality information of the user is performed. During this process,
The RAM 12 is used as a work area.

【００２４】また、リアルタイムクロック（ＲＴＣ）１
７は、例えばデータ更新の日時情報に使用される現在の
時刻情報や、任意の年月日の曜日等のカレンダー情報を
出力する。また、入力装置１６はキーボードやマウス等
であり、キーボードやマウス等の操作情報をＣＰＵ１１
に通知する。A real-time clock (RTC) 1
Reference numeral 7 outputs, for example, current time information used for date and time information of data update, and calendar information such as an arbitrary day of the week. The input device 16 is a keyboard, a mouse, and the like.
Notify.

【００２５】表示装置１４はＣＲＴ又はＬＣＤ等のディ
スプレイであり、後述するシーマンの表示を行う。ま
た、通信インターフェイス１３は前述の通信装置２（２
ａ）に接続され、通信装置２（２ａ）及びインターネッ
ト２１を介してサーバコンピュータ３との間でデータの
送受信を行う。The display device 14 is a display such as a CRT or an LCD, and displays a Seaman described later. The communication interface 13 is connected to the communication device 2 (2
a) and transmits / receives data to / from the server computer 3 via the communication device 2 (2a) and the Internet 21.

【００２６】また、音声入出力装置１８には外部機器で
あるスピーカ１９やマイク２０に接続され、Ａ／Ｄ・Ｄ
／Ａコンバータを有する。音声入出力装置１８は、後述
する音声合成プログラムにより生成された音声データを
アナログの音声と変換（Ｄ／Ａ変換）し、上記スピーカ
１９に出力すると共に、上記マイク２０から入力された
音声をデジタルデータに変換（Ａ／Ｄ変換）して出力す
る。The audio input / output device 18 is connected to a speaker 19 and a microphone 20 as external devices, and A / D / D
/ A converter. The voice input / output device 18 converts (D / A converts) voice data generated by a voice synthesis program to be described later into analog voice, outputs the voice data to the speaker 19, and converts the voice input from the microphone 20 into a digital voice. The data is converted (A / D converted) and output.

【００２７】記憶装置１５は、磁気ディスクや光磁気デ
ィスクから成り、上記ＣＰＵ１１の制御に従ってデータ
やプログラムの書き込み、読み出し処理が行われる。こ
の記憶装置１５には前述の表示装置１４に表示される電
子ペットである人面魚「シーマン」の画像や動作処理等
が記述された電子ペットプログラムや、前記人面魚「シ
ーマン」の声をテキストデータに基づいて音声出力する
ための音声合成プログラムや、発信者或いは受信者の入
力音声をテキストデータに変換するための音声認識プロ
グラム等が記憶されている。The storage device 15 is composed of a magnetic disk or a magneto-optical disk. Data and programs are written and read under the control of the CPU 11. The storage device 15 stores an electronic pet program in which an image and an operation process of a mermaid fish "Seaman", which is an electronic pet displayed on the display device 14, and a voice of the mermaid fish "Seaman" are stored. A speech synthesis program for outputting speech based on text data, a speech recognition program for converting input speech of a sender or a recipient into text data, and the like are stored.

【００２８】また、記憶装置１５には人面魚「シーマ
ン」がアプリケーションにおいて使用し、理解する単語
群が「辞書」として登録されている。図３はこの辞書の
構成を示す図であり、カテゴリー毎に登録されている。In the storage device 15, a group of words used and understood by the mermaid "Seaman" in the application is registered as a "dictionary". FIG. 3 is a diagram showing the structure of this dictionary, which is registered for each category.

【００２９】例えば、項目番号１には人面魚「シーマ
ン」の質問「おまえは男」、及び当該質問に対する回答
ａ、ｂが登録されている。また、項目番号２には人面魚
「シーマン」の質問「おまえの歳は」、及び当該質問に
対する回答ａ、ｂ・・・が登録されている。以下、同図
に示す通りであり、人面魚「シーマン」の質問、及びそ
の回答が複数単語辞書として登録されている。そして、
後述する認識処理の際、ユーザの回答に対して各単語毎
に比較処理を行い、各単語毎に認識率を出力する。For example, in item number 1, the question "You are a man" of the mermaid "Seaman" and the answers a and b to the question are registered. In item number 2, a question "Your age" of the mermaid "Seaman" and answers a, b,... To the question are registered. Hereinafter, as shown in the figure, the question of the mermaid "Seaman" and its answer are registered as a multi-word dictionary. And
At the time of a recognition process described later, a comparison process is performed for each word for the user's answer, and a recognition rate is output for each word.

【００３０】上記構成の音声認識システムにおいて、以
下に処理動作を説明する。図４は本例の処理動作を説明
するフローチャートである。先ず、人面魚「シーマンか
らの質問が行われる（ステップ（以下、ＳＴで示す）１
がＹＥＳ）。この質問は前述のデータベースに記憶され
たデータ順に行われ、この質問は上記パーソナルコンピ
ュータ１ａのスピーカ１９ａから流れる（ＳＴ２）。例
えば、人面魚「シーマン」が行う質問が、前述の図３に
示す項目番号２の場合、「おまえの歳は」の質問であ
る。この質問はスピーカ１９ａから流れ、ユーザは質問
を理解する。The processing operation of the above-structured speech recognition system will be described below. FIG. 4 is a flowchart illustrating the processing operation of this example. First, a question is asked from the mermaid "Seaman" (step (hereinafter referred to as ST) 1
Is YES). This question is made in the order of the data stored in the above-mentioned database, and this question flows from the speaker 19a of the personal computer 1a (ST2). For example, if the question asked by the mermaid "Seaman" is the item number 2 shown in FIG. This question flows from the speaker 19a, and the user understands the question.

【００３１】次に、ユーザは上記質問に答えて、マイク
２０ａに向かって回答を行う（ＳＴ３がＹＥＳ）。ＣＰ
Ｕ１１は上記回答から音声認識を行い、複数の候補を選
択する（ＳＴ４）。例えば、ユーザの回答が「２７歳」
である場合、図５に示す候補が選択される。すなわち、
この場合の回答は上記項目番号２に対応する回答ａ、
ｂ、ｃ、・・・の中から選択され、各単語に対する音声
比較、例えば積分値の比較等から認識率の高い順に出力
される。例えば、図５に示すように、第１候補として
「じゅうななさい」、第２候補として「にじゅうななさ
い」、第３候補として「ごじゅうななさい」、・・・が
出力される。Next, the user answers the above-mentioned question and answers the microphone 20a (YES in ST3). CP
U11 performs voice recognition from the above answer and selects a plurality of candidates (ST4). For example, the user's answer is “27 years old”
, The candidate shown in FIG. 5 is selected. That is,
The answer in this case is answer a corresponding to item number 2 above,
are selected from b, c,..., and are output in descending order of recognition rate based on a speech comparison for each word, for example, a comparison of integration values. For example, as shown in FIG. 5, "Juneha" is output as a first candidate, "Junea" as a second candidate, "Junea" as a third candidate, and so on.

【００３２】また、同時に各候補の認識率も表示され
る。例えば、第１候補が６０％であり、第２候補が５５
％であり、第３候補が４５％である。次に、ＣＰＵ１１
は上記第１候補と第２候補の認識率の差がｎポイント以
下か判断する（ＳＴ５）。このｎポイントは予め設定さ
れており、例えば経験上誤認識を起こさない値である。
ここで、例えば上記ｎポイントが「２０」に設定されて
いれば、上記図５の例ではＹＥＳである。At the same time, the recognition rate of each candidate is also displayed. For example, the first candidate is 60% and the second candidate is 55%.
%, And the third candidate is 45%. Next, the CPU 11
Determines whether the difference between the recognition rates of the first candidate and the second candidate is n points or less (ST5). The n points are set in advance, and are values that do not cause erroneous recognition, for example, through experience.
Here, for example, if the n point is set to “20”, the result is YES in the example of FIG.

【００３３】このように、第１候補と第２候補の認識率
の差がｎポイント以下であれば（ＳＴ５がＹＥＳ）、第
１候補をユーザに提示する（ＳＴ６）。すなわち、この
場合認識結果が確実ではないので、ユーザに確認を促
す。例えば、上記例では「今、１７歳って言った」とい
う確認を促す（図６参照）。一方、前述の判断（ＳＴ
５）において、第１候補と第２候補との差がｎポイント
以上であれば（ＳＴ５がＮＯ）、第１候補に確定し、回
答取得を行うと共に、上記確認処理を行うことなく、例
えば人面魚「シーマン」はスピーカ１９ａから「１７歳
か」と言う（図７参照）。As described above, if the difference between the recognition rates of the first candidate and the second candidate is n points or less (YES in ST5), the first candidate is presented to the user (ST6). That is, in this case, since the recognition result is not reliable, the user is urged to confirm. For example, in the above example, the user is prompted to confirm that he is now 17 years old (see FIG. 6). On the other hand, the aforementioned judgment (ST
In 5), if the difference between the first candidate and the second candidate is n points or more (NO in ST5), the first candidate is determined, the answer is obtained, and the above-described confirmation processing is not performed. The face fish “Seaman” says “17 years old” from the speaker 19a (see FIG. 7).

【００３４】次に、上記第１候補の確認処理（ＳＴ６）
を行い、スピーカ１９ａからの報音と図６に示す表示を
行った結果、ユーザから肯定的な回答があれば、この場
合にも第１候補を回答として確定する（ＳＴ８がＹＥ
Ｓ、ＳＴ９）。例えば、ユーザがマイク２０ａに向かっ
て「はい」、「うん」、「そう」等の答えを返してきた
場合、第１候補を回答として確定する。Next, the first candidate confirmation processing (ST6)
As a result of performing the notification from the speaker 19a and the display shown in FIG. 6, if there is a positive answer from the user, also in this case, the first candidate is determined as the answer (ST8 is YE).
S, ST9). For example, when the user returns an answer such as “Yes”, “Yes”, “Yes”, etc., toward the microphone 20a, the first candidate is determined as the answer.

【００３５】一方、ユーザが「ちがう」、「いいえ」等
の否定的な回答を行った場合、人面魚「シーマン」は次
の候補があるか判断し、第２候補がある場合、第２候補
を提示する（ＳＴ１０がＹＥＳ、ＳＴ１１）。例えば、
図５に示す例の場合、第２候補である「にじゅうななさ
い」の提示を行う。すなわち、人面魚「シーマン」は
「じゃ、２７歳って言った」という質問をする。On the other hand, if the user makes a negative answer such as "No" or "No", the mermaid "Seaman" determines whether there is a next candidate, and if there is a second candidate, The candidates are presented (YES in ST10, ST11). For example,
In the case of the example shown in FIG. 5, the second candidate “Ninare” is presented. In other words, the mermaid "Seaman" asks a question, "Well, I said 27 years old."

【００３６】以下、同様に処理を行い、第２候補以降の
提示に対して肯定的な回答があれば、当該回答を回答取
得とする。例えば、第２候補の２７歳の提示に対してユ
ーザから肯定的な答えがあれば、ユーザの年齢は２７歳
であると分かり、第３候補の５７歳の提示に対して肯定
的な答えがあれば、ユーザの年齢は５７歳であると分か
る。Thereafter, the same processing is performed, and if there is a positive answer to the presentation of the second and subsequent candidates, the answer is regarded as answer acquisition. For example, if the user has a positive answer to the second candidate presentation at the age of 27, the user is known to be 27 years old, and a positive answer is to the third candidate presentation at the age of 57. If so, it can be understood that the age of the user is 57 years old.

【００３７】以上の処理を繰り返すことによって、人面
魚「シーマン」はユーザの正確な年齢を知ることができ
る。尚、全ての候補がユーザによって否定された場合、
最初の質問に会話を戻す。By repeating the above processing, the mermaid "Seaman" can know the correct age of the user. If all candidates are denied by the user,
Return the conversation to the first question.

【００３８】以上のように制御することによって、人面
魚「シーマン」の質問に対して音声認識の認識率を出力
し、該認識率に基づいて音声認識処理を進め、ストレス
のないユーザとの対話を実現するものである。By performing the above control, the recognition rate of speech recognition is output in response to the question of the mermaid "Seaman", and the speech recognition process is performed based on the recognition rate, and the user can be stress-free. A dialogue is realized.

【００３９】尚、人面魚「シーマン」の質問はユーザに
対する年齢に対する質問であったが、年齢以外の質問で
あっても同様の処理によって、ユーザからの正確な情報
を得ることができる。Although the question about the mermaid "Seaman" was a question about the age for the user, accurate information from the user can be obtained by the same processing for a question other than the age.

【００４０】また、第１候補の認識率が一定値以下の場
合、例え第１候補と第２候補間に所定値以上の認識率の
差があったとしても、音声認識結果とすることなく、か
かる場合には両候補について確認処理を行う等の手続き
を行い、誤認識のないシステムとする。When the recognition rate of the first candidate is equal to or less than a predetermined value, even if there is a difference in the recognition rate between the first candidate and the second candidate equal to or more than a predetermined value, the recognition result is not obtained as a speech recognition result. In such a case, a procedure such as performing a confirmation process for both candidates is performed to provide a system that does not cause erroneous recognition.

【００４１】尚、上記処理において、使用する端末機器
はパーソナルコンピュータ１ａであったが、他のパーソ
ナルコンピュータ１ｂであってもよく、又はゲーム機６
や携帯電話７であっても良い。In the above processing, the terminal device used is the personal computer 1a, but may be another personal computer 1b or the game machine 6b.
Or the mobile phone 7.

【００４２】また、パーソナルコンピュータ等はサーバ
に接続されていない状態であってもよく、対応する機能
を回路やプログラムによって単独に保有する構成であれ
ばよい。The personal computer or the like may not be connected to the server, and may have a configuration in which the corresponding function is independently held by a circuit or a program.

【００４３】[0043]

【発明の効果】以上説明したように、本発明の電子ペッ
トの音声認識システムによれば、誤認識を無くすと共
に、効率良い音声認識を可能とするものである。As described above, according to the electronic pet voice recognition system of the present invention, erroneous recognition is eliminated and efficient voice recognition is enabled.

【００４４】また、第１候補の認識率が一定値以下の場
合、両候補について確認処理を行う等の処理により、よ
り正確な音声認識システムとすることができる。Further, when the recognition rate of the first candidate is equal to or less than a certain value, a more accurate voice recognition system can be realized by performing processing such as confirmation processing for both candidates.

[Brief description of the drawings]

【図１】本発明の電子ペットの音声認識システムのシス
テム図である。FIG. 1 is a system diagram of an electronic pet voice recognition system of the present invention.

【図２】本発明の実施例において用いた利用者が所有す
るコンピュータをす示すブロック図である。FIG. 2 is a block diagram showing a computer owned by a user used in the embodiment of the present invention.

【図３】記憶装置に登録されたデータベースの構成を示
す図である。FIG. 3 is a diagram showing a configuration of a database registered in a storage device.

【図４】本例における処理動作を説明するフローチャー
トである。FIG. 4 is a flowchart illustrating a processing operation in the present example.

【図５】第１候補、第２候補、第３候補の認識率を示す
図である。FIG. 5 is a diagram showing recognition rates of a first candidate, a second candidate, and a third candidate.

【図６】人面魚「シーマン」の発音状態を表示する図で
ある。FIG. 6 is a diagram showing a sounding state of a mermaid “Seaman”.

【図７】人面魚「シーマン」の発音状態を表示する図で
ある。FIG. 7 is a diagram showing a sounding state of a mermaid “Seaman”;

[Explanation of symbols]

１パーソナルコンピュータ（利用者）２通信装置３サーバコンピュータ４通信装置５通信装置６コンピュータゲーム機７携帯電話８テレビ９マイク１０データバス１１中央精算処理装置（ＣＰＵ）１２ＲＡＭ１３通信インターフェイス１４表示装置１５記憶装置１９スピーカ２０マイク２１インターネット網 DESCRIPTION OF SYMBOLS 1 Personal computer (user) 2 Communication device 3 Server computer 4 Communication device 5 Communication device 6 Computer game machine 7 Cellular phone 8 Television 9 Microphone 10 Data bus 11 Central payment processing device (CPU) 12 RAM 13 Communication interface 14 Display device 15 Storage device 19 Speaker 20 Microphone 21 Internet network

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 3/00 ５７１Ｕ５６１ＥＦターム(参考） 2C001 AA00 AA11 BA00 BA06 BA07 CA00 CA07 CB01 CB04 CB08 CC01 CC08 5D015 KK02 LL04 LL05 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G10L 3/00 571U 561E F-term (Reference) 2C001 AA00 AA11 BA00 BA06 BA07 CA00 CA07 CB01 CB04 CB08 CC01 CC08 5D015 KK02 LL04 LL05

Claims

[Claims]

1. An electronic pet voice recognition system comprising a plurality of terminal devices and a server computer connected via a network and connected to enable data communication, wherein the voice recognition system recognizes a question from the electronic pet. Speech recognition means for outputting a plurality of candidates having different rates, comparison means for comparing the difference in recognition rate between a first candidate and a second candidate obtained by the speech recognition, and a comparison result by the comparison means being a predetermined value When the value exceeds the value, the certifying means for certifying the first candidate as the result of the speech recognition, and when the comparison result by the comparing means does not exceed a predetermined value,
And a confirmation means for performing a confirmation process on the first candidate.

2. The electronic pet according to claim 1, wherein in the confirmation processing for the first candidate, when a positive response is received, the second candidate is recognized as a result of the voice recognition. Voice recognition system.

3. The voice recognition system for an electronic pet according to claim 1, wherein in the confirmation processing for the first candidate, if a negative response is received, the confirmation processing for the second candidate is further performed. .

4. The electronic pet according to claim 3, wherein in the confirmation processing for the second candidate, if a positive response is received, the second candidate is recognized as the result of the voice recognition. Voice recognition system.

5. The voice recognition system for an electronic pet according to claim 3, wherein a confirmation process is performed on the third candidate when a negative response is received in the confirmation process on the second candidate.

6. The voice recognition system for an electronic pet according to claim 1, wherein the recognition rate of the first candidate is equal to or higher than a predetermined value.

7. A method for recognizing a voice of an electronic pet, comprising a plurality of terminal devices and a server computer connected via a network and connected to enable data communication, wherein a response to an electronic pet question is voiced. Recognition processing; and when a plurality of recognition candidates are obtained by the voice recognition, a comparison processing of comparing a difference in recognition rate between the first candidate and the second candidate; When exceeding, the first
A certification process for recognizing the candidate as a result of the speech recognition; and a confirmation process for performing a confirmation process on the first candidate when a comparison result by the process does not exceed a predetermined value. Pet voice recognition method.

8. The method according to claim 7, wherein the recognition rate of the first candidate is equal to or higher than a predetermined value.

9. A voice recognition program for an electronic pet, comprising: a plurality of terminal devices; and a server computer connected via a network and connected to enable data communication, wherein a voice response to a question of the electronic pet is provided. A recognition function; a plurality of recognition candidates obtained by the voice recognition; a comparison function of comparing a difference in recognition rate between the first candidate and the second candidate; and a comparison result obtained by the comparison function being a predetermined value. A certification function for certifying the first candidate as a result of the speech recognition when the number exceeds the threshold, and a confirmation function for performing a confirmation process on the first candidate when a comparison result by the function does not exceed a predetermined value. A voice recognition program for an electronic pet, comprising:

10. The computer-readable storage medium according to claim 9, wherein a recognition rate of the first candidate is equal to or greater than a predetermined value.