JP2001216131A

JP2001216131A - Information processor, its method and program storage medium

Info

Publication number: JP2001216131A
Application number: JP2000027889A
Authority: JP
Inventors: Satoshi Fujimura; 聡藤村; Yasuhiko Kato; 靖彦加藤; Shuji Yonekura; 修二米倉; Takashi Sasai; 崇司笹井
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2000-02-04
Filing date: 2000-02-04
Publication date: 2001-08-10

Abstract

PROBLEM TO BE SOLVED: To allow a user to recognize which application receives a voice recognition result to be sent. SOLUTION: When an application for voice recognition is started and the display of a window for the application is instructed as minimum display, a small display window 241 for the application corresponds to voice recognition and is displayed in the vicinity of an activated electronic pet window 191. The window 241 is displayed only during the execution of voice recognition.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は情報処理装置および
方法、並びにプログラム格納媒体に関し、特に、音声認
識を行う装置に用いて好適な情報処理装置および方法、
並びにプログラム格納媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing apparatus and method, and a program storage medium, and more particularly, to an information processing apparatus and method suitable for an apparatus for performing voice recognition.
And a program storage medium.

【０００２】[0002]

【従来の技術】音声を認識し、認識した音声に対応し
て、所定の処理を実行するパーソナルコンピュータなど
の情報処理装置が普及しつつある。例えば、パーソナル
コンピュータにおいて、音声認識が実行される場合、音
声認識用アプリケーションが起動され、その起動された
ことを示すウィンドウなどがディスプレイ表示される。
そのウィンドウには、使用者が音声として発話した言葉
を認識した結果などが表示される。2. Description of the Related Art Information processing apparatuses, such as personal computers, which recognize voice and execute predetermined processing in response to the recognized voice, are becoming widespread. For example, when voice recognition is executed in a personal computer, a voice recognition application is activated, and a window or the like indicating that the application is activated is displayed on a display.
The window displays the result of recognition of words spoken by the user as speech.

【０００３】[0003]

【発明が解決しようとする課題】上述した音声認識が実
行されている状態において表示されるウィンドは、小型
表示され、タスクバーなどに格納される場合がある。そ
のような小型表示され、かつ、複数の音声認識対応のア
プリケーションが実行されている場合、使用者は、入力
した音声の認識結果が、どのアプリケーションに対して
送られたのかがわかりづらいといった課題があった。A window displayed while the above-described speech recognition is being executed is sometimes displayed in a small size and stored in a task bar or the like. When such a small-sized display and a plurality of applications that support voice recognition are executed, the user has a problem that it is difficult to recognize to which application the recognition result of the input voice was sent. there were.

【０００４】また、音声認識をさせる場合、使用者は、
予め決められたコマンドを発話する必要がある。そのコ
マンドは、アプリケーションにより異なり、また、その
数も多いため、使用者は覚えきれず、使いこなせないた
め、使い勝手が悪くなるといった課題があった。[0004] In the case of voice recognition, the user must
It is necessary to speak a predetermined command. The commands differ depending on the application and the number of commands is large, so that the user cannot remember them and cannot use them easily, so that there is a problem that usability is deteriorated.

【０００５】本発明はこのような状況に鑑みてなされた
ものであり、ウィンドウが小型表示されている時には、
音声の認識結果が送られるアプリケーションの近傍にウ
ィンドウを表示し、また、コマンドの一覧を音声により
呼び出せるようにすることにより、使用者にとって使い
勝手の良い音声認識を実現させることを目的とする。The present invention has been made in view of such circumstances, and when a window is displayed in a small size,
It is an object of the present invention to realize a user-friendly voice recognition by displaying a window near an application to which a voice recognition result is sent and enabling a list of commands to be called up by voice.

【０００６】[0006]

【課題を解決するための手段】請求項１に記載の情報処
理装置は、音声を認識する状態が指示されているか否か
を判断する第１の判断手段と、第１の判断手段により音
声を認識する状態が指示されていると判断された場合、
音声認識の結果に対応して所定の処理を実行するプログ
ラムが起動され、かつ、アクティブな状態になっている
か否かを判断する第２の判断手段と、第２の判断手段に
より音声認識の結果に対応して所定の処理を実行するプ
ログラムが起動され、かつ、アクティブな状態になって
いると判断された場合、音声認識が指示されている状態
を示す第１のウィンドウを、プログラムに対応するる第
２のウィンドウの近傍、または、重なる位置に表示され
るように表示を制御する第１の表示制御手段と、第２の
判断手段により音声認識の結果に対応して所定の処理を
実行するプログラムは起動されていない、または、起動
されてはいるがアクティブな状態ではないと判断された
場合、第１のウィンドウが予め定められた所定の位置に
表示されるように表示を制御する第２の表示制御手段と
を含むことを特徴とする。According to a first aspect of the present invention, there is provided an information processing apparatus comprising: first determining means for determining whether a voice recognition state is instructed; If it is determined that the recognition state is indicated,
A second determining means for determining whether or not a program for executing a predetermined process according to the result of the voice recognition is activated and in an active state; and a result of the voice recognition by the second determining means. When it is determined that the program for executing the predetermined process is activated and is in the active state in response to the above, the first window indicating the state in which the voice recognition is instructed is set to correspond to the program. First display control means for controlling display so as to be displayed near or overlapping the second window, and predetermined processing is executed by the second determination means in accordance with the result of voice recognition. If it is determined that the program has not been started or has been started but is not active, the first window is displayed at a predetermined position. Characterized in that it comprises a second display control means for controlling indicates.

【０００７】音声認識した結果が、所定の処理を実行さ
せるためのコマンドの一覧の表示を指示するものであっ
た場合、コマンドの一覧が表示されるように表示を制御
する第３の表示制御手段をさらに含むようにすることも
できる。If the result of the speech recognition indicates that a command list for executing a predetermined process is to be displayed, a third display control means for controlling the display so that the command list is displayed. May be further included.

【０００８】請求項３に記載の情報処理方法は、音声を
認識する状態が指示されているか否かを判断する第１の
判断ステップと、第１の判断ステップの処理で音声を認
識する状態が指示されていると判断された場合、音声認
識の結果に対応して所定の処理を実行するプログラムが
起動され、かつ、アクティブな状態になっているか否か
を判断する第２の判断ステップと、第２の判断ステップ
の処理で音声認識の結果に対応して所定の処理を実行す
るプログラムが起動され、かつ、アクティブな状態にな
っていると判断された場合、音声認識が指示されている
状態を示す第１のウィンドウを、プログラムに対応する
第２のウィンドウの近傍、または、重なる位置に表示さ
れるように表示を制御する第１の表示制御ステップと、
第２の判断ステップの処理で音声認識の結果に対応して
所定の処理を実行するプログラムは起動されていない、
または、起動されてはいるがアクティブな状態ではない
と判断された場合、第１のウィンドウが予め定められた
所定の位置に表示されるように表示を制御する第２の表
示制御ステップとを含むことを特徴とする。According to a third aspect of the present invention, there is provided an information processing method comprising: a first determining step of determining whether a state of recognizing a voice is instructed; If it is determined that the instruction has been given, a second determination step of determining whether or not a program for executing a predetermined process according to the result of the voice recognition is activated and in an active state; If a program for executing a predetermined process corresponding to the result of speech recognition is started in the process of the second determination step and it is determined that the program is in an active state, a state in which speech recognition is instructed A first display control step of controlling the display so that a first window indicating the above is displayed near or overlapping with the second window corresponding to the program;
In the processing of the second determination step, the program for executing the predetermined processing corresponding to the result of the voice recognition has not been started,
Or a second display control step of controlling display so that the first window is displayed at a predetermined position when it is determined that the first window is activated but not in an active state. It is characterized by the following.

【０００９】請求項４に記載のプログラム格納媒体のプ
ログラムは、音声を認識する状態が指示されているか否
かを判断する第１の判断ステップと、第１の判断ステッ
プの処理で音声を認識する状態が指示されていると判断
された場合、音声認識の結果に対応して所定の処理を実
行するプログラムが起動され、かつ、アクティブな状態
になっているか否かを判断する第２の判断ステップと、
第２の判断ステップの処理で音声認識の結果に対応して
所定の処理を実行するプログラムが起動され、かつ、ア
クティブな状態になっていると判断された場合、音声認
識が指示されている状態を示す第１のウィンドウを、プ
ログラムに対応する第２のウィンドウの近傍、または、
重なる位置に表示されるように表示を制御する第１の表
示制御ステップと、第２の判断ステップの処理で音声認
識の結果に対応して所定の処理を実行するプログラムは
起動されていない、または、起動されてはいるがアクテ
ィブな状態ではないと判断された場合、第１のウィンド
ウが予め定められた所定の位置に表示されるように表示
を制御する第２の表示制御ステップとを含むことを特徴
とする。According to a fourth aspect of the present invention, a program in a program storage medium recognizes a voice by a first determining step of determining whether a voice recognition state is instructed, and a process of the first determining step. If it is determined that the state has been instructed, a second determination step of activating a program for executing a predetermined process in accordance with the result of voice recognition and determining whether the state is active When,
If a program for executing a predetermined process corresponding to the result of speech recognition is started in the process of the second determination step and it is determined that the program is in an active state, a state in which speech recognition is instructed Is displayed in the vicinity of the second window corresponding to the program, or
A first display control step of controlling display so as to be displayed at an overlapping position and a program for executing predetermined processing corresponding to a result of voice recognition in the processing of the second determination step have not been started, or A second display control step of controlling the display so that the first window is displayed at a predetermined position when it is determined that the first window is activated but not active. It is characterized by.

【００１０】請求項１に記載の情報処理装置、請求項３
に記載の情報処理方法、および請求項４に記載のプログ
ラム格納媒体においては、音声認識の結果に対応して所
定の処理を実行するプログラムが起動され、かつ、アク
ティブな状態になっていると判断された場合、音声認識
が指示されている状態を示す第１のウィンドウが、プロ
グラムに対応する表示される第２のウィンドウの近傍、
または、重なる位置に表示されるように表示が制御さ
れ、音声認識の結果に対応して所定の処理を実行するプ
ログラムは起動されていない、または、起動されてはい
るがアクティブな状態ではないと判断された場合、第１
のウィンドウが予め定められた所定の位置に表示される
ように表示が制御される。[0010] The information processing apparatus according to claim 1, claim 3,
In the information processing method described in the above, and the program storage medium described in the claim 4, it is determined that the program for executing the predetermined processing in response to the result of the voice recognition is activated and is in an active state. In this case, the first window indicating the state where the voice recognition is instructed is located near the second window displayed corresponding to the program,
Or, the display is controlled to be displayed at an overlapping position, and a program for executing a predetermined process corresponding to a result of voice recognition is not activated, or is activated but is not in an active state. If determined, the first
The display is controlled so that this window is displayed at a predetermined position.

【００１１】[0011]

【発明の実施の形態】以下、本発明に係る情報処理装置
の一実施の形態を図面を参照して説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of an information processing apparatus according to the present invention will be described below with reference to the drawings.

【００１２】図１乃至図６は、本発明を適用した携帯型
パーソナルコンピュータの構成例を表している。このパ
ーソナルコンピュータ１は、ミニノート型のパーソナル
コンピュータとされ、基本的に、本体２と、本体２に対
して開閉自在とされている表示部３により構成されてい
る。図１は、表示部３を本体２に対して開いた状態を示
す外観斜視図、図２は、図１の平面図、図３は、表示部
３を本体２に対して閉塞した状態を示す左側側面図、図
４は、表示部３を本体２に対して１８０度開いた状態を
示す右側側面図、図５は、図３の正面図、図６は、図４
の底面図である。1 to 6 show examples of the configuration of a portable personal computer to which the present invention is applied. The personal computer 1 is a mini-notebook type personal computer, and basically includes a main body 2 and a display unit 3 which can be opened and closed with respect to the main body 2. 1 is an external perspective view showing a state in which the display unit 3 is opened with respect to the main body 2, FIG. 2 is a plan view of FIG. 1, and FIG. 3 shows a state in which the display unit 3 is closed with respect to the main body 2. 4 is a right side view showing the display unit 3 opened 180 degrees with respect to the main body 2, FIG. 5 is a front view of FIG. 3, and FIG.
FIG.

【００１３】本体２には、各種の文字や記号などを入力
するとき操作されるキーボード４、マウスカーソルを移
動させるときなどに操作されるスティック式ポインティ
ングデバイス５が、その上面に設けられている。また、
本体２の上面には、音を出力するスピーカ８と、表示部
３に設けられているCCDビデオカメラ２３で撮像すると
き操作されるシャッタボタン１０がさらに設けられてい
る。The main body 2 is provided with a keyboard 4 operated when inputting various characters and symbols, and a stick-type pointing device 5 operated when moving a mouse cursor or the like. Also,
On the upper surface of the main body 2, there are further provided a speaker 8 for outputting sound and a shutter button 10 operated when capturing an image with the CCD video camera 23 provided on the display unit 3.

【００１４】表示部３の上端部には、ツメ１３が設けら
れており、図３に示すように、表示部３を本体２に対し
て閉塞した状態において、ツメ１３に対向する位置にお
ける本体２には、ツメ１３が嵌合する孔部６が設けられ
ている。本体２の前面には、スライドレバー７が前面に
平行に移動可能に設けられており、スライドレバー７は
孔部６に嵌合したツメ１３と係合してロックし、またロ
ック解除することができるようになっている。ロックを
解除することにより、表示部３を本体２に対して回動す
ることができる。ツメ１３の隣りには、マイクロホン２
４が取り付けられている。このマイクロホン２４は、図
６にも示すように、背面からの音も収音できるようにな
されている。A claw 13 is provided at an upper end portion of the display unit 3. As shown in FIG. 3, when the display unit 3 is closed with respect to the main body 2, Is provided with a hole 6 into which the claw 13 is fitted. A slide lever 7 is provided on the front surface of the main body 2 so as to be movable in parallel with the front surface. The slide lever 7 engages with a claw 13 fitted in the hole 6 to lock and unlock. I can do it. By releasing the lock, the display unit 3 can be rotated with respect to the main body 2. Microphone 2 next to claw 13
4 is attached. As shown in FIG. 6, the microphone 24 can collect sound from the back.

【００１５】本体２の正面にはまた、プログラマブルパ
ワーキー（PPK）９が設けられている。本体２の右側面
には、図４に示すように、排気孔１１が設けられてお
り、本体２の前面下部には、図５に示すように、吸気孔
１４が設けられている。さらに、排気孔１１の右側に
は、PCMCIA（Personal Computer Memory Card Internat
ional Association）カード（ＰＣカード）を挿入する
ためのスロット１２が設けられている。A programmable power key (PPK) 9 is also provided on the front of the main body 2. As shown in FIG. 4, an exhaust hole 11 is provided on the right side surface of the main body 2, and an intake hole 14 is provided at a lower part of the front surface of the main body 2 as shown in FIG. 5. Further, a PCMCIA (Personal Computer Memory Card Internat) is provided on the right side of the exhaust hole 11.
A slot 12 for inserting an ional association) card (PC card) is provided.

【００１６】表示部３の正面には、画像を表示するLCD
（Liquid Crystal Display）２１が設けられており、そ
の上端部には、撮像部２２が、表示部３に対して回動自
在に設けられている。すなわち、この撮像部２２は、LC
D２１と同一の方向と、その逆の方向（背面の方向）と
の間の１８０度の範囲の任意の位置に回動することがで
きるようになされている。撮像部２２には、CCDビデオ
カメラ２３が取り付けられている。An LCD for displaying an image is provided on the front of the display unit 3.
(Liquid Crystal Display) 21 is provided, and an imaging unit 22 is provided at the upper end thereof so as to be rotatable with respect to the display unit 3. That is, this imaging unit 22
It can rotate to any position within a range of 180 degrees between the same direction as D21 and the opposite direction (backward direction). A CCD video camera 23 is attached to the imaging unit 22.

【００１７】表示部３の下側の本体側には、電源ランプ
PL、電池ランプBL、メッセージランプML、その他のLED
よりなるランプが設けられている。なお、図３に示す符
号４０は、本体２の左側面に設けられた電源スイッチで
あり、図５に示す符号２５は、CCDビデオカメラ２３の
フォーカスを調整する調整リングである。さらに、図６
に示す符号２６は、本体２内に増設メモリを取り付ける
ための開口部を被覆する蓋であり、符号４１は、蓋２６
のロックツメを外すためのピンを挿入する小孔である。A power lamp is provided on the lower body side of the display unit 3.
PL, battery lamp BL, message lamp ML, other LEDs
Is provided. Reference numeral 40 shown in FIG. 3 is a power switch provided on the left side surface of the main body 2, and reference numeral 25 shown in FIG. 5 is an adjustment ring for adjusting the focus of the CCD video camera 23. Further, FIG.
Reference numeral 26 denotes a lid for covering an opening for mounting an additional memory in the main body 2, and reference numeral 41 denotes a lid 26.
This is a small hole for inserting a pin for removing the lock claw.

【００１８】図７は、パーソナルコンピュータ１の内部
の構成を表している。内部バス５１には、図７に示すよ
うに、CPU（Central Processing Unit）５２、必要に応
じて挿入されるＰＣカード５３、RAM（Random Access M
emory）５４、およびグラフィックチップ８１が接続さ
れている。この内部バス５１は、外部バス５５に接続さ
れており、外部バス５５には、ハードディスクドライブ
（HDD）５６、Ｉ／Ｏ（入出力）コントローラ５７、キ
ーボードコントローラ５８、スティック式ポインティン
グデバイスコントローラ５９、サウンドチップ６０、LC
Dコントローラ８３、モデム５０などが接続されてい
る。FIG. 7 shows the internal configuration of the personal computer 1. As shown in FIG. 7, a CPU (Central Processing Unit) 52, a PC card 53 inserted as needed, and a RAM (Random Access M
emory) 54 and a graphic chip 81 are connected. The internal bus 51 is connected to an external bus 55. The external bus 55 includes a hard disk drive (HDD) 56, an I / O (input / output) controller 57, a keyboard controller 58, a stick pointing device controller 59, and a sound. Chip 60, LC
The D controller 83, the modem 50 and the like are connected.

【００１９】CPU５２は、各機能を統括するコントロー
ラであり、ＰＣカード５３は、オプションの機能を付加
するとき適宜装着される。The CPU 52 is a controller that controls each function, and the PC card 53 is appropriately mounted when an optional function is added.

【００２０】RAM５４の中には、起動が完了した時点に
おいて、電子メールプログラム（アプリケーションプロ
グラム）５４Ａ、オートパイロットプログラム（アプリ
ケーションプログラム）５４Ｂ、そしてＯＳ（基本プロ
グラム）５４Ｃが、HDD５６から転送され、記憶され
る。In the RAM 54, an e-mail program (application program) 54A, an auto-pilot program (application program) 54B, and an OS (basic program) 54C are transferred from the HDD 56 and stored at the time of completion of the activation. You.

【００２１】電子メールプログラム５４Ａは、電話回線
のような通信回線などからネットワーク経由で通信文を
授受するプログラムである。電子メールプログラム５４
Ａは、特定機能としての着信メール取得機能を有してい
る。この着信メール取得機能は、メールサーバ９３に対
して、そのメールボックス９３Ａ内に自分（利用者）宛
のメールが着信しているかどうかを確認して、自分宛の
メールがあれば取得する処理を実行する。The electronic mail program 54A is a program for sending and receiving messages via a network from a communication line such as a telephone line. E-mail program 54
A has an incoming mail acquisition function as a specific function. This incoming mail acquisition function checks the mail server 93 to see if mail addressed to the user (user) has arrived in the mailbox 93A, and if there is mail addressed to the user, obtains the mail. Execute.

【００２２】オートパイロットプログラム５４Ｂは、予
め設定された複数の処理（またはプログラム）などを、
予め設定された順序で順次起動して、処理するプログラ
ムである。The auto-pilot program 54B includes a plurality of preset processes (or programs).
This is a program that is started and processed sequentially in a preset order.

【００２３】ＯＳ５４Ｃは、Windows９８（商標）に代
表される、コンピュータの基本的な動作を制御するもの
である。The OS 54C controls the basic operation of a computer typified by Windows 98 (trademark).

【００２４】一方、外部バス５５側のハードディスクド
ライブ（HDD）５６には、電子メールプログラム５６
Ａ、オートパイロットプログラム５６Ｂ、ＯＳ５６Ｃが
記憶されている。ハードディスクドライブ５６内のＯＳ
５６Ｃ、オートパイロットプログラム５６Ｂ、および電
子メールプログラム５６Ａは、起動（ブートアップ）処
理の過程で、RAM５４内に順次転送され、格納される。On the other hand, a hard disk drive (HDD) 56 on the external bus 55 side has an electronic mail program 56
A, an autopilot program 56B, and an OS 56C are stored. OS in the hard disk drive 56
56C, the auto-pilot program 56B, and the e-mail program 56A are sequentially transferred and stored in the RAM 54 in the course of the startup (boot-up) process.

【００２５】Ｉ／Ｏコントローラ５７は、マイクロコン
トローラ６１を有し、このマイクロコントローラ６１に
は、Ｉ／Ｏインタフェース６２が設けられている。この
マイクロコントローラ６１は、Ｉ／Ｏインタフェース６
２、CPU６３、RAM６４、ROM６９が相互に接続されて構
成されている。このRAM６４は、キー入力ステイタスレ
ジスタ６５、LED（発光ダイオード）制御レジスタ６
６、設定時刻レジスタ６７、レジスタ６８を有してい
る。設定時刻レジスタ６７は、ユーザが予め設定した時
刻（起動条件）になると起動シーケンス制御部７６の動
作を開始させる際に利用される。レジスタ６８は、予め
設定された操作キーの組み合わせ（起動条件）と、起動
すべきアプリケーションプログラムの対応を記憶するも
ので、その記憶された操作キーの組み合わせがユーザに
より入力されると、その記憶されたアプリケーションプ
ログラム（例えば電子メール）が起動されることにな
る。The I / O controller 57 has a microcontroller 61, and the microcontroller 61 is provided with an I / O interface 62. This microcontroller 61 includes an I / O interface 6
2, the CPU 63, the RAM 64, and the ROM 69 are connected to each other. The RAM 64 includes a key input status register 65 and an LED (light emitting diode) control register 6.
6, a set time register 67 and a register 68 are provided. The set time register 67 is used to start the operation of the start-up sequence control unit 76 at a time (start-up condition) set by the user in advance. The register 68 stores a correspondence between a preset operation key combination (start condition) and an application program to be started. When the stored operation key combination is input by the user, the register 68 stores the correspondence. The activated application program (for example, e-mail) is activated.

【００２６】キー入力ステイタスレジスタ６５は、ワン
タッチ操作用のプログラマブルパワーキー（PPK）９が
押されると、操作キーフラグが格納されるようになって
いる。LED制御レジスタ６６は、レジスタ６８に記憶さ
れたアプリケーションプログラム（電子メール）の立ち
上げ状態を表示するメッセージランプMLの点灯を制御す
るものである。設定時刻レジスタ６７は、所定の時刻を
任意に設定することができるものである。When the programmable power key (PPK) 9 for one-touch operation is pressed, the key input status register 65 stores an operation key flag. The LED control register 66 controls the lighting of a message lamp ML that indicates the activation state of the application program (e-mail) stored in the register 68. The set time register 67 can arbitrarily set a predetermined time.

【００２７】なお、このマイクロコントローラ６１に
は、バックアップ用のバッテリ７４が接続されており、
各レジスタ６５，６６，６７の値は、本体２の電源がオ
フとされている状態においても保持されるようになって
いる。A backup battery 74 is connected to the microcontroller 61.
The values of the registers 65, 66, and 67 are retained even when the power of the main unit 2 is turned off.

【００２８】マイクロコントローラ６１内のROM６９の
中には、ウェイクアッププログラム７０、キー入力監視
プログラム７１、LED制御プログラム７２が予め格納さ
れている。このROM６９は、例えばEEPROM（electricall
y erasable and programmable read only memory）で構
成されている。このEEPROMはフラッシュメモリとも呼ば
れている。さらにマイクロコントローラ６１には、常時
現在時刻をカウントするRTC（Real-Time Clock）７５が
接続されている。The ROM 69 in the microcontroller 61 stores a wake-up program 70, a key input monitoring program 71, and an LED control program 72 in advance. The ROM 69 is, for example, an EEPROM (electricall
y erasable and programmable read only memory). This EEPROM is also called a flash memory. Further, an RTC (Real-Time Clock) 75 that constantly counts the current time is connected to the microcontroller 61.

【００２９】ROM６９の中のウェイクアッププログラム
７０は、RTC７５から供給される現在時刻データに基づ
いて、設定時刻レジスタ６７に予め設定された時刻にな
ったかどうかをチェックして、設定された時刻になる
と、所定の処理（またはプログラム）などの起動をする
プログラムである。キー入力監視プログラム７１は、PP
K９が利用者により押されたかどうかを常時監視するプ
ログラムである。LED制御プログラム７２は、メッセー
ジランプMLの点灯を制御するプログラムである。The wake-up program 70 in the ROM 69 checks, based on the current time data supplied from the RTC 75, whether or not the time set in the set time register 67 has been reached. , A program for starting a predetermined process (or a program). The key input monitoring program 71
This is a program for constantly monitoring whether or not K9 has been pressed by the user. The LED control program 72 is a program for controlling lighting of the message lamp ML.

【００３０】ROM６９には、さらにBIOS（Basic Input O
utput System）７３が書き込まれている。このBIOS７３
は、電源投入時にＯＳ５６Ｃを起動したり、起動した
後、各種アプリケーションソフトウェアと周辺機器（デ
ィスプレイ、キーボード、ハードディスクドライブな
ど）の間でデータを授受する等の機能を有する。The ROM 69 further includes a BIOS (Basic Input O
utput System) 73 is written. This BIOS 73
Has a function of activating the OS 56C when the power is turned on, and transmitting and receiving data between various application software and peripheral devices (display, keyboard, hard disk drive, etc.) after the activation.

【００３１】外部バス５５に接続されているキーボード
コントローラ５８は、キーボード４からの入力をコント
ロールする。スティック式ポインティングデバイスコン
トローラ５９は、スティック式ポインティングデバイス
５の入力を制御する。A keyboard controller 58 connected to the external bus 55 controls an input from the keyboard 4. The stick pointing device controller 59 controls an input of the stick pointing device 5.

【００３２】サウンドチップ６０は、マイクロホン２４
からの入力を取り込み、あるいは内蔵スピーカ８に対し
て音声信号を供給する。The sound chip 60 includes the microphone 24
Or an audio signal is supplied to the built-in speaker 8.

【００３３】モデム５０は、公衆電話回線９０、インタ
ーネットサービスプロバイダ９１を介して、インターネ
ットなどの通信ネットワーク９２やメールサーバ９３な
どに接続することができる。The modem 50 can be connected to a communication network 92 such as the Internet, a mail server 93, and the like via a public telephone line 90 and an Internet service provider 91.

【００３４】内部バス５１に接続されているグラフィッ
クチップ８１には、CCDビデオカメラ２３で取り込んだ
画像データが、処理部８２で処理された後、ＺＶ（Ｚｏ
ｏｍｅｄＶｉｄｅｏ）ポートを介して入力されるよう
になされている。グラフィックチップ８１は、処理部８
２を介してCCDビデオカメラ２３より入力されたビデオ
データを、内蔵するVRAM８１に記憶し、適宜、これを読
み出して、LCDコントローラ８３に出力する。LCDコント
ローラ８３は、グラフィックチップ８１より供給された
画像データをLCD２１に出力し、表示させる。バックラ
イト８４は、LCD２１を後方から照明するようになされ
ている。After the image data captured by the CCD video camera 23 is processed by the processing unit 82, the graphics chip 81 connected to the internal bus 51
omed Video) port. The graphic chip 81 includes the processing unit 8
Video data input from the CCD video camera 23 via the VRAM 2 is stored in the built-in VRAM 81, read out as appropriate, and output to the LCD controller 83. The LCD controller 83 outputs the image data supplied from the graphic chip 81 to the LCD 21 for display. The backlight 84 illuminates the LCD 21 from behind.

【００３５】電源スイッチ４０は、電源をオンまたはオ
フするとき操作される。半押しスイッチ８５は、シャッ
タボタン１０が半押し状態にされたときオンされ、全押
しスイッチ８６は、シャッタボタン１０が全押し状態に
されたときオンされる。反転スイッチ８７は、撮像部２
２が１８０度回転されたとき（CCDビデオカメラ２３がL
CD２１の反対側を撮像する方向に回転されたとき）、オ
ンされるようになされている。The power switch 40 is operated when the power is turned on or off. The half-press switch 85 is turned on when the shutter button 10 is half-pressed, and the full-press switch 86 is turned on when the shutter button 10 is fully pressed. The reversing switch 87 is connected to the imaging unit 2
2 is rotated 180 degrees (CCD video camera 23
It is turned on when it is rotated in the direction of imaging the opposite side of the CD 21).

【００３６】ドライブ８８は、外部バス５５に接続され
ている。ドライブ８８は、磁気ディスク３５１（フロッ
ピディスクを含む）、光ディスク３５２（CD-ROM(Compa
ct Disc-Read Only Memory)、DVD(Digital Versatile D
isc)を含む）、光磁気ディスク３５３（ＭＤ(Mini-Dis
c)を含む）、または半導体メモリ３５４などが装着さ
れ、装着された磁気ディスク３５１、光ディスク３５
２、光磁気ディスク３５３、または半導体メモリ３５４
などに記録されているプログラムまたはデータを、外部
バス５５または内部バス５１を介して、HDD５６またはR
AM５４に供給する。The drive 88 is connected to the external bus 55. The drive 88 includes a magnetic disk 351 (including a floppy disk) and an optical disk 352 (CD-ROM (Compa
ct Disc-Read Only Memory), DVD (Digital Versatile D)
isc)), magneto-optical disk 353 (MD (Mini-Dis
c)) or a semiconductor memory 354 or the like, and the mounted magnetic disk 351 and optical disk 35
2. Magneto-optical disk 353 or semiconductor memory 354
Program or data recorded in the HDD 56 or the R via the external bus 55 or the internal bus 51.
Supply to AM54.

【００３７】ドライブ８８は、外部バス５５または内部
バス５１を介して、モデム５０、HDD５６、またはRAM５
４から供給されたプログラムまたはデータなどを、装着
された磁気ディスク３５１、光ディスク３５２、光磁気
ディスク３５３、または半導体メモリ３５４などに記録
させる。The drive 88 is connected to the modem 50, the HDD 56, or the RAM 5 via the external bus 55 or the internal bus 51.
4 is recorded on the mounted magnetic disk 351, optical disk 352, magneto-optical disk 353, semiconductor memory 354, or the like.

【００３８】図８は、音声認識に係るプログラムをパー
ソナルコンピュータ１が起動させたときの、所定のプロ
グラムによる機能ブロックを示す図である。音声認識エ
ンジン１０１は、読み仮名辞書データベース１１１に予
め記憶されている漢字に対する読み、またはエンジン用
認識単語・文法データベース１１２に予め記憶されてい
る認識単語、若しくは文法を基に、マイクロホン２４か
ら入力された使用者の音声に対応するデータを入力し、
使用者が発話した言葉に対応するテキストなどの所定の
方式のデータを生成して、音声コマンダ１０２に供給す
る。FIG. 8 is a diagram showing functional blocks according to a predetermined program when the personal computer 1 starts a program relating to voice recognition. The speech recognition engine 101 is input from the microphone 24 based on the reading of the kanji stored in the reading kana dictionary database 111 in advance, or the recognition word or grammar stored in the engine recognition word / grammar database 112 in advance. Enter the data corresponding to the user's voice
The data in a predetermined format such as text corresponding to the words spoken by the user is generated and supplied to the voice commander 102.

【００３９】音声認識エンジン１０１は、音声コマンダ
１０２から認識単語、若しくは文法などのデータを受信
して、読み仮名辞書データベース１１１またはエンジン
用認識単語・文法データベース１１２に記憶させる。The voice recognition engine 101 receives data such as a recognized word or grammar from the voice commander 102 and stores the data in the reading kana dictionary database 111 or the recognized word / grammar database 112 for the engine.

【００４０】音声コマンダ１０２は、使用者が発話した
所定の言葉に対応する単語（テキストなど）などのデー
タが音声認識エンジン１０１から供給されたとき、静止
画撮影プログラム１０３、静止画閲覧プログラム１０
４、若しくは電子ペットプログラム１０５を起動させ、
または静止画撮影プログラム１０３、静止画閲覧プログ
ラム１０４、若しくは電子ペットプログラム１０５に所
定コマンド（使用者が発話した言葉に対応する）を送信
する。When data such as a word (text or the like) corresponding to a predetermined word spoken by the user is supplied from the voice recognition engine 101, the voice commander 102 executes the still image photographing program 103 and the still image browsing program 10
4 or start the electronic pet program 105,
Alternatively, a predetermined command (corresponding to a word spoken by the user) is transmitted to the still image photographing program 103, the still image browsing program 104, or the electronic pet program 105.

【００４１】音声コマンダ１０２は、使用者が発話した
他の所定の言葉に対応する単語（テキストなど）などの
データが音声認識エンジン１０１から供給されたとき、
ランチャ設定データベース１１３に記憶されている起動
に関する設定に基づき、電子メールプログラム５４Ａ、
ワードプロセッサプログラム１０６、または表計算プロ
グラム１０７を起動させ、電子メールプログラム５４Ａ
にメールアドレスなどの所定のデータを供給する。When the voice commander 102 receives data such as a word (text or the like) corresponding to another predetermined word spoken by the user from the voice recognition engine 101,
The e-mail program 54A, based on the setting related to the activation stored in the launcher setting database 113,
Activate the word processor program 106 or the spreadsheet program 107 and execute the e-mail program 54A.
Is supplied with predetermined data such as a mail address.

【００４２】また、音声コマンダ１０２は、グラフィカ
ルなユーザインターフェースを有し、使用者により、グ
ラフィカルなユーザインターフェースを介して種々の設
定がなされ、使用者により設定された内容を分類して、
アプリケーションプログラム（電子メールプログラム５
４Ａ、ワードプロセッサプログラム１０６、または表計
算プログラム１０７）の起動に関する設定をランチャ設
定データベース１１３に、漢字の読み、または静止画撮
影プログラム１０３、静止画閲覧プログラム１０４、若
しくは電子ペットプログラム１０５のコマンドなどに関
する設定を辞書設定データベース１１４に、音声認識す
る単語または文法に関する設定を認識単語・文法データ
ベース１１５にそれぞれ記憶させる。The voice commander 102 has a graphical user interface. Various settings are made by the user through the graphical user interface, and the contents set by the user are classified.
Application program (E-mail program 5
4A, the settings related to the activation of the word processor program 106 or the spreadsheet program 107) in the launcher setting database 113, the settings related to reading of kanji, or commands of the still image photographing program 103, the still image browsing program 104, or the electronic pet program 105 Is stored in the dictionary setting database 114, and the setting relating to the word or grammar to be recognized is stored in the recognition word / grammar database 115, respectively.

【００４３】音声コマンダ１０２は、所定のタイミング
で、例えば、音声認識エンジン１０１に音声を認識させ
るとき、認識単語・文法データベース１１５に記憶して
いる認識単語のデータおよび文法のデータを、音声認識
エンジン１０１に送信する。When the voice commander 102 causes the voice recognition engine 101 to recognize a voice at a predetermined timing, for example, the voice commander 102 converts the recognition word data and the grammar data stored in the recognition word / grammar database 115 into the voice recognition engine 101. Send to 101.

【００４４】音声認識エンジン１０１は、ＯＳ５４Ｃを
起動するとき入力される使用者を判別するデータに基づ
いて、その使用者用の読み仮名辞書データベース１１１
およびエンジン用認識単語・文法データベース１１２を
利用する。音声コマンダ１０２は、ＯＳ５４Ｃを起動す
るとき入力される使用者を判別するデータに基づいて、
その使用者用のランチャ設定データベース１１３、辞書
設定データベース１１４、および認識単語・文法データ
ベース１１５を利用する。The speech recognition engine 101 reads the kana dictionary database 111 for the user based on the data inputted when the OS 54C is started to determine the user.
And the engine recognition word / grammar database 112 is used. The voice commander 102 determines the user based on data input when the OS 54C is started,
The launcher setting database 113, dictionary setting database 114, and recognized word / grammar database 115 for the user are used.

【００４５】仮名辞書データベース１１１、エンジン用
認識単語・文法データベース１１２、ランチャ設定デー
タベース１１３、辞書設定データベース１１４、および
認識単語・文法データベース１１５は、パーソナルコン
ピュータ１の使用者毎に生成され、HDD５６に記録され
る。A kana dictionary database 111, an engine recognition word / grammar database 112, a launcher setting database 113, a dictionary setting database 114, and a recognition word / grammar database 115 are generated for each user of the personal computer 1 and recorded on the HDD 56. Is done.

【００４６】静止画撮影プログラム１０３は、CCDビデ
オカメラ２３から入力された画像を、シャッタボタン１
０などの操作に対応した信号に基づき、静止画像のデー
タを生成して、所定のファイルとしてHDD５６に記録す
る。The still image shooting program 103 converts the image input from the CCD video camera 23 into the shutter button 1
Based on a signal corresponding to an operation such as 0, data of a still image is generated and recorded in the HDD 56 as a predetermined file.

【００４７】静止画閲覧プログラム１０４は、静止画撮
影プログラム１０３が記録させた静止画像のファイルを
選択し、または使用者に選択させ、選択された静止画像
をLCD２１に表示させる。電子ペットプログラム１０５
は、LCD２１に仮想的なペットを表示させ、使用者の操
作に対応して、仮想的なペットに指示などを与える。The still image browsing program 104 selects a file of the still image recorded by the still image photographing program 103 or allows the user to select the file, and causes the LCD 21 to display the selected still image. Electronic pet program 105
Displays a virtual pet on the LCD 21 and gives an instruction to the virtual pet in response to a user operation.

【００４８】ワードプロセッサプログラム１０６は、文
字または図形などから成る文書を編集するためのプログ
ラムである。表計算プログラム１０７は、所定の形式の
表に配置された数値に所定の演算を実行する、または配
置された数値に対応するグラフを描写するなどの機能を
有する。The word processor program 106 is a program for editing a document composed of characters or figures. The spreadsheet program 107 has a function of executing a predetermined operation on numerical values arranged in a table of a predetermined format, or drawing a graph corresponding to the arranged numerical values.

【００４９】図９は、音声コマンダ１０２のより詳細な
機能を説明する図である。ＵＩ（ユーザインターフェー
ス）処理部１２３は、アプリケーション通信部１２１、
エンジン通信部１２２、音声ランチャ制御部１２４、ユ
ーザ辞書制御部１２５、または認識テスト処理部１２６
から所定のデータを入力するとともに、キーボード４ま
たはスティック式ポインティングデバイス５などから所
定の信号を入力して、マイクロフォン２４を介して入力
された音声の大きさまたは音声認識の結果などを、所定
のウィンドウに表示させる。ＵＩ処理部１２３は、所定
のプログラムを起動させるとき、アプリケーション通信
部１２１、または音声ランチャ制御部１２４から入力さ
れたデータを基に、所定の画像をLCD２１に表示させ
る。FIG. 9 is a diagram for explaining more detailed functions of the voice commander 102. The UI (user interface) processing unit 123 includes an application communication unit 121,
Engine communication unit 122, voice launcher control unit 124, user dictionary control unit 125, or recognition test processing unit 126
And a predetermined signal from the keyboard 4 or the stick-type pointing device 5 and the like, and the loudness of the voice input through the microphone 24 or the result of voice recognition is displayed in a predetermined window. To be displayed. When activating a predetermined program, the UI processing unit 123 displays a predetermined image on the LCD 21 based on data input from the application communication unit 121 or the audio launcher control unit 124.

【００５０】ＵＩ処理部１２３は、キーボード４または
ステッィク式ポインティングデバイス５などの操作に対
応した信号を基に、ＵＩ処理部１２３自身の状態を変化
させ、所定のデータをアプリケーション通信部１２１、
エンジン通信部１２２、音声ランチャ制御部１２４、ユ
ーザ辞書制御部１２５、または認識テスト処理部１２６
に供給する。The UI processing unit 123 changes the state of the UI processing unit 123 itself on the basis of a signal corresponding to the operation of the keyboard 4 or the stick type pointing device 5, and transmits predetermined data to the application communication unit 121.
Engine communication unit 122, voice launcher control unit 124, user dictionary control unit 125, or recognition test processing unit 126
To supply.

【００５１】また、ＵＩ処理部１２３は、静止画撮影プ
ログラム１０３、静止画閲覧プログラム１０４、および
電子ペットプログラム１０５の状態、並びにエンジン通
信部１２２を介して音声認識エンジン１０１から供給さ
れた、使用者が発話した所定の言葉に対応する所定のテ
キストなどのデータを基に、アプリケーション通信部１
２１または音声ランチャ制御部１２４に、コマンドを送
信または所定のプログラムの起動をさせるか否かを決定
し、アプリケーション通信部１２１または音声ランチャ
制御部１２４にコマンドを送信させ、または所定のプロ
グラムの起動させる。Further, the UI processing unit 123 controls the state of the still image photographing program 103, the still image browsing program 104, and the electronic pet program 105, and the user supplied from the voice recognition engine 101 via the engine communication unit 122. Based on data such as a predetermined text corresponding to a predetermined word spoken by the application communication unit 1
21 or the voice launcher control unit 124 determines whether to transmit a command or activate a predetermined program, and causes the application communication unit 121 or the voice launcher control unit 124 to transmit the command or activate a predetermined program. .

【００５２】アプリケーション通信部１２１は、静止画
撮影プログラム１０３、静止画閲覧プログラム１０４、
または電子ペットプログラム１０５を起動させ、起動し
ている静止画撮影プログラム１０３、静止画閲覧プログ
ラム１０４、または電子ペットプログラム１０５と通信
を行い、静止画撮影プログラム１０３、静止画閲覧プロ
グラム１０４、または電子ペットプログラム１０５から
それぞれの状態を示すデータを受信する。The application communication unit 121 includes a still image photographing program 103, a still image browsing program 104,
Alternatively, the electronic pet program 105 is activated, and communicates with the activated still image photographing program 103, still image browsing program 104, or electronic pet program 105, and the still image photographing program 103, the still image browsing program 104, or the electronic pet is activated. Data indicating each state is received from the program 105.

【００５３】アプリケーション通信部１２１は、静止画
撮影プログラム１０３、静止画閲覧プログラム１０４、
および電子ペットプログラム１０５の状態を示すデータ
などをエンジン通信部１２２またはＵＩ処理部１２３に
供給するとともに、エンジン通信部１２２またはＵＩ処
理部１２３から、使用者が発話した所定の言葉に対応す
る所定のテキストなどのデータ、または使用者のキーボ
ード４などへの操作に対応するデータなどを受信する。The application communication unit 121 includes a still image photographing program 103, a still image browsing program 104,
In addition to supplying data indicating the state of the electronic pet program 105 to the engine communication unit 122 or the UI processing unit 123, the engine communication unit 122 or the UI processing unit 123 outputs predetermined data corresponding to a predetermined word spoken by the user. It receives data such as text, data corresponding to a user's operation on the keyboard 4, and the like.

【００５４】また、アプリケーション通信部１２１は、
静止画撮影プログラム１０３、静止画閲覧プログラム１
０４、および電子ペットプログラム１０５の状態、並び
にエンジン通信部１２２を介して音声認識エンジン１０
１から供給された、使用者が発話した所定の言葉に対応
する所定のテキストなどのデータを基に、静止画撮影プ
ログラム１０３、静止画閲覧プログラム１０４、若しく
は電子ペットプログラム１０５のいずれかを起動させ、
または静止画撮影プログラム１０３、静止画閲覧プログ
ラム１０４、若しくは電子ペットプログラム１０５のい
ずれかに所定のコマンドを供給する。Also, the application communication unit 121
Still image shooting program 103, still image viewing program 1
04, the state of the electronic pet program 105, and the voice recognition engine 10 via the engine communication unit 122.
On the basis of data such as a predetermined text corresponding to a predetermined word spoken by the user supplied from 1, one of the still image photographing program 103, the still image browsing program 104, and the electronic pet program 105 is activated. ,
Alternatively, a predetermined command is supplied to any one of the still image photographing program 103, the still image browsing program 104, and the electronic pet program 105.

【００５５】静止画撮影プログラム１０３、静止画閲覧
プログラム１０４、および電子ペットプログラム１０５
のいずれもが、フォーカスがあてられていないとき（い
ずれもアクティブでないとき）、音声コマンダ１０２
は、静止画撮影プログラム１０３、静止画閲覧プログラ
ム１０４、または電子ペットプログラム１０５のいずれ
かを対象としたコマンドを実行できない。The still image photographing program 103, the still image browsing program 104, and the electronic pet program 105
Are not focused (when neither is active), the voice commander 102
Cannot execute a command for any one of the still image shooting program 103, the still image browsing program 104, and the electronic pet program 105.

【００５６】静止画撮影プログラム１０３、静止画閲覧
プログラム１０４、および電子ペットプログラム１０５
のいずれかが、フォーカスがあてられているとき（いず
れかがアクティブであるとき）、音声コマンダ１０２
は、アクティブである、静止画撮影プログラム１０３、
静止画閲覧プログラム１０４、または電子ペットプログ
ラム１０５のいずれかを対象としたコマンドを実行する
ことができる。A still image photographing program 103, a still image browsing program 104, and an electronic pet program 105
Are focused (when either is active), the voice commander 102
Is an active still image shooting program 103,
A command for either the still image browsing program 104 or the electronic pet program 105 can be executed.

【００５７】このような静止画撮影プログラム１０３、
静止画閲覧プログラム１０４、または電子ペットプログ
ラム１０５のいずれかの特定のプログラムを対象とした
コマンドをローカルなコマンドと称する。Such a still image photographing program 103,
A command for a specific one of the still image browsing program 104 and the electronic pet program 105 is referred to as a local command.

【００５８】なお、音声コマンダ１０２がローカルなコ
マンドを送信するプログラムを特定する方法は、フォー
カスに限らず、他の状態またはデータを参照するように
してもよい。The method by which the voice commander 102 specifies a program for transmitting a local command is not limited to the focus, but may refer to another state or data.

【００５９】エンジン通信部１２２は、所定の方式を基
づいて、認識単語・文法データベース１１５から認識単
語のデータおよび文法のデータを読み出して、そのデー
タを音声認識エンジン１０１に送信するとともに、音声
認識エンジン１０１から供給された使用者が発話した所
定の言葉に対応する所定のテキストなどのデータを受信
する。The engine communication unit 122 reads the recognized word data and the grammatical data from the recognized word / grammar database 115 based on a predetermined method, transmits the data to the voice recognition engine 101, and transmits the data to the voice recognition engine 101. Data such as a predetermined text corresponding to a predetermined word spoken by the user supplied from 101 is received.

【００６０】エンジン通信部１２２は、例えば、図１０
に例を示すMicrosoft Speech API（商標）（以下、SAPI
と称する）に規定された方式で、音声認識エンジン１０
１に認識単語・文法データベース１１５に記憶されてい
る認識単語のデータおよび文法のデータを送信する。図
１０に示すデータの例には、音声認識の対象が<Global>
および<SVCommand>から構成され、<Global>が更に(Chan
geWin)，(VoiceCommand)から構成され、<SVCommand>が
「ヘルプ」、「前へ」などのコマンドの他、<SendMail>
で表されるメールのコマンドも含むことが記述されてい
る。また、図１０に示すデータの例には、「ヘルプ」と
いうコマンドのコード番号が１０２であり、「パパ」と
いう読みを有する単語に「daddy@test.company.co.jp」
という文字列が関連していることなどが示されている。The engine communication unit 122 is, for example, as shown in FIG.
Microsoft Speech API (trademark) (hereafter, SAPI
The speech recognition engine 10
1, the data of the recognized word and the data of the grammar stored in the recognized word / grammar database 115 are transmitted. In the data example shown in FIG. 10, the target of speech recognition is <Global>
And <SVCommand>, and <Global> is further (Chan
geWin), (VoiceCommand), where <SVCommand> is a command such as "Help" or "Previous", and <SendMail>
It is described that it also includes the mail command represented by. In the example of the data shown in FIG. 10, the code number of the command “help” is 102, and the word having the pronunciation “dad” is “daddy@test.company.co.jp”.
It is shown that the character string is related.

【００６１】音声認識エンジン１０１は、エンジン通信
部１２２から受信したデータを、所定の方式のデータに
変換して、読み仮名辞書データベース１１１またはエン
ジン用認識単語・文法データベース１１２に記憶させ、
読み仮名辞書データベース１１１またはエンジン用認識
単語・文法データベース１１２に記憶しているデータに
基づき、音声認識の処理を実行する。The speech recognition engine 101 converts the data received from the engine communication unit 122 into data of a predetermined method, and stores the data in the reading kana dictionary database 111 or the recognized word / grammar database 112 for the engine.
Based on the data stored in the reading kana dictionary database 111 or the engine recognition word / grammar database 112, a speech recognition process is executed.

【００６２】音声認識エンジン１０１は、エンジン通信
部１２２に、使用者が発話した所定の言葉に対応する、
コード番号（例えば、１０２など）、認識した単語また
は文（例えば、”パパにメール”など）、および認識し
た単語に関連する文字列（例えば、”daddy@test.compa
ny.co.jp”）のデータを送信する。The speech recognition engine 101 causes the engine communication unit 122 to correspond to a predetermined word spoken by the user.
A code number (eg, 102), a recognized word or sentence (eg, “Email Dad”), and a character string associated with the recognized word (eg, “daddy@test.compa”)
ny.co.jp ”).

【００６３】例えば、使用者がマイクロフォン２４に向
かって「パパにメール」という音声を入力して、音声認
識エンジン１０１が正しく音声を認識したとき、音声認
識エンジン１０１は、7fffffff（１６進数）、”パパに
メール”、および”daddy@test.company.co.jp”をエン
ジン通信部１２２に送信する。For example, when the user inputs a voice of “mail to dad” into the microphone 24 and the voice recognition engine 101 correctly recognizes the voice, the voice recognition engine 101 outputs 7fffffff (hexadecimal), “ "E-mail to dad" and "daddy@test.company.co.jp" are transmitted to engine communication unit 122.

【００６４】エンジン通信部１２２は、音声認識エンジ
ン１０１から受信したデータを基に、受信したデータを
アプリケーション通信部１２１、ＵＩ処理部１２３、音
声ランチャ制御部１２４、ユーザ辞書制御部１２５、ま
たは認識テスト処理部１２６のいずれに送信するかを判
断し、その判断に基づいて、音声認識エンジン１０１か
ら受信したデータを所定の方式に変換して、選択された
アプリケーション通信部１２１、ＵＩ処理部１２３、音
声ランチャ制御部１２４、ユーザ辞書制御部１２５、ま
たは認識テスト処理部１２６のいずれかに変換したデー
タを供給する。The engine communication unit 122 converts the received data based on the data received from the speech recognition engine 101 into an application communication unit 121, a UI processing unit 123, a speech launcher control unit 124, a user dictionary control unit 125, or a recognition test. It determines which of the processing units 126 to transmit, and based on the determination, converts the data received from the speech recognition engine 101 into a predetermined method, and selects the selected application communication unit 121, UI processing unit 123, The converted data is supplied to any one of the launcher control unit 124, the user dictionary control unit 125, and the recognition test processing unit 126.

【００６５】音声ランチャ制御部１２４は、グラフィカ
ルなユーザインターフェースを表示させて使用者により
入力された、アプリケーションプログラム（電子メール
プログラム５４Ａ、ワードプロセッサプログラム１０
６、または表計算プログラム１０７）の起動に関する設
定をランチャ設定データベース１１３に保存させるとと
もに、その設定に基づき、認識単語・文法データベース
１１５に記憶されている音声認識する単語または文法に
関する設定を更新させる。The voice launcher control unit 124 displays an application program (e-mail program 54A, word processor program 10
6, or the setting relating to the activation of the spreadsheet program 107) is stored in the launcher setting database 113, and the setting relating to the speech recognition word or grammar stored in the recognition word / grammar database 115 is updated based on the setting.

【００６６】音声ランチャ制御部１２４は、エンジン通
信部１２２からランチャに関するデータを受信したと
き、ランチャ設定データベース１１３に記憶されている
起動に関する設定に基づき、電子メールプログラム５４
Ａ、ワードプロセッサプログラム１０６、または表計算
プログラム１０７のいずれかを起動させ、電子メールプ
ログラム５４Ａにメールアドレスなどを供給する。When the data related to the launcher is received from the engine communication unit 122, the voice launcher control unit 124 executes the e-mail program 54 based on the setting related to the activation stored in the launcher setting database 113.
A, activates any one of the word processor program 106 and the spreadsheet program 107 and supplies a mail address and the like to the e-mail program 54A.

【００６７】音声コマンダ１０２は、フォーカスの状態
にかかわらず（いずれのプログラムがアクティブであっ
ても）、電子メールプログラム５４Ａ、ワードプロセッ
サプログラム１０６、または表計算プログラム１０７の
いずれかを起動させるコマンドを実行することができ
る。The voice commander 102 executes a command to activate any one of the electronic mail program 54A, the word processor program 106, and the spreadsheet program 107, regardless of the focus state (no matter which program is active). be able to.

【００６８】このような、フォーカスの状態などにかか
わらず、常に実行することができる、例えば、電子メー
ルプログラム５４Ａ、ワードプロセッサプログラム１０
６、または表計算プログラム１０７のいずれかを起動さ
せるコマンドをグローバルなコマンドと称する。Regardless of the focus state, the program can be always executed. For example, the electronic mail program 54A, the word processor program 10
6 or a command for activating one of the spreadsheet programs 107 is referred to as a global command.

【００６９】ユーザ辞書制御部１２５は、グラフィカル
なユーザインターフェースを表示させ使用者により入力
された、認識する音声に関する設定を辞書設定データベ
ース１１４に記憶させるとともに、その設定に基づき、
認識単語・文法データベース１１５に記憶されている音
声認識する単語または文法に関する設定を更新させる。The user dictionary control unit 125 displays a graphical user interface, stores the setting relating to the recognized voice input by the user in the dictionary setting database 114, and based on the setting,
The setting related to the word or grammar for speech recognition stored in the recognition word / grammar database 115 is updated.

【００７０】認識テスト処理部１２６は、使用者により
テストを実行する旨がユーザ辞書制御部１２５に入力さ
れたとき、グラフィカルなユーザインターフェースを表
示させて、辞書設定データベース１１４に記憶され、選
択されている所定の１の単語と、エンジン通信部１２２
を介して、音声認識エンジン１０１から供給された、音
声を認識した結果を示す単語とが一致するか否かを判定
し、その判定の結果を表示する。When the user inputs a command to execute a test to the user dictionary control unit 125, the recognition test processing unit 126 displays a graphical user interface, and stores the graphical user interface in the dictionary setting database 114. The predetermined one word and the engine communication unit 122
And determines whether the word supplied from the voice recognition engine 101 matches the word indicating the result of the voice recognition, and displays the result of the determination.

【００７１】また、認識テスト処理部１２６は、使用者
によりテストを実行する旨がユーザ辞書制御部１２５に
入力されたとき、グラフィカルなユーザインターフェー
スを表示させて、エンジン通信部１２２を介して、音声
認識エンジン１０１から供給された、音声を認識した結
果を示す単語が、辞書設定データベース１１４に記憶さ
れ、選択されている所定の１以上の単語に含まれるか否
かを判定し、その判定の結果を表示する。When the user inputs a command to the user dictionary control unit 125 to execute a test, the recognition test processing unit 126 displays a graphical user interface and outputs a voice via the engine communication unit 122. It is determined whether or not the word indicating the result of the voice recognition supplied from the recognition engine 101 is stored in the dictionary setting database 114 and is included in one or more selected predetermined words. Is displayed.

【００７２】音声コマンダ１０２が起動されると、ＵＩ
処理部１２３は、LCD２１に起動中を示す画像を表示さ
せるとともに、図１１に示す音声コマンダ１０２のウィ
ンドウを表示させ、音声認識エンジン１０１の起動を待
つ状態１に遷移する。When the voice commander 102 is activated, the UI
The processing unit 123 causes the LCD 21 to display an image indicating that the voice commander is running, displays the window of the voice commander 102 shown in FIG.

【００７３】音声コマンダウィンドウ１６１は、レベル
ゲージ１６２、認識結果表示部１６３、ランチャ設定ボ
タン１６４、辞書管理ボタン１６５、ヘルプボタン１６
６、最小化ボタン１６７、閉じるボタン１６８、認識状
態表示部１６９、および音声入力モード切り換えボタン
１７０を有する。The voice commander window 161 includes a level gauge 162, a recognition result display section 163, a launcher setting button 164, a dictionary management button 165, and a help button 16.
6, a minimize button 167, a close button 168, a recognition state display section 169, and a voice input mode switching button 170.

【００７４】レベルゲージ１６２は、マイクロフォン２
４を介して入力された使用者の音声のレベル（マイクロ
フォン２４が出力する信号の振幅）を表示する。認識結
果表示部１６３は、エンジン通信部１２２から供給され
た認識された音声に対応する単語または文を表示する。
ランチャ設定ボタン１６４は、電子メールプログラム５
４Ａ、ワードプロセッサプログラム１０６、または表計
算プログラム１０７の起動に関する設定をするとき、操
作される。The level gauge 162 is connected to the microphone 2
4 shows the level of the user's voice input via the microphone 4 (the amplitude of the signal output from the microphone 24). The recognition result display unit 163 displays a word or a sentence corresponding to the recognized voice supplied from the engine communication unit 122.
The launcher setting button 164 is used for the e-mail program 5
4A, the word processor program 106 or the spreadsheet program 107 is operated to make settings for activation.

【００７５】辞書管理ボタン１６５は、認識する音声に
関する設定を辞書設定データベース１１４に記憶させる
とき、操作される。ヘルプボタン１６６は、オンライン
ヘルプをLCD２１に表示させるとき、操作される。最小
化ボタン１６７は、音声コマンダウィンドウ１６１をLC
D２１から消去し、例えば、タスクトレイ上に所定のア
イコンを表示させるとき、操作される。閉じるボタン１
６８は、音声コマンダ１０２を終了させるとき、操作さ
れる。The dictionary management button 165 is operated when the setting relating to the voice to be recognized is stored in the dictionary setting database 114. The help button 166 is operated when displaying online help on the LCD 21. The minimize button 167 allows the voice commander window 161 to be
The operation is performed when the icon is deleted from D21 and a predetermined icon is displayed on the task tray, for example. Close button 1
Reference numeral 68 is operated when the voice commander 102 is terminated.

【００７６】認識状態表示部１６９は、音声認識エンジ
ン１０１の状態またはローカルコマンドが使用できるか
否か（所定のプログラムがアクティブであるか否か）な
どを表示する。音声入力モード切り換えボタン１７０
は、常時認識モードと通常の認識モードとを切り換える
ときに、操作される。The recognition state display section 169 displays the state of the speech recognition engine 101, whether a local command can be used (whether a predetermined program is active or not) and the like. Voice input mode switching button 170
Is operated when switching between the normal recognition mode and the normal recognition mode.

【００７７】音声認識エンジン１０１が起動された場
合、上述したような音声コマンダウィンドウ１６１がLC
D２１に表示される。このような状態で、閉じるボタン
１６８がクリックされると、ＵＩ処理部１２３は、音声
コマンダ１０２を終了させる。また、使用者が音声認識
に割り当てているキー（例えば、キーボード４のコント
ロールキーなど。以下、認識キーと称する）を押圧した
とき、ＵＩ処理部１２３は、音声入力可能な状態に遷移
する。When the voice recognition engine 101 is activated, the voice commander window 161 as described above
Displayed in D21. When the close button 168 is clicked in such a state, the UI processing unit 123 ends the voice commander 102. When the user presses a key assigned to voice recognition (for example, a control key of the keyboard 4; hereinafter, referred to as a recognition key), the UI processing unit 123 transits to a state in which voice input is possible.

【００７８】音声入力可能な状態に遷移するとき、ＵＩ
処理部１２３は、アプリケーション通信部１２１から静
止画撮影プログラム１０３、静止画閲覧プログラム１０
４、および電子ペットプログラム１０５の内、アクティ
ブであるプログラムを示すデータを受信し、アクティブ
であるプログラムの名称を音声コマンダウィンドウ１６
１の認識状態表示部１６９に表示させる。静止画撮影プ
ログラム１０３、静止画閲覧プログラム１０４、または
電子ペットプログラム１０５のいずれもアクティブでな
いとき、ＵＩ処理部１２３は、音声コマンダウィンドウ
１６１の認識状態表示部１６９にその旨（例えば、”Gl
obal Command”など）を表示させる。When transitioning to a state where voice input is possible, the UI
The processing unit 123 transmits the still image photographing program 103 and the still image browsing program 10 from the application communication unit 121.
4, and data indicating the active program among the electronic pet programs 105 is received, and the name of the active program is entered in the voice commander window 16.
1 is displayed on the recognition state display unit 169. When none of the still image photographing program 103, the still image browsing program 104, or the electronic pet program 105 is active, the UI processing unit 123 displays a message to that effect on the recognition status display unit 169 of the voice commander window 161 (for example, “Gl”).
obal Command ”).

【００７９】音声入力可能な状態において、使用者がマ
イクロフォン２４から入力させた音声に対応する信号が
音声認識エンジン１０１に供給され、音声認識エンジン
１０１に供給された音声に対応する信号のレベルに対応
するデータが、エンジン通信部１２２を介して、ＵＩ処
理部１２３に供給される。また、ＵＩ処理部１２３は、
音声に対応する信号のレベルに対応するデータに基づ
き、音声コマンダウィンドウ１６１のレベルゲージ１６
２の表示を更新する。In a state where voice input is possible, a signal corresponding to the voice input by the user from the microphone 24 is supplied to the voice recognition engine 101, and a signal corresponding to the level of the signal supplied to the voice recognition engine 101 is provided. Is supplied to the UI processing unit 123 via the engine communication unit 122. Also, the UI processing unit 123
Based on the data corresponding to the level of the signal corresponding to the voice, the level gauge 16 of the voice commander window 161 is used.
Update the display of 2.

【００８０】さらに、音声認識エンジン１０１が音声を
認識したとき、ＵＩ処理部１２３は、音声認識エンジン
１０１から認識した単語または文などのデータを受信
し、音声コマンダウィンドウ１６１の認識結果表示部１
６３に認識した単語または文を表示させる。Further, when the speech recognition engine 101 recognizes the speech, the UI processing unit 123 receives the data such as the recognized word or sentence from the speech recognition engine 101 and displays the data in the recognition result display unit 1 of the speech commander window 161.
63 displays the recognized word or sentence.

【００８１】音声認識可能な状態において、使用者が認
識キーを離したとき、ＵＩ処理部１２３は、アプリケー
ション通信部１２１または音声ランチャ制御部１２４
に、音声認識エンジン１０１から供給された、コード番
号、認識した単語または文、および認識した単語に関連
する文字列のデータに対応する、所定の動作（例えば、
電子メールプログラム５４Ａの起動など）を要求する。When the user releases the recognition key in a state in which the voice can be recognized, the UI processing unit 123 controls the application communication unit 121 or the voice launcher control unit 124.
A predetermined operation (for example, corresponding to the code number, the recognized word or sentence, and the character string data related to the recognized word, supplied from the voice recognition engine 101)
(E.g., activation of the e-mail program 54A).

【００８２】このとき、アプリケーション通信部１２１
は、ＵＩ処理部１２３からの要求に対応して、静止画撮
影プログラム１０３、静止画閲覧プログラム１０４、若
しくは電子ペットプログラム１０５のいずれかを起動さ
せ、または静止画撮影プログラム１０３、静止画閲覧プ
ログラム１０４、若しくは電子ペットプログラム１０５
のいずれかに所定のコマンドを送信する。At this time, the application communication unit 121
Starts one of the still image photographing program 103, the still image browsing program 104, and the electronic pet program 105 in response to a request from the UI processing unit 123, or executes the still image photographing program 103, the still image browsing program 104 Or electronic pet program 105
A predetermined command.

【００８３】このとき、音声ランチャ制御部１２４は、
ＵＩ処理部１２３からの要求に対応して、電子メールプ
ログラム５４Ａ、ワードプロセッサプログラム１０６、
若しくは表計算プログラム１０７のいずれかを起動さ
せ、または電子メールプログラム５４Ａに所定のデータ
（例えば、メールアドレスなど）を供給する。At this time, the voice launcher control unit 124
In response to the request from the UI processing unit 123, the e-mail program 54A, the word processor program 106,
Alternatively, one of the spreadsheet programs 107 is activated, or predetermined data (for example, a mail address) is supplied to the e-mail program 54A.

【００８４】アプリケーション通信部１２１または音声
ランチャ制御部１２４が所定のプログラムに対して、所
定の動作を完了させたとき、アプリケーション通信部１
２１または音声ランチャ制御部１２４はＵＩ処理部１２
３にその旨を通知し、ＵＩ処理部１２３は、動作の対象
となる所定のプログラムに応じて、動作の対象となる所
定のプログラムを使用者に認識させる画像をLCD２１に
表示させる。When the application communication unit 121 or the sound launcher control unit 124 completes a predetermined operation for a predetermined program, the application communication unit 1
21 or the voice launcher control unit 124
3 is notified to that effect, and the UI processing unit 123 causes the LCD 21 to display an image that allows the user to recognize the predetermined program to be operated according to the predetermined program to be operated.

【００８５】LCD２１に動作の対象となる所定のプログ
ラム認識させる画像が表示されるので、使用者は、音声
の認識の結果、および音声コマンダ１０２の動作を知る
ことができる。An image for recognizing a predetermined program to be operated is displayed on the LCD 21, so that the user can know the result of voice recognition and the operation of the voice commander 102.

【００８６】音声入力モード切り換えボタン１７０がク
リックされたとき、ＵＩ処理部１２３は、常時認識モー
ドである状態に遷移する。常時認識モードに遷移すると
き、ＵＩ処理部１２３は、アプリケーション通信部１２
１から静止画撮影プログラム１０３、静止画閲覧プログ
ラム１０４、および電子ペットプログラム１０５の内、
アクティブであるプログラムを示すデータを受信し、ア
クティブであるプログラムの名称を認識状態表示部１６
９に表示させる。静止画撮影プログラム１０３、静止画
閲覧プログラム１０４、または電子ペットプログラム１
０５のいずれもアクティブでないとき、ＵＩ処理部１２
３は、音声コマンダウィンドウ１６１の認識状態表示部
１６９にその旨（例えば、”Global Command”など）を
表示させる。When the voice input mode switching button 170 is clicked, the UI processing section 123 transits to a state in which it is always in the recognition mode. When transitioning to the constant recognition mode, the UI processing unit 123
1 to a still image photographing program 103, a still image browsing program 104, and an electronic pet program 105,
Receiving data indicating the active program and recognizing the name of the active program
9 is displayed. Still image shooting program 103, still image viewing program 104, or electronic pet program 1
05 is not active, the UI processing unit 12
No. 3 causes the recognition status display section 169 of the voice commander window 161 to display the fact (for example, “Global Command”).

【００８７】常時認識モードにおいては、音声コマンダ
２は、認識キーに対する操作に係わらず、音声認識エン
ジンが所定の音声を認識したとき、静止画撮影プログラ
ム１０３、静止画閲覧プログラム１０４、若しくは電子
ペットプログラム１０５のいずれかを起動させ、若しく
は静止画撮影プログラム１０３、静止画閲覧プログラム
１０４、若しくは電子ペットプログラム１０５のいずれ
かに所定のコマンドを送信し、または電子メールプログ
ラム５４Ａ、ワードプロセッサプログラム１０６、若し
くは表計算プログラム１０７のいずれかを起動させ、若
しくは電子メールプログラム５４Ａに所定のデータを供
給する。In the continuous recognition mode, when the voice recognition engine recognizes a predetermined voice irrespective of the operation of the recognition key, the voice commander 2 executes the still image photographing program 103, the still image browsing program 104, or the electronic pet program. 105, or sends a predetermined command to any one of the still image photographing program 103, the still image browsing program 104, and the electronic pet program 105, or the e-mail program 54A, the word processor program 106, or the spreadsheet. Activate any of the programs 107 or supply predetermined data to the e-mail program 54A.

【００８８】常時認識モードにおいて、音声入力モード
切り換えボタン１７０がクリックされたとき、ＵＩ処理
部１２３は、通常の認識モードに遷移する。When the voice input mode switching button 170 is clicked in the continuous recognition mode, the UI processing unit 123 shifts to the normal recognition mode.

【００８９】通常認識モードにおいて、音声コマンダウ
ィンドウ１６１の辞書管理ボタン１６５がクリックされ
ると、ＵＩ処理部１２３は、辞書を設定する状態に遷移
し、ユーザ辞書制御部１２５に辞書の設定の処理を要求
する。辞書を設定する状態において、ユーザ辞書制御部
１２５は、辞書設定用のダイアログをLCD２１に表示さ
せ、辞書設定用のダイアログへの操作に基づき、辞書設
定データベース１１４および認識単語・文法データベー
ス１１５に記憶されている設定を更新する。When the dictionary management button 165 of the voice commander window 161 is clicked in the normal recognition mode, the UI processing unit 123 makes a transition to a state in which a dictionary is set, and the user dictionary control unit 125 sends a dictionary setting process. Request. In the state of setting a dictionary, the user dictionary control unit 125 causes the LCD 21 to display a dialog for setting a dictionary, and stores the dialog in the dictionary setting database 114 and the recognized word / grammar database 115 based on an operation on the dialog for setting a dictionary. Update the settings you have.

【００９０】辞書を設定する状態において、辞書設定用
のダイアログに配置されているテストボタンがクリック
されると、ＵＩ処理部１２３は、音声認識テストを実行
する状態に遷移し、認識テスト処理部１２６に音声認識
テストの処理を要求する。認識テスト処理部１２６は、
音声認識テストのダイアログをLCD２１に表示させ、エ
ンジン通信部１２２を介して、音声認識エンジン１０１
から供給された、音声を認識した単語が、辞書設定デー
タベース１１４に登録されている単語と一致するか否か
を判定する音声認識のテストを実行し、その結果を表示
する。When a test button arranged in the dictionary setting dialog is clicked in a state where a dictionary is set, the UI processing unit 123 makes a transition to a state in which a speech recognition test is executed, and the recognition test processing unit 126 Request processing of the voice recognition test. The recognition test processing unit 126
A dialog for the voice recognition test is displayed on the LCD 21, and the voice recognition engine 101 is transmitted via the engine communication unit 122.
Performs a speech recognition test for determining whether or not the word whose speech has been recognized supplied from is matched with a word registered in the dictionary setting database 114, and displays the result.

【００９１】または、認識テスト処理部１２６は、音声
認識テストのダイアログをLCD２１に表示させ、エンジ
ン通信部１２２を介して、音声認識エンジン１０１から
供給された認識した単語が、辞書設定データベース１１
４に登録されている単語に含まれているか否かを判定す
る音声認識のテストを実行し、その結果を表示する。Alternatively, the recognition test processing unit 126 causes the LCD 21 to display a dialog of the speech recognition test, and the recognized word supplied from the speech recognition engine 101 via the engine communication unit 122 is used by the dictionary setting database 11
Then, a speech recognition test for determining whether the word is included in the words registered in No. 4 is executed, and the result is displayed.

【００９２】音声認識テストを実行する状態において、
音声認識テストのダイアログに配置されているテストボ
タンがクリックされると、ＵＩ処理部１２３は、辞書を
設定する状態に遷移する。辞書を設定する状態におい
て、辞書設定用のダイアログに配置されている閉じるボ
タンがクリックされると、ＵＩ処理部１２３は、通常認
識モードに遷移する。In the state where the voice recognition test is executed,
When a test button arranged in the dialog for the voice recognition test is clicked, the UI processing unit 123 transits to a state where a dictionary is set. When the close button arranged in the dictionary setting dialog is clicked in the state where the dictionary is set, the UI processing unit 123 transits to the normal recognition mode.

【００９３】通常認識モードにおいて、音声コマンダウ
ィンドウ１６１のランチャ設定ボタン１６４がクリック
されると、ＵＩ処理部１２３は、音声ランチャ制御部１
２４の電子メールプログラム５４Ａ、ワードプロセッサ
プログラム１０６、または表計算プログラム１０７を起
動する設定を行う状態に遷移し、音声ランチャ制御部１
２４にプログラムの起動の設定の処理を要求する。When the launcher setting button 164 of the voice commander window 161 is clicked in the normal recognition mode, the UI processing unit 123 causes the voice launcher control unit 1
24 e-mail program 54A, word processor program 106, or spreadsheet program 107.
24 is requested to perform the setting process of the program activation.

【００９４】起動の設定を行う状態において、音声ラン
チャ制御部１２４は、ランチャ設定用のダイアログをLC
D２１に表示させ、ランチャ設定用のダイアログへの操
作に基づき、ランチャ設定データベース１１３に記憶さ
れている設定を更新する。In a state in which the setting of starting is performed, the voice launcher control unit 124 displays a dialog for launcher setting in the LC.
D21 is displayed, and the setting stored in the launcher setting database 113 is updated based on the operation on the launcher setting dialog.

【００９５】次に、パーソナルコンピュータ１のLCD２
１に表示する画面について説明する。図１２は、音声コ
マンダ１０２、音声認識エンジン１０１、および電子ペ
ットプログラム１０５が起動しているとき、LCD２１に
表示される画面を示す図である。Next, the LCD 2 of the personal computer 1
The screen displayed in No. 1 will be described. FIG. 12 is a diagram illustrating a screen displayed on the LCD 21 when the voice commander 102, the voice recognition engine 101, and the electronic pet program 105 are running.

【００９６】LCD２１の画面の所定の位置に、電子メー
ルプログラム５４Ａに対応するアイコン１８１、ワード
プロセッサプログラム１０６に対応するアイコン１８
２、表計算プログラム１０７に対応するアイコン１８
３、音声コマンダウィンドウ１６１、および電子ペット
プログラム１０５が表示させる電子ペットウィンドウ１
９１が配置される。At a predetermined position on the screen of the LCD 21, an icon 181 corresponding to the electronic mail program 54A and an icon 18 corresponding to the word processor program 106 are displayed.
2. Icon 18 corresponding to spreadsheet program 107
3. Voice commander window 161 and electronic pet window 1 displayed by electronic pet program 105
91 are arranged.

【００９７】スティック式ポインティングデバイス５な
どを操作してアイコン１８１を選択して、起動コマンド
を実行する（図示せぬメニューなどから選択するなどの
操作をする）と、電子メールプログラム５４Ａが起動さ
れる。アイコン１８２を選択して、起動コマンドを実行
すると、ワードプロセッサプログラム１０６が起動され
る。アイコン１８３を選択して、起動コマンドを実行す
ると、表計算プログラム１０７が起動される。When the stick type pointing device 5 or the like is operated to select the icon 181 and execute a start command (an operation such as selecting from a menu or the like not shown), the e-mail program 54A is started. . When the icon 182 is selected and the start command is executed, the word processor program 106 is started. When the icon 183 is selected and a start command is executed, the spreadsheet program 107 is started.

【００９８】電子ペットウィンドウ１９１は、仮想空間
内で生息している電子ペットが表示される表示部２００
と、複数のボタン２０１乃至２０６から構成されてい
る。閉じるボタン２０１は、電子ペットプログラム１０
５を終了させたいときに操作され、拡大ボタン２０２
は、電子ペットウィンドウ１９１の表示を拡大させたい
ときに操作され、最小化ボタン２０３は、電子ペットウ
ィンドウ１９１をLCD２１上から消去し、タスクバーに
収納させたい時に操作される。The electronic pet window 191 displays the electronic pet inhabiting the virtual space.
And a plurality of buttons 201 to 206. The close button 201 is used for the electronic pet program 10
5 is operated when the user wants to end
Is operated when the display of the electronic pet window 191 is to be enlarged, and the minimize button 203 is operated when the electronic pet window 191 is to be deleted from the LCD 21 and stored in the task bar.

【００９９】また、音階指示ボタン２０４は、図示しな
いロボットに対して指示を出す場合に操作され、辞書管
理ボタン２０５は、辞書に仮想ペットの新たな名前など
を登録させたい時などに操作され、ヘルプボタン２０６
は、わからないことを調べたい時に操作される。The scale instruction button 204 is operated when giving an instruction to a robot (not shown), and the dictionary management button 205 is operated when it is desired to register a new name of the virtual pet in the dictionary. Help button 206
Is operated when you want to check what you do not know.

【０１００】図１２に示したような状態で、使用者が、
音声コマンダウィンドウ１６１の最小化ボタン１６７を
操作すると、その操作結果として、タスクトレイ２１１
上に、音声コマンダ１０２に対応するアイコンが、図１
３に示しように表示される。図１３において、タスクト
レイ２１１に表示されている複数のアイコンの内、アイ
コン２２１が、音声コマンダ１０２に対応するアイコン
である。In the state shown in FIG. 12, the user
When the minimize button 167 of the voice commander window 161 is operated, as a result of the operation, the task tray 211
An icon corresponding to the voice commander 102 is shown in FIG.
It is displayed as shown in FIG. In FIG. 13, among a plurality of icons displayed on the task tray 211, an icon 221 is an icon corresponding to the voice commander 102.

【０１０１】このアイコン２１１上に、使用者がカーソ
ル２３１を移動させ、クリックなどの所定の操作を行う
と、図１４に示すように、メニューが表示される。メニ
ューには、音声コマンダウィンドウ１６１を通常表示さ
せたいとき（図１２に示したような表示に戻したいと
き）に操作される”通常表示”、通常の認識モードと常
時認識モードとを切り換える時に操作される”入力モー
ド切換”、および音声コマンダ１０２を終了させる時に
操作される”終了”が表示されている。When the user moves the cursor 231 on this icon 211 and performs a predetermined operation such as clicking, a menu is displayed as shown in FIG. The menu includes a “normal display” which is operated when the voice commander window 161 is to be normally displayed (when it is desired to return to the display as shown in FIG. 12), and which is operated when switching between the normal recognition mode and the normal recognition mode. "Input mode switching" and "end" operated when terminating the voice commander 102 are displayed.

【０１０２】図１３に示したような表示状態のとき、す
なわち、音声コマンダウィンドウ１６１が、最小化され
ており、タスクトレイ２１１にアイコン２２１が表示さ
れている状態のとき、ユーザが音声認識を行わせるため
に、認識キーを操作すると、図１５に示したように、小
型表示ウィンドウ２４１が表示される。この小型表示ウ
ィンドウ２４１は、認識キーが操作された時点で、起動
されている（アクティブにされている）音声認識対応の
アプリケーションの近傍に表示される。、例えば、図１
５の場合、音声認識対応のアプリケーションとして電子
ペットプログラム１０５が起動されており、その電子ペ
ットプログラム１０５の電子ペットウィンドウ１９１の
左上に重ならないように表示されている。In the display state as shown in FIG. 13, that is, when the voice commander window 161 is minimized and the icon 221 is displayed on the task tray 211, the user performs voice recognition. Therefore, when the recognition key is operated, a small display window 241 is displayed as shown in FIG. The small display window 241 is displayed near the activated (activated) voice recognition compatible application when the recognition key is operated. For example, FIG.
In the case of 5, the electronic pet program 105 is activated as a voice recognition compatible application, and is displayed so as not to overlap the upper left of the electronic pet window 191 of the electronic pet program 105.

【０１０３】電子ペットウィンドウ１９１自体が、LCD
２１の端にあるために見切れてしまっている場合があ
る。そのような場合は、図１６に示すように、小型表示
ウィンドウ２４１は、電子ペットウィンドウ１９１に重
ねられて表示される。なお、アプリケーションに対する
小型表示ウィンドウ２４１の位置は、右上、右下、左下
など、どこでもよく、左上に限定されるものではない。The electronic pet window 191 itself has an LCD
There is a case where it is cut off because it is at the end of 21. In such a case, as shown in FIG. 16, the small display window 241 is displayed so as to overlap the electronic pet window 191. The position of the small display window 241 with respect to the application may be anywhere, such as the upper right, lower right, or lower left, and is not limited to the upper left.

【０１０４】音声認識に対応しているアプリケーション
が起動され、かつ、そのアプリケーションがアクティブ
な状態になっている場合は、上述したように、認識キー
が操作されると、そのアプリケーションの近傍に、小型
表示ウィンドウ２４１が表示されるが、音声認識に対応
しているアプリケーションが起動されていないとき、ま
たは、起動されているが、アクティブな状態になってい
ないときは、例えば、図１７に示すように、LCD２１の
画面の中央下側に、小型表示ウィンドウ２４１は表示さ
れる。勿論、その表示位置は、中央下側に限られるもの
ではない。When an application corresponding to voice recognition is activated and the application is in an active state, when the recognition key is operated as described above, a small size is placed near the application. Although the display window 241 is displayed, when an application corresponding to voice recognition is not activated, or when it is activated but is not in an active state, for example, as shown in FIG. , A small display window 241 is displayed at the lower center of the screen of the LCD 21. Of course, the display position is not limited to the lower side of the center.

【０１０５】小型表示ウィンドウ２４１の左側の丸い部
分は、レベルゲージ１６２（図１１）と同様の表示を行
う部分であり、右側の細長い部分は、認識状態表示部１
６９と同様の表示（音声認識された結果を表示）する部
分である。The round part on the left side of the small display window 241 is a part for performing the same display as the level gauge 162 (FIG. 11), and the elongated part on the right side is the recognition state display unit 1.
This is a portion for displaying the same as 69 (displaying the result of voice recognition).

【０１０６】図１８は、音声認識に対応している２つの
アプリケーションが起動されている状態を示している。
図１８には、音声認識に対応しているアプリケーション
として電子ペットプログラム１０５と静止画閲覧プログ
ラム１０４が起動されている状態を示している。このよ
うな状態において、静止画閲覧プログラム１０４がアク
ティブな状態であるときに、ユーザが認識キーを操作す
ると、そのアクティブな状態となっている静止画閲覧ウ
ィンドウ２５１の左上に、小型表示ウィンドウ２４１が
表示される。FIG. 18 shows a state in which two applications corresponding to voice recognition are activated.
FIG. 18 illustrates a state in which the electronic pet program 105 and the still image browsing program 104 have been activated as applications supporting voice recognition. In such a state, when the user operates the recognition key while the still image browsing program 104 is in an active state, a small display window 241 is displayed on the upper left of the active still image browsing window 251. Is displayed.

【０１０７】このように、音声認識された結果を、どの
アプリケーションに送るか、換言すれば、どのアプリケ
ーションに対して音声によるコマンドを出すのかを、ア
プリケーションに対応するウィンドウの近傍に、小型表
示ウィンドウ２４１を表示させることにより、使用者に
認識させることが可能となる。As described above, to which application the result of voice recognition is to be sent, in other words, to which application a voice command is to be issued, is displayed in the small display window 241 near the window corresponding to the application. Is displayed, the user can be recognized.

【０１０８】また、例えば、使用者が電子ペットプログ
ラム１０５に対して音声によるコマンドを出そうと思
い、認識キーを操作したにも関わらず、小型表示ウィン
ドウ２４１が、静止画閲覧ウィンドウ２５１の近傍に表
示されたり、画面中央下側に表示されたときには、使用
者は、電子ペットプログラム１０５がアクティブな状態
になっていないと判断することができ、その判断に基づ
いて、電子ペットプログラム１０５をアクティブな状態
にし、再度、認識キーを操作して電子ペットプログラム
１０５に対してコマンドを出すことができる。すなわ
ち、誤ったアプリケーションに対してコマンドを出して
しまうようなことを防ぐことが可能となる。Also, for example, the small display window 241 is positioned near the still image browsing window 251 even though the user operates the recognition key to give a command to the electronic pet program 105 by voice. When the electronic pet program 105 is displayed or displayed at the lower center of the screen, the user can determine that the electronic pet program 105 is not in an active state, and based on the determination, activates the electronic pet program 105. Then, the user can operate the recognition key again to issue a command to the electronic pet program 105. That is, it is possible to prevent a command from being issued to an incorrect application.

【０１０９】さらに、このような小型表示ウィンドウ２
４１にすることにより、音声コマンダウィンドウ１６１
を通常表示させる場合に比べ、その描画に係る時間（処
理能力）を軽減させることが可能となる。Further, such a small display window 2
41, the voice commander window 161
Can be reduced in the time required for the drawing (processing capability) as compared with the case where is normally displayed.

【０１１０】ところで、音声認識させるためには、予め
設定されているコマンドの形式に従って発話する必要が
ある。使用者は、音声認識により処理を実行させている
とき、所望な処理を実行させるためには、どのようなコ
マンド形式に従って発話すれば良いかわからない時があ
る。そのような場合、使用者は、”コマンド一覧”と発
話することにより、コマンドに関するヘルプ画面をLCD
２１上に表示させることができる。By the way, in order to perform voice recognition, it is necessary to speak in accordance with a preset command format. When executing a process by voice recognition, the user sometimes does not know what command format should be used in order to execute a desired process. In such a case, the user speaks “command list” to display a help screen for the command on the LCD.
21 can be displayed.

【０１１１】図１９は、使用者が”コマンド一覧”と発
話した結果。”コマンド一覧”と認識され、音声コマン
ダウィンドウ１６１の認識結果表示部１６３に表示され
た状態を示している。もちろん、音声コマンドウィンド
ウ１６１が、最小化表示が指示されていることにより、
小型表示ウィンドウ２４１が表示される場合、その小型
表示ウィンドウ２４１に”コマンド一覧”と表示され
る。このように、使用者が発話した”コマンド一覧”と
いう言葉が認識されることにより、図２０に示したよう
に、音声コマンドに関するヘルプ画面がLCD２１上に表
示される。FIG. 19 shows the result of the user saying “command list”. This shows a state where the command is recognized as “command list” and displayed on the recognition result display section 163 of the voice commander window 161. Of course, since the voice command window 161 is instructed to minimize the display,
When the small display window 241 is displayed, "command list" is displayed in the small display window 241. In this way, by recognizing the word “command list” spoken by the user, a help screen relating to voice commands is displayed on the LCD 21 as shown in FIG.

【０１１２】このように、音声認識に関するコマンドの
操作に困ったときには、音声により、ヘルプ画面を表示
させることができるようにすることにより、使用者は、
マウスなどを操作してヘルプ画面を表示させる操作を行
わなくて良く、一貫して音声にて操作を行うことがで
き、使い勝手が向上する。As described above, when it is difficult to operate a command related to voice recognition, the help screen can be displayed by voice so that the user can
It is not necessary to perform an operation of displaying a help screen by operating a mouse or the like, and the operation can be consistently performed by voice, thereby improving usability.

【０１１３】音声認識は、使用回数が多いほど、学習し
ていき、認識率が高くなる。そこで、使用者は、音声コ
マンダ１０２を何回使用したかという情報を知りたいと
きがある。そのような場合、例えば、コマンドとして”
何回使ったかな”という言葉を発話することにより、音
声コマンダ１０２を使用した回数を知ることができる。
具体的には、使用者が”何回使ったかな”と発話する
と、その発話が認識された結果が、図２１（Ａ）に示す
ように、音声コマンダウィンドウ１６１の認識結果表示
部１６３に表示される。そして、図２１（Ｂ）に示すよ
うに、認識状態表示部１６９に、使用回数が表示される
（図２１（Ｂ）の表示例の場合、８回である）。この表
示は所定時間表示された後に、消される。In speech recognition, as the number of uses increases, learning is performed, and the recognition rate increases. Therefore, the user sometimes wants to know information on how many times the voice commander 102 has been used. In such a case, for example,
By saying the word "how many times have you used?", The number of times the voice commander 102 has been used can be known.
Specifically, when the user utters “How many times have you used?”, The recognition result is displayed on the recognition result display section 163 of the voice commander window 161 as shown in FIG. Is done. Then, as shown in FIG. 21 (B), the number of times of use is displayed on the recognition state display unit 169 (in the case of the display example of FIG. 21 (B), the number is eight). This display is turned off after being displayed for a predetermined time.

【０１１４】次に、音声認識エンジン１０１および音声
コマンダ１０２を実行するCPU５２の音声によるコマン
ドの送信の処理を図２２のフローチャートを参照して説
明する。ここでは、音声コマンダウィンドウ１６１が最
小化表示が指示されており、音声認識に対応しているア
プリケーションが１以上起動されていることを前提とし
て説明する。Next, the processing of command transmission by voice of the CPU 52 executing the voice recognition engine 101 and the voice commander 102 will be described with reference to the flowchart of FIG. Here, the description will be given on the assumption that the voice commander window 161 has been instructed to minimize the display and that one or more applications corresponding to voice recognition have been activated.

【０１１５】ステップＳ１において、音声コマンダ１０
２は、常時入力モードが選択されているか否かを判定
し、常時入力モードが選択されていないと判定された場
合、ステップＳ２に進み、認識キーが押圧されているか
否かを判定する。ステップＳ２において、認識キーが押
圧されていないと判定された場合、ステップＳ２に戻
り、音声コマンダ１０２は、認識キーが押圧されるま
で、認識キーの押圧の判定の処理を繰り返す。In step S1, the voice commander 10
Step 2 determines whether or not the continuous input mode is selected. If it is determined that the continuous input mode is not selected, the process proceeds to step S2 to determine whether or not the recognition key is pressed. If it is determined in step S2 that the recognition key has not been pressed, the process returns to step S2, and the voice commander 102 repeats the process of determining whether the recognition key has been pressed until the recognition key is pressed.

【０１１６】ステップＳ２において、認識キーが押圧さ
れたと判定された場合、ステップＳ３に進み、小型表示
ウィンドウ２４１を、アクティブな状態になっている音
声認識に対応しているアプリケーションの近傍、ここで
は、左上に表示する（図１５に示したような状態）。そ
して、ステップＳ４において、音声コマンダ１０２は、
音声認識エンジン１０１に音声認識の処理を実行させ
る。ステップＳ５において、音声コマンダ１０２は、音
声認識エンジン１０１から音声認識の処理の結果を受信
する。If it is determined in step S2 that the recognition key has been pressed, the process proceeds to step S3, in which the small display window 241 is placed in the vicinity of an active application corresponding to voice recognition, here, It is displayed at the upper left (as shown in FIG. 15). Then, in step S4, the voice commander 102
It causes the speech recognition engine 101 to execute speech recognition processing. In step S5, the voice commander 102 receives the result of the voice recognition processing from the voice recognition engine 101.

【０１１７】ステップＳ６において、音声コマンダ１０
２は、音声認識エンジン１０１から受信した音声認識の
処理の結果を、小型表示ウィンドウ２４１に表示させ
る。ステップＳ７において、音声コマンダ１０２は、認
識キーが離されたか否かを判定し、認識キーが離された
と判定された場合、ステップＳ８に進み、表示されてい
る小型表示ウィンドウ２４１をLCD２１上から消し、ス
テップＳ９において、コマンドの送信を実行して、処理
は終了する。At step S6, the voice commander 10
2 displays the result of the speech recognition processing received from the speech recognition engine 101 on the small display window 241. In step S7, the voice commander 102 determines whether or not the recognition key has been released. If it is determined that the recognition key has been released, the process proceeds to step S8, where the displayed small display window 241 is erased from the LCD 21. In step S9, a command is transmitted, and the process ends.

【０１１８】ステップＳ７において、認識キーが離され
ていないと判定された場合、ステップＳ１０に進み、音
声コマンダ１０２は、解除キー（例えば、シフトキー）
が押圧されたか否かを判定する。ステップＳ１０におい
て、解除キーが押圧されたと判定された場合、ステップ
Ｓ１１に進み、音声コマンダ１０２は、音声認識の結果
をクリアして、ステップＳ４に戻り、音声認識の処理を
繰り返す。ステップＳ１０において、解除キーが押圧さ
れていないと判定された場合、ステップＳ１１はスキッ
プされ、ステップＳ４に戻り、音声認識の処理を繰り返
す。If it is determined in step S7 that the recognition key has not been released, the flow advances to step S10, where the voice commander 102 sets the release key (for example, the shift key).
It is determined whether or not is pressed. If it is determined in step S10 that the release key has been pressed, the process proceeds to step S11, in which the voice commander 102 clears the result of voice recognition, returns to step S4, and repeats the voice recognition process. If it is determined in step S10 that the release key has not been pressed, step S11 is skipped, the process returns to step S4, and the voice recognition process is repeated.

【０１１９】一方、ステップＳ１において、常時入力モ
ードが選択されていると判定された場合、ステップＳ１
２に進み、小型表示ウィンドウ２４１を、アクティブな
状態になっている音声認識に対応しているアプリケーシ
ョンの左上に表示される。そして、ステップＳ１３にお
いて、音声コマンダ１０２は、音声認識エンジン１０１
に音声認識の処理を実行させる。ステップＳ１４におい
て、音声コマンダ１０２は、音声認識エンジン１０１か
ら音声認識の処理の結果を受信する。On the other hand, if it is determined in step S1 that the constant input mode has been selected, the process proceeds to step S1.
Proceeding to 2, the small display window 241 is displayed at the upper left of the application corresponding to the active voice recognition. Then, in step S13, the voice commander 102 makes the voice recognition engine 101
To execute voice recognition processing. In step S14, the voice commander 102 receives the result of the voice recognition process from the voice recognition engine 101.

【０１２０】ステップＳ１５において、音声コマンダ１
０２は、音声認識エンジン１０１から受信した音声認識
の処理の結果を、小型表示ウィンドウ２４１に表示させ
る。ステップＳ１６において、音声コマンダ１０２は、
コマンドの送信を実行して、ステップＳ１に戻り、コマ
ンドの送信または起動の処理を繰り返す。In step S15, the voice commander 1
02 displays the result of the speech recognition process received from the speech recognition engine 101 on the small display window 241. In step S16, the voice commander 102
After transmitting the command, the process returns to step S1 to repeat the process of transmitting or starting the command.

【０１２１】このように、認識キーの操作により、使用
者は、音声コマンダウィンドウ１６１が最小化表示され
ている状態においても、音声が認識された結果を小型表
示ウィンドウ２４１を確認して、コマンドの送信をさせ
ることができ、誤った認識による処理の実行を防止する
ことができる。また、以上のように、使用者は、キーボ
ード４などを操作することなく、簡単に、電子ペットと
遊んだり、静止画像を閲覧したりすることができる。As described above, by operating the recognition key, even when the voice commander window 161 is displayed in a minimized state, the user confirms the result of voice recognition in the small display window 241 and checks the command. Transmission can be performed, and execution of processing due to incorrect recognition can be prevented. Further, as described above, the user can easily play with the electronic pet or browse the still image without operating the keyboard 4 or the like.

【０１２２】上述した一連の処理は、ハードウェアによ
り実行させることもできるが、ソフトウェアにより実行
させることもできる。一連の処理をソフトウェアにより
実行させる場合には、そのソフトウェアを構成するプロ
グラムが、専用のハードウェアに組み込まれているコン
ピュータ、または、各種のプログラムをインストールす
ることで、各種の機能を実行することが可能な、例えば
汎用のパーソナルコンピュータなどに、プログラム格納
媒体からインストールされる。The above-described series of processing can be executed by hardware, but can also be executed by software. When a series of processing is executed by software, a program constituting the software can execute various functions by installing a computer built into dedicated hardware or installing various programs. It is installed from a program storage medium to a possible general-purpose personal computer or the like.

【０１２３】コンピュータにインストールされ、コンピ
ュータによって実行可能な状態とされるプログラムを格
納するプログラム格納媒体は、図７に示すように、磁気
ディスク９５（フロッピディスクを含む）、光ディスク
９６（CD-ROM(Compact Disc-Read Only Memory)、DVD(D
igital Versatile Disc)を含む）、光磁気ディスク９７
（ＭＤ(Mini-Disc)を含む）、若しくは半導体メモリ９
８などよりなるパッケージメディア、または、プログラ
ムが一時的若しくは永続的に格納されるROM６９や、ハ
ードディスク５６などにより構成される。プログラム格
納媒体へのプログラムの格納は、必要に応じてルータ、
モデム５０などのインタフェースを介して、ローカルエ
リアネットワーク、インターネット９２、図示せぬデジ
タル衛星放送といった、有線または無線の通信媒体を利
用して行われる。As shown in FIG. 7, a program storage medium for storing a program installed in a computer and made executable by the computer includes a magnetic disk 95 (including a floppy disk), an optical disk 96 (CD-ROM ( Compact Disc-Read Only Memory), DVD (D
digital Versatile Disc), magneto-optical disc 97
(Including MD (Mini-Disc)) or semiconductor memory 9
8 or a ROM 69 for temporarily or permanently storing a program, a hard disk 56, or the like. The storage of the program in the program storage medium can be performed by a router,
This is performed using a wired or wireless communication medium such as a local area network, the Internet 92, or a digital satellite broadcast (not shown) via an interface such as the modem 50.

【０１２４】なお、本明細書において、プログラム格納
媒体に格納されるプログラムを記述するステップは、記
載された順序に沿って時系列的に行われる処理はもちろ
ん、必ずしも時系列的に処理されなくとも、並列的ある
いは個別に実行される処理をも含むものである。In this specification, the step of describing a program stored in a program storage medium is not limited to processing performed in chronological order according to the described order, but is not necessarily performed in chronological order. , And also includes processes executed in parallel or individually.

【０１２５】また、本明細書において、システムとは、
複数の装置により構成される装置全体を表すものであ
る。In this specification, the system is
It represents the entire device composed of a plurality of devices.

【０１２６】[0126]

【発明の効果】以上の如く、請求項１に記載の情報処理
装置、請求項３に記載の情報処理方法、および請求項４
に記載のプログラム格納媒体によれば、音声認識の結果
に対応して所定の処理を実行するプログラムが起動さ
れ、かつ、アクティブな状態になっていると判断された
場合、音声認識が指示されている状態を示す第１のウィ
ンドウを、プログラムに対応する第２のウィンドウの近
傍、または、重なる位置に表示されるように表示を制御
し、音声認識の結果に対応して所定の処理を実行するプ
ログラムは起動されていない、または、起動されてはい
るがアクティブな状態ではないと判断された場合、第１
のウィンドウが予め定められた所定の位置に表示される
ように表示を制御するようにしたので、使用者は、音声
認識の状態を確認することができ、もって、誤った処理
を指示することを防ぐことができる。As described above, the information processing apparatus according to the first aspect, the information processing method according to the third aspect, and the fourth aspect.
According to the program storage medium described in the above, a program for executing a predetermined process corresponding to the result of voice recognition is started, and when it is determined that it is in an active state, voice recognition is instructed The display is controlled so that the first window indicating the present state is displayed near or overlapping with the second window corresponding to the program, and a predetermined process is executed in response to the result of voice recognition. If it is determined that the program has not been started or has been started but is not active, the first
Since the display is controlled so that the window is displayed at a predetermined position, the user can confirm the state of the voice recognition, and thus can instruct an erroneous process. Can be prevented.

[Brief description of the drawings]

【図１】パーソナルコンピュータ１の外観斜視図であ
る。FIG. 1 is an external perspective view of a personal computer 1. FIG.

【図２】パーソナルコンピュータ１の平面図である。FIG. 2 is a plan view of the personal computer 1.

【図３】パーソナルコンピュータ１の左側側面図であ
る。FIG. 3 is a left side view of the personal computer 1.

【図４】パーソナルコンピュータ１の右側側面図であ
る。FIG. 4 is a right side view of the personal computer 1.

【図５】パーソナルコンピュータ１の正面図である。FIG. 5 is a front view of the personal computer 1.

【図６】パーソナルコンピュータ１の底面図である。FIG. 6 is a bottom view of the personal computer 1.

【図７】パーソナルコンピュータ１の構成を示すブロッ
ク図である。FIG. 7 is a block diagram showing a configuration of the personal computer 1.

【図８】パーソナルコンピュータ１の機能ブロックを示
す図である。FIG. 8 is a diagram showing functional blocks of the personal computer 1.

【図９】パーソナルコンピュータ１の機能ブロックを示
す図である。FIG. 9 is a diagram showing functional blocks of the personal computer 1.

【図１０】SAPIを説明する図である。FIG. 10 is a diagram illustrating SAPI.

【図１１】音声コマンダウィンドウ１６１を示す図であ
る。FIG. 11 is a diagram showing a voice commander window 161.

【図１２】LCD２１に表示される画面を説明する図であ
る。FIG. 12 is a diagram illustrating a screen displayed on the LCD 21.

【図１３】LCD２１に表示される画面を説明する図であ
る。FIG. 13 is a diagram illustrating a screen displayed on the LCD 21.

【図１４】LCD２１に表示される画面を説明する図であ
る。FIG. 14 is a diagram illustrating a screen displayed on the LCD 21.

【図１５】LCD２１に表示される画面を説明する図であ
る。FIG. 15 is a diagram illustrating a screen displayed on the LCD 21.

【図１６】LCD２１に表示される画面を説明する図であ
る。FIG. 16 is a diagram illustrating a screen displayed on the LCD 21.

【図１７】LCD２１に表示される画面を説明する図であ
る。FIG. 17 is a diagram illustrating a screen displayed on the LCD 21.

【図１８】LCD２１に表示される画面を説明する図であ
る。FIG. 18 is a diagram illustrating a screen displayed on the LCD 21.

【図１９】LCD２１に表示される画面を説明する図であ
る。FIG. 19 is a diagram illustrating a screen displayed on the LCD 21.

【図２０】LCD２１に表示される画面を説明する図であ
る。FIG. 20 is a diagram illustrating a screen displayed on the LCD 21.

【図２１】LCD２１に表示される画面を説明する図であ
る。FIG. 21 is a diagram illustrating a screen displayed on the LCD 21.

【図２２】音声認識に関する処理を説明するフローチャ
ートである。FIG. 22 is a flowchart illustrating processing related to speech recognition.

[Explanation of symbols]

１パーソナルコンピュータ，４キーボード，１
０シャッタボタン，２１ LCD，２３ CCDビデオカ
メラ，２４マイクロフォン，５２ CPU，５９
ROM，５４ RAM，８４Ａ電子メールプログラ
ム，８６前押しスイッチ，８８ドライブ，９
２インターネット，９５磁気ディスク，９６
光ディスク，９７光磁気ディスク，９８半導体
メモリ，１０１音声認識エンジン，１０２音声コ
マンダ，１０３静止画撮影プログラム，１０４
静止画閲覧プログラム，１１１読み仮名辞書データ
ベース，１１２エンジン用認識単語・文法データベ
ース，１１３ランチャ設定データベース，１１４
辞書設定データベース，１１５認識単語データベ
ース，１２１アプリケーション通信部，１２２
エンジン通信部，１２３ＵＩ処理部，１２４音声
ランチャ制御部，１２５ユーザ辞書制御部，１２
６認識テスト処理部，２４１小型表示ウィンドウ1 personal computer, 4 keyboard, 1
0 shutter button, 21 LCD, 23 CCD video camera, 24 microphone, 52 CPU, 59
ROM, 54 RAM, 84A E-mail program, 86 Front switch, 88 drive, 9
2 Internet, 95 Magnetic disk, 96
Optical disk, 97 magneto-optical disk, 98 semiconductor memory, 101 voice recognition engine, 102 voice commander, 103 still image photographing program, 104
Still image browsing program, 111 reading kana dictionary database, 112 recognition word / grammar database for engine, 113 launcher setting database, 114
Dictionary setting database, 115 recognized word database, 121 application communication unit, 122
Engine communication unit, 123 UI processing unit, 124 voice launcher control unit, 125 user dictionary control unit, 12
6 Recognition test processing unit, 241 Small display window

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 15/00 Ｇ１０Ｌ 3/00 ５５１Ｐ 15/28 ５６１Ｃ 15/22 ５７１Ｈ (72)発明者米倉修二東京都品川区北品川６丁目７番35号ソニー株式会社内 (72)発明者笹井崇司東京都品川区北品川６丁目７番35号ソニー株式会社内Ｆターム(参考） 5B076 AB17 5D015 KK03 LL02 LL05 5E501 AA02 AC37 CA03 CB15 EA21 EB05 FA06 FA23 FA43 FB22 9A001 DD11 HH17 HH34 ──────────────────────────────────────────────────の Continued on the front page (51) Int.Cl. ⁷ Identification code FI Theme coat ゛ (Reference) G10L 15/00 G10L 3/00 551P 15/28 561C 15/22 571H (72) Inventor Shuji Yonekura Shinagawa, Tokyo 6-7-35 Kita-Shinagawa-ku, Sony Corporation (72) Takashi Sasai Inventor 6-35, Kita-Shinagawa, Shinagawa-ku, Tokyo Sony Corporation F-term (reference) 5B076 AB17 5D015 KK03 LL02 LL05 5E501 AA02 AC37 CA03 CB15 EA21 EB05 FA06 FA23 FA43 FB22 9A001 DD11 HH17 HH34

Claims

[Claims]

A first determining means for determining whether or not a state of recognizing a voice is instructed; and if the first determining means determines that the state of recognizing the voice is instructed, A second determining means for determining whether or not a program for executing a predetermined process according to a result of the voice recognition is activated and in an active state; and When it is determined that the program for executing the predetermined process corresponding to the result is activated and is in the active state, the first window indicating the state where the voice recognition is instructed is added to the program. A first controlling display so as to be displayed near or overlapping with the corresponding second window;
The display control unit and the second determination unit determine that the program for executing the predetermined process in response to the result of the voice recognition is not activated, or is activated but not active. And a second display control means for controlling the display so that the first window is displayed at a predetermined position.

2. A third display for controlling the display so that the list of commands is displayed, when the result of the speech recognition is an instruction to display a list of commands for executing a predetermined process. The information processing apparatus according to claim 1, further comprising a control unit.

3. A first judging step of judging whether or not a state of recognizing a voice has been instructed, and it has been determined in the processing of the first judging step that a state of recognizing a voice has been instructed. A second determining step of determining whether or not a program for executing a predetermined process according to a result of the voice recognition is activated and in an active state; and a process of the second determining step When a program that executes a predetermined process corresponding to the result of voice recognition is started, and it is determined that it is in an active state,
A first display control step of controlling display so that a first window indicating a state in which voice recognition is instructed is displayed near or overlapping with a second window corresponding to the program; If it is determined in the processing of the second determination step that the program for executing the predetermined processing corresponding to the result of the voice recognition has not been activated, or that it has been activated but is not in an active state, A second display control step of controlling the display so that the first window is displayed at a predetermined position.

4. A first judging step of judging whether or not a state of recognizing a voice has been instructed, and it has been determined in the processing of the first judging step that a state of recognizing a voice has been instructed. In the case, a second determination step of determining whether or not a program for voice recognition is activated and is in an active state; and a predetermined processing corresponding to a result of voice recognition in the processing of the second determination step. If it is determined that the program that executes the processing of the above is activated and is in the active state,
A first display control step of controlling display so that a first window indicating a state in which voice recognition is instructed is displayed near or overlapping with a second window corresponding to the program; If it is determined in the processing of the second determination step that the program for executing the predetermined processing corresponding to the result of the voice recognition has not been activated, or that it has been activated but is not in an active state, And a second display control step of controlling display so that the first window is displayed at a predetermined position. A program storage medium storing a computer-executable program. .