JPH04163618A

JPH04163618A - Sound operation computer

Info

Publication number: JPH04163618A
Application number: JP2290177A
Authority: JP
Inventors: Hiroyuki Noto; 広之野戸
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1990-10-26
Filing date: 1990-10-26
Publication date: 1992-06-09

Abstract

PURPOSE:To easily input a command even if the performance of a sound recognition device is low by limiting sound being the object of recognition to sound related to an object being the object of a present operation at the time of executing a computer operation in the sound recognition device. CONSTITUTION:Control information 107 of the object and sound information 118 for improving the performance of sound recognition in a data structure itself are added to a secondary storage device 102 and it stores them. When plural objects being the object of the present operation are decided, sound information peculiar to the object is automatically transmitted to the sound recognition device 101 and it is operated. It restricts the reading, the feature amount and the speaker of the object of recognition based on sound information peculiar to the object and recognizes sound. Thus, the command can easily be inputted even if the sound recognition device has low performance.

Description

【発明の詳細な説明】（産業上の利用分野）この発明は、音声認識装置を付加したコンピュータに関
するものである。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a computer equipped with a speech recognition device.

（従来の技術）従来、コンピュータの操作方法は次のようなものであっ
た。(Prior Art) Conventionally, the method of operating a computer has been as follows.

コンピュータは、あおよそ第２図（Ａ）に示すように、
コンピュータ（こ対しでコマシトを入力するキーボード
２０５、プログラムを実行する制御装置２Ｏ６、実行中
のプログラムや処理中のデータを格納する主記憶装置２
０Ｂ、実行可能なプログラムや高速で処理する必要のな
い大量のデータを保存する２次記憶装［２０２を備えで
いてコマンド入力のための音声認識装置を備えていない
構成となっている。ここでは、その他の出力装置、通信
製Ｍなどについての図示と説明は省略する。The computer, approximately as shown in Figure 2 (A),
Computer (keyboard 205 for inputting commands, control device 2O6 for executing programs, main memory 2 for storing programs being executed and data being processed)
OB is equipped with a secondary storage device [202] for storing executable programs and a large amount of data that does not need to be processed at high speed, and is not equipped with a voice recognition device for inputting commands. Here, illustrations and explanations of other output devices, Tsushin M, etc. will be omitted.

主記憶装置２０８の内部にはオペレーティングシステム
と呼ばれるプログラムか存在し、制御装置２０６の動作
手順を定めている。A program called an operating system exists inside the main storage device 208, and defines the operating procedure of the control device 206.

２次記惚装Ｍ２Ｏ２にはティレフト１ノ領域２０３と呼
ばれる記憶領域かあり、多数のファイルの名称（以降フ
ァイル名と称する）・属性・記憶場所などの管理情報２
０７を保存している。ここではファイル名の例として実
行可能ファイルの属性を持つワードプロセッサのプログ
ラムファイルのファイル名「ワードプロセッサ」と、表
計算のプログラムファイルのファイル名「表計算」か示
されているとして、以下の説明を行う。The secondary memory M2O2 has a storage area called TiLeft 1 area 203, which contains management information 2 such as file names (hereinafter referred to as file names), attributes, and storage locations of a large number of files.
07 is saved. Here, the following explanation will be given assuming that the file names are ``Word Processor'' for a word processor program file that has an executable file attribute, and ``Spreadsheet'' for a spreadsheet program file. .

先す、コンピュータをキーボード２０５によって操作す
る手順を第２図（Ｂ）および（Ｃ）を参照して説明する
。キーボード２０５によって使用者か例えば「表計算」
とコマンドを入力すると、制御表Ｍ２Ｏ６はオペレーテ
ィングシステム２０９のプログラムに基づいて、このコ
マンド名に対応したファイル名を持つ管理情報をディレ
クトリ指定（２５０）およびバス指定（２５１）に基づ
いてディレクトリ領域２０３の中から模索し、そのファ
イル名に対応した属性を読む（笥２図（Ｂ）に点線で示
す）。このファイル名のファイルか実行可能ファイルで
あるという属性を持っていれば、管理情報２０７の中の
記憶場所（図示せず）に基づいてファイル２０４（この
場合には表計算プログラムファイル）を読み出し、主記
憶装Ｎ２Ｏ３上に転送して、このファイル２０４内のプ
ログラムの美行を開始する（第２図（Ｃ）に点線で示す
）。First, the procedure for operating the computer using the keyboard 205 will be explained with reference to FIGS. 2(B) and 2(C). The keyboard 205 allows the user to select, for example, "spreadsheet".
When a command is input, the control table M2O6, based on the program of the operating system 209, stores the management information with the file name corresponding to this command name in the directory area 203 based on the directory specification (250) and bus specification (251). Search inside the file and read the attribute corresponding to the file name (shown by the dotted line in Figure 2 (B)). If the file with this file name has the attribute of being an executable file, the file 204 (in this case, a spreadsheet program file) is read based on the storage location (not shown) in the management information 207, The data is transferred to the main memory N2O3 and the program in this file 204 starts running (as shown by the dotted line in FIG. 2(C)).

尚、通常、上記ティレフト「ノ領域は他のディレクトリ
領域を参照できるようになっており、使用するティレフ
トリの変更もキーボードからのコマンドによって指定で
きるようになっている（この指定をディレクトリ指定２
５０と称する）。また通常、上記ファイルの検索の検索
対象となるディレクトリ領域は任意に指定可能となって
いる（この指定をバス指定２５１と称する）。Normally, the TiLeft area mentioned above can refer to other directory areas, and the TiLeft area to be used can also be specified by a command from the keyboard (this specification can be done using Directory Specification 2).
50). Further, normally, the directory area to be searched for in the file search can be arbitrarily designated (this designation is referred to as bus designation 251).

一方、内部コマンドと呼ばれるコマンドに対してはオペ
レーティングシステム２０９か処理を行い、必すしも上
記のような２次記憶装置２０２からの読み出しを必要と
じない。この内部コマンドの場合には一般にその実行手
順が主記憶装置２０８或いは制御表［２０６の中に存在
し、入力されたコマンドか内部コマシトである場合には
制御表Ｍ２Ｏ６か即座に天性を行う。On the other hand, commands called internal commands are processed by the operating system 209 and do not necessarily require reading from the secondary storage device 202 as described above. In the case of this internal command, the execution procedure generally exists in the main memory 208 or the control table 206, and if the input command is an internal command, the control table M2O6 immediately executes its execution.

（発明か解決しようとする課題）しかしながら、上述した従来の操作方法では、コマンド
入力の手段としでキーボードしかないため、キーボード
入力に手間力＼かがり、操作までに時間かかかる。(Problems to be Solved by the Invention) However, in the conventional operating method described above, since the keyboard is the only means for inputting commands, keyboard input requires effort and time, and it takes time to complete the operation.

また、従来構成のコンピュータでは、キーボード入力の
代わりに音声認識による入力によってファイルを指定し
て操作しようとするためには、全てのコマンド名やファ
イルの数の語索に対応できる大語業の、従って高性能な
音声認識装置か必要であった。In addition, in computers with conventional configurations, in order to specify and operate files by voice recognition input instead of keyboard input, it is necessary to use a large language service that can handle all command names and word searches for the number of files. Therefore, a high-performance speech recognition device was needed.

この発明の目的は、以上述べたキーボード入力によって
操作するまでに手間と時間がかかるという問題点を除去
し、少数の語業に対応した音声認識装置によって音声の
司令により操作できるコンピュータを提供することにあ
る。An object of the present invention is to eliminate the above-mentioned problem that it takes time and effort to operate through keyboard input, and to provide a computer that can be operated by voice commands using a voice recognition device that is compatible with a small number of language languages. It is in.

（課題を解決するための手段）この目的の達成を図るため、この発明の音声操作コンピ
ュータによれば、コマンドを入力するキーボードと、このキーボードと接
続されプログラムを実行する制御装置と、この制御装置
と接続され実行中のプログラムや処理中のデータを格納
する主記憶装置と、制御装置と接続され美行可能なプロ
グラムや高速で処理する必要のないデータを保存する２
次記憶装置と、制御装置と接続され発声音声の認識処理
を行って認識結果を制御装置へ出力する音声認識装置を
備えており、２次記憶装苫は、オブジェクトの管理情報やそのデータ
構造自体（こ音声認識の性能の向上を図るための音声情
報を付加しで記憶してあり、制御装置は、キーボードか
ら入力したコマントに応じて現在の操作の対象となって
いるオブジェクト固有の音声情報を、認識のための基準
パターンとして、音声認識装置に伝達する機能を具えて
いることを特徴とする。(Means for Solving the Problem) In order to achieve this object, the voice-operated computer of the present invention includes a keyboard for inputting commands, a control device connected to the keyboard for executing a program, and a control device for executing a program. The main memory device is connected to the controller and stores the programs being executed and the data being processed, and the main memory device is connected to the control device to store programs that can be easily processed and data that does not need to be processed at high speed.
The secondary storage device is equipped with a voice recognition device that is connected to the control device and performs speech recognition processing and outputs the recognition result to the control device.The secondary storage device stores object management information and its data structure itself. (Speech information is added and stored in order to improve the performance of speech recognition, and the control device receives speech information specific to the object currently being operated in response to commands entered from the keyboard.) , is characterized in that it has a function of transmitting it to a speech recognition device as a reference pattern for recognition.

（作用）コンピュータは通常、実行可能な全てのコマンド、プロ
グラムを操作の対象とはせず、サーチバスと呼ばれるコ
マンド・プログラムの探索領域指定を行ったり、カレン
トディレクトリを定めで使用者に表示するなど、操作の
対象となるデータ、プログラムなどを限定する手段を持
つ。(Function) Computers usually do not operate on all executable commands and programs, but specify search areas for commands and programs called a search bus, or display the current directory to the user in a fixed manner. , has a means to limit the data, programs, etc. that are subject to operations.

この発明では使用者にとって現在の操作の対象となるデ
ータ、プログラム、絵文字（アイコン）、ボタンなどの
対象物（以降これをオブジェクトと称する）の管理情報
やそのデータ構造自体に、それぞれのオブジェクト固有
の、読み方・音声の特徴量・話者などの音声認識の性能
を向上させるための情報（以降これを音声情報と称する
）を付加してあき、現在の操作の対象となる複数のオブ
ジェクトが定まったときにオブジェクト固有の音声情報
を自動的に音声認識装置に伝達し、音声認識袋Ｎを動作
させる。音声認識装置はこのオブジェクト固有の音声情
報をもとに、認識対象の読み・特８！量・話者等を限定
し、音声認識を行う。音声が発声され、認識結果が得ら
れると、その認識結果は制御装置に入力され、コンピュ
ータはキーボードやマウスによるコマンド入力と同様の
処理によって操作される。In this invention, the management information and data structure of objects (hereinafter referred to as objects) such as data, programs, pictograms (icons), buttons, etc. that are the targets of current operations for the user, and the data structure itself, are unique to each object. , information to improve the performance of speech recognition, such as pronunciation, voice features, speaker, etc. (hereinafter referred to as speech information) was added, and multiple objects to be the target of the current operation were determined. Sometimes, object-specific voice information is automatically transmitted to the voice recognition device, and the voice recognition bag N is operated. The speech recognition device uses this object-specific speech information to determine the reading/toku8! of the object to be recognized. Speech recognition is performed by limiting the amount and speakers. When a voice is uttered and a recognition result is obtained, the recognition result is input to the control device, and the computer is operated by processing similar to command input using a keyboard or mouse.

（実施例）以下、図面を参照しで、この発明の実施例につき説明す
る。(Embodiments) Hereinafter, embodiments of the present invention will be described with reference to the drawings.

この発明の一実施例を第１図、第３図および第４図を用
いて説明する。An embodiment of the present invention will be described with reference to FIGS. 1, 3, and 4.

この発明の音声操作コンピュータの全体の構成はおおよ
そ第１図（Ａ）に示すような構成を持つ。このコンピュ
ータは、これに対してコマンドを入力するキーボード１
０５、プログラムを実行する制御装置１０６、実行中の
プログラムや処理中のデータを格納する主記憶装置１０
８、実行可能なプログラムや高速で処理する必要のない
大量のデータを保存する２次記憶装！１０２！備えでい
る。ここでは、その他の出力装置、通信装置は、この発
明とは直接間係ないのでこれらについての図示と説明は
省略する。２次記憶装Ｍ１０２には通常複数のディレク
トリ領域１０３と呼ばれる記憶領域かある。ここまでは
通常のコンピュータと同様の構成となっている。The overall configuration of the voice-operated computer of the present invention is approximately as shown in FIG. 1(A). This computer has a keyboard 1 for inputting commands.
05. A control device 106 that executes programs, and a main storage device 10 that stores programs that are being executed and data that is being processed.
8. Secondary storage for storing executable programs and large amounts of data that does not need to be processed at high speed! 102! Be prepared. Here, since other output devices and communication devices are not directly related to the present invention, illustrations and descriptions thereof will be omitted. The secondary storage device M102 usually has a plurality of storage areas called directory areas 103. Up to this point, the configuration is similar to that of a normal computer.

この発明では、入力装置としではキーホード］０５のほ
かに音声認識装置７０１を備えている。この実施例にお
いては、音声認識装置１１０１は一例として単語単位の
音声認識装置％用いるとする。また、この発明では、フ
ァイルの名称・属性・記憶場所などの管理情報１０７に
は特別に音声認識の性能を向上させるための情報１１８
を保存している。In this invention, a voice recognition device 701 is provided in addition to a keyboard] 05 as an input device. In this embodiment, the speech recognition device 1101 is assumed to be a word-by-word speech recognition device, for example. In addition, in the present invention, the management information 107 such as file name, attribute, storage location, etc. includes information 118 specifically for improving speech recognition performance.
is saved.

また、第３図には制御表Ｍ１０６の詳細を示す構成図、
第４図には制御装置７０８の動作を示すフローを示す。In addition, FIG. 3 is a configuration diagram showing details of the control table M106,
FIG. 4 shows a flowchart showing the operation of the control device 708.

第３図において、３０１は内部コマンド記憶手段、３０
２はコマンド模索手段、３０３はコマンド実行手段を示
す。内部コマンド記憶手段３０７は内部コマンドの名称
、美行手順などとともに、内部コマシト固有の音声特徴
パタンを記憶してあり、音声特徴バタンは音声認識装置
］０１に必要に応しで転送される。コマンド検索手段３
０２は入力されたコマンド８検索する。In FIG. 3, 301 is an internal command storage means;
Reference numeral 2 indicates a command searching means, and 303 indicates a command execution means. The internal command storage means 307 stores internal command names, beauty procedures, etc., as well as voice characteristic patterns unique to the internal command, and the voice characteristic patterns are transferred to the voice recognition device 01 as necessary. Command search means 3
02 searches for the input command 8.

コマンド検索手段３０２は、外部コマンドを検索するた
めに２次記憶装Ｍ］０２と情報のやりとりができる構成
となっている。コマンド実行手段３０３はコマンドを実
行する。そのため、このコマンド実行手段３０３は、キ
ーボード１０５および音声認識装置１０１と情報のヤり
とりができる構成となっていると共に、２次記憶装置１
０２および主記憶装置１０８と情報のやりとりができる
構成ともなっている。The command search means 302 is configured to be able to exchange information with the secondary storage device M]02 in order to search for external commands. Command execution means 303 executes commands. Therefore, this command execution means 303 is configured to be able to exchange information with the keyboard 105 and the voice recognition device 101, and the secondary storage device 1
02 and the main storage device 108.

第４図においで（Ｓ］）〜（Ｓｌ○）は制御装置１０６
の内部の処理ステップを示す。In FIG. 4, (S]) to (Sl○) are the control device 106.
shows the internal processing steps.

ここで説明する実施例（こおいで、ある一つのディレク
トリ領域１０３の名称が［業務Ｊであるとする。また、
ファイル名「ワードプロセッサ」を持つワードプロセッ
サプログラムファイルに対しては単語音声「ワープロＪ
の音声特徴バタンを管理情報１０７の中に保存し、ファ
イル名「表計算」を持つ表計算プログラムファイルに対
しては単語音声「表計算ｊの音声特徴バタンを管理情報
１０７の中に保存しておくとする。In the example described here, it is assumed that the name of one directory area 103 is [Business J].
For a word processor program file with the file name "Word Processor", the word audio "Word Processor J" is displayed.
The sound characteristic button of the word "spreadsheet j" is saved in the management information 107, and for the spreadsheet program file with the file name "spreadsheet", the sound characteristic button of the word "spreadsheet j" is saved in the management information 107. I'm going to leave it.

また、第３図の内部コマンド記憶手段３０１にはオペレ
ーティングシステムの内部コマンド固有の名称およびそ
の操作に対応する音声特徴バタンか予め格納されている
とする。Further, it is assumed that the internal command storage means 301 in FIG. 3 stores in advance the names specific to the internal commands of the operating system and the voice characteristic buttons corresponding to their operations.

次に、このような情況下にある実施例における音声操作
コンピュータの動作例を、第１図（Ｂ）、（Ｃ）、（Ｄ
）、菓３図および第４図により説明する。Next, an example of the operation of the voice-operated computer in an embodiment under such circumstances is shown in FIGS.
), and will be explained with reference to Figures 3 and 4.

〈音声特徴バタンの消去・書き換え〉先ず、コンピュータの起動時における動作ヤコマントの
入力によってバス指定、ディレクトリ指定の設定・変更
を行う。このバス指定、ディレクトリ指定の設定・変更
の際、制御装置１０６は先ず、直前まで用いられていた
バス或いはディレクトリに対応した音声特徴バタンを消
去し、指定・変更後のバス指定、ディレクトリ指定に対
応し、現在の操作の対象となる可能性かあるファイルの
音声特徴バタンを音声認識装置１０１に送る。<Deleting/Rewriting Audio Feature Buttons> First, the bus designation and directory designation are set and changed by inputting the action command when starting up the computer. When setting or changing this bus designation or directory designation, the control device 106 first erases the audio characteristic button corresponding to the bus or directory that was used immediately before responding to the bus designation or directory designation after the designation or change. Then, the audio feature button of the file that may be the target of the current operation is sent to the speech recognition device 101.

これにつき説明する、この実施例の場合、例えば、ＭＳ
−ＤＯ３と同様のコマンドを持つオペレーティングシス
テムとした場合には、オペレータは、通常、キーボード
操作によって”ｃｄ￥業務”かディレクトリ指定変更の
コマンドとして入力する。このとき、先ず制御装置１０
６の内部のコマンド検索手段３０２か第４図における処
理ステップＳ１、Ｓ２１経てこのコマンド行を入力する
。In this case, for example, MS
- In the case of an operating system having commands similar to DO3, the operator normally inputs a command such as "cd\business" or a command for changing the directory designation using the keyboard. At this time, first, the control device 10
This command line is input through the command retrieval means 302 inside the computer 6 or through the processing steps S1 and S21 in FIG.

コマンド検索手段３０２は処理ステップＳ３により、オ
ペレーティングシステムの内部コマンドとして内部コマ
ンド記憶手段３０１に予め記憶されでいる内部コマンド
特有の文字列との照合を行う。このコマンド”ｃｄ”は
内部コマンドの１　ｆｉ類であるので、照合は内部コマ
ンドと一致し、Ｓ４の判断により、処理ステップＳ５に
進む。もしも入力されたコマンドか外部コマンドであっ
たり、プログラムであった場合には、処理ステップＳ１
０により、外部コマンド或いはプログラムを実行して処
理ステップＳ２に戻る。In processing step S3, the command search means 302 performs a comparison with a character string unique to an internal command stored in advance in the internal command storage means 301 as an internal command of the operating system. Since this command "cd" is a 1fi class of internal commands, the comparison results in a match with the internal command, and based on the determination in S4, the process proceeds to step S5. If the input command is an external command or a program, processing step S1
0, the external command or program is executed and the process returns to step S2.

処理ステップＳ５では、コマンド検索手段３０２におい
て、キーボード１０５から入力されたこの内部コマンド
かディレクトリ指定やバス指定の変更であるかどうかを
判断する。このコマンド″ｃｃｊ″はディレクトリ指定
のコマンドなので、処理ステップ８６に進む６もしも入
力されたコマンドがディレクトリ指定やバス指定の変更
でなければ、処理ステップＳ９で当該内部コマンドを実
行し、処理ステップＳ２に戻る。In processing step S5, the command search means 302 determines whether this internal command input from the keyboard 105 is a change in directory designation or bus designation. Since this command "ccj" is a command for specifying a directory, the process proceeds to step 86.6 If the input command does not change the directory specification or bus specification, the internal command is executed in process step S9, and the process proceeds to process step S2. return.

処理ステップＳ６では、コマンド実行手段３０３が音声
認識装置］０１に対して音声認識装置１０１の内部にあ
る音声特徴バタンの消去を指示する。In processing step S6, the command execution means 303 instructs the voice recognition device]01 to erase the voice characteristic button inside the voice recognition device 101.

次に、処理ステップＳ７では、主記憶装百１０８の内部
のバス指定領域１５１およびディレクトリ指定領域１１
０を入力されたコマンドの内容に従って変更する。ここ
では、ディレクトリ指定をディレクトリ名′°￥業務”
で書き換える。Next, in processing step S7, the bus designated area 151 and the directory designated area 11 inside the main memory 108 are
0 is changed according to the contents of the input command. Here, the directory specification is the directory name′°\business”
Rewrite with .

次に、処理Ｓ８では、バス指定、ディレクトリ指定によ
って音声認識の対象となる内部コマンド、外部コマンド
、プログラムのディレクトリ領域から音声特徴バタンを
音声認識装置１１０１に転送する。そのため、ここでは
、先ず、内部コマンド記憶手段３０１の内部に蓄積され
でいる内部コマンドに対応した音声特徴バタンを音声認
識装置１０１に転送する。次に、ティレフトリ指定変更
後のカレントディレクトリ”￥業務”の情報％９照して
２次記憶装Ｎ１０２の中に存在するディレクトリ”￥業
務”の中に存在する、外部コマンドおよびプログラムの
ディレクトリ領ｔ１ｉ　１０３からそれぞれに対応した
音声特徴バタンを音声認識装置ｆｌＯ１に転送する。こ
こではディレクトリ”￥業務”の中に存在するプログラ
ム”ワードプロセッサパ、”表計算”１０４のディレク
トリ領域１０３に格納されでいる音声特徴バタン「ワー
プロ」、「表計算」１１８が音声認識装置１０１に転送
される。さらに、必要ならば、現在のバス指定によって
示されるディレクトリ内の外部コマンドおよびプログラ
ムのディレクトリ領域から音声認識装置１０１へ音声特
徴バタンを転送する。Next, in step S8, voice characteristic buttons are transferred to the voice recognition device 1101 from the directory area of internal commands, external commands, and programs to be voice recognized by bus designation and directory designation. Therefore, here, first, the voice characteristic button corresponding to the internal command stored in the internal command storage means 301 is transferred to the voice recognition device 101. Next, the directory area t1i of external commands and programs existing in the directory "\business" existing in the secondary storage device N102 according to the information %9 of the current directory "\business" after the Tileft reference designation has been changed. 103, the corresponding voice feature buttons are transferred to the voice recognition device flO1. Here, the speech feature buttons "Word Processor" and "Spreadsheet" 118 stored in the directory area 103 of the programs "Word Processor" and "Spreadsheet" 104 existing in the directory "\Business" are transferred to the speech recognition device 101. be done. Furthermore, if necessary, the voice feature button is transferred from the external command and program directory area in the directory indicated by the current bus designation to the voice recognition device 101.

以上で、現在操作の対象となるオブジェクトに対応した
音声特徴バタンか全て音声認識装置１０１に転送され、
次の操作を音声認識によって入力する準備か整ったので
、処理ステップＳ２に戻り、次のコマンドの入力と処理
を続ける。With the above steps, all the voice characteristic buttons corresponding to the object currently being operated are transferred to the voice recognition device 101.
Since preparations for inputting the next operation by voice recognition are complete, the process returns to step S2 to continue inputting and processing the next command.

このようにディレクトリ指定変更のコマンドに連動して
、音声認識装置１０１の認識対象となる音声はｒワープ
ロＪとｒ表計算Ｊとなる（第１図　。In this way, in conjunction with the directory designation change command, the voices to be recognized by the voice recognition device 101 are r word processor J and r spreadsheet J (Fig. 1).

（Ｂ））。この実施例の場合にはディレクトリ指定の変
更時の動作について記したか、バス指定の変更時も同様
の動作を行う。(B)). In this embodiment, the operation when the directory designation is changed has been described, and the same operation is performed when the bus designation is changed.

このように、音声認識装置１０１の標準バタンは、現在
の操作の対象となっているオブジェクト固有の音声に間
する情報（音声情報）によって自動的に書き換えられる
ため、音声認識装置１０１ヘマイクロホシ等の適当な音
声入力手段を介して発声音声を入力させ、この入力音声
によってコンピュータを操作するための音声認識の対象
となる音声は、現在のコンピュータの操作の対象となる
オブジェクトに関する音声に限定されることとなる。In this way, the standard button of the speech recognition device 101 is automatically rewritten with the information (speech information) that is specific to the object that is the object of the current operation. The voice to be inputted through a voice input means and the voice to be recognized for operating the computer using this input voice is limited to the voice related to the object currently being operated by the computer. Become.

〈音声認識装置による操作〉次に、コンピュータを音声認識装置１０１によって操作
する手順を第１図（Ｃ）および（Ｄ）によって説明する
。音声認識装置ｔｌＯ１に対して使用者がコマンドに対
応した音声、例えばｒ表計算Ｊを発声すると、音声認識
装置１０１は従来周知の認識処理を行って認識結果「表
計算Ｊを制御装置１０６に出力する。制御装置１０６は
第４図の処理フロー（３２〜Ｓ４．５１０）に従ってこ
の認識結果に対応したプログラムのファイルをディレク
トリ領域１０３の中から検索しく第１図（Ｃ）に点線で
示す）、管理情報１０７の中の記憶場所に基づいて表計
算ファイル１０４を読み出し、主記憶装Ｍ１０８上に転
送して、このファイル１０４内のプログラムの実行を開
始する（第１図（Ｄ）に点線で示す）。また、発声内容
かプログラムに対応した音声ではなく、内部コマンドや
タト部コマンドに対応したものであれば、その発声内容
に対応した所望の動作が行われる。<Operation by Speech Recognition Device> Next, the procedure for operating the computer by the speech recognition device 101 will be explained with reference to FIGS. 1(C) and (D). When the user utters a voice corresponding to a command, for example r spreadsheet J, to the voice recognition device tlO1, the voice recognition device 101 performs a conventional recognition process and outputs the recognition result “spreadsheet J” to the control device 106. The control device 106 searches the directory area 103 for the program file corresponding to this recognition result according to the processing flow (32 to S4.510) in FIG. 4 (indicated by a dotted line in FIG. 1C), The spreadsheet file 104 is read out based on the storage location in the management information 107, transferred to the main storage device M108, and execution of the program in this file 104 is started (as shown by the dotted line in FIG. 1(D)). ).Furthermore, if the uttered content is not a voice corresponding to the program, but corresponds to an internal command or a tato part command, the desired action corresponding to the uttered content is performed.

以上説明した実施例のように、２次記憶装置１０２の中
に存在するコマンド（通常、外部コマンドと呼ばれる）
のほかに、内部コマンドの実行も音声認識によって操作
できるように構成するのか好適である。この場合には、
内部コマンドに対応した音声特徴バタンか２次記憶装置
１０２の中に存在するオペレーティングシステムファイ
ルの管理情報（図示せず）とともに存在し、コンピュー
タの電源投入時にオペレーティングシステムファイル（
図示せず）が２次記憶装置１０２から制御装冨内の内部
コマンド記憶手段３０１に転送される際に音声特徴バタ
ンか音声認識装置１０１に転送するように構成するのか
好適である。As in the embodiment described above, commands existing in the secondary storage device 102 (usually called external commands)
In addition to this, it is preferable to configure the system so that the execution of internal commands can also be operated by voice recognition. In this case,
The sound characteristic button corresponding to the internal command exists together with the management information (not shown) of the operating system file existing in the secondary storage device 102, and when the computer is powered on, the operating system file (
It is preferable that the voice feature button is transferred to the voice recognition device 101 when the command (not shown) is transferred from the secondary storage device 102 to the internal command storage means 301 in the control device.

また、オペレーティングシステムに対するコマンドの指
定を音声認識で行うのと同様に、アプリケーションプロ
グラムに対するコマンドやファイルの指定を音声認識で
行う場合にも、音声特徴バタンかアプリケーションプロ
グラムファイルの管理領域ヤ揉作の対象となるファイル
の管理情報に付加しておくのが好適である。In addition, in the same way that voice recognition is used to specify commands to the operating system, when voice recognition is used to specify commands and files to application programs, voice characteristics and application program file management areas are subject to manipulation. It is preferable to add this to the management information of the file.

さらに、上述した実施例では、音声認識装置として単語
音声認識袋Ｎを想定したか、この発明における音声認識
装置は必すしも単語単位の音声認識装置である必要はな
く、連続音声認識製雪であってもよい。この場合には、
管理情報の中には音声特徴バタンの代わりに、好ましく
は、ファイル固有の読み等の情報を音声に関する情報と
しで付加するのが良い。Furthermore, in the above-mentioned embodiment, the word speech recognition bag N was assumed as the speech recognition device, but the speech recognition device in this invention does not necessarily have to be a word-by-word speech recognition device, but may be a continuous speech recognition device. There may be. In this case,
Instead of the sound feature button, it is preferable to add information such as file-specific pronunciation to the management information as sound-related information.

ざらに、一般にコンピュータの操作の対象となる対象物
（以降これをオブジェクトと称する）は２次記憶装置上
のファイル、外部コマンドや制御装置内部の内部コマン
ドのみてはなく、Ｃ日Ｔティスプレィ上のカーソル、文
字、アイコン、ボタンを表現するデータ構造なども考え
られる。上述した実施例では内部コマンド、外部コマン
ドファイルやプログラムファイルをオブジェクトの一例
として挙げたか、オブジェクトは必すしもファイルのみ
に限定されるものではなく、例えば、マウス模作の対象
となるＣ日Ｔデイスプレィ上のアイコンやボタンを表現
するデータ構造をオブジェクトとして扱ってもよい。In general, the objects that are the targets of computer operations (hereinafter referred to as objects) are not only files on the secondary storage device, external commands, and internal commands inside the control device, but also objects on the C/T display. Data structures representing cursors, characters, icons, buttons, etc. can also be considered. In the embodiments described above, internal commands, external command files, and program files were cited as examples of objects, but objects are not necessarily limited to files. Data structures representing icons and buttons may be handled as objects.

（発明の効果）この発明によれば、音声認識装置によってコンピュータ
に対する操作を行う際に、認識対象となる音声を現在の
操作の対象となるオブジェクト（ファイル、データ構造
）に関連した音声に限定することかでき、音声認識の性
能か低く、少数の語粟しか認識できない場合でも容易に
コマンドの入力ができる。(Effects of the Invention) According to the present invention, when an operation is performed on a computer using a speech recognition device, the speech to be recognized is limited to the speech related to the object (file, data structure) that is the object of the current operation. This allows commands to be input easily even when voice recognition performance is low and only a few words can be recognized.

[Brief explanation of the drawing]

菓１図（Ａ）〜（Ｄ）は、この発明の音声掃作コンピュ
ータの説明に供する図、第２図（Ａ）〜（Ｃ）は、従来のコシどユータの説明図
、第３図は、この発明の音声操作コンピュータが備える制
御装置の説明に供する図、笥４図は、第３図の動作フローである。］０１・・・音声認識装置、１０２・・・２次記憶装冒
１０３−・・ディレクトリ領域］０４・・・ファイル、　　１０５・・・キーボード’
１０６・・・制御装置、　　　１０８・・・主記憶装置
］０９・・・オへレーティングシステム］］○・・・デ
ィレクトリ指定］５１・・・パス指定。Figures 1 (A) to (D) are diagrams for explaining the voice sweeping computer of the present invention, Figures 2 (A) to (C) are diagrams for explaining a conventional computer, and Figure 3 is a diagram for explaining the conventional computer. FIG. 4 is a diagram illustrating the control device included in the voice-operated computer of the present invention. FIG. 4 is an operational flowchart of FIG. 3. ]01...Speech recognition device, 102...Secondary storage device 103-...Directory area]04...File, 105...Keyboard'
106...Control device, 108...Main memory]09...Operating system]]○...Directory specification]51...Path specification.

Claims

[Claims]

(1) A keyboard for inputting commands, a control device connected to this keyboard to execute programs, a main storage device connected to this control device for storing programs being executed and data being processed, and the control device a secondary storage device that is connected to store executable programs and data that does not need to be processed at high speed, and a speech recognition device that is connected to the control device and performs recognition processing on vocalized sounds and outputs recognition results to the control device. The secondary storage device stores object management information and the data structure itself with voice information added to improve voice recognition performance, and the control device stores information from the keyboard. A voice-operated computer, characterized in that it has a function of transmitting voice information specific to the object currently being operated, as a reference pattern for recognition, to the voice recognition device according to an input command. .