JPH1153162A

JPH1153162A - Voice processor

Info

Publication number: JPH1153162A
Application number: JP9212765A
Authority: JP
Inventors: Osahisa Okamoto; 長久岡本; Koji Aizawa; 浩二相沢
Original assignee: Hitachi Engineering and Services Co Ltd
Current assignee: Hitachi Engineering and Services Co Ltd
Priority date: 1997-08-07
Filing date: 1997-08-07
Publication date: 1999-02-26

Abstract

PROBLEM TO BE SOLVED: To carry out all operations from the start to the end of application including data input and data storage with voice commands by freely registering a series of operations on a computer device in a dictionary and specifying them with vocal words and phrases. SOLUTION: A voice dictionary 6 on the side of a voice recognition device contains reading and indexes and a voice dictionary 13 on the side of the computer device contain texts and indexes corresponding to them in dictionary form. The index value of a recognized word is sent from the voice recognition device 1 to the computer device 2, which instructs an application operation part 12 to perform key operation that the index indicates, thereby performing application operation. A series of operations for moving a cursor and pressing a return key are represented in text form and registered in the dictionary 13 to enable key operations corresponding to the contents, so that the computer device 2 can be operated with voice.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は音声認識し、音声処
理を行う音声処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech processing apparatus for recognizing speech and performing speech processing.

【０００２】[0002]

【従来の技術】従来の音声処理装置にあっては、音声と
カーソル，キーなどの操作を表わすテキストとを対応さ
せて記憶し、入力された音声を認識したときに記憶した
テキストに沿って処理実行するものであった。2. Description of the Related Art In a conventional voice processing apparatus, a voice and a text representing an operation of a cursor, a key or the like are stored in association with each other, and processing is performed according to the stored text when an input voice is recognized. Was to run.

【０００３】[0003]

【発明が解決しようとする課題】前述のように、従来か
ら声でパソコンを操作できるシステムはあったが、これ
らは操作の登録が面倒であったり、アプリケーション毎
に使える命令が固定化されてしまっていたりで、決して
使いやすいものではなかった。As described above, there have conventionally been systems that can operate a personal computer by voice. However, in these systems, registration of operations is troublesome, and commands usable for each application are fixed. It was not easy to use.

【０００４】プラントあるいは機器の点検結果の収集，
各種検計機器の検討結果の収集などに音声処理装置を採
用し、直ちにこれらの結果がコンピュータ装置（パソコ
ン）内にデータとして取り込まれ、まとめると言った省
力化が必要とされて来ている。[0004] Collection of inspection results of plants or equipment,
It is necessary to use a sound processing device for collecting the examination results of various measuring instruments, etc., and immediately take these results as data in a computer device (personal computer), and to save the labor to collect the results.

【０００５】本発明は、このようなニーズに応え、省力
化に貢献し、作業者が音声を使用して使い易く、直ちに
作業内容を実行に移すことの出来る音声処理装置を提供
することを目的とする。An object of the present invention is to provide a voice processing apparatus that meets such needs, contributes to labor saving, is easy to use by a worker using voice, and can immediately execute work contents. And

【０００６】[0006]

【課題を解決するための手段】本発明は、この技術課題
を解決するために、コンピュータ装置に対する一連のキ
ー操作を自由に辞書に登録できるようになし、これを音
声・語句で指定できるようにした。こうすることで、ア
プリケーションの起動からデータ入力，データ格納，終
了まで、全での操作を音声指令で操作できるようにし
た。SUMMARY OF THE INVENTION In order to solve this technical problem, the present invention allows a series of key operations on a computer device to be freely registered in a dictionary, so that the operation can be specified by voice or phrase. did. By doing so, all operations from starting the application to data input, data storage, and termination can be operated by voice commands.

【０００７】本発明は具体的には、音声を入力する音声
入力装置と、入力された音声を認識する音声認識装置
と、指定した読みを記憶する読み記憶手段と、読みとカ
ーソル，キーなどの操作を表わすテキストとを対応させ
て記憶するテキスト記憶手段と、読みの内容をテキスト
に従って実行する一連のテキストを記憶するテキストフ
ァイル記憶手段と、前記音声認識装置で認識した音声か
ら指定された読みを認識し、該読みに対応して、前記テ
キストファイル記憶手段から一連のテキストの実行手順
を求めて一連のテキストの内容を実行する実行手段とを
備えたことを特徴とする音声処理装置を提供する。More specifically, the present invention provides a voice input device for inputting a voice, a voice recognition device for recognizing the input voice, a reading storage means for storing a specified reading, and a reading and cursor, key and the like. Text storage means for storing text representing operations in association with each other; text file storage means for storing a series of texts that execute the contents of reading according to the text; and reading specified from the voice recognized by the voice recognition device. Executing means for recognizing and reading a series of texts from the text file storage means in response to the reading and executing the contents of the series of texts. .

【０００８】本発明は音声を入力する音声入力装置と、
入力された音声を認識する音声認識装置と、指定した読
みを記憶する読み記憶手段と、読みとカーソル，キーな
どの操作を表わすテキストとを対応させて記憶するテキ
スト記憶手段と、読みの内容をテキストに従って実行す
る一連のテキストを記憶するテキストファイル記憶手段
と、前記読みで起動，データ入力，データ格納，終了ま
でのアプリケーション操作の内容を前記テキスト記憶手
段およびテキストファイル記憶手段で記憶されたテキス
トに基づいて実行する実行手段とを備えたことを特徴と
する音声処理装置を提供する。[0008] The present invention provides a voice input device for inputting voice,
A voice recognition device for recognizing an input voice, a reading storage means for storing a specified reading, a text storage means for storing a reading and a text representing an operation of a cursor, a key or the like in association with each other; Text file storage means for storing a series of texts to be executed in accordance with the text, and contents of the application operation from start to reading, data input, data storage, and termination to the text stored in the text storage means and text file storage means And an execution means for executing the sound processing based on the sound processing.

【０００９】[0009]

【発明の実施の形態】以下、本発明にかかる一実施例を
図面に基づいて説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment according to the present invention will be described below with reference to the drawings.

【００１０】図１において、音声処理装置は、音声認識
装置１およびコンピュータ装置（パソコン）２からな
る。In FIG. 1, the speech processing device comprises a speech recognition device 1 and a computer device (personal computer) 2.

【００１１】音声認識装置１は、外部のマイクロフォン
３から入力された音声指令をＡ／Ｄ変換するＡ／Ｄ変換
部４、その変換された信号を音声として認識する音声認
識部５、これとデータのやり取りのされる音声辞書（装
置側）６，演算処理装置（ＣＰＵ）９からなる。この音
声辞書は記憶手段として機能し、読み７とそれに対応し
たインデックス８とを記憶する。このインデックスが通
信回路１０を介してコンピュータ装置２に送信される。The voice recognition device 1 includes an A / D conversion unit 4 for A / D converting a voice command input from an external microphone 3, a voice recognition unit 5 for recognizing the converted signal as voice, and a voice recognition unit 5. And a processing unit (CPU) 9 for exchanging voice data. This speech dictionary functions as a storage unit, and stores the reading 7 and the index 8 corresponding thereto. This index is transmitted to the computer device 2 via the communication circuit 10.

【００１２】コンピュータ装置２は、演算処理装置（Ｃ
ＰＵ）１１、これを指令を受けて作動するアプリケーシ
ョン操作部１２、これらとデータとのやり取りのされる
音声辞書（パソコン側）１３からなる。この音声辞書は
記憶手段として機能し、テキスト１４とインデックス１
５とを記憶する。このインデックスは通信回路１０を介
して送信されて来た前述のインデックス８に対応する。The computer device 2 includes an arithmetic processing unit (C
PU) 11, an application operation unit 12 that operates upon receiving a command, and a voice dictionary (PC side) 13 for exchanging data with these. This speech dictionary functions as a storage unit, and stores the text 14 and the index 1
5 is stored. This index corresponds to the aforementioned index 8 transmitted via the communication circuit 10.

【００１３】従って、この二つの音声辞書は「読み」と
「テキスト」との対応関係を示す対応部１６に示すよう
に読み，テキストおよびインデックスの関係を有する。
音声認識装置側音声辞書６には、読みとインデックスが
記憶され、それに対応するテキストとインデックスをコ
ンピュータ装置側音声辞書１３が辞書として持つ。音声
認識装置１からは、認識された単語のインデックス値が
コンピュータ装置２へ送られ、コンピュータ装置２で
は、そのインデックスが示すキー操作をアプリケーショ
ン操作部１２に対し、アプリケーション操作を実行させ
る。Accordingly, these two voice dictionaries have a relationship between reading, text and index, as shown in a correspondence section 16 indicating the correspondence between "reading" and "text".
The speech recognition device-side speech dictionary 6 stores readings and indexes, and the computer device-side speech dictionary 13 has corresponding texts and indexes as dictionaries. The speech recognition device 1 sends the index value of the recognized word to the computer device 2, and the computer device 2 causes the application operation unit 12 to execute the key operation indicated by the index in the application operation.

【００１４】カーソルを移動したり、エンターキーを押
したりする一連の操作を前述したテキストで表現し、そ
れを辞書に登録しておくことでその内容に相当するキー
操作等をパソコンに実施できることとした。A series of operations such as moving a cursor and pressing an enter key are expressed in the above-described text, and by registering them in a dictionary, key operations and the like corresponding to the contents can be performed on a personal computer. did.

【００１５】テキストについて更に詳述すれば次のよう
になる。テキストおよびテキストに従って実行する一連
のテキストからなるテキストファイルについての構成は
次のようになる。The text will be described in more detail as follows. The structure of a text file consisting of text and a series of texts to be executed according to the text is as follows.

【００１６】テキスト，テキストファイル読み〔ＤＯＷＮ〕カーソルした〔ＰＧＵＰ〕スクルールうえ〔ＥＮＴＥＲ〕リターン〔ＵＰ２〕〔ＲＩＧＨＴ〕カーソルふたつうえでみぎ〔ＣＴＲＬ〕〔ＥＳＣ〕rcalc.exe, でんたくのじっこうこのようにすることで、パソコンに対する操作をあらか
じめ音声辞書に登録しておけるため、パソコンを音声で
操作することが可能になる。Reading text, text file [DOWN] Cursor [PGUP] Scroller [ENTER] Return [UP2] [RIGHT] Cursor double [CTRL] [ESC] rcalc.exe, like this By doing so, the operation of the personal computer can be registered in the voice dictionary in advance, so that the personal computer can be operated by voice.

【００１７】ここで記号の説明をしておくと、基本的に
はパソコンの１つ１つのキーをテキストとすることがで
きる。また、対象キーに対応するテキストは、自由に設
定することができるものとする。テキストと対象キーの
一例を下記に示す。To explain the symbols here, basically, each key of the personal computer can be a text. The text corresponding to the target key can be set freely. An example of the text and target key is shown below.

【００１８】テキスト対象キー＾：ＣＴＲＬ［ＥＮＴＥＲ］：ＥＮＴＥＲ［ＤＯＷＮ］： ↓ ［ＵＰ］： ↑ ［ＲＩＧＨＴ］： → ［ＬＥＦＴ］： ← ・・・・次にテキストファイルで編集した事例を示す。Key for text ＾: CTRL [ENTER]: ENTER [DOWN]: ↓ [UP]: ↑ [RIGHT]: → [LEFT]: ← Next, an example of editing with a text file is shown.

【００１９】〔ＵＰ〕，カーソルうえ〔ＵＰ２〕，カーソルふたつうえ〔ＵＰ３〕，カーソルみっつうえ〔ＤＯＷＮ〕，カーソルした〔ＤＯＷＮ２〕，カーソルふたつした〔ＤＯＷＮ３〕，カーソルみっつした〔ＲＩＧＨＴ〕，カーソルみぎ〔ＲＩＧＨＴ２〕，カーソルふたつみぎ〔ＲＩＧＨＴ３〕，カーソルみっつみぎ〔ＬＥＦＴ〕，カーソルひだり〔ＬＥＦＴ２〕，カーソルふたつひだり〔ＬＥＦＴ３〕，カーソルみっつひだり〔ＰＧＵＰ〕，スクロールうえ〔ＰＧＤＮ〕，スクロールした〔ＢＳ〕，こうたい〔ＢＳ２〕，ふたつこうたい〔ＢＳ３〕，みっつこうたい〔ＥＮＴＥＲ〕，リターン〔ＥＮＴＥＲ〕〔ＲＩＧＨＴ〕〔ＵＰ〕，かくてい〔ＥＮＴＥＲ〕，とうろく＾〔ＨＯＭＥ〕〔ＤＯＷＮ４〕〔ＲＩＧＨＴ〕，ひづけ＾〔ＨＯＭＥ〕〔ＤＯＷＮ４〕〔ＲＩＧＨＴ８〕，てん
こう〔ＨＯＭＥ〕，ぎょうのせんとう〔ＨＯＭＥ〕，てんけんじこく〔ＴＡＢ〕，タブ〔ＩＮＳＥＲＴ〕，インサート〔ＤＥＬ〕，デリート＾（ｏ），ファイルをひらく＾（ｎ），しんきさくせい＾（ｐ），いんさつ＾（ｓ），うわがきほぞん＾（ｃ），コピー＾（ｖ），はりつけ＾（ｘ），きりとり＾（ｚ），もとにもどす＾（ｙ），くりかえし＾（ｆ），けんさく＾（ｈ），ちかん〔Ｆ１〕，へルプのひょうじ〔Ｆ２〕，へんしゅう〔ＵＰ〕＾〔ＩＮＳＥＲＴ〕〔ＤＯＷＮ〕〔ＥＮＴＥ
Ｒ〕〔ＲＩＧＨＴ〕，うえにおなじ％〔Ｆ４〕，アプリケーションのしゅうりょうＹ，はいＮ，いいえ〔ＥＳＣ〕，キャンセル％〔Ｆ〕，メニューのファイル％〔Ｅ〕，メニューのへんしゅう％〔Ｖ〕，メニューのひょうじ％〔Ｉ〕，メニューのそうにゅう％〔Ｏ〕，メニューのしょしき％〔Ｔ〕，メニューのツール％〔Ｗ〕，メニューのウィンドゥ％〔Ｈ〕，メニューのヘルプ＾〔Ｆ９〕，さいしょうか＾〔Ｆ１０〕，さいだいか＾〔Ｆ５〕，もとのサイズ＾〔Ｆ４〕，ファイルをとじる！ＥＮＴＥＲ，けってい！ＮＥＸＴ，へんかん！ＢＥＦＯＲＥ，まえ！ＡＦＴＥＲ，つぎ！ＣＡＮＣＥＬ，とりけし！ＩＭＥＯＮ，にほんごモードせってい！ＩＭＥＯＦＦ，にほんごモードかいじょ＾〔ＥＳＣ〕rnotepad.exe，メモちょうのじっこう＾〔ＥＳＣ〕rcalc,exc, でんたくのじっこう＾〔ＥＳＣ〕rc：\msoffice\excel\excel.exe, エクセ
ルのじっこう＾〔ＥＳＣ〕rc：\ＴＫ\ＴＫ．ＢＡＴ，てんけんのじっ
こう＾（ｇ）ヘリウム〔ＥＮＴＥＲ〕〔ＥＮＤ〕〔ＤＯＷＮ
２〕，へりうむきょうきゅうせつび HELIUM ＾（ｇ）アルゴン〔ＥＮＴＥＲ〕〔ＥＮＤ〕〔ＤＯＷＮ
２〕，あるごんきょうきゅうせつび ARUGON ＾（ｇ）各種流量〔ＥＮＴＥＲ〕〔ＥＮＤ〕〔ＤＯＷＮ
２〕，かくしゅがすりゅうりょう KAKUSYURYUYO ＾（ｇ）Ｈ２系〔ＥＮＴＥＲ〕〔ＥＮＤ〕〔ＤＯＷＮ
２〕，えっちつーけいH2KEI ＾（ｇ）Ｎ２系〔ＥＮＴＥＲ〕〔ＥＮＤ〕〔ＤＯＷＮ
２〕，えぬつーけいN2KEI ＾（ｇ）混合流量〔ＥＮＴＥＲ〕〔ＥＮＤ〕〔ＤＯＷＮ
２〕，こんごうがすりゅうりよう KONGORYUYO ＾（ｇ）混合ガス系〔ＥＮＴＥＲ〕〔ＥＮＤ〕〔ＤＯＷ
Ｎ２〕，こんごうがすけい KONGOGASUKEI ＾（ｇ）Ｈ２分析〔ＥＮＴＥＲ〕〔ＥＮＤ〕〔ＤＯＷＮ
２〕，えっちつーぶんせきけい H2BUNSEKI ＾（ｇ）各種記事〔ＥＮＴＥＲ〕〔ＤＯＷＮ〕,かくし
ゅがすきじ KAKUSYUKIJI ＾（ｇ）混合記事〔ＥＮＴＥＲ〕〔ＤＯＷＮ〕，こんご
うがすきじ KONGOKIJI ％〔ＴＡＢ〕，アプリケーションのきりかえ＃１，おんせいじしょいち＃２，おんせいじしょに＃３，おんせいじしょさん＃４，おんせいじしょよん＾〔ＰＧＵＰ〕，まえのシート＾〔ＰＧＤＮ〕，つぎのシート〔ＢＳ〕，もどる〔ＤＥＬ〕，いれなおしこのように、自由な発声で好きなキー操作を、しかもテ
キストファイルで容易に編集・登録できるため、専用プ
ログラムを作成することなく、かつパソコン上のアプリ
ケーションに手を加えることなく、簡単に音声処理装置
とすることができる。[UP], cursor up [UP2], cursor up [UP3], cursor up [DOWN], cursor up [DOWN2], cursor up [DOWN3], cursor up [RIGHT], cursor up [RIGHT2], cursor two [RIGHT3], cursor three [LEFT], cursor left [LEFT2], cursor two [LEFT3], cursor three [PGUP], scroll up [PGDN], scroll [ [BS2], two [BS3], three [ENTER], return [ENTER] [RIGHT] [UP], 〔[ENTER], ＾ [HOME] [DOWN4] [ RIGHT], [HOME] [DOWN4] [RIGHT8], Health [HOME], Gyo no Sento [HOME], Health [TAB], Tab [INSERT], Insert [DEL], Delete ＾ (o), Open a file ＾ (n), き (p), ＾ (s), ほ (c), copy ＾ (v), crucifix ＾ (x), cut とり (z), Return ＾ (y), repeat ＾ (f), kensaku ＾ (h), chikan [F1], help's display [F2], change [UP] ＾ [INSERT] [DOWN] [ENTER
R] [RIGHT], same as above [F4], application Y, yes N, no [ESC], cancel% [F], menu file% [E], menu change% [V] , Menu information% [I], menu information% [O], menu information% [T], menu tool% [W], menu window% [H], menu help ＾ [F9] , いしょうＦ [F10], いかさ [F5], original size ＾ [F4], close the file! Enter! NEXT, epilepsy! Before, before! AFTER, next! CANCEL, take it! IMEON, Japanese mode! IMEOFF, Japanese mode ＾ [ESC] rnotepad.exe, memo じ [ESC] rcalc, exc,, た＾ [ESC] rc: \ msoffice \ excel \ excel.exe, じセル[ESC] rc: \ TK \ TK. BAT, balance て (g) Helium [ENTER] [END] [DOWN
2), HELIUM ＾ (g) argon [ENTER] [END] [DOWN
ARGUON II (g) Various flow rates [ENTER] [END] [DOWN]
2], KAKUSYURYUYO ＾ (g) H2 series [ENTER] [END] [DOWN
2], ETSU-KEI H2KEI (g) N2 series [ENTER] [END] [DOWN
2], Entsu-kei N2KEI (g) Mixing flow rate [ENTER] [END] [DOWN]
2) 、 Kongoryuyo ＾ (g) Mixed gas system [ENTER] [END] [DOW
N2], KONGOGASUKEI ＾ (g) H2 analysis [ENTER] [END] [DOWN
2), H2BUNSEKI ＾ (g) Various articles [ENTER] [DOWN], KAKUSYUKIJI ＾ (g) Mixed articles [ENTER] [DOWN], KONGOKIJI% [TAB] , Application change # 1, onset # 2, onset # 3, onset # 4, onset ＾ [PGUP], previous sheet ＾ [PGDN], next sheet [ BS], return [DEL], rewrite In this way, you can easily edit and register your favorite key operations with a free utterance, and also with a text file. The audio processing device can be easily formed without any modification.

【００２０】アプリケーション操作命令に関係して、テキストファイル読み例１＾〔ＨＯＭＥ〕〔ＤＯＷＮ△〕〔ＲＩＧＨＴ〕ひづけ読み「ひづけ」の場合に、どのように機能するかを図２
に基づいて説明する。In connection with the application operation instruction, a text file reading example 1 {[HOME] [DOWN}] [RIGHT] HISHI In the case of reading "HISHI", how it functions is shown in FIG.
It will be described based on.

【００２１】入力エリア（Ａ）の位置で、日付入力エリ
ア（Ｂ）に移動する場合、「ひづけ」と発声する。この
音声指令により登録したテキストに従って実行が自動的
になされ、日付エリア（Ｂ）へと移動する。これによっ
て、日付エリア（Ｂ）に日付を入力することが可能にな
る。＾［ＨＯＭＥ］，［ＤＯＷＮ４］，［ＲＩＧＨＴ］
を説明すれば次の通りである。When moving to the date input area (B) at the position of the input area (A), "his" is uttered. The execution is automatically performed according to the text registered by the voice command, and moves to the date area (B). This makes it possible to input a date in the date area (B). ＾ [HOME], [DOWN4], [RIGHT]
Is described as follows.

【００２２】キー操作該当（イ）＾[ＨＯＭＥ］＝ＣＴＲＬ＋ＨＯＭＥキーを押
した状態（ロ）［ＤＯＷＮ４］＝ ↓キーを４回押した状態（ハ）［ＲＩＧＨＴ］＝ →キーを１回押した状態 “＾［ＨＯＭＥ］［ＤＯＷＮ４］［ＲＩＧＨＴ］てんこ
う”についても同様であることが理解できよう。Key operation applicable (a)） [HOME] = CTRL + HOME key pressed (b) [DOWN4] = ↓ key pressed four times (c) [RIGHT] = → key pressed once It can be understood that the same applies to "@ [HOME] [DOWN4] [RIGHT]".

【００２３】例２＾（ｇ）ヘリウム〔ＥＮＴＥＲ〕
〔ＥＮＤ〕〔ＤＯＷＮ２〕，へりうむきょうきゅうせつ
び読み「へりうむきょうきゅうせつび」の場合に、どのよ
うに機能するかを図３に基づいて説明する。Example 2 ＾ (g) helium [ENTER]
[END] [DOWN2], how to function in the case of the reading "Herimukyokyusetsu" will be described with reference to FIG.

【００２４】入力範囲（Ａ）の入力エリア（ＡＡ）の位
置で、入力範囲「ヘリウム」の入力エリア（ＢＢ）に移
動させたい場合、「へりうむきょきゅうせつび」と発声
する。この音声指令により登録したテキストに従って実
行が自動的になされて入力範囲が切替えられ、入力エリ
アが（ＢＢ）に移動する。すなわち、（ニ）入力範囲を
（Ａ）からヘリウムに切替える機能（ホ）入力エリア
（ＢＢ）に移動させる機能によって実行がなされる。こ
れによって、プラントの「ヘリウム」の状態をコンピュ
ータ装置に入力エリア（ＢＢ）に入力することが可能に
なる。When the user wants to move to the input area (BB) of the input range "helium" at the position of the input area (AA) of the input range (A), he utters "Helium". The execution is automatically performed according to the text registered by the voice command, the input range is switched, and the input area moves to (BB). That is, (d) the function of switching the input range from (A) to helium (e) the function of moving to the input area (BB) is executed. This makes it possible to input the state of "helium" of the plant to the computer device in the input area (BB).

【００２５】＾（ｇ）ヘリウム［ＥＮＴＥＲ］はテキス
トを表し、［ＥＮＤ］［ＤＯＷＮ２］は読みを表す。キ
ー操作該当状態を表せば、次のようである。(G) Helium [ENTER] represents text, and [END] [DOWN2] represents reading. The state corresponding to the key operation is as follows.

【００２６】キー操作該当＾（ｇ）＝ＣＴＲＬ＋Ｇキーを押した状態ヘリウム［ＥＮＴＥＲ］＝「ヘリウム」とキー入力した後にＥＮＴＥＲキーを押した状態［ＥＮＤ］＝ＥＮＤキーを押した状態［ＤＯＷＮ２］＝ ↓キーを２回押した状態前述のように“＾（ｇ）アルゴン［ＥＮＴＥＲ］［ＥＮ
Ｄ］［ＤＯＷＮ２］，あるごんきょうきゅうせつび”な
どが使用されている。これらについても機能は同様であ
り、容易に理解することができよう。Key operation applicable ＾ (g) = CTRL + G key pressed Helium [ENTER] = key input of “helium” and then ENTER key pressed [END] = END key pressed [DOWN2] = ↓ key pressed twice As described above, “＾ (g) Argon [ENTER] [EN
D] [DOWN2], and certain functions are also used. These functions are the same and can be easily understood.

【００２７】[0027]

【発明の効果】以上のように本発明によれば、自由な音
声指令（読み）で好きなキー操作・カーソル操作を、し
かもテキストファイルを用いることによって容易に編集
・登録できるため、専用プログラムを作成することなく
音声処理することができる。これによって、音声指令
で、起動・データ入力，データ格納，終了までのアプリ
ケーション操作の内容を実行することができることにな
る。As described above, according to the present invention, the user can easily edit and register any key operation and cursor operation with a free voice command (reading) by using a text file. Audio processing can be performed without creating. As a result, it is possible to execute the contents of the application operation up to start-up, data input, data storage, and termination by a voice command.

[Brief description of the drawings]

【図１】本発明の実施例を説明するためのブロック図。FIG. 1 is a block diagram for explaining an embodiment of the present invention.

【図２】本発明の機能を示すための機能図。FIG. 2 is a functional diagram showing the functions of the present invention.

【図３】本発明の機能を示すための機能図。FIG. 3 is a functional diagram showing the functions of the present invention.

[Explanation of symbols]

１…音声認識装置、２…コンピュータ装置、５…音声認
識部、６…音声辞書部（装置側）、７…読み、８…イン
デックス、９…ＣＰＵ、１０…通信回路、１１…ＣＰ
Ｕ、１２…アプリケーション操作部、１３…音声辞書
（パソコン側）、１４…テキスト、１５…インデック
ス、１６…対応部。DESCRIPTION OF SYMBOLS 1 ... Speech recognition apparatus, 2 ... Computer apparatus, 5 ... Speech recognition part, 6 ... Speech dictionary part (device side), 7 ... Reading, 8 ... Index, 9 ... CPU, 10 ... Communication circuit, 11 ... CP
U, 12: application operation unit, 13: voice dictionary (PC side), 14: text, 15: index, 16: corresponding unit.

Claims

[Claims]

A voice input device for inputting voice, a voice recognition device for recognizing the input voice, a reading storage means for storing a specified reading, and a text representing a reading and an operation of a cursor, a key or the like. Text storage means for storing in association with each other, text file storage means for storing a series of texts for executing the contents of reading according to the text, and recognizing a specified reading from the voice recognized by the speech recognition device, And an execution unit for executing a series of text contents by obtaining an execution procedure of the series of texts from the text file storage unit.

2. A voice input device for inputting voice, a voice recognition device for recognizing the input voice, a reading storage means for storing a specified reading, and a text representing a reading and an operation of a cursor, a key or the like. Text storage means for storing corresponding texts, text file storage means for storing a series of texts for executing the contents of reading according to the texts, and contents of application operations up to activation, data input, data storage, and termination in the readings. An audio processing device comprising: a text storage unit; and an execution unit that executes based on text stored in a text file storage unit.