JP3129893B2

JP3129893B2 - Voice input word processor

Info

Publication number: JP3129893B2
Application number: JP05262357A
Authority: JP
Inventors: 夏樹湯浅
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1993-10-20
Filing date: 1993-10-20
Publication date: 2001-01-31
Anticipated expiration: 2016-01-31
Also published as: JPH07121651A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は音声入力ワープロに関
し、より詳細には言語に関して入力されたデータが認識
できなかった場合に入力されたデータをそのままの形態
で出力し得る音声入力ワープロに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech input word processor , and more particularly to a speech input word processor capable of outputting input data as it is when the input data for a language cannot be recognized. Related to voice input word processor .

【０００２】[0002]

【従来の技術】従来の言語認識装置は、手書文字認識装
置や音声認識装置を初めとして、いろいろなものが実現
されているが、言語を認識できなかった場合の処理とし
ては、「認識できなかった」という印を記録するもの
や、認識できるまで繰り返し入力を要求するものがあ
る。2. Description of the Related Art Conventionally, various types of language recognition devices have been realized, such as a handwritten character recognition device and a speech recognition device. Some records the mark "No," and others require repeated input until they can be recognized.

【０００３】また、ＯＣＲにおいては、原画像を保存し
ておいて認識結果と原画像とを対比させることによっ
て、誤認識された文字を修正し易くしているシステムが
存在する。In OCR, there is a system that stores an original image and compares the recognition result with the original image, thereby making it easy to correct an erroneously recognized character.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、従来の
技術の一番目の方式では、認識できなかったときに入力
されていたデ−タは失われてしまうことになる。認識装
置が認識できなかったデ−タでも人間には認識できる場
合が多いので、このデ−タは生かすべきである。However, in the first method of the prior art, data input when recognition was not possible is lost. Since data that could not be recognized by the recognition device can often be recognized by humans, this data should be utilized.

【０００５】また、従来の技術の二番目の方式ではＯＣ
Ｒで読み取った全画像デ−タを保存しておく必要があ
り、記憶領域を大量に必要とする。[0005] Further, in the second method of the prior art, the OC method is used.
It is necessary to store all image data read by R, which requires a large storage area.

【０００６】さらに、従来の方式では繰り返し入力を求
めるよりは無理矢理一番近い文字に認識してしまった方
が使用者からの印象が良いだろうという判断からか、認
識時の判断が悪くても無理矢理、ある文字として認識さ
せてしまう場合があった。Further, in the conventional method, it is difficult to judge whether a character is recognized as the closest to a character rather than repeatedly seeking an input. In some cases, it was forced to be recognized as a certain character.

【０００７】本発明の目的は、言語に関する入力データ
について認識できなかった場合にはその入力データその
ものを保存することによって、使用者が後からそのデー
タを参照した時に、何を入力したのか判別できる音声入
力ワープロを提供することにある。An object of the present invention is to save input data itself when language input data cannot be recognized, so that when the user refers to the data later, what is input can be determined. Voice In
To provide a power word processor .

【０００８】[0008]

【課題を解決するための手段】上述した目的は、音声デ
ータを入力する音声入力手段と、該音声入力手段より入
力された音声データを認識処理する音声認識手段と、該
音声認識手段の認識結果に応じて、該音声認識手段が前
記入力された音声データを認識した場合は文字データと
して記憶するとともに、前記記音声認識手段が前記入力
された音声データを認識しなかった場合は前記入力され
た音声データを記憶する手段と、前記記音声認識手段が
前記入力された音声データを認識しなかった場合はワー
プロの画面に不認識を表示し、かつ前記不認識表示部分
への指示により前記記憶手段に記憶した音声データを音
声出力する制御手段を有してなることを特徴とする音声
入力ワープロによって達成される。SUMMARY OF THE INVENTION An object of the present invention is to provide a voice input means for inputting voice data, a voice recognition means for recognizing and processing voice data input from the voice input means, and a voice recognition apparatus. depending on the result of recognition means, when the speech recognition means in together when the case of recognizing the voice data the input are stored as character data, did not recognize the voice data to which the Symbol speech recognition means is the input means for storing the audio data said input, said Symbol speech recognition means
If the input voice data is not recognized, an unrecognition is displayed on the screen of the word processor, and the voice data stored in the storage unit is sounded by an instruction to the unrecognized display portion.
This is achieved by a voice input word processor having a voice output control means.

【０００９】[0009]

【作用】音声認識手段は、音声入力手段より入力された
音声データを認識処理する。The voice recognition means is input from the voice input means.
Recognize voice data.

【００１０】[0010]

【００１１】[0011]

【００１２】制御手段は、前記音声認識手段が認識しな
かった音声データを、ワープロの画面に不認識を表示
し、かつ前記不認識表示部分への指示により前記記憶手
段に記憶した音声データより音声出力する。 [0012] The control means is configured so that the voice recognizing means does not recognize.
Unrecognized audio data is displayed on the word processor screen.
And an instruction to the unrecognized display portion causes the memory
The voice is output from the voice data stored in the row.

【００１３】[0013]

【実施例】図１は、本発明に関連のある言語認識装置の
一実施例を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of a language recognition apparatus related to the present invention.

【００１４】本実施例による言語認識装置は、認識すべ
きデ−タを入力する入力手段１と、言語の認識を行う認
識手段２と、認識された（コ−ド化された）デ−タある
いは認識できなかった（入力デ−タそのままの）デ−タ
を記憶する記憶手段３と、記憶手段３に記憶されている
認識されたデ−タに応じた言語または入力デ−タそのま
まを出力する出力手段４と、装置全体の作動を制御する
制御手段５とを備え、認識できなかった入力デ−タも、
認識されコ−ド化された入力デ−タと一緒に記憶手段３
に保存しておくことによって、使用者が後からそのデ−
タを参照した時に、何を入力したのかを判別できるもの
である。The language recognition apparatus according to the present embodiment has an input means 1 for inputting data to be recognized, a recognition means 2 for recognizing a language, and recognized (coded) data. Alternatively, storage means 3 for storing unrecognized data (as input data), and a language or input data corresponding to the recognized data stored in storage means 3 being output as it is. Output means 4 and control means 5 for controlling the operation of the entire apparatus.
Storage means 3 together with the recognized and coded input data
The user can save the data later.
When referring to the data, it is possible to determine what has been input.

【００１５】すなわち、入力手段１から入力される入力
デ−タが、制御手段５によって認識手段２に渡される。
ここで、入力デ−タは必要に応じて出力手段４にも渡さ
れ出力される。That is, input data input from the input means 1 is passed to the recognition means 2 by the control means 5.
Here, the input data is also passed to the output means 4 and output as required.

【００１６】認識手段２は、受け取った入力デ−タに基
づいて認識処理を行う。そして、認識できた場合には、
認識後のコ−ド化されたデ−タを記憶手段３に記憶さ
せ、認識できなかった場合には、認識できなかった入力
デ−タそのものを記憶手段３に記憶させる。The recognition means 2 performs a recognition process based on the received input data. And if it is recognized,
The coded data after recognition is stored in the storage means 3. If the recognition is not possible, the input data itself which is not recognized is stored in the storage means 3.

【００１７】出力時には記憶手段３の内容に応じて、認
識後のコ−ド化されたデ−タならそれを文字や音声等入
力時の形態に変換して出力手段４から出力するが、認識
できなかったデ−タの場合には、入力デ−タそのものが
記憶されているので、それをそのまま出力手段４から出
力する。At the time of output, according to the contents of the storage means 3, if the coded data after recognition is converted into a form at the time of inputting characters, voices, etc., it is outputted from the output means 4, but is output from the output means 4. In the case of data that could not be obtained, the input data itself is stored, so that it is output from the output means 4 as it is.

【００１８】図２は、言語認識装置の実施例の第１変形
例を示すブロック図である。FIG. 2 is a block diagram showing a first modification of the embodiment of the language recognition apparatus.

【００１９】言語認識装置の実施例の第１変形例は、入
力手段１１と、手書文字認識手段１２と、記憶手段１３
と、出力手段１４と、装置全体の作動を制御する制御手
段１５とである。制御手段１５は装置の中心に位置し、
マイクロプロセッサまたは専用ＬＳＩを備えている。入
力手段１１はライトペンや感圧シ−トまたは電磁誘導等
を利用した電子ペン等を備えたタブレット装置であり、
記入された手書文字の筆跡を制御手段１５に入力する。
出力手段１４はＣＲＴや液晶等のディスプレイで構成さ
れており、入力手段１１から入力された筆跡や、記憶手
段１３に保存されている文字や認識できなかった筆跡な
どを表示する。A first modification of the embodiment of the language recognition apparatus is as follows: an input unit 11, a handwritten character recognition unit 12, and a storage unit 13.
And output means 14 and control means 15 for controlling the operation of the entire apparatus. The control means 15 is located at the center of the device,
A microprocessor or dedicated LSI is provided. The input means 11 is a tablet device provided with a light pen, a pressure-sensitive sheet, an electronic pen using electromagnetic induction or the like,
The handwriting of the written handwritten character is input to the control means 15.
The output unit 14 is configured by a display such as a CRT or a liquid crystal, and displays handwriting input from the input unit 11, characters stored in the storage unit 13, and unrecognized handwriting.

【００２０】手書文字認識手段１２も制御手段１５と同
様にマイクロプロセッサまたは専用ＬＳＩを備え（制御
手段１５と共用し得る）、入力手段１１から入力された
筆跡のデ−タの認識を行う。The handwritten character recognizing means 12 also includes a microprocessor or a dedicated LSI (which can be shared with the control means 15) similarly to the control means 15, and recognizes handwriting data input from the input means 11.

【００２１】手書文字認識手段１２は、入力された筆跡
を認識することができれば、認識できたことを表すフラ
グとその文字コ−ドを制御手段１５を介して記憶手段１
３に記憶させるが、入力された筆跡が認識できなけれ
ば、認識できなかったことを表すフラグとその入力デ−
タそのものを記憶手段１３に記憶させる。つまり、記憶
手段１３から読み出すデ−タには、それが文字コ−ドな
のか入力デ−タそのものなのかを表すフラグが付されて
いる。If the handwritten character recognition means 12 can recognize the input handwriting, the handwriting character recognition means 12 stores a flag indicating the recognition and the character code via the control means 15 into the storage means 1.
If the input handwriting cannot be recognized, a flag indicating that the handwriting could not be recognized and the input data are stored.
The data itself is stored in the storage means 13. That is, the data read from the storage means 13 is provided with a flag indicating whether the data is character code or input data itself.

【００２２】図３は図２の第１変形例による文字の入力
について説明するフロ−チャ−トである。FIG. 3 is a flowchart for explaining character input according to a first modification of FIG.

【００２３】図３において、手書文字認識手段１２が入
力された筆跡の認識処理を行う（３−１）。もし認識で
きたなら文字コ−ドであることを表すフラグと認識でき
た文字の文字コ−ドとを記憶手段１３に記憶させる（３
−２、３−３）。もし認識できなければ認識できなかっ
たことを表すフラグ（これはそのデ−タが入力デ−タそ
のものであることを表すフラグとも言える）と認識でき
なかった入力デ−タそのものとを記憶手段１３に記憶さ
せる（３−２、３−４）。In FIG. 3, the handwritten character recognizing means 12 performs an input handwriting recognition process (3-1). If the character code is recognized, the flag indicating the character code and the character code of the recognized character are stored in the storage means 13 (3.
-2, 3-3). If the recognition is not possible, the storage means 13 stores a flag indicating that recognition was not possible (this can also be referred to as a flag indicating that the data is the input data itself) and the input data itself which could not be recognized. (3-2, 3-4).

【００２４】次に、このように記憶された情報を出力す
るときの処理を説明する。Next, a process for outputting the information stored in this manner will be described.

【００２５】このときには記憶手段１３からデ−タを読
み込み、そのデ−タに付加されているフラグの値を調べ
る。それが文字コ−ドを表すフラグであるなら、そのデ
−タを文字コ−ドとみなし、その文字コ−ドで表される
文字を出力手段１４に出力する。それが認識することが
できなかったことを表すフラグであるなら、そのデ−タ
を入力デ−タそのもの（つまり入力時の筆跡）とみな
し、その入力デ−タそのものを出力手段１４に出力す
る。At this time, the data is read from the storage means 13 and the value of the flag added to the data is checked. If it is a flag representing a character code, the data is regarded as a character code, and the character represented by the character code is output to the output means 14. If it is a flag indicating that it cannot be recognized, the data is regarded as input data itself (that is, handwriting at the time of input), and the input data itself is output to the output means 14. .

【００２６】図４は、図２の第１変形例による文字の出
力について説明するフロ−チャ−トである。FIG. 4 is a flowchart for explaining the output of characters according to the first modification of FIG.

【００２７】記憶手段１３からデ−タを読み込み（４−
１）、デ−タに付加されているフラグの値を調べる（４
−２）。Data is read from the storage means 13 (4-
1) Check the value of the flag added to the data (4)
-2).

【００２８】このフラグが文字コ−ドを表すフラグであ
るなら読み込んだデ−タを文字コ−ドとみなし、その文
字コ−ドに対応する文字を出力手段１４に出力する（４
−３）、一方、このフラグが入力デ−タを表すフラグで
あるならそのデ−タを入力デ−タそのもの（つまり入力
時の筆跡）とみなして、出力手段１４に出力する（４−
４）。If the flag is a flag representing a character code, the read data is regarded as a character code, and a character corresponding to the character code is output to the output means 14 (4).
-3) On the other hand, if this flag is a flag representing the input data, the data is regarded as the input data itself (that is, the handwriting at the time of input) and is output to the output means 14 (4).
4).

【００２９】デ−タにフラグを付加させる方法はいろい
ろあるが、簡単な例を以下に説明する。Although there are various methods for adding a flag to data, a simple example will be described below.

【００３０】（１）入力時の筆跡をグラフィックデ−タ
として格納する場合最初のビットをフラグとし、０なら「文字コ−ド」、１
なら「入力デ−タ」を表すことにする。このフラグが０
のときはその後に文字コード自体が続く。このフラグが
１の時には、筆跡を表すグラフィックデ−タの大きさを
表す数値がその後に続き、さらにその後に筆跡を表すグ
ラフィックデ−タ自体を置く。(1) When handwriting at the time of input is stored as graphic data The first bit is set as a flag.
Then, it represents "input data". This flag is 0
In the case of, the character code itself follows. When this flag is 1, a numerical value representing the size of the graphic data representing the handwriting follows, followed by the graphic data itself representing the handwriting.

【００３１】図５は、入力時の筆跡を認識できた場合の
デ−タ構造を説明する図、図６は入力時の筆跡を認識で
きなかった場合のデ−タ構造を説明する図である。FIG. 5 is a diagram for explaining the data structure when the handwriting at the time of input is recognized, and FIG. 6 is a diagram for explaining the data structure when the handwriting at the time of input is not recognized. .

【００３２】図５において（筆跡を認識できた場合）、
２０はフラグ０を入力すべき記憶領域、２１は文字コ−
ドそのものを入力すべき記憶領域、２２は文字「あ」を
表すグラフィックデ−タを示しており、記憶手段１３の
記憶領域２３にはフラグ０が、記憶領域２４には「あ」
の文字コ−ド（０００１）が記憶される。In FIG. 5 (when handwriting is recognized)
20 is a storage area for inputting the flag 0, 21 is a character code.
In the storage area to which the character itself is to be inputted, reference numeral 22 denotes graphic data representing the character "A", a flag 0 is stored in the storage area 23 of the storage means 13, and "A" is stored in the storage area 24.
Character code (0001) is stored.

【００３３】図６において（筆跡を認識できなかった場
合）、２５はフラグ１を入力すべき記憶領域、２６は入
力デ−タのサイズ（横方向ドット数および縦方向のドッ
ト数）を入力すべき記憶領域、２７は入力デ−タそのも
の（ビットイメ−ジデ−タ）を入力すべき記憶領域、２
８は文字「あ」を表すグラフィックデ−タを示してお
り、記憶手段１３の記憶領域２９にはフラグ１が、記憶
領域３０にはこの場合のビットイメ−サイズである横１
１ドット、縦１１ドットを表すデ−タが、３１にはビッ
トイメ−ジの各行を４ドットずつ区切って３桁の１６進
数で表したもの１１個からなるビットイメ−ジデ−タ
（０００，１３８，・・・４５０，０００）が記憶され
る。なお、この例ではビットイメ−ジの横のドット数は
４の倍数であることが望ましいが、サイズのデ−タを参
照すれば横のドット数が１１であることが分り、最後の
１ビット分はダミ−デ−タであることが判別できるので
問題ない。In FIG. 6 (when handwriting cannot be recognized), 25 is a storage area to which the flag 1 is to be input, and 26 is the size of input data (the number of horizontal dots and the number of vertical dots). A storage area 27 to which the input data itself (bit image data) is to be input;
Numeral 8 denotes graphic data representing the character "A", a flag 1 is stored in the storage area 29 of the storage means 13, and a horizontal 1 which is the bit image size in this case is stored in the storage area 30.
Data representing 1 dot and 11 vertical dots, and 31 is a bit image data (000, 138, 31) consisting of 11 pieces of three-digit hexadecimal numbers obtained by dividing each row of the bit image by 4 dots. ... 450,000) is stored. In this example, the number of horizontal dots in the bit image is desirably a multiple of four. However, referring to the size data, the number of horizontal dots is eleven, and the number of horizontal dots is eleven. Can be determined to be dummy data, so there is no problem.

【００３４】（２）入力時の筆跡をストロ−クデ−タと
して格納する場合最初のビットをフラグとし、０なら「文字コ−ド」、１
なら「入力デ−タ」を表す。このフラグが１のときに
は、筆跡を表すストロ−クデ−タの大きさ（ストロ−ク
数）を表す数値がその後に続き、さらにその後に筆跡を
表すストロ−クデ−タ自体を置く。(2) When the handwriting at the time of input is stored as stroke data The first bit is set as a flag.
Represents "input data". When this flag is 1, a numerical value representing the size (stroke number) of the stroke data representing the handwriting follows, followed by the stroke data itself representing the handwriting.

【００３５】図７は、入力時の筆跡を認識できた場合の
他のデ−タ構造を説明する図、図８は入力時の筆跡を認
識できなかった場合の他のデ−タ構造を説明する図であ
る。FIG. 7 is a diagram for explaining another data structure when handwriting at the time of input is recognized. FIG. 8 is a diagram for explaining another data structure when handwriting at the time of input is not recognized. FIG.

【００３６】図７において（筆跡を認識できた場合）、
３５はフラグ０を入力すべき記憶領域、３６は文字コ−
ドそのものを入力すべき記憶領域、３７は文字「い」を
表すグラフィックデ−タを示しており、記憶手段１３の
記憶領域３８にはフラグ０が、記憶領域３９には「い」
の文字コ−ド（０００２）が記憶される。In FIG. 7 (when handwriting is recognized)
35 is a storage area for inputting the flag 0, and 36 is a character code.
In the storage area to which the character itself is to be input, reference numeral 37 denotes graphic data representing the character "i". The storage area 38 of the storage means 13 has a flag 0, and the storage area 39 has "i".
Is stored.

【００３７】図８において（筆跡を認識できなかった場
合）、４０はフラグ１を入力すべき記憶領域、４１は入
力デ−タそのもののうちストロ−ク数を入力すべき記憶
領域、４２は入力デ−タそのもののうちベクトルデ−タ
を入力すべき記憶領域、４３は文字「い」を表すグラフ
ィックデ−タを示しており、記憶手段１３の記憶領域４
４にはフラグ１が、記憶領域４５にはストロ−ク数２
が、記憶領域４６にはストロ−クデ−タ（( ２，２，
２)(−１，６，６)(１，１，１) ( ９，２，１) (０，
１，５））が記憶される。この場合ストロ−クデ−タは
開始点のｘ座標、開始点のｙ座標、連続直線の個数の後
に連続直線の個数分だけ（向きのｘ成分、向きのｙ成
分、長さ）が続いたものとして表している。In FIG. 8 (when handwriting cannot be recognized), reference numeral 40 denotes a storage area for inputting the flag 1, 41 denotes a storage area for inputting the number of strokes of the input data itself, and 42 denotes an input area. Of the data itself, a storage area to which vector data is to be input, 43 indicates graphic data representing the character "i", and a storage area 4 of the storage means 13
4 is a flag 1 and a storage area 45 is 2 strokes.
However, in the storage area 46, the stroke data ((2, 2,
2) (-1, 6, 6) (1, 1, 1) (9, 2, 1) (0,
1, 5)) are stored. In this case, the stroke data has the x-coordinate of the starting point, the y-coordinate of the starting point, and the number of continuous straight lines, followed by the number of continuous straight lines (x component of direction, y component of direction, length). It is expressed as something.

【００３８】図９は、言語認識装置の実施例の第２変形
例を示すブロック図である。FIG. 9 is a block diagram showing a second modification of the embodiment of the language recognition apparatus.

【００３９】図９において、５１は入力手段、５２は音
声認識手段、５３は記憶手段、５４は出力手段、５５は
装置全体の作動を制御する制御手段である。In FIG. 9, reference numeral 51 denotes input means, 52 denotes voice recognition means, 53 denotes storage means, 54 denotes output means, and 55 denotes control means for controlling the operation of the entire apparatus.

【００４０】装置の中心に制御手段５５が配置されてお
り、マイクロプロセッサあるいは専用ＬＳＩなどを備え
ている。入力手段５１はマイクロホン等を備えた音声入
力装置であり、発音された音声を制御手段５５に入力す
る。出力手段５４は音声合成装置やスピ−カ等を備えた
音声出力装置で構成されている。A control means 55 is arranged at the center of the apparatus, and includes a microprocessor or a dedicated LSI. The input unit 51 is a voice input device provided with a microphone or the like, and inputs a pronounced voice to the control unit 55. The output means 54 is composed of a voice output device provided with a voice synthesizer, a speaker, and the like.

【００４１】音声認識手段５２も制御手段５５と同様、
マイクロプロセッサあるいは専用ＬＳＩなどを備え（制
御手段５５と共用し得る）、入力手段５１から入力され
た音声の認識を行う。The voice recognition means 52 is similar to the control means 55,
A microprocessor or a dedicated LSI is provided (which can be shared with the control unit 55), and recognizes voice input from the input unit 51.

【００４２】音声認識手段５２は、入力された音声を認
識することができれば、認識できたことを表すフラグと
その文字コ−ドを制御手段５５を介して記憶手段５３に
記憶させるが、入力された音声が認識できなければ認識
できなかったことを表すフラグとその入力デ−タそのも
のとを記憶手段５３に記憶させる。つまり、記憶手段５
３から読み出すデ−タには、それが文字コ−ドなのか入
力デ−タそのものなのかを表すフラグが付加されてい
る。If the voice recognition means 52 can recognize the input voice, it stores a flag indicating the recognition and the character code in the storage means 53 via the control means 55. If the voice cannot be recognized, the flag indicating that the voice cannot be recognized and the input data itself are stored in the storage means 53. That is, the storage unit 5
A flag is added to the data read from No. 3 to indicate whether the data is character code or input data itself.

【００４３】図１０は、図９に示す第２変形例による言
語の入力について説明するフローチャートである。[0043] Figure 10 is a word according to a second modification shown in FIG. 9
It is a flowchart explaining input of a word .

【００４４】図１０において、音声認識手段５２が入力
された音声の認識処理を行う（１０−１）。In FIG. 10, the voice recognition means 52 performs a recognition process on the input voice (10-1).

【００４５】ここで、、認識できたなら文字コ−ドであ
ることを表すフラグとその文字コ−ドとを制御手段５５
を介して記憶手段５３に記憶させる（１０−２、１０−
３）。Here, if the character code is recognized, a flag representing the character code and the character code are stored in the control means 55.
Through the storage means 53 (10-2, 10-
3).

【００４６】また、認識できなければ認識できなかった
ことを表すフラグ（これはそのデ−タが入力デ−タその
ものであることを表すフラグとも言える）と認識できな
かった入力デ−タそのものとを記憶手段５３に記憶させ
る（１０−２、１０−４）。If recognition is not possible, a flag indicating that recognition was not possible (this can also be referred to as a flag indicating that the data is input data itself) and the input data itself that could not be recognized Is stored in the storage means 53 (10-2, 10-4).

【００４７】次に、このように記憶された情報を出力す
るときの処理を説明する。Next, a process for outputting the information stored as described above will be described.

【００４８】まず、記憶手段５３からデ−タを読み込
み、そのデ−タに付加されているフラグの値を調べる。First, data is read from the storage means 53 and the value of the flag added to the data is checked.

【００４９】フラグが文字コ−ドを表すフラグであるな
ら、そのデ−タを文字コ−ドとみなし、その文字コ−ド
で表される文字の音声を出力手段５４に出力する。If the flag is a flag representing a character code, the data is regarded as a character code, and the voice of the character represented by the character code is output to the output means 54.

【００５０】フラグが認識することができなかったこと
を表すフラグであるなら、そのデ−タを入力デ−タその
もの（つまり入力時の音声）とみなし、その入力デ−タ
そのものを出力手段５４に出力する。If the flag is a flag indicating that it could not be recognized, the data is regarded as input data itself (that is, voice at the time of input), and the input data itself is output. Output to

【００５１】図１１は図９に示す第２変形例による言語
の出力について説明するフローチャートである。FIG. 11 is a flowchart for explaining the output of the language according to the second modification shown in FIG.

【００５２】記憶手段５３からデ−タを読み込み（１１
−１）、デ−タに付加されているフラグの値を調べる
（１１−２）。これが文字コ−ドを表すフラグであるな
ら読み込んだデ−タを文字コ−ドとみなし、その文字コ
−ドに対応する音声を出力手段５４に出力し（１１−
３）、このフラグが入力デ−タを表すフラグであるなら
そのデ−タを入力デ−タそのもの（つまり入力時の音
声）とみなして、出力手段５４に出力する（１１−
４）。Data is read from the storage means 53 (11)
-1) Check the value of the flag added to the data (11-2). If this is a flag indicating a character code, the read data is regarded as a character code, and a sound corresponding to the character code is output to the output means 54 (11-).
3) If this flag is a flag representing input data, the data is regarded as the input data itself (that is, voice at the time of input) and output to the output means 54 (11-).
4).

【００５３】デ−タにフラグを付加させる方法はいろい
ろあるが、簡単な例を以下に説明する。There are various methods for adding a flag to data. A simple example will be described below.

【００５４】最初のビットをフラグとし、０なら「文字
コ−ド」、１なら「入力デ−タ」を表すことにする。こ
のフラグが０のときはその後に文字コ−ド自体が続く。
このフラグが１の時には、音声を表すＰＣＭデ−タの大
きさを表す数値がその後に続き、さらにその後に音声を
表すＰＣＭデ−タ自体を置く。The first bit is used as a flag, and if 0, it indicates "character code", and if 1, it indicates "input data". When this flag is 0, the character code itself follows.
When this flag is 1, a numerical value representing the magnitude of the PCM data representing the voice follows, followed by the PCM data itself representing the voice.

【００５５】図１２は、入力時の音声を認識できた場合
のデ−タ構造を説明する図、図１３は、入力時の音声を
認識できなかった場合のデ−タ構造を説明する図であ
る。FIG. 12 is a diagram for explaining the data structure when the voice at the time of input can be recognized, and FIG. 13 is a diagram for explaining the data structure when the voice at the time of input cannot be recognized. is there.

【００５６】図１２において（音声を認識できた場
合）、６０はフラグ０を入力すべき記憶領域、６１は文
字コ−ドそのものを入力すべき記憶領域、６２は音声
「う」の波形を示しており、記憶手段５３の記憶領域６
３にはフラグ０が、記憶領域６４には「う」の文字コ−
ド（０００３）が記憶される。In FIG. 12 (when speech can be recognized), reference numeral 60 denotes a storage area for inputting the flag 0, 61 denotes a storage area for inputting the character code itself, and 62 denotes a waveform of the voice "U". And the storage area 6 of the storage means 53
3 is a flag 0, and the storage area 64 is a character code "U".
(0003) is stored.

【００５７】図１３において（音声を認識できなかった
場合）、６５はフラグ１を入力すべき記憶領域、６６は
入力デ−タのサイズ（入力デ−タのバイト数）を入力す
べき記憶領域、６７は入力デ−タそのもの（ＰＣＭデ−
タ）を入力すべき記憶領域、６８は入力された音声
「う」の波形を示しており、記憶手段５３の記憶領域６
９にはフラグ１が、記憶領域７０にはこの場合のサイズ
（９７５３）が、記憶領域７１にはＰＣＭデ−タ（８
０，２５，・・・３０，９６）が記憶される。In FIG. 13 (when speech cannot be recognized), 65 is a storage area for inputting the flag 1, and 66 is a storage area for inputting the size of input data (the number of bytes of input data). , 67 are the input data itself (PCM data).
And 68, a waveform of the input voice "U", and a storage area 68 of the storage means 53.
9, the flag 1 is stored in the storage area 70, and the PCM data (853) is stored in the storage area 71.
0, 25,... 30, 96) are stored.

【００５８】本第２変形例では、音声を認識して格納
し、その音声を後で出力できる装置を例にあげたが、こ
れは音声入力ワープロなどにも応用できる。本発明は、
音声が認識できなかったときにはそのことを表す記号を
ワープロの画面に表示させ、その部分をカーソルキーや
電子ペン等で指し示すと認識できなかった音声データそ
のものを出力するような音声入力ワープロである。In the second modified example, an apparatus which can recognize and store a voice and output the voice later is described as an example, but this can be applied to a voice input word processor or the like. The present invention
When the voice cannot be recognized, a symbol indicating the fact is displayed on a screen of a word processor, and when the portion is pointed by a cursor key, an electronic pen, or the like, the voice input word processor outputs the voice data itself that cannot be recognized.

【００５９】本実施例は、認識装置が認識できなかった
入力デ−タについてもその入力デ−タそのものを保存す
ることによって、使用者が後からそのデ−タを参照した
ときに何を入力したかを判別できるような認識処理を行
えるものである。In the present embodiment, even if input data that the recognition device cannot recognize is stored, the input data itself is stored, so that what is input when the user later refers to the input data. It is possible to perform a recognition process that can determine whether or not it has been performed.

【００６０】[0060]

【発明の効果】本発明は上述のように音声入力ワープロ
において、音声認識手段が認識できなかった音声データ
でも使用者が認識可能であることが多いため、音声認識
手段が入力した音声をデータを認識できないときは、ワ
ープロの画面での不認識の表示と、不認識表示部分への
指示による、記憶手段からの認識できなかった音声デー
タの出力により、音声認識させるとき入力した音声デー
タを生かして効率のよい音声入力が達成できるものであ
る。 According to the present invention, as described above, a speech input word processor is used.
, The voice data that the voice recognition means could not recognize
However, since the user can often recognize it, speech recognition
If the means cannot recognize the input voice,
-Unrecognized display on the professional screen and
Unrecognized audio data from the storage means according to the instruction
The voice data input when performing voice recognition
That can achieve efficient voice input
You.

[Brief description of the drawings]

【図１】本発明に関連のある言語認識装置の実施例を示
すブロック図である。FIG. 1 is a block diagram showing an embodiment of a language recognition device related to the present invention.

【図２】言語認識装置の実施例の第１変形例を示すブロ
ック図である。FIG. 2 is a block diagram showing a first modification of the embodiment of the language recognition device.

【図３】図２の第１変形例による文字の入力について説
明するフロ−チャ−トである。FIG. 3 is a flowchart for explaining character input according to a first modification of FIG. 2;

【図４】図２の第１変形例による文字の出力について説
明するフロ−チャ−トである。FIG. 4 is a flowchart for explaining character output according to a first modification of FIG. 2;

【図５】入力時の筆跡を認識できた場合のデ−タ構造を
説明する図である。FIG. 5 is a diagram illustrating a data structure when handwriting at the time of input is recognized.

【図６】入力時の筆跡を認識できなかった場合のデ−タ
構造を説明する図である。FIG. 6 is a view for explaining a data structure when handwriting at the time of input cannot be recognized.

【図７】入力時の筆跡を認識できた場合の他のデ−タ構
造を説明する図である。FIG. 7 is a diagram illustrating another data structure when handwriting at the time of input is recognized.

【図８】入力時の筆跡を認識できなかった場合の他のデ
−タ構造を説明する図である。FIG. 8 is a diagram for explaining another data structure when handwriting at the time of input cannot be recognized.

【図９】言語認識装置の実施例の第２変形例を示すブロ
ック図である。FIG. 9 is a block diagram showing a second modification of the embodiment of the language recognition device.

【図１０】図９の第２変形例による言語の入力について
説明するフローチャートである。FIG. 10 is a flowchart illustrating input of a language according to a second modification of FIG. 9;

【図１１】図９の第２変形例による言語の出力について
説明するフローチャートである。FIG. 11 is a flowchart illustrating language output according to a second modification of FIG. 9;

【図１２】入力時の音声を認識できた場合のデ−タ構造
を説明する図である。FIG. 12 is a diagram for explaining a data structure in a case where speech at the time of input has been recognized.

【図１３】入力時の音声を認識できなかった場合のデ−
タ構造を説明する図である。FIG. 13 is a diagram showing data when speech at the time of input cannot be recognized.
FIG. 3 is a diagram illustrating a data structure.

[Explanation of symbols]

１入力手段２認識手段３記憶手段４出力手段５制御手段 DESCRIPTION OF SYMBOLS 1 Input means 2 Recognition means 3 Storage means 4 Output means 5 Control means

フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 15/22 G06F 17/22 503 ＩＮＳＰＥＣ（ＤＩＡＬＯＧ) ＪＩＣＳＴファイル（ＪＯＩＳ) ＷＰＩ（ＤＩＡＬＯＧ) ＩＢＭＩｎｔｅｌｌｅｃｔｕａｌＰｒｏｐｅｒｔｙＮｅｔｗｏｒｋContinuation of the front page (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 15/22 G06F 17/22 503 INSPEC (DIALOG) JICST file (JOIS) WPI (DIALOG) IBM Intellectual Property Network

Claims

(57) [Claims]

A voice input unit for inputting voice data; a voice recognition unit for recognizing and processing voice data input from the voice input unit; together when recognizing the voice data the input is stored as character data, and means if the Symbol voice recognition unit does not recognize the voice data the input is for storing audio data said input, said The voice recognition means recognizes the input voice data.
If not, a control means for displaying unrecognition on a screen of a word processor and outputting voice data stored in the storage means in response to an instruction to the unrecognized display portion. Input word processor.