JPH08180137A

JPH08180137A - Character input device

Info

Publication number: JPH08180137A
Application number: JP6323031A
Authority: JP
Inventors: Jun Ito; 純伊藤; Hiromi Tomita; ひろみ冨田; Akira Nakajima; 晃中島; Hiroyuki Kumai; 裕之隈井; Yasumasa Matsuda; 泰昌松田
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1994-12-26
Filing date: 1994-12-26
Publication date: 1996-07-12

Abstract

PURPOSE: To provide a character input device which outputs the most likelihood recognition result as Japanese and can improve a character recognition rate by utilizing general trends of Japanese. CONSTITUTION: A control means 103 executes a program stored in a storage means 102 to output character codes indicating >=1 character that is similar to writing data inputted by a coordinate input means 101 and further outputs one of the character codes as a recognition result. At this time, a character code indicating a character of the same kind as the kind of a character that a character code outputted as a last recognition result indicates is outputted preferentially as the recognition result by utilizing the general trend of Japanese that characters of the same kind appear successively.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字入力装置に係り、
特に、手書きされた文字の筆記データを文字コードに変
換し、変換した文字コードで表される文字を表示する文
字入力装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character input device,
In particular, the present invention relates to a character input device that converts handwritten data of handwritten data into a character code and displays a character represented by the converted character code.

【０００２】[0002]

【従来の技術】近年、コンピュータの小型化に伴い、可
搬性を重視した「ペンコンピュータ」と称される情報処
理装置が普及している。一般に、ペンコンピュータは、
持ち運ばれることを前提としているので、ペンおよびタ
ブレットを入力手段としており、操作者は、一方の手で
本体を持ち、もう一方の手でペンを持ちながらタブレッ
トに対して入力操作を行うことができる。ペンコンピュ
ータは、キーボードやマウスのように固定台を必要とし
ないので、移動中や立ったままでの操作に適している。2. Description of the Related Art In recent years, along with the miniaturization of computers, information processing apparatuses called "pen computers" which emphasize portability have become widespread. In general, pen computers
Since it is assumed to be carried around, the pen and tablet are used as input means, and the operator can perform input operations on the tablet while holding the main body with one hand and the pen with the other hand. it can. Since the pen computer does not require a fixed base unlike a keyboard and a mouse, it is suitable for operation while moving or standing.

【０００３】このように、操作者は、卓上コンピュータ
において文字を入力する場合は、キーボードで文字を入
力していたが、ペンコンピュータにおいては、ペンでタ
ブレットに対して文字を手書きして入力する。As described above, when the operator inputs a character on the desktop computer, the operator inputs the character on the keyboard, but on the pen computer, the operator manually writes the character on the tablet.

【０００４】このようにして入力された筆記データは、
文字認識処理が行われることによって、文字コードに変
換され、変換された文字コードで表される文字が表示さ
れる。The writing data input in this way is
By performing the character recognition process, the character code is converted into a character code, and the character represented by the converted character code is displayed.

【０００５】文字認識処理とは、文字の標準的な筆記パ
ターンである標準筆記パターンおよび文字を表す文字コ
ードを文字ごとに記憶している文字認識辞書を参照する
ことにより、入力された筆記データが標準筆記パターン
に似通っている度合いを示す一致度を求め、求めた一致
度が予め決められた値より大きい標準筆記パターンに対
応する文字コードを認識結果として出力する処理である
が、操作者が意図した文字を表す文字コードが常に認識
結果として出力されるわけではない。The character recognition processing refers to a character recognition dictionary that stores, for each character, a standard writing pattern that is a standard writing pattern of a character and a character code that represents the character. This is a process that calculates the degree of similarity indicating the degree of similarity to the standard writing pattern and outputs the character code corresponding to the standard writing pattern for which the calculated degree of matching is greater than a predetermined value as the recognition result. The character code that represents the specified character is not always output as the recognition result.

【０００６】例えば、「カ（カタカナ）」という文字と
「力（漢字）」という文字とは非常に似通っており、ま
た、「一（漢字）」という文字と「ー(長音)」という文
字とは非常に似通っているので、いずれの文字の筆記デ
ータが入力されたかを明確に区別することが困難であ
り、操作者が意図した文字を表す文字コードを認識結果
として出力することができない場合がある。このとき、
操作者は、認識結果を修正するための認識結果修正操作
を行わねばならず、操作上の負担となる。For example, the characters "ka" and "power" are very similar, and the characters "1" (kanji) and "-(long sound)" are very similar. Are very similar, it is difficult to clearly distinguish which character's writing data was input, and it may not be possible to output the character code representing the character intended by the operator as the recognition result. is there. At this time,
The operator has to perform a recognition result correction operation for correcting the recognition result, which is an operational burden.

【０００７】この操作者の負担を減らすためには、文字
認識処理の文字認識率（正しい認識結果を出力できる割
合）を向上させることが重要であり、現在は、９０％〜
９５％の文字認識率が得られるようになっている。In order to reduce the burden on the operator, it is important to improve the character recognition rate of character recognition processing (the rate at which a correct recognition result can be output).
A character recognition rate of 95% can be obtained.

【０００８】ところが、従来の文字認識処理は、１文字
ごとに、該文字の筆記データの一致度を求めるようにな
っていることから、上述した例のように似通っている文
字については、いずれの文字の筆記データが入力された
かを明確に区別することが依然として困難であった。However, in the conventional character recognition processing, since the degree of coincidence of the writing data of each character is obtained for each character, any character that is similar to the above-mentioned example will be recognized. It was still difficult to clearly distinguish whether or not the writing data of characters was input.

【０００９】そこで、文字認識処理に加えて、認識結果
が日本語文であることを前提にして、前後の認識結果を
参照することにより、認識結果を自動的に修正するとい
う単語処理を行うようにすることができる。Therefore, in addition to the character recognition processing, word processing is performed to automatically correct the recognition result by referring to the previous and next recognition results on the assumption that the recognition result is a Japanese sentence. can do.

【００１０】単語処理は、例えば、「カ（カタカナ）」
という文字の筆記データが入力されたか「力（漢字）」
という文字の筆記データが入力されたかを区別するため
に、前の認識結果を参照し、前の認識結果が「重」とい
う文字を表す文字コードであった場合には、「重力」と
いう単語を構成する文字の筆記データが入力されたらし
いと推測し、「カ（カタカナ）」という文字を表す文字
コードより「力（漢字）」という文字を表す文字コード
を、優先的に認識結果として出力するものである。な
お、単語処理を行うためには、日本語として存在する全
ての単語ごとに、該単語を構成する文字を表す文字コー
ドを記憶している単語辞書を備えるようにする必要があ
る。The word processing is, for example, "Ka (Katakana)".
"Writing power" (Kanji) has been entered or not
In order to distinguish whether or not the writing data of the character was input, the previous recognition result was referenced, and if the previous recognition result was the character code representing the character "heavy", the word "gravity" was used. It presumes that the writing data of the constituent characters seems to have been input, and preferentially outputs the character code indicating the character "Kanji" rather than the character code indicating the character "Ka (katakana)" as the recognition result. Is. In addition, in order to perform word processing, it is necessary to provide a word dictionary that stores, for each word existing in Japanese, a character code that represents a character forming the word.

【００１１】また、さらに、単語同士の接続関係を示す
情報を単語辞書に加えることによって、単語同士が接続
される可能性をも調べるようにしたの日本語処理を行う
ようにすることもできる。Furthermore, by adding information indicating the connection relation between words to the word dictionary, it is possible to perform Japanese processing so as to check the possibility that the words are connected.

【００１２】単語処理および日本語処理の詳細について
は、特開平１−２３１１８５号公報，特開平２−１２４
８１号公報を参照されたい。For details of word processing and Japanese language processing, see Japanese Patent Application Laid-Open Nos. 1-231185 and 2-124.
See the '81 publication.

【００１３】[0013]

【発明が解決しようとする課題】ところで、一般に、文
字認識処理は、実際には、１つの文字の筆記データにつ
いて、まず、１０個程度の文字コードを認識候補として
出力し、それらのうちの１つを認識結果として出力する
ようになっている。そこで、１つの文字の筆記データに
ついて、いずれの認識候補を認識結果として出力するか
を決定するためには、全ての認識候補の組み合わせに対
して単語処理や日本語処理を行わなければならず、例え
ば、２つの文字の筆記データについては、１０×１０通
りの認識候補の組み合わせが生じ、３つの文字の筆記デ
ータについては、１０×１０×１０通りの認識候補の組
み合わせが生じるので、これらの全ての認識候補の組み
合わせに対して単語処理や日本語処理を行うためには、
多大な処理時間を要し、また、ペンコンピュータのよう
な小型の情報処理装置の処理速度では、実用上困難であ
る。By the way, in general, in character recognition processing, first, about 10 character codes are first output as recognition candidates for writing data of one character, and one of them is output as a recognition candidate. One is output as a recognition result. Therefore, in order to determine which recognition candidate is to be output as the recognition result for the writing data of one character, word processing and Japanese processing must be performed on all combinations of the recognition candidates. For example, 10 × 10 combinations of recognition candidates occur for writing data of two characters, and 10 × 10 × 10 combinations of recognition candidates occur for writing data of three characters. In order to perform word processing and Japanese processing for the combination of recognition candidates of
It requires a great deal of processing time and is practically difficult at the processing speed of a small information processing apparatus such as a pen computer.

【００１４】そこで、本発明の目的は、単語処理や日本
語処理を行わずに、日本語の一般的な傾向を利用するこ
とにより、日本語として尤もらしい認識結果を出力し、
文字認識率を向上させることが可能な文字入力装置を提
供することにある。Therefore, an object of the present invention is to output a recognition result that is likely to be Japanese by utilizing the general tendency of Japanese without performing word processing and Japanese processing.
It is to provide a character input device capable of improving the character recognition rate.

【００１５】[0015]

【課題を解決するための手段】本発明で利用する日本語
の一般的な傾向としては、第１に、日本語においては、
同じ字種の文字が、いくつかの文字数分だけ連続して出
現することが多いという傾向が挙げられる。例えば、カ
タカナおよび英字は、文章中では名詞（外来語や固有名
詞等）として使用されることが多く、また、数字は、２
桁以上続くものが多い。As a general tendency of Japanese used in the present invention, firstly, in Japanese,
The tendency is that characters of the same character type often appear consecutively for several characters. For example, katakana and English letters are often used as nouns (foreign words, proper nouns, etc.) in sentences, and the number is 2
Many last more than one digit.

【００１６】また、本発明で利用する日本語の一般的な
傾向としては、第２に、日本語においては、数十万語の
単語が存在するが、日常生活で使用される単語は、たか
だか数百語程度であり、特定の単語が繰り返し出現する
ことが多いという傾向が挙げられる。特に、テーマが絞
られた１つの文章中では、第２の傾向は顕著である。As a general tendency of Japanese used in the present invention, secondly, there are hundreds of thousands of words in Japanese, but the words used in daily life are at most It is about several hundred words, and there is a tendency that a specific word often appears repeatedly. In particular, the second tendency is remarkable in one sentence with a narrowed theme.

【００１７】そこで、本発明は、上記第１の傾向を利用
して、文字認識処理によって出力された文字コードのう
ち、前回認識結果として出力された文字コードで表され
る文字の字種と同じ字種となるような文字コードを、優
先的に認識結果として出力するようにしている。Therefore, the present invention utilizes the first tendency described above, and of the character codes output by the character recognition processing, the same character type of the character represented by the character code output as the previous recognition result. Character codes that are character types are preferentially output as recognition results.

【００１８】また、本発明は、上記第２の傾向を利用し
て、認識結果として出力されたことがある連続する複数
の文字コードを、１つの単語を構成する要素である複数
の文字を表す文字コードであるとみなして記憶してお
き、以降は、この記憶内容に基づいて認識結果を出力す
るようにしている。ただし、認識結果として出力された
ことがある連続する２つの文字コードの全てを記憶する
わけではなく、例えば、連続する２つの漢字，連続する
２つのカタカナ，連続する２つの英字を表す文字コード
を記憶するようにしている。Further, according to the present invention, by utilizing the second tendency, a plurality of consecutive character codes that have been output as a recognition result represent a plurality of characters which are elements constituting one word. It is regarded as a character code and stored, and thereafter, the recognition result is output based on the stored content. However, it does not store all of the two consecutive character codes that have been output as recognition results. For example, a character code that represents two consecutive kanji characters, two consecutive katakana characters, and two consecutive alphabetic characters may be stored. I try to remember.

【００１９】具体的には、本発明は、上記第１の傾向を
利用した文字入力装置として、（１）手書きされた文字
の筆記データを入力する手書き入力手段、（２）文字ご
とに、該文字の標準的な筆記パターンである標準筆記パ
ターン、該文字を表す文字コード、および、該文字の字
種を記憶している文字認識辞書記憶手段、（３）上記手
書き入力手段が入力した筆記データが上記標準筆記パタ
ーンに似通っている度合いを示す一致度を求め、求めた
一致度が予め決められた値より大きい標準筆記パターン
に対応する文字コードおよび字種を、求めた一致度と共
に、文字認識情報として出力する文字認識手段、（４）
上記文字認識手段が出力した文字認識情報を記憶する文
字認識情報記憶手段、（５）文字の字種ごとに、該字種
の文字の次に出現する文字が何の字種となるかの確率を
示す、予め決められた確率値を記憶している字種連接情
報記憶手段、（６）上記字種連接情報記憶手段が記憶し
ている確率値に応じて、上記文字認識情報記憶手段が記
憶している文字認識情報ごとに、該文字認識情報が示す
一致度に加算すべき加算点を計算し、該一致度と計算し
た加算点との合計値が最も大きい文字認識情報を、認識
結果として出力する加算点計算手段、（７）上記手書き
入力手段が入力した筆記データ、および、上記加算点計
算手段が認識結果として出力した文字認識情報が示す文
字コードで表される文字を表示する表示手段、を備えた
文字入力装置を提供している。Specifically, the present invention provides a character input device utilizing the above first tendency, (1) handwriting input means for inputting handwritten data of handwritten characters, and (2) for each character, A standard writing pattern that is a standard writing pattern of characters, a character code that represents the character, and a character recognition dictionary storage unit that stores the character type of the character, (3) Writing data input by the handwriting input unit. The degree of similarity indicating the degree of similarity to the standard writing pattern is obtained, and the character code and the character type corresponding to the standard writing pattern for which the obtained degree of matching is larger than a predetermined value, together with the obtained degree of matching, are recognized. Character recognition means for outputting as information, (4)
Character recognition information storage means for storing the character recognition information output by the character recognition means, (5) For each character type, the probability that the character that appears next to the character of the character type will be Indicating that the character recognition information storage means stores a predetermined probability value, and (6) the character recognition information storage means stores in accordance with the probability value stored in the character connection information storage means. For each character recognition information being calculated, an addition point to be added to the degree of coincidence indicated by the character recognition information is calculated, and the character recognition information having the largest total value of the degree of coincidence and the calculated addition point is used as the recognition result. (7) Display means for displaying the character represented by the character code indicated by the character recognition information output as the recognition result by the handwriting input means, and the added point calculation means for outputting (7) the handwriting input means. Providing a character input device equipped with It is.

【００２０】なお、上記字種連接情報記憶手段が記憶し
ている確率値は、次に出現する文字の字種が前の文字の
字種と同じ字種となる確率値が、次に出現する文字の字
種が前の文字の字種と異なる字種となる確率値より大き
くなるようになっており、上記加算点計算手段は、上記
字種連接情報記憶手段が記憶している確率値に応じて文
字認識情報についての加算点を計算する際に、該文字認
識情報が示す字種と前回認識結果として出力された文字
認識情報が示す字種とから、該文字認識情報に対応する
確率値を求め、求めた確率値が大きいほど、該文字認識
情報についての加算点が大きくなるようにすることが好
ましい。The probability value stored in the character type concatenation information storage means is a probability value in which the character type of the character that appears next is the same as the character type of the previous character that appears next. The character type of the character is set to be larger than the probability value that the character type of the previous character is different from the character type of the previous character, and the addition point calculation means uses the probability value stored in the character type connection information storage means. Accordingly, when calculating the addition point for the character recognition information, the probability value corresponding to the character recognition information is determined from the character type indicated by the character recognition information and the character type indicated by the character recognition information output as the previous recognition result. It is preferable that the larger the calculated probability value, the larger the addition point for the character recognition information.

【００２１】また、上記加算点計算手段が認識結果とし
て出力した文字認識情報が示す字種と同じ字種を含む文
字認識情報が、それまでにいくつだけ連続して認識結果
として出力されているかを示す連続字種数を計算する連
続字種数計算手段をさらに備えるようにすることがで
き、このようにした場合は、上記加算点計算手段は、さ
らに、上記連続字種数計算手段が計算した連続字種数が
大きいほど、前回認識結果として出力された文字認識情
報が示す字種と同じ字種を含む文字認識情報についての
加算点が小さくなるようにすることができる。Further, how many character recognition information including the same character type as the character type indicated by the character recognition information output as the recognition result by the addition point calculation means have been continuously output as the recognition result until then. It is possible to further include a continuous character type number calculating means for calculating the number of continuous character types shown, and in this case, the addition point calculating means further calculates the continuous character type number calculating means. The larger the number of continuous character types, the smaller the addition point for the character recognition information including the same character type as the character type indicated by the character recognition information output as the previous recognition result.

【００２２】一方、本発明は、上記第２の傾向を利用し
た文字入力装置として、（１）手書きされた文字の筆記
データを入力する手書き入力手段、（２）文字ごとに、
該文字の標準的な筆記パターンである標準筆記パター
ン、該文字を表す文字コード、および、該文字の字種を
記憶している文字認識辞書記憶手段、（３）上記手書き
入力手段が入力した筆記データが上記標準筆記パターン
に似通っている度合いを示す一致度を求め、求めた一致
度が予め決められた値より大きい標準筆記パターンに対
応する文字コードおよび字種を、求めた一致度と共に、
文字認識情報として出力する文字認識手段、（４）上記
文字認識手段が出力した文字認識情報を記憶する文字認
識情報記憶手段、（５）１つの単語を構成する要素とな
る複数の文字を表す文字コードを組にした連接文字情報
を記憶する連接文字情報記憶手段、（６）上記連接文字
情報記憶手段が記憶している連接文字情報に基づいて、
上記文字認識情報記憶手段が記憶している文字認識情報
ごとに、該文字認識情報が示す一致度に加算すべき加算
点を計算し、該一致度と計算した加算点との合計値が最
も大きい文字認識情報を、認識結果として出力する加算
点計算手段、（７）上記手書き入力手段が入力した筆記
データ、および、上記加算点計算手段が認識結果として
出力した文字認識情報が示す文字コードで表される文字
を表示する表示手段、を備えた文字入力装置を提供して
いる。On the other hand, the present invention is, as a character input device utilizing the above second tendency, (1) handwriting input means for inputting handwritten data of handwritten characters, (2) for each character,
A standard writing pattern that is a standard writing pattern of the character, a character code that represents the character, and a character recognition dictionary storage unit that stores the character type of the character, (3) Writing input by the handwriting input unit The degree of coincidence indicating the degree to which the data is similar to the standard writing pattern is obtained, and the degree of coincidence obtained is a character code and character type corresponding to the standard writing pattern larger than a predetermined value, together with the obtained degree of coincidence.
Character recognition means for outputting as character recognition information, (4) Character recognition information storage means for storing the character recognition information output by the character recognition means, and (5) Characters representing a plurality of characters constituting an element of one word. A concatenated character information storage means for storing concatenated character information in the form of a code, (6) based on the concatenated character information stored in the concatenated character information storage means,
For each character recognition information stored in the character recognition information storage means, an addition point to be added to the degree of coincidence indicated by the character recognition information is calculated, and the sum of the degree of coincidence and the calculated addition point is the largest. The character recognition information is output as a recognition result by an addition point calculation means, (7) written by the handwriting input means, and a character code indicated by the character recognition information output by the addition point calculation means as a recognition result. There is provided a character input device provided with a display unit for displaying the displayed characters.

【００２３】そして、上記連接文字情報記憶手段は、予
め決められた同じ字種を示す複数の文字認識情報が認識
結果として連続して出力されているならば、これら複数
の文字認識情報が示す文字コードで表される文字が１つ
の単語を構成する要素となる複数の文字であるとみなし
て、これら複数の文字認識情報が示す文字コードを組に
したものを、上記連接文字情報として記憶するようにし
ている。If a plurality of character recognition information indicating the same predetermined character type is continuously output as the recognition result, the concatenated character information storage means stores the characters indicated by the plurality of character recognition information. It is assumed that the character represented by the code is a plurality of characters constituting one word, and a set of character codes indicated by the plurality of character recognition information is stored as the concatenated character information. I have to.

【００２４】なお、上記構成に加えて、上記表示手段が
表示した文字を表す文字コードを含む文字認識情報と、
該該文字認識情報以外の他の文字認識情報とを入れ替え
る旨の指示を受付ける受付手段をさらに備えるようにし
てもよく、このようにした場合は、上記表示手段は、上
記受付手段が入れ替える旨の指示を受付けた場合に、表
示している文字を、該入れ替える旨が指示された文字認
識情報が示す文字コードで表される文字に変更して表示
するようにし、また、上記連接文字情報記憶手段は、上
記受付手段が入れ替える旨の指示を受付けた場合に、予
め決められた同じ字種を示す複数の文字認識情報が認識
結果として連続して出力されているならば、これら複数
の文字認識情報が示す文字コードで表される文字が１つ
の単語を構成する要素となる複数の文字であるとみなし
て、これら複数の文字認識情報が示す文字コードを組に
したものを、上記連接文字情報として記憶するようにす
ることができる。In addition to the above configuration, character recognition information including a character code representing a character displayed by the display means,
The display means may further include a reception means for receiving an instruction to replace the character recognition information other than the character recognition information. In this case, the display means indicates that the reception means replaces the character recognition information. When the instruction is accepted, the displayed character is changed to the character represented by the character code indicated by the character recognition information instructed to be replaced, and displayed, and the concatenated character information storage means is also provided. If a plurality of character recognition information indicating the same predetermined character type is continuously output as a recognition result when the above-mentioned receiving means receives an instruction to replace the plurality of character recognition information, Considering that the character represented by the character code indicated by is a plurality of characters that form one word, and combining the character codes indicated by the plurality of character recognition information, It can be configured to store a contact character information.

【００２５】上記加算点計算手段は、上記連接文字情報
記憶手段が記憶している連接文字情報に基づいて文字認
識情報についての加算点を計算する際に、該文字認識情
報が示す文字コードと前回認識結果として出力された文
字認識情報が示す文字コードとを組にしたものを含む連
接文字情報が上記連接文字情報記憶手段に記憶されてい
る文字認識情報についての加算点を、予め決められた値
とするようにすることができる。The addition point calculating means calculates the addition point for the character recognition information based on the concatenated character information stored in the concatenated character information storage means, and the character code indicated by the character recognition information and the last time. The addition point for the character recognition information in which the concatenated character information including a combination with the character code indicated by the character recognition information output as the recognition result is stored in the concatenated character information storage means is a predetermined value. You can do so.

【００２６】なお、本明細書において、「文字」という
記述には、記号を含むものとする。In the present specification, the term "character" includes a symbol.

【００２７】[0027]

【作用】上述したように、一致度は、筆記データについ
て文字単位で文字認識処理を行った結果求められる値で
あり、一致度が最も大きい文字認識情報を認識結果とし
て出力すると、前後関係がいっさい参照されていない認
識結果が出力されることとなるので、区別することが困
難な文字の筆記データについて、操作者の意図通りの認
識結果が出力されない場合がある。そこで、本発明で
は、前回認識結果として出力された文字認識情報の内容
を参照することにより計算した加算点と一致度との合計
値が最も大きい文字認識情報を認識結果として出力する
ようにしている。As described above, the degree of coincidence is a value obtained as a result of performing character recognition processing on the writing data on a character-by-character basis, and when the character recognition information having the highest degree of coincidence is output as a recognition result, the context is not significant. Since the recognition result that is not referred to is output, the recognition result as intended by the operator may not be output for the writing data of the characters that are difficult to distinguish. Therefore, in the present invention, the character recognition information having the largest total value of the addition point and the degree of coincidence calculated by referring to the content of the character recognition information output as the previous recognition result is output as the recognition result. .

【００２８】前回認識結果として出力された文字認識情
報の内容のうち、文字認識情報についての加算点を計算
する際に参照する内容とは、上記第１の傾向を利用する
場合は字種であり、上述した第２の傾向を利用する場合
は文字コードである。Of the contents of the character recognition information output as the previous recognition result, the contents referred to when calculating the addition point for the character recognition information is the character type when the above first tendency is used. The character code is used when the second tendency described above is used.

【００２９】詳しくは、上記第１の傾向を利用した文字
入力装置において、上記字種連接情報記憶手段が記憶し
ている確率値は、字種ごとに、該字種の文字の次に出現
する文字の字種が何であるかを統計的に調査した結果に
基づいて、予め決められた値であり、特に、次に出現す
る文字の字種が前の文字の字種と同じ字種となる確率値
が、次に出現する文字の字種が前の文字の字種と異なる
字種となる確率値より大きくなるようになっている。Specifically, in the character input device using the first tendency, the probability value stored in the character type connection information storage means appears next to the character of the character type for each character type. It is a predetermined value based on the result of statistically investigating what kind of character the character type is, in particular, the character type of the next character will be the same as the character type of the previous character. The probability value is set to be larger than the probability value that the character type of the character that appears next is a character type that is different from the character type of the previous character.

【００３０】そこで、上記加算点計算手段が上記確率値
に応じて計算する文字認識情報についての加算点は、前
回認識結果として出力された文字認識情報が示す字種と
同じ字種を含む文字認識情報についての加算点が、前回
認識結果として出力された文字認識情報が示す字種と異
なる字種を含む文字認識情報についての加算点より大き
くなる。Therefore, the addition point for the character recognition information calculated by the addition point calculation means according to the probability value is the character recognition including the same character type as the character type indicated by the character recognition information output as the previous recognition result. The addition point for information is larger than the addition point for character recognition information including a character type different from the character type indicated by the character recognition information output as the previous recognition result.

【００３１】また、上記加算点計算手段は、文字認識情
報の加算点を計算すると、文字認識情報が示す一致度と
計算した加算点との合計値が最も大きい文字認識情報
を、認識結果として出力するので、前回認識結果として
出力された文字認識情報が示す字種と同じ字種を含む文
字認識情報が、優先的に認識結果として出力されること
となる。Further, when the addition point calculation means calculates the addition point of the character recognition information, the character recognition information having the largest sum of the degree of coincidence indicated by the character recognition information and the calculated addition point is output as the recognition result. Therefore, the character recognition information including the same character type as the character type indicated by the character recognition information output as the previous recognition result is preferentially output as the recognition result.

【００３２】従って、日本語として尤もらしい認識結果
を出力し、文字認識率を向上させることができるように
なる。Therefore, it is possible to output a recognition result that is likely to be Japanese and improve the character recognition rate.

【００３３】また、詳しくは、上記第２の傾向を利用し
た文字入力装置において、上記連接文字情報記憶手段が
記憶している連接文字情報は、１つの単語を構成する要
素となる複数の文字を表す文字コードを組にしたもので
ある。Further, in detail, in the character input device utilizing the second tendency, the concatenated character information stored in the concatenated character information storage means includes a plurality of characters which are elements constituting one word. It is a set of character codes to represent.

【００３４】そこで、上記加算点計算手段が上記連接文
字情報に基づいて計算する文字認識情報についての加算
点は、該文字認識情報が示す文字コードと前回認識結果
として出力された文字認識情報が示す文字コードとを組
にしたものを含む連接文字情報が上記連接文字情報記憶
手段に記憶されている文字認識情報についての加算点の
みが、予め決められた値となる。Therefore, the addition point for the character recognition information calculated by the addition point calculation means based on the concatenated character information is indicated by the character code indicated by the character recognition information and the character recognition information output as the previous recognition result. Only the addition points of the character recognition information stored in the concatenated character information storage unit of the concatenated character information including a combination of the character code and the character code have a predetermined value.

【００３５】また、上記加算点計算手段は、文字認識情
報についての加算点を計算すると、文字認識情報が示す
一致度と計算した加算点との合計値が最も大きい文字認
識情報を、認識結果として出力し、上記連接文字情報記
憶手段は、予め決められた同じ字種（例えば、カタカ
ナ，漢字，英字）を示す複数の文字認識情報が認識結果
として連続して出力されているならば、これら複数の文
字認識情報が示す文字コードで表される文字が１つの単
語を構成する要素となる複数の文字であるとみなして、
これら複数の文字認識情報が示す文字コードを組にした
ものを、上記連接文字情報として記憶するので、１度で
も認識結果として出力されたことがある、同じ字種を含
む連続する複数の文字認識情報は、以降は、優先的に認
識結果として出力されることとなる。Further, when the addition point calculation means calculates the addition point for the character recognition information, the character recognition information having the largest total value of the matching degree indicated by the character recognition information and the calculated addition point is set as the recognition result. If a plurality of character recognition information indicating the same predetermined character type (eg, Katakana, Kanji, English character) is continuously output as a recognition result, the concatenated character information storage means outputs the plurality of character recognition information. Considering that the character represented by the character code indicated by the character recognition information of is a plurality of characters that are elements that form one word,
Since a set of character codes indicated by the plurality of character recognition information is stored as the concatenated character information, a plurality of consecutive character recognitions including the same character type that have been output as a recognition result even once. After that, the information is preferentially output as the recognition result.

【００３６】従って、単語として学習するのと同様の効
果が得られるので、単語処理や日本語処理を行わなくて
も、日本語として尤もらしい認識結果を出力し、文字認
識率を向上させることができるようになる。さらに、単
語処理や日本語処理を行わないことから、高速に認識結
果を出力することができるようになる。Therefore, since the same effect as learning as a word can be obtained, it is possible to output a recognition result that is likely to be Japanese and improve the character recognition rate without performing word processing or Japanese processing. become able to. Furthermore, since word processing and Japanese language processing are not performed, the recognition result can be output at high speed.

【００３７】[0037]

【実施例】以下、本発明の実施例について図面を参照し
て説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００３８】図１は本実施例の文字入力装置の機能ブロ
ック図である。FIG. 1 is a functional block diagram of the character input device of this embodiment.

【００３９】図１において、１０１は座標入力手段、１
０２は記憶手段、１０３は制御手段、１０４は表示手
段、１０５は外部記憶手段、１０６は電源、１０７は電
源スイッチである。In FIG. 1, 101 is coordinate input means, 1
Reference numeral 02 is a storage unit, 103 is a control unit, 104 is a display unit, 105 is an external storage unit, 106 is a power supply, and 107 is a power switch.

【００４０】座標入力手段１０１は、具体的には、手書
きされた筆記データをオンラインで入力するタブレット
や、手書きされている筆記データをオフラインで入力す
るスキャナーであり、本実施例では、座標入力手段１０
１がタブレットである場合について説明する。The coordinate input means 101 is specifically a tablet for inputting handwritten writing data online or a scanner for inputting handwritten writing data offline. In the present embodiment, the coordinate input means 101 is used. 10
A case where 1 is a tablet will be described.

【００４１】記憶手段１０２は、具体的には、主メモリ
であり、各種プログラムおよび文字認識辞書等を格納し
ている。また、制御手段１０３は、具体的には、ＣＰＵ
であり、記憶手段１０２に格納されている各種プログラ
ムを実行する。The storage means 102 is specifically a main memory, and stores various programs, a character recognition dictionary, and the like. Further, the control means 103 is specifically a CPU
And executes various programs stored in the storage unit 102.

【００４２】表示手段１０４は、具体的には、液晶表示
装置やＣＲＴであり、座標入力手段１０１が入力した筆
記データや制御手段１０３による各種プログラムの実行
結果を表示する。本実施例では、表示手段１０４が液晶
表示装置である場合について説明する。The display unit 104 is specifically a liquid crystal display device or a CRT, and displays the writing data input by the coordinate input unit 101 and the execution results of various programs by the control unit 103. In this embodiment, a case where the display unit 104 is a liquid crystal display device will be described.

【００４３】外部記憶手段１０５は、具体的には、フロ
ッピィディスクやＩＣカードである。また、電源１０６
は、これらの手段に電流を供給し、電源スイッチ１０７
は、電源１０６による電流供給の入／切を操作者が指示
するためものである。The external storage means 105 is specifically a floppy disk or an IC card. In addition, the power source 106
Supplies current to these means, and the power switch 107
Is for the operator to instruct turning on / off of current supply by the power source 106.

【００４４】図２は本実施例の文字入力装置の外観図で
ある。FIG. 2 is an external view of the character input device of this embodiment.

【００４５】図２に示すように、本実施例の文字入力装
置は、大別して、本体２０１と付属のペン２０２とから
構成されている。本体２０１は、記憶手段１０２，制御
手段１０３，電源１０６を内蔵しており、また、電源ス
イッチ１０７，透明なタブレットの下に液晶表示装置を
配置した表示一体型タブレット２０３，外部記憶手段１
０５の挿入口２０４を備えている。As shown in FIG. 2, the character input device of this embodiment is roughly composed of a main body 201 and an attached pen 202. The main body 201 has a built-in storage means 102, control means 103, and power supply 106, a power switch 107, a display-integrated tablet 203 in which a liquid crystal display device is arranged under a transparent tablet, and an external storage means 1.
05 is provided with an insertion port 204.

【００４６】操作者が表示一体型タブレット２０３上を
ペン２０２で触れると、ペン２０２で触れられた位置
は、タブレットによってその座標データが検知され、検
知された座標データは、制御手段１０３に出力される。
これにより、操作者は、キーボードを用いずに、ペン２
０２で、筆記データを手書きしたり各種指示を行うこと
ができる。When the operator touches the display-integrated tablet 203 with the pen 202, the coordinate data of the position touched by the pen 202 is detected by the tablet, and the detected coordinate data is output to the control means 103. It
This allows the operator to use the pen 2 without using the keyboard.
With 02, handwriting data can be handwritten and various instructions can be given.

【００４７】次に、本実施例の文字入力装置における操
作者の操作の流れについて、図３〜図１０を用いて説明
する。Next, the flow of operations by the operator in the character input device of this embodiment will be described with reference to FIGS.

【００４８】図３は電源スイッチ１０７が投入された直
後の表示一体型タブレット２０３の画面表示例を示して
いる。FIG. 3 shows a screen display example of the display-integrated tablet 203 immediately after the power switch 107 is turned on.

【００４９】図３に示すように、表示一体型タブレット
２０３には、文字（文字コードで表される文字）を表示
する本文領域３０１と、次の文字の表示位置を示すカー
ソル３０２と、操作者が文字の筆記データを手書きする
文字枠３０３と、手書きされた筆記データについて文字
認識処理を行う旨を指示する認識ボタン３０４と、文字
認識処理によって出力された全ての認識候補の文字（文
字コードで表される文字）を表示する認識候補枠３０５
と、認識結果の文字（文字コードで表される文字）を本
文領域３０１に表示する旨を指示する確定ボタン３０６
とが表示されている。As shown in FIG. 3, the display-integrated tablet 203 has a body area 301 for displaying characters (characters represented by character codes), a cursor 302 for indicating the display position of the next character, and an operator. Is a character frame 303 for handwriting writing data of characters, a recognition button 304 for instructing to perform character recognition processing on the handwritten writing data, and all recognition candidate characters (in character codes) output by the character recognition processing. Recognition candidate frame 305 displaying (characters represented)
And a confirmation button 306 for instructing to display the recognition result character (character represented by the character code) in the body area 301.
And are displayed.

【００５０】操作者は、文字入力を行う場合に、まず、
図４に示すように、文字枠３０３にペン２０２で文字の
筆記データを手書きする。これにより、タブレットによ
って該筆記データの座標データが検知され、検知された
座標データに対応する液晶表示板の画素が反転表示され
て、該筆記データの筆跡が文字枠３０３内に表示され
る。なお、ここでは、操作者は、「日」という文字を意
図した筆記データを手書きしている。When inputting characters, the operator firstly
As shown in FIG. 4, the writing data of a character is handwritten in the character frame 303 with the pen 202. Accordingly, the tablet detects the coordinate data of the handwriting data, the pixel of the liquid crystal display plate corresponding to the detected coordinate data is reversely displayed, and the handwriting of the handwriting data is displayed in the character frame 303. Note that, here, the operator is handwriting the writing data intended for the character “day”.

【００５１】続いて、操作者は、図５に示すように、認
識ボタン３０４をペン２０２で触れることにより、該筆
記データについて文字認識処理を行う旨を指示する。こ
れにより、該筆記データについての文字認識処理が行わ
れて、１つ以上の認識候補が出力され、これらの認識候
補のうちの第１の認識候補の文字が、認識結果の文字と
して、該筆記データの代わりに文字枠３０３に表示され
る。ここでは、操作者の意図通りの文字（「日」）が文
字枠３０３に表示されている。なお、１つ以上の認識候
補のうちのいずれを第１の認識候補として決定するかに
ついては後述する。Then, as shown in FIG. 5, the operator touches the recognition button 304 with the pen 202 to instruct to perform the character recognition processing on the handwritten data. As a result, character recognition processing is performed on the writing data, one or more recognition candidates are output, and the character of the first recognition candidate among these recognition candidates is the character as the recognition result. It is displayed in the character frame 303 instead of the data. Here, the character (“day”) as intended by the operator is displayed in the character frame 303. Which of the one or more recognition candidates is to be determined as the first recognition candidate will be described later.

【００５２】続いて、操作者は、上述した操作と同様
に、図６に示すように、文字枠３０３にペン２０２で文
字の筆記データを手書きし、認識ボタン３０４をペン２
０２で触れることにより、該データについて文字認識処
理を行う旨を指示する。なお、ここでは、操作者は、
「立」という文字を意図した筆記データを手書きしてい
る。Then, similarly to the above-mentioned operation, the operator handwrites the writing data of the character with the pen 202 in the character frame 303 and presses the recognition button 304 with the pen 2 as shown in FIG.
By touching 02, it is instructed to perform character recognition processing on the data. Here, the operator is
I am handwriting handwritten data intended for the character "standing".

【００５３】これにより、該筆記データについての文字
認識処理が行われて、１つ以上の認識候補が出力され、
これらの認識候補のうちの第１の認識候補の文字が、認
識結果の文字として、該筆記データの代わりに文字枠３
０３に表示されるが、ここで、例えば、図７に示すよう
に、操作者の意図通りの文字（「日」）とな異なる文字
（「主」）が文字枠３０３に表示されてしまった場合
に、操作者は、第１の認識候補（認識結果）以外の他の
認識候補のうちの所望の認識候補を第１の認識候補に修
正するための認識結果修正操作を行う。As a result, character recognition processing is performed on the writing data, and one or more recognition candidates are output.
The character of the first recognition candidate among these recognition candidates is the character frame 3 instead of the writing data as the character of the recognition result.
03, but here, for example, as shown in FIG. 7, a different character (“main”) different from the character intended by the operator (“day”) is displayed in the character frame 303. In this case, the operator performs a recognition result correction operation for correcting a desired recognition candidate among the recognition candidates other than the first recognition candidate (recognition result) to the first recognition candidate.

【００５４】すなわち、操作者は、図８に示すように、
文字枠３０３に表示されている認識結果の文字
（「主」）をペン２０２で触れることにより、該文字認
識結果を第１の認識候補とする全ての認識候補の文字を
認識候補枠３０５に表示する旨を指示する。これによ
り、第１の認識候補を含む全ての認識候補の文字（ここ
では、「主」，「立」，「安」，「広」，「皿」）が認
識候補枠３０５に表示されるので、操作者は、これらの
文字のうちの意図通りの文字（「立」）をペン２０２で
触れることにより、該文字の認識候補を第１の認識候補
に修正する旨を指示する。これにより、操作者が修正す
る旨を指示した文字（「立」）の認識候補が第１の認識
候補に修正され、図９に示すように、修正後の第１の認
識候補の文字（「立」）が、修正前の第１の認識候補の
文字（「主」）に代わって文字枠３０３に表示される。That is, the operator, as shown in FIG.
By touching the character (“main”) of the recognition result displayed in the character frame 303 with the pen 202, all recognition candidate characters having the character recognition result as the first recognition candidate are displayed in the recognition candidate frame 305. Instruct to do so. As a result, all recognition candidate characters including the first recognition candidate (here, “main”, “standing”, “cheap”, “wide”, “plate”) are displayed in the recognition candidate frame 305. The operator touches the intended character (“upright”) of these characters with the pen 202 to instruct that the recognition candidate of the character should be corrected to the first recognition candidate. As a result, the recognition candidate of the character (“standing”) that the operator has instructed to correct is corrected to the first recognition candidate, and as shown in FIG. 9, the character of the first recognition candidate after correction (“ "" Is displayed in the character frame 303 in place of the character ("main") of the first recognition candidate before correction.

【００５５】操作者は、認識結果修正操作を終了する
と、図１０に示すように、確定ボタン３０６をペン２０
２で触れることにより、文字枠３０３に表示されている
認識結果の文字を本文領域３０１に表示する旨を指示す
る。これにより、文字枠３０３に表示されている認識結
果の文字（「日」，「立」）が、本文領域３０１に表示
される。When the operator finishes the recognition result correction operation, the operator pushes the confirm button 306 with the pen 20 as shown in FIG.
By touching with 2, it is instructed to display the character of the recognition result displayed in the character frame 303 in the body area 301. As a result, the characters (“day”, “standing”) as the recognition result displayed in the character frame 303 are displayed in the body area 301.

【００５６】さて、本実施例の文字入力装置は、文字認
識処理によって出力された１つ以上の認識候補のうちの
から第１の認識候補（認識結果）を決定する際に、従来
のような単語処理や日本語処理を行わずに、日本語の一
般的な傾向を利用することにより、日本語として尤もら
しい認識結果を出力し、文字認識率を向上させることを
可能としたものである。Now, the character input device of this embodiment is different from the conventional one when determining the first recognition candidate (recognition result) from the one or more recognition candidates output by the character recognition processing. By utilizing the general tendency of Japanese language without performing word processing or Japanese language processing, it is possible to output a recognition result that is likely to be Japanese and improve the character recognition rate.

【００５７】本実施例で利用する日本語の一般的な傾向
としては、第１に、日本語において、カタカナおよび英
字は、文章中では名詞（外来語や固有名詞等）として使
用されることが多く、従って、いくつかの文字数分だけ
連続して出現することが多いという傾向が挙げられる。
また、日本語において、数字は、２桁以上続くものが多
く、従って、いくつかの文字数分だけ連続して出現する
ことが多いという傾向が挙げられる。As a general tendency of Japanese used in this embodiment, firstly, in Japanese, katakana and English characters are used as nouns (foreign words, proper nouns, etc.) in sentences. Therefore, there is a tendency that a large number of characters appear consecutively in many cases.
Further, in Japanese, there are many numbers in which two or more digits continue, and therefore, there is a tendency that the numbers often appear continuously for several characters.

【００５８】この第１の傾向を利用すると、例えば、筆
記データの手書きの仕方によっては、操作者が「シ（カ
タカナ）」，「ス（カタカナ）」，「テ（カタカ
ナ）」，「ム（カタカナ）」という文字を意図していた
のに、「シ」，「又（漢字）」，「千（漢字）」，
「ム」という認識結果の文字が表示されてしまったり、
操作者が「Ｐ（英字）」，「Ｏ（英字）」，「Ｔ（英
字）」，「Ａ（英字）」，「Ｔ（英字）」，「Ｏ（英
字）」という文字を意図していたのに、「Ｐ」，「０
（数字）」，「丁（漢字）」，「Ａ」，「丁（漢
字）」，「０（数字）」という認識結果の文字が表示さ
れてしまうことを防止することができる。When this first tendency is used, for example, depending on how the handwriting data is handwritten, the operator can select "shi (katakana)", "su (katakana)", "te (katakana)", "mu ( "Katakana)" was intended, but "shi", "mata (kanji)", "thousand (kanji)",
The character of the recognition result "Mu" is displayed,
The operator intends the characters "P (alphabet)", "O (alphabet)", "T (alphabet)", "A (alphabet)", "T (alphabet)", "O (alphabet)" However, "P", "0
It is possible to prevent the characters of the recognition result such as "(number)", "Ding (kanji)", "A", "Ding (kanji)", and "0 (number)" from being displayed.

【００５９】そこで、この第１の傾向を利用するための
機能として、本実施例では、前の認識結果の文字の字種
を引き継ぐような認識候補を第１の認識候補（認識結
果）として決定する機能を設けるようにしている。以
下、この機能を字種継承機能と称する。Therefore, as a function for utilizing this first tendency, in the present embodiment, a recognition candidate that inherits the character type of the character of the previous recognition result is determined as the first recognition candidate (recognition result). I have a function to do. Hereinafter, this function is referred to as a character type inheritance function.

【００６０】また、本実施例で利用する日本語の一般的
な傾向としては、第２に、日本語においては、数十万語
の単語が存在するが、日常生活で使用する単語はたかだ
か数百語程度であり、特定の単語が繰り返し出現するこ
とが多いという傾向が挙げられる。As a general tendency of Japanese used in this embodiment, secondly, there are hundreds of thousands of words in Japanese, but only a few words are used in daily life. It is about a hundred words, and there is a tendency that a specific word often appears repeatedly.

【００６１】そこで、この第２の傾向を利用するための
機能として、本実施例では、１度入力された単語を学習
しておき、以降は、学習内容に基づいて認識結果（第１
の認識候補）を決定する機能を設けるようにしている。
ただし、本実施例では、従来のように単語辞書を参照せ
ずに、連続する２つの漢字，連続する２つのカタカナ，
連続する２つの英字を、１つの単語を構成する要素とみ
なして、学習するようにする。以下、この機能を連接学
習機能と称する。Therefore, as a function for utilizing this second tendency, in the present embodiment, the word input once is learned, and thereafter, the recognition result (first
(Recognition candidate of) is provided.
However, in the present embodiment, two consecutive Kanji, two consecutive Katakana,
Two consecutive letters are regarded as the elements that make up one word, and learning is performed. Hereinafter, this function is referred to as a connection learning function.

【００６２】なお、本実施例では、字種継承機能および
連接学習機能の２つの機能を設けるようにして説明して
いるが、いずれか一方のみを設けるようにすることも容
易にできることはいうまでもない。In the present embodiment, two functions, that is, the character type inheritance function and the connection learning function are provided, but it is easy to provide only one of them. Nor.

【００６３】本実施例の文字入力装置において、文字認
識処理、および、字種継承機能および連接学習機能を実
現するための処理は、具体的には、制御手段１０３が、
記憶手段１０２に格納されている各種プログラムを実行
することにより実現することができる。また、記憶手段
１０２には、各種プログラムが使用する各種データ（文
字認識辞書を含む。）も格納される。In the character input device of the present embodiment, the character recognition processing and the processing for realizing the character type inheritance function and the connection learning function are specifically performed by the control means 103.
It can be realized by executing various programs stored in the storage unit 102. The storage unit 102 also stores various data (including a character recognition dictionary) used by various programs.

【００６４】図１１は記憶手段１０２の内容を模式的に
示す図である。FIG. 11 is a diagram schematically showing the contents of the storage means 102.

【００６５】図１１に示すように、記憶手段１０２は、
各種プログラム領域１１０１と、認識候補データ領域１
１０２と、連接頻度データ領域１１０３と、減衰パラメ
ータデータ領域１１０４と、字種継承文字数領域１１０
５と、連接記憶データ領域１１０６と、座標データ領域
１１０７と、本文データ領域１１０８とから構成されて
いる。As shown in FIG. 11, the storage means 102 is
Various program areas 1101 and recognition candidate data area 1
102, a concatenation frequency data area 1103, an attenuation parameter data area 1104, and a character type inherited character number area 110.
5, a connected storage data area 1106, a coordinate data area 1107, and a body data area 1108.

【００６６】各種プログラム領域１１０１は、文字認識
処理を行う文字認識処理プログラム，後述するメイン処
理プログラム，後述する連接学習処理プログラム，後述
する加算点計算処理プログラムを含む各種プログラムが
格納される領域である。The various program areas 1101 are areas for storing various programs including a character recognition processing program for performing character recognition processing, a main processing program described later, a connection learning processing program described later, and an addition point calculation processing program described later. .

【００６７】認識候補データ領域１１０２は、文字認識
処理によって出力された認識候補が格納される領域であ
り、この詳細については後述する。The recognition candidate data area 1102 is an area in which the recognition candidates output by the character recognition processing are stored, and details thereof will be described later.

【００６８】連接頻度データ領域１１０３，減衰パラメ
ータデータ領域１１０４，字種継承文字数領域１１０５
は、字種継承機能を実現するために必要なデータが格納
される領域であり、これらの詳細については後述する。Concatenation frequency data area 1103, attenuation parameter data area 1104, character type inherited character number area 1105
Is an area for storing data necessary for realizing the character type inheritance function, and details of these areas will be described later.

【００６９】連接記憶データ領域１１０６は、連接学習
機能を実現するために必要なデータが格納される領域で
あり、この詳細については後述する。The concatenation storage data area 1106 is an area for storing data necessary for realizing the concatenation learning function, and details thereof will be described later.

【００７０】座標データ領域１１０７は、手書きされた
筆記データの座標データが格納される領域であり、本文
データ領域１１０８は、本文領域３０１に表示されてい
る文字の文字コードが格納される領域である。The coordinate data area 1107 is an area in which the coordinate data of handwritten writing data is stored, and the body data area 1108 is an area in which the character code of the character displayed in the body area 301 is stored. .

【００７１】なお、これらの領域１１０１〜１１０８
は、必要に応じて、一部または全部が独立した個々のメ
モリで実現するようにすることもできる。Incidentally, these areas 1101 to 1108
Can be realized by individual memories, which are partially or wholly independent, as required.

【００７２】図１２は認識候補データ領域１１０２の内
容を模式的に示す図であり、図７に示した画面表示とな
っている状態での内容を示している。FIG. 12 is a diagram schematically showing the contents of the recognition candidate data area 1102, and shows the contents in the screen display shown in FIG.

【００７３】図１２において、１２０１は１つ目の文字
枠３０３に手書きされた筆記データについての認識候補
が格納される認識候補記憶領域、１２０２は２つ目の文
字枠３０３に手書きされた筆記データについての認識候
補が格納される認識候補記憶領域であり、認識候補デー
タ領域１１０４には、少なくとも文字枠３０３の数分だ
けの認識候補記憶領域が設けられている。In FIG. 12, 1201 is a recognition candidate storage area in which recognition candidates for handwritten data handwritten in the first character frame 303 are stored, and 1202 is handwritten data handwritten in the second character frame 303. Is a recognition candidate storage area in which the recognition candidates are stored, and the recognition candidate data area 1104 is provided with at least as many recognition candidate storage areas as the number of character frames 303.

【００７４】認識候補記憶領域は、全ての認識候補のそ
れぞれについて、該認識候補の文字コードが格納される
文字コードフィールド１２０３と、該認識候補の文字の
字種が格納される字種フィールド１２０４と、該認識候
補の一致度が格納される一致度フィールド１２０５と、
字種継承機能または連接学習機能の出力結果である加算
点が格納される加算点フィールド１２０６とから構成さ
れている。The recognition candidate storage area includes, for each of all the recognition candidates, a character code field 1203 in which the character code of the recognition candidate is stored, and a character type field 1204 in which the character type of the character of the recognition candidate is stored. , A matching degree field 1205 in which the matching degree of the recognition candidate is stored,
The addition point field 1206 stores addition points which are output results of the character type inheritance function or the connection learning function.

【００７５】例えば、認識候補記憶領域１２０１におい
て、文字コードフィールド１２０３には、「日」，
「月」，「円」，「目」，「白」とう５つの文字の認識
候補の文字コードが格納されており、字種フィールド１
２０４には、それぞれの文字が漢字である旨を示す
「漢」が格納されている。For example, in the recognition candidate storage area 1201, in the character code field 1203, "day",
The character codes of the recognition candidates for the five characters "month", "yen", "eyes", and "white" are stored in the character type field 1
In “204”, “Kan” indicating that each character is a Kanji is stored.

【００７６】なお、文字認識処理においては、文字ごと
に、標準的な筆記データである標準筆記パターンとそれ
に対応する文字コードとが格納されている文字認識辞書
を参照することにより、認識候補を出力するようになっ
ている。すなわち、手書きされた筆記データと文字認識
辞書中の標準筆記パターンとを照合して、両者の形が似
通っている度合いを示す値（ここでは、値が大きいほど
似通っている度合いが高いとものとする。）を一致度と
して求め、求めた一致度が予め決められた閾値より大き
い場合に、該標準筆記パターンに対応する文字コード
を、求めた一致度と共に、認識候補として出力するよう
になっている。そこで、出力された認識候補の文字コー
ドが、文字コードフィールド１２０３に格納され、ま
た、該認識候補の一致度が、一致度フィールド１２０５
に格納されることとなる。In the character recognition process, a recognition candidate is output for each character by referring to a character recognition dictionary that stores standard writing patterns that are standard writing data and corresponding character codes. It is supposed to do. That is, by collating the handwritten writing data with the standard writing pattern in the character recognition dictionary, a value indicating the degree of similarity between the two shapes (here, the larger the value, the higher the degree of similarity. ) As the degree of coincidence, and when the degree of coincidence obtained is larger than a predetermined threshold value, the character code corresponding to the standard writing pattern is output as a recognition candidate together with the degree of coincidence obtained. There is. Therefore, the character code of the output recognition candidate is stored in the character code field 1203, and the matching degree of the recognition candidate is stored in the matching degree field 1205.
Will be stored in.

【００７７】本実施例では、文字認識辞書中に、さら
に、文字の字種（ひらがな，カタカナ，漢字，数字，英
字，記号）が格納されるようにしており、文字認識処理
が、文字コードおよび一致度と共に、字種も出力するよ
うにしている。そこで、出力された認識候補の字種が、
字種フィールド１２０４に格納されることとなる。In this embodiment, the character recognition dictionary stores the character types of the characters (Hiragana, Katakana, Kanji, numbers, letters, symbols). The character type is also output together with the degree of coincidence. Therefore, the output recognition character type is
It is stored in the character type field 1204.

【００７８】ところで、従来の文字認識処理において
は、一致度が最も大きい認識候補が第１の認識候補とし
て決定され、優先的に認識結果として出力されるように
なっていた。しかし、この一致度は、筆記データと標準
筆記パターンとを文字単位で照合した結果に基づいて求
められた値であるので、前後関係がいっさい参照されて
いない認識結果が出力されることとなっていた。このた
め、上述したように、「シ」，「ス」，「テ」，「ム」
という文字の認識結果や「Ｐ」，「Ｏ」，「Ｔ」，
「Ａ」，「Ｔ」，「Ｏ」という文字の認識結果が意図通
り得られない場合がある。By the way, in the conventional character recognition processing, the recognition candidate having the highest degree of coincidence is determined as the first recognition candidate and is preferentially output as the recognition result. However, this degree of matching is a value obtained based on the result of collating the writing data with the standard writing pattern on a character-by-character basis, so that a recognition result in which the context is not referenced is output. It was Therefore, as described above, "shi", "su", "te", "mu"
"P", "O", "T",
In some cases, the recognition result of the characters "A", "T", "O" cannot be obtained as intended.

【００７９】そこで、本実施例では、前の文字の認識結
果を参照することにより、認識候補ごとの加算点を計算
し、計算した加算点を加算点フィールド１２０６に格納
するようにしている。そして、１つ以上の認識候補のう
ちから第１の認識候補を決定する際に、一致度フィール
ド１２０５に格納されている一致度と加算点フィールド
１２０６に格納されている加算点とを合計した値（以
下、優先度と称す。）が最も大きい認識候補を第１の認
識候補として決定するようにしている。なお、加算点を
計算する加算点計算処理の詳細については後述する。Therefore, in the present embodiment, the addition point for each recognition candidate is calculated by referring to the recognition result of the previous character, and the calculated addition point is stored in the addition point field 1206. A value obtained by summing the matching score stored in the matching score field 1205 and the addition points stored in the addition score field 1206 when determining the first recognition candidate from the one or more recognition candidates. The recognition candidate having the highest (hereinafter referred to as priority) is determined as the first recognition candidate. The details of the addition point calculation processing for calculating the addition points will be described later.

【００８０】図１３は連接頻度データ領域１１０３の内
容を模式的に示す図である。FIG. 13 is a diagram schematically showing the contents of the concatenation frequency data area 1103.

【００８１】連接頻度データ領域１１０３には、字種ご
とに、該字種の文字の次に出現する文字の字種が何であ
るかを統計的に調査した結果に基づいて予め決められた
値が格納されている。In the concatenation frequency data area 1103, for each character type, a value determined in advance based on the result of statistically investigating what kind of character the character that appears next to the character type is It is stored.

【００８２】図１３において、ｉは前の文字の字種を示
し、ｊはその次の文字の字種を示している。ｉ行ｊ列は
字種ｉの次に字種ｊが出現する確率Ｐijを示している。
例えば、前の文字がカタカナである場合に、次の文字が
カタカナである確率は、７０％であることを示してい
る。In FIG. 13, i indicates the character type of the previous character, and j indicates the character type of the next character. The i-th row and j-th column shows the probability Pij that the character type j appears next to the character type i.
For example, when the previous character is katakana, the probability that the next character is katakana is 70%.

【００８３】図１４は減衰パラメータデータ領域１１０
４の内容を模式的に示す図である。FIG. 14 shows the attenuation parameter data area 110.
It is a figure which shows the content of 4 typically.

【００８４】カタカナ，英字，数字は、上述したように
連続して出現することが多いという傾向があるが、連続
する文字数は有限である。そこで、本実施例では、字種
継承機能が、同じ字種の文字が連続するごとに、徐々に
その機能を減衰していくようになっており、減衰パラメ
ータデータ領域１１０４には、そのための減衰率が格納
されている。Katakana, letters, and numbers tend to appear consecutively as described above, but the number of consecutive characters is finite. Therefore, in the present embodiment, the character type inheritance function is designed to gradually attenuate the function each time a character of the same character type continues, and the attenuation parameter data area 1104 includes the attenuation for that purpose. The rate is stored.

【００８５】同じ字種の文字が連続する文字数の平均は
字種ごとに異なるので、減衰率は、字種ごとに、該字種
の文字が連続する文字数を統計的に調査した結果に基づ
いて予め決められた値となっている。Since the average number of consecutive characters of the same character type differs for each character type, the attenuation rate is based on the result of statistical investigation of the number of consecutive characters of the character type for each character type. It is a predetermined value.

【００８６】図１５は継承文字数領域１１０５の内容を
模式的に示す図である。FIG. 15 is a diagram schematically showing the contents of the inherited character number area 1105.

【００８７】継承文字数領域１１０５には、減衰パラメ
ータデータ領域１１０４に格納されている減衰率と共
に、字種継承機能を減衰するために使用される継承文字
数が格納される。継承文字数は、第１の認識候補として
決定した認識候補の文字の字種が、それまでに何文字分
だけ連続して同じ字種であったかを示す値である。例え
ば、図９に示したように、操作者が認識結果修正操作を
行った後の画面表示となっている状態では、継承文字数
領域１１０５には、「２」が格納されることとなる。In the inherited character number area 1105, the number of inherited characters used for attenuating the character type inheritance function is stored together with the attenuation rate stored in the attenuation parameter data area 1104. The inherited character number is a value indicating how many characters the character type of the character of the recognition candidate determined as the first recognition candidate has been the same character type up to that time. For example, as shown in FIG. 9, “2” is stored in the inherited character number area 1105 in a state where the screen is displayed after the operator performs the recognition result correction operation.

【００８８】図１６は連接記憶データ領域１１０６の内
容を模式的に示す図である。FIG. 16 is a diagram schematically showing the contents of the concatenated storage data area 1106.

【００８９】連接記憶データ領域１１０６には、操作者
が認識結果修正操作を行った場合に、修正する旨が指示
された文字の文字コードと１つ前の文字の文字コードと
を組にしたもの（以下、連接文字コード組と称す。）が
格納される。例えば、図９に示したように、操作者が認
識結果修正操作を行った後の画面表示となっている状態
では、連接記憶データ領域１１０６には、「日」という
文字の文字コードと「立」という文字の文字コードとを
組にした連接文字コード組が格納されることとなる。In the concatenated storage data area 1106, when the operator performs a recognition result correction operation, the character code of the character for which correction is instructed and the character code of the preceding character are paired. (Hereinafter referred to as a concatenated character code set) is stored. For example, as shown in FIG. 9, in the state where the screen display is displayed after the operator performs the recognition result correction operation, the character code of the character “day” and the “standing” are displayed in the connected storage data area 1106. A concatenated character code set that is a combination of the character code of the character "" is stored.

【００９０】ただし、連接記憶データ領域１１０６に
は、操作者が修正する旨を指示した文字の字種と１つ前
の文字（文字枠３０３に表示されている文字）の字種と
が同じ場合であり、かつ、その字種が漢字，カタカナ，
英字のうちのいずれかであった場合に限り、その内容が
格納される。However, in the concatenated storage data area 1106, when the character type of the character instructed by the operator to be corrected is the same as the character type of the preceding character (the character displayed in the character frame 303). And the type is Kanji, Katakana,
The content is stored only if it is one of the alphabetic characters.

【００９１】漢字，カタカナ，英字は、単語として繰り
返し入力されることが多いので、１度入力された連続す
る２つの漢字，連続する２つのカタカナ，連続する２つ
の英字を、１つの単語を構成する要素とみなして、それ
らの文字コードを組にした連接文字コード組を連接記憶
データ領域１１０６に格納しておき、以降は、連接記憶
データ領域１１０６に連接文字コード組として格納され
ている文字コードの認識候補を優先的に第１の認識候補
として決定することにより、単語処理や日本語処理を行
わなくても、日本語として尤もらしい認識結果を出力
し、文字認識率を高めることができる。また、単語処理
や日本語処理を行わないことから、高速に認識結果を出
力することができるようになる。Since Kanji, Katakana, and English characters are often repeatedly input as words, two consecutive Kanji characters, two consecutive Katakana, and two consecutive alphabetical characters that have been input once constitute one word. The concatenated character code set, which is a combination of those character codes, is stored in the concatenated storage data area 1106, and thereafter, the character code stored as the concatenated character code set in the concatenated storage data area 1106. By preferentially deciding the recognition candidate of No. 1 as the first recognition candidate, it is possible to output a recognition result that is likely to be Japanese and improve the character recognition rate without performing word processing or Japanese processing. Also, since word processing and Japanese processing are not performed, the recognition result can be output at high speed.

【００９２】次に、本実施例の文字入力装置において、
字種継承機能および連接学習機能を実現するための処理
について、図１７〜図２２を用いて説明する。Next, in the character input device of this embodiment,
Processing for realizing the character type inheritance function and the connection learning function will be described with reference to FIGS. 17 to 22.

【００９３】図１７はメイン処理プログラムのフローチ
ャートである。FIG. 17 is a flow chart of the main processing program.

【００９４】メイン処理プログラムは、電源スイッチ１
０７が投入されると起動され、まず、ハードウェア的な
初期化処理を行う（ステップ１７０１）。続いて、タブ
レットが座標データを検知したか否かを判定し（ステッ
プ１７０２）、座標データを検知したと判定した場合
は、該座標データが、文字枠３０３内にあるか（ステッ
プ１７０３）、認識ボタン３０４内にあるか（ステップ
１７０４）、確定ボタン３０６内にあるか（ステップ１
７０５）、認識候補枠３０５内にあるか（ステップ１７
０６）を調べる。The main processing program is the power switch 1
It is activated when 07 is input, and first, a hardware-like initialization process is performed (step 1701). Subsequently, it is determined whether or not the tablet has detected the coordinate data (step 1702). When it is determined that the coordinate data has been detected, whether the coordinate data is within the character frame 303 (step 1703), recognition is performed. Whether it is in the button 304 (step 1704) or in the confirm button 306 (step 1)
705), is it within the recognition candidate frame 305 (step 17)
06).

【００９５】タブレットが検知した座標データが文字枠
３０３内にある場合は（ステップ１７０３）、文字枠３
０３に筆記データが手書きされた場合（図４に示した画
面表示となっている状態）、および、文字枠３０３に表
示されている文字が触れられた場合と（図８に示した画
面表示となっている状態）の２つの場合が考えられるの
で、文字枠３０３に文字が表示されているか否かを調べ
る（ステップ１７０７）。If the coordinate data detected by the tablet is within the character frame 303 (step 1703), the character frame 3
When handwriting data is written in 03 (the screen display shown in FIG. 4) and when the characters displayed in the character frame 303 are touched (screen display shown in FIG. 8). Since there are two possible cases, it is checked whether or not characters are displayed in the character frame 303 (step 1707).

【００９６】文字枠３０３に文字が表示されていないな
らば、文字枠３０３に筆記データが手書きされた場合で
あることを意味しているので、タブレットが検知した座
標データに対応する液晶表示装置の画素を反転表示さ
せ、図４に示した画面表示のように、手書きされた筆記
データの筆跡を表示すると共に（ステップ１７０８）、
該座標データを座標データ領域１１０７に格納する（ス
テップ１７０９）。また、文字枠３０３に文字が表示さ
れているならば、文字枠３０３に表示されている文字が
触れられた場合であることを意味しているので、該文字
の認識候補を第１の認識候補とする全ての認識候補を、
認識候補データ領域１１０２から取り出して、図８に示
した画面表示のように、取り出した認識候補の文字を認
識候補枠３０５に表示する（ステップ１７１０）。If no character is displayed in the character frame 303, it means that the writing data is handwritten in the character frame 303. Therefore, the liquid crystal display device corresponding to the coordinate data detected by the tablet is displayed. The pixels are displayed in reverse, and the handwriting of handwritten writing data is displayed as in the screen display shown in FIG. 4 (step 1708),
The coordinate data is stored in the coordinate data area 1107 (step 1709). Further, if a character is displayed in the character frame 303, it means that the character displayed in the character frame 303 is touched. Therefore, the recognition candidate of the character is the first recognition candidate. All recognition candidates
The character of the extracted recognition candidate is extracted from the recognition candidate data area 1102 and displayed in the recognition candidate frame 305 as in the screen display shown in FIG. 8 (step 1710).

【００９７】また、タブレットが検知した座標データが
認識ボタン３０４内にある場合は（ステップ１７０
４）、認識ボタン３０４が触れられたことを意味してい
るので、座標データ領域１１０７に座標データが格納さ
れているか否かを調べ（ステップ１７１１）、座標デー
タが格納されているならば、該座標データ（筆記デー
タ）について文字認識処理を行うように、文字認識処理
プログラムを起動する（ステップ１７１２）。文字認識
処理プログラムが行う文字認識処理は上述した通りであ
り、文字認識処理プログラムが文字認識処理を終了する
と、文字認識処理によって出力された認識候補（文字コ
ード，字種，一致度）を、一致度が大きい順に、認識候
補データ領域１１０２に格納し（ステップ１７１３）、
格納した時点での第１の認識候補の文字の字種と同じ字
種の文字（文字枠３０３に表示されている文字）がそれ
までに何文字分だけ連続しているかを、該第１の認識候
補の文字から前にさかのぼって計算し、計算結果を継承
文字数領域１１０５に格納する（ステップ１７１４）。
この計算は、具体的には、認識候補データ領域１１０２
において、該第１の認識候補が格納されている認識候補
記憶領域より前の全ての認識候補記憶領域に格納されて
いる第１の認識候補の字種フィールド１２０５を参照す
ることにより行うことができる。If the coordinate data detected by the tablet is in the recognition button 304 (step 170).
4) This means that the recognition button 304 has been touched. Therefore, it is checked whether or not the coordinate data is stored in the coordinate data area 1107 (step 1711). A character recognition processing program is activated so as to perform character recognition processing on coordinate data (writing data) (step 1712). The character recognition processing performed by the character recognition processing program is as described above. When the character recognition processing program ends the character recognition processing, the recognition candidates (character code, character type, degree of coincidence) output by the character recognition processing are matched. The data are stored in the recognition candidate data area 1102 in descending order (step 1713),
It is possible to determine how many characters the characters of the same character type (characters displayed in the character frame 303) as the character type of the first recognition candidate character at the time of storage are consecutive by the first character. It calculates backward from the recognition candidate character and stores the calculation result in the inherited character number area 1105 (step 1714).
Specifically, this calculation is performed on the recognition candidate data area 1102.
, The first recognition candidate storage area in which the first recognition candidate is stored can be referred to by referring to the character type field 1205 of the first recognition candidate stored in all the recognition candidate storage areas. .

【００９８】続いて、座標データ領域１１０７を初期化
した後（ステップ１７１５）、ステップ１７１４で認識
候補データ領域１１０２に格納された認識候補の加算点
を計算する加算点計算処理を行うように、加算点計算処
理プログラムを起動する（ステップ１７１６）。なお、
加算点計算処理の詳細については後述する。Subsequently, after the coordinate data area 1107 is initialized (step 1715), addition is performed so that addition point calculation processing for calculating the addition points of the recognition candidates stored in the recognition candidate data area 1102 in step 1714 is performed. The point calculation processing program is started (step 1716). In addition,
Details of the addition point calculation processing will be described later.

【００９９】加算点計算処理プログラムが加算点計算処
理を終了すると、認識候補データ領域１１０２におい
て、計算された加算点が加算点フィールド１２０６に格
納され、一致度フィールド１２０５に格納されている一
致度と加算点フィールド１２０６に格納されている加算
点とを合計した優先度が大きい順に、認識候補が並び替
えられているので、該優先度が最も大きい認識候補（１
番目に格納されている認識候補）を第１の認識候補とし
て決定し、決定した第１の認識候補の文字を、図５およ
び図７に示した画面表示のように、文字枠３０３に表示
する（ステップ１７１７）。When the addition point calculation processing program ends the addition point calculation processing, in the recognition candidate data area 1102, the calculated addition points are stored in the addition point field 1206 and the matching degree stored in the matching degree field 1205. Since the recognition candidates are rearranged in descending order of priority, which is the sum of the addition points stored in the addition point field 1206, the recognition candidate with the highest priority (1
The second stored recognition candidate) is determined as the first recognition candidate, and the determined first recognition candidate character is displayed in the character frame 303 as in the screen display shown in FIGS. 5 and 7. (Step 1717).

【０１００】また、タブレットが検知した座標データが
確定ボタン３０６内にある場合は（ステップ１７０
５）、確定ボタン３０６が触れられたことを意味してい
るので、認識候補データ領域１１０２に認識候補が格納
されているか否かを調べ（ステップ１７１８）、認識候
補が格納されているならば、そのうちの第１の認識候補
の文字コードを本文データ領域１１０８のデータ末尾に
追加して格納すると共に（ステップ１７１９）、該文字
コードの文字を本文領域３０１のカーソル３０２が示す
表示位置に表示する（ステップ１７２０）。If the coordinate data detected by the tablet is in the confirm button 306 (step 170)
5) Since it means that the confirm button 306 has been touched, it is checked whether or not the recognition candidate is stored in the recognition candidate data area 1102 (step 1718). If the recognition candidate is stored, The character code of the first recognition candidate among them is added to the end of the data in the body data area 1108 and stored (step 1719), and the character of the character code is displayed at the display position indicated by the cursor 302 in the body area 301 ( Step 1720).

【０１０１】続いて、認識候補データ領域１１０２を初
期化し（ステップ１７２１）、継承文字数領域１１０５
を初期化し（ステップ１７２２）、文字枠３０３内を初
期化する（ステップ１７２３）。Subsequently, the recognition candidate data area 1102 is initialized (step 1721) and the inherited character number area 1105 is set.
Is initialized (step 1722), and the inside of the character frame 303 is initialized (step 1723).

【０１０２】また、タブレットが検知した座標データが
認識候補枠３０５内にある場合は（ステップ１７０
６）、認識候補枠３０５に文字が表示されているか否か
を調べ（ステップ１７２４）、文字が表示されているな
らば、図９に示した画面表示のように、認識候補枠３０
５に表示されている文字が触れられたことを意味してい
るので、認識候補データ領域１１０２において、触れら
れた文字の認識候補、すなわち、修正する旨が指示され
た文字の認識候補とその時点の第１の認識候補とを入れ
替えて、修正する旨が指示された文字の認識候補を第１
の認識候補とする（ステップ１７２５）。If the coordinate data detected by the tablet is within the recognition candidate frame 305 (step 170)
6) It is checked whether or not the character is displayed in the recognition candidate frame 305 (step 1724). If the character is displayed, the recognition candidate frame 30 is displayed as in the screen display shown in FIG.
Since it means that the character displayed in No. 5 is touched, in the recognition candidate data area 1102, the recognition candidate of the touched character, that is, the recognition candidate of the character instructed to be corrected and its time point. The first recognition candidate of the character is replaced by the first recognition candidate of the character
(Step 1725).

【０１０３】続いて、修正する旨が指示された文字の認
識候補を学習する連接学習処理を行うように、連接学習
処理プログラムを起動する（ステップ１７２６）。な
お、連接学習処理の詳細については後述する。Subsequently, the connection learning processing program is activated so as to perform the connection learning processing for learning the recognition candidates of the character instructed to be corrected (step 1726). The details of the connection learning process will be described later.

【０１０４】連接学習処理プログラムが連接学習処理を
終了すると、ステップ１７２５で入れ替えられた後の第
１の認識候補の文字コードの文字を、文字枠３０３に表
示されている文字の代わりに表示し（ステップ１７２
７）、認識候補枠３０５内を初期化する（ステップ１７
２８）。また、ステップ１７２５で入れ替えられた後の
第１の認識候補の文字の字種と同じ字種の文字（文字枠
３０３に表示されている文字）がそれまでに何文字分だ
け連続しているかを、該文字から前にさかのぼって計算
し、計算結果を継承文字数領域１１０５に格納する（ス
テップ１７２９）。この計算は、具体的には、認識候補
データ領域１１０２において、該第１の認識候補の文字
の認識候補が格納されている認識候補記憶領域より前の
全ての認識候補記憶領域に格納されている第１の認識候
補の字種フィールド１２０５を参照することにより行う
ことができる。When the connection learning processing program ends the connection learning processing, the character of the character code of the first recognition candidate after the replacement in step 1725 is displayed in place of the character displayed in the character frame 303 ( Step 172
7), the inside of the recognition candidate frame 305 is initialized (step 17).
28). In addition, the number of consecutive characters by the same character type (characters displayed in the character frame 303) as the character type of the first recognition candidate character after being replaced in step 1725 is determined. , Is calculated backward from the character, and the calculation result is stored in the inherited character number area 1105 (step 1729). Specifically, this calculation is stored in all the recognition candidate storage areas before the recognition candidate storage area in which the recognition candidates of the character of the first recognition candidate are stored in the recognition candidate data area 1102. This can be done by referring to the character type field 1205 of the first recognition candidate.

【０１０５】続いて、ステップ１７２５で入れ替えられ
た後の第１の認識候補の文字以降の文字（文字枠３０３
に表示る文字）の認識候補の加算点を計算する加算点計
算処理を行うように、加算点計算処理プログラムを起動
する（ステップ１７３０）。Subsequently, the characters after the first recognition candidate character after the replacement in step 1725 (character frame 303
The addition point calculation processing program is started so as to perform the addition point calculation processing for calculating the addition point of the recognition candidate of the character displayed in (step 1730).

【０１０６】ここで、ステップ１７３０で加算点計算処
理を行う理由について説明する。Here, the reason why the addition point calculation processing is performed in step 1730 will be described.

【０１０７】操作者が、複数文字分の筆記データを文字
枠３０３に手書きした後に認識ボタン３０４をペン２０
２で触れた場合は、該複数文字分の筆記データの認識結
果の文字が文字枠３０３に表示されることとなるが、こ
のとき、操作者が、途中の文字について認識結果修正操
作を行うと、ステップ１７２９で継承文字数領域１１０
５に格納された継承文字数が以前に格納されていた値と
異なることがあり、それに伴って、該文字以降の文字の
認識候補の加算点が変わってくることがある。そこで、
このような場合に、修正する旨が指示された文字以降の
全ての文字の認識候補の加算点を再計算し、一致度と再
計算した加算点とを合計した優先度が大きい順に、認識
候補を入れ替えるようにする。ステップ１７３０で加算
点計算処理を行うのは、この加算点の再計算を行うため
である。After the operator handwrites the writing data for a plurality of characters in the character frame 303, the operator presses the recognition button 304 with the pen 20.
When touched with 2, the characters of the recognition result of the writing data for the plurality of characters are displayed in the character frame 303. At this time, if the operator performs the recognition result correction operation for the character in the middle. , Inherited character number area 110 in step 1729
The number of inherited characters stored in 5 may be different from the value previously stored, and accordingly, the addition points of the recognition candidates of the characters after the character may change. Therefore,
In such a case, the addition points of the recognition candidates of all the characters after the character for which correction is instructed are recalculated, and the recognition candidates are sorted in descending order of priority, which is the sum of the degree of coincidence and the recalculated addition points. To be replaced. The addition point calculation process is performed in step 1730 in order to recalculate this addition point.

【０１０８】加算点計算処理プログラムが加算点計算処
理を終了すると、認識候補データ領域１１０２におい
て、再計算された加算点が加算点フィールド１２０６に
格納され、一致度フィールド１２０５に格納されている
一致度と加算点フィールド１２０６に格納されている加
算点とを合計した優先度が大きい順に、認識候補が並び
替えられているので、該優先度が最も大きい認識候補
（１番目に格納されている認識候補）を第１の認識候補
として決定し、決定した第１の認識候補の文字を文字枠
３０３に表示する（ステップ１７３１）。When the addition point calculation processing program ends the addition point calculation processing, in the recognition candidate data area 1102, the recalculated addition points are stored in the addition point field 1206 and the matching degree stored in the matching degree field 1205. And the addition points stored in the addition point field 1206 are sorted in descending order of priority, the recognition candidates are rearranged. Therefore, the recognition candidate having the highest priority (the recognition candidate stored first) ) Is determined as the first recognition candidate, and the character of the determined first recognition candidate is displayed in the character frame 303 (step 1731).

【０１０９】なお、本実施例では、操作者が認識結果修
正操作を行った場合にのみ、ステップ１７２６で連接学
習処理を行うようにしているが、記憶手段１０２の容量
がある程度大きいならば、操作者が文字の筆記データを
手書きする度に連接学習処理を行って、学習する機会を
増やすようにしてもよい。In this embodiment, the connection learning process is performed in step 1726 only when the operator performs the recognition result correcting operation. However, if the capacity of the storage means 102 is large to some extent, the operation is performed. It is also possible to increase the chances of learning by performing the connection learning process every time the person handwrites the writing data of characters.

【０１１０】また、本実施例では、ステップ１７１２で
行われる文字認識処理によって、１つ以上の認識候補が
必ず出力されすることを前提としているが、文字認識処
理において、操作者が手書きした筆記データとの間の一
致度が予め決められた閾値より大きい標準筆記データが
文字認識辞書に格納されていない場合には、認識候補が
１つも出力されなくなるので、このような場合は、閾値
を下げて、再度、文字認識処理を行うことにより、１つ
以上の認識候補が必ず出力されるようになっている。と
ころが、一致度が小さい認識候補、すなわち、形があま
り似通っていない認識候補が出力されることとなるの
で、このような場合には、後述するように、連接記憶デ
ータ領域１１０６を参照し、過去に１度でも手書きされ
たことがある文字の認識候補があるならば、該認識候補
以外の認識候補を切り捨てるようにすることにより、余
分な認識候補を出力しないようにすることができる。Further, in the present embodiment, it is premised that one or more recognition candidates are always output by the character recognition processing performed in step 1712. However, in the character recognition processing, the handwritten data written by the operator is written. If no standard writing data in which the degree of coincidence between and is larger than a predetermined threshold is stored in the character recognition dictionary, no recognition candidate is output. In such a case, lower the threshold. By performing the character recognition process again, one or more recognition candidates are always output. However, since recognition candidates having a low degree of coincidence, that is, recognition candidates having a shape that is not very similar to each other, are output, in such a case, the concatenated storage data area 1106 is referred to and the past If there is a recognition candidate of a character that has been handwritten even once, it is possible to prevent the recognition candidate other than the recognition candidate from being output, so that an extra recognition candidate is not output.

【０１１１】図１８はステップ１７２６の連接学習処理
を行う連接学習処理プログラムのフローチャートであ
る。FIG. 18 is a flow chart of a connection learning processing program for performing the connection learning processing in step 1726.

【０１１２】連接学習処理は、操作者が認識結果修正操
作を行った場合に、修正する旨が指示された文字の文字
コードと１つ前の文字（文字枠３０３に表示されている
文字）の文字コードとを組にした連接文字コード組を、
連接記憶データ領域１１０６に格納する処理である。た
だし、修正する旨が指示された文字の字種と１つ前の文
字の字種とが同じ字種であり、かつ、該字種が漢字，カ
タカナ，英字のうちのいずれかであった場合に限って、
連接文字コード組を格納する。In the connection learning process, when the operator performs the recognition result correction operation, the character code of the character instructed to be corrected and the character code immediately before (the character displayed in the character frame 303) A concatenated character code set that is a combination of character code and
This is a process of storing in the concatenated storage data area 1106. However, if the character type of the character instructed to be modified is the same as the character type of the previous character, and the character type is one of Kanji, Katakana, or English. Only
Stores the concatenated character code set.

【０１１３】具体的には、操作者が修正する旨を指示し
た文字の字種が、漢字，カタカナ，英字のうちのいずれ
かであるか否かを調べ（ステップ１８０１）、漢字，カ
タカナ，英字のうちのいずれかであるならば、修正する
旨が指示された文字の字種と１つ前の文字の字種とが同
じであるか否かを判定し、同じであるならば、連接文字
コード組を取得する（ステップ１８０２）。連接文字コ
ード組の取得は、具体的には、認識候補データ領域１１
０２において、修正する旨が指示された文字の認識候補
が格納されている認識候補記憶領域より１つ前の認識候
補記憶領域を参照することにより行うことができる。Specifically, it is checked whether the character type of the character instructed to be corrected by the operator is kanji, katakana, or alphabet (step 1801), and kanji, katakana, or alphabet. If any of the above, it is determined whether or not the character type of the character designated to be modified is the same as the character type of the previous character. A code set is acquired (step 1802). To obtain the concatenated character code set, specifically, the recognition candidate data area 11
In 02, it can be performed by referring to the recognition candidate storage area immediately preceding the recognition candidate storage area in which the recognition candidates of the character instructed to be corrected are stored.

【０１１４】ステップ１８０２で連接文字コード組が取
得できたならば、取得した連接文字コード組を連接記憶
データ領域１１０６に格納する（ステップ１８０３）。If the concatenated character code set can be acquired in step 1802, the acquired concatenated character code set is stored in the concatenated storage data area 1106 (step 1803).

【０１１５】なお、連接文字コード組を格納する際に、
既に同じ連接文字コード組が格納されていることが考え
られるが、このようなときには、既に格納されている連
接文字コード組に上書きして格納するようにしてもよい
し、重複する連接文字コード組を格納することを許容す
るようにしてもよい。When storing the concatenated character code set,
It is possible that the same concatenated character code set has already been stored. In such a case, the already stored concatenated character code set may be overwritten and stored. May be allowed to be stored.

【０１１６】また、連接文字コード組を新しい順に格納
するようにすることもでき、この場合は、既に同じ連接
文字コード組が格納されているときには、既に格納され
ている連接文字コード組を削除してから、新たに取得し
た連接文字コード組を格納するようにする。It is also possible to store the concatenated character code sets in the newest order. In this case, when the same concatenated character code set is already stored, the already stored concatenated character code set is deleted. Then, the newly acquired concatenated character code set is stored.

【０１１７】図１９はステップ１７１６およびステップ
１７３０の加算点計算処理を行う加算点計算処理プログ
ラムのフローチャートである。FIG. 19 is a flow chart of an addition point calculation processing program for executing the addition point calculation processing in step 1716 and step 1730.

【０１１８】上述したように、文字認識処理が行われた
後、文字認識処理によって出力された認識候補（文字コ
ード，字種，一致度）が、一致度が大きい順に、認識候
補データ領域１１０２に格納されるが、一致度は、筆記
データについて文字単位で文字認識処理を行った結果で
あり、前後関係は、いっさい参照していない。そこで、
ステップ１７１６で、図１９のフローチャートに示す加
算点計算処理を行うことにより、認識候補データ領域１
１０２に格納された認識候補の加算点を計算し、一致度
と計算した加算点とを合計した優先度が大きい順に、認
識候補を入れ替えるようにしている。As described above, after the character recognition processing is performed, the recognition candidates (character code, character type, matching degree) output by the character recognition processing are stored in the recognition candidate data area 1102 in descending order of matching degree. Although stored, the degree of coincidence is the result of performing character recognition processing on the writing data on a character-by-character basis, and does not refer to any context. Therefore,
In step 1716, by performing the addition point calculation process shown in the flowchart of FIG.
The addition points of the recognition candidates stored in 102 are calculated, and the recognition candidates are replaced in the descending order of priority, which is the sum of the coincidence and the calculated addition points.

【０１１９】また、上述したように、操作者が途中の文
字について認識結果修正操作を行うと、継承文字数が変
更されることがあり、それに伴って、該文字以降の文字
の認識候補の加算点が変わってくることがある。そこ
で、ステップ１７３０で、操作者が修正する旨を指示し
た文字以降の文字数分だけ、図１９のフローチャートに
示す加算点計算処理を行うことにより、操作者が修正す
る旨を指示した文字以降の全ての文字について、該文字
の認識候補の加算点を再計算し、一致度と再計算した加
算点とを合計した優先度が大きい順に、認識候補を入れ
替えるようにしている。Further, as described above, when the operator performs a recognition result correction operation for a character in the middle, the number of inherited characters may be changed. Along with this, the addition points of the recognition candidates of the characters after the character may be changed. May change. Therefore, in step 1730, by performing the addition point calculation process shown in the flowchart of FIG. 19 by the number of characters after the character that the operator has instructed to correct, all the characters after the character instructed to be corrected by the operator are performed. With respect to the character, the addition points of the recognition candidates of the character are recalculated, and the recognition candidates are replaced in descending order of the sum of the matching degree and the recalculated addition point.

【０１２０】加算点には、以下に説明するように、連接
学習機能による加算点と字種継承機能による加算点があ
るが、本実施例では、連接学習機能による加算点を優先
するようにしている。すなわち、通常は、字種継承機能
による加算点を用い、特に、操作者が過去に認識結果修
正操作を行ったことがある文字に対しては、連接学習機
能による加算点を優先して用いるようにすることによ
り、日本語の一般的な傾向を利用しながら、操作者の操
作に合わせて文字認識率を向上させることができるよう
にしている。As will be described below, the addition points include addition points by the concatenation learning function and addition points by the character type inheritance function. In the present embodiment, the addition points by the concatenation learning function are prioritized. There is. That is, normally, the addition point by the character type inheritance function is used, and especially for the character for which the operator has performed the recognition result correction operation in the past, the addition point by the connection learning function is preferentially used. By using the general tendency of Japanese, the character recognition rate can be improved according to the operation of the operator.

【０１２１】具体的には、加算点を計算しようとしてい
る認識候補が格納されている認識候補記憶領域と１つ前
の認識候補記憶領域とを参照し、前者に格納されている
全ての認識候補の文字コードと後者に格納されている第
１の認識候補の文字コードとをそれぞれ組にしたものに
ついて、該組と一致する連接文字コード組が連接記憶デ
ータ領域１１０６に格納されているか否かを調べる（ス
テップ１９０１）。一致する連接文字コード組が格納さ
れているならば（ステップ１９０２）、該組にした文字
コードの認識候補について、連接学習機能による加算点
を計算し、計算した加算点を加算点フィールド１２０６
に格納する（ステップ１９０３）。連接学習機能による
加算点は、具体的には、予め決められた正の値である。Specifically, all the recognition candidates stored in the former are referenced by referring to the recognition candidate storage area in which the recognition candidate for which the addition point is to be calculated is stored and the previous recognition candidate storage area. Of the character code of the first recognition candidate stored in the latter and the character code of the first recognition candidate stored in the latter, and whether or not a concatenated character code set matching the group is stored in the concatenated storage data area 1106. Check (step 1901). If a concatenated concatenated character code set is stored (step 1902), an addition point by the concatenation learning function is calculated for the character code recognition candidates in the set, and the calculated addition point is added to the addition point field 1206.
(Step 1903). Specifically, the addition point by the connection learning function is a predetermined positive value.

【０１２２】なお、連接学習処理において、連接記憶デ
ータ領域１１０６に重複する連接文字コード組を格納す
ることを許容するようにした場合は、一致する連接文字
コード組が２つ以上格納されていることもあるので、そ
のときは、予め決められた正の値に一致する連接文字コ
ード組が格納されている数を乗じた値を加算点とするこ
とによって、学習回数を加算点に反映させることができ
る。また、連接学習処理において、連接記憶データ領域
１１０６に連接文字コード組を新しい順に格納するよう
にした場合は、一致する連接文字コード組が格納された
時点が新しいほど、大きい値となるようにすることによ
って、学習の新／旧を加算点に反映させることができ
る。In the connection learning process, when it is allowed to store overlapping connection character code sets in the connection storage data area 1106, two or more matching connection character code sets must be stored. Therefore, at that time, the number of learnings can be reflected in the addition point by setting the value obtained by multiplying the number of stored concatenated character code sets matching the predetermined positive value as the addition point. it can. Further, in the concatenation learning process, when the concatenation character code sets are stored in the concatenation storage data area 1106 in the newest order, the newer the time when the matching concatenation character code set is stored, the larger the value becomes. By doing so, new / old learning can be reflected in the addition points.

【０１２３】続いて、加算点を計算しようとしている認
識候補の全てについて、加算点の計算を終了すると、認
識候補データ領域１１０２において、一致度と加算点と
を合計した優先度が大きい順に、認識候補を並び替える
（ステップ１９０４）。Then, when the calculation of the addition points is completed for all the recognition candidates for which the addition points are to be calculated, recognition is performed in the recognition candidate data area 1102 in descending order of the sum of the degree of coincidence and the addition points. The candidates are rearranged (step 1904).

【０１２４】図２０は連接学習機能による加算点の具体
例を示す図であり、図９に示した画面表示のように、操
作者が認識結果修正操作を行った後、操作者が、２度目
に、「日」，「立」という文字を意図した筆記データを
手書きした場合の認識候補データ領域１１０２の内容を
示している。FIG. 20 is a diagram showing a specific example of the addition points by the concatenation learning function. As shown in the screen display of FIG. 9, after the operator performs the recognition result correction operation, the operator does the second time. The contents of the recognition candidate data area 1102 in the case of handwriting the writing data intended for the characters "day" and "standing" are shown in FIG.

【０１２５】操作者が「日」，「立」という文字を意図
した筆記データを手書きした場合に、１度目は、操作者
の意図とは異なる「日」，「主」という文字の認識候補
が第１の認識候補として決定されてしまい、誤認識され
ることとなるが、「立」という文字についての認識結果
修正操作が行われたことにより、「日」という文字の文
字コードと「立」という文字の文字コードとを組にした
連接文字コード組が連接記憶データ領域１１０６に格納
されているので、２度目は、連接学習機能による加算点
が用いられて、図２０に示すように、「立」という文字
の認識候補の加算点が「＋５０」となる。そこで、一致
度と学習内容に応じた加算点とを合計した優先度が最も
大きい「立」という文字の認識候補が「主」という文字
の認識候補と入れ替わり、「日」，「立」の文字の認識
候補が第１の認識候補として決定されて、操作者の意図
通りの認識結果が得られるようになる。When the operator handwrites the writing data intended for the characters “day” and “standing”, the first time, the recognition candidates of the characters “day” and “main” different from the operator's intention are obtained. Although it is determined as the first recognition candidate and is erroneously recognized, the character code of the character "Sun" and the character "Stand" are recognized due to the recognition result correction operation for the character "Stand". Since a concatenated character code set that is a set of the character code of the character is stored in the concatenated storage data area 1106, the addition point by the concatenation learning function is used for the second time, as shown in FIG. The addition point of the recognition candidate of the character "standing" is "+50". Therefore, the recognition candidate of the character "standing" having the highest priority, which is the sum of the degree of coincidence and the addition point according to the learning content, is replaced with the recognition candidate of the character "main", and the characters "day" and "standing" are replaced. Is determined as the first recognition candidate, and the recognition result as intended by the operator can be obtained.

【０１２６】このように、加算点を計算しようとしてい
る全ての認識候補の文字コードと１つ前の文字の文字コ
ード（第１の認識候補の文字コード）とをそれぞれ組に
したもののうち、連接記憶データ領域１１０６に格納さ
れている連接文字コード組と一致する組は、該組が１つ
の単語を構成する要素であることを意味しているので、
該組にした文字コードの認識候補の加算点（連接学習機
能による加算点）が計算されて、第１の認識候補として
決定される度合いが大きくなる。従って、単語処理や日
本語処理を行わなくても、連接学習機能によって、
「日」，「立」という、操作者の意図通りの認識結果を
出力し、文字認識率を向上させることができる。As described above, among the combinations of the character codes of all the recognition candidates whose addition points are to be calculated and the character code of the character immediately before (the character code of the first recognition candidate), the concatenation is performed. A set that matches the concatenated character code set stored in the storage data area 1106 means that the set is an element that constitutes one word.
The addition points of the character code recognition candidates in the set (addition points by the concatenation learning function) are calculated, and the degree of determination as the first recognition candidate increases. Therefore, even if you do not perform word processing or Japanese processing, by the connection learning function,
It is possible to improve the character recognition rate by outputting the recognition result of "day" and "standing" as intended by the operator.

【０１２７】一方、一致する連接文字コード組が格納さ
れていないならば（ステップ１９０２）、字種継承機能
による加算点を計算する（ステップ１９０５）。On the other hand, if the matching concatenated character code set is not stored (step 1902), the addition point by the character type inheritance function is calculated (step 1905).

【０１２８】ここで、ステップ１９０５の字種継承機能
による加算点の計算処理について、図２１のフローチャ
ートを用いて説明する。Here, the calculation processing of the addition point by the character type inheritance function of step 1905 will be described with reference to the flowchart of FIG.

【０１２９】上述したように、従来のように文字単位で
文字認識処理を行った場合は、「ＰＯ」，「Ｔ」，
「Ａ」，「Ｔ」，「Ｏ」のように識別しにくい文字の場
合は、誤認識が生じてしまう。そこで、本実施例では、
字種ごとに、該字種の文字の次に出現する文字の字種が
何であるかを統計的に調査した結果に基づいて予め決め
られた値（確率）を、連接頻度データ領域１１０３に格
納しておき、字種継承機能を実現することができるよう
にしている。また、カタカナ，英字，数字は、単語とし
て連続して出現することが多いという傾向があるが、連
続する文字数は有限であるので、字種継承機能を徐々に
減衰していくようにするために、字種ごとに、該字種の
文字が連続する文字数を統計的に調査した結果に基づい
て予め決められた値（減衰率）を、減衰パラメータデー
タ領域１１０４に格納しておくと共に、第１の認識候補
として決定した認識候補の文字の字種が、それまでに何
文字分だけ連続して同じ字種であったかを示す値（継承
文字数）を、継承文字数領域１１０５に格納するように
している。As described above, when character recognition processing is performed character by character as in the conventional case, "PO", "T",
In the case of difficult-to-identify characters such as "A", "T", and "O", erroneous recognition occurs. Therefore, in this embodiment,
For each character type, a value (probability) predetermined based on the result of statistically investigating the character type of the character that appears next to the character of the character type is stored in the concatenation frequency data area 1103. In addition, the character type inheritance function can be realized. In addition, katakana, letters, and numbers tend to appear consecutively as words in many cases, but since the number of consecutive characters is finite, in order to gradually attenuate the character type inheritance function, For each character type, a value (attenuation rate) determined in advance based on the result of statistically examining the number of consecutive characters of the character type is stored in the attenuation parameter data area 1104. A value (the number of inherited characters) indicating how many characters the character type of the character of the recognition candidate determined as the recognition candidate has been the same character type until then is stored in the inherited character number area 1105. .

【０１３０】具体的には、認識候補データ領域１１０２
において、加算点を計算しようとしている認識候補が格
納されている認識候補記憶領域の１つ前の認識候補記憶
領域に認識候補が格納されているか否かを調べることに
より、認識候補の加算点を計算しようとしている文字の
１つ前に文字（文字枠３０３に表示されている文字）が
あるか否かを調べ（ステップ２１０１）、文字があるな
らば、変数ｉに、１つ前の文字の字種（１つ前の文字の
認識候補のうちの第１の認識候補の文字の字種）を設定
し（ステップ２１０２）、変数ｊに、現在処理対象とし
ている認識候補（ｋ番目の認識候補）の文字の字種を設
定する（ステップ２１０３）。Specifically, the recognition candidate data area 1102
In, the addition point of the recognition candidate is determined by checking whether or not the recognition candidate is stored in the recognition candidate storage area immediately preceding the recognition candidate storage area in which the recognition candidate whose addition point is to be calculated is stored. It is checked whether or not there is a character (character displayed in the character frame 303) immediately before the character to be calculated (step 2101). If there is a character, the variable i is set to the character before the character. The character type (the character type of the character of the first recognition candidate of the recognition candidates of the previous character) is set (step 2102), and the variable j is the recognition candidate currently being processed (the kth recognition candidate). The character type of the character) is set (step 2103).

【０１３１】続いて、変数ｉ（１つ前の文字の字種）お
よび変数ｊ（ｋ番目の認識候補の文字の字種）から、
（式１）を用いて、ｋ番目の認識候補の加算点Ｄｉｊを
計算する（ステップ２１０４）。Subsequently, from the variable i (character type of the character immediately before) and the variable j (character type of the character of the kth recognition candidate),
The addition point Dij of the kth recognition candidate is calculated using (Equation 1) (step 2104).

【０１３２】Ｄｉｊ＝（Ｐｉｊ＋Ｎ×αｉ×Ｐｉｊ）×Ａ …（式１）ここで、Ｐｉｊは、連接頻度データ領域１１０３に格納
されている確率であり、Ｎは、継承文字数領域１１０５
に格納されている継承文字数であり、αｉは、減衰パラ
メータデータ領域１１０４に格納されている減衰率であ
る。また、Ａは、字種継承機能による加算点の重み付け
を調整するためのパラメータであり、予め決められた値
となっている。Dij = (Pij + N × αi × Pij) × A (Equation 1) Here, Pij is the probability stored in the concatenation frequency data area 1103, and N is the inherited character number area 1105.
Is the number of inherited characters stored in, and αi is the attenuation rate stored in the attenuation parameter data area 1104. Also, A is a parameter for adjusting the weighting of the addition points by the character type inheritance function, and has a predetermined value.

【０１３３】連接頻度データ領域１１０３において、図
１３に示したように、確率Ｐｉｊは、ひらがな，カタカ
ナ，英字，数字が連続する傾向が強くなるように、その
値が予め決められているので、前の文字の字種と同じ字
種である文字の認識候補の加算点Ｄｉｊが大きくなり、
字種継承機能を実現することができるようになってい
る。In the concatenation frequency data area 1103, as shown in FIG. 13, the probability Pij is determined in advance so that the tendency that hiragana, katakana, letters and numbers are continuous becomes strong. The addition point Dij of the recognition candidates of the character having the same character type as that of
The character type inheritance function can be realized.

【０１３４】また、継承文字数Ｎが大きくなるほど、加
算点Ｄｉｊは徐々に減衰する。例えば、カタカナの場合
は、Ｎ＝０では、Ｄｉｊ＝（７０＋０）×０.５＝３５
となり、Ｎ＝１では、Ｄｉｊ＝（７０＋１×（−０.
２）×７０）×０.５＝２８となり、Ｎ＝２では、Ｄｉ
ｊ＝２１となり、Ｎ＝３では、Ｄｉｊ＝１４となり、Ｎ
＝４では、Ｄｉｊ＝７となる。なお、ここでは、Ａ＝
０.５として計算しているが、例えば、認識候補の一致
度の平均値が大きいほど、Ａの値を大きくするようにす
るなど、一致度の大きさに応じてＡの値を変更すること
ができる。Further, as the number N of inherited characters increases, the addition point Dij gradually decreases. For example, in the case of katakana, when N = 0, Dij = (70 + 0) × 0.5 = 35
Therefore, when N = 1, Dij = (70 + 1 × (−0.
2) × 70) × 0.5 = 28, and when N = 2, Di
j = 21, N = 3, Dij = 14, N
= 4, Dij = 7. Here, A =
Although calculated as 0.5, the value of A should be changed according to the degree of coincidence, for example, the value of A is increased as the average value of the degree of coincidence of recognition candidates is increased. You can

【０１３５】このように、継承文字数Ｎが大きくなるに
従って、ある時点で、加算点Ｄｉｊは負の数に転じるの
で、加算点Ｄｉｊが正であるか否かを調べ（ステップ２
１０５）、正であるならば、加算点Ｄｉｊを認識候補デ
ータ領域１１０２の加算点フィールド１２０６に格納す
る（ステップ２１０６）。なお、加算点Ｄｉｊの正／負
を判定しないようにすると、前の文字の字種と同じ字種
である文字の認識候補は、ある文字数分だけ連続してい
る間は、優先度が大きくなり、該文字数を超えて連続す
ると、逆に、優先度が小さくなるようにすることが可能
となる。As described above, as the number of inherited characters N increases, the addition point Dij turns to a negative number at a certain point, so it is checked whether or not the addition point Dij is positive (step 2
105) If it is positive, the addition point Dij is stored in the addition point field 1206 of the recognition candidate data area 1102 (step 2106). Note that if the addition point Dij is not determined to be positive or negative, the recognition candidate of a character having the same character type as the previous character has a high priority while it continues for a certain number of characters. When the number of characters exceeds the number of consecutive characters, the priority can be reduced.

【０１３６】続いて、加算点を計算しようとしている全
ての認識候補について、加算点Ｄijを計算し、計算した
加算点Ｄｉｊを加算点フィールド１２０６に格納した後
は（ステップ２１０７）、認識候補データ領域１１０２
において、一致度と加算点とを合計した優先度が大きい
順に、認識候補を並び替える（ステップ２１０８）。そ
して、並び替えた結果、第１の認識候補となった認識候
補の文字の字種と同じ字種の文字が、それまでに何文字
分だけ連続しているかを、該第１の認識候補の文字から
前にさかのぼって計算し、計算結果を継承文字数領域１
１０５に格納する（ステップ２１０９）。この計算は、
具体的には、認識候補データ領域１１０２において、該
第１の認識候補が格納されている認識候補記憶領域より
前の全ての認識候補記憶領域に格納されている第１の認
識候補の字種フィールド１２０５を参照することにより
行うことができる。Subsequently, after calculating the addition points Dij for all the recognition candidates for which the addition points are to be calculated and storing the calculated addition points Dij in the addition point field 1206 (step 2107), the recognition candidate data area is calculated. 1102
In step S2108, the recognition candidates are rearranged in the descending order of priority, which is the sum of the coincidence and the addition points. Then, as a result of the rearrangement, the number of consecutive characters of the same character type as the character type of the recognition candidate character that has become the first recognition candidate is determined by the first recognition candidate. It calculates backward from the character and calculates the calculation result.
It is stored in 105 (step 2109). This calculation is
Specifically, in the recognition candidate data area 1102, the character type field of the first recognition candidate stored in all the recognition candidate storage areas before the recognition candidate storage area in which the first recognition candidate is stored. This can be done by referring to 1205.

【０１３７】図２２は字種継承機能による加算点の具体
例を示す図であり、操作者が、「シ」，「ス」，
「テ」，「ム」という文字を意図した筆記データを手書
きした場合の認識候補データ領域１１０２の内容を示し
ている。FIG. 22 is a diagram showing a specific example of the addition points by the character type inheritance function.
The contents of the recognition candidate data area 1102 in the case where the handwriting data intended for the characters "te" and "mu" are handwritten are shown.

【０１３８】操作者が「シ」，「ス」，「テ」，「ム」
という文字を意図した筆記データを手書きした場合に、
字種継承機能による加算点を用いないと、図２２（ａ）
に示すように、一致度が最も大きい「シ」，「又」，
「千」，「ム」という文字の認識候補が第１の認識候補
として決定されてしまい、誤認識されることとなるが、
字種継承機能による加算点を用いると、図２２（ｂ）に
示すように、一致度と加算点Ｄｉｊとを合計した優先度
が最も大きい「ス」，「テ」という文字の認識候補が
「又」，「千」という文字の認識候補と入れ替わるの
で、「シ」，「ス」，「テ」，「ム」という文字の認識
候補が第１の認識候補として決定されて、操作者の意図
通りの認識結果が得られるようになる。The operator selects "shi", "su", "te", "mu".
When handwriting the writing data intended for the character
22 (a), unless the addition point by the character type inheritance function is used.
As shown in, the highest degree of coincidence is "shi", "mata",
Although the recognition candidates of the characters "thousand" and "mu" are determined as the first recognition candidate, they are erroneously recognized.
When the addition point by the character type inheritance function is used, as shown in FIG. 22B, the recognition candidates of the characters “su” and “te” having the highest priority, which is the sum of the coincidence degree and the addition point Dij, are “ Since it is replaced with the recognition candidates of the characters “m” and “thousand”, the recognition candidates of the characters “shi”, “su”, “te”, and “mu” are determined as the first recognition candidates, and the intention of the operator is determined. You will get the same recognition results.

【０１３９】このように、「シ」という文字以降の文字
の認識候補については、１つ前の文字の字種がカタカナ
であるので、該文字の字種がカタカナであるような認識
候補の加算点が計算されて、第１の認識候補として決定
される度合いが大きくなる。従って、単語処理や日本語
処理を行わなくても、字種継承機能によって、「シ」，
「ス」，「テ」，「ム」という、日本語として尤もらし
い認識結果を出力し、文字認識率を向上させることがで
きる。As described above, with respect to the recognition candidates of the characters after the character "shi", since the character type of the preceding character is katakana, the addition of the recognition candidates whose character type is katakana is added. The degree to which the points are calculated and determined as the first recognition candidate increases. Therefore, even if word processing or Japanese processing is not performed, the character type inheritance function enables
It is possible to improve the character recognition rate by outputting Japanese-like recognition results such as "su", "te", and "mu".

【０１４０】[0140]

【発明の効果】以上説明したように、本発明によれば、
単語処理や日本語処理を行わなくても、日本語の一般的
な傾向を利用することにより、日本語として尤もらしい
認識結果を出力し、文字認識率を向上させることが可能
な文字入力装置を提供することが可能となる。As described above, according to the present invention,
By using the general tendency of Japanese language without using word processing or Japanese language processing, a character input device that can output a recognition result that seems to be Japanese and can improve the character recognition rate is provided. It becomes possible to provide.

【０１４１】また、単語処理や日本語処理を行わないこ
とから、高速に認識結果を出力することができるように
なる。Further, since word processing and Japanese language processing are not performed, the recognition result can be output at high speed.

[Brief description of drawings]

【図１】本実施例の文字入力装置の機能ブロック図。FIG. 1 is a functional block diagram of a character input device according to an embodiment.

【図２】本実施例の文字入力装置の外観図。FIG. 2 is an external view of the character input device according to the present embodiment.

【図３】本実施例の画面表示例を示す説明図。FIG. 3 is an explanatory diagram showing a screen display example of the present embodiment.

【図４】本実施例の画面表示例を示す説明図。FIG. 4 is an explanatory diagram showing a screen display example of the present embodiment.

【図５】本実施例の画面表示例を示す説明図。FIG. 5 is an explanatory diagram showing a screen display example of the present embodiment.

【図６】本実施例の画面表示例を示す説明図。FIG. 6 is an explanatory diagram showing a screen display example of the present embodiment.

【図７】本実施例の画面表示例を示す説明図。FIG. 7 is an explanatory diagram showing a screen display example of the present embodiment.

【図８】本実施例の画面表示例を示す説明図。FIG. 8 is an explanatory diagram showing a screen display example of the present embodiment.

【図９】本実施例の画面表示例を示す説明図。FIG. 9 is an explanatory diagram showing a screen display example of the present embodiment.

【図１０】本実施例の画面表示例を示す説明図。FIG. 10 is an explanatory diagram showing a screen display example of the present embodiment.

【図１１】記憶手段の内容を模式的に示す説明図。FIG. 11 is an explanatory diagram schematically showing the contents of storage means.

【図１２】認識候補データ領域の内容を模式的に示す説
明図。FIG. 12 is an explanatory diagram schematically showing the contents of a recognition candidate data area.

【図１３】連接頻度データ領域の内容を模式的に示す説
明図。FIG. 13 is an explanatory diagram schematically showing the contents of a concatenation frequency data area.

【図１４】減衰パラメータデータ領域の内容を模式的に
示す説明図。FIG. 14 is an explanatory diagram schematically showing the contents of an attenuation parameter data area.

【図１５】継承文字数領域の内容を模式的に示す説明
図。FIG. 15 is an explanatory diagram schematically showing the content of the inherited character number area.

【図１６】連接記憶データ領域の内容を模式的に示す説
明図。FIG. 16 is an explanatory diagram schematically showing the contents of a concatenated storage data area.

【図１７】メイン処理プログラムのフローチャート。FIG. 17 is a flowchart of a main processing program.

【図１８】連接学習処理プログラムのフローチャート。FIG. 18 is a flowchart of a connection learning processing program.

【図１９】加算点計算処理プログラムのフローチャー
ト。FIG. 19 is a flowchart of an addition point calculation processing program.

【図２０】連接学習機能による加算点の具体例を示す説
明図。FIG. 20 is an explanatory diagram showing a specific example of an addition point by the connection learning function.

【図２１】字種継承機能による加算点計算処理のフロー
チャート。FIG. 21 is a flowchart of addition point calculation processing by the character type inheritance function.

【図２２】字種継承機能による加算点の具体例を示す説
明図。FIG. 22 is an explanatory diagram showing a specific example of addition points by the character type inheritance function.

[Explanation of symbols]

１０１…座標入力手段、１０２…記憶手段、１０３…制
御手段、１０４…表示手段、１０５…外部記憶手段、１
０６…電源、１０７…電源スイッチ、２０１…本体、２
０２…ペン、２０３…表示一体型タブレット、２０４…
挿入口、３０１…本文領域、３０２…カーソル、３０３
…文字枠、３０４…認識ボタン、３０５…認識候補枠、
３０６…確定ボタン、１１０１…各種プログラム領域、
１１０２…認識候補データ領域、１１０３…連接頻度デ
ータ領域、１１０４…減衰パラメータデータ領域、１１
０５…字種継承文字数領域、１１０６…連接記憶データ
領域、１１０７…座標データ領域、１１０８…本文デー
タ領域、１２０１，１２０２…認識候補記憶領域、１２
０３…文字コードフィールド、１２０４…字種フィール
ド、１２０５…一致度フィールド、１２０６…加算点フ
ィールド。101 ... Coordinate input means, 102 ... Storage means, 103 ... Control means, 104 ... Display means, 105 ... External storage means, 1
06 ... Power source, 107 ... Power switch, 201 ... Main body, 2
02 ... Pen, 203 ... Tablet with integrated display, 204 ...
Insertion slot, 301 ... Body area, 302 ... Cursor, 303
... character frame, 304 ... recognition button, 305 ... recognition candidate frame,
306 ... Confirm button, 1101 ... Various program areas,
1102 ... Recognition candidate data area, 1103 ... Concatenation frequency data area, 1104 ... Attenuation parameter data area, 11
05 ... Character type inherited character number area, 1106 ... Concatenated storage data area, 1107 ... Coordinate data area, 1108 ... Body data area, 1201, 1202 ... Recognition candidate storage area, 12
03 ... Character code field, 1204 ... Character type field, 1205 ... Matching degree field, 1206 ... Addition point field.

───────────────────────────────────────────────────── フロントページの続き (72)発明者隈井裕之神奈川県横浜市戸塚区吉田町292番地株式会社日立製作所映像メディア研究所内 (72)発明者松田泰昌神奈川県横浜市戸塚区吉田町292番地株式会社日立製作所映像メディア研究所内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Hiroyuki Kumai, Inventor Hiroyuki Kumai, 292 Yoshida-cho, Totsuka-ku, Yokohama, Kanagawa, Ltd.Inside the Hitachi Media Visual Media Research Institute (72) Yasumasa Matsuda, 292 Yoshida-cho, Totsuka-ku, Yokohama, Kanagawa Hitachi, Ltd. Visual Media Research Center

Claims

[Claims]

1. A handwriting input means for inputting handwritten data of handwritten characters, a standard writing pattern which is a standard writing pattern of the character, a character code representing the character, and a A character recognition dictionary storage unit that stores a character type and a matching degree that indicates the degree to which the writing data input by the handwriting input unit resembles the standard writing pattern are obtained.
A character recognition means for outputting the character code and the character type corresponding to the standard writing pattern for which the obtained coincidence is larger than a predetermined value, together with the obtained coincidence, as character recognition information, and the character output by the character recognizing means. A character recognition information storage unit that stores recognition information, and a predetermined probability value that indicates, for each character type, the probability that the character that appears next to the character of the character type will be. According to the stored character type connection information storage means and the probability value stored in the character type connection information storage means, the character recognition is performed for each character recognition information stored in the character recognition information storage means. An addition point calculating means for calculating an addition point to be added to the coincidence degree indicated by the information, and outputting character recognition information having the largest total value of the coincidence degree and the calculated addition point as a recognition result; Written by And a display unit for displaying a character represented by a character code indicated by the character recognition information output as the recognition result by the addition point calculation unit.

2. The character input device according to claim 1, wherein character recognition information including a character code representing a character displayed by said display means is replaced with character recognition information other than said character recognition information. The display means is further provided with a receiving means for receiving an instruction, and when the receiving means receives the instruction for replacing, the displayed character is represented by a character code indicated by the character recognition information instructed to replace the character. When the display means changes and displays the character, the addition point calculation means displays the character as the recognition result after the character recognition information including the character code representing the character. A character input device, which calculates the addition points again for all the character recognition information outputting the character recognition information, and outputs the recognition result again.

3. A handwriting input means for inputting handwriting data of a handwritten character, a standard writing pattern which is a standard writing pattern of the character, a character code representing the character, and a character code of the character. A character recognition dictionary storage unit that stores a character type and a matching degree that indicates the degree to which the writing data input by the handwriting input unit resembles the standard writing pattern are obtained.
A character recognition means for outputting the character code and the character type corresponding to the standard writing pattern for which the obtained coincidence is larger than a predetermined value, together with the obtained coincidence, as character recognition information, and the character output by the character recognizing means. Character recognition information storage means for storing recognition information, concatenated character information storage means for storing concatenated character information in which character codes representing a plurality of characters that are elements forming one word are stored, and the concatenated character information storage Based on the concatenated character information stored in the means, the addition point to be added to the matching degree indicated by the character recognition information is calculated for each character recognition information stored in the character recognition information storage means, and the matching is calculated. The character recognition information having the largest sum of the degree and the calculated addition point, as the recognition result, the addition point calculation means, the handwriting data input by the handwriting input means, and the above The addition point calculation means includes a display means for displaying a character represented by a character code indicated by the character recognition information output as a recognition result, and the concatenated character information storage means is a plurality of predetermined character types indicating the same character type. If the character recognition information is continuously output as the recognition result, it is considered that the character represented by the character code indicated by the plurality of character recognition information is a plurality of characters that are elements forming one word. A character input device characterized in that a set of character codes indicated by the plurality of character recognition information is stored as the concatenated character information.

4. A handwriting input means for inputting handwritten data of handwritten characters, a standard writing pattern that is a standard writing pattern of the character, a character code representing the character, and a character code of the character, for each character. A character recognition dictionary storage unit that stores a character type and a matching degree that indicates the degree to which the writing data input by the handwriting input unit resembles the standard writing pattern are obtained.
A character recognition means for outputting the character code and the character type corresponding to the standard writing pattern for which the obtained coincidence is larger than a predetermined value, together with the obtained coincidence, as character recognition information, and the character output by the character recognizing means. Character recognition information storage means for storing recognition information, concatenated character information storage means for storing concatenated character information in which character codes representing a plurality of characters that are elements forming one word are stored, and the concatenated character information storage Based on the concatenated character information stored in the means, the addition point to be added to the matching degree indicated by the character recognition information is calculated for each character recognition information stored in the character recognition information storage means, and the matching is calculated. Degree calculation means for outputting the character recognition information having the largest sum of the degree and the calculated addition point as a recognition result, and the character including the character code representing the character displayed by the display means. Receiving means for receiving an instruction to replace the recognition information and other character recognition information other than the character recognition information, handwriting data input by the handwriting input means, and the addition point calculation means output as a recognition result. Display means for displaying the character represented by the character code indicated by the character recognition information, and the display means replaces the displayed character when the receiving means receives an instruction to replace the character. The display is changed to a character represented by the character code indicated by the character recognition information instructed to the effect, and the concatenated character information storage means is predetermined when the acceptance means accepts the instruction to replace. If a plurality of character recognition information indicating the same character type is continuously output as the recognition result, the character represented by the character code indicated by the plurality of character recognition information indicates one word. Character input characterized in that it is regarded as a plurality of characters that are elements to be formed, and a combination of character codes indicated by the plurality of character recognition information is stored as the concatenated character information. apparatus.

5. The character input device according to claim 4, wherein the addition point calculation means, when the display means changes and displays a character, after the character recognition information including a character code representing the character, A character input device, which calculates the addition point again for all the character recognition information that has already output the character recognition information as the recognition result, and outputs the recognition result again.

6. The character input device according to claim 3, 4 or 5, wherein the addition point calculation means adds points for character recognition information based on the concatenated character information stored in the concatenated character information storage means. When calculating, the concatenated character information including a combination of the character code indicated by the character recognition information and the character code indicated by the character recognition information output as the previous recognition result is stored in the concatenated character information storage means. A character input device, characterized in that the addition point for the character recognition information is set to a predetermined value.

7. The character input device according to claim 3, 4, 5 or 6, wherein, for each character type, a probability of what character type the character that appears next to the character type is The character point connection information storage means for storing a predetermined probability value is further provided, and the addition point calculation means is any one of the character recognition information stored by the character recognition information storage means. When the concatenated character information including a combination of the character code indicated by the character recognition information and the character code indicated by the character recognition information output as the previous recognition result is stored in the concatenated character information storage means, The addition point is calculated based on the concatenated character information stored in the concatenated character information storage means, and if not, the addition point is further calculated according to the probability value stored in the character type concatenation information storage means. Sentences characterized by calculations Input device.

8. The character input device according to claim 1, 2 or 7, wherein the probability value stored in said character type connection information storage means is such that the character type of the character that appears next is the character type of the preceding character. The probability value to be the same as the character type is larger than the probability value that the character type of the character that appears next is different from the character type of the previous character. When calculating the addition point for the character recognition information according to the probability value stored in the character type connection information storage means, the character type indicated by the character recognition information and the character recognition information output as the previous recognition result indicate A character input device, wherein a probability value corresponding to the character recognition information is obtained from a character type, and the larger the obtained probability value is, the larger the addition point for the character recognition information is.

9. The character input device according to claim 8, wherein the character recognition information including the same character type as the character type indicated by the character recognition information output as the recognition result by the addition point calculation means is continuous until then. And further includes a continuous character type number calculating means for calculating the number of continuous character types indicating whether or not it is output as a recognition result, wherein the addition point calculating means further includes the continuous character type calculated by the continuous character type number calculating means. A character input device characterized in that the larger the number, the smaller the addition point for the character recognition information including the same character type as the character type indicated by the character recognition information output as the previous recognition result.

10. A first step of outputting a character code representing one or more characters having a high degree of similarity to the handwritten character's writing data, and a character code currently output. Or 1
And a second step of outputting the result as a recognition result. In the second step, the character code output as the previous recognition result is displayed among the one or more character codes currently output. A character recognition method characterized by preferentially outputting a character code that represents a character having the same character type as the character type of the specified character as a recognition result.

11. One of a first step of outputting a character code representing one or more characters having a high degree of similarity to the handwritten character's writing data and a character code currently output. Or 1
And a second step of outputting one as a recognition result, and in the second step, the character type of the character represented by the character code among the one or more character codes currently output. And the character type of the character represented by the character code output as the previous recognition result is the same predetermined character type, and the character code and the character code output as the previous recognition result are paired. The character code that has been output in the past as two consecutive recognition results,
A character recognition method characterized by preferentially outputting as a recognition result.