JPH09106433A

JPH09106433A - Character recognizing processor

Info

Publication number: JPH09106433A
Application number: JP7289402A
Authority: JP
Inventors: Kenji Okano; 健治岡野
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1995-10-11
Filing date: 1995-10-11
Publication date: 1997-04-22

Abstract

PROBLEM TO BE SOLVED: To obtain a result in a quick response by selecting one kind of combination of a character recognizing part and a dictionary whenever one character is inputted and using one of the combinations as a most suitable one. SOLUTION: Characters are inputted in an order from the first one through the use of a character input part 5. Here, the stroke number of the character, namely a picture, and the stroke shape of it, namely the string of coordinate points, are compared with a dictionary 13 to recognize the character. A character recognition part 10 extracts the stroke number and the shape of the stroke from the inputted character and compares them with data in the dictionary 13 to select the character which is the most similar to the piece of data as the recognizing result. Then several characters from the most similar one are outputted as the recognizing result. In addition recognizing result from a first to fifth candidates concerning each character are displayed. Then a handwriting user inputting characters through the use of a character input part 5 collates the results with the characters inputted with a keyboard, a mouse, a pen, etc., to specify correctly recognized characters.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、手書き文字の如き
同一の文字であっても各種の字形で入力される文章を読
み取って文字認識する文字認識処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition processing device for recognizing characters by reading sentences input in various character shapes even if they are the same characters such as handwritten characters.

【０００２】[0002]

【従来の技術】情報処理装置に各種のデータ等を入力す
る方法として、文字のイメージを読み取りこれを認識処
理してデータ化する方法がある。これには、タイプ等で
印刷した文書や手書きにより入力する文書等が対象とな
る。例えば、手書き文字を認識する装置では、ペンを用
いてタブレット上で文字を描くことにより、その位置座
標列を文字のストロークとして受け入れ、辞書と照合し
て認識するものがある。このような手書き文字の入力に
は、予めその筆記者の文字認識に適合する辞書を用意す
る。また、文字の認識方法には文字の切り出し方法やパ
ターンとの比較方法等の各種の方法がある。このような
各種の認識方法と辞書とを組み合せる場合には認識率が
異なってくる。2. Description of the Related Art As a method of inputting various data and the like to an information processing apparatus, there is a method of reading an image of a character and recognizing and processing the image to convert it into data. This applies to documents printed in type and the like, and documents input by handwriting. For example, in a device for recognizing a handwritten character, there is a device that draws a character on a tablet with a pen, accepts the position coordinate sequence as a stroke of the character, and collates and recognizes the stroke. For inputting such handwritten characters, a dictionary suitable for the character recognition of the writer is prepared in advance. There are various methods for recognizing characters, such as a method of cutting out a character and a method of comparing with a pattern. When combining such various recognition methods and dictionaries, the recognition rates differ.

【０００３】そこで、通常は筆記者が自己の入力した文
字を適切に認識する辞書や認識方法の組合せを予め選択
している。一方、このような選択作業を自動化するため
に、認識用のプログラムと認識用の辞書の全ての組合せ
を用意し、１文字入力する毎に全ての組合せによって認
識処理を行い、その結果から最も適合するものを選択す
るといった方法も紹介されている（特開平５−３２４９
０３号公報）。Therefore, a writer usually selects in advance a dictionary or a combination of recognition methods for appropriately recognizing a character input by the writer. On the other hand, in order to automate such selection work, all combinations of the recognition program and the recognition dictionary are prepared, and the recognition processing is performed by all the combinations every time one character is input, and the result is the most suitable. A method of selecting a desired one is also introduced (Japanese Patent Laid-Open No. 5-3249).
03 publication).

【０００４】[0004]

【発明が解決しようとする課題】ところで、上記のよう
な従来の文字認識処理装置には次のような解決すべき課
題があった。認識方法や文字認識のための辞書をいくつ
か用意すると、これらの組合せが多数存在することか
ら、１文字入力される毎にこれらの全ての組合せについ
て認識処理を行うと、処理量が増大し、認識結果を出力
するまでの時間が非常に長くなる。従って、複数の利用
者が存在し、多くの辞書が用意されているような場合に
は、筆記者が文字を入力してから認識結果が表示される
までのレスポンスが悪く、操作性の点で問題があった。
また、認識結果が多数出力されるので、多量の認識結果
を一時的に記憶するための容量の大きいメモリが必要に
なる。The conventional character recognition processing apparatus as described above has the following problems to be solved. When several recognition methods and dictionaries for character recognition are prepared, there are many combinations of these, so performing recognition processing for all these combinations each time one character is input increases the processing amount, It takes a very long time to output the recognition result. Therefore, when there are multiple users and many dictionaries are prepared, the response from the writer's input of characters until the recognition result is displayed is poor, and in terms of operability. There was a problem.
Moreover, since a large number of recognition results are output, a memory having a large capacity for temporarily storing a large amount of recognition results is required.

【０００５】[0005]

【課題を解決するための手段】本発明は以上の点を解決
するため次の構成を採用する。〈構成１〉本発明の文字認識処理装置は、認識対象とな
る文字を入力する文字入力部と、その文字を予め用意し
た辞書を参照して認識処理する複数の文字認識部と、文
字が１字入力する度に、予め用意した複数の辞書と文字
認識部との組合せを、所定の順に切り換えて選択する組
み合せ選択部と、認識結果を候補文字のリストとして表
示する表示部と、この表示部の表示した認識結果を評価
して、正しい認識結果を指定する指定部と、指定の結果
に基づいて、認識率の高い辞書と文字認識部の組合せを
その文書の認識処理用として決定する制御部とを備え
る。The present invention employs the following structure to solve the above problems. <Structure 1> The character recognition processing device of the present invention includes a character input unit for inputting a character to be recognized, a plurality of character recognition units for performing recognition processing by referring to a dictionary prepared in advance, A combination selection unit that selects a combination of a plurality of prepared dictionaries and a character recognition unit each time a character is input, in a predetermined order, a display unit that displays the recognition result as a list of candidate characters, and this display unit. The evaluation unit that evaluates the displayed recognition result, and a control unit that specifies a correct recognition result, and a control unit that determines a combination of a dictionary and a character recognition unit with a high recognition rate for the recognition process of the document based on the specified result With.

【０００６】〈説明〉入力される認識対象となる文字
は、手書き文字でもタイプ文字でもよい。この発明は同
一の文字であっても２種以上の字形で入力されるとき、
各字形に対応するパターンを記憶した２以上の辞書を選
択して参照し、文字認識を行う際に適用される。複数の
文字認識部はそれぞれ別々の認識処理方法で文字認識を
行う。１字入力する度に複数の辞書と文字認識部の組合
せのうちから１つを選択してこれを参照し、その認識結
果を出力するから、その都度全ての辞書と文字認識部の
組合せにより認識処理をする場合に比べて認識処理のレ
スポンスが速い。また、出力のバッファのためのメモリ
も少容量でよい。この選択対象となる辞書や文字認識部
は、全ての辞書の中から予め絞り込まれていてもよい。<Explanation> The input character to be recognized may be a handwritten character or a type character. This invention, even if the same character is entered in two or more types,
It is applied when character recognition is performed by selecting and referring to two or more dictionaries that store patterns corresponding to each character shape. The plurality of character recognition units perform character recognition by different recognition processing methods. Each time one character is input, one of the combinations of the dictionary and the character recognition unit is selected and referenced, and the recognition result is output. Therefore, each time the character is recognized by the combination of the dictionary and the character recognition unit. The response of the recognition process is faster than when processing is performed. Also, the memory for the output buffer may have a small capacity. The dictionary or character recognition unit to be selected may be narrowed down in advance from all dictionaries.

【０００７】辞書と文字認識部の組合せは１字あるいは
数文字認識処理する度に違うものに切り換えられ、その
都度あるいは数文字認識する毎に表示されてオペレータ
の評価を受ける。この処理を適当量の文字について繰り
返した結果、認識率の高い辞書と文字認識部の組合せが
その文書全体の認識に最適なものとして使用される。な
お、このとき最適なものとして使用される組合せは１種
でなく２種以上でもよい。候補文字とは、辞書との照合
の結果、最も良く似た字から順にピックアップした結果
で、認識不能な場合には候補文字が無いこともある。The combination of the dictionary and the character recognition section is switched to a different one each time one character or several characters are recognized, and is displayed each time or every time several characters are recognized and evaluated by the operator. As a result of repeating this processing for an appropriate amount of characters, the combination of the dictionary and the character recognition unit having a high recognition rate is used as the optimum one for recognition of the entire document. The combination used as the optimum one at this time may be two or more kinds instead of one kind. The candidate character is a result of picking up the most similar characters in order as a result of collation with the dictionary, and there may be no candidate character when it is unrecognizable.

【０００８】〈構成２〉予め設定された数の文字を認識
した後に、表示部の表示した認識結果を評価して、正し
い認識結果を指定する指定部と、指定の結果に基づい
て、認識率の高い辞書と文字認識部の組合せをその文書
の認識処理用として決定する制御部とを備えるとよい。〈説明〉適当な数の文字について、用意された辞書を順
に適用して認識結果が得られたときには、辞書等の組合
せ決定のための作業を終える。このために、適当な認識
文字数等を予め設定しておく。<Structure 2> After recognizing a preset number of characters, the recognition result displayed on the display unit is evaluated, and a recognition unit that specifies a correct recognition result and a recognition rate based on the specified result A control unit that determines a combination of a high-performance dictionary and a character recognition unit for recognition processing of the document. <Explanation> When a recognition result is obtained by sequentially applying prepared dictionaries for an appropriate number of characters, the work for determining a combination of dictionaries and the like is completed. Therefore, an appropriate number of recognized characters and the like are set in advance.

【０００９】〈構成３〉予め設定された数の文字を認識
した後に、表示部の表示した認識結果を評価して、正し
い認識結果を指定する指定部と、指定の結果、認識率の
低い辞書と文字認識部との組合せを選択の対象から除外
する組み合せ選択部とを備えることが好ましい。〈説明〉複数の辞書を用いた認識とその評価を繰り返す
ことにより、適切な辞書の絞り込みを行うが、その際に
不適切な辞書を除外することもよい。<Structure 3> After recognizing a preset number of characters, the recognition result displayed on the display unit is evaluated to specify a correct recognition result, and a dictionary having a low recognition rate as a result of the specification. It is preferable to include a combination selection unit that excludes the combination of the character recognition unit and the character recognition unit from the selection target. <Explanation> Appropriate dictionaries are narrowed down by repeating recognition using a plurality of dictionaries and evaluation thereof, but in that case, inappropriate dictionaries may be excluded.

【００１０】[0010]

【発明の実施の形態】以下、本発明の実施の形態を具体
例を用いて説明する。〈具体例〉図１は、本発明の文字認識処理装置の具体例
を示す概略図である。この装置は、ディスプレイ１と、
キーボード２及び制御部３を備えた情報処理装置に対し
記憶部４や文字入力部５等を接続したものである。ディ
スプレイ１はこの場合、文字認識結果の表示や評価のた
めに使用される。なお、本発明では、このディスプレイ
１を表示部と呼んでいる。また、キーボード２は評価の
結果を入力したり、その他の操作に用いられる。本発明
ではこれを指定部と呼んでいる。また、制御部３はプロ
セッサ等を備えており、文字認識処理プログラムやその
他の制御機構を備えている。記憶部４はハードディスク
やフロッピーディスク等から構成され、この内部に文字
認識のために参照される各種の辞書１３が格納されてい
る。なお、これらの辞書１３は複数の筆記者の手書き入
力あるいは各種の字体の文字が入力された場合に、その
認識を行うために予め用意されたものである。このよう
な辞書の生成方法等は従来よく知られた方法であり、こ
こでは詳細な説明は省略する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to specific examples. <Specific Example> FIG. 1 is a schematic view showing a specific example of the character recognition processing apparatus of the present invention. This device has a display 1 and
A storage unit 4, a character input unit 5, and the like are connected to an information processing apparatus including a keyboard 2 and a control unit 3. In this case, the display 1 is used for displaying and evaluating the character recognition result. In the present invention, this display 1 is called a display unit. The keyboard 2 is also used for inputting evaluation results and other operations. In the present invention, this is called a designation unit. The control unit 3 includes a processor and the like, and includes a character recognition processing program and other control mechanisms. The storage unit 4 is composed of a hard disk, a floppy disk or the like, and stores various dictionaries 13 referred to for character recognition therein. It should be noted that these dictionaries 13 are prepared in advance in order to recognize handwritten input by a plurality of writers or characters of various fonts. A method of generating such a dictionary is a well-known method in the related art, and a detailed description thereof will be omitted here.

【００１１】制御部３には、この装置の文字認識処理の
ために複数の文字認識部１０が組み込まれている。これ
らの文字認識部１０は、それぞれ別々の認識方法で文字
認識処理を行う処理プログラム等から構成されており、
例えば文字の切り出し方法、ストロークの認識方法、辞
書との照合方法等、これまでよく知られた各種の方法を
指している。これらは入力された文字の性質から最も適
したものが選ばれるよう複数用意されている。選択部１
１は、文字認識の際に予め設定された順番等で、文字認
識部１０と辞書１３のいずれかの組合せを選択して文字
認識処理を実行させる部分である。文字入力部５はペン
６を用いて文字を入力する場合、そのストロークを座標
点列として検出し、制御部３に送り込むよく知られた装
置である。A plurality of character recognition units 10 are incorporated in the control unit 3 for the character recognition processing of this apparatus. These character recognition units 10 are composed of processing programs and the like for performing character recognition processing by different recognition methods,
For example, various well-known methods such as a character cutting method, a stroke recognition method, and a dictionary matching method are used. A plurality of these are prepared so that the most suitable one can be selected from the characteristics of the input characters. Selector 1
Reference numeral 1 denotes a portion for selecting a combination of the character recognition unit 10 and the dictionary 13 and executing the character recognition process in a preset order or the like at the time of character recognition. When inputting a character using the pen 6, the character input unit 5 is a well-known device that detects the stroke as a sequence of coordinate points and sends it to the control unit 3.

【００１２】なお、本発明の装置は、このような手書き
文字をオンラインで入力する場合に特に適するが、既に
手書き文字がフロッピーディスク７等の記憶装置に記録
されている場合でも同様に適用できる。また、この他に
イメージスキャナやＣＣＤカメラ等、光学的な読取り装
置等を用いてデータを入力する場合にも適用できる。更
に、本発明装置は、文字入力部５はキーボードやマウス
等のポインティングデバイス等に置き換えてディスプレ
イ１上に手書き文字を描くといった方法で文字入力を行
う場合にも適用可能である。Although the apparatus of the present invention is particularly suitable for inputting such handwritten characters online, it can be similarly applied even when the handwritten characters are already recorded in a storage device such as the floppy disk 7. In addition to this, it is also applicable to the case of inputting data using an optical reading device such as an image scanner or a CCD camera. Further, the device of the present invention can be applied to a case where the character input unit 5 is replaced with a pointing device such as a keyboard or a mouse, and a character is input on the display 1 by handwriting.

【００１３】図２に、辞書と文字認識部の組合せ説明図
を示す。即ち、図２は文字認識部と辞書の組合せによっ
てどのような結果が得られるかを説明するための図であ
る。例えば、ここに筆記者がＡ，Ｂ，Ｃ，Ｄ，Ｅの５人
いるとする。この場合、文字認識部がＰ，Ｑ，Ｒの３種
類、辞書がＸ，Ｙ，Ｚの３種類用意されていたとする。
このとき、筆記者Ａについては文字認識部をＱとし、辞
書をＹとした場合に最も認識率が高いとすれば、これが
適合する組合せとなる。以下、筆記者Ｂについては文字
認識部はＱ、辞書はＸ、筆記者Ｃ，Ｄ，Ｅについては、
それぞれ文字認識部がＰ，Ｒ，Ｑ、辞書がＸ，Ｚ，Ｙと
いった組合せが適合する組合せとなっている。本発明に
おいては、このような筆記者毎の文字認識部と辞書の組
合せを可能な限り自動的に迅速に決定することを主構成
としている。FIG. 2 is an explanatory view of a combination of the dictionary and the character recognition section. That is, FIG. 2 is a diagram for explaining what kind of result is obtained by the combination of the character recognition unit and the dictionary. For example, suppose there are five writers A, B, C, D, and E here. In this case, it is assumed that three types of character recognition units P, Q, and R and three types of dictionaries X, Y, and Z are prepared.
At this time, for the writer A, if the character recognition unit is Q and the dictionary is Y, and the recognition rate is the highest, this is a suitable combination. For the writer B, the character recognition unit is Q, the dictionary is X, and the writers C, D, and E are
Combinations such as P, R, Q for the character recognition unit and X, Z, Y for the dictionary are suitable combinations. In the present invention, the main configuration is to determine such a combination of the character recognition unit and the dictionary for each writer automatically and as quickly as possible.

【００１４】図３には、辞書と文字認識部の組合せ番号
説明図を示す。上記のように文字認識部がＰ，Ｑ，Ｒの
３種類、辞書がＸ，Ｙ，Ｚの３種類存在するとすれば、
この図に示すように、両者の組合せは９通り生じる。こ
の実施例では、これらの組合せにそれぞれ組合せ番号Ｓ
を付け、かつ文字が１文字入力される毎に、このうちの
いずれかの組合せ番号Ｓを選択し、認識処理を実行す
る。この選択方法には、例えば組合せ番号Ｓが１番から
２，３，４と順に変化させる方法と、常にある組合せが
最後になるのを防止するためにその都度ランダムに組合
せ番号の配列を決定して順序を自由に変化させる方法が
ある。また、予め不必要な組合せは除外することもでき
る。FIG. 3 is an explanatory view of combination numbers of the dictionary and the character recognition section. If there are three types of character recognition units P, Q, and R and three types of dictionaries X, Y, and Z as described above,
As shown in this figure, there are nine combinations of the two. In this embodiment, a combination number S is assigned to each of these combinations.
Each time one character is input, the combination number S is selected and the recognition process is executed. This selection method includes, for example, a method in which the combination number S is sequentially changed from 1 to 2, 3, and 4, and a sequence of combination numbers is randomly determined in order to prevent a certain combination from being the last. There is a method to change the order freely. Also, unnecessary combinations can be excluded in advance.

【００１５】図４には、入力文書の例説明図を示す。こ
の例では、１文字から８文字目まで「私は社会人で
す。」という文字が順に入力される。一方、図５には、
例えば２番目の文字についてある組合せを選定したとき
の候補文字表示例説明図を示す。ディスプレイ１には２
番目の文字を認識した結果が、例えばこの図に示すよう
に、５種類表示される。この５種類が、例えば１番左か
ら順に可能性の高いものを並べる。即ち、対象となる文
字と辞書とを照合し、辞書の各文字と入力された文字と
の差に相当する距離データが得られるが、この距離デー
タが最も近いものを第１候補として１番左に置く。以
下、第２〜第５候補まで表示される。筆記者はこのよう
な表示を見ながら、図１に示すキーボード２を用いて実
際の入力文書の該当する文字と一致するものを指定す
る。このようにして、文字認識結果の評価が行われる。FIG. 4 shows an example explanatory diagram of an input document. In this example, the characters "I am a member of society" are sequentially input from the first character to the eighth character. On the other hand, in FIG.
For example, an explanatory view of a candidate character display example when a certain combination is selected for the second character is shown. 2 for display 1
As a result of recognizing the th character, for example, five types are displayed as shown in this figure. Of these five types, for example, the ones with high possibility are arranged in order from the leftmost. That is, the target character is collated with the dictionary, and distance data corresponding to the difference between each character in the dictionary and the input character is obtained. The one having the closest distance data is the first candidate and is the leftmost. Put on. Hereinafter, the second to fifth candidates are displayed. While seeing such a display, the writer uses the keyboard 2 shown in FIG. 1 to specify what matches the corresponding character of the actual input document. In this way, the character recognition result is evaluated.

【００１６】〈動作〉以下、図６を用いて上記のような
入力文書に対し具体的な本発明の装置の動作を説明す
る。図６は、認識結果表示例説明図である。まず、図１
に示す装置の文字入力部５を用いて図４に示した文字が
第１番目から順に入力される。このとき、その文字のス
トロークの数、即ち画数と、そのストロークの形状、即
ち座標点列とを辞書と比較して文字認識を行う。辞書に
は複数の人から認識対象となる文字を収集し、画数やス
トロークの形を検出して平均化処理等を行ったデータを
記憶してある。複数の辞書は、それぞれ例えば別々の人
から収集したデータから生成し、あるいは各種のデータ
を元に生成してある。文字認識部１０は、このような入
力文字から画数やストロークの形を抽出し、辞書のデー
タと比較して最も類似度の高い文字を認識結果とする。
そして、図５に示すように類似度の高い文字から数文字
を認識結果として出力する。この文字認識部１０は文字
の画数やストロークの始点と終点を結ぶベクトル、スト
ロークの終点と次のストロークの始点を結ぶベクトル等
を、入力された順番に辞書とマッチングして認識を行
う。この図に示すように、１文字目、２文字目、３文字
目という順に８文字目まで文字認識部と辞書の組合せ番
号を１，２，３，４と設定し、これを２回繰り返してい
る。そして、各文字について、第１候補から第５候補ま
での認識結果が表示される。文字入力部５を用いて文字
を入力した筆記者はキーボードやマウスあるいはペン等
によって入力文字と照合し、正しく認識した文字を指定
する。例えば、２番目の文字については、番号が２の組
合せが使用され、第１候補は誤認識している。従って、
筆記者は第２候補の「は」を指定する。従って、最初の
文字は第１候補、２番目の文字は第２候補、３番目の文
字は第１候補、４番目の文字は第１候補、５番目の文字
は第３候補、６番目の文字は第１候補、７番目の文字は
第１候補、８番目の文字は第３候補というように指示が
入力される。図１に示す制御部３はこの入力を受け入れ
て適合度の計算を行う。なお、このような正解の指定は
第１候補については省略して良いようにし、負担を軽減
することもできる。<Operation> The specific operation of the apparatus of the present invention for the above-described input document will be described below with reference to FIG. FIG. 6 is a diagram illustrating a recognition result display example. First, FIG.
The characters shown in FIG. 4 are input in order from the first using the character input unit 5 of the device shown in FIG. At this time, character recognition is performed by comparing the number of strokes of the character, that is, the number of strokes, and the shape of the stroke, that is, the coordinate point sequence, with a dictionary. The dictionary stores data obtained by collecting characters to be recognized from a plurality of people, detecting the number of strokes and stroke shapes, and performing averaging processing. The plurality of dictionaries are generated from, for example, data collected from different persons, or based on various data. The character recognition unit 10 extracts the number of strokes and the stroke shape from such input characters, compares them with the dictionary data, and sets the character having the highest degree of similarity as the recognition result.
Then, as shown in FIG. 5, several characters are output as a recognition result from the character having high similarity. The character recognizing unit 10 recognizes the number of strokes of a character, a vector connecting a start point and an end point of a stroke, a vector connecting an end point of a stroke and a start point of the next stroke, and the like in a matching order with a dictionary. As shown in this figure, the combination numbers of the character recognition unit and the dictionary are set to 1, 2, 3, 4 up to the 8th character in the order of the 1st character, the 2nd character, and the 3rd character, and this is repeated twice. There is. Then, for each character, the recognition results from the first candidate to the fifth candidate are displayed. The writer who has input a character using the character input unit 5 collates the input character with a keyboard, a mouse, a pen or the like, and designates the correctly recognized character. For example, for the second character, the combination with the number 2 is used, and the first candidate is erroneously recognized. Therefore,
The writer specifies the second candidate, "ha". Therefore, the first character is the first candidate, the second character is the second candidate, the third character is the first candidate, the fourth character is the first candidate, the fifth character is the third candidate, the sixth character. Is the first candidate, the seventh character is the first candidate, the eighth character is the third candidate, and so on. The control unit 3 shown in FIG. 1 receives this input and calculates the goodness of fit. Note that such correct answer designation may be omitted for the first candidate, and the burden can be reduced.

【００１７】図７に、このような評価の結果を適合度と
して算出した説明図を示す。ここでは、正解文字が第ｉ
候補の場合、その文字の適合度をＶ（ｉ）と表現する。
候補中に正解が無い場合は適合度は“０”となる。従っ
て、Ｖ（１）は１．０となり、Ｖ（５）は０．２とな
る。こうして、図６に示した結果から適合度を設ける
と、この図に示すように、第１番目、３番目、４番目、
６番目及び７番目の文字は適合度が１．０となるが、そ
の他の文字は適合度が０．８あるいは０．６となってい
る。ここで、各文字について、それぞれ文字認識部と辞
書の組合せを切り換えいているため、同一の組合せを用
いた場合の適合度を合計する。FIG. 7 shows an explanatory diagram in which the result of such evaluation is calculated as the goodness of fit. Here, the correct character is the i-th
In the case of a candidate, the matching degree of the character is expressed as V (i).
If there is no correct answer in the candidates, the goodness of fit is “0”. Therefore, V (1) becomes 1.0 and V (5) becomes 0.2. In this way, if the goodness of fit is provided from the result shown in FIG. 6, as shown in this figure, the first, third, fourth,
The sixth and seventh characters have a goodness of fit of 1.0, while the other characters have a goodness of fit of 0.8 or 0.6. Here, since the combination of the character recognition unit and the dictionary is switched for each character, the matching degrees when the same combination is used are summed up.

【００１８】図８には、この適合度の合計結果の説明図
を示す。この図に示すように、組合せ番号１，２，３，
４の適合度の合計結果を見ると、組合せ番号３が最も適
合していると判断できる。こうして、文字認識部と辞書
との組合せが決定される。なお、上記の場合、組合せの
数を４組とし、１文字ずつ組合せを変えて、これを２回
ずつ繰り返すために８文字の認識を行った段階で最終的
な評価を行った。このように、適合度の合計を一定の文
字だけ認識処理した後に比較することによって最適な組
合せを決定できる。この場合の文字数は組合せの数の正
数倍となる。FIG. 8 shows an explanatory diagram of the result of the sum of the goodness of fit. As shown in this figure, combination numbers 1, 2, 3,
It can be judged that the combination number 3 is the most suitable when looking at the total result of the matching degrees of 4. In this way, the combination of the character recognition unit and the dictionary is determined. In the above case, the number of combinations was set to 4, and the combination was changed by one character, and the final evaluation was performed when 8 characters were recognized in order to repeat this twice. In this way, the optimum combination can be determined by comparing the total of the matching degrees after recognizing only certain characters. In this case, the number of characters is a positive multiple of the number of combinations.

【００１９】なお、上記の例では、１文字毎に別の組合
せを選択する例を説明したが、数文字は同一の組合せを
用い、続く数文字は別の組合せを用いるといった方法も
可能である。また、上記のような適合度を用いる他、各
種の評価方法を採用して差し支えない。また、適合度の
高い組合せが２種以上ある場合、１つの組合せだけでな
く、２種の組合せを用いてその文書の認識処理用として
使用することも可能である。また、予め一定の文字数を
設定しておき、その文字数の認識が終えると、一定の組
合せを選択するという実施例を説明したが、評価の都度
不適当な組合せを１つずつ削除したり、あるいは評価の
都度上位半数の組合せを残し、他の半数を削除するとい
った方法によって何段階かで最終的な組合せを決定する
こともできる。このようにすることによって、可能な限
り適切な辞書と文字認識部の組合せを決定してその後の
処理を容易にすることができる。In the above example, an example in which another combination is selected for each character has been described, but a method of using the same combination for several characters and another combination for the following several characters is also possible. . Further, in addition to using the degree of conformity as described above, various evaluation methods may be adopted. Further, when there are two or more combinations having a high degree of conformity, it is possible to use not only one combination but also two combinations for recognition processing of the document. Further, although an example in which a certain number of characters is set in advance and a certain combination is selected when the recognition of the number of characters is finished has been described, an inappropriate combination is deleted one by one each time evaluation is performed, or It is also possible to decide the final combination in several stages by a method of leaving the combination of the upper half for each evaluation and deleting the other half. By doing so, it is possible to determine the most suitable combination of the dictionary and the character recognition section and facilitate the subsequent processing.

【００２０】また、一旦決定した組合せは、その後適当
な時間の後、初期状態に戻し、改めて組合せの選択処理
を再開したり、一定の文字数以上文字が入力された場合
に、同様の選択処理を再開するといった方法も可能であ
る。The combination once decided is returned to the initial state after an appropriate time, the selection process of the combination is restarted again, or the same selection process is performed when more than a certain number of characters are input. A method of restarting is also possible.

【００２１】〈効果〉以上説明した文字認識処理装置
は、１文字入力される毎に１種類の文字認識部と辞書と
の組合せを選択し、これによって認識した結果を順に表
示し、所定回数これを繰り返していずれかの組合せを最
も適合するものとして使用するようにしたので、１文字
毎の認識処理は１つの組合せとなり、その処理量を少な
くし、速いレスポンスで結果を得ることができる。ま
た、文字認識部側で認識結果を一時保存するバッファメ
モリを設けると、１文字毎にその認識結果を保持するた
めのメモリは少量でよく、メモリの小型化と低コスト化
を図ることができる。なお、本発明の装置は、ＯＣＲ
（光学的文字読取り装置）、コンピュータ、電子手帳、
電子記帳台、その他各種の文字を認識する装置に広く採
用することができる。<Effect> The character recognition processing apparatus described above selects a combination of one type of character recognition unit and a dictionary each time one character is input, displays the recognition results in order, and stores this result a predetermined number of times. Since any combination is repeatedly used as the most suitable one, the recognition processing for each character becomes one combination, the processing amount can be reduced, and the result can be obtained with a fast response. Further, if a buffer memory for temporarily storing the recognition result is provided on the character recognition unit side, a small amount of memory is required to hold the recognition result for each character, and it is possible to reduce the size and cost of the memory. . The device of the present invention is
(Optical character reader), computer, electronic notebook,
It can be widely applied to electronic book readers and other devices that recognize various characters.

[Brief description of the drawings]

【図１】本発明の文字認識処理装置実施例を示す概略図
である。FIG. 1 is a schematic diagram showing an embodiment of a character recognition processing device of the present invention.

【図２】辞書と文字認識部の組合せ説明図である。FIG. 2 is an explanatory diagram of a combination of a dictionary and a character recognition unit.

【図３】辞書と文字認識部の組合せ番号説明図である。FIG. 3 is an explanatory diagram of combination numbers of a dictionary and a character recognition unit.

【図４】入力文書の例説明図である。FIG. 4 is a diagram illustrating an example of an input document.

【図５】候補文字表示例説明図である。FIG. 5 is a diagram illustrating a candidate character display example.

【図６】認識結果表示例説明図である。FIG. 6 is a diagram illustrating a recognition result display example.

【図７】適合度の説明図である。FIG. 7 is an explanatory diagram of a goodness of fit.

【図８】適合度合計結果の説明図である。FIG. 8 is an explanatory diagram of a result of total fitness.

[Explanation of symbols]

１ディスプレイ（表示部）２キーボード（指定部）３制御部４記憶部５文字入力部６ペン１０文字認識部１１選択部１３辞書 1 Display (display unit) 2 Keyboard (designating unit) 3 Control unit 4 Storage unit 5 Character input unit 6 Pen 10 Character recognition unit 11 Selection unit 13 Dictionary

Claims

[Claims]

1. A character input unit for inputting a character to be recognized, a plurality of character recognition units for recognizing the character by referring to a dictionary prepared in advance, and a character prepared in advance each time one character is input. A combination selection unit for selecting and selecting a combination of the plurality of dictionaries and the character recognition unit in a predetermined order, a display unit for displaying the recognition result as a list of candidate characters, and an evaluation of the recognition result displayed on this display unit. And a control unit that determines a combination of a dictionary having a high recognition rate and the character recognition unit for recognition processing of the document based on the result of the designation. Characterized character recognition processing device.

2. A recognition unit which evaluates the recognition result displayed on the display unit after recognizing a preset number of characters, and specifies a correct recognition result, and a recognition rate of the recognition rate based on the specified result. The character recognition processing apparatus according to claim 1, further comprising a control unit that determines a combination of a high dictionary and the character recognition unit for recognition processing of the document.

3. A designation unit for designating a correct recognition result by evaluating a recognition result displayed on the display unit after recognizing a preset number of characters, and a dictionary having a low recognition rate as a result of the designation. The character recognition processing device according to claim 1, further comprising a combination selection unit that excludes a combination with the character recognition unit from selection targets.