JPH06309505A

JPH06309505A - Character recognizing device

Info

Publication number: JPH06309505A
Application number: JP5115344A
Authority: JP
Inventors: Masahiro Kishi; 岸　　正弘
Original assignee: Fuji Electric Co Ltd; Fuji Facom Corp
Current assignee: Fuji Electric Co Ltd; Fuji Facom Corp
Priority date: 1993-04-19
Filing date: 1993-04-19
Publication date: 1994-11-04

Abstract

PURPOSE:To improve the character recognizing factor by means of two types of dictionaries having high precision. CONSTITUTION:The dimension number of character feature value is set by a dimension setting part 1 and then sent to a single character end detecting part 2 and an address generating part 3. An address counter 4 successively sends its count value to the part 2 and an address converting part 5. The part 5 converts the received count value into an address and sends this address to an unknown character memory 6. The part 3 generates an address based on the dimension number set by the part 1 and the count value supplied via the part 2 and sends the address to a dictionary memory 7. The feature value equivalent to a single unknown character is previously written in the memory 6, and four pieces of data are read out of the memory 6 for each dimension of feature value and in the order of addresses and sent to a correlation with metic part 9. The feature value data equivalent to several thousands of characters are previously stored in the memory 7, and four pieces of data are read out of the memory 7 for each dimension of feature value and in the order of addresses and sent to the part 9, respectively.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、辞書パターンを用いて
未知の文字パターンを認識する文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for recognizing an unknown character pattern using a dictionary pattern.

【０００２】[0002]

【従来の技術】従来の文字認識装置は、未知文字から特
徴量を抽出し、予め辞書データとして文字ごとに記憶し
ておいた特徴量と比較し、相関演算により類似度を求
め、最も類似度の大きい文字を認識結果としている。図
６は、従来の文字認識装置の構成を示すブロック図であ
る。図において、アドレスカウンタ４からは、周期的に
アドレス信号が未知文字メモリ６および辞書メモリ７へ
送られる。未知文字メモリ７では、アドレス信号が入力
されると、該当するアドレスに格納されている未知文字
の特徴量が読み出されて相関演算部９へ送られる。辞書
メモリ７では、アドレス信号が入力されると、先頭文字
から末尾文字までの特徴が順に読み出されて相関演算部
９へ送られる。2. Description of the Related Art A conventional character recognition device extracts a feature amount from an unknown character, compares it with a feature amount stored in advance as dictionary data for each character, calculates a similarity degree by a correlation operation, and obtains the highest similarity degree. The large character is used as the recognition result. FIG. 6 is a block diagram showing the configuration of a conventional character recognition device. In the figure, an address signal is periodically sent from the address counter 4 to the unknown character memory 6 and the dictionary memory 7. When the address signal is input to the unknown character memory 7, the characteristic amount of the unknown character stored in the corresponding address is read and sent to the correlation calculator 9. In the dictionary memory 7, when the address signal is input, the features from the first character to the last character are read out in order and sent to the correlation calculator 9.

【０００３】相関演算部９は、送られてきた特徴量をそ
れぞれ比較して、その類似度を相関演算により求めて、
ソーティング部１０へ送る。ソーティング部１０は、得
られた類似度をその都度ソーティングしていき、辞書全
文字の類似度のソーティングが終了したら、最も大きい
類似度をした辞書の文字を認識結果として出力する。こ
こで用いられている辞書は、文字ごとの特徴をＮビット
からなる特徴量としてそれぞれＭ個づつ記憶しておいた
ものであり、辞書の文字総数をＳとすると、辞書の総デ
ータ数はＮ×Ｍ×Ｓビットとなる。なお、一般に１文字
あたりの特徴量の個数を次元数と呼んでいる。The correlation calculator 9 compares the transmitted characteristic quantities and obtains the degree of similarity by the correlation calculation.
Send to the sorting unit 10. The sorting unit 10 sorts the obtained similarities each time, and when the sorting of the similarity of all the characters of the dictionary is completed, the dictionary character having the highest similarity is output as a recognition result. The dictionary used here stores M features each of which is a feature amount consisting of N bits, and when the total number of characters in the dictionary is S, the total number of data in the dictionary is N. × M × S bits. The number of feature quantities per character is generally called the number of dimensions.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、従来の
文字認識装置では、通常品質文字用の辞書か、低品質文
字用の辞書のいずれか一方をのみを備えているため、未
知文字の品質レベルと辞書の品質レベルが対応しない場
合は認識率が低下するという問題があった。例えば、通
常品質文字用の辞書を使用している場合に、かすれやつ
ぶれのある低品質文字を認識すると誤りやすい。また、
認識率を向上させようとして、文字特徴量の構成データ
数を多くした辞書を用いようとしても、相関演算部のハ
ード構成が簡単に変更できないため、高精度の辞書が使
用できないという問題があった。本発明は上記問題点を
解決するためになされたもので、その目的とするところ
は、より高精度の辞書の使用を可能にして認識精度を向
上するとともに、処理時間の効率化をはかることのでき
る文字認識装置を提供することにある。However, since the conventional character recognition device is provided with only one of the dictionary for normal quality characters and the dictionary for low quality characters, the quality level of unknown characters is There is a problem that the recognition rate decreases when the quality level of the dictionary does not correspond. For example, when using a dictionary for normal quality characters, it is easy to make an error when recognizing low quality characters with faintness or crushing. Also,
Even if an attempt is made to use a dictionary having a large number of character feature amount constituent data in order to improve the recognition rate, there is a problem that a highly accurate dictionary cannot be used because the hardware configuration of the correlation calculation unit cannot be easily changed. . The present invention has been made to solve the above problems, and an object of the present invention is to enable the use of a highly accurate dictionary to improve the recognition accuracy and to improve the efficiency of the processing time. The object is to provide a character recognition device that can do this.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するため
に、第１の発明は、未知の文字パターンから特徴量を抽
出して予め辞書として各文字ごとに記憶させておいた特
徴量とそれぞれ比較し得られた類似度の最も大きい文字
を認識結果として出力する文字認識装置において、文字
ごとに比較される特徴量の１次元あたりのデータ数を指
定する手段と、文字ごとの特徴量を指定されたデータ数
と同じくして構成した辞書とを備えたことを特徴とす
る。In order to achieve the above object, a first aspect of the present invention is to extract a feature amount from an unknown character pattern and store the feature amount in advance as a dictionary for each character. In a character recognition device that outputs a character having the highest similarity obtained by comparison as a recognition result, a means for designating the number of data per dimension of the feature quantity to be compared for each character and a feature quantity for each character. It is characterized in that it is provided with a dictionary configured in the same manner as the number of generated data.

【０００６】第２の発明は、未知の文字パターンから特
徴量を抽出して予め辞書として各文字ごとに記憶させて
おいた特徴量とそれぞれ比較し得られた類似度の最も大
きい文字を認識結果として出力する文字認識装置におい
て、互いに品質レベルの異なる文字群ごとに特徴量を記
憶している複数の辞書を備えたことを特徴とする。第３
の発明は、第２の発明において、複数の辞書の中から使
用する辞書を指定する手段を備えたことを特徴とする。A second aspect of the invention is to recognize the character having the highest degree of similarity obtained by extracting a feature amount from an unknown character pattern and comparing it with a feature amount stored in advance as a dictionary for each character. In the character recognition device for outputting as, a plurality of dictionaries storing feature quantities for character groups having different quality levels are provided. Third
According to the second invention, in the second invention, there is provided means for designating a dictionary to be used from a plurality of dictionaries.

【０００７】第４の発明は、第２の発明において、最も
大きな類似度が得られた辞書の品質レベルに基づき、未
知文字の品質レベルを特定する手段と、連続して文字認
識がなされた文字列の文字ごとに特定された品質レベル
を比較し、他の多数文字と品質レベルが異なる少数文字
を他文字の品質レベルに修正する手段と、修正された文
字を新たな品質レベルに該当する辞書を用いて再度比較
処理を行う手段とを備えたことを特徴とする。A fourth aspect of the present invention is, in the second aspect, a means for specifying the quality level of an unknown character based on the quality level of the dictionary for which the greatest degree of similarity is obtained, and a character for which character recognition is continuously performed. A method of comparing the quality levels specified for each character in the column, correcting the minority characters whose quality level is different from other majority characters to the quality level of other characters, and a dictionary in which the corrected characters correspond to the new quality level. And means for performing the comparison process again using.

【０００８】[0008]

【作用】第１の発明においては、文字ごとに比較される
特徴量の１次元あたりのデータ数が指定されるととも
に、指定された１次元あたりデータ数と同一のデータ数
により辞書が構成されている。それにより、比較する特
徴量のデータ数の変更が可能になり、１次元あたりデー
タ数を多くした高精度の辞書の使用が可能になる。第２
の発明においては、互いに品質レベルの異なる文字群ご
とに特徴量を記憶している複数の辞書を備えたことによ
り、複数の辞書ごとに未知文字の特徴が比較されてその
中の最も大きい類似度が採用される。それにより、未知
文字の品質レベルが変化しても、常に高精度の認識が行
われる。According to the first aspect of the invention, the number of data per dimension of the feature amount to be compared for each character is designated, and the dictionary is constructed with the same number of data as the designated number of data per dimension. There is. As a result, it is possible to change the number of feature quantity data to be compared, and it is possible to use a highly accurate dictionary having a large number of data per dimension. Second
In the invention of claim 1, since a plurality of dictionaries that store the feature amount for each character group having different quality levels are provided, the feature of the unknown character is compared for each of the plurality of dictionaries, and the largest similarity among them is obtained. Is adopted. As a result, even if the quality level of the unknown character changes, high-accuracy recognition is always performed.

【０００９】第３の発明においては、備えられた複数の
辞書の中から使用する辞書を指定することが可能とな
る。それにより、認識する未知文字の品質に応じて最適
な辞書を指定して、認識率を低下させることなく処理時
間を短縮することができる。第４の発明においては、最
も大きな類似度が得られた辞書の品質レベルに基づき、
未知文字の品質レベルが特定される。次いで、連続して
文字認識がなされた文字列の文字ごとに特定された品質
レベルが比較され、他の多数文字と品質レベルが異なる
少数文字は他文字の品質レベルに修正されるとともに、
修正された文字は新たな品質レベルに該当する辞書によ
り再度比較処理が行われる。それにより、認識率がさら
に向上する。In the third invention, it is possible to specify the dictionary to be used from among the plural dictionaries provided. As a result, the optimum dictionary can be designated according to the quality of the unknown character to be recognized, and the processing time can be shortened without lowering the recognition rate. In the fourth invention, based on the quality level of the dictionary for which the largest similarity is obtained,
The quality level of the unknown character is identified. Then, the quality level specified for each character of the character string in which character recognition is continuously performed is compared, and the minority character having a quality level different from that of other majority characters is corrected to the quality level of other characters, and
The corrected characters are compared again with the dictionary corresponding to the new quality level. As a result, the recognition rate is further improved.

【００１０】[0010]

【実施例】以下、図に沿って本発明の実施例を説明す
る。図１は第１の発明に係る第１の実施例の構成を示す
ブロック図である。この実施例は、図６に示した従来例
に、次元数設定部１，一文字終了検知部２，アドレス発
生部３，アドレス変換部５，認識処理終了検知部８を増
設したものである。次元設定部１は、予め文字特徴量の
次元数が設定されており、その設定値を一文字終了検知
部２とアドレス発生部３へ送る。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the configuration of a first embodiment according to the first invention. In this embodiment, a dimension number setting unit 1, a one-character end detection unit 2, an address generation unit 3, an address conversion unit 5, and a recognition processing end detection unit 8 are added to the conventional example shown in FIG. The dimension setting unit 1 sets the dimension number of the character feature amount in advance, and sends the set value to the one-character end detection unit 2 and the address generation unit 3.

【００１１】アドレスカウンタ４は、図２に示すように
順次カウントアップしてそのカウント値を一文字終了検
知部２とアドレス変換部５へ送る。アドレス変換部５は
入力されたカウント値に基づき、図２に示すように、未
知文字メモリ６のアドレスに変換して、未知文字メモリ
６へ送る。アドレス発生部３は、設定された次元数と、
一文字終了検知部２を介して入力されたカウント値に基
づき、図２に示すように、辞書メモリ７のアドレスを発
生して、辞書メモリ７および認識処理終了検知部８へ送
る。As shown in FIG. 2, the address counter 4 sequentially counts up and sends the count value to the one-character end detection unit 2 and the address conversion unit 5. Based on the input count value, the address conversion unit 5 converts the count value into the address of the unknown character memory 6 and sends it to the unknown character memory 6. The address generator 3 has a set number of dimensions,
As shown in FIG. 2, an address of the dictionary memory 7 is generated based on the count value input via the one-character end detection unit 2 and sent to the dictionary memory 7 and the recognition process end detection unit 8.

【００１２】未知文字メモリ６には未知な１文字分の画
像データから抽出した特徴量が予め書き込まれており、
入力されたアドレスに従い特徴量１次元あたり４個のデ
ータが順次読み出されて相関演算部９へ送られる。辞書
メモリ７には数千文字分の特徴を示すデータが書き込ま
れており、入力されたアドレスに従い特徴量１次元あた
り４個のデータが順次読み出されて相関演算部９へ送ら
れる。In the unknown character memory 6, the characteristic amount extracted from the unknown image data of one character is written in advance,
Four pieces of data per one-dimensional feature amount are sequentially read according to the input address and sent to the correlation calculation unit 9. Data indicating the characteristics of several thousand characters is written in the dictionary memory 7, and four pieces of data per one dimension of the characteristic amount are sequentially read according to the input address and sent to the correlation calculation unit 9.

【００１３】なお図２では、辞書メモリ７から読み出さ
れた４個のデータ「１−１」、「１−２」、「１−
３」、「１−４」が未知文字の１番目の特徴量である
「１」に対応している。さらに続いて読み出される辞書
メモリ７のデータ「２−１」〜「２−４」が、未知文字
メモリ６の２番目の特徴量であるデータ「２」にそれぞ
れ対応する。これは、例えば、「１１」という値を、
「３」，「３」，「３」，「２」の４個のデータの和と
して表すことができることを意味している。In FIG. 2, four pieces of data "1-1", "1-2", "1-" read from the dictionary memory 7 are used.
"3" and "1-4" correspond to "1" which is the first feature amount of the unknown character. Further, the data “2-1” to “2-4” of the dictionary memory 7 that are subsequently read correspond to the data “2” that is the second feature amount of the unknown character memory 6. For example, the value "11" is
This means that it can be represented as the sum of four data, "3", "3", "3", "2".

【００１４】相関演算部９は送られた特徴量のデータを
比較して相関演算を実行し、辞書の各文字ごとに類似度
を求め、ソーティング部１０へ送る。ソーティング部１
０は、辞書の各文字ごとに入力されてくる類似度につい
て大きい値から順番にソーティングを実行する。辞書メ
モリ７の全文字について類似度のソーティングが実行さ
れたら、その中の最も大きな類似度を示した文字を認識
結果として出力する。The correlation calculation unit 9 compares the sent characteristic amount data and executes the correlation calculation, obtains the similarity for each character of the dictionary, and sends it to the sorting unit 10. Sorting section 1
With 0, sorting is executed in order from the largest similarity value input for each character in the dictionary. When the similarity is sorted for all the characters in the dictionary memory 7, the character having the highest similarity among them is output as the recognition result.

【００１５】なお、図示例では、特徴量１次元あたりの
構成データ数を４個としているが、次元数設定部１で１
〜Ｎ個の任意の個数に設定することができる。このよう
に、特徴量１次元あたりのデータ個数を可変にしたこと
により、辞書メモリ７の特徴量１次元あたりのデータ個
数も可変にすることが可能となって、辞書の有効桁数を
従来よりもアップすることが可能となる。すなわち、１
次元あたりデータ数を多くした高精度の辞書の使用が可
能となり、その分、認識精度が向上する。In the illustrated example, the number of constituent data per one dimension of the feature amount is 4, but the dimension number setting unit 1
It can be set to any number from N to N. As described above, by changing the number of data items per one dimension of the feature amount, it is possible to change the number of data items per one dimension of the feature amount in the dictionary memory 7 as well. It is also possible to upload. Ie 1
It is possible to use a highly accurate dictionary with a large number of data per dimension, and the recognition accuracy is improved accordingly.

【００１６】次に、第２、第３、第４の発明に係る第２
の実施例について説明する。図３は第２の実施例の構成
を示すブロック図である。図において、１１，１２はと
もに未知文字メモリであり、同一内容をした未知文字パ
ターンの特徴量が格納されている。１３は相関演算・ソ
ーティング部であり、未知文字メモリ１１から読み出し
た特徴量と通常品質文字用辞書メモリ１４から読み出し
た特徴量とを比較して相関演算を実行し辞書メモリ１４
の各文字ごとに類似度を求めるとともに、辞書の各文字
ごとに求められた類似度を大きい値から順番にソーティ
ングする。Next, the second according to the second, third and fourth inventions
An example will be described. FIG. 3 is a block diagram showing the configuration of the second embodiment. In the figure, 11 and 12 are unknown character memories, in which characteristic quantities of unknown character patterns having the same contents are stored. Reference numeral 13 denotes a correlation calculation / sorting unit, which compares the feature amount read from the unknown character memory 11 with the feature amount read from the normal quality character dictionary memory 14 to execute the correlation operation to execute the correlation operation.
The degree of similarity is obtained for each character of, and the degree of similarity obtained for each character of the dictionary is sorted in descending order.

【００１７】同じく、１５は相関演算・ソーティング部
であり、未知文字メモリ１２から読み出した特徴量と低
品質文字用辞書メモリ１６から読み出した特徴量とを比
較して相関演算を実行し辞書メモリ１６の各文字ごとに
類似度を求めるとともに、辞書の各文字ごとに求められ
た類似度を大きい値から順番にソーティングする。１７
はプロセッサであり、相関演算・ソーティング部１３，
１５でそれぞれ求められた最も類似度の大きい文字に基
づき、最終的に未知文字の認識を行う。Similarly, 15 is a correlation calculation / sorting unit, which compares the characteristic amount read from the unknown character memory 12 with the characteristic amount read from the low quality character dictionary memory 16 to execute the correlation operation and execute the dictionary operation 16 The degree of similarity is obtained for each character of, and the degree of similarity obtained for each character of the dictionary is sorted in descending order. 17
Is a processor, the correlation calculation / sorting unit 13,
Finally, the unknown character is recognized based on the character having the highest degree of similarity obtained in step 15.

【００１８】図４は、第２の実施例の動作を示すフロー
チャートであり、以下、このフローチャートに基づいて
動作を説明する。先ず、文字読取りが開始されると、文
字品質の設定の有無を判別する（Ｓ１）。設定されてい
る場合は、その設定内容が低品質か否かを判別する（Ｓ
２）。低品質ではなく通常品質であれば、通常品質文字
用辞書を選択する（Ｓ３）。次いで、選択された通常品
質文字用辞書を用いて、未知文字に対する認識処理を
し、認識結果を出力する（Ｓ４，５）。FIG. 4 is a flowchart showing the operation of the second embodiment, and the operation will be described below based on this flowchart. First, when the character reading is started, it is determined whether or not the character quality is set (S1). If it is set, it is determined whether or not the setting content is of low quality (S
2). If the quality is not low quality but normal quality, the normal quality character dictionary is selected (S3). Next, using the selected normal quality character dictionary, recognition processing is performed on the unknown character and the recognition result is output (S4, 5).

【００１９】Ｓ２において、低品質である場合は、低品
質文字用辞書を選択し（Ｓ６）、低品質文字用辞書を用
いて、認識処理をして認識結果を出力する（Ｓ７，
８）。Ｓ１において、文字品質の設定がない場合は、通
常品質文字用辞書および低品質文字用辞書の並行実行を
選択する（Ｓ９）。次いで、各辞書ごとに認識処理を実
行し、それぞれに得られた類似度を比較する（Ｓ１０，
１１）。得られた類似度のうち高い方の類似度を正解と
して、いったん、その類似度の文字を対象未知文字の読
みとする（Ｓ１２）。When the quality is low in S2, a dictionary for low quality characters is selected (S6), the recognition process is performed using the dictionary for low quality characters, and the recognition result is output (S7,
8). If no character quality is set in S1, parallel execution of the normal quality character dictionary and the low quality character dictionary is selected (S9). Next, the recognition process is executed for each dictionary, and the similarities obtained are compared (S10,
11). Of the obtained similarities, the higher similarity is set as the correct answer, and the character having the similarity is once set as the reading of the target unknown character (S12).

【００２０】ここで正解とされた未知文字が属する文字
列における他の文字が低品質文字用辞書により認識され
たものであるか否かを判別する（Ｓ１３）。低品質では
なく、通常品質である場合は、Ｓ１２で正解とした認識
結果が通常品質であるか否かを判別する（Ｓ１４）。通
常品質ではなく低品質である場合は、Ｓ１２で正解とし
た認識結果を、通常品質文字用辞書で得られた類似度の
文字に切り換えて認識結果とする（Ｓ１５）。通常品質
である場合は、Ｓ１２で正解とした認識結果をそのまま
出力する（Ｓ１６）。Here, it is determined whether or not another character in the character string to which the unknown character which has been regarded as the correct answer belongs is recognized by the low quality character dictionary (S13). If it is not low quality but normal quality, it is determined whether or not the recognition result as the correct answer in S12 is normal quality (S14). When the quality is not the normal quality but the low quality, the recognition result determined as the correct answer in S12 is switched to the character having the similarity obtained in the normal quality character dictionary to be the recognition result (S15). If the quality is normal, the recognition result that is correct in S12 is output as it is (S16).

【００２１】Ｓ１３において、低品質文字用辞書により
認識されたものである場合は、Ｓ１２で正解とした認識
結果が低品質であるか否かを判別する（Ｓ１７）。低品
質ではなく通常品質である場合は、Ｓ１２で正解とした
認識結果を、低品質文字用辞書で得られた類似度の文字
に切り換えて認識結果とする（Ｓ１８）。低品質である
場合は、Ｓ１２で正解とした認識結果をそのまま出力す
る（Ｓ１９）。なお、上述した各処理は、図３のプロセ
ッサ１７において実行される。In S13, if it is recognized by the low quality character dictionary, it is determined whether or not the recognition result which is the correct answer in S12 is of low quality (S17). When the quality is not the low quality but the normal quality, the recognition result which is the correct answer in S12 is switched to the recognition result by switching to the character having the similarity degree obtained in the low quality character dictionary (S18). When the quality is low, the recognition result which is correct in S12 is output as it is (S19). The above-mentioned processes are executed by the processor 17 shown in FIG.

【００２２】図５は、図４における認識処理を示すフロ
ーチャートであり、以下、このフローチャートに基づい
て認識処理を説明する。認識処理が開始されると、先
ず、辞書メモリ用のアドレスを初期化するとともに、未
知文字メモリ用のアドレスを初期化する（Ｓ２１，２
２）。次いで、次元数Ｍが設定される（Ｓ２３）。次に
指定されるアドレスに基づき、辞書メモリから特徴量の
データを読み出すとともに、未知文字メモリから特徴量
のデータを読み出す（Ｓ２４，２５）。FIG. 5 is a flow chart showing the recognition processing in FIG. 4, and the recognition processing will be described below based on this flow chart. When the recognition process is started, first, the address for the dictionary memory is initialized and the address for the unknown character memory is initialized (S21, S2).
2). Next, the number of dimensions M is set (S23). Next, based on the designated address, the feature amount data is read from the dictionary memory and the feature amount data is read from the unknown character memory (S24, 25).

【００２３】ここで読み出されたデータを互いに比較し
て、相関演算を行う（Ｓ２６）。さらにアドレスを更新
して、設定されたＭ次元が終了するまで次の次元のデー
タを読み出して相関演算を繰り返す（Ｓ２７，２８）。
こうして、辞書の１文字について相関演算により得られ
た類似度を順次ストアするとともに、類似度をソーティ
ングする（Ｓ２９，３０）。これらの類似度のストアお
よびソーティングを辞書の全文字について終了するまで
繰り返すことにより、最も類似度の大きい文字が認識結
果となる（Ｓ３１）。The data read here are compared with each other to perform a correlation operation (S26). Further, the address is updated, the data of the next dimension is read out and the correlation calculation is repeated until the set M dimension is completed (S27, 28).
In this way, the similarities obtained by the correlation calculation for one character in the dictionary are sequentially stored, and the similarities are sorted (S29, 30). By repeating the storage and sorting of these similarities for all the characters in the dictionary, the character with the highest similarity becomes the recognition result (S31).

【００２４】第２の実施例では上述したように、文字品
質の異なる２種類の辞書を備えたことにより未知文字の
品質が変化してもそれに対応した辞書を用いることが可
能となり、認識精度が向上する。また、文字列を認識す
る際は、構成する文字ごとに品質に応じた辞書を選択し
ながら認識がなされ、得られた認識結果のなかで、文字
列の他の文字の品質と異なる品質の辞書により認識され
た文字があった場合は、その文字についての認識結果は
誤りの可能性があるので、他の文字と同じ品質の辞書に
より再度認識が行われる。それにより、さらに認識精度
が向上する。In the second embodiment, as described above, by providing the two types of dictionaries having different character qualities, it is possible to use the dictionary corresponding to the unknown character quality even if the quality thereof changes, and the recognition accuracy is improved. improves. When recognizing a character string, recognition is performed while selecting a dictionary according to the quality of each of the constituent characters, and in the obtained recognition result, a dictionary with a quality different from the quality of other characters in the character string is obtained. If there is a character recognized by, the recognition result for that character may be erroneous, and therefore the recognition is performed again using the dictionary having the same quality as the other characters. Thereby, the recognition accuracy is further improved.

【００２５】なお、第２の実施例では２種類の辞書を用
いたが３種類以上の辞書を同様にして使用することも可
能である。また、第１の実施例および第２の実施例は互
いを組み合わせて実現することも、またそれぞれ単独で
文字認識装置に適用することも可能である。Although the two types of dictionaries are used in the second embodiment, three or more types of dictionaries can be used in the same manner. Further, the first and second embodiments can be realized by combining them with each other, or can be independently applied to the character recognition device.

【００２６】[0026]

【発明の効果】以上述べたように第１の発明によれば、
文字ごとに比較される特徴量の１次元あたりのデータ数
の変更を可能にしたことにより、１次元あたりデータ数
を多くした高精度の辞書の使用が可能となり認識精度が
向上する。第２の発明によれば、互いに品質レベルの異
なる文字群ごとに特徴量を記憶している複数の辞書を備
え、各辞書ごとに未知文字の特徴を比較することによ
り、未知文字の品質レベルが変化しても、常に高精度の
認識が行われる。As described above, according to the first invention,
By making it possible to change the number of data for one dimension of the feature amount to be compared for each character, it is possible to use a highly accurate dictionary having a large number of data for one dimension and improve the recognition accuracy. According to the second aspect of the present invention, a plurality of dictionaries storing feature quantities for character groups having different quality levels are provided, and by comparing the characteristics of unknown characters for each dictionary, the quality level of unknown characters can be improved. Even if it changes, high-precision recognition is always performed.

【００２７】第３の発明によれば、備えられた複数の辞
書の中から使用する辞書を指定することにより、認識す
る未知文字の品質に応じて最適な辞書を指定して、認識
率を低下させることなく処理時間を短縮することができ
る。第４の発明によれば、連続して文字認識がなされた
文字列の中に異なる品質レベルの文字がある場合は、そ
の文字の品質レベルを他と同じくなるように修正してか
ら該当する辞書により再度比較処理を行うようにして、
認識率をさらに向上することができる。According to the third aspect of the present invention, the dictionary to be used is specified from among the plural dictionaries provided, so that the optimum dictionary is specified according to the quality of the unknown character to be recognized and the recognition rate is lowered. It is possible to shorten the processing time without causing it. According to the fourth invention, when there are characters of different quality levels in a character string in which character recognition is continuously performed, the quality level of the character is corrected to be the same as the others, and then the corresponding dictionary By doing the comparison process again,
The recognition rate can be further improved.

[Brief description of drawings]

【図１】第１の発明に係る第１の実施例の構成を示すブ
ロック図である。FIG. 1 is a block diagram showing a configuration of a first embodiment according to the first invention.

【図２】図１のメモリからの読み出されるデータ内容を
示す説明図である。2 is an explanatory diagram showing data contents read from the memory of FIG. 1. FIG.

【図３】第２乃至第４の発明に係る第２の実施例の構成
を示すブロック図である。FIG. 3 is a block diagram showing a configuration of a second embodiment according to the second to fourth inventions.

【図４】第２の実施例の動作を示すフローチャートであ
る。FIG. 4 is a flowchart showing the operation of the second embodiment.

【図５】図４における認識処理を詳細に示したフローチ
ャートである。FIG. 5 is a flowchart showing in detail the recognition processing in FIG.

【図６】従来例の構成を示すブロック図である。FIG. 6 is a block diagram showing a configuration of a conventional example.

[Explanation of symbols]

１次元数設定部２一文字終了検知部３アドレス発生部４アドレスカウンタ５アドレス変換部６未知文字メモリ７辞書メモリ８認識処理終了検知部９相関演算部１０ソーティング部１１，１２未知文字メモリ１３相関演算・ソーティング部１４通常品質文字用辞書メモリ１５相関演算・ソーティング部１６低品質文字用辞書メモリ１７プロセッサ 1 dimension number setting unit 2 one-character end detection unit 3 address generation unit 4 address counter 5 address conversion unit 6 unknown character memory 7 dictionary memory 8 recognition processing end detection unit 9 correlation calculation unit 10 sorting unit 11, 12 unknown character memory 13 correlation calculation • Sorting unit 14 Dictionary memory for normal quality characters 15 Correlation operation / Sorting unit 16 Dictionary memory for low quality characters 17 Processor

Claims

[Claims]

1. A character whose characteristic value is extracted from an unknown character pattern and which is compared with a characteristic value stored in advance as a dictionary for each character and which has the highest degree of similarity, is output as a recognition result. The recognition device is provided with a means for designating the number of data of the feature amount per one dimension to be compared for each character, and a dictionary configured so that the feature amount of each character is the same as the designated data amount. Characterized character recognition device.

2. A character whose characteristic value is extracted from an unknown character pattern and is compared with a characteristic value stored in advance as a dictionary for each character and which has the highest degree of similarity and is output as a recognition result. A character recognition device comprising a plurality of dictionaries that store feature quantities for character groups having different quality levels.

3. The character recognition device according to claim 2,
A character recognition device comprising means for designating a dictionary to be used from a plurality of dictionaries.

4. The character recognition device according to claim 2, wherein means for specifying the quality level of the unknown character based on the quality level of the dictionary for which the largest similarity is obtained, and characters for which character recognition is continuously performed. A means to compare the quality levels specified for each character in the column, to correct minority characters whose quality level is different from other majority characters to the quality level of other characters, and a dictionary of corrected characters to the new quality level. A character recognition device comprising: means for performing a comparison process again using.