JPH0620098A

JPH0620098A - Character recognition device

Info

Publication number: JPH0620098A
Application number: JP4173165A
Authority: JP
Inventors: Junji Mitani; 純司三谷; Yoshihiro Kitamura; 義弘北村
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1992-06-30
Filing date: 1992-06-30
Publication date: 1994-01-28
Anticipated expiration: 2014-08-23
Also published as: JP2938276B2

Abstract

PURPOSE:To quickly perform the character recognition processing by collating characters with each standard character picture classified in groups each of which has the similarity higher than a preliminarily determined threshold. CONSTITUTION:A dictionary flag 16 as an 8X8-bit binary mesh flag, a dictionary 17 for rough classification as a file of standard patterns representing respective categories classified by rough classification, and a threshold 18 for each category which is used for detailed discrimination only in the case of the simple similarity of the category higher than the simple similarity obtained as the result of collation between the feature pattern of an inputted character picture and the dictionary 17 for rough classification are stored in a rough classification dictionary part 14. Groups of standard flag data matched to flag data are obtained, and collation with the standard character pictures of each group is performed, and collation with each of standard character pictures classified to groups whose similarities exceed the preliminarily determined threshold 18 is performed, and standard character pictures are outputted in the order of similarity as recognition results.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字を含む原稿を画像
として読取り、読取った文字画像を文字として認識する
文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition apparatus for reading a document containing characters as an image and recognizing the read character image as a character.

【０００２】[0002]

【従来の技術】従来、漢字を含む文字を対象とする文字
認識において、様々な認識方法が提案されているが、漢
字は１字１字のパターンが複雑で、字種も当用漢字だけ
でも１９４５字あり、さらに印刷活字でも明朝体やゴシ
ック体などがあり、手書き文字まで含めると、機械で認
識させることが非常に難しい。このため、個々の装置に
おいて、マッチング法や構造解析などの種々の方法の組
合せによって認識精度向上および処理の高速化のための
工夫が試みられている。2. Description of the Related Art Conventionally, various recognition methods have been proposed for character recognition of characters including Chinese characters, but each Chinese character has a complicated pattern of one character, and even if the Chinese character type is only applicable Chinese characters. There are 1945 characters, and even in print type there are Mincho typeface and Gothic typeface. If handwritten characters are included, it is very difficult for the machine to recognize them. Therefore, attempts have been made to improve recognition accuracy and speed up processing by combining various methods such as a matching method and a structural analysis in each device.

【０００３】また前記のように、漢字は複雑な上に字種
が多いので、１度に１つの文字を認識してしまうより
も、特徴によって大分類をして、似通った文字のグルー
プ（カテゴリ）に分けて、次にそのグループの中を細か
く識別して、最後に１つの文字として認識するという方
法が採られている。As described above, since Kanji is complicated and has many character types, rather than recognizing one character at a time, it is roughly classified according to the characteristics and a group of similar characters (category). ), Then finely identify the group, and finally recognize it as one character.

【０００４】[0004]

【発明が解決しようとする課題】図６は、従来行われて
いる一般的な大分類の動作を示すフローチャートであ
る。ステップｋ１で切出された１つの文字画像を認識す
る場合、ステップｋ２で特徴抽出を行い、ステップｋ３
〜ｋ５で、大分類辞書に含まれる全てのカテゴリ毎に、
各カテゴリを代表する標準パターンと照合し、その標準
パターンと入力された文字画像とがどの程度類似してい
るのかを数値で表すための単純類似度計算を行い、ステ
ップｋ６で、この結果得られた数値に基づいて、類似度
順に全カテゴリを並べ換え、ステップｋ７で、上位第１
位〜ｎ位までのカテゴリを大分類の分類結果とする。FIG. 6 is a flow chart showing a general operation of general classification which has been conventionally performed. When recognizing one character image cut out in step k1, feature extraction is performed in step k2, and step k3 is performed.
~ K5, for each category included in the large dictionary,
A simple similarity calculation is performed to compare the standard pattern representing each category with a numerical value to show how similar the standard pattern and the input character image are, and the result is obtained in step k6. All categories are rearranged in the order of similarity based on the numerical value of
The categories from the nth rank to the nth rank are set as the classification results of the major classification.

【０００５】ステップｋ８〜ｋ１０では、ステップｋ７
で出力されたカテゴリに含まれる文字認識辞書内の全て
の標準パターンと詳細な照合を行い、類似度を計算し、
ステップｋ１１でこの類似度に基づいて文字コードを並
べ換え、ステップｋ１２で、そのうち上位１位〜ｍ位の
文字コードを認識結果として出力する。In steps k8 to k10, step k7
Performs detailed matching with all standard patterns in the character recognition dictionary included in the category output in, calculates the degree of similarity,
In step k11, the character codes are rearranged based on this degree of similarity, and in step k12, the first to mth highest character codes are output as the recognition result.

【０００６】このような場合、大分類においても単純類
似度計算および計算結果の並べ換えを行うので、計算量
が多く、認識処理に比較的長時間を要する。In such a case, since the simple similarity calculation and the rearrangement of the calculation results are performed even in the large classification, the calculation amount is large and the recognition processing requires a relatively long time.

【０００７】本発明の目的は、認識処理における計算量
を減少し、高速な文字認識処理を行える文字認識装置を
提供することである。An object of the present invention is to provide a character recognition device which reduces the amount of calculation in the recognition process and can perform high-speed character recognition process.

【０００８】[0008]

【課題を解決するための手段】本発明は、文字画像を入
力する入力手段と、複数の規準文字画像が互いに類似す
るグループに分類され、各グループ毎に特徴を示す標準
フラグデータおよび標準文字画像が記憶される文字辞書
と、入力文字画像の特徴を示すフラグデータを求め、こ
のフラグデータと適合する標準フラグデータのグループ
を求め、各グループの標準文字画像との照合を行い、類
似度が予め定めるしきい値以上のグループに分類されて
いる各規準文字画像との照合を行い、類似度順に認識結
果として出力する認識手段とを含むことを特徴とする文
字認識装置である。According to the present invention, an input means for inputting a character image and a plurality of standard character images are classified into groups similar to each other, and standard flag data and a standard character image showing a characteristic for each group. The character dictionary in which is stored and the flag data indicating the characteristics of the input character image are obtained, the group of the standard flag data that matches this flag data is obtained, the standard character image of each group is collated, and the similarity is calculated in advance. A character recognition device characterized by including a recognition means for collating with each standard character image classified into a group equal to or larger than a predetermined threshold value and outputting as a recognition result in order of similarity.

【０００９】[0009]

【作用】本発明に従えば、文字認識装置は、複数の規準
文字画像が互いに類似するグループに分類され、各グル
ープ毎に特徴を示す標準フラグデータおよび標準文字画
像が記憶される文字辞書を有し、認識手段において入力
手段によって入力された文字画像の特徴を示すフラグデ
ータを求め、このフラグデータと適合する標準フラグデ
ータのグループを求め、各グループの標準文字画像との
照合を行い、類似度が予め定めるしきい値以上のグルー
プに分類されている各規準文字画像との照合を行い、類
似度順に認識結果として出力する。According to the present invention, the character recognition apparatus has a character dictionary in which a plurality of standard character images are classified into groups similar to each other, and standard flag data and a standard character image showing a characteristic are stored for each group. Then, the flag data indicating the characteristics of the character image input by the input means is obtained in the recognition means, a group of standard flag data that matches this flag data is obtained, and the standard character image of each group is collated to determine the similarity. Collates with each standard character image classified into a group equal to or larger than a predetermined threshold value, and outputs the recognition result in the order of similarity.

【００１０】[0010]

【実施例】図１は、本発明の一実施例の全体の構成を示
すブロック図である。ＣＰＵ（中央処理装置）２は、本
装置全体を制御する演算処理用の中央処理部である。Ｒ
ＯＭ（リードオンリメモリ）３は、ＣＰＵ２の処理プロ
グラムなどを格納するメモリである。ＲＡＭ（ランダム
アクセスメモリ）４は、ＣＰＵ２の作業バッファ２２を
提供するとともに、後述する入力部５からの入力画像の
座標データを記憶する画像メモリ１９、特徴抽出部７か
らの特徴データメモリ２１、特徴抽出部７において、特
徴データから作成される大分類フラグメモリ２０などを
含む。1 is a block diagram showing the overall construction of an embodiment of the present invention. A CPU (central processing unit) 2 is a central processing unit for arithmetic processing that controls the entire apparatus. R
The OM (Read Only Memory) 3 is a memory that stores processing programs of the CPU 2. A RAM (random access memory) 4 provides a work buffer 22 for the CPU 2, an image memory 19 that stores coordinate data of an input image from an input unit 5 described later, a feature data memory 21 from the feature extraction unit 7, and a feature. The extraction unit 7 includes a large classification flag memory 20 created from the characteristic data.

【００１１】入力部５は、イメージスキャナなどの文字
画像入力装置であり、画像メモリ１９に、これらの文字
画像を出力する。切出部６は、画像メモリに蓄えられた
画像から、１文字分の文字画像を切出す。前述のＣＰＵ
２が、これを行ってもよい。The input unit 5 is a character image input device such as an image scanner, and outputs these character images to the image memory 19. The cutout unit 6 cuts out a character image for one character from the image stored in the image memory. CPU mentioned above
2 may do this.

【００１２】特徴抽出部７は、予め定められた方法で、
切出された文字画像の特徴量を抽出し、後述の認識用特
徴パターンを作成して、これをもとに大分類フラグを作
成し、ＲＡＭ４の大分類フラグメモリ２０、特徴データ
メモリ２１にそれぞれ格納する。The feature extraction unit 7 uses a predetermined method to
The feature amount of the cut-out character image is extracted, a recognition feature pattern to be described later is created, a large classification flag is created based on this, and the large classification flag memory 20 and the characteristic data memory 21 of the RAM 4 are respectively generated. Store.

【００１３】認識部８は、大分類演算部１２と、詳細識
別部１３と、大分類辞書部１４と、認識用辞書部１５と
から成る。大分類辞書部１４は、後述の８×８ビットの
２値メッシュフラグである辞書フラグ１６と、大分類に
よって分類される各カテゴリを代表する標準パターンの
ファイルである大分類用辞書１７、入力された文字画像
の特徴パターンと、大分類用辞書１７との照合の結果と
して得られる単純類似度と比較して、カテゴリの単純類
似度がこの値を超えた場合のみ、後述の詳細識別を行う
ところの、各カテゴリ毎のしきい値１８とを記憶するメ
モリである。The recognition section 8 comprises a large classification calculation section 12, a detailed identification section 13, a large classification dictionary section 14, and a recognition dictionary section 15. The large classification dictionary unit 14 receives a dictionary flag 16 that is a binary mesh flag of 8 × 8 bits, which will be described later, and a large classification dictionary 17 that is a standard pattern file representing each category classified according to the large classification. Compared with the simple similarity obtained as a result of collating the character image characteristic pattern with the large classification dictionary 17, detailed identification to be described later is performed only when the simple similarity of the category exceeds this value. Is a memory for storing the threshold value 18 for each category.

【００１４】大分類演算部１２は、大分類辞書部１４に
記憶された標準データをもとに大分類を行う。認識用辞
書部１５は、各１文字に対応して作成された標準パター
ンを記憶するメモリであり、詳細識別部１３は、大分類
によって選び出されたカテゴリ内の全文字について、１
文字単位の標準パターンと詳細な照合を行う。インタフ
ェイス９は、プリンタや磁気ディスクなどの外部出力１
０に、文字認識装置１からのデータを出力するための接
続部である。表示部１１は、液晶表示装置などの表示装
置であり、文字認識装置１の認識結果などを表示する。The large classification calculation unit 12 performs large classification based on the standard data stored in the large classification dictionary unit 14. The recognition dictionary unit 15 is a memory that stores a standard pattern created corresponding to each character, and the detailed identification unit 13 sets 1 for all characters in the category selected by the major classification.
Performs detailed matching with the standard pattern for each character. The interface 9 is an external output 1 such as a printer or a magnetic disk.
0 is a connection unit for outputting data from the character recognition device 1. The display unit 11 is a display device such as a liquid crystal display device, and displays the recognition result of the character recognition device 1 and the like.

【００１５】図２は、抽出された文字画像の特徴パター
ンのうち、８×８のメッシュ特徴および８×８ビットの
２値メッシュフラグである大分類フラグを説明する図で
ある。FIG. 2 is a diagram for explaining the 8 × 8 mesh feature and the major classification flag which is an 8 × 8 bit binary mesh flag in the extracted feature pattern of the character image.

【００１６】図２（１）は、８×８のメッシュ特徴を説
明する図である。８×８のメッシュ特徴とは、１文字画
像を縦横にそれぞれ８等分して６４メッシュに区切り、
それぞれのメッシュの中で画像の特徴量を抽出し、８×
８次元ベクトルあるいはマトリクスで表したものであ
る。FIG. 2A is a diagram for explaining the 8 × 8 mesh feature. The 8 × 8 mesh feature means that one character image is divided into 8 meshes vertically and horizontally, and divided into 64 meshes.
Extract the feature quantity of the image in each mesh and
It is represented by an eight-dimensional vector or matrix.

【００１７】図２（２）は、図２（１）に示されるメッ
シュ特徴から作成される２値メッシュフラグである。図
２（１）のメッシュ特徴における行が、それぞれ２値メ
ッシュフラグの各８ビットに対応しており、メッシュ特
徴の各行で特徴量が最大になる位置に、対応するメッシ
ュフラグのビットを１に、その他のビットを０にする。
たとえば、ある行の特徴として（０，１１，２５，３
６，３３，２０，８，７）が得られた場合、その行に対
するフラグには、（０００１００００）を設定する。FIG. 2 (2) shows a binary mesh flag created from the mesh features shown in FIG. 2 (1). The row in the mesh feature of FIG. 2 (1) corresponds to each 8 bits of the binary mesh flag, and the bit of the corresponding mesh flag is set to 1 at the position where the feature amount is maximum in each row of the mesh feature. , Other bits are set to 0.
For example, as a feature of a row (0, 11, 25, 3
6, 33, 20, 8, 7) is obtained, (00010000) is set in the flag for that row.

【００１８】図３は、８×８のメッシュ特徴量抽出の一
例を示す図である。まず、切出された文字画像を８×８
のメッシュに分割する。図３（１）に示すように、文字
画像を縦横に８等分する。特徴抽出部７は、たとえば図
３（２）に示すように、６４等分された各メッシュにつ
いて濃度分布を求める。濃度分布特徴量は、一般に、メ
ッシュ中の黒点の数を求め、さらに縦横のバランスを図
るため、メッシュを構成する全画素数で割ったものを定
数倍して整数化し用いる。FIG. 3 is a diagram showing an example of 8 × 8 mesh feature amount extraction. First, cut out the character image 8 × 8
Of mesh. As shown in FIG. 3A, the character image is vertically and horizontally divided into eight equal parts. For example, as shown in FIG. 3B, the feature extraction unit 7 obtains a density distribution for each mesh divided into 64 equal parts. In general, the density distribution feature amount is obtained by dividing the number of black dots in the mesh by the total number of pixels forming the mesh and multiplying it by a constant to obtain an integer, in order to obtain a vertical and horizontal balance.

【００１９】図３（１）の左下のメッシュに着目する
と、このメッシュ内の焦点の数は４５であり、横方向の
画素数は８（ドットの数）である。図３（１）において
は、メッシュ特徴量を簡略化して説明するため、黒点の
数をメッシュの横方向の画素数で割っている。ここで、
濃度分布特徴量は約５である。このようにして、メッシ
ュ毎の特徴量を抽出し、図３（３）のメッシュ特徴を得
る。Focusing on the lower left mesh in FIG. 3A, the number of focal points in this mesh is 45, and the number of pixels in the horizontal direction is 8 (the number of dots). In FIG. 3A, in order to simplify the description of the mesh feature amount, the number of black dots is divided by the number of pixels in the horizontal direction of the mesh. here,
The density distribution feature amount is about 5. In this way, the feature amount for each mesh is extracted, and the mesh feature of FIG. 3C is obtained.

【００２０】図４は、８×８ビットの２値メッシュフラ
グである大分類フラグと辞書フラグとを説明する図であ
る。FIG. 4 is a diagram for explaining the large classification flag and the dictionary flag, which are 8 × 8 bit binary mesh flags.

【００２１】図４（１）は、図３（３）のメッシュ特徴
から作成された大分類フラグである。図３（３）に示す
メッシュ特徴の各行の特徴量が最大である位置に対応す
る大分類フラグのビット、たとえば、メッシュ特徴の１
行４列目、２行４，５列目、３行５列目、…、８行２，
７列目に対応して、大分類フラグの１行４列目、２行
４，５列目、３行５列目、…、８行２，７列目の各ビッ
トが１に、他のビットは０に設定されている。FIG. 4 (1) is a large classification flag created from the mesh features of FIG. 3 (3). The bit of the major classification flag corresponding to the position where the feature amount of each line of the mesh feature shown in FIG. 3C is the maximum, for example, 1 of the mesh feature.
Row 4th column, 2nd row 4, 5th column, 3rd row 5th column, ..., 8th row 2,
Corresponding to the 7th column, each bit in the 1st row, 4th column, 2nd row 4, 5th column, 3rd row 5th column, ... Bit is set to 0.

【００２２】図４（２）は、カテゴリ毎に予め設定され
る辞書フラグ１６の一例である。辞書フラグ１６は、カ
テゴリ毎に各行の特徴量が最大になる可能性のある位置
に対応するビットが１に、可能性のないビットが０に設
定される。FIG. 4B shows an example of the dictionary flag 16 preset for each category. In the dictionary flag 16, the bit corresponding to the position where the feature amount of each line may be maximized for each category is set to 1 and the bit which is not possible is set to 0.

【００２３】大分類フラグと辞書フラグとの照合の際
に、大分類フラグにおいて１であり、辞書フラグにおい
てそれに対応するビットが０となるようなビットがあれ
ば、そのカテゴリについては以後の認識処理を行わな
い。When the large classification flag and the dictionary flag are matched with each other, if there is a bit in the large classification flag that is 1 and the corresponding bit in the dictionary flag is 0, the subsequent recognition processing is performed for that category. Do not do.

【００２４】図５は、本実施例の文字認識装置１が１文
字の認識を行う際の動作を示すフローチャートである。FIG. 5 is a flow chart showing the operation when the character recognition device 1 of this embodiment recognizes one character.

【００２５】ステップｍ１で切出部６は、画像メモリ１
９に記憶されている画像から、１文字分の文字画像を切
出し、特徴抽出部７に出力する。ステップｍ２で、特徴
抽出部７は、予め定められた方法で予め定められた数の
８×８メッシュ特徴を作成し、それぞれ特徴データメモ
リ２１に出力する。In step m1, the cut-out section 6 makes the image memory 1
A character image for one character is cut out from the image stored in 9, and is output to the feature extraction unit 7. In step m2, the feature extraction unit 7 creates a predetermined number of 8 × 8 mesh features by a predetermined method and outputs them to the feature data memory 21.

【００２６】ステップｍ３では、特徴抽出部７におい
て、前記８×８メッシュ特徴に基づいて８×８ビット２
値メッシュフラグである大分類フラグが作成され、大分
類フラグメモリ２０に記憶される。ステップｍ４では、
認識部８において、大分類辞書部１４内から、各カテゴ
リの辞書フラグ１６、大分類用辞書１７、しきい値１８
が大分類演算部１２に読出される。ステップｍ５におい
て、大分類辞書部１４内に照合すべきデータの有無を判
断し、データがあれば次のステップｍ６に移る。In step m3, the feature extraction unit 7 uses 8 × 8 bit 2 based on the 8 × 8 mesh feature.
A large classification flag that is a value mesh flag is created and stored in the large classification flag memory 20. In step m4,
In the recognition unit 8, the dictionary flag 16 of each category, the large classification dictionary 17, and the threshold value 18 from the large classification dictionary unit 14 are included.
Is read by the large classification calculation unit 12. In step m5, it is judged whether or not there is data to be collated in the large classification dictionary unit 14, and if there is data, the process proceeds to the next step m6.

【００２７】ステップｍ６では、大分類フラグと辞書フ
ラグとの照合が行われる。この際、大分類フラグにおい
て１で、辞書フラグの対応するビットが０となるような
ビットがあれば、そのカテゴリについては以後の処理を
行わず、ステップｍ４に戻り、次のカテゴリの大分類処
理を行う。大分類フラグと辞書フラグとの照合において
適合したカテゴリは、次のステップｍ７の処理を受け
る。In step m6, the large classification flag and the dictionary flag are collated. At this time, if there is a bit in the large classification flag that is 1 and the corresponding bit in the dictionary flag is 0, the subsequent processing is not performed for that category, the process returns to step m4, and the large classification processing of the next category is performed. I do. The category that matches the collation of the large classification flag and the dictionary flag is subjected to the processing of the next step m7.

【００２８】ステップｍ７において、大分類演算部１２
は、８×８メッシュ特徴と大分類用辞書７との照合を行
い、単純類似度を計算する。この際、単純類似度は、大
分類用辞書特徴ベクトルと、８×８メッシュ特徴ベクト
ルとの内積とする。In step m7, the large classification calculation unit 12
Compares the 8 × 8 mesh feature with the large classification dictionary 7 to calculate the simple similarity. At this time, the simple similarity is the inner product of the large classification dictionary feature vector and the 8 × 8 mesh feature vector.

【００２９】次のステップｍ８において、ステップｍ７
で計算された単純類似度の値が、予めカテゴリ毎に設定
されたしきい値と比較され、単純類似度の値がしきい値
以下となったカテゴリについては、以後の詳細識別を行
わず、ステップｍ４に戻り、次のカテゴリの大分類処理
に移る。単純類似度がしきい値を超えたカテゴリについ
ては、次のステップｍ９に移り、詳細識別を行う。In the next step m8, step m7
The value of the simple similarity calculated in, is compared with a threshold set in advance for each category, for the category of which the value of the simple similarity is less than or equal to the threshold, without performing the subsequent detailed identification, The process returns to step m4, and the process of the large classification of the next category starts. For categories whose simple similarity exceeds the threshold value, the process proceeds to the next step m9 to perform detailed identification.

【００３０】ステップｍ９では、詳細識別部１３に認識
用辞書１５が読出され、単純類似度がしきい値を超えた
カテゴリに属する全文字に関して、詳細な識別が行われ
る。詳細識別において、詳細識別部１３は、８×８メッ
シュ特徴ベクトルと認識用辞書１５との類似度を計算し
た後、ステップｍ４に戻り、大分類用辞書７内の次のカ
テゴリを読出す。次のステップｍ５で、大分類辞書７内
の全カテゴリとの照合が終了したと判断されると、ステ
ップｍ１０に移る。In step m9, the recognition dictionary 15 is read to the detailed identification section 13, and detailed identification is performed for all the characters belonging to the category whose simple similarity exceeds the threshold value. In the detailed identification, the detailed identification unit 13 calculates the similarity between the 8 × 8 mesh feature vector and the recognition dictionary 15, and then returns to step m4 to read the next category in the large classification dictionary 7. When it is determined in the next step m5 that the collation with all the categories in the large classification dictionary 7 is completed, the process proceeds to step m10.

【００３１】次のステップｍ１０において、ステップｍ
９で計算され、蓄積された類似度をもとに、類似度順に
対応する文字コードを並べ換える。ステップｍ１１で
は、ステップｍ１０で並べ換えられた文字コードの第１
位〜第ｎ位を、認識結果としてＲＡＭ３に出力する。In the next step m10, step m
Based on the similarities calculated and accumulated in 9, the corresponding character codes are rearranged in the order of the similarities. In step m11, the first of the character codes rearranged in step m10
The rank to the nth rank are output to the RAM 3 as the recognition result.

【００３２】このように本実施例において、予め設定し
た２値フラグと、得られたメッシュ特徴で決定される２
値フラグとの照合という簡単な計算によって、単純類似
度を求めるべきカテゴリを限定することができる。ま
た、さらに、単純類似度の値に基づくカテゴリの並べ換
えを行わず、カテゴリ毎に予め設定されたしきい値と比
較することによって、適合、不適合を決定する。これに
よって、大分類の計算量を減少することができ、文字認
識処理の高速化を図ることができる。As described above, in this embodiment, the binary flag determined in advance and the value determined by the obtained mesh feature are used.
The category for which the simple similarity is to be calculated can be limited by a simple calculation of matching with the value flag. Further, the category is not rearranged based on the value of the simple similarity, and the conformity or nonconformity is determined by comparing with a threshold value set in advance for each category. As a result, the calculation amount of the large classification can be reduced, and the character recognition processing can be speeded up.

【００３３】[0033]

【発明の効果】以上のように本発明によれば、入力文字
画像の特徴を示すフラグデータを求め、このフラグデー
タと適合する標準フラグデータのグループを求め、各グ
ループの標準文字画像との照合を行い、類似度が予め定
めるしきい値以上のグループに分類されている各規準文
字画像との照合を行うので、入力文字画像が属するグル
ープの認識処理において計算量が少なく、したがって高
速な文字認識処理を行うことができる。As described above, according to the present invention, the flag data indicating the characteristics of the input character image is obtained, the group of the standard flag data that matches this flag data is obtained, and the standard character image of each group is collated. Since the similarity is compared with each of the reference character images classified into groups whose similarity is greater than or equal to a predetermined threshold value, the amount of calculation is small in the recognition process of the group to which the input character image belongs, and therefore high-speed character recognition is performed. Processing can be performed.

[Brief description of drawings]

【図１】本発明の一実施例の全体の構成を示すブロック
図である。FIG. 1 is a block diagram showing the overall configuration of an embodiment of the present invention.

【図２】抽出された文字画像の特徴パターンのうち、８
×８のメッシュ特徴および８×８ビットの２値メッシュ
フラグである大分類フラグを説明する図である。FIG. 2 shows 8 of the characteristic patterns of the extracted character image.
It is a figure explaining the large classification flag which is a * 8 mesh feature and a binary mesh flag of 8x8 bits.

【図３】８×８のメッシュ特徴量抽出の一例を示す図で
ある。FIG. 3 is a diagram illustrating an example of 8 × 8 mesh feature amount extraction.

【図４】８×８ビットの２値メッシュフラグである大分
類フラグと、辞書フラグとを説明する図である。FIG. 4 is a diagram illustrating a large classification flag which is a binary mesh flag of 8 × 8 bits and a dictionary flag.

【図５】本実施例の文字認識装置１が１文字の認識を行
う際の動作を示すフローチャートである。FIG. 5 is a flowchart showing an operation when the character recognition device 1 of the present embodiment recognizes one character.

【図６】従来行われている一般的な大分類の動作を示す
フローチャートである。FIG. 6 is a flowchart showing a general classification operation which is conventionally performed.

[Explanation of symbols]

１文字認識装置２ＣＰＵ５入力部７特徴抽出部８認識部１２大分類演算部１３詳細識別部１４大分類辞書部１５認識用辞書部１６辞書フラグ１７大分類用辞書１８しきい値 1 character recognition device 2 CPU 5 input unit 7 feature extraction unit 8 recognition unit 12 large classification calculation unit 13 detailed classification unit 14 large classification dictionary unit 15 recognition dictionary unit 16 dictionary flag 17 large classification dictionary 18 threshold

Claims

[Claims]

1. An input means for inputting a character image, a character dictionary in which a plurality of reference character images are classified into groups similar to each other, and standard flag data and a standard character image indicating the characteristics of each group are stored. Find the flag data that shows the characteristics of the input character image, find the group of standard flag data that matches this flag data, and compare with the standard character image of each group, and make a group whose similarity is greater than or equal to a predetermined threshold value. A character recognizing device, comprising: a recognizing unit that performs a collation with each of the classified standard character images and outputs the result as a recognition result in order of similarity.