JPH0773275A

JPH0773275A - Character recognition device

Info

Publication number: JPH0773275A
Application number: JP5218280A
Authority: JP
Inventors: Akiko Konno; 章子紺野; Yasuo Hongo; 保夫本郷
Original assignee: Fuji Electric Co Ltd; Fuji Facom Corp
Current assignee: Fuji Electric Co Ltd; Fuji Facom Corp
Priority date: 1993-09-02
Filing date: 1993-09-02
Publication date: 1995-03-17

Abstract

PURPOSE:To improve the recognition accuracy of a pattern of large curvature by recognizing mainly the curvature feature value of a contour picture element with a curvature changing point position defined as a feature. CONSTITUTION:A curvature feature value calculating part 10 calculates the complicacy of a character image and the curvature feature value of a contour line. A normalizing part 11 acquires a normalized size of an input pattern. A correlation calculating part 12 calculates the correlation between the curvature feature value of the normalized input pattern and a curvature feature value dictionary 13 and calculates the curvature resemblance. A directive feature value calculating part 14 calculates the directive feature value in a method similar to the conventional one and by the directive differentiation of 4 or 8 directions. A normalizing part 15 normalizes the size of the directive feature value, and a correlation calculating part 16 calculates the correlation between the directive feature value and the character stored in a directive feature value dictionary 17 and to be recognized. Then a general resemblance calculating part 18 multiplying the curvature resemblance and the directive resemblance by a proper weight coefficient and adds both resemblance together to calculate the general resemblance. A sorting part 19 sorts the recognized characters and transmits the character recognizing results.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、方向性特徴を用いて
文字を認識する文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for recognizing characters using directional features.

【０００２】[0002]

【従来の技術】光学式文字認識装置（ＯＣＲ）による文
字認識手法には、原パターンを正規化したものを特徴量
とする方式、方向性の微分を行なった結果の画像を特徴
量とする方式、定位置に配置したセルから各ストローク
への方向・距離を特徴量とする方式、ストローク構造を
解析する方式など様々な手法がある。その中の１つとし
て、文字パターンと背景の境界部分の方向性を特徴量と
し、特徴間のマッチング度合いを２つのパターンの特徴
ベクトル空間の相関値から求めた類似度ｃｏｓθで計測
する方法がある。2. Description of the Related Art Character recognition methods using an optical character recognition device (OCR) include a method in which a feature amount is obtained by normalizing an original pattern, and a method in which an image resulting from directional differentiation is used as a feature amount. There are various methods such as a method in which the direction / distance from a cell arranged at a fixed position to each stroke is used as a feature amount, and a method in which a stroke structure is analyzed. As one of them, there is a method in which the directionality of the boundary portion between the character pattern and the background is used as the feature amount, and the matching degree between the features is measured by the similarity cos θ obtained from the correlation value of the feature vector space of the two patterns. .

【０００３】[0003]

【発明が解決しようとする課題】この２次元特徴相関法
は文字のストローク、または背景を４または８方向の方
向性微分画像を特徴量とすることにより表現したもので
あるが、比較的直線成分が多い明朝体やゴシック体の印
刷漢字については効果があることが確認されている。し
かし、印刷漢字でもゆるやかな曲線成分を多く含む平仮
名や教科書体の文字に対しては、このような４方向また
は８方向の方向性特徴だけでは対応できず、認識精度が
低下するという問題がある。This two-dimensional feature correlation method is expressed by using a stroke or a background of a character as a feature amount of a directional differential image in 4 or 8 directions. It has been confirmed that it is effective for printing Kanji in Mincho and Gothic fonts, which are often used. However, even in the case of printed Chinese characters, it is not possible to deal with characters in hiragana or textbooks containing a lot of gentle curve components only with such directional features in four or eight directions, and there is a problem that the recognition accuracy deteriorates. .

【０００４】また、手書き文字に対しても、限られた方
向の方向性特徴だけでは認識精度に限界がある。これを
解決するためには、方向の刻みを１６，３２と増加させ
る方法もあるが、もとの画像の並びが格子状であること
から、８方向を越える方向に対しては方向性特徴量自体
の精度が低下するだけでなく、特徴量の次元数，辞書容
量，認識時間などが増加するという問題がある。したが
って、この発明の課題は特徴量の次元数，辞書容量，認
識時間などの増大を抑えながら認識精度を向上させるこ
とにある。Further, even with respect to handwritten characters, there is a limit to the recognition accuracy only with the directional features in a limited direction. In order to solve this, there is also a method of increasing the number of steps in the direction to 16, 32, but since the original images are arranged in a grid pattern, the directional feature quantity is added to the direction beyond 8 directions. There is a problem in that not only the accuracy of itself decreases but also the number of dimensions of the feature amount, the dictionary capacity, the recognition time, etc. increase. Therefore, an object of the present invention is to improve recognition accuracy while suppressing increases in the number of dimensions of feature quantity, dictionary capacity, recognition time, and the like.

【０００５】[0005]

【課題を解決するための手段】このような課題を解決す
るため、この発明では、方向性特徴量を用いて文字の認
識を行なう文字認識装置において、１文字単位で切り出
された文字画像から文字の縦横のサイズを抽出する文字
サイズ抽出手段と、文字画像の線密度から文字の複雑度
を算出する複雑度算出手段と、４または８方向の２次元
方向性特徴量を抽出する方向性特徴量抽出手段と、前記
文字サイズと複雑度から輪郭画素の平滑化条件を決定す
る平滑化条件決定手段と、決定された平滑化条件で輪郭
画素の曲率を求め、その絶対値の大きい部分を特徴量と
して抽出する曲率特徴量抽出手段と、この曲率値の分散
から文字を構成するストロークの直線度を計算する直線
度算出手段と、前記方向性特徴量，曲率特徴量を用いて
未知パターンと辞書パターンとの間の各相関値を求め、
これらに前記直線度による重み係数をそれぞれ掛けて加
算することにより総合類似度を求める総合類似度演算手
段とを設け、この総合類似度から文字の認識を行なうこ
とを特徴としている。In order to solve such a problem, according to the present invention, in a character recognition device for recognizing a character by using a directional characteristic amount, characters are extracted from a character image cut out character by character. Size extracting means for extracting the vertical and horizontal sizes of the, a complexity calculating means for calculating the complexity of the character from the line density of the character image, and a directional feature quantity for extracting a two-dimensional directional feature quantity in four or eight directions. Extraction means, smoothing condition determining means for determining the smoothing condition of the contour pixel from the character size and complexity, the curvature of the contour pixel is determined under the determined smoothing condition, and a portion having a large absolute value is a feature amount. A curvature feature amount extraction means for extracting as a straight line, a straightness degree calculation means for calculating a straightness of a stroke forming a character from the variance of the curvature values, and an unknown pattern and a word using the directional feature amount and the curvature feature amount. Obtains the correlation values between the pattern,
It is characterized in that a total similarity calculating means for obtaining a total similarity by multiplying these by a weighting coefficient based on the linearity and adding them is provided, and characters are recognized from the total similarity.

【０００６】[0006]

【作用】従来の方向性特徴量に加え、境界成分の曲率を
特徴量として導入する。この境界成分の曲率はそのまま
では誤差が大きく、特徴量として使用できないので、ま
ず線密度等によって文字の複雑度を計測し、文字のサイ
ズと複雑度に応じて曲率の平滑化係数を変える。そし
て、このようにして求めた方向性特徴と曲率特徴を総合
して文字認識を行なう。この場合の方向性特徴と曲率特
徴の重みは未知パターンから求めた直線度により、パタ
ーンが直線を多く含む場合は方向性特徴の比重を重く
し、一方、曲線が多い場合は曲率特徴の比重を重くして
それぞれ認識を行なう。In addition to the conventional directional feature amount, the curvature of the boundary component is introduced as a feature amount. The curvature of the boundary component has a large error as it is and cannot be used as a feature amount. Therefore, the complexity of the character is first measured by the line density or the like, and the smoothing coefficient of the curvature is changed according to the size and the complexity of the character. Then, character recognition is performed by integrating the directional characteristics and the curvature characteristics thus obtained. In this case, the weights of the directional feature and the curvature feature are determined by the linearity obtained from the unknown pattern. When the pattern includes many straight lines, the specific gravity of the directional feature is increased. Weigh each and recognize each one.

【０００７】[0007]

【実施例】図１はこの発明の実施例を示すブロック図で
ある。同図において、１はＯＣＲに入力される文書、２
は画像入力装置としてのイメージスキャナを示し、３は
ＯＣＲ全体を示す。４は文字画像入力部でイメージスキ
ャナ２を制御し、文書画像を画像メモリに格納する。５
は行切出し部で、文書画像からテキストの書かれた行を
抽出する。６は文字切出し部で、１文字ずつの画像を切
り出す。1 is a block diagram showing an embodiment of the present invention. In the figure, 1 is a document input to OCR, 2
Represents an image scanner as an image input device, and 3 represents the entire OCR. A character image input unit 4 controls the image scanner 2 to store the document image in the image memory. 5
Is a line cutout unit that extracts a line in which text is written from a document image. Reference numeral 6 denotes a character cutout unit which cuts out an image of each character.

【０００８】７は１文字認識ブロックであり、６で切り
出された１文字画像について認識を行なう。８は文字画
像からノイズを除去する部分、９はノイズ除去された文
字画像から文字サイズおよび線密度を計測するサイズ・
複雑度計測部である。１０はこの複雑度と文字画像から
輪郭線の曲率特徴量を算出する曲率特徴量算出部、１１
はこれを規定のサイズに正規化する正規化部、１３は認
識対象の各文字についての曲率特徴量を格納している辞
書、１２は１１で正規化された入力パターンの曲率特徴
と、１３の曲率特徴量辞書との相関演算を行ない、曲率
類似度を求める相関演算部である。Reference numeral 7 is a 1-character recognition block, which recognizes the 1-character image cut out in 6. 8 is a portion for removing noise from the character image, 9 is a size for measuring character size and line density from the noise-removed character image.
It is a complexity measuring unit. Reference numeral 10 is a curvature feature amount calculation unit that calculates the curvature feature amount of the contour line from the complexity and the character image, and 11
Is a normalization unit that normalizes this to a prescribed size, 13 is a dictionary that stores the curvature feature amount for each character to be recognized, 12 is the curvature feature of the input pattern normalized by 11, and 13 This is a correlation calculation unit that performs a correlation calculation with the curvature feature amount dictionary to obtain a curvature similarity.

【０００９】１４は従来と同様の方法により、４方向ま
たは８方向の方向性微分により方向性特徴量を算出する
方向性特徴量算出部、１５はこの特徴量のサイズを正規
化する正規化部、１６はこうして得られた方向性特徴量
と、方向性特徴量辞書１７に記憶されている認識対象の
文字についての方向性特徴量との相関値を求める相関演
算部である。１８は曲率類似度と方向性類似度とに適当
な重み係数を掛け合わせて加算することにより、総合類
似度を求める総合類似度計算部である。１９は総合類似
度の大きい順にソーティングを行ない、文字認識結果を
出力するソーティング部、２０は文字単位の認識結果に
対して、知識処理等により後処理を施す後処理部であ
り、２１が最終的に得られた認識結果を示す。Reference numeral 14 is a directional feature quantity calculating section for calculating a directional feature quantity by directional differentiation in four or eight directions by a method similar to the conventional method, and 15 is a normalizing section for normalizing the size of this feature quantity. , 16 are correlation calculation units for obtaining a correlation value between the directional characteristic amount thus obtained and the directional characteristic amount of the character to be recognized stored in the directional characteristic amount dictionary 17. Reference numeral 18 denotes an overall similarity calculating unit that obtains the overall similarity by multiplying the curvature similarity and the directional similarity by an appropriate weighting factor and adding them. Reference numeral 19 denotes a sorting unit that performs sorting in order of increasing total similarity and outputs a character recognition result, 20 denotes a post-processing unit that performs post-processing such as knowledge processing on the recognition result in character units, and 21 denotes a final processing unit. The recognition results obtained are shown in.

【００１０】図２は図１における認識手順を説明するた
めのフローチャートである。まず、入力パターンについ
てノイズの除去を行ない（ステップＳ１）、ついでステ
ップＳ２で文字サイズ（高さ，幅）、ステップＳ３で複
雑度Ｆをそれぞれ計測する。その後、一方では方向性特
徴の抽出（ステップＳ４）、他方では輪郭画素の曲率特
徴抽出，パターンの直線度αｐの計算（ステップＳ６）
を行ない、それぞれ正規化した後（ステップＳ５，Ｓ
７）、各辞書カテゴリとの方向性特徴量の相関値Ｄｊを
計算し（ステップＳ９）、他方では各辞書カテゴリとの
曲率特徴量相関値Ｃｊを計算する（ステップＳ１０）。
これらに対し、入力パターンの直線度αｐによる重み付
けをして加算することにより、総合類似度Ｔｊを求める
（ステップＳ１１）。総合類似度の大きい順にソーティ
ングを行ない（ステップＳ１２〜１４）、ステップＳ１
５で認識結果を出力する。FIG. 2 is a flow chart for explaining the recognition procedure in FIG. First, noise is removed from the input pattern (step S1), then the character size (height, width) is measured at step S2, and the complexity F is measured at step S3. After that, on the one hand, the extraction of the directional feature (step S4), on the other hand, the extraction of the curvature feature of the contour pixel, and the calculation of the linearity αp of the pattern (step S6).
And normalize each (steps S5, S
7) Then, the correlation value Dj of the directional feature amount with each dictionary category is calculated (step S9), and the curvature feature amount correlation value Cj with each dictionary category is calculated on the other hand (step S10).
The total similarity Tj is obtained by weighting these with the linearity αp of the input pattern and adding them (step S11). Sorting is performed in descending order of total similarity (steps S12 to S14), and step S1.
At 5, the recognition result is output.

【００１１】図３は図１における辞書作成手順を説明す
るためのフローチャートである。まず、カテゴリ内の第
１番目（ｉ＝１）のパターンから順に、ノイズ除去，文
字サイズ（高さ，幅）の計測および文字の複雑度Ｆの計
測を行なう（ステップＳ１〜４）。その後は、並列に２
つの特徴量の計算を行なう。一方では、従来と同様に方
向性特徴量の抽出，サイズの正規化を行ない、得られた
特徴量ｄｉ₁〜ｄｉ₈を辞書領域へ足し込む（ステップ
Ｓ５〜７）。もう一方では、文字のサイズに比例し、複
雑度に反比例するような平滑化区間を設定し、輪郭画素
の曲率画像を求める（ステップＳ８）。これについて
も、同様にサイズの正規化を行ない、得られた特徴量ｃ
ｉ₊〜ｃｉ_-を、それぞれ辞書Ｃ₊，Ｃ_-に足し込む
（ステップＳ９，１０）。これをカテゴリ内の全パター
ンに対して行なった後正規化し、１つのカテゴリに対す
る辞書作成を終了する（ステップＳ１１〜１３）。FIG. 3 is a flow chart for explaining the dictionary creating procedure in FIG. First, noise removal, character size (height, width) measurement, and character complexity F measurement are performed in order from the first (i = 1) pattern in the category (steps S1 to 4). After that, 2 in parallel
Calculate two feature quantities. On the other hand, the directional feature quantity is extracted and the size is normalized as in the conventional case, and the obtained feature quantities di _{1 to} di ₈ are added to the dictionary area (steps S5 to 7). On the other hand, a smoothing section that is proportional to the character size and inversely proportional to the complexity is set, and the curvature image of the contour pixel is obtained (step S8). Also for this, the size is normalized in the same manner, and the obtained feature amount c
i ₊ ~ci _- a dictionary respectively C _+, C _- the Komu added (Step S9 and S10). This is performed for all patterns in the category and then normalized, and dictionary creation for one category is completed (steps S11 to 13).

【００１２】以下、具体的に説明する。図４に、文字画
像例を示す。これは、教科書体の漢字「乙」の例であ
る。この文字は直線と曲線とからなっているが、字体に
よって曲線の曲がり方が異なり、平仮名の「て」として
誤読されることも多い。なお、この文字の複雑度Ｆは、
例えば次の数１〜数５で示されるような、線の立ち上が
りをカウントする関数から求めることができる。A detailed description will be given below. FIG. 4 shows an example of a character image. This is an example of the kanji "Oto" in the textbook. This character consists of a straight line and a curved line, but the way the curve bends differs depending on the typeface, and is often misread as the hiragana “te”. The complexity F of this character is
For example, it can be obtained from a function that counts the rising of a line as shown in the following equations 1 to 5.

【００１３】[0013]

【数１】 [Equation 1]

【数２】 [Equation 2]

【数３】 [Equation 3]

【数４】 [Equation 4]

【数５】 [Equation 5]

【００１４】図４に示す文字画像から求まる複雑度は、
次のようになる。Ｆｘ＝１．３８Ｆｙ＝２．６７Ｆ＝２．０４The complexity obtained from the character image shown in FIG.
It looks like this: Fx = 1.38 Fy = 2.67 F = 2.04

【００１５】図４に示す文字画像から、従来方式により
８方向の方向性特徴（ストローク）を抽出する方法につ
いて、説明する。図５は８方向を説明するための説明
図、図６は方向性特徴を抽出するための３×３の局部メ
モリ構成図、図７は方向性特徴を抽出するための方向性
微分オペレータ例を示す説明図、図８は図４に対応する
微分画像例である。なお、図７（イ），（ロ）は良く知
られているＳｏｂｅｌオペレータの例であるが、方向性
の抽出が可能な他のオペレータで代用することができ
る。A method of extracting directional features (strokes) in eight directions by the conventional method from the character image shown in FIG. 4 will be described. FIG. 5 is an explanatory view for explaining 8 directions, FIG. 6 is a 3 × 3 local memory block diagram for extracting directional features, and FIG. 7 is an example of a directional differential operator for extracting directional features. 8A and 8B are examples of differential images corresponding to FIG. 7 (a) and 7 (b) are examples of the well-known Sobel operator, other operators capable of extracting the direction can be substituted.

【００１６】まず、図６の局部領域に図７（イ），
（ロ）に示すＳｏｂｅｌオペレータを掛け合わせて次の
数６，数７のように微分値∇ｘ（ｘ，ｙ），∇ｙ（ｘ，
ｙ）を求める。図４に対する微分画像例を図８に示す。
さらに、これを用いて数８のように境界成分の方向θ
（ｘ，ｙ）、すなわち角度を算出する。First, in the local area of FIG. 6, FIG.
By multiplying the Sobel operator shown in (b), differential values ∇x (x, y) and ∇y (x,
y) is calculated. An example of the differential image with respect to FIG. 4 is shown in FIG.
Furthermore, using this, the direction θ of the boundary component
(X, y), that is, the angle is calculated.

【数６】 [Equation 6]

【数７】 [Equation 7]

【数８】以上のようにして求めた境界成分の方向θ（ｘ，ｙ）
を、境界成分の輪郭順に並べた結果を図９に示す。[Equation 8] Boundary component direction θ (x, y) obtained as described above
FIG. 9 shows the result of arranging B in the order of the boundary components.

【００１７】曲率は、或る点Ｉ（ｘ，ｙ）での方向θ
（ｘ，ｙ）の変化量（角度の変化量）として表わすこと
ができる。ここでは、曲率Ｋ（ｘ，ｙ）を次の数９によ
り求める。ただし、数９の（ｘｉ，ｙｉ）は輪郭点列の
座標を示し、ｉの順に境界成分が並んでいることを示し
ている。また、図４の画像について、下記数９により求
めた曲率Ｋ（ｘ，ｙ）を図１０に示す。この図１０では
平滑化区間をｗ＝４としたが、この平滑化係数ｗは前に
求めた文字のサイズにほぼ比例し、文字の複雑度に反比
例するものである。The curvature is the direction θ at a point I (x, y).
It can be expressed as a variation amount (angle variation amount) of (x, y). Here, the curvature K (x, y) is calculated by the following equation 9. However, (xi, yi) in the equation 9 indicates the coordinates of the outline point sequence, and indicates that the boundary components are arranged in the order of i. Further, FIG. 10 shows the curvature K (x, y) obtained by the following Expression 9 for the image of FIG. In FIG. 10, the smoothing section is set to w = 4, but the smoothing coefficient w is almost proportional to the size of the previously obtained character and inversely proportional to the complexity of the character.

【数９】 [Equation 9]

【００１８】この曲率値を正規化し、絶対値が或るしき
い値（ここでは、π／２）以上の値を持つ画素だけを＋
面（曲率＞０）と−面（曲率＜０）に分けて、画像の形
に戻したものを図１１（イ），（ロ）に示し、これを曲
率特徴量とする。この曲率特徴量が出ている点は輪郭の
方向の変化が大きい点であり、同（イ）の＋面は凸の
面、同（ロ）の−面は凹の面を示しており、黒塗りの部
分が曲率の大きい頂点部分である。また、同（イ）の矢
印は画像の追跡方向を示している。This curvature value is normalized, and only the pixels whose absolute value is a certain threshold value (here, π / 2) or more are +
Surfaces (curvature> 0) and negative surfaces (curvature <0), which are returned to the image shape, are shown in FIGS. 11 (a) and 11 (b), which are referred to as curvature feature quantities. The point where this curvature feature amount appears is a point where the change in the direction of the contour is large, the + surface of (a) is a convex surface, and the − surface of (b) is a concave surface. The painted part is the apex part with a large curvature. Further, the arrow (a) indicates the tracking direction of the image.

【００１９】ここで、未知パターンの正規化された方向
性特徴量ベクトルをｄ₁〜ｄ₈、同じく正規化された曲
率特徴量ベクトルをｃ₊，ｃ_-とし、辞書ｊの方向性特
徴量ベクトルをｄｊ₁〜ｄｊ₈、同じく正規化された曲
率特徴量ベクトルをｃｊ₊，ｃｊ_-とすると、未知パタ
ーンと辞書の方向性特徴相関値Ｄｊ、および曲率特徴相
関値Ｃｊは、次の数１０，数１１および数１２により求
められる。なお、Ｎｋは特徴量ベクトルＤｋの次元数を
示す。また、ａ・ｂはベクトルａとベクトルｂの内積を
示し、符号を二重線で囲んでそのノルムを表わす。[0019] Here, ₊ normalized directional feature vectors of d ₁ to d ₈ of unknown pattern, like the normalized curvature feature vectors c, c _- a, and directional feature vector of the dictionary j Are dj ₁ to dj ₈ , and the normalized curvature feature amount vectors are cj ₊ and cj ₋ , the directional feature correlation value Dj of the unknown pattern and the dictionary, and the curvature feature correlation value Cj are It is obtained by the equations 11 and 12. Nk represents the number of dimensions of the feature amount vector Dk. Further, a · b represents the inner product of the vector a and the vector b, and the norm is represented by enclosing the code in double lines.

【００２０】[0020]

【数１０】 [Equation 10]

【数１１】 [Equation 11]

【数１２】 [Equation 12]

【００２１】また、曲率特徴量ベクトルから曲線の分散
に比例する直線度αｐが求められるので、方向性特徴相
関値Ｄｊおよび曲率特徴相関値Ｃｊに直線度αｐを掛け
て、総合相関値Ｔｊを次の数１３により求める。Since the linearity αp proportional to the variance of the curve can be obtained from the curvature feature vector, the directional characteristic correlation value Dj and the curvature characteristic correlation value Cj are multiplied by the linearity αp to obtain the total correlation value Tj as follows. It is obtained by the equation 13.

【数１３】 [Equation 13]

【００２２】[0022]

【発明の効果】この発明によれば、従来の方向性特徴量
に加えて輪郭画素の曲率特徴量を導入し、直線を多く含
むパターンは従来の方向性特徴量の比重を重くし、曲線
を多く含むパターンに対しては方向性特徴量の比重を少
なくし、曲率特徴量を主体に曲率変化点位置（頂点等）
を特徴として認識するようにしたので、曲線の多いパタ
ーンに対しても高精度の認識が可能となる利点が得られ
る。According to the present invention, the curvature feature amount of the contour pixel is introduced in addition to the conventional directional feature amount, and a pattern including a large number of straight lines increases the specific gravity of the conventional directional feature amount, and a curved line is formed. For patterns that contain many patterns, the gravity of directional features is reduced, and the curvature change point position (vertices, etc.) is mainly based on curvature features.
Since it is recognized as a feature, there is an advantage that a pattern with many curves can be recognized with high accuracy.

[Brief description of drawings]

【図１】この発明の実施例を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of the present invention.

【図２】図１における認識手順を説明するためのフロー
チャートである。FIG. 2 is a flowchart for explaining a recognition procedure in FIG.

【図３】図１における辞書作成手順を説明するためのフ
ローチャートである。FIG. 3 is a flowchart for explaining a dictionary creation procedure in FIG.

【図４】文字画像例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of a character image.

【図５】方向の定義を説明するための説明図である。FIG. 5 is an explanatory diagram for explaining the definition of a direction.

【図６】局部メモリを説明するための説明図である。FIG. 6 is an explanatory diagram illustrating a local memory.

【図７】微分オペレータの例を説明するための説明図で
ある。FIG. 7 is an explanatory diagram illustrating an example of a differential operator.

【図８】図４に対する微分画像例を説明するための説明
図である。FIG. 8 is an explanatory diagram for explaining an example of a differential image with respect to FIG. 4.

【図９】輪郭画像の方向（角度）を説明するための説明
図である。FIG. 9 is an explanatory diagram illustrating a direction (angle) of a contour image.

【図１０】輪郭画像の曲率を説明するための説明図であ
る。FIG. 10 is an explanatory diagram for explaining the curvature of a contour image.

【図１１】曲率特徴量を説明するための説明図である。FIG. 11 is an explanatory diagram for explaining a curvature feature amount.

[Explanation of symbols]

１…文書、２…イメージスキャナ、３…文字認識装置、
４…文字画像入力部、５…行切出し部、６…文字切出し
部、７…１文字認識ブロック、８…ノイズ除去部、９…
サイズ・複雑度計測部、１０…曲率特徴量算出部、１
１，１５…正規化部、１２，１６…相関演算部、１３…
曲率特徴量辞書、１４…方向性特徴量算出部、１８…総
合類似度計算部、１９…ソーティング部、２０…後処理
部、２１…認識結果。1 ... document, 2 ... image scanner, 3 ... character recognition device,
4 ... Character image input section, 5 ... Line cutout section, 6 ... Character cutout section, 7 ... 1 character recognition block, 8 ... Noise removal section, 9 ...
Size / complexity measurement unit, 10 ... Curvature feature amount calculation unit, 1
1, 15 ... Normalization unit, 12, 16 ... Correlation calculation unit, 13 ...
Curvature feature amount dictionary, 14 ... Directional feature amount calculation unit, 18 ... Total similarity calculation unit, 19 ... Sorting unit, 20 ... Post-processing unit, 21 ... Recognition result.

Claims

[Claims]

1. A character recognizing device for recognizing a character using a directional characteristic amount, and a character size extracting means for extracting a vertical and horizontal size of a character from a character image cut out in units of one character, and a line of the character image. A complexity calculating means for calculating the complexity of a character from the density, a directional characteristic quantity extracting means for extracting a two-dimensional directional feature quantity in 4 or 8 directions, and a smoothing condition for contour pixels from the character size and complexity. The smoothing condition determining means for determining, the curvature feature amount extracting means for obtaining the curvature of the contour pixel under the determined smoothing condition, and extracting the portion having a large absolute value as the feature amount, the character from the variance of the curvature values. And a linearity calculation means for calculating the linearity of the strokes, and the correlation values between the unknown pattern and the dictionary pattern using the directional feature quantity and the curvature feature quantity, and the linearity That the overall similarity calculating means for calculating an overall similarity by weighting factors and adding the multiplied respectively, the provided character recognition apparatus characterized by performing the recognition of characters from the overall similarity.