JPH0773275A - Character recognition device - Google Patents

Character recognition device

Info

Publication number
JPH0773275A
JPH0773275A JP5218280A JP21828093A JPH0773275A JP H0773275 A JPH0773275 A JP H0773275A JP 5218280 A JP5218280 A JP 5218280A JP 21828093 A JP21828093 A JP 21828093A JP H0773275 A JPH0773275 A JP H0773275A
Authority
JP
Japan
Prior art keywords
curvature
character
feature value
directive
directional
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP5218280A
Other languages
Japanese (ja)
Inventor
Akiko Konno
章子 紺野
Yasuo Hongo
保夫 本郷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuji Electric Co Ltd
Fuji Facom Corp
Original Assignee
Fuji Electric Co Ltd
Fuji Facom Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuji Electric Co Ltd, Fuji Facom Corp filed Critical Fuji Electric Co Ltd
Priority to JP5218280A priority Critical patent/JPH0773275A/en
Publication of JPH0773275A publication Critical patent/JPH0773275A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To improve the recognition accuracy of a pattern of large curvature by recognizing mainly the curvature feature value of a contour picture element with a curvature changing point position defined as a feature. CONSTITUTION:A curvature feature value calculating part 10 calculates the complicacy of a character image and the curvature feature value of a contour line. A normalizing part 11 acquires a normalized size of an input pattern. A correlation calculating part 12 calculates the correlation between the curvature feature value of the normalized input pattern and a curvature feature value dictionary 13 and calculates the curvature resemblance. A directive feature value calculating part 14 calculates the directive feature value in a method similar to the conventional one and by the directive differentiation of 4 or 8 directions. A normalizing part 15 normalizes the size of the directive feature value, and a correlation calculating part 16 calculates the correlation between the directive feature value and the character stored in a directive feature value dictionary 17 and to be recognized. Then a general resemblance calculating part 18 multiplying the curvature resemblance and the directive resemblance by a proper weight coefficient and adds both resemblance together to calculate the general resemblance. A sorting part 19 sorts the recognized characters and transmits the character recognizing results.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】この発明は、方向性特徴を用いて
文字を認識する文字認識装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for recognizing characters using directional features.

【0002】[0002]

【従来の技術】光学式文字認識装置(OCR)による文
字認識手法には、原パターンを正規化したものを特徴量
とする方式、方向性の微分を行なった結果の画像を特徴
量とする方式、定位置に配置したセルから各ストローク
への方向・距離を特徴量とする方式、ストローク構造を
解析する方式など様々な手法がある。その中の1つとし
て、文字パターンと背景の境界部分の方向性を特徴量と
し、特徴間のマッチング度合いを2つのパターンの特徴
ベクトル空間の相関値から求めた類似度cosθで計測
する方法がある。
2. Description of the Related Art Character recognition methods using an optical character recognition device (OCR) include a method in which a feature amount is obtained by normalizing an original pattern, and a method in which an image resulting from directional differentiation is used as a feature amount. There are various methods such as a method in which the direction / distance from a cell arranged at a fixed position to each stroke is used as a feature amount, and a method in which a stroke structure is analyzed. As one of them, there is a method in which the directionality of the boundary portion between the character pattern and the background is used as the feature amount, and the matching degree between the features is measured by the similarity cos θ obtained from the correlation value of the feature vector space of the two patterns. .

【0003】[0003]

【発明が解決しようとする課題】この2次元特徴相関法
は文字のストローク、または背景を4または8方向の方
向性微分画像を特徴量とすることにより表現したもので
あるが、比較的直線成分が多い明朝体やゴシック体の印
刷漢字については効果があることが確認されている。し
かし、印刷漢字でもゆるやかな曲線成分を多く含む平仮
名や教科書体の文字に対しては、このような4方向また
は8方向の方向性特徴だけでは対応できず、認識精度が
低下するという問題がある。
This two-dimensional feature correlation method is expressed by using a stroke or a background of a character as a feature amount of a directional differential image in 4 or 8 directions. It has been confirmed that it is effective for printing Kanji in Mincho and Gothic fonts, which are often used. However, even in the case of printed Chinese characters, it is not possible to deal with characters in hiragana or textbooks containing a lot of gentle curve components only with such directional features in four or eight directions, and there is a problem that the recognition accuracy deteriorates. .

【0004】また、手書き文字に対しても、限られた方
向の方向性特徴だけでは認識精度に限界がある。これを
解決するためには、方向の刻みを16,32と増加させ
る方法もあるが、もとの画像の並びが格子状であること
から、8方向を越える方向に対しては方向性特徴量自体
の精度が低下するだけでなく、特徴量の次元数,辞書容
量,認識時間などが増加するという問題がある。したが
って、この発明の課題は特徴量の次元数,辞書容量,認
識時間などの増大を抑えながら認識精度を向上させるこ
とにある。
Further, even with respect to handwritten characters, there is a limit to the recognition accuracy only with the directional features in a limited direction. In order to solve this, there is also a method of increasing the number of steps in the direction to 16, 32, but since the original images are arranged in a grid pattern, the directional feature quantity is added to the direction beyond 8 directions. There is a problem in that not only the accuracy of itself decreases but also the number of dimensions of the feature amount, the dictionary capacity, the recognition time, etc. increase. Therefore, an object of the present invention is to improve recognition accuracy while suppressing increases in the number of dimensions of feature quantity, dictionary capacity, recognition time, and the like.

【0005】[0005]

【課題を解決するための手段】このような課題を解決す
るため、この発明では、方向性特徴量を用いて文字の認
識を行なう文字認識装置において、1文字単位で切り出
された文字画像から文字の縦横のサイズを抽出する文字
サイズ抽出手段と、文字画像の線密度から文字の複雑度
を算出する複雑度算出手段と、4または8方向の2次元
方向性特徴量を抽出する方向性特徴量抽出手段と、前記
文字サイズと複雑度から輪郭画素の平滑化条件を決定す
る平滑化条件決定手段と、決定された平滑化条件で輪郭
画素の曲率を求め、その絶対値の大きい部分を特徴量と
して抽出する曲率特徴量抽出手段と、この曲率値の分散
から文字を構成するストロークの直線度を計算する直線
度算出手段と、前記方向性特徴量,曲率特徴量を用いて
未知パターンと辞書パターンとの間の各相関値を求め、
これらに前記直線度による重み係数をそれぞれ掛けて加
算することにより総合類似度を求める総合類似度演算手
段とを設け、この総合類似度から文字の認識を行なうこ
とを特徴としている。
In order to solve such a problem, according to the present invention, in a character recognition device for recognizing a character by using a directional characteristic amount, characters are extracted from a character image cut out character by character. Size extracting means for extracting the vertical and horizontal sizes of the, a complexity calculating means for calculating the complexity of the character from the line density of the character image, and a directional feature quantity for extracting a two-dimensional directional feature quantity in four or eight directions. Extraction means, smoothing condition determining means for determining the smoothing condition of the contour pixel from the character size and complexity, the curvature of the contour pixel is determined under the determined smoothing condition, and a portion having a large absolute value is a feature amount. A curvature feature amount extraction means for extracting as a straight line, a straightness degree calculation means for calculating a straightness of a stroke forming a character from the variance of the curvature values, and an unknown pattern and a word using the directional feature amount and the curvature feature amount. Obtains the correlation values between the pattern,
It is characterized in that a total similarity calculating means for obtaining a total similarity by multiplying these by a weighting coefficient based on the linearity and adding them is provided, and characters are recognized from the total similarity.

【0006】[0006]

【作用】従来の方向性特徴量に加え、境界成分の曲率を
特徴量として導入する。この境界成分の曲率はそのまま
では誤差が大きく、特徴量として使用できないので、ま
ず線密度等によって文字の複雑度を計測し、文字のサイ
ズと複雑度に応じて曲率の平滑化係数を変える。そし
て、このようにして求めた方向性特徴と曲率特徴を総合
して文字認識を行なう。この場合の方向性特徴と曲率特
徴の重みは未知パターンから求めた直線度により、パタ
ーンが直線を多く含む場合は方向性特徴の比重を重く
し、一方、曲線が多い場合は曲率特徴の比重を重くして
それぞれ認識を行なう。
In addition to the conventional directional feature amount, the curvature of the boundary component is introduced as a feature amount. The curvature of the boundary component has a large error as it is and cannot be used as a feature amount. Therefore, the complexity of the character is first measured by the line density or the like, and the smoothing coefficient of the curvature is changed according to the size and the complexity of the character. Then, character recognition is performed by integrating the directional characteristics and the curvature characteristics thus obtained. In this case, the weights of the directional feature and the curvature feature are determined by the linearity obtained from the unknown pattern. When the pattern includes many straight lines, the specific gravity of the directional feature is increased. Weigh each and recognize each one.

【0007】[0007]

【実施例】図1はこの発明の実施例を示すブロック図で
ある。同図において、1はOCRに入力される文書、2
は画像入力装置としてのイメージスキャナを示し、3は
OCR全体を示す。4は文字画像入力部でイメージスキ
ャナ2を制御し、文書画像を画像メモリに格納する。5
は行切出し部で、文書画像からテキストの書かれた行を
抽出する。6は文字切出し部で、1文字ずつの画像を切
り出す。
1 is a block diagram showing an embodiment of the present invention. In the figure, 1 is a document input to OCR, 2
Represents an image scanner as an image input device, and 3 represents the entire OCR. A character image input unit 4 controls the image scanner 2 to store the document image in the image memory. 5
Is a line cutout unit that extracts a line in which text is written from a document image. Reference numeral 6 denotes a character cutout unit which cuts out an image of each character.

【0008】7は1文字認識ブロックであり、6で切り
出された1文字画像について認識を行なう。8は文字画
像からノイズを除去する部分、9はノイズ除去された文
字画像から文字サイズおよび線密度を計測するサイズ・
複雑度計測部である。10はこの複雑度と文字画像から
輪郭線の曲率特徴量を算出する曲率特徴量算出部、11
はこれを規定のサイズに正規化する正規化部、13は認
識対象の各文字についての曲率特徴量を格納している辞
書、12は11で正規化された入力パターンの曲率特徴
と、13の曲率特徴量辞書との相関演算を行ない、曲率
類似度を求める相関演算部である。
Reference numeral 7 is a 1-character recognition block, which recognizes the 1-character image cut out in 6. 8 is a portion for removing noise from the character image, 9 is a size for measuring character size and line density from the noise-removed character image.
It is a complexity measuring unit. Reference numeral 10 is a curvature feature amount calculation unit that calculates the curvature feature amount of the contour line from the complexity and the character image, and 11
Is a normalization unit that normalizes this to a prescribed size, 13 is a dictionary that stores the curvature feature amount for each character to be recognized, 12 is the curvature feature of the input pattern normalized by 11, and 13 This is a correlation calculation unit that performs a correlation calculation with the curvature feature amount dictionary to obtain a curvature similarity.

【0009】14は従来と同様の方法により、4方向ま
たは8方向の方向性微分により方向性特徴量を算出する
方向性特徴量算出部、15はこの特徴量のサイズを正規
化する正規化部、16はこうして得られた方向性特徴量
と、方向性特徴量辞書17に記憶されている認識対象の
文字についての方向性特徴量との相関値を求める相関演
算部である。18は曲率類似度と方向性類似度とに適当
な重み係数を掛け合わせて加算することにより、総合類
似度を求める総合類似度計算部である。19は総合類似
度の大きい順にソーティングを行ない、文字認識結果を
出力するソーティング部、20は文字単位の認識結果に
対して、知識処理等により後処理を施す後処理部であ
り、21が最終的に得られた認識結果を示す。
Reference numeral 14 is a directional feature quantity calculating section for calculating a directional feature quantity by directional differentiation in four or eight directions by a method similar to the conventional method, and 15 is a normalizing section for normalizing the size of this feature quantity. , 16 are correlation calculation units for obtaining a correlation value between the directional characteristic amount thus obtained and the directional characteristic amount of the character to be recognized stored in the directional characteristic amount dictionary 17. Reference numeral 18 denotes an overall similarity calculating unit that obtains the overall similarity by multiplying the curvature similarity and the directional similarity by an appropriate weighting factor and adding them. Reference numeral 19 denotes a sorting unit that performs sorting in order of increasing total similarity and outputs a character recognition result, 20 denotes a post-processing unit that performs post-processing such as knowledge processing on the recognition result in character units, and 21 denotes a final processing unit. The recognition results obtained are shown in.

【0010】図2は図1における認識手順を説明するた
めのフローチャートである。まず、入力パターンについ
てノイズの除去を行ない(ステップS1)、ついでステ
ップS2で文字サイズ(高さ,幅)、ステップS3で複
雑度Fをそれぞれ計測する。その後、一方では方向性特
徴の抽出(ステップS4)、他方では輪郭画素の曲率特
徴抽出,パターンの直線度αpの計算(ステップS6)
を行ない、それぞれ正規化した後(ステップS5,S
7)、各辞書カテゴリとの方向性特徴量の相関値Djを
計算し(ステップS9)、他方では各辞書カテゴリとの
曲率特徴量相関値Cjを計算する(ステップS10)。
これらに対し、入力パターンの直線度αpによる重み付
けをして加算することにより、総合類似度Tjを求める
(ステップS11)。総合類似度の大きい順にソーティ
ングを行ない(ステップS12〜14)、ステップS1
5で認識結果を出力する。
FIG. 2 is a flow chart for explaining the recognition procedure in FIG. First, noise is removed from the input pattern (step S1), then the character size (height, width) is measured at step S2, and the complexity F is measured at step S3. After that, on the one hand, the extraction of the directional feature (step S4), on the other hand, the extraction of the curvature feature of the contour pixel, and the calculation of the linearity αp of the pattern (step S6).
And normalize each (steps S5, S
7) Then, the correlation value Dj of the directional feature amount with each dictionary category is calculated (step S9), and the curvature feature amount correlation value Cj with each dictionary category is calculated on the other hand (step S10).
The total similarity Tj is obtained by weighting these with the linearity αp of the input pattern and adding them (step S11). Sorting is performed in descending order of total similarity (steps S12 to S14), and step S1.
At 5, the recognition result is output.

【0011】図3は図1における辞書作成手順を説明す
るためのフローチャートである。まず、カテゴリ内の第
1番目(i=1)のパターンから順に、ノイズ除去,文
字サイズ(高さ,幅)の計測および文字の複雑度Fの計
測を行なう(ステップS1〜4)。その後は、並列に2
つの特徴量の計算を行なう。一方では、従来と同様に方
向性特徴量の抽出,サイズの正規化を行ない、得られた
特徴量di1 〜di8 を辞書領域へ足し込む(ステップ
S5〜7)。もう一方では、文字のサイズに比例し、複
雑度に反比例するような平滑化区間を設定し、輪郭画素
の曲率画像を求める(ステップS8)。これについて
も、同様にサイズの正規化を行ない、得られた特徴量c
+ 〜ci- を、それぞれ辞書C+ ,C- に足し込む
(ステップS9,10)。これをカテゴリ内の全パター
ンに対して行なった後正規化し、1つのカテゴリに対す
る辞書作成を終了する(ステップS11〜13)。
FIG. 3 is a flow chart for explaining the dictionary creating procedure in FIG. First, noise removal, character size (height, width) measurement, and character complexity F measurement are performed in order from the first (i = 1) pattern in the category (steps S1 to 4). After that, 2 in parallel
Calculate two feature quantities. On the other hand, the directional feature quantity is extracted and the size is normalized as in the conventional case, and the obtained feature quantities di 1 to di 8 are added to the dictionary area (steps S5 to 7). On the other hand, a smoothing section that is proportional to the character size and inversely proportional to the complexity is set, and the curvature image of the contour pixel is obtained (step S8). Also for this, the size is normalized in the same manner, and the obtained feature amount c
i + ~ci - a dictionary respectively C +, C - the Komu added (Step S9 and S10). This is performed for all patterns in the category and then normalized, and dictionary creation for one category is completed (steps S11 to 13).

【0012】以下、具体的に説明する。図4に、文字画
像例を示す。これは、教科書体の漢字「乙」の例であ
る。この文字は直線と曲線とからなっているが、字体に
よって曲線の曲がり方が異なり、平仮名の「て」として
誤読されることも多い。なお、この文字の複雑度Fは、
例えば次の数1〜数5で示されるような、線の立ち上が
りをカウントする関数から求めることができる。
A detailed description will be given below. FIG. 4 shows an example of a character image. This is an example of the kanji "Oto" in the textbook. This character consists of a straight line and a curved line, but the way the curve bends differs depending on the typeface, and is often misread as the hiragana “te”. The complexity F of this character is
For example, it can be obtained from a function that counts the rising of a line as shown in the following equations 1 to 5.

【0013】[0013]

【数1】 [Equation 1]

【数2】 [Equation 2]

【数3】 [Equation 3]

【数4】 [Equation 4]

【数5】 [Equation 5]

【0014】図4に示す文字画像から求まる複雑度は、
次のようになる。 Fx=1.38 Fy=2.67 F =2.04
The complexity obtained from the character image shown in FIG.
It looks like this: Fx = 1.38 Fy = 2.67 F = 2.04

【0015】図4に示す文字画像から、従来方式により
8方向の方向性特徴(ストローク)を抽出する方法につ
いて、説明する。図5は8方向を説明するための説明
図、図6は方向性特徴を抽出するための3×3の局部メ
モリ構成図、図7は方向性特徴を抽出するための方向性
微分オペレータ例を示す説明図、図8は図4に対応する
微分画像例である。なお、図7(イ),(ロ)は良く知
られているSobelオペレータの例であるが、方向性
の抽出が可能な他のオペレータで代用することができ
る。
A method of extracting directional features (strokes) in eight directions by the conventional method from the character image shown in FIG. 4 will be described. FIG. 5 is an explanatory view for explaining 8 directions, FIG. 6 is a 3 × 3 local memory block diagram for extracting directional features, and FIG. 7 is an example of a directional differential operator for extracting directional features. 8A and 8B are examples of differential images corresponding to FIG. 7 (a) and 7 (b) are examples of the well-known Sobel operator, other operators capable of extracting the direction can be substituted.

【0016】まず、図6の局部領域に図7(イ),
(ロ)に示すSobelオペレータを掛け合わせて次の
数6,数7のように微分値∇x(x,y),∇y(x,
y)を求める。図4に対する微分画像例を図8に示す。
さらに、これを用いて数8のように境界成分の方向θ
(x,y)、すなわち角度を算出する。
First, in the local area of FIG. 6, FIG.
By multiplying the Sobel operator shown in (b), differential values ∇x (x, y) and ∇y (x,
y) is calculated. An example of the differential image with respect to FIG. 4 is shown in FIG.
Furthermore, using this, the direction θ of the boundary component
(X, y), that is, the angle is calculated.

【数6】 [Equation 6]

【数7】 [Equation 7]

【数8】 以上のようにして求めた境界成分の方向θ(x,y)
を、境界成分の輪郭順に並べた結果を図9に示す。
[Equation 8] Boundary component direction θ (x, y) obtained as described above
FIG. 9 shows the result of arranging B in the order of the boundary components.

【0017】曲率は、或る点I(x,y)での方向θ
(x,y)の変化量(角度の変化量)として表わすこと
ができる。ここでは、曲率K(x,y)を次の数9によ
り求める。ただし、数9の(xi,yi)は輪郭点列の
座標を示し、iの順に境界成分が並んでいることを示し
ている。また、図4の画像について、下記数9により求
めた曲率K(x,y)を図10に示す。この図10では
平滑化区間をw=4としたが、この平滑化係数wは前に
求めた文字のサイズにほぼ比例し、文字の複雑度に反比
例するものである。
The curvature is the direction θ at a point I (x, y).
It can be expressed as a variation amount (angle variation amount) of (x, y). Here, the curvature K (x, y) is calculated by the following equation 9. However, (xi, yi) in the equation 9 indicates the coordinates of the outline point sequence, and indicates that the boundary components are arranged in the order of i. Further, FIG. 10 shows the curvature K (x, y) obtained by the following Expression 9 for the image of FIG. In FIG. 10, the smoothing section is set to w = 4, but the smoothing coefficient w is almost proportional to the size of the previously obtained character and inversely proportional to the complexity of the character.

【数9】 [Equation 9]

【0018】この曲率値を正規化し、絶対値が或るしき
い値(ここでは、π/2)以上の値を持つ画素だけを+
面(曲率>0)と−面(曲率<0)に分けて、画像の形
に戻したものを図11(イ),(ロ)に示し、これを曲
率特徴量とする。この曲率特徴量が出ている点は輪郭の
方向の変化が大きい点であり、同(イ)の+面は凸の
面、同(ロ)の−面は凹の面を示しており、黒塗りの部
分が曲率の大きい頂点部分である。また、同(イ)の矢
印は画像の追跡方向を示している。
This curvature value is normalized, and only the pixels whose absolute value is a certain threshold value (here, π / 2) or more are +
Surfaces (curvature> 0) and negative surfaces (curvature <0), which are returned to the image shape, are shown in FIGS. 11 (a) and 11 (b), which are referred to as curvature feature quantities. The point where this curvature feature amount appears is a point where the change in the direction of the contour is large, the + surface of (a) is a convex surface, and the − surface of (b) is a concave surface. The painted part is the apex part with a large curvature. Further, the arrow (a) indicates the tracking direction of the image.

【0019】ここで、未知パターンの正規化された方向
性特徴量ベクトルをd1 〜d8 、同じく正規化された曲
率特徴量ベクトルをc+ ,c- とし、辞書jの方向性特
徴量ベクトルをdj1 〜dj8 、同じく正規化された曲
率特徴量ベクトルをcj+ ,cj- とすると、未知パタ
ーンと辞書の方向性特徴相関値Dj、および曲率特徴相
関値Cjは、次の数10,数11および数12により求
められる。なお、Nkは特徴量ベクトルDkの次元数を
示す。また、a・bはベクトルaとベクトルbの内積を
示し、符号を二重線で囲んでそのノルムを表わす。
[0019] Here, + normalized directional feature vectors of d 1 to d 8 of unknown pattern, like the normalized curvature feature vectors c, c - a, and directional feature vector of the dictionary j Are dj 1 to dj 8 , and the normalized curvature feature amount vectors are cj + and cj , the directional feature correlation value Dj of the unknown pattern and the dictionary, and the curvature feature correlation value Cj are It is obtained by the equations 11 and 12. Nk represents the number of dimensions of the feature amount vector Dk. Further, a · b represents the inner product of the vector a and the vector b, and the norm is represented by enclosing the code in double lines.

【0020】[0020]

【数10】 [Equation 10]

【数11】 [Equation 11]

【数12】 [Equation 12]

【0021】また、曲率特徴量ベクトルから曲線の分散
に比例する直線度αpが求められるので、方向性特徴相
関値Djおよび曲率特徴相関値Cjに直線度αpを掛け
て、総合相関値Tjを次の数13により求める。
Since the linearity αp proportional to the variance of the curve can be obtained from the curvature feature vector, the directional characteristic correlation value Dj and the curvature characteristic correlation value Cj are multiplied by the linearity αp to obtain the total correlation value Tj as follows. It is obtained by the equation 13.

【数13】 [Equation 13]

【0022】[0022]

【発明の効果】この発明によれば、従来の方向性特徴量
に加えて輪郭画素の曲率特徴量を導入し、直線を多く含
むパターンは従来の方向性特徴量の比重を重くし、曲線
を多く含むパターンに対しては方向性特徴量の比重を少
なくし、曲率特徴量を主体に曲率変化点位置(頂点等)
を特徴として認識するようにしたので、曲線の多いパタ
ーンに対しても高精度の認識が可能となる利点が得られ
る。
According to the present invention, the curvature feature amount of the contour pixel is introduced in addition to the conventional directional feature amount, and a pattern including a large number of straight lines increases the specific gravity of the conventional directional feature amount, and a curved line is formed. For patterns that contain many patterns, the gravity of directional features is reduced, and the curvature change point position (vertices, etc.) is mainly based on curvature features.
Since it is recognized as a feature, there is an advantage that a pattern with many curves can be recognized with high accuracy.

【図面の簡単な説明】[Brief description of drawings]

【図1】この発明の実施例を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of the present invention.

【図2】図1における認識手順を説明するためのフロー
チャートである。
FIG. 2 is a flowchart for explaining a recognition procedure in FIG.

【図3】図1における辞書作成手順を説明するためのフ
ローチャートである。
FIG. 3 is a flowchart for explaining a dictionary creation procedure in FIG.

【図4】文字画像例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of a character image.

【図5】方向の定義を説明するための説明図である。FIG. 5 is an explanatory diagram for explaining the definition of a direction.

【図6】局部メモリを説明するための説明図である。FIG. 6 is an explanatory diagram illustrating a local memory.

【図7】微分オペレータの例を説明するための説明図で
ある。
FIG. 7 is an explanatory diagram illustrating an example of a differential operator.

【図8】図4に対する微分画像例を説明するための説明
図である。
FIG. 8 is an explanatory diagram for explaining an example of a differential image with respect to FIG. 4.

【図9】輪郭画像の方向(角度)を説明するための説明
図である。
FIG. 9 is an explanatory diagram illustrating a direction (angle) of a contour image.

【図10】輪郭画像の曲率を説明するための説明図であ
る。
FIG. 10 is an explanatory diagram for explaining the curvature of a contour image.

【図11】曲率特徴量を説明するための説明図である。FIG. 11 is an explanatory diagram for explaining a curvature feature amount.

【符号の説明】[Explanation of symbols]

1…文書、2…イメージスキャナ、3…文字認識装置、
4…文字画像入力部、5…行切出し部、6…文字切出し
部、7…1文字認識ブロック、8…ノイズ除去部、9…
サイズ・複雑度計測部、10…曲率特徴量算出部、1
1,15…正規化部、12,16…相関演算部、13…
曲率特徴量辞書、14…方向性特徴量算出部、18…総
合類似度計算部、19…ソーティング部、20…後処理
部、21…認識結果。
1 ... document, 2 ... image scanner, 3 ... character recognition device,
4 ... Character image input section, 5 ... Line cutout section, 6 ... Character cutout section, 7 ... 1 character recognition block, 8 ... Noise removal section, 9 ...
Size / complexity measurement unit, 10 ... Curvature feature amount calculation unit, 1
1, 15 ... Normalization unit, 12, 16 ... Correlation calculation unit, 13 ...
Curvature feature amount dictionary, 14 ... Directional feature amount calculation unit, 18 ... Total similarity calculation unit, 19 ... Sorting unit, 20 ... Post-processing unit, 21 ... Recognition result.

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 方向性特徴量を用いて文字の認識を行な
う文字認識装置において、 1文字単位で切り出された文字画像から文字の縦横のサ
イズを抽出する文字サイズ抽出手段と、 文字画像の線密度から文字の複雑度を算出する複雑度算
出手段と、 4または8方向の2次元方向性特徴量を抽出する方向性
特徴量抽出手段と、 前記文字サイズと複雑度から輪郭画素の平滑化条件を決
定する平滑化条件決定手段と、 決定された平滑化条件で輪郭画素の曲率を求め、その絶
対値の大きい部分を特徴量として抽出する曲率特徴量抽
出手段と、 この曲率値の分散から文字を構成するストロークの直線
度を計算する直線度算出手段と、 前記方向性特徴量,曲率特徴量を用いて未知パターンと
辞書パターンとの間の各相関値を求め、これらに前記直
線度による重み係数をそれぞれ掛けて加算することによ
り総合類似度を求める総合類似度演算手段と、 を設け、この総合類似度から文字の認識を行なうことを
特徴とする文字認識装置。
1. A character recognizing device for recognizing a character using a directional characteristic amount, and a character size extracting means for extracting a vertical and horizontal size of a character from a character image cut out in units of one character, and a line of the character image. A complexity calculating means for calculating the complexity of a character from the density, a directional characteristic quantity extracting means for extracting a two-dimensional directional feature quantity in 4 or 8 directions, and a smoothing condition for contour pixels from the character size and complexity. The smoothing condition determining means for determining, the curvature feature amount extracting means for obtaining the curvature of the contour pixel under the determined smoothing condition, and extracting the portion having a large absolute value as the feature amount, the character from the variance of the curvature values. And a linearity calculation means for calculating the linearity of the strokes, and the correlation values between the unknown pattern and the dictionary pattern using the directional feature quantity and the curvature feature quantity, and the linearity That the overall similarity calculating means for calculating an overall similarity by weighting factors and adding the multiplied respectively, the provided character recognition apparatus characterized by performing the recognition of characters from the overall similarity.
JP5218280A 1993-09-02 1993-09-02 Character recognition device Pending JPH0773275A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP5218280A JPH0773275A (en) 1993-09-02 1993-09-02 Character recognition device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5218280A JPH0773275A (en) 1993-09-02 1993-09-02 Character recognition device

Publications (1)

Publication Number Publication Date
JPH0773275A true JPH0773275A (en) 1995-03-17

Family

ID=16717384

Family Applications (1)

Application Number Title Priority Date Filing Date
JP5218280A Pending JPH0773275A (en) 1993-09-02 1993-09-02 Character recognition device

Country Status (1)

Country Link
JP (1) JPH0773275A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012147844A1 (en) * 2011-04-27 2012-11-01 日本電気株式会社 Recognition/search method for object/form, system for same, and program for same
CN116612483A (en) * 2023-07-19 2023-08-18 广州宏途数字科技有限公司 Recognition method and device for handwriting vector of intelligent pen

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012147844A1 (en) * 2011-04-27 2012-11-01 日本電気株式会社 Recognition/search method for object/form, system for same, and program for same
CN116612483A (en) * 2023-07-19 2023-08-18 广州宏途数字科技有限公司 Recognition method and device for handwriting vector of intelligent pen
CN116612483B (en) * 2023-07-19 2023-09-29 广州宏途数字科技有限公司 Recognition method and device for handwriting vector of intelligent pen

Similar Documents

Publication Publication Date Title
JP3027321B2 (en) Method and apparatus for online recognition of unrestricted handwritten alphanumeric characters
JP3453134B2 (en) How to determine equivalence of multiple symbol strings
US9418283B1 (en) Image processing using multiple aspect ratios
KR100658119B1 (en) Apparatus and Method for Recognizing Character
Liu et al. Handwriting recognition: soft computing and probabilistic approaches
US8588529B2 (en) Method and system for detecting text in raster images
EP2534612B1 (en) Efficient scale-space extraction and description of interest points
US20120002867A1 (en) Feature point generation system, feature point generation method, and feature point generation program
JPH06348896A (en) Segmenting method for character and device therefor
JP4300098B2 (en) Object identification method and apparatus, program and recording medium
CN111444907A (en) Character recognition method, device, equipment and storage medium
Mohammadpoor et al. A novel method for persian handwritten digit recognition using support vector machine
CN105844299A (en) Image classification method based on bag of words
JPH0773275A (en) Character recognition device
Lattner et al. A combination of machine learning and image processing technologies for the classification of image regions
JP2770849B2 (en) Image processing device
JP2909604B2 (en) Fuzzy pattern recognition method and apparatus
JP2576350B2 (en) String extraction device
Sadri et al. Statistical characteristics of slant angles in handwritten numeral strings and effects of slant correction on segmentation
Chen et al. Independent component analysis segmentation algorithm
KR940007935B1 (en) Candidate classification method for recognizing character
Mahmood et al. Surface Patch Detection of 3D Point Cloud Using Local Shape Descriptor
JPH08315155A (en) Graphic preprocessing unit
JPH0259507B2 (en)
JP2974167B2 (en) Large Classification Recognition Method for Characters