JP3250317B2

JP3250317B2 - Character recognition device

Info

Publication number: JP3250317B2
Application number: JP09319493A
Authority: JP
Inventors: 敏幸吉武; 東善 ▲裴▼; 美佐子諏訪; 晴雄秋元
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1993-04-20
Filing date: 1993-04-20
Publication date: 2002-01-28
Anticipated expiration: 2017-01-28
Also published as: JPH06309501A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、手書き文字や印刷され
た文字を認識する文字認識装置に関するものである。近
年、コンピュータ等の情報処理装置の普及に伴い、これ
らの装置への入力手段として、帳票等に記入された手書
き文字や印刷文字を読み取る文字認識の技術が開発され
ており、特に、手書き文字は印刷文字に比べて歪みや傾
き等の変形が大きいので、精度の高い認識技術が求めら
れている。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for recognizing handwritten characters and printed characters. In recent years, with the spread of information processing devices such as computers, a character recognition technology for reading handwritten characters or printed characters written on a form or the like has been developed as an input means to these devices. Since deformation such as distortion and inclination is larger than that of printed characters, a highly accurate recognition technique is required.

【０００２】[0002]

【従来の技術】以下、従来の文字認識装置について図面
に基づいて説明する。図１０は、従来の文字認識装置の
ブロック構成図である。図１０において、１００は文字
画像入力部、１０１はセグメント抽出部、１０２は特徴
抽出部、１０３は候補決定部、１０４は辞書である。2. Description of the Related Art A conventional character recognition apparatus will be described below with reference to the drawings. FIG. 10 is a block diagram of a conventional character recognition device. 10, reference numeral 100 denotes a character image input unit, 101 denotes a segment extraction unit, 102 denotes a feature extraction unit, 103 denotes a candidate determination unit, and 104 denotes a dictionary.

【０００３】従来の文字認識装置では、まず最初に文字
画像入力部１００において、帳票等の特定の位置に記入
された手書き文字や印刷文字をイメージとして読み取
り、白黒の２値画像に変換し、セグメント抽出部１０１
において、変換された２値画像から文字セグメントとし
て、例えば、２値画像の輪郭線を抽出する。そして、特
徴抽出部１０２において、抽出された輪郭線の特徴量と
して、文字画像の幾何学的形状情報を抽出し、抽出され
た特徴量を与えられた候補決定部１０３は、この特徴量
を予め辞書１０４に格納されている辞書セグメントの特
徴量との距離を計算し、距離の最も近い辞書を認識結果
として出力する。In a conventional character recognition apparatus, first, in a character image input unit 100, a handwritten character or a printed character written in a specific position such as a form is read as an image, converted into a black and white binary image, and segmented. Extraction unit 101
In, for example, a contour of the binary image is extracted as a character segment from the converted binary image. Then, in the feature extracting unit 102, the geometric shape information of the character image is extracted as the extracted feature amount of the contour line, and the candidate determining unit 103 given the extracted feature amount sets the feature amount in advance. The distance from the feature amount of the dictionary segment stored in the dictionary 104 is calculated, and the dictionary having the closest distance is output as a recognition result.

【０００４】図１１は、文字画像の幾何学的形状による
文字認識を説明する図である。図１１の（Ａ）に示すよ
うに、文字画像「２」の輪郭には、幾何学的形状とし
て、例えば、凹部（図中、矢印で示した部分）が左側に
１つ、右側に１つあり、図１１の（Ｂ）に示す文字画像
「３」の輪郭には、凹部が左側に２つ、右側に１つあ
る。FIG. 11 is a diagram illustrating character recognition based on the geometric shape of a character image. As shown in FIG. 11A, the outline of the character image “2” has, for example, one concave portion (indicated by an arrow in the figure) on the left side and one on the right side as a geometric shape. In addition, the outline of the character image “3” shown in FIG. 11B has two concave portions on the left side and one concave portion on the right side.

【０００５】このように、文字画像の輪郭にある凹凸等
の文字画像毎に異なる幾何学的形状を文字画像を識別す
る特徴量として用いて、各カテゴリに特有な形状に着目
して文字を認識するので、特に、局所的に形状の異なる
文字の認識に対しては有益である。[0005] As described above, using a geometrical shape different for each character image such as unevenness in the outline of the character image as a feature amount for identifying the character image, the character is recognized by paying attention to a shape unique to each category. This is particularly useful for recognizing characters having locally different shapes.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、認識対
象となる文字の文字セグメントの局所的形状が、辞書に
格納された辞書セグメントのどの部分に対応するかを特
定する際に、特に、変形の大きな文字に対しては、その
対応付けるを効率的行わなければ、認識率が低下すると
ともに、認識処理の時間が遅くなるという問題がある。However, when specifying which part of the dictionary segment stored in the dictionary corresponds to the local shape of the character segment of the character to be recognized, particularly large deformations occur. If characters are not efficiently associated with each other, there is a problem that the recognition rate is reduced and the time required for the recognition process is reduced.

【０００７】例えば、図１１の（Ｃ）に示すように、辞
書セグメントでは凹部が左側に１つあるのに対して、図
１１の（Ｄ）に示す認識対象の文字画像のように、凹部
が左側に２つある場合にはどちらの凹部が辞書セグメン
トの凹部に対応するのか処理する必要がある。本発明
は、このような点に鑑みて、文字セグメントと辞書セグ
メントとの対応を的確に行って文字を認識することがで
きる文字認識装置を提供することを目的とする。For example, as shown in FIG. 11C, the dictionary segment has one concave portion on the left side, whereas the concave portion is formed as shown in the character image to be recognized shown in FIG. 11D. If there are two on the left side, it is necessary to process which concave portion corresponds to the concave portion of the dictionary segment. In view of the above, an object of the present invention is to provide a character recognition device capable of recognizing a character by accurately associating a character segment with a dictionary segment.

【０００８】[0008]

【課題を解決するための手段】図１は、本発明の原理ブ
ロック図である。本発明において、上記の課題を解決す
るための手段は、図１に示すように、文字画像から抽出
された文字のセグメントを辞書セグメントとして格納
し、且つ前記辞書セグメント上にとった基準点における
辞書セグメントの特徴を当該文字画像の辞書特徴情報と
して格納する辞書１０と、認識対象となる文字の文字画
像から抽出された文字セグメント上の点と、前記辞書セ
グメント上の点との対応関係をとるために、角度変化量
の累積値を用いたＤＰ照合法による対応関係決定手段１
１と、前記対応関係決定手段１１で得られた対応関係に
基づいて、前記辞書セグメント上の基準点に対応する文
字セグメントの点を検査点として設定し、なお且つ、前
記基準点における当該辞書セグメントの辞書特徴情報に
対応する前記検査点における当該文字セグメントの特徴
情報を抽出する検査点検出手段１２と、前記検査点検出
手段１２で抽出された文字セグメントの特徴情報を、辞
書セグメント上の対応する基準点の辞書特徴情報と比較
して前記認識対象となる文字画像の認識候補を決定する
候補決定手段１３を備えることである。FIG. 1 is a block diagram showing the principle of the present invention. In the present invention, as shown in FIG. 1, a means for solving the above problem stores a segment of a character extracted from a character image as a dictionary segment, and stores a dictionary at a reference point taken on the dictionary segment. the characteristics of the segments and dictionary 10 for storing a dictionary feature information of the character image, to take a point on the character segment extracted from character image to be recognized, a correspondence between the points on the dictionary segment And angle change
Correspondence Determination Means 1 by DP Matching Method Using Cumulative Value of
1 and a point of a character segment corresponding to a reference point on the dictionary segment is set as a check point based on the correspondence obtained by the correspondence determination means 11, and the dictionary segment at the reference point Test point detecting means 12 for extracting the characteristic information of the character segment at the test point corresponding to the dictionary characteristic information of the above, and the characteristic information of the character segment extracted by the test point detecting means 12 It is provided with candidate determining means 13 for determining a recognition candidate of the character image to be recognized by comparing with dictionary feature information of a reference point.

【０００９】[0009]

【作用】本発明によれば、上記構成に基づき、辞書セグ
メント上の各点が、認識対象となる文字画像の文字セグ
メント上のどの点に対応するかを見て、予め定められた
辞書セグメント上の基準点と対応する認識対象の文字画
像の文字セグメント上の検査点を求める。According to the present invention, based on the above configuration, it is determined whether each point on the dictionary segment corresponds to a point on the character segment of the character image to be recognized. Inspection points on the character segments of the character image to be recognized corresponding to the reference points are determined.

【００１０】そして、求めた検査点において、辞書セグ
メント上の基準点での特徴情報に対応する文字セグメン
トの特徴情報を算出し、辞書セグメントの特徴情報との
比較を行い、比較対象の辞書セグメントが認識対象の文
字画像の認識候補であるか判定する。よって、カテゴリ
毎に認識するために注目すべき辞書セグメント上の特徴
的な認識情報を有する点での特徴情報と、この点と対応
する認識対象となる文字画像の文字セグメント上の特徴
情報とを効率的に比較することにより、容易に文字の認
識を行うことができる。Then, at the obtained inspection point, the characteristic information of the character segment corresponding to the characteristic information at the reference point on the dictionary segment is calculated and compared with the characteristic information of the dictionary segment. It is determined whether the character image to be recognized is a recognition candidate. Therefore, the feature information at the point having the characteristic recognition information on the dictionary segment to be noted for recognition for each category and the feature information on the character segment of the character image to be recognized corresponding to this point are Characters can be easily recognized by efficient comparison.

【００１１】[0011]

【実施例】以下、本発明の実施例を図面に基づいて説明
する。図２は、本発明の実施例による文字認識装置のブ
ロック構成図である。図２において、２０は文字画像入
力部、２１はセグメント抽出部、２２は対応関係判定
部、２３は検査点検出部、２４は候補決定部、２５は辞
書である。Embodiments of the present invention will be described below with reference to the drawings. FIG. 2 is a block diagram of a character recognition device according to an embodiment of the present invention. 2, reference numeral 20 denotes a character image input unit, 21 denotes a segment extraction unit, 22 denotes a correspondence determination unit, 23 denotes an inspection point detection unit, 24 denotes a candidate determination unit, and 25 denotes a dictionary.

【００１２】文字画像入力部２０は、帳票等に書かれた
文字を白黒２値の２次元パターンの文字画像として入力
するところであり、帳票等に記入された文字を２次元パ
ターンのイメージである文字画像として入力するスキャ
ナ２０１と、スキャナ２０１から入力された文字画像を
白黒の２値画像に変換する２値化部２０２とで構成され
る。The character image input section 20 is for inputting characters written on a form or the like as a black and white binary two-dimensional pattern character image, and converts the characters written on the form or the like into a two-dimensional pattern image. The scanner 201 includes a scanner 201 for inputting an image, and a binarizing unit 202 for converting a character image input from the scanner 201 into a black and white binary image.

【００１３】セグメント抽出部２１は、文字画像入力部
２０で得られた文字の２値画像から文字セグメントを抽
出するところであり、輪郭線抽出部２１１と、端点検出
部２１２、セグメント分割部２１３とで構成されてい
る。図４は、文字セグメントの抽出を説明する図であ
る。図４の（Ａ）は文字画像の２値画像であり、斜線を
施した部分が黒画素で表された文字部分を示しており、
図４の（Ｂ）は図４の（Ａ）に示した２値画像から抽出
された輪郭線であり、点線部分が元の２値画像を示して
おり、図４の（Ｃ）は図４の（Ｂ）に示した輪郭線をセ
グメント（Ｓ１とＳ２）と端点部分（Ｔ１とＴ２）とに
分割した状態を示している。The segment extracting section 21 is for extracting a character segment from the binary image of the character obtained by the character image input section 20, and includes a contour line extracting section 211, an end point detecting section 212, and a segment dividing section 213. It is configured. FIG. 4 is a diagram illustrating extraction of a character segment. FIG. 4A is a binary image of a character image, and a hatched portion indicates a character portion represented by black pixels.
FIG. 4B is a contour line extracted from the binary image shown in FIG. 4A, and the dotted line portion shows the original binary image, and FIG. 3B shows a state in which the contour shown in FIG. 3B is divided into segments (S1 and S2) and end points (T1 and T2).

【００１４】輪郭線抽出部２１１では、文字画像入力部
２０の２値化部２０２で得られた文字の２値画像上で黒
画素で表される文字部分の輪郭線を抽出し、端点検出部
２１２では、抽出された輪郭線から文字の書き始め、書
き終わり部分に相当する端点を検出する。図５は、端点
の検出を説明する図である。The contour line extracting section 211 extracts a contour line of a character portion represented by a black pixel on the binary image of the character obtained by the binarizing section 202 of the character image input section 20, and detects an end point detecting section. At 212, an end point corresponding to a part where writing of a character starts and ends of writing is detected from the extracted outline. FIG. 5 is a diagram illustrating the detection of an end point.

【００１５】図５に示すように、端点は輪郭線が急激に
変化する場所であり、輪郭線（Ｓ）上に一定間隔離れた
３点（Ａ，Ｂ，Ｃ）をとり、その３点のうちで真ん中の
点（Ａ）を頂点としてなす角の値（θ）が所定の値以下
である輪郭線領域として求める。セグメント分割部２１
３では、検出された端点によって輪郭線をセグメントに
分割し、各文字画像毎に抽出されたセグメントを格納す
る。As shown in FIG. 5, an end point is a place where the contour changes rapidly, and three points (A, B, C) are placed on the contour (S) at regular intervals, and the three points are defined. The value is determined as a contour line region in which the value of the angle (θ) formed with the middle point (A) as the vertex is equal to or less than a predetermined value. Segment dividing section 21
In step 3, the outline is divided into segments based on the detected end points, and the segments extracted for each character image are stored.

【００１６】対応関係判定部２２は、文字画像から抽出
された文字セグメント毎に、辞書２５に格納されている
辞書セグメントとの対応関係を求めるところであり、代
表点決定部２２１と、角度変化検出部２２２と、ダイナ
ミック・プログラミング（ＤＰ）照合部２２３とで構成
されている。代表点決定部２２１では、セグメント抽出
部２１のセグメント分割部２１３に格納された文字セグ
メント上に、ＤＰ照合に用いる代表点を一定距離毎にと
り、角度変化検出部２２２では、連続する３つの代表点
のなす角度を求め、各代表点での特徴量として、文字セ
グメントの最初の代表点から各代表点までの角度変化量
の累積値を算出する。The correspondence determining section 22 determines the correspondence between the character segments extracted from the character image and the dictionary segments stored in the dictionary 25. The representative point determining section 221 and the angle change detecting section 222 and a dynamic programming (DP) collation unit 223. The representative point determining unit 221 sets representative points to be used for DP collation at regular intervals on the character segment stored in the segment dividing unit 213 of the segment extracting unit 21, and the angle change detecting unit 222 sets three consecutive representative points. Then, the accumulated value of the angle change from the first representative point of the character segment to each representative point is calculated as the characteristic amount at each representative point.

【００１７】図６は、角度変化の検出を説明する図であ
る。図６において、輪郭線（Ｓ）上に任意の間隔で代表
点（Ａ，Ｂ，Ｃ，・・・）をとり、点Ａから点Ｂに引い
たベクトルＡＢと、点Ｂから点Ｃに引いたベクトルＢＣ
をつくり、ベクトルＡＢとベクトルＢＣとのなす角（θ
２）が点Ｂでの角度変化となる。FIG. 6 is a diagram for explaining detection of a change in angle. In FIG. 6, representative points (A, B, C,...) Are set at arbitrary intervals on the contour line (S), and a vector AB drawn from point A to point B and a vector AB drawn from point B to point C. Vector BC
And the angle between the vector AB and the vector BC (θ
2) is an angle change at the point B.

【００１８】角度変化の初期値である輪郭線（Ｓ）の一
端点Ａでの角度変化は、文字の重心（Ｇ）から点Ａに引
いたベクトルＧＡとベクトルＡＢとのなす角（θ１）と
し、各代表点での特徴量は、角度変化の初期値を有する
点Ａから各代表点までの角度変化を累積した値で表し、
例えば、点Ｂでの特徴量はθ１＋θ２となる。ＤＰ照合
部２２３では、代表点決定部２２１で選択した文字セグ
メントの代表点と辞書２５に格納されている辞書セグメ
ントの代表点との間で、角度変化検出部２２２で算出し
た特徴量と辞書２５に格納されている辞書セグメントの
代表点の特徴量との差の絶対値を距離とした場合に最も
距離の近い代表点を対応させる１次元のＤＰ照合を行
う。The angle change at the end point A of the outline (S), which is the initial value of the angle change, is defined as the angle (θ1) between the vector GA and the vector AB obtained by subtracting the point A from the center of gravity (G) of the character. , The characteristic amount at each representative point is represented by a value obtained by accumulating the angle change from the point A having the initial value of the angle change to each representative point,
For example, the feature value at point B is θ1 + θ2. The DP matching unit 223 compares the characteristic amount calculated by the angle change detection unit 222 between the representative point of the character segment selected by the representative point determination unit 221 and the representative point of the dictionary segment stored in the dictionary 25 and the dictionary 25. When the absolute value of the difference between the feature amount of the representative point of the dictionary segment and the feature amount stored in the dictionary segment is defined as the distance, one-dimensional DP matching is performed so that the closest representative point is associated.

【００１９】図７は、セグメントの対応関係を説明する
図である。図７の（Ａ）が文字セグメントであり、白ま
るが代表点を表し、図７の（Ｂ）が辞書セグメントであ
り、黒まるが代表点を表しており、矢印は両セグメント
上の代表点が対応していることを示している。検査点検
出部２３は、対応関係決定部２２で求めた対応関係を基
に、文字セグメント上に検査点を求めて検査情報を計算
するところであり、検査点決定部２３１と、検査情報検
出部２３２とで構成されている。FIG. 7 is a diagram for explaining the correspondence between segments. FIG. 7A shows a character segment, white circles represent representative points, and FIG. 7B a dictionary segment, black circles represent representative points, and arrows represent representative points on both segments. Indicates that they are compatible. The inspection point detection unit 23 is to calculate inspection information by finding inspection points on the character segments based on the correspondence determined by the correspondence determination unit 22. The inspection point determination unit 231 and the inspection information detection unit 232 It is composed of

【００２０】検査点決定部２３１では、図７の（Ｄ）に
示す辞書セグメント上に予め定められている基準点（ｄ
１，ｄ２）に対応する図７の（Ｃ）に示す文字セグメン
ト上の代表点を検査点（ｃ１，ｃ２）とする。ＤＰ照合
では、多対多の対応関係を求めることができるので、辞
書セグメント上の基準点に対応する文字セグメント上の
代表点が１点の場合には、その代表点を検査点とし、辞
書セグメント上の基準点に対して文字セグメント上の複
数の代表点が対応する場合には、端点に近い点や対応す
る複数の代表点の真ん中の点といった予め設定した条件
によって複数の代表点の中から１点を検査点とする。In the inspection point determination unit 231, a reference point (d) set in advance on the dictionary segment shown in FIG.
The representative point on the character segment shown in FIG. 7C corresponding to (1, d2) is defined as the inspection point (c1, c2). In the DP matching, a many-to-many correspondence can be obtained. Therefore, when the number of the representative point on the character segment corresponding to the reference point on the dictionary segment is one, the representative point is set as the inspection point, and the dictionary segment is checked. When a plurality of representative points on the character segment correspond to the reference point above, the plurality of representative points are selected from among the plurality of representative points according to a preset condition such as a point close to an end point or a center point of the corresponding plurality of representative points. One point is an inspection point.

【００２１】図８は、検査点決定の処理を示すフローチ
ャートである。図８において、まず、複数の基準点から
特定の基準点を指定する番号（ｉ）の値を１に設定し
（Ｓ８０）、辞書セグメント上の代表点の中から基準点
ｄｉを選定し（Ｓ８１）、続いて、基準点ｄｉに対応し
ている文字セグメント上の代表点を検査点ｃｉとして決
定する（Ｓ８２）。FIG. 8 is a flowchart showing a process of determining an inspection point. In FIG. 8, first, the value of the number (i) designating a specific reference point from a plurality of reference points is set to 1 (S80), and a reference point di is selected from the representative points on the dictionary segment (S81). Then, a representative point on the character segment corresponding to the reference point di is determined as the inspection point ci (S82).

【００２２】次に、未処理の基準点がまだあるか判定し
（Ｓ８３）、未処理の基準点が無い場合には、検査点決
定の処理を終了し、未処理の基準点が有る場合には、基
準点を指定する番号をｉ＋１とし（Ｓ８４）、その後
は、Ｓ８１で示す処理に戻り、続くＳ８２とＳ８３の処
理を同様に繰り返す。検査情報検出部２３２では、文字
セグメント上の検査点での検査情報として、例えば、１
つの検査点に対しては、その検査点が文字画像全体に対
し、どの位置にあるかという個々の検査点の絶対的位置
情報や、２つの検査点に対しては、その検査点間の距離
や方向といった相対的位置情報や、２つ以上の検査点に
対しては、その検査点間の角度変化や直線性等の情報を
算出する。Next, it is determined whether there is an unprocessed reference point (S83). If there is no unprocessed reference point, the inspection point determination process is terminated. Sets the number designating the reference point to i + 1 (S84), and thereafter returns to the process shown in S81 and repeats the subsequent processes of S82 and S83 in the same manner. In the inspection information detection unit 232, as the inspection information at the inspection point on the character segment, for example, 1
For one inspection point, the absolute position information of each inspection point indicating the position of the inspection point with respect to the entire character image, and for two inspection points, the distance between the inspection points For relative position information such as the direction and direction, and for two or more inspection points, information such as angle change and linearity between the inspection points is calculated.

【００２３】候補決定部２４は、基準点検査部２３で求
めた文字セグメント上の検査点の検査情報に基づいて入
力文字画像のカテゴリを決定するところであり、検査情
報判定部２４１と、認識結果出力部２４２とで構成され
ている。検査情報判定部２４１では、検査点検出部２３
の検査情報検出部２３２で算出された検査情報と、辞書
２５に予め格納されている辞書セグメント上の基準点の
検査情報を比較し、文字セグメント上の検査点が辞書２
５に予め格納されている判定条件を満たすか満たさない
かを判定する。The candidate deciding section 24 decides the category of the input character image based on the inspection information of the inspection point on the character segment obtained by the reference point inspecting section 23. 242. In the inspection information determination unit 241, the inspection point detection unit 23
The inspection information calculated by the inspection information detecting unit 232 is compared with the inspection information of the reference point on the dictionary segment stored in the dictionary 25 in advance.
It is determined whether the determination condition stored in advance in step 5 is satisfied or not satisfied.

【００２４】判定条件としては、例えば、図７の（Ｃ）
に示した文字セグメント上の検査点ｃ１から文字セグメ
ントに沿ってｃ２までの角度変化を検査情報とした場合
に、この角度変化が６０度以上である文字セグメントの
文字画像が、対応させた辞書セグメントの文字候補
「２」のカテゴリに属する、というものである。認識結
果出力部２４２では、各辞書セグメントとの判定に基づ
いて、判定条件を満たした辞書セグメントのカテゴリを
入力画像の認識結果として決定し、出力する。The determination condition is, for example, as shown in FIG.
In the case where the angle change from the inspection point c1 on the character segment to c2 along the character segment is used as the inspection information, the character image of the character segment whose angle change is 60 degrees or more is assigned to the corresponding dictionary segment. Belongs to the category of the character candidate "2". The recognition result output unit 242 determines a category of the dictionary segment that satisfies the determination condition as a recognition result of the input image based on the determination of each dictionary segment, and outputs the category.

【００２５】図９は、認識候補決定の処理を示すフロー
チャートである。図９において、まず、複数の検査情報
から特定の検査情報を指定する番号（ｊ）の値を１に設
定し（Ｓ９０）、辞書セグメント上の任意の基準点ｄｉ
における複数の検査情報の中から検査情報ｊを選定し
（Ｓ９１）、続いて、この基準点ｄｉに対応している文
字セグメント上の検査点ｃｉにおける検査情報ｊを算出
する（Ｓ９２）。FIG. 9 is a flowchart showing the process of determining a recognition candidate. In FIG. 9, first, a value of a number (j) designating specific test information from a plurality of test information is set to 1 (S90), and an arbitrary reference point di on the dictionary segment is set.
The inspection information j is selected from the plurality of inspection information in (S91), and the inspection information j at the inspection point ci on the character segment corresponding to the reference point di is calculated (S92).

【００２６】次に、算出された検査情報ｊと辞書セグメ
ント上の基準点ｄｉにおける検査情報ｊの判定条件とを
比較して判定条件を満たすか判定し（Ｓ９３）、判定条
件を満たさない場合には、文字セグメントの認識結果は
この辞書セグメントのカテゴリではないと判断し（Ｓ９
４）、続いて、認識候補決定処理を行っていない次の辞
書を選択し（Ｓ９５）、処理を行っている特定の辞書と
の認識候補決定の処理を終了する。Next, the calculated inspection information j is compared with the determination condition of the inspection information j at the reference point di on the dictionary segment to determine whether or not the determination condition is satisfied (S93). Determines that the recognition result of the character segment is not the category of the dictionary segment (S9).
4) Subsequently, the next dictionary that has not been subjected to the recognition candidate determination processing is selected (S95), and the processing of the recognition candidate determination with the specific dictionary being processed is terminated.

【００２７】判定条件を満たす場合には、未処理の検査
情報があるか判定し（Ｓ９６）、未処理の検査情報が無
い場合には、文字セグメントの認識結果はこの辞書セグ
メントのカテゴリであると判断し（Ｓ９７）、続いて、
認識結果を出力し（Ｓ９８）、認識候補決定の処理を終
了する。一方、未処理の検査情報が有る場合には、検査
情報を指定する番号をｊ＋１とし（Ｓ９９）、その後
は、Ｓ９１で示す処理に戻り、続くＳ９２〜Ｓ９８の処
理を同様に繰り返す。If the determination condition is satisfied, it is determined whether there is unprocessed inspection information (S96). If there is no unprocessed inspection information, the recognition result of the character segment is determined to be the category of this dictionary segment. Judgment (S97),
The recognition result is output (S98), and the recognition candidate determination process ends. On the other hand, when there is unprocessed inspection information, the number designating the inspection information is set to j + 1 (S99), and thereafter, the process returns to S91, and the subsequent processes of S92 to S98 are similarly repeated.

【００２８】辞書２５は、入力された文字画像のカテゴ
リを特定するための情報である辞書セグメント、基準
点、検査情報、判定条件を、それぞれ辞書セグメント格
納部２５１、基準点格納部２５２、検査点情報格納部２
５３、判定条件格納部２５４に各カテゴリ毎に格納する
ところである。なお、図１における辞書１０、対応関係
決定手段１１、検査点検出手段１２、候補決定手段１３
は、図２における辞書２５、対応関係判定部２２、検査
点検出部２３、候補決定部２４にそれぞれ対応してい
る。The dictionary 25 stores a dictionary segment, a reference point, inspection information, and a judgment condition, which are information for specifying a category of an input character image, in a dictionary segment storage unit 251, a reference point storage unit 252, and an inspection point, respectively. Information storage unit 2
53, to store in the judgment condition storage unit 254 for each category. Note that the dictionary 10, the correspondence determining means 11, the inspection point detecting means 12, the candidate determining means 13 in FIG.
Respectively correspond to the dictionary 25, the correspondence determination unit 22, the inspection point detection unit 23, and the candidate determination unit 24 in FIG.

【００２９】以上の構成において、文字認識装置の処理
動作のフローを説明する。図３は、本発明の実施例の文
字認識装置の処理動作を示すフローチャートである。ま
ず、帳票等を文字画像入力部２０のスキャナ２０１で走
査して入力された文字画像に対し、２値化部２０２にお
いて白黒２値の画像に２値化し（Ｓ３０）、セグメント
抽出部２１において、２値画像から文字セグメントを抽
出する処理（Ｓ３１）を行う。In the above configuration, a flow of processing operation of the character recognition device will be described. FIG. 3 is a flowchart showing a processing operation of the character recognition device according to the embodiment of the present invention. First, the form or the like is scanned by the scanner 201 of the character image input unit 20 and the input character image is binarized into a black and white binary image by the binarization unit 202 (S30). A process of extracting a character segment from the binary image (S31) is performed.

【００３０】そして、辞書セグメント格納部に格納され
ている複数の辞書セグメントから、文字セグメントとの
対応関係処理を行っていない任意の辞書セグメントを選
択し（Ｓ３２）、対応関係判定部２２において、選択さ
れた辞書セグメントと文字セグメントとの対応関係処理
を行う（Ｓ３３）。対応関係処理が終わると、検査点検
出部２３において、文字セグメント上にとった代表点の
中から検査点を決定し、検査情報を算出する検査点決定
処理を行う（Ｓ３４）。Then, from the plurality of dictionary segments stored in the dictionary segment storage unit, an arbitrary dictionary segment which has not been subjected to the correspondence relationship processing with the character segment is selected (S32). The correspondence process between the dictionary segments and the character segments is performed (S33). After the correspondence processing, the inspection point detection unit 23 determines an inspection point from the representative points taken on the character segment, and performs an inspection point determination process of calculating inspection information (S34).

【００３１】次に、検査情報に基づき文字セグメントと
辞書セグメントとを比較判定して文字セグメントが認識
候補である辞書セグメントと一致するかの候補決定処理
を行い（Ｓ３５）、続いて、候補が決定したか判定し
（Ｓ３６）、候補が決定した場合には、認識結果を出力
して入力された文字画像の認識処理を終了する。一方、
Ｓ３２で選択した辞書セグメントが候補として決定され
なければ、文字セグメントとの対応関係処理を行ってい
ない未処理の辞書があるか判定し（Ｓ３７）、未処理の
辞書がある場合には、辞書セグメントを選択するＳ３２
に戻り、続くＳ３３〜Ｓ３６の処理を同様に繰り返す。Next, a comparison is made between the character segment and the dictionary segment based on the inspection information, and a candidate determination process is performed to determine whether the character segment matches a dictionary segment that is a recognition candidate (S35). It is determined whether or not the recognition has been performed (S36). If the candidate is determined, the recognition result is output and the recognition processing of the input character image is terminated. on the other hand,
If the dictionary segment selected in S32 is not determined as a candidate, it is determined whether there is an unprocessed dictionary that has not been subjected to the correspondence processing with the character segment (S37). S32 to select
And the subsequent steps S33 to S36 are repeated in the same manner.

【００３２】また、対応関係処理を行っていない未処理
の辞書が無い場合には、入力された文字画像は認識不可
能と判断し、認識不可能という認識結果を出力して認識
処理を終了する。If there is no unprocessed dictionary for which no correspondence processing has been performed, the input character image is determined to be unrecognizable, a recognition result indicating that recognition is not possible is output, and the recognition processing ends. .

【００３３】[0033]

【発明の効果】以上説明したように、本発明によれば、
認識対象の文字画像と辞書を照合する際に、照合する辞
書毎に入力された文字画像との対応関係を求め、特徴情
報を比較する検査点を指定するので、手書き文字等の変
形に対して安定した認識率を得ることができるという効
果がある。As described above, according to the present invention,
When matching a recognition target character image with a dictionary, the correspondence between the input character image and the input character image is determined for each dictionary to be compared, and an inspection point for comparing feature information is specified. There is an effect that a stable recognition rate can be obtained.

【００３４】また、カテゴリ毎に照合すべき箇所を辞書
セグメント上に設定し、対応する認識対象の文字セグメ
ント上の検査点における特徴情報を算出して認識判定す
ることにより、類似した文字の認識精度の向上を大幅に
図ることができる。Further, a place to be collated for each category is set on the dictionary segment, and the characteristic information at the inspection point on the corresponding character segment to be recognized is calculated to determine the recognition. Can be greatly improved.

[Brief description of the drawings]

【図１】本発明の原理ブロック図である。FIG. 1 is a principle block diagram of the present invention.

【図２】本発明の実施例による文字認識装置のブロック
構成図である。FIG. 2 is a block diagram of a character recognition device according to an embodiment of the present invention.

【図３】本発明の実施例の文字認識装置の処理動作を示
すフローチャートである。FIG. 3 is a flowchart illustrating a processing operation of the character recognition device according to the embodiment of the present invention.

【図４】文字セグメントの抽出を説明する図である。FIG. 4 is a diagram illustrating extraction of a character segment.

【図５】端点の検出を説明する図である。FIG. 5 is a diagram illustrating detection of an end point.

【図６】角度変化の検出を説明する図である。FIG. 6 is a diagram illustrating detection of an angle change.

【図７】セグメントの対応関係を説明する図である。FIG. 7 is a diagram illustrating the correspondence between segments.

【図８】検査点決定の処理を示すフローチャートであ
る。FIG. 8 is a flowchart showing a process of determining an inspection point.

【図９】認識候補決定の処理を示すフローチャートであ
る。FIG. 9 is a flowchart illustrating a process of determining a recognition candidate.

【図１０】従来の文字認識装置のブロック構成図であ
る。FIG. 10 is a block diagram of a conventional character recognition device.

【図１１】文字画像の幾何学的形状による文字認識を説
明する図である。FIG. 11 is a diagram illustrating character recognition based on a geometric shape of a character image.

[Explanation of symbols]

１０・・・辞書１１・・・対応関係決定手段１２・・・検査点検出手段１３・・・候補決定手段 DESCRIPTION OF SYMBOLS 10 ... Dictionary 11 ... Correspondence determination means 12 ... Inspection point detection means 13 ... Candidate determination means

───────────────────────────────────────────────────── フロントページの続き (72)発明者秋元晴雄神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (56)参考文献特開昭58−66176（ＪＰ，Ａ) 特開平３−282896（ＪＰ，Ａ) 特開昭61−190682（ＪＰ，Ａ) 特開昭61−198381（ＪＰ，Ａ) 特開平２−217981（ＪＰ，Ａ) 特開昭58−163080（ＪＰ，Ａ) 「電子通信学会論文誌」Ｖｏｌ．Ｊ67 −ＤＮｏ．３ｐ．351−358（1984) 山田博三”輪郭ＤＰ整合法と手書漢字認識への適用" (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/46 - 9/62 ＪＩＣＳＴファイル（ＪＯＩＳ)──────────────────────────────────────────────────続き Continuation of the front page (72) Inventor Haruo Akimoto 1015 Kamikodanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture Inside Fujitsu Limited (56) References JP-A-58-66176 (JP, A) JP-A-3-282896 (JP, A) JP-A-61-190682 (JP, A) JP-A-61-198381 (JP, A) JP-A-2-217981 (JP, A) JP-A-58-163080 (JP, A) IEICE Transactions, Vol. J67-D No. 3 p. 351-358 (1984) Hirozo Yamada "Contour DP Matching Method and Application to Handwritten Kanji Recognition" (58) Fields investigated (Int. Cl. ⁷ , DB name) G06K 9/46-9/62 JICST file (JOIS)

Claims

(57) [Claims]

1. From a character contour extracted from a character image
A dictionary that stores a line segment obtained by cutting off the end as a dictionary segment, and stores the characteristics of the dictionary segment at reference points taken on the dictionary segment as dictionary characteristic information of the character image; Character outline extracted from the character image of the character
The line segment obtained by cutting the end from the line
A specific representative point on the character segment
Using the cumulative value of the angle change from the point to each representative point
The legal, and the representative point on the character segment, the correspondence determining means to take correspondence between the representative point on the dictionary segment, based on the correspondence relationship obtained by the relationship determining means,
A point on the character segment corresponding to the reference point on the dictionary segment is set as an inspection point, and the characteristic information of the character segment at the inspection point corresponding to the dictionary characteristic information of the dictionary segment at the reference point An inspection point detection unit to be extracted; and a recognition candidate for the character image to be recognized by comparing the feature information of the character segment extracted by the inspection point detection unit with dictionary feature information of a corresponding reference point on the dictionary segment. And a candidate deciding means for deciding a character.