JP2918666B2

JP2918666B2 - Text image extraction method

Info

Publication number: JP2918666B2
Application number: JP2253850A
Authority: JP
Inventors: 明子中島
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1990-09-21
Filing date: 1990-09-21
Publication date: 1999-07-12
Anticipated expiration: 2014-07-12
Also published as: JPH04130979A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、文字認識システムにおいて２値画像として
入力された文書画像より文字画像を切出す方法に関す
る。Description: TECHNICAL FIELD The present invention relates to a method of extracting a character image from a document image input as a binary image in a character recognition system.

[Conventional technology]

文字認識システムにおいては、スキャナー等によって
２値画像として入力された文書画像より行を切出し、各
行毎に文字画像を切出して文字認識を行う。In a character recognition system, a line is cut out from a document image input as a binary image by a scanner or the like, and a character image is cut out for each line to perform character recognition.

この文字画像の切出し方法は、射影による方法と連結
黒画素の外接矩形による方法に大別される。しかし、い
ずれの方法であっても、射影の一塊もしくは一つの外接
矩形が、そのまま一つの文字のデータとなるとは限らな
いため、それらのデータを組合せて、あるいは区切って
１文字データを得る技術が必要である。The method of extracting a character image is roughly classified into a method using projection and a method using a circumscribed rectangle of connected black pixels. However, in any method, a block of projection or one circumscribed rectangle is not always the data of one character, so a technique for obtaining one character data by combining or dividing those data is known. is necessary.

そこで従来、連結黒画素（文字素）の外接矩形を抽出
し、単独の外接矩形または複数の外接矩形の組合わせの
幅を、予め設定された標準文字幅と比較し、１文字とみ
なし得る幅の外接矩形またはその組合わせ領域を文字枠
として文字画像を切出す方法（特開昭61−17670号）が
知られている。Therefore, conventionally, a circumscribed rectangle of a connected black pixel (character element) is extracted, and a width of a single circumscribed rectangle or a combination of a plurality of circumscribed rectangles is compared with a preset standard character width, and a width that can be regarded as one character A method of cutting out a character image using a circumscribed rectangle or a combination area thereof as a character frame (Japanese Patent Laid-Open No. 61-17670) is known.

また、単独の外接矩形または組合わせの領域を文字画
像とみなして文字認識を行い、その結果の確からしさの
評価により文字画像を確定する方法が知られている（同
一出願人による昭和63年特許出願第133424号）。Also, a method is known in which a single circumscribed rectangle or a combination of regions is regarded as a character image and character recognition is performed, and a character image is determined by evaluating the certainty of the result (see 1988 patent by the same applicant). Application No. 133424).

[Problems to be solved by the invention]

縦書き行の漢数字「一、二、三」や横書き行の漢字
「川」のように、行方向の垂直な線分のみからなる文字
は、１文字文の文字幅を持っていないことが多く、また
連続して出現した場合、どの部分で区切ってもそれなり
の認識結果が得られる。Characters consisting only of vertical line segments, such as the kanji numbers "1, 2, 3" in vertical lines and the "Kanji" in horizontal lines, may not have the width of one character sentence. If they appear many times or continuously, no matter where they are separated, a reasonable recognition result can be obtained.

したがって、このような文字が連続して出現した場合
等には、前後の空白幅を考慮しても、前記従来方法では
切出し間違いが起こりやすい。Therefore, in the case where such characters appear continuously, etc., the cutout error is likely to occur in the above-described conventional method, even if the width of the front and rear spaces is considered.

本発明の目的は、行方向に垂直な線分のみからなる漢
数字等の文字画像が連続して出現した場合にも、正確な
切出しが可能な文字画像切出し方法を提供することにあ
る。SUMMARY OF THE INVENTION It is an object of the present invention to provide a character image extracting method capable of extracting an accurate character image even when a character image such as a Chinese numeral consisting only of line segments perpendicular to the line direction appears continuously.

[Means for solving the problem]

本発明は、行画像から抽出した文字素外接矩形を統合
することによって文字枠を得て行画像より文字画像を切
出す文字画像切出し方法において、幅が所定値より小さ
い文字素外接矩形に注目し、該注目した文字素外接矩形
と後続の文字素外接矩形の長さ及び統合したときの幅に
関する条件判定を行い、該条件判定の結果に従って該注
目した文字素外接矩形と後続の文字素外接矩形の統合を
制御することにより、行方向に垂直な線分のみからなる
特定の文字の文字枠を得ることを特徴とする。The present invention relates to a character image extracting method for extracting a character image from a line image by obtaining a character frame by integrating a character element circumscribed rectangle extracted from a line image, focusing on a character element circumscribed rectangle having a width smaller than a predetermined value. Performing a condition determination on the length of the focused vowel circumscribed rectangle and the succeeding vowel circumscribed rectangle and the width at the time of integration, and according to the result of the condition determination, the focused vowel circumscribed rectangle and the subsequent vowel circumscribed rectangle By controlling the integration of the characters, a character frame of a specific character consisting of only line segments perpendicular to the line direction is obtained.

(Operation)

縦書きの漢数字「一二三」のように、行方向に垂直な
線分のみからなる特定の文字の場合、各文字を構成する
各線分が文字素として抽出されるが、その外接矩形は幅
が十分に小さく、また各線分の長さは文字に固有の比率
関係を持っている。文字全体の幅も文字に固有である。In the case of a specific character consisting of only line segments perpendicular to the line direction, such as the vertical Chinese character "123", each line segment constituting each character is extracted as a character element, but its circumscribed rectangle is The width is sufficiently small, and the length of each line segment has a ratio relationship unique to the character. The width of the entire character is also unique to the character.

したがって上に述べた本発明によれば、縦書き行の文
字列「一二三」を例にした場合、次の通り正確に切出す
ことが可能である。Therefore, according to the above-described present invention, when the character string “123” in the vertical writing line is taken as an example, it is possible to cut out exactly as follows.

まず、「一」の文字素外接矩形の幅は十分に小さいの
で、これに注目するが、次の文字素外接矩形（「二」の
上線分に対応）と統合した幅は、間に文字間スペースが
存在するために標準的な文字間としては過大となるの
で、注目した文字素外接矩形は次の文字素外接矩形とは
別々の文字を構成すると判定して統合せず、注目した文
字素外接矩形を単独で文字枠として「一」の画像を切出
すことができる。Attention is first paid to the fact that the width of the character circumscribed rectangle of “one” is sufficiently small. The width integrated with the next character circumscribed rectangle (corresponding to the upper line segment of “two”) is the space between characters Since the space between the characters is too large for a standard character, the target circumscribed rectangle is determined to constitute a separate character from the next morpheme circumscribed rectangle, and is not integrated. The image of “one” can be cut out using the circumscribed rectangle alone as a character frame.

次に「二」の上線分に対応した文字素外接矩形に注目
するが、次の下線分に対応した文字素外接矩形と統合し
た間は、間に文字間スペースがないため標準的な文字幅
として過大ではない。さらに注目した文字素外接矩形の
長さ（線分の長さ）は次の文字素外接矩形の長さより短
い。このような条件から二つの文字素外接矩形は「二」
を構成すると判定して統合し、統合した矩形を文字枠と
して「二」の画像を切出すことができる。Next, we focus on the character circumscribed rectangle corresponding to the upper line segment of "2". Not as overkill. Furthermore, the length (length of a line segment) of the character circumscribed rectangle is shorter than the length of the next character circumscribed rectangle. Under these conditions, the two circumscribed rectangles are "two"
Are determined, and the images are integrated, and the image of “two” can be cut out using the integrated rectangle as a character frame.

次に「三」の最上部線分に対応した文字素外接矩形に
注目する。次の文字素外接矩形と統合した幅は標準的な
文字幅として過大ではないが、長さの関係が「二」と逆
である。さらに次の文字素外接矩形まで統合した幅は、
標準的な文字幅として過大とはならず、中央の文字素外
接矩形の長さに比べ最後の文字素外接矩形の長さが大き
い。このような条件から、３個の文字素外接矩形は
「三」を構成すると判定して統合し、統合した矩形を文
字枠として「三」の画像を切出すことができる。Next, attention is paid to the circumscribed rectangle of the character element corresponding to the uppermost line segment of “3”. The width integrated with the next morpheme circumscribed rectangle is not too large as a standard character width, but the relationship of length is opposite to "two". The width integrated up to the next circumscribed rectangle is
The standard character width is not excessive, and the length of the last character circumscribed rectangle is larger than the length of the central character circumscribed rectangle. From such conditions, it is possible to determine that the three circumscribed rectangles constitute "three" and integrate them, and to cut out the image of "three" using the integrated rectangle as a character frame.

〔Example〕

第１図は本発明の一実施例に係る文字認識システムの
ブロック図である。FIG. 1 is a block diagram of a character recognition system according to one embodiment of the present invention.

処理対象の文書画像はスキャナー等から２値画像デー
タとして入力され、原画像メモリ１に記憶されている。
文字画像切出し処理部２は、この原画像メモリ１内の文
書画像の文字画像を切出す処理を行い、切出した文字画
像を文字画像メモリ３に格納する。文字認識処理部４
は、その文字画像の正規化、特徴量抽出、辞書との照合
を行って文字を認識し、認識結果を認識結果メモリ５に
格納する。A document image to be processed is input as binary image data from a scanner or the like and stored in the original image memory 1.
The character image cutout processing unit 2 performs a process of cutting out the character image of the document image in the original image memory 1 and stores the cutout character image in the character image memory 3. Character recognition processing unit 4
Performs normalization of the character image, extraction of the feature amount, and comparison with the dictionary to recognize the character, and stores the recognition result in the recognition result memory 5.

文字画像切出し処理部２において、行切出し部６は文
書画像より１行分の文字素を抽出し、文字素データを文
字素データメモリ７に格納する。例えば、文書画像上の
黒画素の連結を探索し、連結した黒画素の塊を文字素
（文字または文字の一部を構成する黒画素群）として抽
出し、文字素の外接矩形の座標（例えば対角頂点の座
標）を文字素データとして抽出する。また、文字素外接
矩形の間の水平方向及び垂直方向の距離を求め、両方向
の距離がある閾値より小さい文字素外接矩形の集まりを
１行として切出し、そのデータを文字素データメモリ７
に格納する。このような行切出しは、行方向照射による
行切出しより文書のスキュー等も強い。In the character image cutout processing unit 2, the line cutout unit 6 extracts one line of character elements from the document image and stores the character element data in the character element data memory 7. For example, a connection of black pixels on a document image is searched for, a block of connected black pixels is extracted as a character element (a group of black pixels forming a character or a part of a character), and coordinates of a circumscribed rectangle of the character element (for example, (Coordinates of diagonal vertices) are extracted as character element data. Further, the horizontal and vertical distances between the character circumscribed rectangles are obtained, and a set of character circumscribed rectangles whose distance in both directions is smaller than a certain threshold is cut out as one line, and the data is stored in the character element data memory 7.
To be stored. Such line segmentation has a stronger skew of a document than line segmentation by line direction irradiation.

パラメータ設定部８は、文字素データメモリ７内の文
字素データ全体を見渡し、文字切出し（文字素統合及び
文字パターン合成）に用いるための相対的なパロメータ
を設定し、パラメータメモリ９に格納する。文字素統合
は複数回行われるが、処理を１回終るごとに、修正され
た文字素データメモリ７内の文字素データに基づいてパ
ラメータを再設定する。The parameter setting unit 8 overlooks the entire character element data in the character element data memory 7, sets a relative parameter for use in character extraction (character element integration and character pattern synthesis), and stores the parameter in the parameter memory 9. The character element integration is performed a plurality of times. Each time the processing is completed once, the parameters are reset based on the corrected character element data in the character element data memory 7.

設定するパラメータは次の通りである。 The parameters to be set are as follows.

（ａ）行高さ（横書き行の下端から上端までの距離、
縦書き行の左端から右端までの距離）これは、行を構成する文字素外接矩形の最大高さとす
る。ただし、文字素外接矩形の高さとは、横書き行であ
れば下端から、縦書き上であれば左辺から、文字素外接
矩形の最も遠い辺までの距離である。(A) line height (the distance from the bottom to the top of a horizontal line,
(Distance from the left edge to the right edge of the vertical writing line) This is the maximum height of the character circumscribed rectangle constituting the line. However, the height of the character circumscribed rectangle is the distance from the lower end in the case of horizontal writing to the farthest side of the character circumscribed rectangle from the left side in the case of vertical writing.

（ｂ）標準文字幅（幅とは行方向のサイズ）行高さより少し大きな値であり、例えば［行高さ］＋
（ドット）とする。(B) Standard character width (width is the size in the line direction) A value slightly larger than the line height, for example, [line height] +
(Dot).

（ｃ）最小文字間隔隣り合う文字素外接矩形の最小間隔とする。ただし、
ある幅、例えば［標準文字幅］の半分以上の幅を持つ文
字素外接矩形だけを対象として決定する。(C) Minimum character spacing The minimum spacing between adjacent character circumscribed rectangles. However,
It is determined only for a character circumscribed rectangle having a certain width, for example, half or more of [standard character width].

なお、１回目の文字素統合処理の前においては、次の
初期値に設定する。Before the first character element integration processing, the following initial value is set.

［初期値］＝−［行高さ］/30 （負の値は、文字素外接矩形に重なっていることを意味
する）（ｄ）最大文字幅行を構成する文字素外接矩形は最大幅とする。[Initial value] =-[line height] / 30 (Negative value means overlapping with the character circumscribed rectangle.) (D) Maximum character width The character circumscribed rectangle constituting the line has the maximum width. I do.

（ｅ）Knth 行に垂直方向の一つの線分のみからなる文字素とみな
し得る文字素外接矩形の最大幅であり、例えば［行高
さ」/8に設定する。(E) Knth This is the maximum width of a character circumscribed rectangle that can be regarded as a character consisting of only one line segment in the vertical direction on the line, and is set to, for example, [line height] / 8.

文字素統合部10は、パラメータメモリ９内のパラメー
タを用いた条件判定により、行の先頭の文字素より順
に、同一の文字とみなされる文字素外接矩形を統合し、
文字素データメモリ７内の文字素データを修正する。こ
の文字素統合処理は、通常処理モードと例外処理モード
とからなる。The character element integration unit 10 integrates character element circumscribed rectangles that are regarded as the same character in order from the character element at the head of the line by a condition determination using the parameters in the parameter memory 9,
The character data in the character data memory 7 is corrected. This character element integration processing includes a normal processing mode and an exception processing mode.

まず、例外処理モードの内容について説明する。第２
図は例外処理モードの説明のためのフローチャートであ
る。ただし、W1は注目している文字素の文字幅、W2は注
目している文字素の次の文字素の文字幅、W3はW2の次の
文字素の文字幅、L1は注目している文字素の長さ、L2は
注目している文字素の次の文字素の長さ、L3はW2の次の
文字素の長さ、W1.2は注目している文字素と次の文字素
を統合したときの文字幅、W1.2.3は注目している文字素
と続く二つの文字素を統合したときの文字幅である。First, the contents of the exception handling mode will be described. Second
The figure is a flowchart for explaining the exception handling mode. However, W1 is the character width of the current character element, W2 is the character width of the next character element of the current character element, W3 is the character width of the next character element of W2, and L1 is the current character. The length of the element, L2 is the length of the next element of the current element, L3 is the length of the element next to the element W2, and W1.2 is the element of the element of interest and the next element. W1.2.3 is the character width when the unified character element and the next two character elements are integrated.

横書き行の「川」や縦書き行の「一二三」などの行方
向に垂直な一つの線分よりなる文字素に分解され、その
文字素の間隔が広い特定の文字は、その文字素間隔が他
の文字の文字素間隔より十分に広い場合であれば、通常
処理モードで切出すことが可能であるが、そのような条
件が満たされる文書は少ない。この例外処理モードは、
そのような特定文字の切出しを的確な行うための文字素
統合処理を行うモードである。Certain characters that are separated into one line segment perpendicular to the line direction, such as "kawa" in horizontal lines and "123" in vertical lines, are separated into specific If the interval is sufficiently wider than the character element intervals of other characters, it is possible to cut out in the normal processing mode, but such a condition is rarely satisfied in a few documents. This exception handling mode is
This is a mode for performing a character element integration process for accurately extracting such a specific character.

注目した文字素１の文字幅（文字素外接矩形の行方向
のサイズ）w1と、パラメータKnthとの比較判定を行い
（ステップ）、w1＜Knthであれば、注目文字素１は行
方向に垂直な一つの線分のみからなると見做し例外処理
モードに入る。A comparison is made between the character width w1 (the size of the circumscribed rectangle in the line direction) of the focused pixel 1 and the parameter Knth (step). If w1 <Knth, the focused pixel 1 is perpendicular to the line direction. Assuming that the line consists of only one line segment, the system enters the exception handling mode.

例外処理モードに入ると、まず注目文字素１と次の文
字素２を統合したときの文字幅1.2と標準文字幅との比
較判定を行い（ステップ）、w1.2≧標準文字幅であれ
ば注目文字素１を単独の文字素として確定する。例えば
第３図に示すよな縦書き行の「一二三」の文字列におい
て、先頭の「一」の文字素に注目すると、w1.2標準文字
幅となるので、単独の文字素として確定する。In the exception handling mode, first, a comparison is made between the character width 1.2 when the target character element 1 and the next character element 2 are integrated and the standard character width (step), and if w1.2 ≧ standard character width The target character element 1 is determined as a single character element. For example, in the character string of "123" in the vertical writing line as shown in FIG. 3, when focusing on the first character element of "1", it becomes w1.2 standard character width, so it is determined as a single character element. I do.

ステップでw1.2＜標準文字幅となったときは、注目
している文字素１の長さ（外接矩形の行方向と垂直な方
向のサイズ）L1と次の文字素２の長さL2との比較判定を
行い（ステップ）、L2＞L1ならば文字素１と文字素２
を統合し一つの文字素に確定する。例えば第３図に示し
た「二」の上の線分の文字素を注目文字素１とすれば、
ステップの条件を満足するので、「二」を構成する二
つの文字素は一つの文字素に確定的に統合される。If w1.2 <standard character width in the step, the length of the character element 1 of interest (the size in the direction perpendicular to the line direction of the circumscribed rectangle) L1 and the length L2 of the next character element 2 (L)>L1> Leme 2 if L2> L1
Is integrated into one character. For example, if the character element of the line segment above “two” shown in FIG.
Since the condition of the step is satisfied, the two phonemes constituting "two" are deterministically integrated into one phoneme.

ステップでL2≦L1となったときは、注目文字素１、
次の文字素２、さらに次の文字素３を統合した文字幅w
1.2.3と標準文字幅の比較判定を行い（ステップ）、w
1.2.3＜標準文字幅であれば文字素２の長さL2と文字素
３の長さL3の大小比較を行い（ステップ）、L3＞L2で
あれば文字素1.2.3を確定的に統合する。例えば、第３
図に示した「三」の一番上の線分の文字素を注目文字素
１とすると、ステップの条件を満足するので、「三」
を構成する３個の文字素は確定的に統合される。If L2 ≦ L1 in the step,
Character width w obtained by integrating the next character element 2 and the next character element 3
Perform a comparison judgment between 1.2.3 and standard character width (step), w
If 1.2.3 <standard character width, compare the length L2 of character element 2 with the length L3 of character element 3 (step), and if L3> L2, deterministic integration of character element 1.2.3 I do. For example, the third
Assuming that the uppermost line segment of “3” shown in the figure is the target character unit 1, the condition of the step is satisfied.
Are deterministically integrated.

以上の説明から明らかなように、第３図に示した通常
処理モードでは切出しが難しい文字列に対し、第４図に
示す文字素統合結果が得られるので、各文字を正しく切
出すことができる。As is apparent from the above description, the character element integration result shown in FIG. 4 is obtained for a character string that is difficult to extract in the normal processing mode shown in FIG. 3, so that each character can be correctly extracted. .

次に通常処理モードの内容について第５図乃至第９図
を用い説明する。ただし、便宜上、ここでは横書き行で
あるとする。Next, the contents of the normal processing mode will be described with reference to FIGS. However, for convenience, it is assumed that the line is a horizontal writing line.

１）隣り合う二つの文字素外接矩形a,bが条件式
（１）及び（２）同時に満たす場合、一つの文字素外接
矩形に統合する（第５図参照）。1) If two adjacent primal circumscribed rectangles a and b satisfy the conditional expressions (1) and (2) at the same time, they are integrated into one primal circumscribed rectangle (see FIG. 5).

Wab≦［標準文字幅］条件式（１）ただしWabはa,bの統合矩形の文字幅である。 Wab ≦ [standard character width] Conditional expression (1) where Wab is the character width of the combined rectangle of a and b.

Xsb−Xea＜［最小文字間隔］条件式（２）ただしXsbはｂの始点ｘ座標、Xeaはａの終点ｘ座標で
ある。Xsb−Xea <[minimum character interval] Conditional expression (2) where Xsb is the starting point x coordinate of b, and Xea is the ending point x coordinate of a.

条件式（１）は統合後の文字素外接矩形の幅Aab［標
準文字幅］を越えないことを示し、条件式（２）はa,b
の間隔Gabが［最小文字間隔］より小さいことを示して
いる。Conditional expression (1) indicates that the width does not exceed the width Aab [standard character width] of the combined circumscribed rectangle, and conditional expression (2) indicates a, b
Is smaller than [minimum character spacing].

ただし、条件式（３）に該当する場合には標準文字間
を越えても統合する。However, if the condition (3) is met, the characters are integrated even if they exceed the standard characters.

Xeb＜Xeb 条件式（３）ただしXebはｂの終点ｘ座標である。a,bの位置関係は
ａが左側（行先頭寄り）、ｂが右側であるとする。Xeb <Xeb Condition (3) where Xeb is the end point x coordinate of b. The positional relationship between a and b is such that a is on the left (toward the beginning of the line) and b is on the right.

すなわち、第６図に示すようにDab＜０となる完全に
重なった二つの文字素外接矩形a,bはWab≧［標準文字
幅］であっても統合する。That is, as shown in FIG. 6, two completely overlapping circumscribed rectangles a and b satisfying Dab <0 are integrated even if Wab ≧ [standard character width].

２）前記１）において条件式（１）を満たすが、条件
式（２）を満たさない場合、右側の文字素外接矩形（ｂ
とする）が次の条件式（４）〜（７）をすべて満すとき
は、濁点もしくは半濁点の文字素外接矩形であるとみな
し例外的に統合する（第７図参照）。2) In the case where the conditional expression (1) is satisfied in the above 1) but the conditional expression (2) is not satisfied, the right-hand circumscribed rectangle (b)
Satisfies all of the following conditional expressions (4) to (7), it is regarded as a character circumscribed rectangle of a voiced or semi-voiced dot, and exceptionally integrated (see FIG. 7).

Wb≦fth 条件式（４） Hb≦fth 条件式（５）（Hbはｂの高さ） Yeb−Tsa≦fth 条件式（６）（Yebはｂの終点ｙ座標、Yeaはａの始点ｙ座標） Xsb−Xea＜［最小文字間隔］＋α 条件式（７）ただしfth（閾値）＝［行高さ］／βである。αとβ
は経験値であり、例えばα＝２、β＝３に選ばれる。Wb ≦ fth Conditional expression (4) Hb ≦ fth Conditional expression (5) (Hb is the height of b) Yeb−Tsa ≦ fth Conditional expression (6) (Yeb is the y coordinate of the end point of b, Yea is the y coordinate of the start point of a) Xsb−Xea <[minimum character spacing] + α Conditional expression (7) where fth (threshold) = [line height] / β. α and β
Is an empirical value, for example, α = 2 and β = 3.

３）前記１）において条件式（１），（２）を満たす
ものの、右側の文字素外接矩形（ｂとする）が次の条件
式（８）〜（10）に該当する場合は、句読点であるとみ
なして例外的に統合しない（第８図参照）。3) In the above 1), if conditional expressions (1) and (2) are satisfied, but the right-hand circumscribed rectangle (b) satisfies the following conditional expressions (8) to (10), punctuation marks are used. Assuming that there is, there is no exceptional integration (see FIG. 8).

Wb≦fth 条件式（８） Hb≦fth 条件式（９） base−Yeb≦fth/7 条件式（10）ただしbaseはベースラインのｙ座標、７は経験値で例
えば４に選ばれる。Wb ≦ fth Conditional Expression (8) Hb ≦ fth Conditional Expression (9) base−Yeb ≦ fth / 7 Conditional Expression (10) where base is the y coordinate of the baseline, and 7 is an empirical value, for example, 4 is selected.

以上の如き通常処理モードによる統合処理が１行の文
字素データ（例外処理モードで確定した文字素は除く）
について１回終了するたびに、パラメータ設定部８によ
り修正後の文字素データに基づき再設定され、パラメー
タメモリ９が書き替えられる。この書き換えられたパラ
メータを用い、文字素統合部10は通常処理モードの処理
を再度実行する。One line of character element data in which integration processing in the normal processing mode as described above is performed (excluding character elements determined in the exception processing mode)
Each time is completed once, the parameter setting unit 8 resets the parameter based on the corrected character element data, and rewrites the parameter memory 9. Using the rewritten parameters, the character element integration unit 10 executes the processing in the normal processing mode again.

このようにパラメータをダイナミックに修正しつつ文
字素統合処理を繰り返す。この繰り返しは、例えばパラ
メータが変化しなくなったとき、もしくはパラメータの
変化量がある閾値以下となったときに終了する。Thus, the character element integration process is repeated while dynamically modifying the parameters. This repetition ends, for example, when the parameter stops changing or when the amount of change in the parameter becomes equal to or less than a certain threshold.

第９図は横書き行の通常処理モードによる文字素統合
の例を示す。（ａ）に示す行画像に対し、行切出し部６
により（ｂ）に示すような文字素が抽出される。これら
の文字素は１回目の統合処理により（ｃ）に示すように
統合され、２回目の統合処理によって（ｄ）に示すよう
に統合される。FIG. 9 shows an example of character element integration in the normal processing mode for horizontal writing lines. For the line image shown in FIG.
Extracts character elements as shown in FIG. These character elements are integrated as shown in (c) by the first integration processing, and integrated as shown in (d) by the second integration processing.

（ｄ）に示すように統合処理が完了した段階におい
て、文字素外接矩形の殆どは一つの文字の外接矩形（文
字枠）に対応する。すなわち、殆どの文字の外接矩形が
生成される。しかし、この例における「小」のように統
合すべき文字素が分離したまま残ることもある。At the stage where the integration processing is completed as shown in (d), most of the character circumscribed rectangles correspond to the circumscribed rectangle (character frame) of one character. That is, circumscribed rectangles of most characters are generated. However, a character element to be integrated such as “small” in this example may remain separated.

このような統合末完の文字素が残る可能性があるの
で、文字画像切出し処理部２は文字パターン合成部11に
おいて、単純に統合処理後の各文字素外接矩形を文字枠
として原画像より文字画像を切出すのではなく、連続し
た文字素外接矩形を［最大文字幅］（パラメータメモリ
に記憶されている）を越えない程度に組合せ、組合せた
矩形（１個の文字素外接矩形を含む）のそれぞれを文字
枠として原画像より文字画像の候補を切出し文字画像メ
モリ３に出力する。ただし、文字素統合処理の例外処理
モードで確定した文字素については、それ単独での文字
画像切出しだけを行う。Since there is a possibility that such unfinished character elements may remain, the character image cutout processing unit 2 in the character pattern synthesizing unit 11 simply sets each character element circumscribed rectangle after the integration processing to a character frame from the original image. Rather than cutting out an image, continuous rectangles of circumscribed characters are combined so as not to exceed [maximum character width] (stored in the parameter memory), and the combined rectangles (including one circumscribed rectangle of character) Are used as character frames to extract character image candidates from the original image and output them to the character image memory 3. However, for a character element determined in the exception processing mode of the character element integration process, only the character image cutout is performed alone.

文字認識処理部４においては、文字画像メモリ３に格
納された文字画像候補について文字認識を行う。ただ
し、文字素が重複した文字画像候補については、それぞ
れの認識結果を評価し、最も文字としての確からしさが
他界文字画像候補を選択し（すなわち文字認識結果より
文字切出しを確定し）、その認識結果だけを認識結果メ
モリ５に出力する。The character recognition processing unit 4 performs character recognition on the character image candidates stored in the character image memory 3. However, for character image candidates with duplicated phonemes, the respective recognition results are evaluated, and the most probable character is selected as a foreign character image candidate (that is, character extraction is determined from the character recognition result), and the recognition is performed. Only the result is output to the recognition result memory 5.

なお、このような文字素を組合せて認識し、認識結果
の評価によって文字切出しを確定する方法については前
記昭和63年第133424号特許出願の明細書及び図面に詳述
されている。The method of recognizing such a combination of character elements and determining the character cutout by evaluating the recognition result is described in detail in the specification and the drawings of the above-mentioned 1988 Patent No. 133424.

[Results of the Invention]

以上に、詳細に説明した如く、本発明によれば、行方
向の垂直な線分のみからなる特定の文字（例えば縦書き
行の漢数字「一二三」など）が連続して出現した場合に
も、正確な文字画像切出しが可能となる。As described above in detail, according to the present invention, when a specific character consisting of only a vertical line segment in the row direction (for example, a kanji numeral "123" in a vertical writing line) appears continuously. In addition, accurate character image cutting can be performed.

[Brief description of the drawings]

第１図は本発明の一実施例に係る文字認識システムのブ
ロック図、第２図は文字素統合の例外処理モードを説明
するためのフローチャート、第３図は例外処理モードの
対象となる文字列を示す図、第４図は第３図に示した文
字列の統合処理結果を示す図、第５図乃至第８図は文字
素統合の通常処理モードのための統合条件の説明図、第
９図文字素統合の例を示す図である。１……原画像メモリ、２……文字画像切出し処理部、３……文字画像メモリ、４……文字認識処理部、５……認識結果メモリ、６……行切出し部、７……文字素データメモリ、８……パラメータ設定部、９……パラメータメモリ、10……文字素統合部、 11……文字パターン合成部。FIG. 1 is a block diagram of a character recognition system according to an embodiment of the present invention, FIG. 2 is a flowchart for explaining an exception processing mode of character element integration, and FIG. 3 is a character string to be subjected to the exception processing mode. FIG. 4 is a diagram showing a result of the integration processing of the character strings shown in FIG. 3, FIGS. 5 to 8 are explanatory diagrams of integration conditions for a normal processing mode of character element integration, and FIG. It is a figure which shows the example of figure letter element integration. 1 ... Original image memory, 2 ... Character image cutout processing unit, 3 ... Character image memory, 4 ... Character recognition processing unit, 5 ... Recognition result memory, 6 ... Line cutout unit, 7 ... Character element Data memory 8, Parameter setting section 9, Parameter memory 10, Character element integration section 11, Character pattern synthesis section

Claims

(57) [Claims]

In a character image extracting method for extracting a character image from a line image by obtaining a character frame by integrating a character circumscribed rectangle extracted from a line image, attention is paid to a character circumscribed rectangle having a width smaller than a predetermined value. Then, condition determination is performed on the length of the focused vowel circumscribed rectangle and the succeeding vowel circumscribed rectangle and the width at the time of integration, and according to the result of the condition determination, the focused vowel circumscribed rectangle and the subsequent vowel circumscribed are determined. A character image clipping method characterized by obtaining a character frame of a specific character consisting of only line segments perpendicular to the line direction by controlling the integration of rectangles.