JPH0573718A - Area attribute identifying system - Google Patents

Area attribute identifying system

Info

Publication number
JPH0573718A
JPH0573718A JP3233518A JP23351891A JPH0573718A JP H0573718 A JPH0573718 A JP H0573718A JP 3233518 A JP3233518 A JP 3233518A JP 23351891 A JP23351891 A JP 23351891A JP H0573718 A JPH0573718 A JP H0573718A
Authority
JP
Japan
Prior art keywords
rectangle
area
character
coupling counter
circumscribing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP3233518A
Other languages
Japanese (ja)
Other versions
JP3095470B2 (en
Inventor
Yumiko Ikemure
由美子 池牟禮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP03233518A priority Critical patent/JP3095470B2/en
Publication of JPH0573718A publication Critical patent/JPH0573718A/en
Application granted granted Critical
Publication of JP3095470B2 publication Critical patent/JP3095470B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)
  • Character Input (AREA)

Abstract

PURPOSE:To decide a character assembling direction even for a document with narrow character gap and to automatically perform character recognition by detecting the connecting direction of a character from distance information between circumscribing rectangles of connection of black picture elements. CONSTITUTION:A character assembling direction decision part 10 checks the connecting states of all the black picture elements in an area decided as a character area, and detects the coordinates of the rectangular circumscribing to a cluster in which the black picture elements are connected. A circumscribing rectangle nearest to the circumscribing rectangle is detected for each circumscribing rectangle, and the position relation of it is detected. When it is connected in the horizontal direction, a horizontal direction coupling counter is counted by one, and when it is connected in the vertical direction, a vertical direction coupling counter is counted by one. When the processing of all the circumscribing rectangulars are completed, the value of the horizontal direction coupling counter is compared with that of the vertical direction coupling counter, and it is decided that the area is a horizontal writing area when the value of the horizontal direction coupling counter is higher, and that it is a vertical writing area when the value of the vertical direction coupling counter is higher.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、文字・図形などが混在
する印刷文書を文字・図形などの領域の属性に応じた認
識を行うことが可能な文字認識装置の前処理に関するも
のである。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a preprocessing of a character recognition device capable of recognizing a print document in which characters and / or graphics are mixed according to the attributes of areas such as characters and / or graphics.

【0002】[0002]

【従来の技術】従来の領域属性識別方式について以下説
明する。まず、スキャナによって取り込まれた2値デー
タの白画素情報を基に領域を分割する。そして、黒画素
の水平/垂直のヒストグラムおよび外接矩形から分割さ
れた領域が、文字領域であるか、図形領域であるか、ま
たは、写真領域であるかを判定する。判定された領域が
図形であれば図形のベクトル化を行い、写真領域であれ
ば、画像領域に対してデータ圧縮処理を行う。判定され
た領域が文字領域の場合には、さらに、文字の組み方向
を縦書き領域か、横書き領域かを決定した後、縦組み文
書/横組み文書に応じた文字切り出し処理を経て、文字
認識処理を行う。
2. Description of the Related Art A conventional area attribute identification method will be described below. First, the area is divided based on the white pixel information of the binary data captured by the scanner. Then, it is determined whether the area divided from the horizontal / vertical histogram of black pixels and the circumscribed rectangle is a character area, a graphic area, or a photographic area. If the determined area is a figure, vectorization of the figure is performed, and if it is a photographic area, data compression processing is performed on the image area. If the determined area is a character area, it is further determined whether the writing direction of the character is a vertical writing area or a horizontal writing area, and character recognition is performed through character cutting processing according to the vertical writing document / horizontal writing document. Perform processing.

【0003】従来の文字組み方向決定では、以下の処理
を行っていた。文字領域と判定された領域に対して、水
平/垂直方向それぞれのヒストグラムをとり、各ヒスト
グラムのピーク値を検出し、ピークの距離を求める。こ
の距離は、文字サイズ+行間、あるいは、文字サイズ+
文字間であり、行間は文字間よりも大きいという特徴か
ら、水平方向のヒストグラムのピーク間距離の平均と垂
直方向の平均値を比較して、小きい方向を文字組み方向
とする。
The following processes are performed in the conventional determination of the character set direction. Histograms in the horizontal and vertical directions are taken for the area determined to be the character area, the peak value of each histogram is detected, and the peak distance is obtained. This distance is character size + line spacing or character size +
Since it is between characters and the line spacing is larger than the character spacing, the average of the peak-to-peak distances of the histogram in the horizontal direction and the average value in the vertical direction are compared, and the smaller direction is defined as the character-set direction.

【0004】[0004]

【発明が解決しようとする課題】従来の領域属性識別方
式では、文字間隔の狭い新聞などの文書に対してヒスト
グラムをとった場合、文字組み方向でない方向のヒスト
グラムはなだらかな形状となる。したがって、ヒストグ
ラムのピーク値を正確に検出することは困難となり、文
字組み方向を正しく検出することができないといった問
題を有していた。
In the conventional area attribute identification method, when a histogram is taken for a document such as a newspaper where the character spacing is narrow, the histogram in the direction other than the character set direction has a gentle shape. Therefore, it is difficult to accurately detect the peak value of the histogram, and there is a problem that it is not possible to correctly detect the direction of a character set.

【0005】本発明は、前記問題点を解決するためなさ
れたもので、文字間隔の狭い文書についても文字組み方
向の決定が可能となる領域属性識別方式を提供すること
を目的としてなされたものである。
The present invention has been made to solve the above-mentioned problems, and has been made with the object of providing an area attribute identification method capable of determining the direction of a character set even for a document having a narrow character space. is there.

【0006】[0006]

【課題を解決するための手段】本発明は、前記問題点を
解決するため、2値化された認識対象文書の領域分割を
行い、分割された領域の属性を識別し、認識を行う文字
認識装置において、被認識文書を2値データに変換する
手段と、2値データの白画素情報から領域を分割する手
段と、分割された各領域について領域内の黒画素情報か
ら領域の属性を識別する手段と、識別結果が文字領域の
場合に、文字のつながり方向を黒画素連結の外接矩形間
の距離情報から検出する手段を備え、自動で文字組み方
向を検出して文字認識を行う。
SUMMARY OF THE INVENTION In order to solve the above problems, the present invention performs character segmentation in which a binarized recognition target document is divided into regions, the attributes of the divided regions are identified, and recognition is performed. In the apparatus, means for converting a recognized document into binary data, means for dividing an area from white pixel information of the binary data, and identification of an attribute of the area from black pixel information in the area for each divided area. And means for detecting the connecting direction of the characters from the distance information between the circumscribing rectangles of the black pixel connection when the identification result is the character area, and the character recognition is performed by automatically detecting the character setting direction.

【0007】[0007]

【作用】本発明は、前記の構成により、分割された領域
の属性を識別し、文字領域と判定された領域に対して、
領域内のすべての黒画素の連結状態を調べ、黒画素が連
結しているかたまりに外接する矩形の座標を検出して、
外接矩形を昇順に格納する。
According to the present invention, with the above configuration, the attributes of the divided areas are identified, and the area determined to be the character area is
Check the connection state of all black pixels in the area, detect the coordinates of the rectangle circumscribing the lump where the black pixels are connected,
Store the circumscribed rectangles in ascending order.

【0008】つぎに、昇順に格納された外接矩形につい
て順番に以下の処理を行う。(この外接矩形は他矩形と
の距離を検出する基点となるので以下は基点矩形と呼
ぶ)基点矩形から最も近い距離の矩形を検索する範囲を
基点矩形毎に決定する。その範囲内にある矩形(以下、
対象矩形とする)と基点矩形との距離を算出して、最も
短い距離の矩形を決定する。決定された最小距離矩形と
基点矩形との位置関係を調べ、水平方向に位置していれ
ば水平方向結合カウンタを1つすすめる。また、垂直方
向に位置していれば、垂直方向結合カウンタを1つすす
める。
Next, the following processing is performed in order on the circumscribed rectangles stored in ascending order. (Because this circumscribed rectangle is a base point for detecting the distance from other rectangles, it will be referred to as a base point rectangle hereinafter.) The range in which the rectangle having the closest distance from the base point rectangle is searched is determined for each base point rectangle. A rectangle within that range (below,
The distance between the target rectangle) and the base rectangle is calculated, and the rectangle with the shortest distance is determined. The positional relationship between the determined minimum distance rectangle and the base point rectangle is examined, and if it is located in the horizontal direction, one horizontal coupling counter is recommended. Also, if it is located in the vertical direction, one vertical coupling counter is recommended.

【0009】すべての外接矩形について処理が終了した
ら、水平方向結合カウンタと垂直方向結合カウンタの値
を比較する。水平方向結合カウンタが多い場合は横書き
領域と判定され、垂直方向結合カウンタが多い場合は縦
書き領域と判定される。
When the processing is completed for all the circumscribed rectangles, the values of the horizontal direction coupling counter and the vertical direction coupling counter are compared. If there are many horizontal coupling counters, it is determined to be a horizontal writing area, and if there are many vertical coupling counters, it is determined to be a vertical writing area.

【0010】[0010]

【実施例】本発明の一実施例について図面を参照して説
明する。図1は、本発明の一実施例における領域属性識
別を実行する装置ブロックを示したものである。図1に
おいて、1は領域属性識別プログラムが格納されている
ROMである。2は領域属性識別を行うCPUであっ
て、図2の領域分割部6、領域判定部7、図形ベクトル
化処理部8、画像圧縮部9、文字組み方向決定部10、
文字切り出し部11、文字認識部12を有する。3はス
キャナ4によって取り込まれた2値データを格納するR
AMである。5はCPUによって識別された領域の認識
結果を表示する表示装置である。
An embodiment of the present invention will be described with reference to the drawings. FIG. 1 shows a device block for performing area attribute identification in one embodiment of the present invention. In FIG. 1, reference numeral 1 is a ROM in which an area attribute identification program is stored. Reference numeral 2 denotes a CPU for identifying a region attribute, which includes a region dividing unit 6, a region determining unit 7, a graphic vectorization processing unit 8, an image compressing unit 9, a character framing direction determining unit 10 in FIG.
It has a character cutout unit 11 and a character recognition unit 12. 3 is an R for storing binary data captured by the scanner 4
AM. A display device 5 displays the recognition result of the area identified by the CPU.

【0011】6の領域分割部では、認識を行う文書を2
値データとしてスキャナ4から取り込み、RAM3に格
納する。取り込んだ2値データから水平方向の空白帯
と、垂直方向の空白帯を検出し、検出された空白帯によ
って囲まれている範囲内を1領域として、その領域の左
上と右下の座標値をRAM3に格納する。
In the area dividing unit 6 there are 2 documents to be recognized.
The value data is fetched from the scanner 4 and stored in the RAM 3. The horizontal blank band and the vertical blank band are detected from the captured binary data, and the area surrounded by the detected blank band is defined as one area, and the coordinate values of the upper left and lower right of the area are set. Store in RAM3.

【0012】7の領域判定部では、分割された領域が写
真領域か、図形領域か、それ以外かを判定する。文字領
域と判定された領域については、文字組み方向決定部1
0で文字組み方向を決定し、文字切り出し部11で文字
組み方向に応じた文字切り出し処理を行い、文字認識部
12において認識処理を行う。
The area determination unit 7 determines whether the divided area is a photographic area, a graphic area, or another area. For the area determined to be the character area, the character embedding direction determination unit 1
The character set direction is determined by 0, the character cutout unit 11 performs character cutout processing according to the character set direction, and the character recognition unit 12 performs recognition processing.

【0013】以下、文字組み方向決定について図3のフ
ローチャートと、図4、図5の一領域例を参照しながら
説明する。文字領域内のすべての黒画素の連結状態を調
べ、黒画素が連結しているかたまりに外接する矩形の座
標の(x1,y1,x2,y2)の値を検出して(図4
を参照)、外接矩形のy1昇順に格納する(ステップ
1,2)。図5(b) の例では、外接矩形1の座標は
(4,4,14,12)外接矩形2の座標は(21,
4,31,16)となり、例えば、矩形1,矩形2、・
・・矩形6の順に格納される(但し、y1を昇順に格納
するため矩形1,矩形2が逆になることがある)。ま
た、文字方向を決定するための水平方向結合カウンタy
oko、垂直方向結合カウンタtateを0クリアする
(ステップ3)。
The determination of the direction of the letter combination will be described below with reference to the flow chart of FIG. 3 and the region examples of FIGS. 4 and 5. The connection state of all the black pixels in the character area is checked, and the value of (x1, y1, x2, y2) of the coordinates of the rectangle circumscribing the block in which the black pixels are connected is detected (see FIG. 4).
), And the y1 of the circumscribing rectangle is stored in ascending order (steps 1 and 2). In the example of FIG. 5B, the circumscribed rectangle 1 has coordinates (4, 4, 14, 12) and the circumscribed rectangle 2 has coordinates (21,
4, 31, 16), for example, rectangle 1, rectangle 2, ...
.. are stored in the order of rectangle 6 (however, since y1 is stored in ascending order, rectangle 1 and rectangle 2 may be reversed). Also, a horizontal combination counter y for determining the character direction.
ok, the vertical combination counter tate is cleared to 0 (step 3).

【0014】まず、昇順に格納された1番目の外接矩形
を基点矩形とする(ステップ4)。基点矩形から最も近
い距離を検索する対象矩形の範囲は、対象矩形のy1の
値が以下の値よりも小さい場合である。
First, the first circumscribed rectangle stored in ascending order is set as a base point rectangle (step 4). The range of the target rectangle in which the closest distance from the base point rectangle is searched is when the value of y1 of the target rectangle is smaller than the following value.

【0015】 基点矩形のy2+基点矩形の幅と高さの短い方の1/2 図5(b) の矩形1の幅は11(14−4+1)、高さは
9(12−4+1)であるので検索範囲は12+9/2
=16となり、矩形のy1の値が16よりも小さい矩形
は矩形2と矩形3である。したがって、矩形1を基点矩
形とした場合の検索対象矩形は矩形2と矩形3の2つと
なる(ステップ5)。
Y2 of the base point rectangle + 1/2 of the shorter width and height of the base point rectangle The width of the rectangle 1 in FIG. 5B is 11 (14-4 + 1) and the height is 9 (12-4 + 1). So the search range is 12 + 9/2
= 16, and the rectangles whose y1 values are smaller than 16 are rectangle 2 and rectangle 3. Therefore, there are two search target rectangles, rectangle 2 and rectangle 3, when rectangle 1 is the base rectangle (step 5).

【0016】最小距離矩形を検出するため、最小距離矩
形minBox、最小距離minDisの値をクリアす
る(ステップ6)。対象矩形のy1の値が範囲内に入っ
ているか否かチェックして、範囲内に入っていれば以下
の処理を行い、入っていなければ、1基点矩形に対する
距離検索終了と見なし、ステップ14へジャンプする。
矩形1を基点矩形とした場合は、対象矩形が矩形4とな
った時にジャンプする(ステップ7)。
In order to detect the minimum distance rectangle, the values of the minimum distance rectangle minBox and the minimum distance minDis are cleared (step 6). It is checked whether the value of y1 of the target rectangle is within the range, and if it is within the range, the following processing is performed. If not, it is considered that the distance search for one base point rectangle is completed, and the process proceeds to step 14. To jump.
When the rectangle 1 is the base rectangle, the process jumps when the target rectangle becomes the rectangle 4 (step 7).

【0017】次に対象矩形のy1の値と基点矩形のy2
の値を比較する(ステップ8)。対象矩形y1の値が基
点矩形y2よりも大きい場合は垂直方向結合のチェック
処理(ステップ9)へ進み、それ以外の場合は水平方向
結合のチェック処理(ステップ11)へ進む。矩形2は
水平方向処理部へ進み、矩形3は垂直方向処理部へ進
む。
Next, the value of y1 of the target rectangle and y2 of the base point rectangle
Are compared (step 8). If the value of the target rectangle y1 is larger than the base point rectangle y2, the process goes to the check process for vertical direction connection (step 9), and if not, the process goes to the check process for horizontal direction connection (step 11). Rectangle 2 advances to the horizontal processing unit, and rectangle 3 advances to the vertical processing unit.

【0018】(ステップ9)では、次の条件を満たせば
最小距離矩形の候補となる。対象矩形の幅と基点矩形の
幅の比が2以下で、かつ、垂直方向に対して矩形が重な
る場合は最小距離矩形の候補となり、ステップ10へ進
む。対象矩形のy1−基点矩形のy2の値が矩形間距離
で、最小距離minDisよりもここで得た距離が小さ
い場合は最小距離minDisに、ここでの距離(矩形
間距離)がセットされる。また、最小距離矩形minB
oxにはこの対象矩形の情報がセットされる。
At (step 9), if the following conditions are satisfied, it becomes a candidate for the minimum distance rectangle. If the ratio of the width of the target rectangle to the width of the base rectangle is 2 or less and the rectangles overlap each other in the vertical direction, they become candidates for the minimum distance rectangle, and the process proceeds to step 10. If the value of y1 of the target rectangle is the inter-rectangular distance and the distance obtained here is smaller than the minimum distance minDis, the minimum distance minDis is set to this distance (inter-rectangle distance). Also, the minimum distance rectangle minB
Information of this target rectangle is set in ox.

【0019】(ステップ11)では、次の条件を満たせ
ば最小距離矩形の候補となる。対象矩形の高さと基点矩
形の高さの比が2以下で、かつ、対象矩形のx2の値が
基点矩形のx1−基点矩形の幅と高さの短い方の1/2
より大きく、かつ、対象矩形のx1の値が基点矩形のx
2+基点矩形の幅と高さの短い方の1/2よりも小さい
場合は、最小距離矩形の候補となりステップ12へ進
む。なお、ステップ11は、矩形1,矩形2が逆に格納
されることがあるのを考慮している。対象矩形が基点矩
形の左に位置している場合は基点矩形のx1−対象矩形
のx2の値が矩形間距離となり、対象矩形が基点矩形の
右に位置している場合は対象矩形のx1−基点矩形のx
2の値が矩形間距離となる。最小距離minDisより
もここで得た距離が小さい場合は最小距離minDis
に、ここでの距離がセットされる。また、最小距離矩形
mimBoxにはこの対象矩形の情報がセットされる。
In (step 11), a candidate for a minimum distance rectangle is obtained if the following conditions are satisfied. The ratio of the height of the target rectangle and the height of the base rectangle is 2 or less, and the value of x2 of the target rectangle is x1-1 / 2 of the width and height of the base rectangle of the base rectangle x1.
It is larger and the value of x1 of the target rectangle is x of the base rectangle
If it is smaller than ½ of the shorter width and height of the 2 + base point rectangle, it becomes a candidate for the minimum distance rectangle and proceeds to step 12. Note that step 11 considers that the rectangle 1 and the rectangle 2 may be stored in reverse. When the target rectangle is located to the left of the base rectangle, the value of x1 of the base rectangle is the inter-rectangle distance, and when the target rectangle is located to the right of the base rectangle, x1 of the target rectangle. Base point x
The value of 2 is the distance between the rectangles. If the distance obtained here is smaller than the minimum distance minDis, the minimum distance minDis
The distance here is set. The information of the target rectangle is set in the minimum distance rectangle mimBox.

【0020】以上の処理によって検出された最小距離矩
形が基点矩形に対して、水平方向に位置していれば水平
方向結合カウンタyokoを1つすすめ、また、垂直方
向に位置していれば、垂直方向結合カウンタtateを
1つすすめる(ステップ14〜16)。基点矩形がなく
なるまで、以上の処理を続ける(ステップ17,1
8)。
If the minimum distance rectangle detected by the above processing is located in the horizontal direction with respect to the base point rectangle, then one horizontal direction coupling counter yoko is recommended, and if it is located in the vertical direction, it is vertical. One directional coupling counter tate is recommended (steps 14-16). The above processing is continued until there is no base point rectangle (steps 17 and 1).
8).

【0021】最後に、すべての外接矩形について処理が
終了したら、水平方向結合カウンタyokoと垂直方向
結合カウンタtateの値を比較する。水平方向結合カ
ウンタyokoが多い場合は横書き領域と判定され、垂
直方向結合カウンタtateが多い場合は縦書き領域と
判定される(ステップ19〜21)。
Finally, when all the circumscribed rectangles have been processed, the values of the horizontal direction coupling counter yoko and the vertical direction coupling counter tate are compared. When the horizontal combination counter yoko is large, it is determined to be a horizontal writing area, and when the vertical combination counter tate is large, it is determined to be a vertical writing area (steps 19 to 21).

【0022】図5の1領域例では、 矩形1の最小距離矩形・・・矩形3 矩形2の最小距離矩形・・・矩形5 矩形3の最小距離矩形・・・なし 矩形4の最小距離矩形・・・なし 矩形5の最小距離矩形・・・なし 矩形6の最小距離矩形・・・なし となり、水平カウンタyoko=0、垂直カウンタta
te=2という結果が得られこの領域は縦組み領域と判
定される。
In the example of one area of FIG. 5, the minimum distance rectangle of rectangle 1 ... rectangle 3 the minimum distance rectangle of rectangle 2 ... rectangle 5 the minimum distance rectangle of rectangle 3 ... none the minimum distance rectangle of rectangle 4 .. None. Minimum distance rectangle of rectangle 5 ... None. Minimum distance rectangle of rectangle 6 ... None. Horizontal counter yoko = 0, vertical counter ta.
The result of te = 2 is obtained, and this area is determined to be a vertically set area.

【0023】[0023]

【発明の効果】今回の方式は、文字領域内の文字のつな
がり方向に着目して、文字組み方向を決定するので、ヒ
ストグラムによる文字組方向検出では不可能であった文
字の文字間隔の狭い文書についても文字組み方向の決定
が可能となり、自動で文字認識を行うことが可能とな
る。
According to the present method, the character set direction is determined by paying attention to the connecting direction of the characters in the character area. Therefore, it is impossible to detect the character set direction by the histogram. With respect to, it is also possible to determine the direction of character combination, and it is possible to automatically perform character recognition.

【0024】また、領域内のすべての矩形と比較しない
ように矩形を昇順にソートし、矩形毎に検索範囲を決定
しているため、高速に処理を行うことが可能である。と
いう効果を有する。
Further, since the rectangles are sorted in ascending order so as not to be compared with all the rectangles in the area and the search range is determined for each rectangle, the processing can be performed at high speed. Has the effect.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例における領域識別を実行する
装置ブロック図である。
FIG. 1 is a block diagram of an apparatus for performing area identification according to an embodiment of the present invention.

【図2】本発明の実施例のブロック図である。FIG. 2 is a block diagram of an embodiment of the present invention.

【図3】本発明の実施例におけるフローチャートであ
る。
FIG. 3 is a flowchart in the embodiment of the present invention.

【図4】本発明の実施例におけるフローチャートであ
る。
FIG. 4 is a flowchart in the embodiment of the present invention.

【図5】外接矩形の座標を示す図である。FIG. 5 is a diagram showing coordinates of a circumscribed rectangle.

【図6】文字領域を示す図である。FIG. 6 is a diagram showing a character area.

【符号の説明】[Explanation of symbols]

1 ROM 2 CPU 3 RAM 4 スキャナ 5 表示装置 6 領域分割部 7 領域判断部 10 文字組み方向決定部 1 ROM 2 CPU 3 RAM 4 Scanner 5 Display 6 Area Division 7 Area Judgment Section 10 Character Set Direction Determining Section

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 2値化された認識対象文書の領域分割を
行い、分割された領域の属性を識別し、認識を行う文字
認識装置において、被認識文書を2値データに変換する
手段と、2値データの白画素情報から領域を分割する手
段と、分割された各領域について領域内の黒画素情報か
ら領域の属性を識別する手段と、識別結果が文字領域の
場合に、文字のつながり方向を黒画素連結の外接矩形間
の距離情報から検出する手段を備え、自動で文字組み方
向を検出して文字認識を行う領域属性識別方式。
1. A character recognition device for dividing an area of a binarized recognition target document, identifying attributes of the divided area, and recognizing the divided document, and means for converting the recognized document into binary data. A means for dividing the area from the white pixel information of the binary data, a means for identifying the attribute of the area from the black pixel information in the area for each divided area, and a connecting direction of characters when the identification result is a character area. An area attribute identification method that includes means for detecting the distance from the circumscribing rectangles connected to black pixels, and automatically detects the character set direction to perform character recognition.
JP03233518A 1991-09-13 1991-09-13 Character recognition device Expired - Fee Related JP3095470B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP03233518A JP3095470B2 (en) 1991-09-13 1991-09-13 Character recognition device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP03233518A JP3095470B2 (en) 1991-09-13 1991-09-13 Character recognition device

Publications (2)

Publication Number Publication Date
JPH0573718A true JPH0573718A (en) 1993-03-26
JP3095470B2 JP3095470B2 (en) 2000-10-03

Family

ID=16956290

Family Applications (1)

Application Number Title Priority Date Filing Date
JP03233518A Expired - Fee Related JP3095470B2 (en) 1991-09-13 1991-09-13 Character recognition device

Country Status (1)

Country Link
JP (1) JP3095470B2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8457404B2 (en) 2010-08-24 2013-06-04 Fuji Xerox Co., Ltd. Image processing apparatus, computer readable medium for image processing and computer data signal for image processing
US8787676B2 (en) 2010-08-03 2014-07-22 Fuji Xerox, Co., Ltd. Image processing apparatus, computer readable medium storing program, and image processing method
US9280725B2 (en) 2013-02-14 2016-03-08 Fuji Xerox Co., Ltd. Information processing apparatus, information processing method, and non-transitory computer readable medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5512768A (en) * 1978-07-13 1980-01-29 Mitsubishi Electric Corp Manufactureing method of double-layer multi-crystal silicon structure mos type integrated circuit
JPS61193276A (en) * 1985-02-20 1986-08-27 Fujitsu Ltd Character string extracting system
JPS6257069A (en) * 1985-09-06 1987-03-12 Fujitsu Ltd Character string extracting system
JPS6271379A (en) * 1985-09-25 1987-04-02 Nippon Telegr & Teleph Corp <Ntt> Processing system for picture information
JPS62197881A (en) * 1986-02-26 1987-09-01 Hitachi Ltd Vertical or horizontal writing deciding system for document image
JPS63304387A (en) * 1987-06-04 1988-12-12 Mitsubishi Electric Corp Document reader
JPH03126181A (en) * 1989-10-11 1991-05-29 Oki Electric Ind Co Ltd Area dividing method for document image
JPH04352295A (en) * 1991-05-30 1992-12-07 Toshiba Corp System and device for identifing character string direction

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5512768A (en) * 1978-07-13 1980-01-29 Mitsubishi Electric Corp Manufactureing method of double-layer multi-crystal silicon structure mos type integrated circuit
JPS61193276A (en) * 1985-02-20 1986-08-27 Fujitsu Ltd Character string extracting system
JPS6257069A (en) * 1985-09-06 1987-03-12 Fujitsu Ltd Character string extracting system
JPS6271379A (en) * 1985-09-25 1987-04-02 Nippon Telegr & Teleph Corp <Ntt> Processing system for picture information
JPS62197881A (en) * 1986-02-26 1987-09-01 Hitachi Ltd Vertical or horizontal writing deciding system for document image
JPS63304387A (en) * 1987-06-04 1988-12-12 Mitsubishi Electric Corp Document reader
JPH03126181A (en) * 1989-10-11 1991-05-29 Oki Electric Ind Co Ltd Area dividing method for document image
JPH04352295A (en) * 1991-05-30 1992-12-07 Toshiba Corp System and device for identifing character string direction

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8787676B2 (en) 2010-08-03 2014-07-22 Fuji Xerox, Co., Ltd. Image processing apparatus, computer readable medium storing program, and image processing method
US8457404B2 (en) 2010-08-24 2013-06-04 Fuji Xerox Co., Ltd. Image processing apparatus, computer readable medium for image processing and computer data signal for image processing
US9280725B2 (en) 2013-02-14 2016-03-08 Fuji Xerox Co., Ltd. Information processing apparatus, information processing method, and non-transitory computer readable medium

Also Published As

Publication number Publication date
JP3095470B2 (en) 2000-10-03

Similar Documents

Publication Publication Date Title
US6839466B2 (en) Detecting overlapping images in an automatic image segmentation device with the presence of severe bleeding
JP3308032B2 (en) Skew correction method, skew angle detection method, skew correction device, and skew angle detection device
US5892854A (en) Automatic image registration using binary moments
EP1081648B1 (en) Method for processing a digital image
JP2002133426A (en) Ruled line extracting device for extracting ruled line from multiple image
JPH05233873A (en) Area dividing method
JP3006466B2 (en) Character input device
JP2868134B2 (en) Image processing method and apparatus
JPH0573718A (en) Area attribute identifying system
US7103220B2 (en) Image processing apparatus, method and program, and storage medium
JP3476595B2 (en) Image area division method and image binarization method
JPH06187489A (en) Character recognizing device
JPH07220081A (en) Segmenting method for graphic of image recognizing device
EP0767941B1 (en) Automatic determination of landscape scan in binary images
JP3406942B2 (en) Image processing apparatus and method
JPH1021332A (en) Non-linear normalizing method
JPH0449148B2 (en)
JP2982221B2 (en) Character reader
JPH05274472A (en) Image recognizing device
JPH06223224A (en) Method for segmenting line
JPH0656625B2 (en) Feature extraction method
JP2882056B2 (en) How to identify specific patterns
JPH0757047A (en) Character segmentation system
JPH07118016B2 (en) Figure recognition device
JPH05128305A (en) Area dividing method

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20070804

Year of fee payment: 7

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080804

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080804

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090804

Year of fee payment: 9

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090804

Year of fee payment: 9

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20100804

Year of fee payment: 10

LAPS Cancellation because of no payment of annual fees