JPS62298888A - Input decision system for print character - Google Patents

Input decision system for print character

Info

Publication number
JPS62298888A
JPS62298888A JP14255386A JP14255386A JPS62298888A JP S62298888 A JPS62298888 A JP S62298888A JP 14255386 A JP14255386 A JP 14255386A JP 14255386 A JP14255386 A JP 14255386A JP S62298888 A JPS62298888 A JP S62298888A
Authority
JP
Japan
Prior art keywords
character
input
angle
character string
input angle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP14255386A
Other languages
Japanese (ja)
Inventor
Hiromichi Iwase
岩瀬 洋道
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP14255386A priority Critical patent/JPS62298888A/en
Publication of JPS62298888A publication Critical patent/JPS62298888A/en
Pending legal-status Critical Current

Links

Landscapes

  • Image Input (AREA)

Abstract

PURPOSE:To decide the inclination of an input character at high speed with high accuracy by calculating the input angle of a character string from the peaks of direction histograms of edges found by taking the first derivative of a character print area. CONSTITUTION:A direction histogram calculating means 1 takes the first derivative of the character print area of an input image to find direction histograms of respective edges. A character string input angle calculating means 2 calculates the input angle of the character string from the peak values of the direction histograms calculated by the direction histogram calculating means 1. An input angle deciding means 3 compares the character input angle with a preset permissible range angle to decide whether or not the input angle is within the permissible range. Then when the input angle of the character string is within the permissible range, the segmentation processing and then recognition processing of characters are performed and when not, on the other hand, prompting for reinput or the rotation of the input image by the calculated character string input angle is performed to perform the character segmentation and recognition processing.

Description

【発明の詳細な説明】 3、発明の詳細な説明 [概 要] 印刷文字の記載された入力文書の画像データ中の文字印
刷領域からエツジの方向ヒストグラムを求めることによ
り、文字列の印刷されている方向を検出し、入力された
文字列の方向がそれ以後の処理の許容範囲内にあるか否
かを判定するようにしたものである。
[Detailed Description of the Invention] 3. Detailed Description of the Invention [Summary] By obtaining an edge direction histogram from the character printing area in the image data of an input document in which printed characters are written, it is possible to determine the printed character string. The direction of the input character string is detected, and it is determined whether the direction of the input character string is within the allowable range for subsequent processing.

[産業上の利用分野] 本発明はパターン認識に係わり、特に印刷文字認識にお
ける入力に関するものである。
[Industrial Field of Application] The present invention relates to pattern recognition, and particularly to input in printed character recognition.

文字認識は印刷文字から手書き文字へとその認識対象範
囲を広げてきているが、印刷文字については、文字画像
を読み取るセンサに対しての文字列の角度が問題となる
。これは、それ以後の処理のうち文字の切出しが投影に
基づいて行われ、また文字の傾きがパターン・マツチン
グを基本とする印刷文字認識方式においては、認識率低
下の原因となるためである。
The scope of character recognition has been expanding from printed characters to handwritten characters, but with printed characters, the angle of the character string relative to the sensor that reads the character image poses a problem. This is because, in the subsequent processing, character extraction is performed based on projection, and the inclination of characters causes a reduction in recognition rate in a printed character recognition system based on pattern matching.

このため、文字列の入力角度が文字の切出し方式および
認識方式の許容範囲内にあるが否かを判定する必要があ
る。
Therefore, it is necessary to determine whether the input angle of the character string is within the allowable range of the character extraction method and recognition method.

[従来の技術] 従来、印刷文字の入力が許容範囲内の角度で行われたか
否かを判定する方法としては、次の二つの方法があった
[Prior Art] Conventionally, there have been two methods for determining whether or not a printed character has been input at an angle within a permissible range.

■算出する方向を変えながら投影を求めその投影の分散
が最大となる方向により文字列が印刷されている角度を
検出する方法。即ち、投影する方向が文字列の方向と一
致すれば投影データに行間の空白が現れ、分散が最大と
なるからである。
■A method of calculating the projection while changing the calculation direction and detecting the angle at which the character string is printed based on the direction in which the variance of the projection is maximum. That is, if the direction of projection matches the direction of the character string, spaces between lines will appear in the projection data, and the variance will be maximized.

■縦長の小領域ごとに投影を求めそれらの間の対応をと
り直線を当てはめることにより文字列の角度を検出する
方法。
■A method of detecting the angle of a character string by calculating the projections for each small vertical area, finding the correspondence between them, and applying a straight line.

第5図(alに示すように、印刷文字列の方向が投影を
求める方向と一致している場合は行間の空白が見えるが
、山)に示すように投影を求める方向と一致しない場合
は行間空10よ見えない。そこで、投影を求める方向に
垂直に画像を分割し投影を求める(C1に示すように、
空白が見える。
As shown in Figure 5 (al), if the direction of the printed character string matches the direction in which the projection is sought, the space between the lines will be visible, but if the direction of the printed character string does not match the direction in which the projection is sought, as shown in the mountain, the space between the lines will be visible. I can't see the sky. Therefore, we divide the image perpendicularly to the direction in which we want the projection and find the projection (as shown in C1,
I see a blank space.

(C)の投影データに注目すると、投影を求める方向に
垂直の方向に投影がシフトしていることが判る。この移
動量から文字列の方向を求めるこ。
Looking at the projection data in (C), it can be seen that the projection is shifted in a direction perpendicular to the direction in which the projection is sought. Find the direction of the string from this amount of movement.

とができる。図の(dlの場合は、文字列の回転角度は
X方向のズレ2に対してy方向のズレ4であるので、2
2.5°と求めることができる。
I can do it. In the case of (dl in the figure), the rotation angle of the character string is 2 in the x direction and 4 in the y direction, so 2
It can be calculated as 2.5°.

[発明が解決しようとする問題点] 上記従来の方法の問題点は、 ■の方法においては、分散が最大となる投影の算出方向
が全く判らないため、ある精度で方向を変化させ、何回
も投影を求める必要があり、処理時間が多くかかる、 ■の方法においては、実際の投影は第4図に示したよう
に奇麗には求まらないため、投影間の対応が鮮明にとれ
るとは限らない。また、分割して求めるには時間がかか
る。
[Problems to be Solved by the Invention] The problem with the above conventional method is that in the method (2), the calculation direction of the projection where the variance is maximum is not known at all, so the direction is changed with a certain precision and the calculation is repeated several times. In the method (2), the actual projections cannot be obtained as neatly as shown in Figure 4, so if the correspondence between the projections can be clearly obtained, is not limited. Also, it takes time to divide and calculate.

という問題点がある。There is a problem.

本発明は、このような従来の問題点を解消した新規な入
力判定方式を提供しようとするものである。
The present invention aims to provide a novel input determination method that solves these conventional problems.

E問題点を解決するための手段] 第1図は本発明の印刷文字の入力判定方式の原理ブロッ
ク図を示す。
Means for Solving Problem E] FIG. 1 shows a principle block diagram of the printing character input determination method of the present invention.

図において、lは方向ヒストグラム算出手段であり、入
力画像の文字印刷領域に一次微分を施し各エツジの方向
ヒストグラムを求める。
In the figure, l denotes a direction histogram calculation means, which performs first differentiation on the character printing area of the input image to obtain the direction histogram of each edge.

2は文字列入力角度算出手段であり、方向ヒストグラム
算出手段lの算出した方向ヒストグラムのピーク値から
文字列入力角度を算出する。
Reference numeral 2 denotes a character string input angle calculation means, which calculates a character string input angle from the peak value of the direction histogram calculated by the direction histogram calculation means 1.

3は入力角度判定手段であり、文字列入力角度算出手段
2の算出した文字入力角度を、予め設定した許容範囲角
度と比較し、許容範囲か否かを判定する。
Reference numeral 3 denotes an input angle determining means, which compares the character input angle calculated by the character string input angle calculating means 2 with a preset allowable range angle to determine whether or not it falls within the permissible range.

文字列の人力角度が許容範囲内であ終ると判定されれば
、文字の切出し処理、次いで認識処理を行う。
If it is determined that the human input angle of the character string is within the allowable range, character extraction processing and then recognition processing are performed.

文字列の入力角度が許容範囲外であると判定されれば、
その旨を知らせて再入力を促すが、若しくは入力画像を
算出した文字列入力角度だけ回転処理して、文字切出し
および認識処理を行う。
If the input angle of the string is determined to be outside the allowable range,
It notifies you of this and prompts you to re-enter it, or it rotates the input image by the calculated character string input angle to perform character extraction and recognition processing.

[作用] 印刷物上の日本語文字は、主に縦方向と横方向、これに
加えてそれらから45度傾いた角度を持つ線分から構成
されている。
[Operation] Japanese characters on printed matter are mainly composed of vertical and horizontal directions, and in addition, line segments inclined at 45 degrees from these directions.

即ち、印刷物の文字印刷領域における線分の角度の分布
を裏めれば、第2図に示すように、0度。
That is, if we look at the distribution of angles of line segments in the character printing area of printed matter, it is 0 degrees, as shown in FIG.

90度、180度、27(j度の4つの角度に非常に高
いピークを持ち、それらの中間の4つの角度に若干高い
ピークを持つ。
It has very high peaks at four angles: 90 degrees, 180 degrees, and 27 (j degrees), and slightly high peaks at four angles in between.

本発明では、この角度を求めるために文字印刷領域を、
−次微分フィルタで1回走査することにより文字を構成
する線分のエツジの方向ヒストグラムを求め、そのピー
クから印刷物の読取りセンサへの角度を算出することが
できる。
In the present invention, in order to find this angle, the character printing area is
By scanning once with the −th order differential filter, it is possible to obtain a direction histogram of the edges of line segments constituting a character, and to calculate the angle from the peak to the reading sensor of the printed matter.

[実施例] 以下第3図および第4図に示す実施例により、本発明を
さらに具体的に説明する。
[Example] The present invention will be described in more detail below with reference to Examples shown in FIGS. 3 and 4.

第3図は本発明の一実施例のブロック図である。FIG. 3 is a block diagram of one embodiment of the present invention.

図において、4は入力装置であり入力された印刷文書を
光学的に走査し反射光を光電変換して得られた信号を二
値化する。
In the figure, reference numeral 4 denotes an input device that optically scans an input printed document, photoelectrically converts reflected light, and binarizes the obtained signal.

5は画像メモリであり、入力装置4データニ値化された
画像データを格納する。
Reference numeral 5 denotes an image memory, which stores the image data obtained by converting the input device 4 data into binary values.

6は領域分離回路であり、画像メモリ5の内容を読み出
し、文字印刷領域、図形・表領域、写真領域を識別し、
分離する。
6 is an area separation circuit that reads out the contents of the image memory 5 and identifies the character printing area, figure/table area, and photo area;
To separate.

7は画像メモリ5の一部であり、領域分離回路6により
分離した各領域の座標データを格納する。
Reference numeral 7 denotes a part of the image memory 5, which stores coordinate data of each area separated by the area separation circuit 6.

1は方向ヒストグラム算出回路であり、画像メモリ7に
格納されている領域座標データに基づき、画像メモリ5
に格納されている入力画像の文字印刷領域について、−
次微分を施し、各エツジにおけるエツジの方向ヒストグ
ラムを求める。8はメモリであり、方向ヒストグラム算
出回路1の算出した方向ヒストグラム・データを格納す
る。
Reference numeral 1 denotes a direction histogram calculation circuit, which calculates the direction histogram from the image memory 5 based on the area coordinate data stored in the image memory 7.
Regarding the character printing area of the input image stored in -
Perform the next differentiation to obtain the edge direction histogram for each edge. A memory 8 stores the direction histogram data calculated by the direction histogram calculation circuit 1.

2は文字列入力角度算出回路であり、メモリ8に格納さ
れている方向ヒストグラムのピークを求め、その角度と
0’、90’、180’、270’のいずれかの差のう
ら、最小のものをもって文字列の入力角度とする。
2 is a character string input angle calculation circuit which finds the peak of the direction histogram stored in the memory 8 and calculates the smallest difference between that angle and any one of 0', 90', 180', and 270'. Let be the input angle of the string.

9はメモリであり、文字列入力角度算出回路2の算出し
た文字列入力角度を格納する。
A memory 9 stores the character string input angle calculated by the character string input angle calculation circuit 2.

10はメモリであり、文字の切出し部および認識部の許
容範囲のうち小さい方を格納しである。
Reference numeral 10 denotes a memory, which stores the smaller of the character cutout portion and the permissible range of the recognition portion.

3は人力角度判定回路であり、メモリ9に格納しである
文字列入力角度とメモリ10に格納しである許容範囲角
度とを比較し、入力が許容範囲以上の角度で入力された
か否かを判定する。
3 is a manual angle determination circuit that compares the character string input angle stored in the memory 9 with the permissible range angle stored in the memory 10, and determines whether the input was made at an angle exceeding the permissible range. judge.

第4図は、方向ヒストグラム算出回路lにおける一部微
分フィルタによる 方向ヒストグラム算出方法を示す図
である。
FIG. 4 is a diagram showing a direction histogram calculation method using a partial differential filter in the direction histogram calculation circuit l.

フィルタは、例えば図に示すように、3×3の9画素分
の大きさを持ち、フィルタ(a)とフィルタ(blをも
って画像データの同一個所を走査する。
For example, as shown in the figure, the filter has a size of 9 pixels (3×3), and the filter (a) and filter (bl) are used to scan the same part of the image data.

(C1に示すような画像を水平方向に走査するときは、
画像の各画素とフィルタの各要素の値の積の合計値は、
エツジにおいて、フィルタ(alでは“−33となり、
フィルタ(b)では“0″となる。
(When scanning an image as shown in C1 in the horizontal direction,
The total value of the product of each pixel of the image and the value of each element of the filter is
In the edge, the filter (al is "-33,"
In filter (b), it becomes "0".

(d)に示すような画像を走査するときは、工・ノジに
おいて、フィルタta)では“θ″となり、フィルタ(
blでは3”が連続する。
When scanning an image as shown in (d), in the process, the filter ta) becomes "θ", and the filter (
3" are continuous in bl.

このように、フィルタ(alは、y方向の変化分を検出
し、フィルタ山)は水平方向の変化分を検出する。
In this way, the filter (al detects the change in the y direction, and the filter mountain) detects the change in the horizontal direction.

従って、telに示すような45度のエツジを走査する
ときは、フィルタ(a)では“−2”となり、フィルタ
伽)では2″となる。
Therefore, when scanning a 45 degree edge as shown in tel, the value is "-2" for filter (a) and 2" for filter (a).

フィルタ(alおよびフィルタ01)により同一個所を
走査したときの両者から得られる値は、第4図の(f)
に示すように、エツジの法線方向のX成分およびy成分
を示し、従ってこれら値の比から工・ノジの方向を知る
ことができる。
When the same location is scanned by the filters (al and filter 01), the values obtained from both are shown in (f) in Figure 4.
As shown in , the X component and the y component in the normal direction of the edge are shown, and therefore, the direction of the cut and the cut can be determined from the ratio of these values.

文字印刷領域全体をフィルタta>およびTo)をもっ
て1回走査し、そのとき得られた値の比から求めたエツ
ジ方向についてヒストグラムを作成すれば、第2図と同
様な方向ヒストグラムが得られる。
If the entire character printing area is scanned once using the filters ta> and To) and a histogram is created for the edge direction determined from the ratio of the values obtained at that time, a direction histogram similar to that shown in FIG. 2 can be obtained.

[発明の効果] 以上説明のように本発明によれば、入力文字の傾き判定
を高速度に、且つ精度良く行うことができ、印刷文字認
識装置の性能向上に寄与する効果は極めて大である。
[Effects of the Invention] As explained above, according to the present invention, the inclination of input characters can be determined at high speed and with high accuracy, and the effect of contributing to improving the performance of printed character recognition devices is extremely large. .

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の原理ブロック図、 第2図は印刷文字線分の角度分布を示す図、第3図は本
発明の一実施例のブロック図、第4図は方向ヒストグラ
ム算出方法を説明する図、 第5図は従来例による回転角度検出法を示す図である。 図面において、 lは方向ヒストラム算出手段(回路)、2は文字列人力
角度算出手段(回路)、3は人力角度判定手段(回路)
、 4は入力装置、 5.7は画像メモリ、 6は領域分離回路、 8は方向ヒストグラム格納メモリ、 9は文字列入力角度格納メモリ、 10は許容範囲角度格納メモリ、 をそれぞれ示す。 本発明の原理ブロック図 第1図 第2図 本発明の一実施例のブロック図 第3図 (d)                    (e
)(fl 方向ヒストグラム算出方法を説明する図第4図
Fig. 1 is a block diagram of the principle of the present invention, Fig. 2 is a diagram showing the angular distribution of printed character line segments, Fig. 3 is a block diagram of an embodiment of the present invention, and Fig. 4 explains the direction histogram calculation method. FIG. 5 is a diagram showing a conventional rotation angle detection method. In the drawings, l is a direction histogram calculation means (circuit), 2 is a character string manual angle calculation means (circuit), and 3 is a manual angle determination means (circuit).
, 4 is an input device, 5.7 is an image memory, 6 is a region separation circuit, 8 is a direction histogram storage memory, 9 is a character string input angle storage memory, and 10 is a tolerance range angle storage memory. Block diagram of the principle of the present invention Figure 1 Figure 2 Block diagram of an embodiment of the present invention Figure 3 (d) (e
) (fl Figure 4 explaining the direction histogram calculation method

Claims (1)

【特許請求の範囲】 印刷文字の記載された入力文書の画像より文字を1文字
づつ分離し認識する文字認識装置において、 入力画像の文字印刷領域に一次微分を施し各エッジの方
向ヒストグラムを求める方向ヒストグラム算出手段(1
)と、 方向ヒストグラム算出手段(1)の算出した方向ヒスト
グラムのピークから文字列入力角度を算出する文字列入
力角度算出手段(2)と、 文字列入力角度算出手段(2)の算出した文字入力角度
を予め設定した許容角度と比較し許容範囲か否かを判定
する入力角度判定手段(3)とを備え、入力文書の文字
列がそれ以後の処理に妥当か否かを判定するよう構成し
たことを特徴とする印刷文字の入力判定方式。
[Scope of Claims] In a character recognition device that separates and recognizes characters one by one from an image of an input document in which printed characters are written, a direction in which a direction histogram of each edge is obtained by applying first differentiation to a character printing area of the input image is provided. Histogram calculation means (1
), a character string input angle calculation means (2) that calculates a character string input angle from the peak of the direction histogram calculated by the direction histogram calculation means (1), and a character input angle calculated by the character string input angle calculation means (2). The input angle determining means (3) compares the angle with a preset allowable angle to determine whether or not it is within the allowable range, and is configured to determine whether the character string of the input document is appropriate for subsequent processing. A printing character input determination method characterized by the following.
JP14255386A 1986-06-18 1986-06-18 Input decision system for print character Pending JPS62298888A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP14255386A JPS62298888A (en) 1986-06-18 1986-06-18 Input decision system for print character

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP14255386A JPS62298888A (en) 1986-06-18 1986-06-18 Input decision system for print character

Publications (1)

Publication Number Publication Date
JPS62298888A true JPS62298888A (en) 1987-12-25

Family

ID=15318019

Family Applications (1)

Application Number Title Priority Date Filing Date
JP14255386A Pending JPS62298888A (en) 1986-06-18 1986-06-18 Input decision system for print character

Country Status (1)

Country Link
JP (1) JPS62298888A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5040831A (en) * 1990-01-12 1991-08-20 Phil Lewis Non threaded pipe connector system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5040831A (en) * 1990-01-12 1991-08-20 Phil Lewis Non threaded pipe connector system

Similar Documents

Publication Publication Date Title
O'Gorman The document spectrum for page layout analysis
US5613016A (en) Area discrimination system for text image
EP0543593B1 (en) Method for determining boundaries of words in text
US20070230784A1 (en) Character string recognition method and device
KR100383858B1 (en) Character extracting method and device
JPH0229886A (en) Method for extracting feature variable
JP3006466B2 (en) Character input device
EP0505729B1 (en) Image binarization system
Lehal et al. A range free skew detection technique for digitized Gurmukhi script documents
JPS62298888A (en) Input decision system for print character
JPS6325391B2 (en)
JP2000357287A (en) Method and device for number plate recognition
JP3153439B2 (en) Document image tilt detection method
JP4439054B2 (en) Character recognition device and character frame line detection method
JPH04276888A (en) Character reader
JPH09179982A (en) Specific pattern detecting method
JP3381803B2 (en) Tilt angle detector
JPH0573718A (en) Area attribute identifying system
JP2980636B2 (en) Character recognition device
JPH11250179A (en) Character reocognition device and its method
JPS63101983A (en) Character string extracting system
JPS63282584A (en) Detecting method for angle of rotation
JPH04167193A (en) Character recognizing method
JPH103517A (en) Device for detecting tilt angle of document picture
JPH03250387A (en) Character segmenting system