JPS61156484A

JPS61156484A - Picture image processor

Info

Publication number: JPS61156484A
Application number: JP59276463A
Authority: JP
Inventors: Kenichi Oota; 健一太田; Masatoshi Okutomi; 正敏奥富; Tetsuo Sueda; 末田　哲夫
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1984-12-28
Filing date: 1984-12-28
Publication date: 1986-07-16

Abstract

PURPOSE:To carry out appropriate binarization processing by carrying out pre-processing for recognizing information read by reading means with the aid of multivalued digital data. CONSTITUTION:Picture images in a character frame position detection area are taken out sequentially out of a multivalued picture image data 31 which is an output data from an A/D converter 23, and transferred to adders 32 and 35. When picture taking out is of a luster scan system in an x direction, density of a y direction projection distribution is available by adding density distribution in one line, and addition completes by reciprocating scanning between the adder 32 and a latch 33, thereby storing said distribution into a memory 34. As for the x direction, scanning reciprocates between the adder 35 and a memory 36. A threshold value deciding circuit 39 decides a threshold value from a detected frame position and the picture image data 31, and send said value to a comparator 38, thereby obtaining a secondary value picture image 30 from the picture image data 31 and the comparator 38. Thus, a character area to be recognized is extracted, and the appropriate binarization processing with respect to a character image can be carried out.

Description

【発明の詳細な説明】〔技術分計〕本発明は画像処理装置特に光学的文字認識装置における
前処理装置特に文字枠とその内部に書き込まれた文字、
記号とを識別し、文字、記号のみを抽出する文字抽出装
置に関するものである。[Detailed Description of the Invention] [Technical Summary] The present invention relates to a pre-processing device for an image processing device, particularly an optical character recognition device, particularly a character frame and characters written inside the character frame.
The present invention relates to a character extraction device that identifies characters and symbols and extracts only characters and symbols.

[Prior art]

従来光学的文字読み取り装置において、読み取るべき帳
票上の文字枠は、読み取り装置の分光感度では帳票下地
と区別できないような色の、いわゆるドロップアウトカ
ラーで印刷することにより、文字部のみを検出するとい
った方法が一般的であった。しかしこの方法においては
、文字枠に上記のような制約があるため、一般的な筆記
用具である鉛筆、ボールペンやサインペン或いはＰＰＣ
等により、文字枠を作成し、帳票を自家製造することは
不可能である。また、文字枠にドロップアウトカラーを
用いない場合としては、文字及び文字枠を２値画像に変
換した後に適当なフィルタリングを行うことにより、文
字枠を検出する方法も提案されている。（特開昭５９−
７７５７７号公報）しかし、この方法では文字及び文字
枠の両方に対して一定値の閾値による２値化を行うため
、第１図のように、文字と文字枠の濃度差が大きい場合
や文字ごとの濃度のバラツキがある場合には、適正な２
値化閾値がうまく設定できず、文字の切れやつぶれ等が
生じやすいという欠点があった。In conventional optical character reading devices, the character frame on the form to be read is printed in a so-called dropout color, which is a color that cannot be distinguished from the form base using the spectral sensitivity of the reading device, so that only the text is detected. The method was common. However, in this method, due to the above-mentioned restrictions on the character frame, general writing instruments such as pencils, ballpoint pens, felt-tip pens, or PPC
etc., it is impossible to create character frames and create forms in-house. Furthermore, in the case where a dropout color is not used for the character frame, a method has also been proposed in which the character frame is detected by converting the characters and the character frame into a binary image and then performing appropriate filtering. (Unexamined Japanese Patent Publication No. 59-
(No. 77577) However, since this method performs binarization using a fixed threshold value for both characters and character frames, as shown in Figure 1, when there is a large density difference between characters and character frames, or when each character If there are variations in the concentration of
There was a drawback that the digitization threshold could not be set well, and characters were likely to be cut off or blurred.

〔the purpose〕

以上の点に鑑み、本願発明の目的は文字枠等にドロップ
アウトカラーを使用するといった制約なしで、認識すべ
き文字領域を抽出し、該文字に対して適正な２値化処理
を行うことが可能な画像処理装置を提供することにある
。In view of the above points, an object of the present invention is to extract a character area to be recognized and perform appropriate binarization processing on the character without the restriction of using dropout colors for character frames etc. The purpose of the present invention is to provide a capable image processing device.

又、２値化する以前に、前処理段階で、確実に認識対象
物とノイズ、文字枠との識別が可能な画像処理装置を提
供することを目的としている。Another object of the present invention is to provide an image processing apparatus that can reliably distinguish between a recognition target object, noise, and character frames in a preprocessing stage before binarization.

〔Example〕

次に、図面を参照して本発明の詳細な説明する。第２図
に本発明の適用が可能のシステムの１例を示す。１はリ
ーダであり、ＣＣＤ等で原稿を読み取り、該読み取った
画像データを編集処理し、プリンタ３に出力したり、外
部機器へネットワークを介して、伝達したりすることが
可能である。なお、２はＯＣＲ及び画像処理の制御部で
ある。又、４は、ディスプレイを有したコンピュータで
、５のキーボードからの入力により、機器の制御指令等
を行うものである。Next, the present invention will be described in detail with reference to the drawings. FIG. 2 shows an example of a system to which the present invention can be applied. Reference numeral 1 denotes a reader, which is capable of reading a document with a CCD or the like, editing the read image data, outputting it to the printer 3, or transmitting it to an external device via a network. Note that 2 is a control unit for OCR and image processing. Further, numeral 4 is a computer having a display, and is used to issue control commands for equipment through input from the keyboard 5.

なお本願の適用が可能な機器は他に１元ディスク等のフ
ァイル、マイクロフィルム、レーザビームプリンタ、各
機器を制御するワークステーションを有したシステムの
中の１つであってもよい。Note that the device to which the present application can be applied may also be one of the systems having a file such as a single-source disk, a microfilm, a laser beam printer, and a workstation for controlling each device.

第３図に本発明の適用が可能な機器のブロック図を示す
。読み取られるべき原稿２１は光源で照明され、充電変
換素子２２上に結像され、ラスタスキャンにより、時系
列の電気信号に変換され、２３のＡ−Ｄ変換器により多
値のデジタル信号となり、２４．２５の走査方向及び副
走査方向の加算器及び相関器へ別々に送られる。FIG. 3 shows a block diagram of equipment to which the present invention can be applied. The original 21 to be read is illuminated with a light source, imaged on the charging conversion element 22, converted into a time-series electric signal by raster scanning, converted into a multi-value digital signal by an A-D converter 23, and converted into a multi-value digital signal by an A-D converter 24. .25 scan direction and sub-scan direction adders and correlators separately.

相関器２５からは文字領域情報が２６の２値化回路へ送
られ、画像の２値化、つづいて２７の認識回路で、認識
が行われる。本発明は、上述の２値化を行う直前までの
前処理に係るものである。The character area information is sent from the correlator 25 to a binarization circuit 26, where the image is binarized and then recognized by a recognition circuit 27. The present invention relates to preprocessing immediately before performing the above-mentioned binarization.

第４図に、第３図に示した加算器２４、相関器２５につ
いて、更に詳細に説明する。なお、第５図にその手順を
示す。In FIG. 4, the adder 24 and correlator 25 shown in FIG. 3 will be explained in more detail. The procedure is shown in FIG.

第３図に示したＡ／Ｄ変換器２３からの出力データであ
る多値画像データ３１から後述の第６図に示すように、
文字枠位置探査領域の画像を順次とり出し、３２．３５
の加算器へ転送する。画像取り出しがＸ方向のラスター
スキャン方式である場合を考えると、ｙ方向投影投炭分
布は、１ライン中の濃度分布を加算して行えばよいので
、３２の加算器と３３のラッチの間を往復することによ
り加算が完了し、それを３４のメモリに収容すればよい
。Ｘ方向については、１ラインスキヤンの間のデータを
それぞれ別々のメモリに収容して、ラインが変わる毎に
、それをよみ出して、データを加算することになるので
、３５．３６の間を往復することＫなる。As shown in FIG. 6, which will be described later, from the multivalued image data 31 which is the output data from the A/D converter 23 shown in FIG.
Sequentially extract images of the character frame position detection area, 32.35
transfer to the adder. Considering the case where image retrieval is performed using a raster scan method in the The addition is completed by going back and forth, and it is sufficient to store it in 34 memories. Regarding the X direction, the data for one line scan is stored in separate memories, and each time the line changes, it is read out and the data is added, so it is necessary to go back and forth between 35.36 It becomes K to do.

以上により得られた投影濃度分布は、３７のマイクロプ
ロセッサで相関演算を施こされ、枠位置が検出される。The projected density distribution obtained in the above manner is subjected to correlation calculation by 37 microprocessors, and the frame position is detected.

この部分については後述する。This part will be described later.

検出された枠位置と３１の画像データから、上記とは独
立に設けられた３９の１闇値決定回路が闇値を決定し、
３８の比較器に送りこまれ、３１と３８から２値画像３
０が得られる。Based on the detected frame position and 31 image data, 39 1 darkness value determination circuits provided independently from the above determine the darkness value,
It is sent to the comparator of 38, and binary image 3 is generated from 31 and 38.
0 is obtained.

第５図には以上の手順が示されており、Ｘ方向について
のみ説明する。The above procedure is shown in FIG. 5, and only the X direction will be explained.

まず、枠探査領域が指定されると（ステップ２）、ｘ方
向投影濃度分布を作成しくステップ３）、次に、濃度分
布のピークを検出するため、ステップ４で、初期化し、
ステップ５において、ｔをパラメータとして増加させ、
ステップ６でマスク演算を行う（後述の相関演算）。そ
して、ステップ７．８において、最大値を求め、枠を検
出する。First, when the frame search area is specified (step 2), an x-direction projected density distribution is created (step 3), and then, in order to detect the peak of the density distribution, in step 4, initialization is performed.
In step 5, increase t as a parameter,
In step 6, a mask calculation is performed (correlation calculation to be described later). Then, in step 7.8, the maximum value is determined and a frame is detected.

更に第３図について、詳細に説明する。Further, FIG. 3 will be explained in detail.

読み取るべき原稿２１は第６図の如く、あらかじめ、相
関器２５に大きさ及び線幅が記憶された複数の文字枠３
１内に、文字、記号３２が書きこまれているようなもの
であるとする。Ａ−Ｄ変換器２３からの多値デジタル信
号は２４の加算器で順次加算されＸ軸方向及びｙ軸方向
への投影濃度分布を作成する。第７図に投影濃度分布の
一例を示す。濃度分布は、文字枠の少くとも１つが入り
２つ以上は入らないような領域、例えば第７図の点＠４
１で囲まれた領域についてとるようにする。第７図の曲
Ｉ！４２がＸ軸投影濃度分布、４３がｙ軸投影濃度分布
を示して〜する。第７図中の長さ４４．４５は、あらか
じめ既知の文字枠のＸ方向、ｙ方向の大きさとほぼ等し
くなっている。第３図２５の相関器は上記投影濃度分布
と、与えられたマスクパターンとの相関演算を行い、文
字枠位置を検出するもので、以下にこの相関器の作用（
第４図のＭＰＵ内）について説明する。As shown in FIG. 6, the original 21 to be read has a plurality of character frames 3 whose sizes and line widths are stored in the correlator 25 in advance.
Assume that a character or symbol 32 is written in 1. The multivalued digital signals from the A-D converter 23 are sequentially added by 24 adders to create projected density distributions in the X-axis direction and the y-axis direction. FIG. 7 shows an example of the projected density distribution. The density distribution is a region where at least one character frame is included and two or more are not included, for example, point @4 in Figure 7.
Let us take the area surrounded by 1. Song I in Figure 7! Reference numeral 42 indicates an X-axis projected density distribution, and 43 indicates a y-axis projected density distribution. The length 44.45 in FIG. 7 is approximately equal to the previously known size of the character frame in the X and Y directions. The correlator shown in FIG. 325 detects the character frame position by calculating the correlation between the projected density distribution and the given mask pattern.The function of this correlator (
(inside the MPU in FIG. 4) will be explained.

相関とは、この場合、ある−次元の関数ｆと、与えられ
たマスクパターンｇの檀を有限区間にわたって積分し、
ｇを少しずつずらしながら積分値の最大値を探し出すと
いった操作を指す。In this case, correlation is the integration of a certain -dimensional function f and a given mask pattern g over a finite interval,
Refers to the operation of finding the maximum value of the integral value while shifting g little by little.

Ｘ方向の枠位置を検出するには、ｆはＸ方向投影濃度分
布ｆ（ｘ）、ｇは例えば、第８図に示すようなマスクパ
ターンであり、点ｊを原点にとれば、となる。ここで２つの凸部の間隔ωと凸部の大きさｈを
Ｘ方向の文字枠の大きさ及び、文字枠の線幅にほぼ等し
くとっておけばｆとｇの相関値は第７図のｔ＝ａにおい
てまず最大となる。従って、この最大値を検出すれば枠
位置が求められることになる。ｙ方向についても同様の
手続きをすれば、同様にして第７図す点が求められるの
で２次元的な枠位置Ａ点が検出されることになる。又、
枠の長さがわかっているので、Ｂ点も容易に検出できる
。To detect the frame position in the X direction, f is the X direction projected density distribution f(x), g is a mask pattern as shown in FIG. 8, for example, and if point j is taken as the origin, then the following equation is obtained. Here, if the interval ω between the two convex parts and the size h of the convex parts are set approximately equal to the size of the character frame in the X direction and the line width of the character frame, the correlation value between f and g will be as shown in Figure 7. It first becomes maximum at t=a. Therefore, by detecting this maximum value, the frame position can be determined. If the same procedure is carried out in the y direction, the points shown in FIG. 7 can be found in the same way, so that the two-dimensional frame position point A can be detected. or,
Since the length of the frame is known, point B can also be easily detected.

次に文字枠を抽出した後、文字領域の抽出方法について
説明する。Next, a method for extracting a character area after extracting a character frame will be described.

前述のＡ、Ｂ点からそれぞれ枠の内側に投影濃度分布を
たどり、最初の極小点（第９図矢印）をもって文字抽出
範囲とするのである。又、他の２点についても同様であ
る。文字抽出範囲は、第９図一点鎖線で囲まれた領域と
なる。以上説明したように文字領域の判別までＡ−Ｄ変
換後の多値画像のままで行うので、該領域に対して、２
値化閾値を決めれば、適正な２値化を行うことができ、
またその後、バタンマツチング方法の精度を高めること
ができ非常に精度の高い画像処理装置を提供することが
可能となった。The projected density distribution is traced inside the frame from the aforementioned points A and B, respectively, and the first minimum point (arrow in FIG. 9) is taken as the character extraction range. The same applies to the other two points. The character extraction range is the area surrounded by the dashed line in FIG. As explained above, since the character area is determined using the multivalued image after A-D conversion, two
By determining the digitization threshold, proper binarization can be performed.
Further, since then, the accuracy of the slam matching method has been improved, and it has become possible to provide an image processing device with extremely high accuracy.

前記第８図に示したマスクパターンのω、ｈには多少の
冗長性があるのである範囲内の値であれば、充分な機能
を発揮する。またその形も例えば第１０図（ａ）　、　
（ｂ）のようにいくつかの変形が考えられ、第３図のも
のだけに限定されるわけではない。また、第７図の投影
濃度分布を得る際、領域４１内の全ての点を抽出する必
要はなく、例えばＸ軸方向投影濃度分布を作成するとき
は、第１１図の如く、走査ラインをｙ方向に何本かおき
にとる、即ち図中８１の一点鎖線上の濃度分布のみをＸ
方向投影濃度分布として積算する、といったことも可能
である。Since there is some redundancy in ω and h in the mask pattern shown in FIG. 8, a sufficient function can be achieved if the values are within a certain range. The shape is also as shown in Figure 10(a), for example.
Several modifications are possible, as shown in (b), and the invention is not limited to the one shown in FIG. Furthermore, when obtaining the projected density distribution shown in FIG. 7, it is not necessary to extract all points within the region 41. For example, when creating the projected density distribution in the In other words, only the concentration distribution on the dashed line 81 in the figure
It is also possible to integrate as a directional projection density distribution.

以上、本発明によれば、所定の枠内に書きこまれた文字
等の画像を抽出する際に、ドロップアウトカラー等の細
工をする必要がなく、例えば、ＰＰＣにより自家作製し
た帳票を用いることも可能となる。また多値画像のまま
処理を行うため、２値画像処理で不適正なしきい値を設
定した場合におこりがちな帳票の汚れ、かすれ等の誤動
作を減少させ安定して文字を抽出することができ、その
後の２値化、認識等の処理も容易となる。As described above, according to the present invention, when extracting images such as characters written in a predetermined frame, there is no need to use tricks such as drop-out colors, and, for example, a form made in-house using PPC can be used. is also possible. In addition, since processing is performed as a multivalued image, characters can be extracted stably by reducing malfunctions such as smearing and blurring of forms that tend to occur when an inappropriate threshold is set in binary image processing. , subsequent processing such as binarization and recognition becomes easier.

〔effect〕

以上、詳述したように本願発明により、文字枠にドロッ
プアウトカラー等を使用することなく、認識すべき文字
領域を抽出し、該文字画像に対して、適正な２値化処理
を行うことが可能となった。As detailed above, according to the present invention, it is possible to extract the character area to be recognized without using dropout colors or the like in the character frame, and to perform appropriate binarization processing on the character image. It has become possible.

本願発明により、画像データの２値化処理以前に多値画
像のまま前処理を行うので、適正な２値化を容易に行う
ことが可能となった。According to the present invention, since preprocessing is performed on the multivalued image before the image data is binarized, it is possible to easily perform proper binarization.

[Brief explanation of the drawing]

第１図は、文字枠濃度と文字濃度との差が大きいか、或
いは文字同士の濃度にバラツキがあるような帳票の一例
を示す図、第２図は本発明の適用が可能な装置の例を示
す図、第３図は本発明適用の装置の構成ブロック図、第
４図は加惇器、相関器の説明図、第５図は文字枠の識別
の７０−チャート、第６図は帳票上の文字及び文字枠の
配列例及び座標軸の説明図、第７図は投影濃度分布を説
明する図、第８図は相関処理に用いるマスクパターンを
示す図、第９図は文字領域の切り出し範囲を説明する図
、第１０図は相関処理に用いるマスクパターンの他の例
を示す図である。第１１図は走査ラインをｙ方向に何本
かおきにとった場合の説明図。FIG. 1 is a diagram showing an example of a document in which there is a large difference between the character frame density and character density, or in which there is variation in the density between characters, and FIG. 2 is an example of an apparatus to which the present invention can be applied. 3 is a configuration block diagram of the device to which the present invention is applied, FIG. 4 is an explanatory diagram of the adder and correlator, FIG. 5 is a 70-chart for character frame identification, and FIG. 6 is a form diagram. Figure 7 is a diagram explaining the projected density distribution, Figure 8 is a diagram showing the mask pattern used for correlation processing, Figure 9 is the cutout range of the character area. FIG. 10 is a diagram illustrating another example of a mask pattern used for correlation processing. FIG. 11 is an explanatory diagram when scanning lines are taken every few lines in the y direction.

Claims

[Claims]

A reading means capable of reading frame information and image information;
An image processing apparatus characterized in that preprocessing for recognizing information read by a reading means is performed on multivalued digital data.