JPH05506115A

JPH05506115A - Correlation masking process for deskewing, filtering and recognition of vertically segmented characters

Info

Publication number: JPH05506115A
Application number: JP91504482A
Authority: JP
Inventors: ガボウスキ，ロジャー・スティーヴン
Original assignee: イーストマン・コダック・カンパニー
Priority date: 1990-02-02
Filing date: 1991-01-31
Publication date: 1993-09-02
Also published as: US5052044A; WO1991011780A1; EP0513171A1

Abstract

(57)【要約】本公報は電子出願前の出願データであるため要約のデータは記録されません。 (57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】垂直方向にセグメント化されたキャラクタのデヌ牛ユーイングろ波及び認識のｔ ζめの相関マスキング処理元型の背量挾血云野本発明は　垂直方向にセグメント化されたキャラクタを認識する光学式文字認識装置に関する。[Detailed description of the invention] Denu cow Ewing filtering and recognition of vertically segmented characters The amount of correlation masking processing archetype of ζ Hanketsu Yunno The present invention is an optical character recognition system that recognizes vertically segmented characters. Regarding equipment.

貨−探術垂直方向にセグメント化されたキャラクタは、肉眼で読み取り可能であると共に、機械的に読み取り可能なシンボルを必要とする　例えば銀行小切手のような書類上に印刷される。一つのキャラクタにおいて、垂直方向のセグメントは、そのキャラクタが肉眼で読み取り可能となるように、特定シンボルの輪郭に一致させる。一方、そのキャラクタにおいて　垂直方向に異なるセグメント間で水平方向に異なる間隔のパターンは特定のシンボルを固有に定めるので　キャラクタは機械的に読み取り可能なものとなる。この考えは、ファイセル（Ｆｅｉ＋５ｅｌｌ他に対する米国特許第３．３０９．６６７号、及びルイエール（Ｌ’ｈｉｌｌｉｅｒｌに対する米国特許第３．６８８．９５５号に開示されている。Currency - Exploration Vertically segmented characters are readable to the naked eye and , documents that require machine-readable symbols, such as bank checks. printed on the class. For one character, the vertical segment is its Match the outline of a particular symbol so that the character is readable to the naked eye. Ru. On the other hand, in that character, between vertically different segments, horizontally The different spacing patterns uniquely define a particular symbol, so the character Become machine-readable. This idea is based on Fei + 5ell No. 3,309,667 to et al., and L'hilli et al. No. 3,688,955 to erl.

ハンチエフ）　（Ｈａｏｃｈｅ１１１他に対する米国特許第３，５３９，９８９号、及びラフエベールズ（Ｌａｔｅマｅｒｓ）他に対する米国特許第４．０５３．７３７号に開示されているように　周知の技術では、文書から垂直方向にセグメント化されたキャラクタを読み込むために、ピーク検出を採用している１文書イメージ・データは狭い垂直方向の窓により水平方向に走査され、窓におけるビクセル密度が寥の水平方向位置の関数としてプロットされる。プロットにおけるピークはピーク検出器により位置決めされ　ピーク間のスペースのパターンは既知シンボルのパターンと比較される。この技術の一つの利点は　垂直窓の狭さが文書上に印刷された垂直方向のセグメントの厚さにおける変動からの影響を最小化させることである。(U.S. Patent No. 3,539,989 to Haoche 111 et al. No. 4.053, and U.S. Patent No. 4.053 to Latemers et al. ．． As disclosed in No. 737, well-known techniques include vertically segmenting a document A document that employs peak detection to read characterized characters. The image data is scanned horizontally through a narrow vertical window; The density of cells is plotted as a function of the horizontal position of the pig. in the plot The peaks are located by a peak detector, and the pattern of the spaces between the peaks is already It is compared with the pattern of knowledge symbols. One advantage of this technology is that the vertical window is narrow. Minimizes the impact from variations in the thickness of vertical segments printed on documents It is to make it possible.

一つの問題は、文書そのものがスキニーを持ったり、又は湾曲して印刷されたために、垂直方向にセグメント化されたキャラクタのイメージがそのイメージの平面上で歪められたり、又は回転されることがあるということにある。スキューを持ったイメージは垂直方向にセグメント化されたキャラクタの認識を妨げることがある。特に、スキニーの角度が増加するに従って４　ピーク検出器により検出されるピークが拡散又は幅広となる。更に、このことはピーク間の距離を測定して既知のシンボルのパターンと一致するパターンを形成できるようにする精度を低下させる。更にスキューの角度が増加するに従い、ある点でピークが広げられ、かつ低いものとなるので　ピーク間の距離を確実に測定できず、従って与えられたキャラクタにおける間隔）＜ターンを正確に定められず、キャラクタを互いに区別することができない。One problem is that the document itself has skinny or curved prints. For example, a vertically segmented character image is The problem is that it can be distorted or rotated on the surface. skew Images that have been segmented vertically may impede recognition of characters There is. In particular, as the skinny angle increases, detected by the 4 peak detector peaks become diffuse or broad. Furthermore, this allows us to measure the distance between peaks. accuracy that allows the formation of patterns that match patterns of known symbols. lower. Furthermore, as the skew angle increases, the peak broadens at a certain point. , and will be low, so the distance between the peaks cannot be reliably measured, and therefore the given (distance between characters) cannot be distinguished.

従って、本発明の目的は、スキニーが存在しても信頼性を低下させることなく垂直方向にセグメント化されたキャラクタを識別することにある。Therefore, it is an object of the present invention to improve the performance of the present invention without reducing reliability even in the presence of skinny. The purpose is to identify vertically segmented characters.

発男Ω開示垂直方向にセグメント化されたキャラクタは、光学式文字認識のために、短い空間周波数を有する平行な複数の垂直ラインからなるマスクによる畳み込みにより前処理される。セグメント化された各キャラクタに対するマスクの水平方向アライメントは、最大相互相関が得られるまで調整される１次いで、マスクとキャラクタ・イメージの積が形成される。マスクの各垂直ラインに沿った積イメージにおける「オン」画素数のヒストグラムは、このような既知シンボルのヒストグラムのライブラリと比較され、最良の一致によりキャラクタ・イメージにより表わされているシンボルを識別する。Male birth Ω disclosure Vertically segmented characters have short blanks for optical character recognition. By convolution with a mask consisting of parallel vertical lines with frequencies between Pre-processed. Horizontal alignment of the mask for each segmented character The mask and character are adjusted in the first order until the maximum cross-correlation is obtained. A product of vector images is formed. to the product image along each vertical line of the mask. The histogram of the number of “on” pixels in The best match represents the character image. identify the symbol being used.

好ましい実施例において、マスクにおける垂直ラインの間隔は１画素であるがいずれの場合であっても垂直方向にセグメント化されたキャラクタのうちの種々のキャラクタのパターンに採用された、あり得る全ての画素間隔の共通約数でなければならない、好ましい実施例において、マスクにおける垂直ラインの厚さは１画素であるが　垂直方向のセグメントの厚さが多数画素である場合に、厚さを更に厚くすることができる。In a preferred embodiment, the vertical lines in the mask are spaced one pixel apart. Even in case of misalignment, various of the vertically segmented characters Must be a common divisor of all possible pixel spacings used in the character's pattern. In the preferred embodiment, the thickness of the vertical lines in the mask is 1 pixel, but if the vertical segment thickness is many pixels, change the thickness. It can be made thicker.

最大相互相関点においてマスクを水平方向に位置決めすることの利点は、キャラクタが未知量のスキューを持っていてもキャラクタにおける垂直方向のセグメントを対応する垂直マスク　ラインのキャラクタ上に中心付けることである。マスク・イメージとキャラクタ　イメージとの積をめる利点は、キャラクタにおいて垂直方向のセグメントの厚さの変動を結果として得られる積イメージから除去されることにある。垂直マスク　ラインに沿って「オン」画素のヒストグラムを計算することの利点は、キャラクタ・イメージにスキューが存在していても、異なるキャラクタの間隔パターン間の区別を確実にすることにである。このことは　複数の垂直マスク・ラインがそれぞれ１垂直キヤラクタ・セグメント以上と交差するものであっても成立する。The advantage of positioning the mask horizontally at the point of maximum cross-correlation is that vertical segments in the character even if the character has an unknown amount of skew. centering the mark on the character of the corresponding vertical mask line. trout The advantage of multiplying the character image by the character image is that Vertical segment thickness variations are removed from the resulting product image. It's about being able to do something. Vertical Mask Plans the histogram of “on” pixels along the line. The advantage of calculating is that even if there is skew in the character image, different The objective is to ensure differentiation between character spacing patterns. This thing is Multiple vertical mask lines each intersect one or more vertical character segments It holds true even if it is.

２血の間車ｌ認朋添付する図を参照して本発明を以下で詳細に説明する。2 Bloody Car Licensed Tomo The invention will be explained in more detail below with reference to the accompanying figures.

第１図は機械により読み取り可能　かつ肉眼により読み取り可能な垂直方向にセグメント化された従来技術のキャラクタであって　本発明を含む装置により読み出されるべきキャラクタの例示的文書イメージを示す図であり第２図は本発明を含み　第１図のキャラクタを読み出すシステムのブロック図であり第３図は第１図のイメージから垂直方向にセグメント化されたキャラクタと、垂直方向に配列されたマスクとについて第２図のシステムにより実行される畳み込みを示す図であり。Figure 1 is a machine-readable and eye-readable vertical orientation. Characters of the prior art that have been made into FIG. 2 is a diagram showing an exemplary document image of characters to be displayed; FIG. This is a block diagram of a system for reading out the characters shown in Figure 1. Figure 3 shows a character segmented vertically from the image in Figure 1 and a vertically segmented character. Convolution performed by the system of Figure 2 with orthogonally aligned masks This is a diagram showing the details.

第４図は第３図の畳み込みの積の図であり、第５図は第４図の積の「オン」画素のヒストグラムを示す図であり、第６図は第５図に対応する図であり　第５図のヒストグラムから構築された二進数の符号ワードを示す図である。Figure 4 is an illustration of the convolution product of Figure 3, and Figure 5 is an illustration of the "on" pixel of the product of Figure 4. FIG. 6 is a diagram corresponding to FIG. 5, and FIG. FIG. 3 illustrates a binary code word constructed from a histogram.

杢光叫の去施倒第１図を参照すると　ファイセル（Ｆｅ＋５ｅｌｌ他に対する米国特許第３．３０９，６６７号に開示されている型式の垂直方向にセグメント化されたキャラクタは　それぞれ肉眼により読み取り可能なシンボルの内輪郭と外輪郭との間に位置する垂直方向に平行なセグメントを構成する１例えば、数字の「６」は文書イメージ１００内で垂直方向のセグメン）ｌｏｏａ、１００ｂ、１００ｃ・・により表わされ　各セグメントの終端は数字の「６」の内輪郭及び外輪郭に接している。このように　各セグメントの長さはキャラクタを肉眼により読み取り得るように選択されている。隣接するセグメント間で水平方向に測定される間隔は異なっており　各シンボルに固有な線形のシーケンス即ち間隔のパターンを定めている。従って、数字の「６」の場合に、第１対のセグメント１００ａ、１００ｂ間の間隔は、第２対のセグメン）１００ｂと　１０００との間の間隔より広く、一方策３対のセグメン）１００ｃと、１００ｄとの間の間隔は前記２つのものより太きい０間隔パターンは容易に確認され、これを簡単な装置により一組の２知のパターンと合せられる。従って、隣接するセグメントの各対間における間隔はキャラクタを機械により読み取り得るように選択される。Mokkou's death Referring to FIG. 1, U.S. Patent No. 3.3 to Fe+5ell et al. Vertically segmented characters of the type disclosed in No. 09,667 Each symbol is located between the inner and outer contours of the symbol that can be read by the naked eye. For example, the number "6" is Vertical segment within image 100) looa, 100b, 100c... The end of each segment touches the inner and outer contours of the number "6". Ru. In this way, the length of each segment is such that the character can be read with the naked eye. The sea urchin has been selected. The spacing measured horizontally between adjacent segments is different It defines a unique linear sequence or pattern of intervals for each symbol. Ru. Therefore, in the case of the number "6", between the first pair of segments 100a and 100b is wider than the spacing between the second pair of segments) 100b and 1000; The interval between 100c and 100d (segments of 3 pairs of strategies) is larger than the above two. The thick 0-interval pattern is easily confirmed and can be detected using a simple device. Matches the pattern. Therefore, the spacing between each pair of adjacent segments is The character is selected to be machine readable.

本発明により第１図のキャラクタを読み取るシステムを第２図に示す、スキャナ１０１は第１図に示す型式のキャラクタの文書を走査し、文書イメージを発生する。この文書イメージは、好ましくは　画素が垂直な列及び水平な行に編成された二進数のデータ　ブロックであって、その文書イメージにおいて各二進ビットが２値トーン画素を表わしている。相関器１０２は、スキャナ１０１からの文書イメージと、第２図のメモリ１０４に記憶され、第３図に示す平行な垂直ラインを除外してなるマスク　イメージ３００との積を計算する。マスク・イメージ３００は　好ましくは　画素が垂直な列及び水平な行に編成された他の二進数のデータ・ブロックであって、各二進ビットがこのマスク　イメージ３００における画素を表わしている。A system for reading the characters of FIG. 1 according to the present invention is shown in FIG. 2, using a scanner. 101 scans a character document of the type shown in FIG. 1 and generates a document image. Ru. This document image preferably has pixels organized into vertical columns and horizontal rows. A block of binary data that represents each binary bit in the document image. represents a binary tone pixel. The correlator 102 receives the document from the scanner 101. image and parallel vertical lines stored in memory 104 of FIG. 2 and shown in FIG. Calculate the product with the mask image 300 excluding . mask image 3 00 is preferably another binary number whose pixels are organized in vertical columns and horizontal rows. data block, where each binary bit is It represents a pixel.

第３図に示すように、マスク・イメージ３００における各垂直マスク・ライン３００ａ、３００ｂ、３００Ｃ−・は、厚さが１イメ一ジ画素である。垂直マスク・ライン３００ａ、３００ｂ、３００ｃ・・は　全で第１図のキャラクタにおいて隣接する垂直方向のセグメント間の最小インターバルに等しい距離ｄにより均一に隔てられている。その代りに、マスク　インターバルｄは、第１図に対応し。As shown in FIG. 3, each vertical mask line 3 in the mask image 300 00a, 300b, 300C-. have a thickness of one image pixel. vertical mask ・The lines 300a, 300b, 300c, etc. all correspond to the characters in Figure 1. by a distance d equal to the minimum interval between adjacent vertical segments. are separated by one. Instead, the mask interval d corresponds to FIG. .

セグメント化された全てのキャラクタ・セットに採用された異なる全てのインターバルＡ、Ｂ、Ｃ・−の共通除数でもよい、更に、他の実施例として、垂直マスク・ライン３００ａ、３００ｂ、３００Ｃ・・の輻は、第１図のキャラクタの各垂直方向におけるセグメントの厚さＴが多数のイメージ画素である場合に　１イメ一ン画素より大きくてもよい。All different interfaces adopted for all segmented character sets In addition, as another embodiment, the vertical mass The convergence of the lines 300a, 300b, 300C, etc. of each character in FIG. If the thickness of the segment in the vertical direction T is a number of image pixels, It may be larger than the main pixel.

文書イメージ１００とマスク　イメージ３００との積を有効にするためには、まず文書イメージ１００に対してマスク　イメージ３００の正しい水平位置を見出す必要がある。垂直セグメントがマスク　ラインに対して一平行でなく一スキューを持っていても、正しい水平位置は各垂直セグメント１００ａ、１０１００ｂ１００が垂直マスク　ライン３００ａ、３００ｂ、３００ｃｍのうちで対応する一つに中心付けされたものである。スキューを持った垂直セグメントが垂直マスク　ライン上に中心付けられることを第３図に、屯線により示す、マスク・イメージ３００のこの正しい水平位置を見出すために、相関器１０２はマスク　イメージ３００を増分ステップにより文書イメージ１００上を水平方向に移動させ各ステップにおいて文書イメージ１００とマスク　イメージ３００との画素毎の積を計算する０文書イメージ１００に対するマスク　イメージ３００の移動方向は、特に文書イメージ１００が第３図に示すようにスキューを持っているときは文書イメージ１００における水平画素行の方向と平行であってはならないことに注意すべきである。好ましくは、各増分ステップの長さは、垂直マスク・ライン間の距離ｄの一部分である。To enable the product of document image 100 and mask image 300, First, find the correct horizontal position of the mask image 300 relative to the document image 100. It is necessary to If the vertical segment is not parallel to the mask line but one skew The correct horizontal position is for each vertical segment 100a, 10100b. 100 corresponds to vertical mask lines 300a, 300b, 300cm It is centered on one thing. A vertical segment with skew is a vertical mass The mask image is centered on the square line, which is shown in Figure 3 by the tomb line. To find this correct horizontal position of image 300, correlator 102 uses a mask image. page 300 is moved horizontally over document image 100 in incremental steps. In the step, the pixel-by-pixel product of the document image 100 and the mask image 300 is The moving direction of the mask image 300 for the document image 100 is , especially when the document image 100 has a skew as shown in FIG. Note that it must not be parallel to the direction of the horizontal pixel rows in the image 100. should be taken into consideration. Preferably, the length of each incremental step is between vertical mask lines. is a part of the distance d.

このようにして、相関器１０２により発生した積イメージ４００の例を第４図に示す、第４図の積イメージ４００は　文書イメージ１００における各画素の二進値をマスク−イメージ３００において対応する画素と掛算することにより、発生したものである。従って　積イメージ４００は垂直マスク・ライン３００ａ３００ｂ、３００Ｃ・の画素位置に沿って配置された「オン」画素を有する。第４図の積イメージ４００は文書イメージ１００に対するマスク・イメージ３００の「正しい」水平位置に一致して対応しており　文書イメージ１００において各垂直キャラクタ　セグメント１００ａ、１００ｂ・・は対応する垂直マスク　ライン３００ａ、３００ｂ　上に中心付けられている。しかし、この点において文書イメージ１００の正しい水平位置は達成され得るとは限らないことを理解すべきである。An example of a product image 400 generated by the correlator 102 in this manner is shown in FIG. The product image 400 shown in FIG. 4 is the binary representation of each pixel in the document image 100. generated by multiplying the value by the corresponding pixel in the mask-image 300. This is what I did. Therefore, product image 400 is vertical mask line 300a30 It has "on" pixels located along pixel positions 0b, 300C. Figure 4 The product image 400 is the product image 400 of the mask image 300 for the document image 100. Each vertical position in the document image 100 corresponds to the correct horizontal position. Character segments 100a, 100b... are the corresponding vertical mask lines It is centered on 300a, 300b. However, the document It should be understood that the correct horizontal position of the image 100 may not always be achieved. be.

マスク　イメージ３００の各増分ステップにおいて相関器＋０２が発生する各積イメージ４００のために、加算器１０３はマスク　イメージの各垂直マスクライン３００ａ、３００ｂ、３００ｃに沿い「オン」画素の数の総和を計算する。Each product generated by the correlator +02 at each incremental step of the mask image 300 For image 400, adder 103 adds each vertical mask line of the mask image. The sum of the number of "on" pixels along lines 300a, 300b, 300c is calculated.

相関器１０２が文書イメージ１００上のマスク　イメージ３００を増分的にステップするに従って　プロセッサ１０５は各増分ステップにより加算器１０３が計算した全垂直マスク　ライン３００ａ、３００ｂ　についての総和を対応するメモリ　ビンに格納して、各増分ステップについて第５図に示すようなヒストグラムを形成する。プロセンサ１０５は、垂直キャラクタ・セグメン）１００ｃと１００ｄとの間で少なくとも最長のインターバルＣ（第３図）を覆う多数の増分ステップを完了した後に、格納した総和を調べ　どの増分ステップにおいて加算器１０３が最大の総和を発生したかを判断する。加算器１０３が最大の総和を発生した増分ステップは１文書イメージ１０ｏの「正しい」水平位ｌてあり、そこでは各垂直キャラクタ　セグメントが、第３図に示すように、垂直マスク・ラインのうちの対応する一つの上に中心付けられている。Correlator 102 incrementally steps mask image 300 over document image 100. With each incremental step, processor 105 adds The sum of the calculated vertical mask lines 300a and 300b is calculated using the corresponding menu. For each incremental step, a histogram like the one shown in Figure 5 is created. form a system. The prosensor 105 has vertical character segments) 100c and 1 00d, covering at least the longest interval C (Fig. 3). After completing a step, examine the stored sum and at which incremental step the adder 103 to determine whether the maximum sum has been generated. Adder 103 generates the maximum sum The incremental steps taken are at the "correct" horizontal position of the document image 10o, where is a vertical mask line where each vertical character segment is shown in Figure 3. centered on the corresponding one of the two.

このようにして、プロセッサ１０５は、「正しい」水平位置を識別すると直ちに、乗算器１０６に指令して、スキャナから受け取る文書イメージ１００に対して正しい増分ステップでメモリ１０４からのマスク　イメージ３００を配置させかつイメージを互いに掛算させて第４図の積イメージを発生させ１次いでこれをメモリ１０７に記憶させる。加算器１０８は第４図の積イメージから第５図のヒストグラムを計算する。他の実施例において　乗算器１０６及び加算器１０８は省略される。当該他の実施例において、プロセッサ１０５は、マスク・イメージ３００の全ての増分ステップについて格納した全てのヒストグラムからそのビンが最高の総和を有するヒストグラム５００を選択すると共に、このヒストグラムを出力する。最高の総和を有するヒストグラムはマスク・イメージ位置の増分ステップに対応しており、この位置では文書イメージ１００の垂直キャラクタ　セグメントがマスク　イメージ３００の対応する垂直マスク・ライン上に中心付けられている。In this way, as soon as processor 105 identifies the "correct" horizontal position, , for the document image 100 received from the scanner. Place the mask image 300 from memory 104 in the correct incremental steps. The two images are multiplied together to generate the product image shown in Figure 4. It is stored in the memory 107. The adder 108 converts the histogram in FIG. 5 from the product image in FIG. Calculate the totogram. In other embodiments, multiplier 106 and adder 108 may be omitted. Omitted. In such other embodiments, processor 105 may include mask image 3 From all histograms stored for all incremental steps of 00, that bin is Select the 500 histograms with the highest sum and set this histogram to Output. The histogram with the highest sum is the incremental step of the mask image position. This position corresponds to the vertical character segment of document image 100. center on the corresponding vertical mask line of the mask image 300. It is.

好ましい実施例において加算器１０８により発生した。又は他の実施例においてプロセッサ１０５により選択されたヒストグラムは、プロセッサ１０９に入力される。プロセッサ１０９は　０又は非常に小さな値を有する連続的なビンを検索することにより、隣接するキャラクタのヒストグラムから個別的なキャラクタのヒストグラムを分離する。これらのビンは文書イメージ１００において隣接す較器１１０に送信する。比較器１１０は、当該技術分野において周知のパターン一致技術を用いて　ヒストグラムとメモリ１１１に記憶されている基準ヒストグラムのライブラリのうちの各一つとの間の相互相関各計算する。比較器１１０は最高相関を有する基準ヒストグラムを「勝者」と宣言し、従ってこのキャラクタ・イメージを識別するものである。generated by adder 108 in the preferred embodiment. or in other embodiments The histogram selected by processor 105 is input to processor 109. It will be done. Processor 109 searches for consecutive bins with 0 or very small values. By Separate histograms. These bins correspond to adjacent bins in the document image 100. 110. Comparator 110 uses a pattern well known in the art. The histogram and the reference histogram stored in the memory 111 are Each calculates the cross-correlation between each one of the program libraries. The comparator 110 Declare the reference histogram with a high correlation as the “winner” and therefore It identifies the image.

基準ヒストグラムのライブラリは　スキャナ１０１に既知のシンボルの連続的なイメージを供給して、プロセッサ１０９が受け取ったヒストグラムをメモリ１１１に対応するシンボルの識別と共に格納することにより、第２図のシステムの「プログラムコモードにより発生される。The library of reference histograms consists of a series of symbols known to the scanner 101. The histogram received by the processor 109 is stored in the memory 11. By storing the symbol with the identification corresponding to 1, the system of FIG. Generated by program commode.

本発明の他の実施例において　プロセッサ１０９は、個別的な各キャラクタのヒストグラムを比較器１１０の代わりに符号ワード変換器１１２に送信する。符号ワード変換器１１２は、「オン」画素カウントが０でない第５図のヒストグラム５００の各ビン５００ａ、５００ｂ、５００ｃ・・に二進数の「１」を割り付け。In another embodiment of the invention, the processor 109 is configured to The stogram is sent to codeword converter 112 instead of comparator 110. sign Word converter 112 converts the histogram of FIG. Assign a binary number "1" to each of the 500 bins 500a, 500b, 500c... .

「オン画素カウントが０の各ビンに二進数の「０」を割り付ける０次いで、符号ワード変換器１１２は連続する１及びＯを配列して第６図に示す符号ワードを形成する比較器１１３は符号ワードをメモリ１１４に格納されている基準符号ワードのライブラリと比較する。比較器１１３は、符号ワード変換器１１２により形成された符号ワードに対しで最高の相関を有する基準符号ワードを「勝者」であると宣言し、これによって対応するキャラクタを識別する。``Assign a binary ``0'' to each bin with an on pixel count of 0.0 then sign Word converter 112 arranges consecutive 1's and O's to form the code word shown in FIG. Comparator 113 converts the code word into a reference code word stored in memory 114. Compare with the code library. The comparator 113 is configured by the code word converter 112. The “winner” is the reference codeword with the highest correlation to the codewords created. , and identify the corresponding character.

基準符号ワードのライブラリは、スキャナ１０１に既知のキャラクタの連続的なイメージを供給して　符号ワード変換器１１２により形成される符号ワードをメモリ１１４に格納するＣとにより、「プログラム」モードにおいて第２図のシステムにより発生される。The library of reference code words consists of a sequence of characters known to the scanner 101. the code word formed by the code word converter 112. C stored in the memory 114 allows the system shown in FIG. generated by the system.

タ　び　のｒ本発明は　肉眼により読み取り可能、かつ機械により読み取り可能な型式の垂直方向にセグメント化されたキャラクタのイメージについてデスキューし　ろ渡して読み取るシステムとして有用である。Tabi no r The present invention provides a visually readable and machine readable type vertical Deskew the image of the character segmented in the direction It is useful as a reading system.

口にセグメント　されたキャラクタのデスキューイングろ　びＵのための　マスキング几理炙−約−１光学式文字認識のために１画素の空間周波数を有する平行な垂直ラインからなるマスクによる畳き込みによってスキュー及び雑音を除くように、垂直方向にセグメント化されたキャラクタを前処理する。各セグメント化キャラクタに対するマスクの水平方向アライメントは、最大相互相関を見出すまで調整される０次いで、マスクとキャラクタ　イメージとの積が形成される。マスクの各垂直ラインに沿って積イメージにおける画素数のヒストグラムを既知のシンボルのヒストグラムのライブラリと比較し、最良の一致によりキャラクタ　イメージにより表わされたシンボルを識別する。その代りに、前記ヒストグラムを二進数の符号ワードに変換しこれを一組の既知のシンボルに対応する符号ワードのライブラリと比較する。Mass for deskewing and U of characters segmented into mouth King 几り Roasted - about -1 Consisting of parallel vertical lines with a spatial frequency of 1 pixel for optical character recognition Segment vertically to remove skew and noise by convolution with a mask. Preprocessing the mentized character. The map for each segmented character The horizontal alignment of the cross-correlation is adjusted by zero-order until finding the maximum cross-correlation , the product of the mask and the character image is formed. for each vertical line of the mask The histogram of the number of pixels in the product image along with the histogram of known symbols character image with the best match. Identifies the symbol. Instead, the histogram can be expressed as a binary code word and compare this with a library of codewords corresponding to a set of known symbols do.

手続補正書彷炙平成　５年ケ月、修′日≦Procedural amendment report 1993 month, school day ≦

Claims

[Claims]

1. Segment vertically from a document image in horizontal and vertical rows of parallel pixels In an optical character recognition device that reads characters that are means for defining a mask image with a mask image and one of a plurality of incremental steps; positioning the mask image with respect to the document image to an incremental mask that correlates different pixel pairs between the image and the mask image; stepping means and corresponding pixels of said document image and said mask image; means for generating a product image by multiplying the pairs together; which corresponds to the maximum correlation between the document image and the master image between the document image and the master image; Generate a histogram of the product image corresponding to the incremental step with maximum correlation means for determining individual characters in the document image from the histogram; means to determine the identity of the An optical character recognition device characterized by:

2. The means for generating a histogram is arranged in a plurality of vertical columns in the product image. 2. The method according to claim 1, characterized by means for calculating the sum of "on" pixels in the Optical character recognition device.

3. Said means for generating a product image generates a product image for each said incremental step. and the means for generating the histogram is the product image of each of the incremental steps. said means for generating a histogram from said incremental steps and determining a maximum correlation; which of the pixels is calculated by said means of calculating the sum of "on" pixels. Claim 2 characterized by a means for determining whether the invention corresponds to the maximum sum of optical character recognition device.

4. The incremental mask stepping means steps the mask image in the parallel vertical direction. Directly move on the document image horizontally with respect to the mask line and vertically The mask line and the vertical character segment are less within the skew angle. 2. The optical character recognition device according to claim 1, wherein the optical character recognition device is substantially parallel to the optical character recognition device. Place.

5. a plurality of different adjacent ones of said vertical character segments; the pairs are separated by different separation lengths, and the vertical mask lines are separated by different separation lengths. Each incremental step is arranged at a spatial interval d that is a common divisor of the separation length. is a fraction of d, and the total incremental step covers a distance at least equal to d. 5. The optical character recognition device according to claim 4, characterized by:

6. The thickness of the vertical mask line is less than the thickness of the vertical character segment. 6. The optical character recognition device according to claim 5, wherein the optical character recognition device is characterized by being thin.

7. The vertical mask line has a thickness equal to one pixel, and the vertical mask line The spatial interval d of in is characterized by being one pixel. The optical character recognition device according to claim 6.

8. The means for determining the identity of the character stores a set of reference histograms. means for storing each histogram corresponding to the maximum correlation in the set of reference histograms; and by means of identifying the reference histogram with the greatest correlation compared to the gram. An optical character recognition device according to claim 1, characterized in that:

9. The means for determining the identity of the character is configured to determine the identity of the character corresponding to the maximum correlation. Convert each summation in each histogram to binary bits and create one histogram. means for forming an image code word of corresponding binary bits; means for storing a set of reference code words; and means for storing said image code words in said set of reference code words; Compare with the reference histogram and identify the reference code word with the greatest correlation 2. An optical character recognition device according to claim 1, characterized by:

10. Said means for converting each sum into binary bits converts each sum into binary bits if said sum is not zero or zero. according to claim 9, characterized by converting to 1 or 0 according to whether Optical character recognition device.

11. The plurality of vertical columns for calculating the summation of the histogram are 3. The optical character recognition device according to claim 2, characterized by screen lines.

12. In optical character recognition methods, document images in horizontal and vertical rows of parallel pixels are A method for reading vertically segmented characters from a Steps defining a mask image characterized by a number of parallel vertical lines and, forming the mask image on the document image in one of a plurality of incremental steps; the document image and the mask image. correlating the pairs; multiplying corresponding pixel pairs of the document image and the mask image together; a step of generating a product image by Determine whether the correspondence corresponds to the maximum correlation between the document image and the mask image. a maximum correlation between the document image and the mask image; generating a histogram of the product image corresponding to the incremental step of and identify individual characters in the document image from the histogram. The step of determining the difference How to read vertically segmented characters characterized by .

13. Said step of generating a histogram further includes a plurality of said product images. A request characterized by the step of calculating the sum of "on" pixels in a vertical column. The method according to claim 12.

14. Said step of generating a product image further includes each one of said incremental steps. It is characterized by generating a product image for and generating a histogram. The step generates a histogram from the product image of each incremental step. said step of determining the maximum correlation characterized by said increment Which of the steps calculates the sum of "on" pixels? characterized by the step of determining whether the calculated sum corresponds to the largest sum. 14. The method according to claim 13.

15. The step of placing the incremental mask further includes placing the incremental mask on the document image. moving said mask image horizontally with respect to parallel vertical mask lines; , the vertical mask line and the vertical character segment have a skew angle 13. The method according to claim 12, characterized by being parallel within.

16. adjacent pairs of vertical character segments are different; separated by separation lengths, and the vertical mask lines are a common divisor of the different separation lengths. , and each incremental step is a fraction of d. , and the total incremental step is by covering a distance at least equal to d. 16. The method of claim 15 characterized.

17. The vertical mask line has a thickness equal to that of the vertical character segment. 17. The method of claim 16, characterized by being thinner than thick.

18. The vertical mask line has a thickness equal to one pixel, and the vertical mask line has a thickness equal to one pixel. The spatial interval d of the line was characterized by being one pixel. 18. The method according to claim 17.

19. Said step of determining said identity of said character comprises a set of reference histories. storing each histogram corresponding to the maximum correlation. A set of reference histograms is compared to identify the reference histogram with the greatest correlation. 13. The optical character recognition device according to claim 12, characterized by the step of:

20. The step of determining the identity of the character is based on the maximum correlation. Convert each summation in each corresponding histogram into binary bits to obtain one histogram. forming an image code word of binary bits corresponding to the gram; storing a set of reference code words; and storing the image code words in the set of reference code words; identify the reference codeword with the greatest correlation by comparing it with a set of reference histograms. 13. The method according to claim 12, characterized by the step of: