JPS60120485A

JPS60120485A - Pattern recognizing system

Info

Publication number: JPS60120485A
Application number: JP58226853A
Authority: JP
Inventors: Yasuaki Nakano; 中野　康明; Kunihiro Okada; 邦弘岡田; Osamu Kunisaki; 国崎　修; Hiromichi Fujisawa; 浩道藤澤; Toshihiro Hananoi; 花野井　歳弘; Masashi Kushibiki; 櫛引　政司; Akizo Kadota; 門田　彰三
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1983-12-02
Filing date: 1983-12-02
Publication date: 1985-06-27
Also published as: JPH0427592B2

Abstract

PURPOSE:To process a preprinted part in the same way as that applied to a part printed in dropout colors, by subtracting a reference slip image from an inputted slip image in which a pattern to be read is recorded. CONSTITUTION:By using a slip (reference slip) on which no patterns to be read, such as sharacters, graphic forms, etc., are recoded, information (reference slip image) on the reference slip is stored in a memory 51. The slip image of an inputted slip is taken into the memory 51 in the same way and stored in another memory 52. A subtraction process between the normalizing image (memory 5) of the inputted slip and the thicker image (memory 56) of the reference slip is performed by a control section 2 and the part of the reference slip image contained in the inputted slip is erased and the remaining image is stored in a memory 56. The content of the memory 56 is the characters, symbols, or graphic forms recorded on the inputted slip and a recognition processing section 6 recognizes the content.

Description

【発明の詳細な説明】〔発明の利用分野〕本発明はパターン認識方式に係り、特に複雑な背景の中
に書かれた文字や図形の認識に好適なパターン認識方式
に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Application of the Invention] The present invention relates to a pattern recognition method, and particularly to a pattern recognition method suitable for recognizing characters and figures written in a complex background.

[Background of the invention]

従来のパターン認識方式、例えば文字認識方式は、ドロ
ップアウトカラーで文字記入枠を印刷し、その中に記入
された文字を読み取るものが大半であった。しかし、印
刷コストの増加により、文字認識方式を意識せずに任意
に作成された帳票を使用したいという要求が強くなって
いる。このような帳票を使用するときの問題点は、既に
印刷され丸文字や表・線などＣ以下プレプリントと呼ぶ
）が妨害情報となり、読取対象文字を切り出すことがで
きないことである。プレプリントの部分も含めて認識し
、その内容を理解してから読取対象文字だけを編集すれ
ばよいとの考えもあるが、完全な内容理解は困難である
こと、本来読み取らなくてもよい部分を読み取るため読
み取り速度が低下することなどの問題がある。Most of the conventional pattern recognition methods, such as character recognition methods, print a character entry frame in dropout color and read the characters written in the frame. However, due to the increase in printing costs, there is a growing demand for using forms that are arbitrarily created without being aware of the character recognition method. The problem when using such a form is that already printed circular characters, tables, lines, etc. (hereinafter referred to as "preprints") become interfering information, making it impossible to extract the characters to be read. There is an idea that it would be possible to recognize the preprint including the part, understand its content, and then edit only the characters to be read, but it is difficult to fully understand the content, and it is necessary to read parts that do not originally need to be read. There are problems such as a decrease in reading speed due to the reading of .

[Purpose of the invention]

本発明の目的は、プレプリントｆ壱する任意の帳票を用
いても、この帳票に書かれた読取対象パターン、例えば
、文字、記号１図形等のみを容易に読み取ることのでき
るパターン認識方式を提供することにある。An object of the present invention is to provide a pattern recognition method that can easily read only a pattern to be read written on a preprinted form, such as a character, a symbol, or a single figure, even if an arbitrary form is used. It's about doing.

[Summary of the invention]

かかる目的を達成するため、本発明においては読取るべ
き対象パターンの記入していない帳票のディジタルイメ
ージＣ基準帳票イメージ）を予め記憶しておき、読取対
象パターンの記入された入力帳票イメージから基準帳票
イメージを減算することにより、読取対象パターン、例
えば記入文字パターンのみを残し、見かけ上プレプリン
ト部分をドロップアウトカラーで印刷されたものと同様
に処理できるようにしたことを特徴とするものである。In order to achieve this purpose, in the present invention, a digital image C (reference form image) of a form in which the target pattern to be read is not filled out is stored in advance, and a reference form image is created from the input form image in which the target pattern to be read is written. By subtracting , only the pattern to be read, for example, the written character pattern, is left, and the preprint portion can be processed in the same manner as if it were printed in dropout color.

[Embodiments of the invention]

以下、本発明を図面を用いて詳細に説明する。 Hereinafter, the present invention will be explained in detail using the drawings.

第１図は本発明の認識方式を実施する装置の構成を示す
ブロック図である。装置の各部はバス１に接続され、全
体の動作は制御部２によシ制御される。帳票３上の情報
（帳票イメージ）は光電変換装置４によ多走査され、光
電変換され、さらにディジタル化され、バス１を介して
メモリ５１に格納される。ディジタル化の際、公知の高
効率符号化処理を行なってもよく、これによシ帳票イメ
ージを記憶するメモリの記憶容量を節約でき、コストの
面で肩利になることがある。また、以下の説明では１画
素１ビツトに二値化するものとするが、１画素を多値で
表現してもよく、またカラースキャナにより色情報を付
与してもよい。FIG. 1 is a block diagram showing the configuration of an apparatus that implements the recognition method of the present invention. Each part of the device is connected to a bus 1, and the overall operation is controlled by a control unit 2. The information (form image) on the form 3 is scanned multiple times by the photoelectric conversion device 4, photoelectrically converted, further digitized, and stored in the memory 51 via the bus 1. At the time of digitization, a known highly efficient encoding process may be performed, which can save the storage capacity of the memory that stores the form image, which may be advantageous in terms of cost. Further, in the following explanation, it is assumed that each pixel is binarized into one bit, but one pixel may be expressed with multiple values, or color information may be added using a color scanner.

まず、文字又は図形等の読取対象パターンを記入してい
ない帳票（基準帳票）を用い、この基準帳票上の情報（
基準帳票イメージ）をメモリ５１の中に格納する。この
基準帳票イメージに対し、制御部２によシ後述する傾き
検出処理、傾き補正処理を行って得られる正規化イメー
ジがメモリ５２に格納される。この正規化イメージに対
し、制御部２のプログラム制御により後述する線抽出処
理が行われ、抽出された線パターンがメモリ５３に格納
される。その後、上記正規化イメージに対し、制御部２
により公知の太め処理を行い得られた太めイメージがメ
モリ５４に格納される。First, use a form (standard form) in which no pattern to be read, such as characters or figures, is written, and the information on this standard form (
A reference form image) is stored in the memory 51. A normalized image obtained by performing tilt detection processing and tilt correction processing, which will be described later, by the control unit 2 on this reference form image is stored in the memory 52. Line extraction processing, which will be described later, is performed on this normalized image under program control of the control unit 2, and the extracted line pattern is stored in the memory 53. After that, the control unit 2
A thick image obtained by performing a known thickening process is stored in the memory 54.

以上で基準帳票に対する処理が終了し、入力帳票すなわ
ち読取対象パターンである文字、記号１図形等の記入さ
れた帳票の処理に移る。The processing for the standard form is thus completed, and the processing moves on to the input form, that is, the form on which characters, symbols, figures, etc., which are patterns to be read, have been written.

入力帳票の帳票イメージは基準帳票と同様にメモリ５１
の中に取り込まれ、その正規化イメージがメモリ５２に
格納される。入力帳票の正規化イメージから抽出された
線パターンがメモリ５５に格納される。制御部２により
、基準帳票の線パターン（メモリ５３）と入力帳票の線
パターン（メモリ５５）とについて後述する位置合せ処
理が行われ、位置補正量（ｇ、ｈ）が制御部２の中の一
時記憶に格納される。制御部２により、入力帳票の正規
化イメージ（メモリ５２）と基準帳票の太めイメージ（
メモリ５６）との位置合せ処理及び減算処理が行われ、
入力帳票中に含捷れる基準帳票イメージの部分が消去さ
れ、残りのイメージがメモリ５６に格納される。位置合
せ処理及び減算処理の詳細については後述する。The form image of the input form is stored in the memory 51 in the same way as the standard form.
, and its normalized image is stored in memory 52. The line pattern extracted from the normalized image of the input form is stored in the memory 55. The control unit 2 performs alignment processing, which will be described later, on the line pattern of the reference form (memory 53) and the line pattern of the input form (memory 55), and the position correction amounts (g, h) are calculated in the control unit 2. Stored in temporary memory. The control unit 2 creates a normalized image of the input form (memory 52) and a thicker image of the reference form (
Alignment processing and subtraction processing with the memory 56) are performed,
The portion of the reference form image that is omitted from the input form is erased, and the remaining image is stored in the memory 56. Details of the alignment process and subtraction process will be described later.

かくして、メモリ５６の内容は入力帳票に記入された認
識の文字・記号または図形であり、これらは認識処理部
６によって切り出し、認識が行われるが、その処理の内
容は公知であるので省略する。Thus, the contents of the memory 56 are recognized characters, symbols, or figures written on the input form, and these are extracted and recognized by the recognition processing section 6, but the contents of the processing are well known and will therefore be omitted.

前述したように、装置全体の制御は制御部２によって行
われるが、使用者からの指示、たとえば装置に供給され
た帳票３が基準帳票であるか入力帳票であるかなどは、
キーボード７から入力され、制御部２からの指示や応答
はディスプレイ８上に表示される。As mentioned above, the entire apparatus is controlled by the control unit 2, but instructions from the user, such as whether the form 3 supplied to the apparatus is a reference form or an input form, are determined by the control unit 2.
Input from the keyboard 7 and instructions and responses from the control unit 2 are displayed on the display 8.

次に制御部２で行なう各種の処理について説明する。ま
ず、第２図に傾き検出処理の流れ図をＰ　ＡＤ　（Ｐｒ
ｏｇｉａｍ　７７ｎａｌｙｓｉｓ　］）ｉａｇｒａｍ　
）形式で示す。第２図において、帳票上の各画素はＰ（
ｉ、ｊ）で表され、ｊは走査線番号（０からＪ−１まで
の５本）、ｉは各走査線の中の画素番号である（各走査
線画り１画素）。２０１は傾き検出処理への入口であり
、メモリ５１に格納されてｂる帳票イメージＰが与えら
れる。２０２で変数Ｓ、、、の初期化を行う。２０３で
は、２０４〜２１４の処理を角度θについて一θｌから
θλ捷でΔθきざみで繰り返す。２０４〜２１４の処理
はθ方向の尖鋭度Ｓａをめ、その中の最大値Ｓｍを与え
る方向θ、をめる処理である。２０４は水平方向周辺分
布Ｈ（ｊ）を走査線ｊについてめるもので、そのとき角
度θだけ積分方向を傾ける。２０５はＨ（ｊ）の初期化
であり、２０６はｉについて２０７から２０９−！でを
繰り返し積分値Ｈ（ｊ）をめる処理である。２０７では
角度θだけ回転させたとき座標（ｉ、ｊ）に写像される
原イメージでの座標（ｉｅ　ｌ　Ｊｌｌ　）をめるもの
であり、で定義されるが、近似的にで代用してもよい。２０８は座標の限界チェックであり
、２０９で積分が行われる。２１０〜２１１は尖鋭度Ｓ
、をめる処理であり、２１０は初期化、２１１〜２１２
はで定義される尖鋭度Ｓ、をめる処理である。との尖鋭度
Ｓ、は、一般に帳票にプレプリントされる文字は水平方
向に伸びる行であり、また罫線などの横線も多く含まれ
ていることに着目したもので、帳票上の文字行や横線と
積分方向が一致したときに大きくなる性質を有している
。水平方向の周辺分布だけでなく、垂直方向の周辺分布
も同時（９）に評価すればなおよいが、容易に類推できるため省略す
る。２１３と２１４は８θの最大値Ｓ、とその方向をめ
る部分である。２１５は傾き検出処理からの出口であり
、パラメータθ１が出力に与えられる。Next, various processes performed by the control section 2 will be explained. First, FIG. 2 shows a flowchart of the tilt detection process P AD (Pr
ogiam 77nalysis])iagram
) format. In Figure 2, each pixel on the form is P(
i, j), where j is the scanning line number (5 lines from 0 to J-1) and i is the pixel number in each scanning line (one pixel per scanning line). Reference numeral 201 is an entrance to the inclination detection process, and a form image P stored in the memory 51 is provided. In step 202, the variables S, . . . are initialized. In step 203, the processes of steps 204 to 214 are repeated for the angle θ in steps of Δθ from 1θl to θλ. Processes 204 to 214 are processes for determining the sharpness Sa in the θ direction and determining the direction θ that gives the maximum value Sm. Reference numeral 204 calculates the horizontal peripheral distribution H(j) about the scanning line j, and at this time, the direction of integration is tilted by an angle θ. 205 is the initialization of H(j), and 206 is 207 to 209-! for i! This is a process of repeatedly calculating the integral value H(j). 207 calculates the coordinates (ie l Jll) in the original image that are mapped to the coordinates (i, j) when rotated by the angle θ, and is defined by, but it can also be approximately substituted by good. 208 is a coordinate limit check, and 209 is an integration. 210-211 is sharpness S
, 210 is initialization, 211 to 212
This is the process of calculating the sharpness S defined by . The sharpness S, which is calculated based on the fact that characters preprinted on forms are generally lines that extend horizontally, and also contains many horizontal lines such as ruled lines, It has the property of becoming larger when the integration directions match. It would be better if not only the horizontal peripheral distribution but also the vertical peripheral distribution were evaluated at the same time (9), but this is omitted as this can be easily inferred. 213 and 214 are the parts that determine the maximum value S of 8θ and its direction. 215 is an exit from the tilt detection process, and the parameter θ1 is given to the output.

第３図に傾き補正処理の流れ図をＰＡＤ形式で示す。３
０１は入口であり、帳票イメージＰと回転角θ。が与え
られる。３０２．３０３の二重ループによ如、走査線番
号ｊと走査線内画素番号ｉとについて以下の処理を繰り
返す。３０４は座標ｍは座標の限界チェックであり、限
界内にあるとき３０６により原イメージ内の画素Ｐ（ｉ
、。FIG. 3 shows a flowchart of the tilt correction process in PAD format. 3
01 is the entrance, the form image P and the rotation angle θ. is given. According to the double loop of 302 and 303, the following process is repeated for the scanning line number j and the pixel number i within the scanning line. 304 is a coordinate limit check where the coordinate m is within the limit, and when it is within the limit, the pixel P(i
,.

ｊｅ）を新イメージ内の画素Ｑ（ｉ、ｊ）に転写する。je) to pixel Q(i,j) in the new image.

（ｉ、＋ｊ、、）が限界外に飛び出すときは３０７でＱ
（ｉ、ｊ）け０とする。３０８は傾き補正処理の出口で
あり、回転後の正規化イメージＱ（ｉ、ｊ）を出力に与
え、メモリ５２に格納する。When (i, +j,,) jumps out of the limit, Q with 307
Let (i, j) be 0. 308 is an exit of the tilt correction process, which outputs the rotated normalized image Q(i, j) and stores it in the memory 52.

次に、プレプリントパターンの中から直線部分（１０）を抽出する処理について説明する。第１１図は、その原
理を示す図であり、（５）のようなプレプリントパター
ンからある長さ以上の線分のみを抽出してＱ３）のよう
なパターンを作る。これを水平方向に投影して０のよう
なパターン（これを線パターンと呼ぶ）を作る。実際に
は■のパターンを作る必要はなく、（３）から（Ｑへ直
接変換する。Next, the process of extracting the straight line portion (10) from the preprint pattern will be explained. FIG. 11 is a diagram showing the principle. Only line segments of a certain length or more are extracted from a preprint pattern like (5) to create a pattern like Q3). This is projected horizontally to create a 0-like pattern (this is called a line pattern). Actually, there is no need to create the pattern (3), and (3) is directly converted to (Q).

第４図にこの線抽出処理の流れ図をＰＡＤ形式で示す。FIG. 4 shows a flowchart of this line extraction process in PAD format.

線抽出には水平線抽出と垂直線抽出とがあるが、ここで
は水平線抽出について述べ、垂直線抽出については同様
であるので省略する。第４図で４０１は線抽出への入口
であり、メモリ５２に格納されている正規化イメージＱ
が与えられる。Line extraction includes horizontal line extraction and vertical line extraction, but horizontal line extraction will be described here, and vertical line extraction will be omitted since it is the same. In FIG. 4, 401 is the entrance to line extraction, and the normalized image Q stored in the memory 52
is given.

４０２では４０３〜４０９の処理を走査線番号ｊについ
て繰り返すことにより、黒長連積分値ＡＵ）を得る。４
０３は初期化である。４０４は走査線の中の画素番号ｉ
についての繰す返しである。In step 402, the processes in steps 403 to 409 are repeated for scanning line number j to obtain the long black continuous integral value AU). 4
03 is initialization. 404 is the pixel number i in the scanning line
This is a repetition of

４０５で画素Ｑ（ｉ、ｊ）が１か０かを判定し、１の場
合には４０６で黒の連長Ｂをカウントする。In 405, it is determined whether the pixel Q (i, j) is 1 or 0, and if it is 1, the black run length B is counted in 406.

Ｑ（ｉ、ｊ）がＯの場合には、４０７の判定によ（１１
）りその前の画素までの連長Ｂが閾値εより大きいとき、
４０８で積分処理を行い、４０９で連長Ｂをリセットす
る。ループ終了後４１０で積分値Ａ（ｊ）にＢを加えて
いるのは、４０４以下のループでは最終点（ｉ＝Ｉ−１
）における積分がなされないからである。４０７の判定
を加えたことにより、十分長い黒の水平線分があるとき
のみＡ（ｊ）に積分が行われるので雑音の影響を受け難
く、帳票上に記入されている文字、記号は短い線分から
なっているので、はとんど無視できる。長い線分が存在
しないような帳票では閾値εを適宜小さくとればよい。If Q(i,j) is O, then (11
) When the run length B to the previous pixel is greater than the threshold ε,
Integration processing is performed in 408, and the run length B is reset in 409. The reason why B is added to the integral value A(j) at 410 after the loop ends is that in loops below 404, the final point (i=I-1
) is not integrated. By adding the judgment of 407, integration is performed on A(j) only when there is a sufficiently long black horizontal line segment, so it is less affected by noise, and characters and symbols written on the form are separated from short line segments. Therefore, it can be ignored for the most part. For documents that do not have long line segments, the threshold value ε may be set appropriately small.

４１１はこの処理の出口であり、長連積分値Ａを出力に
与え、メモリ５３に格納する。Reference numeral 411 is an exit of this process, and the long continuous integral value A is given to the output and stored in the memory 53.

以上の説明から判るように長連積分値Ａは線分の存在を
反映したパターンになっており、これを線パターンと呼
ぶ。なお１．線パターンＡからピークの抽出を行えば長
い線分そのものの存在も検知できる。As can be seen from the above explanation, the long continuous integral value A has a pattern that reflects the existence of line segments, and this is called a line pattern. Note 1. If peaks are extracted from line pattern A, the presence of long line segments themselves can be detected.

第５図に位置合せ処理の流れ図をＰＡＤで示す。FIG. 5 shows a flowchart of the alignment process in PAD.

位置合せは垂直方向については水平線パターンＡ（１２
）を用いて行い、水平方向については垂直線パターンを用
いて行うが、ここでは垂直方向についてのみ行う。５０
１は入口であり、二つの線パターンＡ、＋　、Ａ２が与
えられる。Ａ！は基準帳票イメージから抽出されたもの
（メモリ５３）、Ａ２は入力帳票イメージから抽出され
たもの（メモリ５５）とする。５（１２は初期化である
。５０３は位置ずれ量ｄについて５０４〜５０８の処理
を（ｄ＋）からｄｌまで繰シ返す処理である。５０４ｆ
ｄ初期化であり、５０５は走査線ｊについての繰り返し
であり、５０６では二つの線パターンの差を位置をｄだ
けずらして計算し、その絶対値を積分する処理である。For vertical alignment, horizontal line pattern A (12
), and a vertical line pattern is used for the horizontal direction, but here only for the vertical direction. 50
1 is the entrance, and two line patterns A, +, A2 are given. A! A2 is extracted from the reference form image (memory 53), and A2 is extracted from the input form image (memory 55). 5 (12 is initialization. 503 is the process of repeating the processes 504 to 508 for the positional deviation amount d from (d+) to dl. 504f
d initialization, 505 is a repetition for scanning line j, and 506 is a process of calculating the difference between two line patterns by shifting the position by d, and integrating the absolute value.

式で書けば、である。５０７．５０８でＣｄの最小値と、その最小値
を与えるｄ、がめられる。５０９は出口であり、出力に
ｄユが与えられ、制御部２の一時記憶に格納される。ｄ
、は垂直方向の位置ずれ補正量であり、基準帳票イメー
ジ（の正規化イメー（１３）ジ）を垂直方向にｄゆだけずらせれば入力帳票イメージ
に最もよくマツチすることを示す。If written as a formula, it is. 507 and 508, the minimum value of Cd and d that gives the minimum value can be seen. Reference numeral 509 denotes an exit, and the output is given dyu, which is stored in the temporary memory of the control unit 2. d
, is the vertical positional shift correction amount, and indicates that if the reference form image (the normalized image (13) of it) is shifted by d in the vertical direction, it will best match the input form image.

第６図に減算処理の流れ図をＰＡＤ形式で示す。FIG. 6 shows a flowchart of the subtraction process in PAD format.

６０１は入口であり、基準帳票イメージＱ１％入力帳票
イメージＱ２、水平位置ずれ補正量ｇ１垂直位置ずれ補
正量りが与えられる。ここでは、基準帳票イメージＱｌ
としては、メモリ５６に格納されている基準帳票の太め
イメージとし、入力帳票イメージＱ２としてはメモリ５
２に格納されている入力帳票の正規化イメージを用いる
ものとする。また、ｈけ第５図でｄ６と書いたものであ
り、ｇは第５図と同様な処理を水平方向につめて行って
得られるものである。６０２．６０３の二重ループでイ
メージ全体を走査する。６０４は添字がイメージ領域を
飛び出さないようにするだめの限界チェックである。６
０５では、基準帳票イメージＱｌ（ｉ、ｊ）を水平方向
にｇ１垂直方向にｈだけ位置補正して画素Ｑｌ（ｉ＋ｇ
、ｊ十ｈ）を取り出し、その否定と入力帳票イメージＱ
２（ｉ、ｊ）との論理積を取る。したがって、以上の処
理により（１４）作られるイメージＲは入力帳票イメージＱ２の中に含ま
れる基準帳票イメージの部分を消去したものとなってい
る。６０７は出口で、イメージＲが出力に与えられ、メ
モリ５６に格納される。601 is an entrance, and a reference form image Q1, an input form image Q2, a horizontal positional deviation correction amount g1, and a vertical positional deviation correction scale are given. Here, the standard form image Ql
is a thick image of the standard form stored in the memory 56, and the input form image Q2 is the thick image of the standard form stored in the memory 56.
Assume that the normalized image of the input form stored in 2 is used. Moreover, h is written as d6 in FIG. 5, and g is obtained by performing the same processing as in FIG. 5 in the horizontal direction. Scan the entire image in a double loop of 602.603. 604 is a limit check to prevent the subscript from jumping out of the image area. 6
In 05, the reference form image Ql(i,j) is positionally corrected by g1 in the horizontal direction and h in the vertical direction, and the pixel Ql(i+g
, j ten h), its negation and input form image Q
2(i,j). Therefore, by the above processing (14), the image R created is one in which the portion of the reference form image included in the input form image Q2 is deleted. 607 is an exit, and the image R is given as an output and stored in the memory 56.

以上により第１図の実施例の動作が理解される。The operation of the embodiment shown in FIG. 1 can be understood from the above.

なお、第１図でメモリ５１，５２．５３，５４゜５５．
５６は別のものとしだが、同一のメモリ内の異なる領域
を割り当ててもよい。捷だ、認識処理部６により行われ
る処理の全部または一部を制御部２のプログラム制御に
より実行してもよい。In addition, in FIG. 1, memories 51, 52, 53, 54°, 55.
56 is assumed to be different, but different areas within the same memory may be allocated. Alternatively, all or part of the processing performed by the recognition processing section 6 may be executed under program control of the control section 2.

また、傾き検出に用いる周辺分布パターンＨ（ｊ）と位
置補正に用いる線パターン（黒長連積分値）ＡＵ）とは
、いずれか一方で他を代用することもでき、役割を交換
してもよい。Furthermore, one of the peripheral distribution pattern H(j) used for tilt detection and the line pattern (black long continuous integral value) AU) used for position correction can be substituted for the other, and even if the roles are exchanged. good.

また、紙送り装置（図示せず）にガイド等を付けて帳票
のスキューが発生しないようにすれば傾き検出処理や傾
き補正処理は不要となる。また、帳票エツジや帳票上の
特定のマークを検出し、この部分を整合させるようにし
て傾き補正及び位置補正を行うこともできる。さらに、
基準帳票の傾（１５）き補正を省略し、入力帳票を複数の角度に回転させて最
もよく整合する角度と位置を探索するようにして入力帳
票の角度補正と位置合せを行ってもよい。基準帳票につ
いて、線縁の平滑化やボイド・スポットなどの雑音除去
を行うことも効果がある。まだ、認識手法として細線化
法を採用している場合は、基準帳票イメージを太める代
りに入力帳票イメージを細めてもよい。認識手法として
輪郭解析法を採用している場合は、入力帳票イメージか
ら輪郭抽出を行い、基準帳票イメージに含まれる部分を
消去してもよい。基準帳票イメージの中で読み取りたい
部分（たとえばタイトル部）があるとき、その部分を白
ペイントなどで抹消して基準帳票イメージを登録すれば
、抹消した部分も読取対象パターンに含ませることがで
きる。また、上記の説明では二値化パターンについて処
理するものとしたが、多値イメージについて行うことも
でき、そのとき第６図６０５の否定・論理積の部分を大
小比較演算でおきかえることができる。式（１６）％式％（））（）のようになる。太め演算も多値イメージについて大小比
較演算を用いて定義できる。また、色彩イメージに適用
することも可能である。Further, if a guide or the like is attached to the paper feeding device (not shown) to prevent the occurrence of skew of the form, the tilt detection process and the tilt correction process become unnecessary. It is also possible to perform tilt correction and position correction by detecting document edges or specific marks on the document and aligning these portions. moreover,
The tilt correction of the reference form (15) may be omitted, and the angle correction and alignment of the input form may be performed by rotating the input form at a plurality of angles and searching for the angle and position that best matches the input form. It is also effective to smooth line edges and remove noise such as voids and spots on the reference form. If the thinning method is still used as the recognition method, the input form image may be thinned instead of thickening the reference form image. If a contour analysis method is employed as the recognition method, contours may be extracted from the input form image and portions included in the reference form image may be deleted. When there is a part of the standard form image that you want to read (for example, a title part), by erasing that part with white paint or the like and registering the standard form image, the erased part can be included in the pattern to be read. Further, in the above description, processing is performed on a binary pattern, but multivalued images can also be processed, and in that case, the negation/logical product part in FIG. 6 605 can be replaced with a magnitude comparison operation. Equation (16) becomes as follows. A thick operation can also be defined using a magnitude comparison operation for a multivalued image. It can also be applied to color images.

次に第７図により本発明の他の実施例について説明する
。本実施例では、正規化、太めを行った基準帳票イメー
ジをメモリ５、例えばメモリ５４に複数種類登録してお
き、入力帳票の処理に際して使用者がたとえばキーボー
ドからの指示によりその入力帳票に対応する基準帳票イ
メージ（及びその線パターン）を選択して、位置合せ処
理及び減算処理を行う点が第一の実施例と異なり、他は
同様である。第７図は、基準帳票イメージの番号を指定
したとき、そのアドレスを計算してイメージを取り出す
処理の流れ図をＰＡＤ形式で示したものである。７０１
は入口で、基準帳票イメージの番号ｋが与えられる。７
０２では第に番目の基準帳票イメージのサイズＩｈ−Ｊ
ｋを取り出す。Next, another embodiment of the present invention will be described with reference to FIG. In this embodiment, a plurality of types of normalized and thickened standard form images are registered in the memory 5, for example, the memory 54, and when processing an input form, the user responds to the input form by, for example, an instruction from the keyboard. This embodiment differs from the first embodiment in that a reference form image (and its line pattern) is selected and alignment processing and subtraction processing are performed, but the rest is the same. FIG. 7 shows a flowchart in PAD format of the process of calculating the address and extracting the image when the number of the reference form image is specified. 701
is given the reference form image number k at the entrance. 7
In 02, the size of the th standard form image Ih-J
Take out k.

（１７）７０３はＫの初期値をＯとする。７０４は７０５の積和
を行い、（ｋ−１）番目までの基準帳票イメージの画素
数の総和に＝Σ工ｍＪ１．ｌをめる。基準帳票イメージ
はメモリエリアＧの中に順次格納されており、Ｋはその
中でのに番目の基準帳票イメージの先頭アドレスを示す
。７０６，７０７の二重ループによりＧ卸からＧ　（Ｋ
＋　Ｉ、、Ｊ、−１）までの内容を出力バッファエリア
Ｑに転送する処理７０８を繰り返す。７１０は出口で、
キーボードから指定された基準帳票イメージＱとそのサ
イズエに、Ｊｋを出力に与える。なお、複数個の基準帳
票イメージをメモリエリアＧに登録する処理は、第７図
と逆の処理の組み合せでできるので説明は省略する。(17) 703 sets the initial value of K to O. 704 performs the sum of products of 705, and the total number of pixels of the reference form image up to the (k-1)th is calculated as =Σ×mJ1. Add l. The reference form images are sequentially stored in the memory area G, and K indicates the start address of the second reference form image among them. A double loop of 706 and 707 allows G wholesale to G (K
The process 708 of transferring the contents up to +I, , J, -1) to the output buffer area Q is repeated. 710 is the exit,
Jk is given as output to the reference form image Q and its size specified from the keyboard. Note that the process of registering a plurality of reference form images in the memory area G can be performed by combining the processes reversed to those shown in FIG. 7, so a description thereof will be omitted.

第二の実施例で、キーボード等から指示した基準帳票イ
メージをディスプレイ８上に表示し、確認することも効
果がある。In the second embodiment, it is also effective to display the reference form image instructed from the keyboard or the like on the display 8 for confirmation.

本実施例によれば複数個の帳票をあらかじめ登録してお
き、その中から入力帳票と同種のものを選択できるので
、多種多様な業務で用いられる帳（１８）票を次々と処理するとき、帳票ごとに基準帳票を登録す
る必要がなく高速処理できる。According to this embodiment, a plurality of forms can be registered in advance and one of the same type as the input form can be selected from among them. There is no need to register a standard form for each form, allowing high-speed processing.

次に第８図、第９図により第三の実施例について説明す
る。本実施例では帳票イメージを分割し、その各分割領
域において位置合せを行う点が第一の実施例と異なる。Next, a third embodiment will be described with reference to FIGS. 8 and 9. This embodiment differs from the first embodiment in that the form image is divided and alignment is performed in each divided area.

第８図では四分割の場合を図示している。一般に水平方
向にＵ等分、垂直方向に７等分したとし、その（ｕ、ｖ
）分割領域について、第４図と第５図の処理を適用する
ことによりこの分割領域における位置補正量（ｇ、、　
、　ｈｕ、　）がめられる。この補正量を用いて第６図
に代る減算処理を行うのであるが、その流れ図（ＰＡＤ
形式）を第９図に示す。９０１は入口であり、基準帳票
イメージＱ１、入力帳票イメージＱ２、位置補正ｔｇ、
ｈが与えられる。ここでｇ、ｈはＵ×■の大きさの配列
である。９０２は９０３〜９０５を繰り返すループであ
り、９０３〜９０５は水平方向位置補正ｔ　ｇ　ｗ、ｖ
　が二次元配列として与えられているのを添字Ｖについ
て平均し、−次元配列Ｇ（ｕ）を得る処理である。Ｇ（
ｕ）は帳票を水平（１９）方向にＴＪ分割したとき、各分割領域における水平方向
位置補正量を与える。９０６〜９０９は上記の９０２〜
９０５と同様の処理を垂直方向について行い、垂直位置
補正量Ｈ（ｖ）を与える処理である。FIG. 8 shows the case of four divisions. Generally speaking, it is divided into U equal parts in the horizontal direction and 7 equal parts in the vertical direction, and the (u, v
) By applying the processing of FIGS. 4 and 5 to the divided area, the position correction amount (g, ,
, hu, ) are observed. This correction amount is used to perform subtraction processing in place of the one shown in Fig. 6, and its flowchart (PAD
(format) is shown in Figure 9. 901 is an entrance, which includes a reference form image Q1, an input form image Q2, position correction tg,
h is given. Here, g and h are arrays of size U×■. 902 is a loop that repeats 903 to 905, and 903 to 905 are horizontal position corrections t g w, v
is given as a two-dimensional array, and is averaged with respect to the subscript V to obtain a -dimensional array G(u). G(
u) gives the horizontal position correction amount in each divided area when the form is divided into TJs in the horizontal (19) direction. 906-909 are the above 902-
Processing similar to step 905 is performed in the vertical direction to provide a vertical position correction amount H(v).

９１０〜９１２は水平方向の画素当りの位置補正量Ｅ　
（ｉ）を与える処理である。９１０で分割領域の中点間
の区間（（Ｕ−１）個）について繰り返し、９１１では
その各区間内の画素数α＝Ｉ／Ｕについて繰り返す。９
１２は線形補間式である。９１３〜９１４は、両端の２
個の半区間についてのＥ（ｉ）を与えるもので、ここで
は領域の中間の値にスライドした値とした。９１５〜９
１９は上記の９１０〜９１４と同様の処理を垂直方向に
ついて行い、垂直方向の画素当りの位置補正量ＦＯ）を
与える処理であり、９１６のβはβ＝Ｊ／Ｖで定められ
る。910 to 912 are position correction amounts E per pixel in the horizontal direction
This is a process that gives (i). In step 910, it is repeated for the sections ((U-1)) between the midpoints of the divided regions, and in step 911, it is repeated for the number of pixels α=I/U in each section. 9
12 is a linear interpolation formula. 913-914 are 2 at both ends
E(i) for half intervals is given here, and here the value is set as a value that is slid to the middle value of the region. 915-9
19 is a process in which the same processing as 910 to 914 described above is performed in the vertical direction to give a position correction amount FO per pixel in the vertical direction, and β in 916 is determined by β=J/V.

９２０．９２１の二重ループによりイメージ全面を走査
する。９２２は減算処理であり、水平方向にＥ（ｉ）、
垂直方向にＦ（ｊ）だけ補正してめた基準帳票イメージ
上の画素Ｑｓ　（Ｅ（ｉ）　２．　Ｆ（ｊ）　）を取り
出し、その否定と入力帳票イメージＱｈ（”−ｊ）（２
０）との論理積を作る。したがって、以上の処理により作ら
れるイメージＲは入力帳票イメージＱ２の中に含まれる
基準帳票イメージの部分を消去したものとなっている。A double loop of 920.921 scans the entire image. 922 is a subtraction process, and in the horizontal direction E(i),
The pixel Qs (E(i) 2. F(j) ) on the standard form image corrected by F(j) in the vertical direction is extracted, and its negation and input form image Qh(''-j)(2
0). Therefore, the image R created by the above processing is obtained by erasing the portion of the reference form image included in the input form image Q2.

９２３は出口で、イメージＲが出力に与えられる。923 is an exit, and image R is given to the output.

なお、本実施例では二次元配列としてめた位置補正量ｇ
ｗｖ　、　ｈ□　を水平及び垂直方向について平均する
としたが、−次元化せずに二次元配列のまま扱い、各画
素での位置補正量を二次元補間によってめてもよい。ま
だ、本実施例ではイメージの分割を等分割としだが不等
分割するように修正することも容易である。壕だ、本実
施例のように分割領域ごとに線パターンＡＩ　＊　Ａ２
などをめて位置補正量を領域全体で行う代りに、全イメ
ージでの線パターンＡ！（ｊ）　とＡ２（Ｊ）　とを対
応付けし、Ａｒ（ＦＧ））とＡ２（ｊ）とが最もよく一
致するようなＦＯ）（及び水平方向のＥ（ｉ））をめて
減算処理を行ってもよい。このような対応付けは動的計
画法マツチングにより効率的にめられる。In addition, in this example, the position correction amount g determined as a two-dimensional array
Although wv and h□ are averaged in the horizontal and vertical directions, they may be treated as two-dimensional arrays without being converted into -dimensional arrays, and the position correction amount at each pixel may be determined by two-dimensional interpolation. Although the image is divided into equal parts in this embodiment, it is easy to modify the image to be divided into unequal parts. It's a moat, like in this example, line pattern AI * A2 for each divided area
Instead of adjusting the position correction amount for the entire area, line pattern A for the entire image! (j) and A2(J), find the FO) (and E(i) in the horizontal direction) that best matches Ar(FG)) and A2(j), and perform the subtraction process. You may go. Such correspondence can be efficiently determined by dynamic programming matching.

本実施例によれば紙の伸縮や光電変換装置の経（２１）午後化により、帳票の等制約な走査ピンチが変化して帳
票イメージが伸縮しても、基準帳票イメージを確実に消
去できるという効果がある。According to this embodiment, the standard form image can be reliably erased even if the form image expands or contracts due to changes in the restrictive scanning pinch of the form due to the expansion and contraction of paper and the effects of the photoelectric conversion device (21). effective.

次に、プレプリントを有する帳票の種類を自動的に識別
し、帳票識別番号を特に付与することなく異種帳票混在
時の読み取りを可能とした実施例について説明する。こ
の実施例は帳票のプレプリント情報を積極的に利用して
帳票の種類を自動的に識別する点が特徴である。Next, an embodiment will be described in which the type of a form having a preprint is automatically identified, and it is possible to read a mixture of different types of forms without specifically assigning a form identification number. This embodiment is characterized in that the type of the form is automatically identified by actively utilizing the preprint information of the form.

本実施例を第１０図を用いて説明する。第１０図は、帳
票の種類を識別する処理をＰＡＤ形式で示す流れ図であ
る。This embodiment will be explained using FIG. 10. FIG. 10 is a flowchart showing the process of identifying the type of form in PAD format.

１００１は各種の初期化であり、装置内部の初期化や後
述する既登録帳票の各種情報を外部記憶装置９から読み
込む処理もここで行う。１０ｏ２は登録モードか読取モ
ードかを選択する。登録モードの場合は１００３から１
００７の処理を行う。１００３ではキーボード７から登
録しようとする帳票の識別番号ｎをキーインする。１０
０４では、使用する基準帳票を光電変換装置４によりデ
ィジタルパタ（２２） −ンに変換し基準帳票イメージＰ　（ｎ）をメモリ５に
格納する。この基準帳票イメージＰ　（ｎ）は、前述し
たように必要に応じて、正規化、太め処理が施される。Reference numeral 1001 indicates various initializations, including initialization of the inside of the device and processing for reading various information of registered forms, which will be described later, from the external storage device 9. 10o2 selects registration mode or reading mode. 1003 to 1 in registration mode
007 processing is performed. At step 1003, the identification number n of the form to be registered is keyed in from the keyboard 7. 10
In step 04, the reference form to be used is converted into a digital pattern (22)-n by the photoelectric conversion device 4, and the reference form image P(n) is stored in the memory 5. This reference form image P (n) is subjected to normalization and thickening processing as necessary, as described above.

１００５では、この基準帳票イメージＰ　（ｎ）から周
辺分布パターン、線パターン等の特徴パターンＥ　（ｎ
）を抽出する。特徴パターンの抽出例については前述し
た通りであるが、基準帳票イメージＰ　（ｎ）をそのま
ま特徴パターンＥ（ｎ）として使ってもよい。１００６
ではキーボード７から書式情報Ｆ　（ｎ）を入力し、基
準帳票イメージＰ　（ｎ）又は特徴パターンＥ　（ｎ）
と共にメモリ５に格納する。１００７では、Ｐ（ｎ）、
　Ｅ（ｎ）、　Ｆ（ｎ）などの情報を外部記憶装置９へ
書き出すが、メモリ５として不揮発メモリを用いる場合
は、必ずしも外部記憶装置９を用いる必要はない。この
場合、少なくとも、基準帳票に対する特徴パターン（特
徴パターンとして基準帳票イメージそのものを用いる場
合は、基準帳票イメージ）を記憶するメモリ、例えば、
第１図の構成では、メモリ５３及び５４が不揮発メモリ
であればよい。At step 1005, characteristic patterns E (n
). The example of extracting the feature pattern is as described above, but the reference form image P (n) may be used as it is as the feature pattern E(n). 1006
Now, input the format information F (n) from the keyboard 7, and input the standard form image P (n) or characteristic pattern E (n).
It is also stored in the memory 5. In 1007, P(n),
Information such as E(n) and F(n) is written to the external storage device 9, but if a nonvolatile memory is used as the memory 5, it is not necessarily necessary to use the external storage device 9. In this case, at least a memory that stores the feature pattern for the reference form (or the reference form image if the reference form image itself is used as the feature pattern), for example,
In the configuration of FIG. 1, the memories 53 and 54 may be nonvolatile memories.

（２３）１００２において読取モードが選択された場合、１００
８から１０１８の処理を行う。１００８では、読み取る
べき対象パターンの記入された帳票から入力帳票イメー
ジＰ。を侍、メモリ５に格納する。(23) If the reading mode is selected in 1002, 100
8 to 1018 are performed. At 1008, an input form image P is obtained from the form on which the target pattern to be read is written. Samurai stores it in memory 5.

１００９ではＰ、から周辺分布パターン、線パターン等
の特徴パターンＥｘを抽出するが、この処理については
第１の実施例で詳述したので、ここでは省略する。１０
１０は特徴照合であり、入力帳票の特徴パターンＥ！に
対応する帳票識別番号ｍをめる処理であるが、その内容
は１０１１から１０１５からなる。ここでは、入力帳票
の特徴パターンＥ、とじてメモリ５５に格納されている
線パターンを、基準帳票の特徴パターンＥ　（ｎ）とし
てメモリ５３に格納されている線パターンを用いるもの
とする。１０１１は既に登録されている基準帳票に関す
るループであってｎはその帳票識別番号、Ｎは登録済基
準帳票の種類である。１ｏ１２ではＥｘとＥ　（ｎ）の
類似度、例えば距離り、を計算する。At step 1009, a characteristic pattern Ex such as a peripheral distribution pattern or a line pattern is extracted from P, but since this process has been described in detail in the first embodiment, it will be omitted here. 10
10 is feature matching, and feature pattern E! of the input form is used. This is the process of calculating the form identification number m corresponding to , and its contents consist of 1011 to 1015. Here, it is assumed that the line pattern stored in the memory 55 is used as the feature pattern E of the input form, and the line pattern stored in the memory 53 is used as the feature pattern E (n) of the reference form. 1011 is a loop related to already registered standard forms, n is the form identification number, and N is the type of the registered standard forms. In 1o12, the degree of similarity, for example, the distance, between Ex and E (n) is calculated.

類似度の計算は通常のパターン認識に用いられるもので
よく、類似度の尺度としては距離、尤度相（２４）関を用いることができる。１０１３では距離Ｄ１１の最
小値を与える帳票識別番号ｍをめている。Similarity calculations may be those used in normal pattern recognition, and distance and likelihood correlation (24) may be used as similarity measures. In 1013, the form identification number m that gives the minimum value of the distance D11 is determined.

１０１１のループの終了後１０１４でＤＩＩｌが閾値θ
、。After the loop of 1011 ends, DIIl reaches the threshold θ at 1014.
,.

より小さいか否かを判定する。小さくなり場合は、入力
帳票に対応する登録帳票が見当らなかった場合であり、
１０１５のリジェクト処理を行う。リジェクト処理では
、ディスプレイ８に既登録帳票の一覧表を表示し、その
中から選択させたり、あるいは既登録帳票の中に該当す
るものがない場合には登録モードを実行して基準帳票を
追加したりするなどの処理を行う。以上により、帳票識
別番号ｍが決定されると、１０１６において入力帳票イ
メージＰ、から登録帳票イメージＰ（ホ）を減算し、記
入文字・記号等の読取対象パターンのみ残っているイメ
ージＲを得る。１０１７ではイメージＲから、書式情報
Ｆに）を用いて文字・記号の切り出し、認識を行う。１
０１８では認識結果をディスプレイ８又はプリンタ等の
出力装置へ表示する。Determine whether it is smaller. If it becomes smaller, it is because the registered form corresponding to the input form was not found.
1015 rejection processing is performed. In the reject process, a list of registered forms is displayed on the display 8, and the user is asked to select from the list, or if there is no matching form among the registered forms, the registration mode is executed to add a standard form. Perform processing such as When the form identification number m is determined as described above, the registered form image P (e) is subtracted from the input form image P at 1016 to obtain an image R in which only the patterns to be read, such as written characters and symbols, remain. In step 1017, characters and symbols are cut out and recognized using the format information F from the image R. 1
At 018, the recognition result is displayed on the display 8 or an output device such as a printer.

なお、使用する帳票のプレプリント部が文字・記号の記
入部と近接しておらず、切シ出しが容易（２５）にできる場合は１０１７を省略しても差し支えない。Note that if the preprinted part of the form to be used is not close to the part where characters and symbols are written and the cutout can be easily made (25), then 1017 may be omitted.

１０１９は終了処理であり、各種の情報の外部記憶装置
９への出力などを含む。1019 is a termination process, which includes outputting various information to the external storage device 9, etc.

本実施例では、特徴パターンとして線パターンを用しる
ので、帳票上に記入した文字・記号はほとんど無視でき
、帳票の同一性の判定がより正確に行える利点がある。In this embodiment, since a line pattern is used as a characteristic pattern, the characters and symbols written on the form can be almost ignored, and there is an advantage that the identity of the form can be determined more accurately.

なお、特徴パターンとして線パターンを用いた場合、異
なる帳票が同一の直線部を有していたときには区別がで
きないが、このような場合には、帳票上の部分領域を指
定して、その領域内での各種の特徴をとるとよい。既に
直線部が検出されているので、部分領域の指定を直線部
分を基準にして行うことができ、位置ずれの悪影響を避
けられる。Note that when a line pattern is used as a characteristic pattern, it is not possible to distinguish between different forms if they have the same straight line part, but in such a case, specify a partial area on the form and It is a good idea to take various characteristics of Since the straight line part has already been detected, the partial area can be specified based on the straight line part, and the adverse effects of positional deviation can be avoided.

このように、本実施例によれば、既存の帳票をそのまま
利用しても、異種帳票混在読み取りが可能となり、帳票
の新規設計の手間や印刷コストを大幅に削減できる効果
がある。In this way, according to this embodiment, even if existing forms are used as they are, it is possible to read a mixture of different types of forms, which has the effect of significantly reducing the effort for designing new forms and printing costs.

〔Effect of the invention〕

（２６）以上説明した如く本発明によれば、プレプリント部に関
係なく読み取り対象パターンのみを読み取ることが可能
であり、帳票設計・印刷をし直す必要がなく、通常の帳
票をそのまま利用することができる。(26) As explained above, according to the present invention, it is possible to read only the pattern to be read regardless of the preprint part, and there is no need to redesign and print forms, and normal forms can be used as they are. I can do it.

[Brief explanation of the drawing]

第１図は本発明によるパターン認識方式を実施する装置
の構成を示すブロック図、第２図、第３図、第４図、第
５図、第６図、好７図、第９図。第１０図はそれぞれ第１図の制御部における処理を説明
するだめの流れ図、第８図は帳票の分割状態の一例を示
す図、第１１図（ａ）（ｂ）（ｃ）は線パターンを説明
するだめの図である。１・・・バス、２・・・制御部、３・・・帳票、５・・
・メモリ、６・・・認識処理部、７・・・キーボード、
８・・・デイスプ（２７）活　１１　図（Δ）　（Ｂン５２１− （Ｃ）FIG. 1 is a block diagram showing the configuration of an apparatus implementing a pattern recognition method according to the present invention, and FIGS. 2, 3, 4, 5, 6, 7, and 9. 10 is a flowchart explaining the processing in the control section of FIG. 1, FIG. 8 is a diagram showing an example of a form division state, and FIGS. 11(a), (b), and (c) are diagrams showing line patterns. This is a diagram for illustration purposes only. 1... Bus, 2... Control unit, 3... Report, 5...
・Memory, 6... Recognition processing unit, 7... Keyboard,
8... Disp (27) Live 11 Figure (Δ) (Bn521- (C)

Claims

[Claims] 1. A standard form image converted into a digital image is stored, and a portion included in the standard form image is deleted from the input form image obtained by photoelectrically converting and digitizing the input form. The pattern recognition method is characterized in that the pattern to be read written in the input form is cut out from the erased image and recognized. 2. In the pattern recognition method according to claim 1, a direction with respect to the reference form image. A pattern recognition method characterized in that after normalizing the position, thickness, etc., and also normalizing the input form image, the input form image and the reference form image are aligned. 3. In the pattern recognition method according to claim 1 or 2, a desired reference form image is selected from among a plurality of reference form images, and the selected reference form image is extracted from the input form image. A pattern recognition method that is characterized by erasing parts included in . 4. In the pattern recognition method according to claim 1, 2, or 3, a plurality of partial regions are extracted from each of the input form image and the reference form image, and each of the plurality of regions is , calculate the alignment amount at each pixel on the form image from the alignment amount in each of the multiple areas obtained as a result, perform alignment using the alignment amount at each pixel, and then A pattern recognition method characterized by erasing a portion included in the reference form image from an input form image. 5. In the pattern recognition method according to claim 3, a feature pattern extracted in advance for each of the plurality of reference form images is stored, and the feature pattern is extracted from the input form image. , a pattern recognition method characterized in that the desired reference form image is selected by comparing it with a characteristic pattern of the reference form image. 6. In the pattern recognition method according to claim 5, predetermined format information is stored for each part of the plurality of reference form images, and the format information for the selected reference form is used to identify the target to be read. A pattern recognition method characterized by recognizing patterns.