JPH08255237A

JPH08255237A - Data storage device

Info

Publication number: JPH08255237A
Application number: JP7058611A
Authority: JP
Inventors: Takemichi Watanabe; 武道渡辺
Original assignee: HATSUSO KK; Toyo Ink Mfg Co Ltd
Current assignee: HATSUSO KK; Toyo Ink Mfg Co Ltd
Priority date: 1995-03-17
Filing date: 1995-03-17
Publication date: 1996-10-01

Abstract

PURPOSE: To improve a character recognition rate and to store massive image data by discriminating a model which is newly inputted and having characters described with handwriting or typing, out of plural models in a model image storage device after inclination correction is performed. CONSTITUTION: First of all, various kinds of slips, on which nothing is described, are previously inputted by an image input device 1, erected by an image distortion corrector 2 and stored in a model storage device 3. Next, the new model, in which characters are described, is inputted as object data while using the image input device 1 and the image distortion corrector 2. An image discriminator 4 discriminates which model among plural model images stored in the model storage device 3 corresponds to the object data. The object image is turned to the image composed of only described characters by finding difference from the model image. This image composed of only the characters is compressed by an image compressing device 6 and stored in a digital image storage device 7 together with model discrimination information.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明はファクシミリ、スキャ
ナ等を用いて入力した文字画像中の文字部分をコード化
して記憶媒体に格納するデータ蓄積装置に関するもので
ある。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data storage device for encoding a character portion in a character image input by using a facsimile, a scanner or the like and storing it in a storage medium.

【０００２】[0002]

【従来の技術】従来から各種の文字認識方法による文字
認識装置が発明され、実用に供されてきた。文字認識の
対象に関しては自由な体裁で書かれた媒体から文字を認
識するもの、予め伝票として決められた帳票の文字部分
を認識するものと色々あるが、後者の場合多くは予め記
入枠が設定されている。2. Description of the Related Art Conventionally, character recognition devices by various character recognition methods have been invented and put into practical use. Regarding character recognition, there are various things such as recognizing characters from a medium written in free format and recognizing the character part of a form that is predetermined as a slip, but in the latter case, a fill-in frame is often set in advance. Has been done.

【０００３】[0003]

【発明が解決しようとする課題】通常ファクシミリやス
キャナにはドロップアウト・カラーが設定されていて、
その色で印刷された文字、罫線等は読まない。従ってOC
R(Optical Character Reader) のユーザはその色を用い
て記入枠を印刷していた。しかし、ドロップアウト・カ
ラーは機種ごとに異なり、OCR 装置の運用上同一の入力
装置を買い揃える必要があった。また不特定多数からフ
ァクシミリが送られてくるような場合はときとして記入
枠がドロップアウトされず、記入された文字が記入枠に
重なっていたり部分的に飛び出したりしていて文字認識
の際の妨げになっていた。Usually, a facsimile or a scanner has a dropout color set,
Do not read characters or ruled lines printed in that color. Therefore OC
A user of R (Optical Character Reader) printed the entry frame using the color. However, the dropout color differs depending on the model, and it was necessary to purchase the same input device in order to operate the OCR device. In addition, when a fax is sent from an unspecified number of people, the entry frame is not sometimes dropped out, and the entered characters overlap the entry frame or partially pop out, which hinders character recognition. It was.

【０００４】ドロップアウト・カラーは印刷時に特色イ
ンキを使用する関係でOCR 用の用紙の印刷料金が高価に
なり、且つ色が比較的淡いので、記入者が記入しにくい
という問題点があった。また後々の問い合わせを考慮す
ると伝票そのものの保管、または読み取り画像のデジタ
ル保管が必要である。しかし、伝票そのものの保管は検
索時に多量の作業量と保管場所を必要とするため適切で
はない。デジタル蓄積は検索は容易であるが、伝票画像
が複雑な場合は画像圧縮を行っても高圧縮が望めず、多
数のデータの保管に問題があった。The drop-out color has a problem in that it is difficult for the writer to fill in because the printing fee for the OCR paper is expensive because the special color ink is used at the time of printing and the color is relatively light. Also, in consideration of future inquiries, it is necessary to store the slip itself or digitally store the read image. However, the storage of the slip itself is not appropriate because it requires a large amount of work and a storage location when searching. Digital storage is easy to search, but if the slip image is complicated, high compression cannot be expected even if image compression is performed, and there is a problem in storing a large amount of data.

【０００５】[0005]

【課題を解決するための手段】この目的に対応して、第
１の発明のデータ蓄積装置は、画像入力装置と画像歪み
補正装置と画像判別装置と雛形画像蓄積装置と文字認識
装置と画像圧縮装置とデジタルデータ蓄積装置とから構
成され、画像入力装置を用いて予め複数の雛形画像を入
力して画像歪み補正装置によって傾き補正を行った後に
雛形画像蓄積装置に登録しておき、画像入力装置を用い
て新たに入力された手書きまたは活字で文字が記述され
た雛形を前記画像補正装置を用いて傾き補正を行ったの
ちに前記画像判別装置を用いて前記雛形画像蓄積装置中
の複数の雛形の中から判別して前記新たに入力された雛
形から除去して文字のみからなる画像データとし、文字
部分を文字認識装置によって文字コード化した後に前記
画像圧縮装置によって圧縮した前記文字のみからなる画
像データと文字コードをデジタルデータ蓄積装置に蓄積
することを特徴としている。To solve this problem, a data storage device of the first invention is an image input device, an image distortion correction device, an image discrimination device, a template image storage device, a character recognition device, and an image compression device. Device and a digital data storage device. The image input device inputs a plurality of template images in advance, the image distortion correction device performs tilt correction, and the image data is then registered in the template image storage device. A plurality of templates in the template image accumulating device using the image discriminating device after tilt correction is performed using the image correcting device on a template newly input by using From the newly input template to obtain image data consisting of only characters, character parts are character coded by a character recognition device, and then the image compression device is used. It is characterized by the accumulation in the digital data storage device the image data and the character code consisting of only the characters compressed Te.

【０００６】[0006]

【作用】本発明のデータ蓄積装置は画像入力装置を備え
ている。この画像入力装置はファクシミリ装置、または
スキャナ装置であり、入力された画像データは２値化さ
れ、さらにファクシミリ装置の場合はＣＣＩＴＴ（国際
電信電話諮問委員会）が勧告している圧縮方法で圧縮さ
れたデータが出力される。また画像歪み補正装置は入力
時に傾きをもって入力された画像を正立させるもので、
一般には伝票には多くの垂直水平の罫線が印刷されてお
り、その罫線を検出して傾きを求めたり、活字の文章の
場合は行と行の隙間を検出して傾きを求めたりすること
によって画像を正立させる方法が知られている。The data storage device of the present invention comprises an image input device. This image input device is a facsimile device or a scanner device. The input image data is binarized, and in the case of a facsimile device, it is compressed by the compression method recommended by CCITT (International Telegraph and Telephone Consultative Committee). Data is output. In addition, the image distortion correction device erects the input image with a tilt at the time of input,
In general, many vertical and horizontal ruled lines are printed on a slip, and by detecting the ruled lines and determining the inclination, or in the case of print text, detecting the gap between lines and determining the inclination. A method of erecting an image is known.

【０００７】そして、前記画像入力装置によって入力さ
れた画像は前記画像歪み補正装置によって正立させられ
る。予め何も記入されていない複数の伝票画像が前記画
像入力装置、前記画像歪み補正装置を用いて入力され、
雛型として雛形蓄積装置に蓄積されている。一般に伝票
には文字、罫線、ロゴマーク等の図形が印刷されてい
る。新たに伝票に金額、個数、住所といった文字を記入
したものが前記画像入力装置、前記画像歪み補正装置を
用いて対象データとして入力される。画像判別装置は前
記の対象データが前記の雛形蓄積装置中に蓄積されてい
る複数の雛形画像の中のどの雛形に相当するのかを判別
する。The image input by the image input device is erected by the image distortion correction device. A plurality of slip images in which nothing is entered in advance are input using the image input device and the image distortion correction device,
It is stored in the template storage device as a template. In general, figures such as characters, ruled lines, and logo marks are printed on the slips. A new voucher in which characters such as the amount, the number, and the address are entered is input as target data using the image input device and the image distortion correction device. The image discriminating apparatus discriminates which template of the plurality of template images stored in the template storing apparatus corresponds to the target data.

【０００８】一般にこのような判別にはテンプレート・
マッチングという手法が使用されるが、これは重ね合わ
せて重なる割合を評価する方法である。対象画像は雛形
画像と差分をとることによって記入された文字画像のみ
となる。この文字画像のみとなった画像は記入枠が消去
されており、認識対象文字の切り出しが容易になり文字
認識装置によって通常にコード化されデジタルデータ蓄
積装置に蓄積される。またこの文字画像のみとなった画
像は画像圧縮装置によって圧縮された後に雛形識別情報
とともに前記デジタル画像蓄積装置に蓄積されるが、雛
形が付加されていた画像に比べて圧縮率が格段に向上
し、大量の画像データの蓄積が可能となる。Generally, a template
A technique called matching is used, which is a method of evaluating the overlapping ratio by overlapping. The target image is only the character image entered by taking the difference from the template image. The image of only this character image has the entry frame erased, the character to be recognized can be easily cut out, and is normally coded by the character recognition device and stored in the digital data storage device. Further, the image containing only the character image is stored in the digital image storage device together with the model identification information after being compressed by the image compression device, but the compression rate is remarkably improved as compared with the image to which the model is added. It is possible to store a large amount of image data.

【０００９】蓄積された画像データは必要に応じて雛形
画像と重畳することにより復元でき、問い合わせの確認
に用いることができる。The accumulated image data can be restored by superimposing it on a template image as needed, and can be used for confirmation of an inquiry.

【００１０】[0010]

【実施例】以下、この発明の詳細を図面を用いて説明す
る。図１はこの発明の装置の構成図である。図１におい
て、１は画像入力装置である。この画像入力装置はファ
クシミリ装置、またはスキャナ装置である。ファクシミ
リで入力された画像はＧIII規格では主走査方向は200dp
i(dots per inch) 、副走査方向は100dpi(dpi) であ
り、Ａ４サイズの原稿の場合は250Kバイト程度の画像と
なり、ＣＣＩＴＴが勧告しているＭＲ圧縮法(T4 勧告)
で圧縮される。スキャナの場合は縦横ともに200 〜300d
piで入力し、２値化した後500K〜1Mバイト程度の画像と
なる。DESCRIPTION OF THE PREFERRED EMBODIMENTS The details of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram of the apparatus of the present invention. In FIG. 1, reference numeral 1 is an image input device. This image input device is a facsimile device or a scanner device. The image input by facsimile is 200 dp in the main scanning direction according to the GIII standard.
i (dots per inch), 100dpi (dpi) in the sub-scanning direction, an A4 size document has an image of about 250 Kbytes, and the MR compression method recommended by CCITT (T4 recommendation)
Compressed with. 200 to 300d for both vertical and horizontal for scanner
After inputting with pi and binarizing, it becomes an image of about 500K-1MB.

【００１１】予め何も記入されていない各種の伝票を前
記の画像入力装置１で入力し画像補正装置２で正立化し
た後前記雛形蓄積装置３に蓄積される。伝票画像の正立
化は伝票に印刷されている多くの垂直水平の罫線成分を
検出し、その傾きを求めて逆回転させることによって行
う。画像認識の分野ではHough 変換がよく知られてお
り、線分の検出に広く用いられている。図２を用いてHo
ugh 変換を説明する。図２において21は黒画像であり、
画像全体を走査し、黒ドットを検出した場合にその点を
通る各種の角度θの直線22を引き、原点からその直線に
対して垂線23を下ろして垂線の足の長さｒを求める。Various kinds of slips in which nothing is entered in advance are input by the image input device 1 and uprighted by the image correction device 2 and then stored in the template storage device 3. The erect image of the slip is detected by detecting many vertical and horizontal ruled line components printed on the slip, obtaining the inclination of the ruled line component, and rotating the slip backward. The Hough transform is well known in the field of image recognition and widely used for line segment detection. Using Figure 2 Ho
Explain the ugh transformation. In FIG. 2, 21 is a black image,
When the entire image is scanned and a black dot is detected, a straight line 22 passing through the point and having various angles θ is drawn, and a perpendicular line 23 is drawn from the origin to the straight line to obtain a foot length r of the perpendicular line.

【００１２】例えば、角度の解像度を１度とした場合
に、１つの黒ドットを検出する度に360 組の（ｒ、θ）
が得られる。その（ｒ、θ）の頻度分布のピークの位置
の（ｒ_k、θ_k) が罫線成分を表している。図１におい
て、新たに雛形に金額、個数、住所といった文字を記入
したものが前記画像入力装置１、前記画像歪み補正装置
２を用いて対象データとして入力される。図１において
４は画像判別装置であり、画像判別装置は前記の対象デ
ータが前記の雛形蓄積装置３の中に蓄積されている複数
の雛形画像の中のどの雛形に相当するのかを判別する。For example, assuming that the angle resolution is 1 degree, 360 pairs of (r, θ) are detected each time one black dot is detected.
Is obtained. The (r _k , θ _k ) of the peak position of the (r, θ) frequency distribution represents the ruled line component. In FIG. 1, a new model in which characters such as amount, number, and address are entered is input as target data using the image input device 1 and the image distortion correction device 2. In FIG. 1, reference numeral 4 denotes an image discriminating apparatus, and the image discriminating apparatus discriminates which template among the plurality of template images stored in the template storing apparatus 3 corresponds to the target data.

【００１３】画像の判別にはテンプレート・マッチング
が用いられる。これは、対象データと各種の雛形画像を
重ね合わせた場合の重なり割合を雛形ごとに計算し、一
番大きな重なり割合を示した雛形を相当する雛形と特定
する。画像が互いに上下左右にずれていた場合は重なり
割合は小さくなるが、上下左右に所定の範囲内でずらし
ながら前記重なり割合を計算し、最大の重なり割合を求
めてその雛形の重なり割合とする。但し、閾値を設けて
おき、特定された雛形の重なり割合が閾値以下の場合は
相当する雛形は存在しないとみなす。Template matching is used for image discrimination. This is to calculate the overlapping ratio for each model when the target data and various model images are superimposed, and specify the model showing the largest overlapping ratio as the corresponding model. When the images are vertically and horizontally displaced from each other, the overlapping ratio becomes small. However, the overlapping ratio is calculated by shifting the images vertically and horizontally within a predetermined range, and the maximum overlapping ratio is calculated and used as the overlapping ratio of the template. However, if a threshold value is set and the overlapping ratio of the specified template is equal to or less than the threshold value, it is considered that there is no corresponding template.

【００１４】対象画像は雛形画像と差分をとることによ
って記入された文字のみの画像となる。図３が差分の計
算を説明する。図３において、30は対象画像であり、31
は相当する雛形であり、32は文字画像のみとなった画像
である。この文字画像のみとなった画像は図１の文字認
識装置５によってコード化されデジタルデータ蓄積装置
６に蓄積される。図４において（Ａ）は文字枠40の中に
記入された９という数字41を示している。この場合、数
字は文字枠に接しており、従来認識対象文字の切り出し
に困難を来たし、認識率の低下をもたらしていた。
（Ｂ）は文字画像のみの画像の数字42を示し、認識対象
文字の切り出しが容易であり、通常に文字認識を行わせ
ることが可能である。The target image is an image of only the characters entered by taking the difference from the template image. FIG. 3 illustrates the difference calculation. In FIG. 3, 30 is the target image, and 31
Is a corresponding template, and 32 is an image with only character images. The image including only the character image is coded by the character recognition device 5 of FIG. 1 and stored in the digital data storage device 6. In FIG. 4, (A) shows the numeral 41 of 9 entered in the character frame 40. In this case, since the numbers are in contact with the character frame, it has been difficult to cut out the character to be recognized, and the recognition rate is lowered.
(B) shows the numeral 42 of the image of only the character image, the character to be recognized can be easily cut out, and the character can be normally recognized.

【００１５】またこの文字画像のみとなった画像は図１
の画像圧縮装置６によって圧縮された後に雛形識別情報
とともに前記デジタル画像蓄積装置７に蓄積される。圧
縮方法はファクシミリの分野でＣＣＩＴＴが勧告してい
るＭＲ圧縮法(T4 勧告) 、またはＭＭＲ圧縮法(T6 勧
告) を用いており、雛形付の無圧縮画像データに比べて
1/100 程度に圧縮することが可能である。蓄積された画
像データは必要に応じて雛形情報をもとに雛形蓄積装置
３から検索された雛形画像と重畳することにより復元
し、本発明の装置に画像表示装置や画像印刷装置や通信
装置を付加することによって問い合わせに対する確認の
ために供することが可能である。Further, the image which is only the character image is shown in FIG.
After being compressed by the image compression device 6, the image data is stored in the digital image storage device 7 together with the template identification information. As the compression method, the MR compression method (T4 recommendation) or MMR compression method (T6 recommendation) recommended by CCITT in the field of facsimile is used, and compared with the uncompressed image data with template.
It can be compressed to about 1/100. The stored image data is restored by superimposing it on the template image retrieved from the template storage device 3 based on the template information as necessary, and the image display device, the image printing device, or the communication device is added to the device of the present invention. By adding it, it can be used for confirmation of the inquiry.

【００１６】[0016]

【発明の効果】このように、この発明によればOCR 用紙
にドロップアウト・カラーを使用することなく従来の装
置に比べて文字認識率を向上させ、画像データの大量の
蓄積を可能にすることが可能である。As described above, according to the present invention, the character recognition rate is improved as compared with the conventional apparatus without using the dropout color for the OCR paper, and a large amount of image data can be stored. Is possible.

[Brief description of drawings]

【図１】本発明の構成を示す説明図である。FIG. 1 is an explanatory diagram showing a configuration of the present invention.

【図２】Hough 変換を説明する説明図である。FIG. 2 is an explanatory diagram illustrating Hough transform.

【図３】対象画像と雛形画像の差分画像を作成する説明
図である。FIG. 3 is an explanatory diagram of creating a difference image between a target image and a template image.

【図４】対象画像と、雛形画像との差分画像の違いを説
明する説明図である。FIG. 4 is an explanatory diagram illustrating a difference in a difference image between a target image and a template image.

[Brief description of reference numerals]

１画像入力装置２画像歪み補正装置３雛形蓄積装置４画像判別装置５文字認識装置６画像圧縮装置７デジタルデータ蓄積装置 21 黒画像 22 黒ドットを通る直線 23 原点を通る垂線 24 黒ドット 30 伝票に文字が記入された対象画像 31 雛形画像 32 文字画像のみとなった画像 40 記入枠 41 記入された文字 42 記入された文字 1 image input device 2 image distortion correction device 3 template storage device 4 image discrimination device 5 character recognition device 6 image compression device 7 digital data storage device 21 black image 22 straight line passing through black dot 23 perpendicular line passing through origin 24 black dot 30 on voucher Target image with text 31 Template image 32 Image with only text image 40 Entry frame 41 Text 42 Text entered

Claims

[Claims]

1. A data storage device for digitally inputting image data, converting character portions in an image into character codes, and storing the image data, an image input device, an image distortion correction device, an image discrimination device, a template image storage device, and a character recognition device. And an image compression device and a digital data storage device, the image input device is used to input a plurality of template images in advance, the image distortion correction device performs tilt correction, and then is registered in the template image storage device. After the inclination correction is performed using the image correction device for the template in which characters are newly written by handwriting or in print using the image input device, the template is stored in the template image storage device using the image discrimination device. Image data consisting only of characters is obtained by distinguishing from a plurality of templates and removing from the newly input template, and character parts are character coded by a character recognition device. Data storage apparatus characterized by storing the digital data storage device the image data and the character code consisting of only the characters that are compressed by the image compression apparatus after.