JP2001195543A

JP2001195543A - Device and method for processing document and storage medium

Info

Publication number: JP2001195543A
Application number: JP2000010138A
Authority: JP
Inventors: Shinobu Yamamoto; 忍山本
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2000-01-14
Filing date: 2000-01-14
Publication date: 2001-07-19

Abstract

PROBLEM TO BE SOLVED: To make accurately extractable only contents, which are described on a document later, from the image of the document even without designating the color of the document form of the document or color of the contents of characters or the like described later, beforehand. SOLUTION: When the color image of a document is read by a color image input means 2, an image classifying means classifies the color image into an image in the color comprising the document form previously described on the document and an image in the color comprising a character described on the document later, and these two images are extracted from the color image reading the document. On the basis of size comparison between the respective components of two images extracted by the image classifying means 3, one of two images is identified as the image of a character described later by an image identifying means 4. Concretely, the largest link component of pixels is extracted from two images, areas are found, the areas are compared, an image containing a larger circumscribed rectangle is defined as the image of the document form of the document and the other image is defined as the image of characters described later.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、帳票処理装置お
よびその方法ならびに記憶媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a form processing apparatus and method, and a storage medium.

【０００２】[0002]

【従来の技術】予め枠線や文字などからなる文書フォー
ムが記載されている帳票に対し、後から記入された文字
だけを認識する技術としては、特開平10-187882号公報
などに開示されている。2. Description of the Related Art Japanese Patent Application Laid-Open No. Hei 10-187882 discloses a technique for recognizing only a character entered later on a form in which a document form including frame lines and characters is described in advance. I have.

【０００３】ところで、一般に、ドロップアウトカラー
を用いない帳票においても、帳票に予め記載されている
文書フォームと、帳票に後から記入される文字との識別
しやすさなどを考慮して、この両者の色は異なる場合が
多い。このような帳票を読み取った画像から、帳票に記
載されている文書フォームの色の指定をあらかじめ行う
ことなく、帳票に後から記入される文字を抽出するため
に、前記特開平10-187882号公報に開示の技術では次の
ような処理を行っている。[0003] In general, even in a form that does not use a dropout color, in consideration of the easiness of discrimination between a document form described in advance on the form and characters to be entered later on the form, both of them are considered. Often have different colors. From the image read such a form, without previously specifying the color of the document form described in the form, in order to extract the characters to be entered later on the form, the JP-A-10-187882 Performs the following processing.

【０００４】すなわち、帳票に後から記入される文字を
黒色であると規定して、カラースキャナで読み取った帳
票の画像中で、帳票の文書フォームの色が後で記入され
た文字の色より明度が高くなることを利用し、しきい値
処理を行って、後から記入された文字だけを分離して識
別するようにしている。[0004] That is, it is defined that the characters to be entered later on the form are black, and in the image of the form read by the color scanner, the color of the document form of the form is lighter than the color of the character entered later. Utilizing the fact that is higher, threshold processing is performed to separate and identify only characters that are entered later.

【０００５】[0005]

【発明が解決しようとする課題】しかし、前記特開平10
-187882号公報に開示の技術では、文字色が黒でないよ
うな場合には帳票に後から記入された文字だけを抽出で
きないという不具合がある。帳票の中には、数枚つづり
のカーボンやノンカーボンの複写用紙を用いたものがあ
るが、その複写された紙では文字の色が黒であるとは限
らない。その場合、青色が用いられる場合が多いが、色
の彩度や明度などは帳票によって異なるため、それをあ
らかじめ規定しておくことができない場合が多い。SUMMARY OF THE INVENTION However, Japanese Patent Application Laid-Open
The technique disclosed in Japanese Patent Application Laid-Open No. 187882 has a problem that, when the character color is not black, it is not possible to extract only a character entered later on the form. Some forms use several sheets of carbon or non-carbon copy paper, but the color of the copied paper is not always black. In that case, blue is often used, but since the saturation and brightness of the color and the like differ depending on the form, it is often not possible to specify the color in advance.

【０００６】この発明の目的は、あらかじめ帳票の文書
フォームの色や後から記入された文字等の内容の色を指
定しなくても、帳票の画像から帳票に後から記入された
内容だけを精度よく抽出できるようにすることである。SUMMARY OF THE INVENTION An object of the present invention is to make it possible to accurately analyze only the contents of a form from a form image later without specifying the color of a document form of the form or the color of contents such as characters entered later. It is to be able to extract well.

【０００７】この発明の別の目的は、文書フォームとし
て帳票に枠線が記載されている場合などに、帳票の画像
から帳票に後から記入された内容だけを精度よく抽出で
きるようにすることである。[0007] Another object of the present invention is to be able to accurately extract only the contents that are later entered in a form from a form image, for example, when a frame is described in the form as a document form. is there.

【０００８】この発明の別の目的は、文書フォームとし
て帳票に罫線が記載されている場合などに、帳票の画像
から帳票に後から記入された内容だけを精度よく抽出で
きるようにすることである。[0008] Another object of the present invention is to be able to accurately extract only the contents that have been entered in a form later from an image of the form when a ruled line is described in the form as a document form. .

【０００９】この発明の別の目的は、帳票に記載された
文字のうち後から記入されたもののみを認識できるよう
にすることである。Another object of the present invention is to make it possible to recognize only the characters entered on the form which are entered later.

【００１０】[0010]

【課題を解決するための手段】請求項１に記載の発明
は、帳票に予め記載されている文書フォームを構成する
色の画像と前記帳票に後から記入された内容を構成する
色の画像とに各々分類して、この２つの画像を前記帳票
を読み取った画像から抽出する画像分類手段と、この画
像分類手段で抽出された前記２つの画像のそれぞれの構
成要素間における大小比較に基づいて前記２つの画像の
うちの一方を後から記入された前記内容の画像として識
別する画像識別手段と、を備えている帳票処理装置であ
る。According to a first aspect of the present invention, there is provided an image processing apparatus comprising: a color image forming a document form described in a form in advance; and a color image forming a content entered later in the form. Image classifying means for extracting the two images from the image obtained by reading the form, and based on a magnitude comparison between respective components of the two images extracted by the image classifying means. An image identification means for identifying one of the two images as an image of the content entered later.

【００１１】したがって、あらかじめ帳票の文書フォー
ムの色や後から記入された文字等の内容の色を指定しな
くても、帳票の画像から帳票に後から記入された内容だ
けを精度よく抽出することができる。Therefore, it is possible to accurately extract only the contents of a form afterward from an image of the form without specifying the color of the document form of the form or the color of the contents such as characters entered later. Can be.

【００１２】請求項２に記載の発明は、請求項１に記載
の帳票処理装置において、前記画像識別手段は、前記特
定部分の大小比較として前記２つの画像にそれぞれ含ま
れる画素の連結部分の大きさの比較を行うものである。According to a second aspect of the present invention, in the form processing apparatus according to the first aspect, the image identification unit determines a size of a connected portion of pixels included in each of the two images as a magnitude comparison of the specific portion. This is to make a comparison.

【００１３】したがって、文書フォームとして帳票に枠
線が記載されている場合などに、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。Therefore, when a frame is described in a form as a document form, for example, it is possible to accurately extract only the contents later entered in the form from the image of the form.

【００１４】請求項３に記載の発明は、請求項１に記載
の帳票処理装置において、前記画像識別手段は、前記構
成要素間の大小比較として前記２つの画像にそれぞれ含
まれる直線の長さの比較を行うものである。According to a third aspect of the present invention, in the form processing apparatus according to the first aspect, the image identification unit determines a length of a straight line included in each of the two images as a magnitude comparison between the constituent elements. This is to make a comparison.

【００１５】したがって、文書フォームとして帳票に枠
線が記載されている場合のみならず、枠線に代えて罫線
が記載されている場合などにも、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。Therefore, not only when a frame is described on a form as a document form, but also when a ruled line is described in place of the frame, the contents entered later on the form from the form image Can be accurately extracted.

【００１６】請求項４に記載の発明は、請求項１〜３の
いずれかの一に記載の帳票処理装置において、前記画像
識別手段により後から記入された前記内容のものとして
識別された画像から文字を構成する画像を抽出する文字
抽出手段と、この抽出された文字を構成する画像の文字
認識を行う文字認識手段と、を備えている。According to a fourth aspect of the present invention, there is provided the form processing apparatus according to any one of the first to third aspects, wherein the form identification unit is configured to register the image identified by the image identification unit as the content described later. The image processing apparatus includes character extracting means for extracting an image forming a character, and character recognizing means for performing character recognition of the image forming the extracted character.

【００１７】したがって、帳票に記載された文字のうち
後から記入されたもののみを認識することができる。[0017] Therefore, it is possible to recognize only the later entered characters among the characters described in the form.

【００１８】請求項５に記載の発明は、帳票に予め記載
されている文書フォームを構成する色の画像と前記帳票
に後から記入された内容を構成する色の画像とに各々分
類して、この２つの画像を前記帳票を読み取った画像か
ら抽出する画像分類工程と、この画像分類手段で抽出さ
れた前記２つの画像のそれぞれの構成要素間における大
小比較に基づいて前記２つの画像のうちの一方を後から
記入された前記内容の画像として識別する画像識別工程
と、を含んでなる帳票読取方法である。According to a fifth aspect of the present invention, a color image constituting a document form described in advance on a form and a color image constituting contents which are later entered in the form are classified into: An image classification step of extracting the two images from the image obtained by reading the form, and based on a magnitude comparison between respective constituent elements of the two images extracted by the image classification means, An image identification step of identifying one as an image of the content entered later.

【００１９】したがって、あらかじめ帳票の文書フォー
ムの色や後から記入された文字等の内容の色を指定しな
くても、帳票の画像から帳票に後から記入された内容だ
けを精度よく抽出することができる。Therefore, it is possible to accurately extract only the contents of a form from a form image later without specifying the color of the form of the form or the color of contents such as characters entered later. Can be.

【００２０】請求項６に記載の発明は、請求項５に記載
の帳票読取方法において、前記画像識別工程は、前記構
成要素間の大小比較として前記２つの画像にそれぞれ含
まれる画素の連結部分の大きさの比較を行うものであ
る。According to a sixth aspect of the present invention, in the form reading method according to the fifth aspect, the image identifying step includes a step of comparing a connected portion of pixels included in each of the two images as a magnitude comparison between the constituent elements. This is to compare the sizes.

【００２１】したがって、文書フォームとして帳票に枠
線が記載されている場合などに、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。Therefore, in the case where a frame line is described in a form as a document form, for example, it is possible to accurately extract only the contents later entered in the form from the image of the form.

【００２２】請求項７に記載の発明は、請求項５に記載
の帳票読取方法において、前記画像識別工程は、前記構
成要素間の大小比較として前記２つの画像にそれぞれ含
まれる直線の長さの比較を行うものである。According to a seventh aspect of the present invention, in the form reading method according to the fifth aspect, the image identification step includes determining a length of a straight line included in each of the two images as a magnitude comparison between the constituent elements. This is to make a comparison.

【００２３】したがって、文書フォームとして帳票に枠
線が記載されている場合のみならず、枠線に代えて罫線
が記載されている場合などにも、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。Therefore, not only when a frame is described in a form as a document form, but also when a ruled line is described in place of the frame, etc., the contents entered later on the form from the form image Can be accurately extracted.

【００２４】請求項８に記載の発明は、請求項１〜３の
いずれかの一に記載の帳票読取方法において、前記画像
識別工程により後から記入された前記内容のものとして
識別された画像から文字を構成する画像を抽出する文字
抽出工程と、この抽出された文字を構成する画像の文字
認識を行う文字認識工程と、を含んでなる。According to an eighth aspect of the present invention, there is provided the form reading method according to any one of the first to third aspects, wherein an image identified as the content later entered in the image identifying step is used. The method includes a character extracting step of extracting an image forming a character, and a character recognizing step of performing character recognition of an image forming the extracted character.

【００２５】したがって、帳票に記載された文字のうち
後から記入されたもののみを認識することができる。Therefore, it is possible to recognize only the characters entered later among the characters described in the form.

【００２６】請求項９に記載の発明は、帳票に予め記載
されている文書フォームを構成する色の画像と前記帳票
に後から記入された内容を構成する色の画像とに各々分
類して、この２つの画像を前記帳票を読み取った画像か
ら抽出する画像分類工程と、この画像分類手段で抽出さ
れた前記２つの画像のそれぞれの構成要素間における大
小比較に基づいて前記２つの画像のうちの一方を後から
記入された前記内容の画像として識別する画像識別工程
と、をコンピュータに実行させるプログラムを記憶して
いるコンピュータに読取可能な記憶媒体である。According to a ninth aspect of the present invention, a color image constituting a document form described in advance on a form and a color image constituting contents to be later entered in the form are classified into: An image classification step of extracting the two images from the image obtained by reading the form, and based on a magnitude comparison between respective constituent elements of the two images extracted by the image classification means, And a computer-readable storage medium storing a program for causing the computer to execute an image identification step of identifying one of the contents as an image of the content entered later.

【００２７】したがって、あらかじめ帳票の文書フォー
ムの色や後から記入された文字等の内容の色を指定しな
くても、帳票の画像から帳票に後から記入された内容だ
けを精度よく抽出することができる。Therefore, it is possible to accurately extract only the contents of a form later written from a form image without specifying the color of a document form of the form or the color of contents such as characters entered later. Can be.

【００２８】請求項１０に記載の発明は、請求項９に記
載の記憶媒体において、前記画像識別工程は、前記構成
要素間の大小比較として前記２つの画像にそれぞれ含ま
れる画素の連結部分の大きさの比較を行うものである。According to a tenth aspect of the present invention, in the storage medium according to the ninth aspect, in the image identifying step, a size of a connected portion of a pixel included in each of the two images is compared as a magnitude comparison between the constituent elements. This is to make a comparison.

【００２９】したがって、文書フォームとして帳票に枠
線が記載されている場合などに、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。Therefore, when a frame is described in a document as a document form, it is possible to accurately extract only the content that is later entered in the document from the image of the document.

【００３０】請求項１１に記載の発明は、請求項９に記
載の記憶媒体において、前記画像識別工程は、前記構成
要素間の大小比較として前記２つの画像にそれぞれ含ま
れる直線の長さの比較を行うものである。According to an eleventh aspect of the present invention, in the storage medium according to the ninth aspect, the image identifying step compares the length of a straight line included in each of the two images as a magnitude comparison between the constituent elements. Is what you do.

【００３１】したがって、文書フォームとして帳票に枠
線が記載されている場合のみならず、枠線に代えて罫線
が記載されている場合などにも、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。Therefore, not only when a frame line is described on a form as a document form, but also when a ruled line is described in place of the frame line, the content entered later on the form from the image of the form. Can be accurately extracted.

【００３２】請求項１２に記載の発明は、請求項９〜１
１のいずれかの一に記載の記憶媒体において、前記プロ
グラムは、さらに、後から記入された前記内容のものと
して前記画像識別工程により識別された画像から文字を
構成する画像を抽出する文字抽出工程と、この抽出され
た文字を構成する画像の文字認識を行う文字認識工程
と、をコンピュータに実行させるものである。According to the twelfth aspect of the present invention, there is provided the ninth to the first aspects.
1. The storage medium according to claim 1, wherein the program further includes: a character extracting step of extracting an image forming a character from an image identified by the image identifying step as the content entered later. And a character recognition step of performing character recognition of an image constituting the extracted character.

【００３３】したがって、帳票に記載された文字のうち
後から記入されたもののみを認識することができる。Therefore, it is possible to recognize only the characters entered later among the characters described in the form.

【００３４】[0034]

【発明の実施の形態】［発明の実施の形態１］この発明
の一実施の形態を発明の実施の形態１として説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS [First Embodiment of the Invention] One embodiment of the present invention will be described as a first embodiment of the present invention.

【００３５】図１は、この発明の実施の形態１である帳
票処理装置１の機能ブロック図である。図１に示すよう
に、この帳票処理装置１は、帳票のカラー画像を入力す
るカラー画像入力手段２と、この入力された帳票のカラ
ー画像を、予め帳票に記入されている枠線、文字、図形
などから構成される文書フォーム（予め文書作成用に用
意されているテンプレートなど）の色のものと、後から
記入された文字の色のものとに分類して各々抽出する画
像分類手段３と、この分類された画像が、予め帳票に記
入されている文書フォームと、後から記入された文字と
のうち、どちらのものであるかを判別する画像識別手段
４と、各帳票ごとに文字を記入すべき領域を示す情報を
記憶している文字領域情報辞書５を参照して、帳票に記
載されている文字を抽出する文字抽出手段６と、文字の
画像とその文字コードとを対応付けて文字ごとに記憶し
ている文字認識辞書７を参照して、文字抽出手段６によ
り抽出された文字を認識する文字認識手段８と、その文
字認識の結果を出力する出力手段９とにより構成され
る。FIG. 1 is a functional block diagram of a form processing apparatus 1 according to the first embodiment of the present invention. As shown in FIG. 1, the form processing apparatus 1 includes a color image input unit 2 for inputting a color image of a form, and a frame line, a character, An image classifying means 3 for classifying into a document form composed of figures and the like (a template prepared in advance for document creation and the like) and a color of a character entered later and extracting each of them; An image identification means 4 for determining which of the classified image is a document form pre-filled in a form and a character entered later, and a character for each form. Referring to a character area information dictionary 5 storing information indicating an area to be filled, a character extracting means 6 for extracting a character described in a form, a character image and its character code are associated with each other. Remember for each character That with reference to the character recognition dictionary 7, and recognizes the character recognition means 8 the extracted character by character extraction means 6, constituted by an output unit 9 for outputting a result of the character recognition.

【００３６】次に、帳票処理装置１が行う処理について
説明する。図２は、帳票処理装置１が行う処理の流れを
示すフローチャートである。以下では、図３に示すよう
な帳票１１を例にして説明する。図２に示すように、カ
ラー画像入力手段２によって帳票のカラー画像を入力す
る（ステップＳ１）。このカラー画像の画像データは、
画素ごとに、Ｒ（レッド），Ｇ（グリーン），Ｂ（ブル
ー）のそれぞれの色成分について多値の画素値をもつの
で、この画素値をもとに、まず帳票の文書フォームの色
をもつ画素と、後から記入された文字の色をもつ画素
と、背景色の画素を、画像分類手段３により分類する
（ステップＳ２）。これは、一般によく知られる統計的
な方法、たとえば判別分析法などによって、画素値を複
数の集合に分類することなどにより実現できる。そし
て、帳票の文書フォームの色をもつ画素と、後から記入
された文字の色をもつ画素とに分類して、その２つに分
類された画素を異なる２つの画像上に各々マップするこ
とによって、図３に示す帳票１１の読取画像は、図４に
示す帳票１１の文書フォームの色をもつ画素からなる画
像１２と、図５に示す後から帳票１１に記入された文字
の色をもつ画素からなる画像１３とに分類される。ステ
ップＳ２により、画像分類工程が実現される。Next, the processing performed by the form processing apparatus 1 will be described. FIG. 2 is a flowchart illustrating a flow of a process performed by the form processing apparatus 1. In the following, a description will be given using the form 11 as shown in FIG. 3 as an example. As shown in FIG. 2, a color image of a form is input by the color image input means 2 (step S1). The image data of this color image is
Each pixel has a multi-valued pixel value for each of the R (red), G (green), and B (blue) color components. Based on this pixel value, the color of the document form of the form is first determined. Pixels, pixels having the character color written later, and pixels having the background color are classified by the image classification means 3 (step S2). This can be realized by classifying pixel values into a plurality of sets by a generally well-known statistical method, for example, a discriminant analysis method or the like. Then, the pixels are classified into the pixels having the color of the document form of the form and the pixels having the color of the character entered later, and the two classified pixels are respectively mapped on two different images. The read image of the form 11 shown in FIG. 3 is composed of an image 12 composed of pixels having the color of the document form of the form 11 shown in FIG. 4 and a pixel having the color of a character written in the form 11 later shown in FIG. And an image 13 composed of The step S2 implements the image classification step.

【００３７】しかし、この時点では、画像を２つに分類
できたのみであり、どちらの色の画像が、帳票１１の文
書フォームの画像１２であり、帳票１１に後から記入さ
れた文字の画像１３であるのかが不明であるため、次
に、画像識別手段４により、画像１２，１３を識別する
（ステップＳ３）。その手段としては、２つの画像１
２，１３のそれぞれの構成要素間における大小比較に基
づいて行えば、あらかじめ帳票１１の文書フォームの色
や後から記入された文字の色を指定しなくても、帳票１
１の文書フォームの画像と、帳票１１に後から記入され
た文字の画像とを精度よく識別することができる。ステ
ップＳ３により画像識別工程が実現される。However, at this point, only the images can be classified into two, and the image of either color is the image 12 of the document form of the form 11, and the image of the character entered later on the form 11 Since it is unclear whether the image is 13 or not, the images 12 and 13 are identified by the image identifying means 4 (step S3). As a means, two images 1
By performing the comparison based on the magnitude comparison between the constituent elements 2 and 13, the form 1 can be used without specifying the color of the document form of the form 11 or the color of the character entered later.
The image of the first document form and the image of the character entered later on the form 11 can be identified with high accuracy. The image identification step is realized by step S3.

【００３８】具体的には次のように処理する。一般に、
帳票１１の文書フォームの方が、そこに後から記入する
文字よりも、図形的に大きいものが多く含まれる。この
ことを利用して、各画像１２，１３から画素の連結成分
のうち最も大きいものを抽出してその面積を求め、それ
らを比較して大きい方の外接矩形を含む画像を帳票１１
の文書フォームの画像１２、他方を後から記入された文
字の画像１３とすればよい。すなわち、図６に示すよう
に、画像１２の最大の連結成分の外接矩形１４を抽出
し、また、図７に示すように、画像１３の最大の連結成
分の外接矩形１５を抽出し、外接矩形１４と１５とを比
較すれば、この例では、外接矩形１４の方が大きいの
で、外接矩形１４を含む画像１２が、帳票１１の文書フ
ォームの色をもつ画素からなる画像であることがわか
る。Specifically, the processing is performed as follows. In general,
The document form of the form 11 includes many figures that are graphically larger than characters to be entered later. Utilizing this, the largest one of the connected components of pixels is extracted from each of the images 12 and 13 to obtain the area thereof, and the areas are compared.
Image 12 of the document form, and the image 13 of the character entered later. That is, as shown in FIG. 6, a circumscribed rectangle 14 of the largest connected component of the image 12 is extracted, and as shown in FIG. 7, a circumscribed rectangle 15 of the largest connected component of the image 13 is extracted. When 14 and 15 are compared, in this example, since the circumscribed rectangle 14 is larger, it is understood that the image 12 including the circumscribed rectangle 14 is an image composed of pixels having the color of the document form of the form 11.

【００３９】そして、文字領域情報辞書５を参照して、
帳票１１において文字を記入すべき領域を特定し、帳票
１１に後から記入された文字の画像１３中における当該
領域から文字の画像を、文字抽出手段６により抽出し
（ステップＳ４）、その抽出した文字の画像に対応する
文字コードを、文字認識辞書７を参照して文字認識手段
８で取得することにより、文字認識を実行して（ステッ
プＳ５）、その結果を出力手段９で出力する（ステップ
Ｓ６）。ステップＳ４により文字抽出工程が、ステップ
Ｓ５により文字認識工程が実現される。Then, referring to the character area information dictionary 5,
An area where a character is to be written in the form 11 is specified, and a character image is extracted from the area in the character image 13 written later in the form 11 by the character extracting means 6 (step S4), and the extracted character image is extracted. The character recognition is executed by acquiring the character code corresponding to the character image by the character recognition unit 8 with reference to the character recognition dictionary 7 (step S5), and the result is output by the output unit 9 (step S5). S6). The character extracting step is realized by step S4, and the character recognizing step is realized by step S5.

【００４０】図８は、帳票処理装置１の電気的な接続を
示すブロック図である。帳票処理装置１は、各種演算を
行い、各種の制御を行うＣＰＵ２１と、ＢＩＯＳなどが
格納されたＲＯＭ２２と、各種のデータを書換え可能に
記憶し、ＣＰＵ２１の作業エリアとなるＲＡＭ２３と
が、バス２４により接続されている。バス２４には、さ
らに、各種インターフェイス２５を介して、キーボー
ド、マウスなどの入力装置２６と、ディスプレイ、プリ
ンタなどの出力装置２７と、外部記憶装置であるハード
ディスク２８と、記憶媒体であるＣＤ−ＲＯＭ２９を読
み取るＣＤ−ＲＯＭドライブ３０と、フラットベッドタ
イプなどのイメージスキャナ３１と、インターネットな
どと通信を行う通信制御装置３２とが接続されている。FIG. 8 is a block diagram showing the electrical connection of the form processing apparatus 1. The form processing apparatus 1 includes a CPU 21 that performs various calculations and performs various controls, a ROM 22 that stores a BIOS and the like, and a RAM 23 that stores various data in a rewritable manner and is a work area of the CPU 21. Connected by The bus 24 further includes, via various interfaces 25, an input device 26 such as a keyboard and a mouse, an output device 27 such as a display and a printer, a hard disk 28 as an external storage device, and a CD-ROM 29 as a storage medium. A CD-ROM drive 30 for reading an image, an image scanner 31 such as a flatbed type, and a communication control device 32 for communicating with the Internet or the like are connected.

【００４１】ＣＤ−ＲＯＭ２９には各種のプログラムが
記憶されていて、このプログラムをＣＤ−ＲＯＭドライ
ブ３０で読み取り、ハードディスク３０にインストール
することにより、前記の各種処理の実行が可能な状態と
なる。すなわち、イメージスキャナ３１で帳票を読み取
ることでカラー画像入力手段２が実現され、帳票処理装
置１の内部処理として画像分類手段３、画像識別手段
４、文字抽出手段６および文字認識手段８が実現され、
これらの処理の結果を出力装置２７に出力することによ
り出力手段９が実現される。文字領域情報辞書５、文字
認識辞書７を記憶する領域もハードディスク２８に確保
される。Various programs are stored in the CD-ROM 29. The programs are read by the CD-ROM drive 30 and installed on the hard disk 30, whereby the above-mentioned various processes can be executed. That is, the form is read by the image scanner 31 to realize the color image input unit 2, and the internal processing of the form processing apparatus 1 is realized by the image classification unit 3, the image identification unit 4, the character extraction unit 6, and the character recognition unit 8. ,
The output unit 9 is realized by outputting the results of these processes to the output device 27. An area for storing the character area information dictionary 5 and the character recognition dictionary 7 is also secured on the hard disk 28.

【００４２】記憶媒体はＣＤ−ＲＯＭ２９に限定される
ものではなく、ＣＤ−Ｒ、ＣＤ−ＲＡＭ等、他の方式の
ＣＤや、ＤＶＤ、ＭＯ、ＦＤなどの各種の記録メディア
を用いることができる。また、インターネットなどから
通信制御装置３２を介して前記プログラムをダウンロー
ドして、ハードディスク３０にインストールするように
してもよく、この場合に、インターネットなどに接続さ
れた送信側の装置に用いられ、前記のプログラムを送信
可能に記憶している記憶装置も、本発明の記憶媒体であ
る。なお、前記のプログラムは、所定のＯＳ上で動作す
るものであってもよい。The storage medium is not limited to the CD-ROM 29, and various types of recording media such as CD-R, CD-RAM, and other types of CDs, DVDs, MOs, and FDs can be used. Further, the program may be downloaded from the Internet or the like via the communication control device 32 and installed on the hard disk 30. In this case, the program is used for a transmission-side device connected to the Internet or the like. A storage device that stores a program so that it can be transmitted is also a storage medium of the present invention. Note that the program may be operated on a predetermined OS.

【００４３】以上説明した帳票処理装置１によれば、複
数枚綴りの複写用紙の２枚目と３枚目のように、同じ種
類の帳票で色が異なるような帳票の画像が入力された場
合でも、帳票の文書フォームの色と、帳票に後から記入
された文字の色をあらかじめ指定しておくことなく、正
しく文字を抽出して認識することができる。また、文字
の色が黒である必要が無いため、複数枚つづりの複写用
紙でよく用いられるような、黒色以外の色の文字につい
ても、正しく文字を抽出して認識することができる。According to the form processing apparatus 1 described above, when an image of a form in which the same type of form is different in color is input, such as the second and third sheets of a plurality of spelled copy sheets. However, it is possible to correctly extract and recognize characters without having to specify in advance the colors of the document form of the form and the colors of the characters that are later entered in the form. In addition, since the color of the character does not need to be black, it is possible to correctly extract and recognize a character having a color other than black, which is often used for a copy sheet composed of a plurality of sheets.

【００４４】［発明の実施の形態２］この発明の別の実
施の形態を発明の実施の形態２として説明する。[Second Embodiment of the Invention] Another embodiment of the present invention will be described as a second embodiment of the present invention.

【００４５】この発明の実施の形態２において、発明の
実施の形態１と共通する事項については、同一符号を用
い、詳細な説明は省略する。発明の実施の形態２が発明
の実施の形態１と相違する点は、画像識別手段４により
実行されるステップＳ３の処理の内容にある。In the second embodiment of the present invention, items common to the first embodiment of the present invention are denoted by the same reference numerals, and detailed description is omitted. The difference between the second embodiment of the present invention and the first embodiment of the present invention lies in the content of the processing of step S3 executed by the image identifying means 4.

【００４６】すなわち、帳票の文書フォームが、図３に
示す帳票１１のような矩形の枠ではなく、罫線で構成さ
れるような場合には、連結成分の外接矩形の面積が小さ
くなるため、帳票の文書フォームの色と、帳票に後から
記入された文字とを正しく識別できない場合が生じるこ
とがある。例えば、図５のような帳票画像の場合、帳票
の文書フォームの連結成分における外接矩形の最大のも
のの面積は、文字の画像の面積に比べて大きいとはいえ
ず、識別を誤ることがある。That is, when the document form of the form is not a rectangular frame like the form 11 shown in FIG. 3 but is composed of ruled lines, the area of the circumscribed rectangle of the connected component becomes small. In some cases, it may not be possible to correctly identify the color of the document form and characters that are later entered in the form. For example, in the case of the form image as shown in FIG. 5, the area of the largest circumscribed rectangle in the connected component of the document form of the form is not large compared to the area of the character image, and may be erroneously identified.

【００４７】これを防ぐために、連結成分の外接矩形の
面積について大小を判断する代わりに、各画像に含まれ
る直線の最大長さを算出し、それらを比較して直線の最
大長さの大きい方を含む画像を帳票の文書フォームの画
像、他方を後から記入された文字の画像とすればよい。In order to prevent this, instead of judging the size of the area of the circumscribed rectangle of the connected component, the maximum length of the straight line included in each image is calculated, and these are compared to determine the maximum length of the straight line. May be an image of the document form of the form, and the other may be an image of characters entered later.

【００４８】すなわち、一般に帳票の文書フォームに含
まれる罫線は、縦または横方向にある程度の長さをもっ
た直線であることが多く、縦または横方向に画素の度数
分布をとれば、直線がある位置で度数が極端に大きくな
る。後から記入された文字の画像の度数分布は、これほ
どの値をもたない。この特徴を用いて両画像を識別する
ことで、枠をもたない帳票の文書フォームの画像につい
ても識別を誤ることがなく、正しく文字を抽出して認識
することができる。That is, in general, ruled lines included in a document form of a form are often straight lines having a certain length in the vertical or horizontal direction. The frequency becomes extremely large at a certain position. The frequency distribution of the image of the character entered later does not have such a value. By distinguishing both images using this feature, it is possible to correctly extract and recognize characters without erroneous identification of an image of a document form of a form without a frame.

【００４９】例えば、図９に示す帳票４１の場合、その
帳票４１の文書フォームの画像は、図１０に示す画像４
２であり、帳票４１に後から記入された文字の画像は、
図１１に示す画像４３である。この例で、それぞれの画
像４２，４３に対して縦または横方向に値をもつ画素の
度数分布を求め、それぞれの度数の最大値が大きい方を
帳票の文書フォームの画像４２とする。すなわち、図９
〜図１１に示す例では、図１２に示すように画像４２
（図１２（ａ））の画像の横方向に値をもつ画素の度数
分布を求め（図１２（ｂ））、また、図１３に示すよう
に画像４３（図１３（ａ））の画像の横方向に値をもつ
画素の度数分布を求め（図１３（ｂ））、両者の度数の
最大値を比べてみれば、画像４２の方が度数の最大値が
大きいため（図１２（ｂ）、図１３（ｂ））、画像４２
の方が帳票４１の文書フォームの画像であると判断する
ことができる。For example, in the case of the form 41 shown in FIG. 9, the image of the document form of the form 41 is the image 4 shown in FIG.
2, and the image of the character entered later in the form 41 is
It is an image 43 shown in FIG. In this example, the frequency distribution of pixels having a value in the vertical or horizontal direction is obtained for each of the images 42 and 43, and the larger one of the respective frequencies is defined as the image 42 of the document document form. That is, FIG.
In the example shown in FIG. 11 to FIG.
The frequency distribution of pixels having values in the horizontal direction of the image of FIG. 12A is obtained (FIG. 12B), and the image 43 of FIG. The frequency distribution of pixels having values in the horizontal direction is obtained (FIG. 13B), and when the maximum values of the frequencies are compared, the maximum value of the image 42 is larger than that of the image 42 (FIG. 12B). 13 (b)), image 42
Can be determined to be the image of the document form of the form 41.

【００５０】[0050]

【発明の効果】請求項１に記載の発明は、あらかじめ帳
票の文書フォームの色や後から記入された文字等の内容
の色を指定しなくても、帳票の画像から帳票に後から記
入された内容だけを精度よく抽出することができる。According to the first aspect of the present invention, it is possible to fill out a form from an image of a form later without specifying a color of a document form of the form or a color of contents such as characters entered later. Only the contents that have been extracted can be accurately extracted.

【００５１】請求項２に記載の発明は、請求項１に記載
の帳票処理装置において、文書フォームとして帳票に枠
線が記載されている場合などに、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。According to a second aspect of the present invention, in the form processing apparatus according to the first aspect, when a frame line is described in the form as a document form, the form is later filled out from the form image. Only the contents can be accurately extracted.

【００５２】請求項３に記載の発明は、請求項１に記載
の帳票処理装置において、文書フォームとして帳票に枠
線が記載されている場合のみならず、枠線に代えて罫線
が記載されている場合などにも、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。According to a third aspect of the present invention, in the form processing apparatus according to the first aspect, not only a case where a frame is described as a document form but also a ruled line is described instead of the frame line. In such a case, it is possible to accurately extract only the contents later entered in the form from the form image.

【００５３】請求項４に記載の発明は、請求項１〜３の
いずれかの一に記載の帳票処理装置において、帳票に記
載された文字のうち後から記入されたもののみを認識す
ることができる。According to a fourth aspect of the present invention, in the form processing apparatus according to any one of the first to third aspects, it is possible to recognize only a character entered later in the form. it can.

【００５４】請求項５に記載の発明は、あらかじめ帳票
の文書フォームの色や後から記入された文字等の内容の
色を指定しなくても、帳票の画像から帳票に後から記入
された内容だけを精度よく抽出することができる。According to the fifth aspect of the present invention, it is possible to provide a content that is later entered into a form from an image of the form without specifying the color of the document form of the form or the color of the content such as characters entered later. Can be accurately extracted.

【００５５】請求項６に記載の発明は、請求項５に記載
の帳票読取方法において、文書フォームとして帳票に枠
線が記載されている場合などに、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。According to a sixth aspect of the present invention, in the form reading method according to the fifth aspect, when a frame line is described in the form as a document form, the form is later entered from the form image into the form. Only the contents can be accurately extracted.

【００５６】請求項７に記載の発明は、請求項５に記載
の帳票読取方法において、文書フォームとして帳票に枠
線が記載されている場合のみならず、枠線に代えて罫線
が記載されている場合などにも、帳票の画像から帳票に
後から記入された内容だけを精度よく抽出することがで
きる。According to a seventh aspect of the present invention, in the form reading method according to the fifth aspect, not only a case where a frame is described in a form as a document form but also a ruled line is described in place of the frame. In such a case, it is possible to accurately extract only the contents later entered in the form from the form image.

【００５７】請求項８に記載の発明は、請求項１〜３の
いずれかの一に記載の帳票読取方法において、帳票に記
載された文字のうち後から記入されたもののみを認識す
ることができる。According to an eighth aspect of the present invention, in the form reading method according to any one of the first to third aspects, it is possible to recognize only a character entered later in the form. it can.

【００５８】請求項９に記載の発明は、あらかじめ帳票
の文書フォームの色や後から記入された文字等の内容の
色を指定しなくても、帳票の画像から帳票に後から記入
された内容だけを精度よく抽出することができる。According to a ninth aspect of the present invention, the contents of a form that is later entered from a form image without specifying the color of the document form of the form or the color of the contents such as characters entered later. Can be accurately extracted.

【００５９】請求項１０に記載の発明は、請求項９に記
載の記憶媒体において、文書フォームとして帳票に枠線
が記載されている場合などに、帳票の画像から帳票に後
から記入された内容だけを精度よく抽出することができ
る。According to a tenth aspect of the present invention, in the storage medium according to the ninth aspect, when a frame line is described in a form as a document form, the content that is later entered in the form from the form image Can be accurately extracted.

【００６０】請求項１１に記載の発明は、請求項９に記
載の記憶媒体において、文書フォームとして帳票に枠線
が記載されている場合のみならず、枠線に代えて罫線が
記載されている場合などにも、帳票の画像から帳票に後
から記入された内容だけを精度よく抽出することができ
る。According to an eleventh aspect of the present invention, in the storage medium according to the ninth aspect, not only a case where a frame is described in a form as a document form, but also a ruled line is described in place of the frame line. Even in such cases, it is possible to accurately extract only the contents later entered in the form from the form image.

【００６１】請求項１２に記載の発明は、請求項９〜１
１のいずれかの一に記載の記憶媒体において、帳票に記
載された文字のうち後から記入されたもののみを認識す
ることができる。The twelfth aspect of the present invention relates to the ninth to the first aspects.
In the storage medium according to any one of (1) and (3), only the characters written later in the form can be recognized.

[Brief description of the drawings]

【図１】この発明の実施の形態１である帳票処理装置の
機能ブロック図である。FIG. 1 is a functional block diagram of a form processing apparatus according to a first embodiment of the present invention.

【図２】前記帳票処理装置の処理を説明するフローチャ
ートである。FIG. 2 is a flowchart illustrating processing of the form processing device.

【図３】前記帳票処理装置で読み取る帳票の例を示す平
面図である。FIG. 3 is a plan view showing an example of a form read by the form processing apparatus.

【図４】前記帳票の画像から抽出されたあらかじめ帳票
に記入されている文書フォームの画像を示す平面図であ
る。FIG. 4 is a plan view showing an image of a document form which is extracted from the image of the form and which is previously entered in the form.

【図５】前記帳票の画像から抽出された後から帳票に記
入した文字の画像を示す平面図である。FIG. 5 is a plan view showing an image of a character written on a form after being extracted from the form image.

【図６】前記文書フォームの画像中で画素の連結成分が
最大である外接矩形を示す平面図である。FIG. 6 is a plan view showing a circumscribed rectangle in which a connected component of a pixel is maximum in the image of the document form.

【図７】前記文字の画像中で画素の連結成分が最大であ
る外接矩形を示す平面図である。FIG. 7 is a plan view showing a circumscribed rectangle in which a connected component of a pixel is maximum in the image of the character.

【図８】前記帳票処理装置の電気的な接続を示すブロッ
ク図である。FIG. 8 is a block diagram showing an electrical connection of the form processing device.

【図９】この発明の実施の形態２である帳票処理装置で
読み取る帳票の例を示す平面図である。FIG. 9 is a plan view showing an example of a form read by a form processing apparatus according to Embodiment 2 of the present invention;

【図１０】前記帳票の画像から抽出されたあらかじめ帳
票に記入されている文書フォームの画像を示す平面図で
ある。FIG. 10 is a plan view showing an image of a document form which is extracted from the image of the form and which is previously written in the form.

【図１１】前記帳票の画像から抽出された後から帳票に
記入した文字の画像を示す平面図である。FIG. 11 is a plan view showing an image of a character entered in a form after being extracted from the form image.

【図１２】前記文書フォームの画像中での横方向の画素
の度数分布を示す平面図である。FIG. 12 is a plan view showing a frequency distribution of pixels in the horizontal direction in the image of the document form.

【図１３】前記文字の画像中での横方向の画素の度数分
布を示す平面図である。FIG. 13 is a plan view showing a frequency distribution of horizontal pixels in the character image.

[Explanation of symbols]

１帳票処理装置３画像分類手段４画像識別手段６文字抽出手段７文字認識手段 DESCRIPTION OF SYMBOLS 1 Form processing device 3 Image classification means 4 Image identification means 6 Character extraction means 7 Character recognition means

Claims

[Claims]

1. A color image forming a document form previously described in a form and a color image forming a content which is later entered in the form, and the two images are classified into the form. An image classification unit that extracts from the read image, and one of the two images is later filled in based on a magnitude comparison between respective components of the two images extracted by the image classification unit. A form processing device comprising: an image identification unit that identifies the image as the content.

2. The form processing apparatus according to claim 1, wherein the image identification unit compares the size of a connected portion of pixels included in each of the two images as a magnitude comparison between the constituent elements. .

3. The form processing apparatus according to claim 1, wherein the image identification unit compares the lengths of straight lines included in the two images as a magnitude comparison between the constituent elements.

4. A character extracting means for extracting an image constituting a character from an image which has been subsequently identified by said image identifying means as said contents, and character recognition of an image constituting said extracted character. The form processing apparatus according to any one of claims 1 to 3, further comprising: a character recognition unit that performs the following.

5. A color image constituting a document form previously described on a form and a color image constituting contents which are later entered on the form are classified, and the two images are classified into the form. An image classification step of extracting from the read image, and one of the two images is filled in later based on a magnitude comparison between respective components of the two images extracted by the image classification means. An image identification step of identifying the image as the image of the content.

6. The form reading method according to claim 5, wherein the image identification step compares the size of a connected portion of pixels included in each of the two images as a magnitude comparison between the constituent elements. .

7. The form reading method according to claim 5, wherein the image identification step compares the lengths of straight lines included in the two images as a magnitude comparison between the constituent elements.

8. A character extracting step of extracting an image constituting a character from an image identified as having the content entered later in the image identifying step, and character recognition of the image constituting the extracted character The form reading method according to any one of claims 1 to 3, further comprising:

9. A color image forming a document form previously described in a form and a color image forming a content which is later entered in the form, and the two images are classified into the form. An image classification step of extracting from the read image, and one of the two images is filled in later based on a magnitude comparison between respective components of the two images extracted by the image classification means. A computer-readable storage medium storing a program for causing a computer to execute an image identification step of identifying the image as the content.

10. The image identification step includes comparing the sizes of connected portions of pixels included in the two images as a magnitude comparison between the constituent elements.
A storage medium according to claim 1.

11. The storage medium according to claim 9, wherein the image identification step compares the lengths of straight lines included in the two images as a magnitude comparison between the constituent elements.

12. The program further comprises: a character extracting step of extracting an image constituting a character from an image identified in the image identifying step as the content of the content entered later; The storage medium according to any one of claims 9 to 11, which causes a computer to execute a character recognition step of performing character recognition of an image to be formed.