JP2018124854A

JP2018124854A - Image processing apparatus and image processing program

Info

Publication number: JP2018124854A
Application number: JP2017017421A
Authority: JP
Inventors: 猪股　浩司郎; Koshiro Inomata; 浩司郎猪股
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2017-02-02
Filing date: 2017-02-02
Publication date: 2018-08-09
Anticipated expiration: 2037-02-02
Also published as: JP6907565B2

Abstract

PROBLEM TO BE SOLVED: To recognize character information corresponding to a selected position, without requiring a work for setting in advance a character and the position of the character while associating them.SOLUTION: Image data of an unwritten document 51 and image data of an already written document 52 including an additional recording on the unwritten document 51 are acquired. Characters on the unwritten document 51 are recognized. Additional recorded images (circle marks 521, 522, and 523), which are images additionally recorded on the unwritten document 51, are extracted from the written document 52. Characters recorded in areas corresponding to the additional recorded images (circle marks 521, 522, and 523) are identified from the characters recognized on the unwritten document 51.SELECTED DRAWING: Figure 4

Description

本発明は、画像処理装置および画像処理プログラムに関する。 The present invention relates to an image processing apparatus and an image processing program.

官公庁等に提出する書類や様々なアンケート用紙への記入等、印字された用紙（帳票やアンケート用紙等）に手書きで記入して提出する機会が多い。記入された用紙を集める側は、記入済のアンケート用紙等を自動で読み取って集計したいという要求がある。 There are many opportunities to fill out printed forms (forms, questionnaires, etc.) by hand, such as filling out documents submitted to government offices and various questionnaires. There is a demand for the side that collects the completed forms to automatically read and complete the completed questionnaire forms.

その要求に対し、特許文献１には、マークシートのように塗りつぶして回答する種類の帳票について、回答が記入されたマークシートをスキャナ等で読み取って集計する技術が開示されている。 In response to such a request, Patent Document 1 discloses a technique for reading and counting a mark sheet on which a reply is written with a scanner or the like for a type of form that is filled and answered like a mark sheet.

特開平２０１３―４５３０９号公報Japanese Unexamined Patent Publication No. 2013-45309

しかしながら、上掲の特許文献１に開示された技術の場合、マークシートに記入されているマークの位置を検出することはできるが、その位置に記入されたマークが何を意味しているかは、別途の情報として事前設定しておく必要がある。 However, in the case of the technique disclosed in the above-mentioned Patent Document 1, the position of the mark written on the mark sheet can be detected, but what the mark written at the position means separately It is necessary to set in advance as information.

本発明は、特定された位置に対応する文字情報を、文字と文字の位置とを予め対応づけて設定する作業を必要とすることなく認識する画像処理装置および画像処理プログラムを提供することを目的とする。 An object of the present invention is to provide an image processing apparatus and an image processing program for recognizing character information corresponding to a specified position without requiring an operation for setting characters and character positions in association with each other in advance. And

請求項１は、
第１の画像を表わす第１の画像データと、該第１の画像データに追加記録がなされた第２の画像を表わす第２の画像データとを取得する画像取得部と、
前記第１の画像から、１文字であることを含む文字列を認識する文字列認識部と、
前記第２の画像の中から、前記第１の画像に対し追加記録された画像である追加記録画像を抽出する追加記録画像抽出部と、
前記文字列認識部で認識された文字列の中から、前記第１の画像上の、前記追加記録画像に対応する文字列を特定する文字列特定部と
を備えたことを特徴とする画像処理装置である。 Claim 1
An image acquisition unit configured to acquire first image data representing a first image and second image data representing a second image additionally recorded on the first image data;
A character string recognition unit for recognizing a character string including one character from the first image;
An additional recorded image extraction unit that extracts an additional recorded image that is an image additionally recorded with respect to the first image from the second image;
An image processing comprising: a character string specifying unit that specifies a character string corresponding to the additional recording image on the first image from the character strings recognized by the character string recognition unit. Device.

請求項２は、
前記文字列認識部が、前記第１の画像から、文字列の認識に加え、認識した文字列ごとに、該第１の画像上の、該文字列が記録されていた、１点もしくは複数点の座標で表現された領域であることを含む第１の領域を該文字列に対応付け、
追加記録画像抽出部が、前記追加記録画像を構成する、個別の記録ごとの個別追加記録画像ごとに、前記第２の画像上の、該個別追加記録画像が記録されていた、１点もしくは複数点の座標で表現された領域であることを含む第２の領域を抽出し、
前記文字列特定部が、前記第２の領域に対し予め定められた第１の位置関係にある第１の領域に対応づけられている文字列を特定することを特徴とする請求項１に記載の画像処理装置である。 Claim 2
In addition to recognizing the character string from the first image, the character string recognizing unit records one or more points on which the character string is recorded on the first image for each recognized character string. A first region including the region represented by the coordinates of
The additional recorded image extraction unit records the individual additional recorded image on the second image for each individual additional recorded image that constitutes the additional recorded image. Extracting a second region including the region represented by the coordinates of the point;
2. The character string specifying unit specifies a character string associated with a first area having a predetermined first positional relationship with respect to the second area. This is an image processing apparatus.

請求項３は、
前記文字列認識部が、認識した個々の文字の中の、１点もしくは複数点の座標で表現された領域であることを含む領域どうしが予め定められた第２の位置関係にある複数の文字を、１つの文字列として認識し、前記第１の画像上の、該文字列が記録されていた、１点もしくは複数点の座標で表現された領域であることを含む領域を、前記第１の領域として該文字列に対応付けることを特徴とする請求項２に記載の画像処理装置である。 Claim 3
A plurality of characters in which the character string recognition unit recognizes each of the characters including a region represented by one or a plurality of coordinates in a predetermined second positional relationship. Is recognized as one character string, and the first image includes a region including the region represented by the coordinates of one point or a plurality of points on which the character string is recorded. The image processing apparatus according to claim 2, wherein the image processing apparatus is associated with the character string as an area of the image.

請求項４は、
前記文字列特定部が、前記第２の領域に対し前記第１の位置関係にある前記第１の領域が存在しない場合は、該第２の領域に対しては文字列を特定しないことを特徴とする請求項２または３に記載の画像処理装置である。 Claim 4
The character string specifying unit does not specify a character string for the second area when the first area having the first positional relationship with the second area does not exist. The image processing apparatus according to claim 2.

請求項５は、
前記文字列認識部は、前記第１の画像が罫線を含む画像の場合に、該罫線で囲まれた領域ごとに文字列を認識するものであることを特徴とする請求項２から４のうちのいずれか１項に記載の画像処理装置である。 Claim 5
The character string recognition unit recognizes a character string for each area surrounded by the ruled line when the first image is an image including a ruled line. The image processing apparatus according to any one of the above.

請求項６は、
前記文字列特定部が、複数の前記第２の領域に対応して、同一の前記第１の領域に対応付けられた同一の文字列が特定されたときは、該同一の文字列についての複数回の特定のうちの１回の特定を除く残りの特定における、特定された該同一の文字列を無視することを特徴とする請求項２から５のうちのいずれか１項に記載の画像処理装置である。 Claim 6
When the character string specifying unit specifies the same character string associated with the same first region corresponding to the plurality of second regions, a plurality of the same character strings The image processing according to any one of claims 2 to 5, wherein the same character string specified in the remaining specification other than one specification among the times specified is ignored. Device.

請求項７は、
プログラムを実行する情報処理装置内で実行されて、該情報処理装置を、
第１の画像を表わす第１の画像データと、該第１の画像データに追加記録がなされた第２の画像を表わす第２の画像データとを取得する画像取得部と、
前記第１の画像から、１文字であることを含む文字列を認識する文字列認識部と、
前記第２の画像の中から、前記第１の画像に対し追加記録された画像である追加記録画像を抽出する追加記録画像抽出部と、
前記文字列認識部で認識された文字列の中から、前記第１の画像上の、前記追加記録画像に対応する前記文字列を特定する文字列特定部と
を備えた画像処理装置として動作させることを特徴とする画像処理プログラムである。 Claim 7
The information processing apparatus is executed in an information processing apparatus that executes a program.
An image acquisition unit configured to acquire first image data representing a first image and second image data representing a second image additionally recorded on the first image data;
A character string recognition unit for recognizing a character string including one character from the first image;
An additional recorded image extraction unit that extracts an additional recorded image that is an image additionally recorded with respect to the first image from the second image;
Operate as an image processing apparatus including a character string identifying unit that identifies the character string corresponding to the additional recorded image on the first image from among character strings recognized by the character string recognizing unit. This is an image processing program.

請求項１の画像処理装置および請求項７の画像処理プログラムによれば、特定された位置に対応する文字情報を、文字と文字の位置とを予め対応づけて設定する作業を必要とすることなく認識することができる。 According to the image processing apparatus of claim 1 and the image processing program of claim 7, the character information corresponding to the specified position is not required to be set in association with the character and the character position in advance. Can be recognized.

請求項２の画像処理装置によれば、予め定められた第１の位置関係という概念なしに文字列を特定する場合と比べ、文字列をより正確に特定することができる。 According to the image processing apparatus of the second aspect, it is possible to specify the character string more accurately than in the case where the character string is specified without the concept of the predetermined first positional relationship.

請求項３の画像処理装置によれば、１つの第２の領域に対応する文字列が複数の文字からなる文字列であっても、その複数の文字からなる文字列を特定することができる。 According to the image processing apparatus of claim 3, even if the character string corresponding to one second region is a character string composed of a plurality of characters, the character string composed of the plurality of characters can be specified.

請求項４の画像処理装置によれば、全ての第２の領域について文字列を特定する場合と比べ、誤認識が抑制される。 According to the image processing apparatus of the fourth aspect, erroneous recognition is suppressed as compared with the case where character strings are specified for all the second regions.

請求項５の画像処理装置によれば、罫線が記録されていても罫線を利用せずに座標を認識する場合と比べ、文字列をより正確に認識することができる。 According to the image processing apparatus of the fifth aspect, it is possible to recognize the character string more accurately as compared with the case where the coordinates are recognized without using the ruled line even if the ruled line is recorded.

請求項６の画像処理装置によれば、本来１つの第２の領域として認識すべき基になった図形等が掠れ等により複数に分かれていて複数の第２の領域として認識されても、文字列の正しい特定が可能となる。 According to the image processing apparatus of claim 6, even if a figure or the like that should be originally recognized as one second area is divided into a plurality of parts by drowning or the like and recognized as a plurality of second areas, The column can be identified correctly.

文字認識システムの模式図である。It is a schematic diagram of a character recognition system. ノートＰＣ内での画像処理プログラムの実行により実現する画像処理装置の機能ブロック図である。It is a functional block diagram of an image processing device realized by executing an image processing program in a notebook PC. 本発明の一実施形態としての画像処理プログラムのフローチャートを示した図である。It is the figure which showed the flowchart of the image processing program as one Embodiment of this invention. 未記入原稿と記入済原稿の第１例を示した図である。It is the figure which showed the 1st example of the unfilled original and the completed original. 未記入原稿と記入済原稿の第２例を示した図である。It is the figure which showed the 2nd example of the unfilled original and the completed original. 未記入原稿と記入済原稿の第３例を示した図である。It is the figure which showed the 3rd example of the unfilled original and the completed original. 未記入原稿と記入済原稿の第４例を示した図である。It is the figure which showed the 4th example of the unfilled original and the completed original. 未記入原稿上の文字列および領域認識処理のフローチャートを示した図である。It is the figure which showed the flowchart of the character string and area | region recognition process on an unwritten manuscript. 認識された文字に対応づけられる第１の領域の算出方法を示した図である。It is the figure which showed the calculation method of the 1st area | region matched with the recognized character. 罫線が描かれている原稿の一部を示した図である。FIG. 6 is a diagram illustrating a part of a document on which ruled lines are drawn. 罫線を利用した第１の領域どうしの結合例を示した図である。It is the figure which showed the example of a connection of the 1st area | regions using a ruled line. 図３に１つのステップ（ステップＳ０９）で示した文字認識処理の詳細フローを示した図である。FIG. 4 is a diagram showing a detailed flow of character recognition processing shown in one step (step S09) in FIG. 差分画像の一例を示した図である。It is the figure which showed an example of the difference image. 文字列特定処理の詳細フローを示した図である。It is the figure which showed the detailed flow of the character string specific process. 第２の領域と第１の領域が重なっている例を示した図である。It is the figure which showed the example with which the 2nd area | region and the 1st area | region have overlapped. 第２の領域の右側に第１の領域が存在している例を示した図である。It is the figure which showed the example in which the 1st area | region exists on the right side of a 2nd area | region.

以下、本発明の実施の形態について説明する。 Embodiments of the present invention will be described below.

図１は、文字認識システムの模式図である。 FIG. 1 is a schematic diagram of a character recognition system.

ここに示す文字認識システム１０は、スキャナ２０とノート型パーソナルコンピュータ（以下、「ノートＰＣ」と略記する）３０とを備えている。スキャナ２０とノートＰＣ３０との間は、通信ケーブル４０で接続されている。 The character recognition system 10 shown here includes a scanner 20 and a notebook personal computer (hereinafter abbreviated as “notebook PC”) 30. The scanner 20 and the notebook PC 30 are connected by a communication cable 40.

スキャナ２０は、原稿に記録されている画像を読み取って画像データを生成する装置である。このスキャナ２０の原稿トレイ２１上に原稿を置き、スタートボタン（不図示）を押すと、あるいは、ノートＰＣから指示を与えると、原稿が１枚、スキャナ２０内に送り込まれる。スキャナ２０内には原稿上の画像を光電的に読み取るセンサ（不図示）が備えられていて、スキャナ２０内に送り込まれた原稿から、その原稿上に記録されている画像が光電的に読み取られて画像データが生成される。記録されている画像が読み取られた後の原稿は、排紙トレイ２２上に排出される。この原稿トレイ２１には複数枚の原稿を積み重ねて載置することができ、スキャナ２０は、原稿トレイ２１上の複数枚の原稿を１枚ずつ順次にスキャナ２０内に送り込み、その送り込まれた原稿上の画像を読み取り、排紙トレイ２２上に排出する。 The scanner 20 is an apparatus that reads an image recorded on a document and generates image data. When a document is placed on the document tray 21 of the scanner 20 and a start button (not shown) is pressed or an instruction is given from a notebook PC, one document is fed into the scanner 20. The scanner 20 is provided with a sensor (not shown) for photoelectrically reading an image on the document, and the image recorded on the document is photoelectrically read from the document sent into the scanner 20. Image data is generated. The original after the recorded image is read is discharged onto the paper discharge tray 22. A plurality of documents can be stacked and placed on the document tray 21, and the scanner 20 sequentially feeds the plurality of documents on the document tray 21 one by one into the scanner 20, and the fed documents The upper image is read and discharged onto the paper discharge tray 22.

また、このスキャナ２０は、背面側に設けられた左右に延びるヒンジ（不図示）を回転中心として上蓋２３を持ち上げることができる。この上蓋２３を持ち上げてその下に原稿を１枚置き、上蓋２３を閉じて、その置かれた原稿を読み取ることもできる。 Further, the scanner 20 can lift the upper lid 23 around a hinge (not shown) provided on the back side and extending left and right. It is also possible to lift the upper lid 23 and place one document underneath, close the upper lid 23 and read the placed document.

このスキャナ２０での読み取りにより得られた画像データは、通信ケーブル４０を経由してノートＰＣ３０に入力される。 Image data obtained by reading with the scanner 20 is input to the notebook PC 30 via the communication cable 40.

ノートＰＣ３０は、表示画面３１やキーボード３２を備えており、また、その内部には、プログラムを実行するためのＣＰＵやメモリ等の設備を備えている。このノートＰＣ３０ではプログラムが実行されて、その実行されたプログラムに応じた処理が行われる。本実施形態に対応しては、このノートＰＣでは、後述する画像処理プログラムが実行される。このノートＰＣ３０内で実行される画像処理プログラムは、本発明の画像処理プログラムの一例に相当する。そして、このノートＰＣ３０は、この画像処理プログラムの実行により、本発明の一実施形態としての画像処理装置として動作する。 The notebook PC 30 includes a display screen 31 and a keyboard 32, and also includes equipment such as a CPU and a memory for executing a program therein. In the notebook PC 30, a program is executed, and processing according to the executed program is performed. In correspondence with the present embodiment, an image processing program to be described later is executed on this notebook PC. The image processing program executed in the notebook PC 30 corresponds to an example of the image processing program of the present invention. The notebook PC 30 operates as an image processing apparatus as an embodiment of the present invention by executing the image processing program.

図２は、ノートＰＣ内での画像処理プログラムの実行により実現する画像処理装置の機能ブロック図である。 FIG. 2 is a functional block diagram of the image processing apparatus realized by executing the image processing program in the notebook PC.

本実施形態の画像処理装置６０は、画像取得部６１と、文字列認識部６２と、追加記録画像抽出部６３と、文字列特定部６４とを有する。具体的な実施形態の例示は後回しにして、ここでは、各部６１〜６４について概括的に説明する。なお、ここでは、データ上の画像を取り扱っており、したがって、ここでは、特に区別する必要がある場合を除き、データ上の画像であっても、データ上の画像であることを特に明記することなく、単に「画像」あるいは「原稿」と称することがある。 The image processing apparatus 60 of the present embodiment includes an image acquisition unit 61, a character string recognition unit 62, an additional recorded image extraction unit 63, and a character string specifying unit 64. A specific embodiment will be illustrated later, and here, each of the parts 61 to 64 will be described in general. Note that here, images on data are dealt with. Therefore, unless there is a particular need for distinction, it is specifically stated that even images on data are images on data. Instead, it may be simply referred to as “image” or “original”.

画像取得部６１は、アンケートの設問としての文字が記録されていてその設問に対する回答が未記入の未記入原稿の画像と、その未記入原稿に回答が追加記録された記入済原稿の画像とを取得する。未記入原稿は１枚であるが、記入済原稿は通常は複数枚存在し、画像取得部６１は、それら全ての画像を取得する。これら未記入原稿および記入済原稿は、本発明にいう、それぞれ第１の画像および第２の画像の各一例に相当する。 The image acquisition unit 61 includes an image of an unfilled manuscript in which characters as questions of a questionnaire are recorded and an answer to the question is unfilled, and an image of a filled manuscript in which an answer is additionally recorded on the unfilled manuscript. get. Although there is one unwritten document, there are usually a plurality of completed documents, and the image acquisition unit 61 acquires all of these images. These unfilled manuscript and filled manuscript correspond to examples of the first image and the second image, respectively, according to the present invention.

また、文字列認識部６２は、未記入原稿から、１文字であることを含む文字列を認識する。ここでいう「文字列」は、複数文字からなる文字列だけでなく、１文字のみからなるものも含む概念である。
ここで、本実施形態の文字列認識部６２は、文字列の認識に加え、認識した文字列ごとに、未記入原稿上の、その文字列が記録されていた、１点もしくは複数点の座標で表現された領域であることを含む第１の領域をその文字列に対応付ける。この文字列に対応付ける「第１の領域」は、１点の座標あるいは領域の４隅の座標などで代表させたものであってもよい。
また、本実施形態の文字列認識部６２はさらに、認識した個々の文字の、１点もしくは複数点の座標で表現された領域であることを含む領域どうしが予め定められた第２の位置関係（第１の位置関係については後述する）にある複数の文字を１つの文字列として認識する。その場合、未記入原稿上の、その文字列が記録されていた、１点もしくは複数点の座標で表現された領域であることを含む領域を第１の領域として、その文字列に対応付ける。この「第２の位置関係」としては、一例として、予め定められた第２の閾値距離以内で互いに左右に並んでいる、という位置関係が採用される。
さらには、本実施形態の文字列認識部６２は、未記入原稿に罫線が描かれている場合には、その罫線で囲まれた領域ごとに文字列を認識する。罫線が描かれている場合は、その罫線を文字列の認識に利用したほうが認識率が向上することが期待されるからである。
また、追加記録画像抽出部６３は、記入済原稿の中から、未記入原稿に対し追加記録された画像である追加記録画像を抽出する。ここでは、具体的には、例えば、記録済原稿と未記入原稿との差分の画像を算出することにより、追加記録された回答の画像である追加記録画像を抽出する。 Further, the character string recognition unit 62 recognizes a character string including one character from an unfilled document. The “character string” here is a concept including not only a character string composed of a plurality of characters but also a character string composed of only one character.
Here, the character string recognizing unit 62 of the present embodiment recognizes the character string and, for each recognized character string, the coordinates of one or more points on the unfilled document on which the character string is recorded. The first area including the area expressed by is associated with the character string. The “first region” associated with the character string may be represented by the coordinates of one point or the coordinates of the four corners of the region.
In addition, the character string recognition unit 62 of the present embodiment further has a second positional relationship in which regions including the region represented by the coordinates of one point or a plurality of points of each recognized character are determined in advance. A plurality of characters in (the first positional relationship will be described later) is recognized as one character string. In that case, an area including the area represented by the coordinates of one point or a plurality of points where the character string is recorded on the unfilled document is set as the first area and associated with the character string. As this “second positional relationship”, for example, a positional relationship in which they are arranged side by side within a predetermined second threshold distance is employed.
Furthermore, when a ruled line is drawn on the unfilled document, the character string recognition unit 62 of this embodiment recognizes the character string for each area surrounded by the ruled line. This is because when a ruled line is drawn, the recognition rate is expected to be improved by using the ruled line for character string recognition.
Further, the additional recorded image extraction unit 63 extracts an additional recorded image that is an image additionally recorded with respect to the unwritten original from the completed original. Specifically, for example, by calculating an image of the difference between the recorded document and the unfilled document, an additional recorded image that is an additionally recorded answer image is extracted.

ここで、本実施形態における追加画像抽出部６３は、追加記録画像を構成する、個別の記録ごとの個別追加記録画像ごとに、記入済原稿上の、その個別追加記録画像が記録されていた記入済原稿上の、１点もしくは複数点の座標で表現された領域であることを含む領域を抽出する。ここでは、この抽出された記入済原稿上の領域を、上記の第１の領域と区別して、「第２の領域」と称する。この第２の領域は、本発明にいう第２の領域の一例に相当する。なお、「第２の領域」は未記入原稿と記入済原稿との位置合わせをした上で同一の座標系で抽出するのがよい。また、「第２の領域」は、例えば１点の座標として表現され、あるいは４点の座標の集まりとして表現されてもよい。 Here, the additional image extraction unit 63 according to the present embodiment is an entry in which the individual additional recording image on the completed document is recorded for each individual additional recording image for each individual recording constituting the additional recording image. An area on the finished document including an area represented by coordinates of one point or a plurality of points is extracted. Here, the extracted area on the completed manuscript is distinguished from the first area and is referred to as a “second area”. This second area corresponds to an example of the second area referred to in the present invention. The “second region” is preferably extracted with the same coordinate system after aligning the unfilled document and the filled document. Further, the “second region” may be expressed as, for example, a coordinate of one point or may be expressed as a collection of coordinates of four points.

さらに、文字列特定部６４は、文字列認識部６２で認識された文字列の中から、未記入原稿上の、追加記録画像に対応する領域に記録された文字列を特定する。
ここで、本実施形態における文字列特定部６４は、上記の第２の領域に対し予め定められた第１の位置関係にある第１の領域を特定し、その第１の領域に対応付けられている文字列を特定する。ここでは、一例として、「第２の領域に重なっている第１の領域が存在する場合、あるいは、第２の領域に重なっている第１の領域が存在しなくても、その第２の領域に対し予め定められた第１の閾値距離以内であってその第２の領域の右に並ぶ第１の領域が存在する場合に、その第１の領域が、「第２の領域に対し予め定められた第１の位置関係にある」第１の領域として特定される。
また、本実施形態における文字列特定部６４は、複数の第２の領域に対応して、同一の第１の領域に対応付けられた同一の文字列が特定されたときは、同一の第１の領域に対応付けられた同一の文字列についての複数回の特定のうちの１回の特定を除く残りの特定において特定された同一の文字列を無視する。例えば、本実施形態における文字列特定部６４は、複数の第２の領域に対応して、同一の第１の領域に記録されていた同一の文字列が複数回にわたって特定されたときは、最初の１回目を除き２回目以降に特定された同一の文字列を無視する。
さらに、本実施形態における文字列特定部６４は、第２の領域に対し上記の第１の位置関係にある第１の領域が存在しない場合は、その第２の領域に対応しては文字列を特定しない。無理に特定すると誤認識が増えるからである。 Further, the character string specifying unit 64 specifies a character string recorded in an area corresponding to the additional recording image on the unfilled document from the character strings recognized by the character string recognition unit 62.
Here, the character string specifying unit 64 in the present embodiment specifies a first area that has a predetermined first positional relationship with respect to the second area, and associates the first area with the first area. Identify the character string Here, as an example, “if there is a first region overlapping the second region, or even if there is no first region overlapping the second region, the second region If there is a first area that is within a first threshold distance that is predetermined and that is aligned to the right of the second area, the first area is “predetermined for the second area. Specified as the first region ”.
In addition, the character string specifying unit 64 in the present embodiment corresponds to the plurality of second areas, and when the same character string associated with the same first area is specified, the same first string is specified. The same character string specified in the remaining specification excluding one specification among a plurality of times of specification for the same character string associated with the region is ignored. For example, when the same character string recorded in the same first area is specified a plurality of times, the character string specifying unit 64 in the present embodiment corresponds to the plurality of second areas. The same character string specified after the second time is ignored except the first time.
Further, the character string specifying unit 64 in the present embodiment, when there is no first area in the first positional relationship with respect to the second area, the character string corresponding to the second area Is not specified. This is because misrecognition increases if it is forcibly specified.

図３は、本発明の一実施形態としての画像処理プログラムのフローチャートを示した図である。 FIG. 3 is a diagram showing a flowchart of an image processing program as an embodiment of the present invention.

図１に示すスキャナ２０で原稿上の画像が読み取られて画像データが生成され、その生成された画像データが通信ケーブル４０を経由してノートＰＣ３０に入力される。すると、この図３に示す画像処理プログラムが起動し、通信ケーブル４０を経由してノートＰＣ３０に入力されてきた画像データが取得される（ステップＳ０１）。なお、前述の通り、ここでは、特に必要がある場合を除き、データ上の画像であっても「データ」を省略し、「画像」あるいは「原稿」と称することがある。 The scanner 20 shown in FIG. 1 reads the image on the document to generate image data, and the generated image data is input to the notebook PC 30 via the communication cable 40. Then, the image processing program shown in FIG. 3 is activated, and the image data input to the notebook PC 30 via the communication cable 40 is acquired (step S01). Note that, as described above, unless otherwise necessary, “data” may be omitted and referred to as “image” or “original” even for images on data.

ステップＳ０１にて画像を取得すると、今回取得した画像が１枚目の画像であるか２枚目以降の画像であるかが判定される（ステップＳ０２）。 When the image is acquired in step S01, it is determined whether the image acquired this time is the first image or the second and subsequent images (step S02).

本実施形態では、スキャナ２０に、１枚目は未記入原稿を読み取らせ、その後、２枚目以降に記入済原稿を順次読み取らせるというルールを置いている。そこで、この画像処理プログラムは、取得した画像が１枚目の画像のときは、その画像を未記入原稿として一時保存する（ステップＳ０３）。２枚目以降についても画像取得を繰り返し（ステップＳ０５）、２枚目以降に取得した画像は全て記入済原稿として一時保存する（ステップＳ０４）。 In the present embodiment, a rule is set such that the scanner 20 reads an unfilled original on the first sheet and then sequentially reads the completed original on the second and subsequent sheets. Therefore, when the acquired image is the first image, the image processing program temporarily stores the image as an unfilled document (step S03). Image acquisition is repeated for the second and subsequent sheets (step S05), and all images acquired for the second and subsequent sheets are temporarily stored as completed documents (step S04).

図４は、未記入原稿と記入済原稿の第１例を示した図である。
ここで、図４（Ａ）は、記入前のアンケート用紙、すなわち未記入原稿５１Ａを表している。ここでは、アンケート内容として（１）〜（３）の３つの設問があり、それら３つの設問に対する回答は、１〜５の数字のうちのいずれか１つの数字を○印で囲うことによりその数字を選択する方式のものである。 FIG. 4 is a diagram showing a first example of an unfilled document and a filled document.
Here, FIG. 4A shows a questionnaire sheet before filling, that is, an unfilled manuscript 51A. Here, there are three questions (1) to (3) as the contents of the questionnaire, and the answer to these three questions is that number by enclosing one of the numbers 1 to 5 with a circle. It is a method of selecting.

また、図４（Ｂ）は、図４（Ａ）に示したアンケート用紙と同一様式のアンケート用紙上に回答者が回答を記入した記入済原稿５２Ａを表している。記入済原稿は１枚とは限らず、スキャナ２０で順次読み込まれた複数枚の原稿のうちの２枚目以降の原稿の１枚１枚それぞれが記入済原稿として取り扱われる。 FIG. 4B shows a completed manuscript 52A in which a respondent enters an answer on a questionnaire sheet in the same format as the questionnaire sheet shown in FIG. The completed manuscript is not limited to one, but each of the second and subsequent manuscripts among the plurality of manuscripts sequentially read by the scanner 20 is treated as a completed manuscript.

この図４（Ｂ）に示されている１枚の記入済原稿５２Ａでは、（１）の設問に関しては、数字の「３」が○印５２１で囲まれている。また、（２）の設問に関しては、数字の「１」が○印５２２で囲まれている。さらに、（３）の設問に関しては、数字の「５」が○印５２３で囲まれている。 In one completed manuscript 52A shown in FIG. 4B, the number “3” is surrounded by a circle 521 for the question (1). Regarding the question (2), the numeral “1” is surrounded by a circle 522. Further, regarding the question (3), the numeral “5” is surrounded by a circle 523.

図５は、未記入原稿と記入済原稿の第２例を示した図である。
図４の場合と同様、図５（Ａ）は、記入前のアンケート用紙、すなわち未記入原稿５１Ｂを表している。また、図５（Ｂ）は、図５（Ａ）に示したアンケート用紙と同一様式のアンケート用紙上に回答者が回答を記入した記入済原稿５２Ｂを表している。記入済原稿は１枚とは限らず、スキャナ２０で順次読み込まれた複数枚の原稿のうちの２枚目以降の原稿の１枚１枚それぞれが記入済原稿として取り扱われる。 FIG. 5 is a diagram showing a second example of a blank document and a blank document.
As in the case of FIG. 4, FIG. 5 (A) shows a questionnaire sheet before entry, that is, an unfilled manuscript 51B. FIG. 5B shows a completed manuscript 52B in which the respondent enters an answer on a questionnaire sheet having the same format as the questionnaire sheet shown in FIG. The completed manuscript is not limited to one, but each of the second and subsequent manuscripts among the plurality of manuscripts sequentially read by the scanner 20 is treated as a completed manuscript.

ここでは、アンケート内容として（１）から（４）の４つの設問があり、それらの設問のうちの（１）から（３）に対する回答は、「大変良い」、「良い」、「普通」、「悪い」、「大変悪い」のいずれかに重ねて○印等のマークを記入することにより、また（４）の設問に対しては、「ぜひ紹介したい」、「まあ紹介したい」、「あまり紹介したくない」のいずれかに重ねて○印等のマークを記入することにより、そのマークを記入した内容を選択する方式のものである。 Here, there are four questions (1) to (4) as the contents of the questionnaire, and the answers to (1) to (3) of those questions are “very good”, “good”, “normal”, By putting a mark such as ○ on top of either “bad” or “very bad”, and for the question (4), “I want to introduce it”, “Well to introduce”, “ This is a method of selecting the contents with the mark by putting a mark such as a circle over the item “I do not want to introduce”.

図６は、未記入原稿と記入済原稿の第３例を示した図である。
図４，図５の場合と同様、図６（Ａ）は、記入前のアンケート用紙、すなわち未記入原稿５１Ｃを表している。また、図６（Ｂ）は、図６（Ａ）に示したアンケート用紙と同一様式のアンケート用紙上に回答者が回答を記入した記入済原稿５２Ｃを表している。記入済原稿は１枚とは限らず、スキャナ２０で順次読み込まれた複数枚の原稿のうちの２枚目以降の原稿の１枚１枚それぞれが記入済原稿として取り扱われる。 FIG. 6 is a diagram showing a third example of a blank document and a blank document.
As in the case of FIG. 4 and FIG. 5, FIG. 6 (A) shows a questionnaire sheet before entry, that is, an unfilled manuscript 51C. FIG. 6B shows a completed manuscript 52C in which the respondent enters an answer on a questionnaire sheet in the same format as the questionnaire sheet shown in FIG. 6A. The completed manuscript is not limited to one, but each of the second and subsequent manuscripts among the plurality of manuscripts sequentially read by the scanner 20 is treated as a completed manuscript.

ここでは、アンケート内容として（１）と（２）の２つの設問があり、それら２つの設問に対する回答は、図６（Ｂ）に示すように、□印内にチェックマークを記入することにより行われる。□印内にチェックマークを記入すると、そのチェックマークを記入した□印のすぐ右側に記録されている文字列が表わす内容を回答したことになる。 Here, there are two questions (1) and (2) as the contents of the questionnaire, and the answers to these two questions are made by entering a check mark in the square as shown in FIG. 6 (B). Is called. When a check mark is entered in the □ mark, the contents represented by the character string recorded on the right side of the □ mark in which the check mark is written are answered.

図７は、未記入原稿と記入済原稿の第４例を示した図である。
図４〜図６の場合と同様、図７（Ａ）は、記入前のアンケート用紙、すなわち未記入原稿５１Ｄを表している。また、図７（Ｂ）は、図７（Ａ）に示したアンケート用紙と同一様式のアンケート用紙上に回答者が回答を記入した記入済原稿５２Ｄを表している。記入済原稿は１枚とは限らず、スキャナ２０で順次読み込まれた複数枚の原稿のうちの２枚目以降の原稿の１枚１枚それぞれが記入済原稿として取り扱われる。 FIG. 7 is a diagram showing a fourth example of an unfilled document and a filled document.
As in the case of FIGS. 4 to 6, FIG. 7A shows a questionnaire sheet before filling, that is, an unfilled manuscript 51D. FIG. 7B shows a completed manuscript 52D in which the respondent enters an answer on a questionnaire sheet in the same format as the questionnaire sheet shown in FIG. 7A. The completed manuscript is not limited to one, but each of the second and subsequent manuscripts among the plurality of manuscripts sequentially read by the scanner 20 is treated as a completed manuscript.

ここでは、アンケート内容として、図６と同じ内容の（１）と（２）の２つの設問がある。ただし、ここでは、回答候補としての文字列は、罫線で囲まれた枠内に記録されている。この設問に対する回答は、図７（Ｂ）に示すように、回答しようとしている文字列の左に並ぶ、罫線によって囲まれた枠内にチェックマークを記入することによって行われる。枠内にチェックマークを記入すると、そのチェックマークの枠のすぐ右側に並ぶ枠内に記録されている文字列が表わす内容を回答したことになる。 Here, there are two questions (1) and (2) having the same contents as those in FIG. 6 as the contents of the questionnaire. However, here, the character strings as answer candidates are recorded in a frame surrounded by ruled lines. As shown in FIG. 7B, the answer to this question is made by entering a check mark in a frame surrounded by a ruled line that is arranged to the left of the character string to be answered. When a check mark is entered in the frame, the content represented by the character string recorded in the frame arranged immediately to the right of the check mark frame is answered.

図３に戻って説明を続ける。 Returning to FIG. 3, the description will be continued.

一連の画像取得を終了すると（ステップＳ０５）、次に、未記入原稿に記録されている文字列およびその文字列が記録されている領域の認識処理が行われる（ステップＳ０６）。図４に示す第１例では数字のみの認識で足りるが、本実施形態における認識対象は図４〜図７に示すように多岐のアンケート用紙にわたっているため、認識すべき文字のの文字種は限定されていない。 When the series of image acquisition is completed (step S05), the character string recorded in the unfilled document and the area in which the character string is recorded are recognized (step S06). In the first example shown in FIG. 4, it is sufficient to recognize only numbers, but since the recognition target in this embodiment covers a wide variety of questionnaire forms as shown in FIGS. 4 to 7, the character types of characters to be recognized are limited. Not.

図８は、未記入原稿上の文字列および領域認識処理のフローチャートを示した図である。図３のステップＳ０６では、この図８に示す処理が実行される。 FIG. 8 is a diagram showing a flowchart of character string and area recognition processing on an unfilled document. In step S06 of FIG. 3, the process shown in FIG. 8 is executed.

ここでは先ず、未記入原稿上に記録されている個々の文字について、文字とその文字が記録されている領域（第１の領域）とが認識される（ステップＳ６１）。この文字と第１の領域の認識は、未記入原稿の全面にわたって行なわれる。 Here, first, for each character recorded on the unfilled document, the character and the region (first region) where the character is recorded are recognized (step S61). The recognition of the characters and the first area is performed over the entire surface of the unwritten document.

図９は、認識された文字に対応付けられる第１の領域の算出方法を例示した図である。 FIG. 9 is a diagram illustrating a method for calculating the first region associated with the recognized character.

ここでは、図９に示すように、数字の「３」が認識されたものとする。このとき、その数字の「３」に外接する長方形Ｒが算出されて、その長方形Ｒがその認識された数字「３」に対応する第１の領域として認識され、その長方形Ｒが、ここで認識された数字の「３」に対応する第１の領域として、その数字「３」に対応づけられる。ただし、ここで対応づけられる第１の領域は、必ずしも２次元的な広がりのある領域ではなく、例えば、その長方形Ｒの４隅の座標Ｃ１〜Ｃ４のセット、あるいは、その長方形Ｒの中心の１点の座標Ｃ０等であってもよい。 Here, as shown in FIG. 9, it is assumed that the number “3” is recognized. At this time, a rectangle R circumscribing the numeral “3” is calculated, and the rectangle R is recognized as the first region corresponding to the recognized numeral “3”, and the rectangle R is recognized here. The first area corresponding to the number “3” is associated with the number “3”. However, the first region associated here is not necessarily a two-dimensionally spread region, for example, a set of coordinates C1 to C4 at the four corners of the rectangle R, or 1 at the center of the rectangle R. The coordinates C0 of the point may be used.

図８に戻って説明を続ける。 Returning to FIG.

未記入原稿上の個々の文字および個々の文字に対応する個々の第１の領域が認識されると（ステップＳ６１）、次に、その未記入原稿上の罫線の認識が試みられる（ステップＳ６２）。ここでは、図４〜図６に示すような、罫線が描かれていないアンケート用紙も、図７に示すような罫線が描かれているアンケート用紙も処理対象としている。このため、アンケート用紙によっては、罫線が描かれている場合も有り得る。そこで、ここでは、未記入原稿上の罫線の有無、および罫線が描かれていた場合の、その罫線の位置や長さ等が認識される。 When the individual characters on the unwritten manuscript and the individual first areas corresponding to the individual characters are recognized (step S61), the ruled line on the unwritten manuscript is then recognized (step S62). . Here, as shown in FIG. 4 to FIG. 6, questionnaire papers without ruled lines and questionnaire papers with ruled lines as shown in FIG. 7 are also processed. For this reason, a ruled line may be drawn depending on the questionnaire sheet. Therefore, here, the presence or absence of a ruled line on the unfilled document and the position and length of the ruled line when the ruled line is drawn are recognized.

個々の文字および個々の第１の領域の認識（ステップＳ６１）と罫線の認識（ステップＳ６２）が終了すると、次に、未記入原稿上を左上から右下に向かって順次に検査していき（ステップＳ６３）、認識された文字を見つけたら、その見つけた１つの文字とその文字に対応する第１の領域を取り出す（ステップＳ６４）。そして、取り出すべき文字が無くなるまで、以下の処理を繰り返す（ステップＳ６５）。 When the recognition of individual characters and individual first areas (step S61) and ruled line recognition (step S62) are completed, the unprinted document is inspected sequentially from upper left to lower right ( Step S63) When the recognized character is found, the found one character and the first area corresponding to the character are taken out (Step S64). The following processing is repeated until there are no more characters to be extracted (step S65).

ここでは先ず、今回取り出した１つの文字が１行の左側の先頭の文字か否かを判定するステップＳ６６）。１行の左側の先頭の文字１つだけでは、以下の処理を実行できないため、今回取り出した１つの文字が１行の左側の先頭の文字であったときは、ステップＳ６３に戻り、次の文字とその文字に対応する第１の領域を取り出す（ステップＳ６４）。 Here, first, it is determined whether or not one character extracted this time is the first character on the left side of one line (step S66). The following process cannot be executed with only the first character on the left side of one line. If one character extracted this time is the first character on the left side of one line, the process returns to step S63, and the next character And a first area corresponding to the character is extracted (step S64).

そして次に、先に取り出した文字を囲う罫線が存在するか否かを判定する（ステップＳ６７）。 Next, it is determined whether there is a ruled line surrounding the previously extracted character (step S67).

ここでは先ずは、罫線（先に取り出した文字を囲う罫線）が存在しない場合について説明する。上記の条件を満たす罫線が存在しない場合、次に、先に取り出した文字と今回取り出した文字との２つの文字に対応する２つの第１の領域どうしを結合する結合条件を満たすか否かを判定する（ステップＳ６８）。ここでは、この結合条件として、今回取り出した文字に対応する第１の領域が、先に取り出した文字に対応する第１の領域に対し、右隣に有り、かつ、予め定められた閾値距離以内に存在すること、という条件を採用している。 Here, first, a case where there is no ruled line (ruled line surrounding the previously extracted character) will be described. If there is no ruled line that satisfies the above condition, then whether or not the join condition for joining the two first areas corresponding to the two characters of the character extracted first and the character extracted this time is satisfied. Determination is made (step S68). Here, as this combination condition, the first area corresponding to the character extracted this time is right next to the first area corresponding to the previously extracted character and is within a predetermined threshold distance. The condition that it exists is adopted.

図１０は、第１の領域どうしを結合する結合条件の説明図である。 FIG. 10 is an explanatory diagram of a coupling condition for coupling the first regions.

ここには、「以下の質問にお答えください」の文字が並んでいる。そして、先に取り出した文字が「以」であって、今回取り出した文字が「下」であるとする。ここで、今回取り出した文字「下」に対応する、その文字「下」を取り巻く第１の領域（ここでは、これを、「今回の第１の領域」と称する。）は、先に取り出した文字「以」に対応する、その文字「以」を取り巻く第１の領域（ここでは、これを、「先の第１の領域」と称する。）に対し右隣に位置していて、かつ、予め定められた閾値距離以内に存在する。そして、この結合条件を満たすと、それら２つの第１の領域が、「以」と「下」という２つの文字からなる文字列「以下」に対応する１つの第１の領域となるように結合される（図８、ステップＳ６９）。 Here are the letters “Please answer the following questions”. Then, it is assumed that the previously extracted character is “I” and the character extracted this time is “below”. Here, the first area surrounding the character “lower” corresponding to the character “lower” extracted this time (herein referred to as “the first area of this time”) is extracted first. Corresponding to the character “and”, located right next to the first region surrounding the character “and” (herein referred to as the “first region”), and It exists within a predetermined threshold distance. If this combination condition is satisfied, the two first regions are combined so that they become one first region corresponding to the character string “below” consisting of two characters “below” and “below”. (FIG. 8, step S69).

今回の第１の領域が先の第１の領域に対し予め定められた閾値距離以内に存在するか否かの判定は、特定の判定方法に限定されるものではないが、例えば以下の判定方法が採用される。 The determination as to whether or not the current first region is within a predetermined threshold distance with respect to the previous first region is not limited to a specific determination method, but for example, the following determination method: Is adopted.

例えば、図１０（Ａ）に示すように、先の第１の領域（文字「以」を囲う領域）の右辺と、今回の第１の領域（文字「下」を囲う領域）の左辺との間の距離を計算し、その距離が閾値距離以内であるか否かが判定される。 For example, as shown in FIG. 10A, the right side of the previous first area (the area surrounding the character “following”) and the left side of the current first area (the area surrounding the character “below”) A distance between the two is calculated, and it is determined whether or not the distance is within a threshold distance.

あるいは、これも図１０（Ａ）に示すように、先の第１の領域（文字「以」を囲う領域）の４隅の座標の各々と、今回の第１の領域（文字「下」を囲う領域）の４隅の座標の各々との間の距離を計算し、それらの距離の中に閾値距離以内の距離が存在するか否かで判定してもよい。 Alternatively, as shown in FIG. 10 (A), each of the coordinates of the four corners of the first region (region surrounding the character “以”) and the current first region (character “below”) It is also possible to calculate the distance between each of the four corner coordinates of the (enclosing region) and determine whether or not there is a distance within the threshold distance among these distances.

あるいは、図１０（Ｂ）に示すように、先の第１の領域の中心座標と今回の第１の領域の中心座標との間の距離を計算し、その距離が閾値距離以内であるか否かで判定してもよい。 Alternatively, as shown in FIG. 10B, the distance between the center coordinates of the previous first area and the center coordinates of the current first area is calculated, and whether the distance is within a threshold distance or not. You may judge by.

ただし、これらの異なる判定方法に応じて、その判定方法に適した閾値距離が採用される。あるいは、これらの複数の判定方法を併用してもよい。 However, a threshold distance suitable for the determination method is employed according to these different determination methods. Or you may use these several determination methods together.

このような判定方法により結合条件を満たすと判定された場合は、第１の領域どうしを結合する（ステップＳ６９）。そして、以上の処理を、未記入原稿上の全ての文字の取出しが終了するまで繰り返す（ステップＳ６５）。図１０に示す文字列の場合、この繰り返しにより、図１０（Ｃ）に示すように、「以下の質問にお答えください」の文字列全体に対応する１つの第１の領域が生成される。 When it is determined that the combination condition is satisfied by such a determination method, the first regions are combined (step S69). Then, the above processing is repeated until extraction of all characters on the unfilled document is completed (step S65). In the case of the character string shown in FIG. 10, by repeating this process, as shown in FIG. 10C, one first region corresponding to the entire character string “Please answer the following question” is generated.

次に、取り出した文字を囲う罫線が存在する場合について説明する。 Next, a case where there is a ruled line surrounding the extracted character will be described.

この場合、図８のステップＳ６７において先に取り出した文字を囲う罫線が存在すると判定されると、つぎにステップＳ７１に進み、今回取り出した文字が、先に取り出した文字を囲う罫線領域（罫線で囲まれた枠）と同じ罫線領域内に存在するか否かが判定される。そして、それらの文字が同じ罫線領域内（罫線で囲まれた同じ枠内）に存在すると判定されると、それらの文字に対応する２つの第１の領域どうしが結合される（ステップＳ７２）。 In this case, if it is determined in step S67 in FIG. 8 that there is a ruled line surrounding the previously extracted character, the process proceeds to step S71. It is determined whether or not it exists in the same ruled line area as the enclosed frame. If it is determined that these characters exist in the same ruled line area (in the same frame surrounded by ruled lines), the two first areas corresponding to these characters are combined (step S72).

図１１は、罫線を利用した第１の領域どうしの結合例を示した図である。 FIG. 11 is a diagram illustrating an example of combining the first areas using ruled lines.

ここでは、「Ｖｅｒ７．０」、「Ｖｅｒ７．１」、「Ｖｅｒ８．０」、「Ｖｅｒ８．０２」、「Ｖｅｒ８．０５」、の各文字列が、それぞれ１つの罫線領域（罫線で囲まれた同じ枠内）に記録されている。そこで、ここでは、「Ｖｅｒ７．０」について例示すると、「Ｖ」「ｅ」「ｒ」「７」「．」「０」の個々の文字の第１の領域どうしが結合されて、文字列「Ｖｅｒ７．０」に対応する１つの第１の領域が生成される。その他の文字列についても同様である。 Here, each character string “Ver7.0”, “Ver7.1”, “Ver8.0”, “Ver8.02”, “Ver8.05” is surrounded by one ruled line area (ruled line). In the same frame). Therefore, here, for example, “Ver 7.0”, the first areas of the individual characters “V”, “e”, “r”, “7”, “.”, And “0” are combined, and the character string “ One first area corresponding to “Ver 7.0” is generated. The same applies to other character strings.

このようにして、図８に示した処理の実行により、文字列ごとの第１の領域が生成される。ここで、今回取り出した文字が先に取り出した文字から離れていて、さらに、次に取り出した文字も離れているときは、１文字のみからなる文字列が構成されることになる。 In this way, the first region for each character string is generated by executing the processing shown in FIG. Here, when the character extracted this time is separated from the previously extracted character, and the next extracted character is also separated, a character string composed of only one character is formed.

図８に示した処理、すなわち、図３のステップＳ０６の処理により１文字のみであることを含む各文字列に対応する各第１の領域が生成されると、次に、図３のステップＳ０７に進む。ここでは、ステップＳ０４で一時保存しておいた記入済原稿のうちの１枚を取り出す（ステップＳ０７）。ただし、ステップＳ０９における文字認識処理が済んでいる記入済原稿は取出しの対象からは外している。そして、未処理の記入済原稿が有ったときは、すなわち、未処理の記入済原稿を取り出せたときは（ステップＳ０８）、その取り出した１枚の未処理の記入済原稿について、文字認識処理を実行する（ステップＳ０９）。文字認識処理の詳細については、後述する。 When the first region corresponding to each character string including only one character is generated by the process shown in FIG. 8, that is, the process of step S06 of FIG. 3, next, step S07 of FIG. Proceed to Here, one of the completed manuscripts temporarily stored in step S04 is taken out (step S07). However, the completed manuscript for which the character recognition processing in step S09 has been completed is not taken out. When there is an unprocessed completed document, that is, when an unprocessed completed document can be taken out (step S08), a character recognition process is performed on the one unprocessed completed document that has been taken out. Is executed (step S09). Details of the character recognition process will be described later.

未処理の記入済原稿を取り出せなかったとき、すなわち、全ての記入済原稿について文字認識処理（ステップＳ０９）が終了したときは（ステップＳ０８）、今回の文字認識ルーチンを終了する。
図１２は、図３に１つのステップ（ステップＳ０９）で示した文字認識処理の詳細フローを示した図である。 When the unprocessed completed manuscript cannot be taken out, that is, when the character recognition process (step S09) is completed for all the completed manuscripts (step S08), the current character recognition routine is terminated.
FIG. 12 is a diagram showing a detailed flow of the character recognition process shown in one step (step S09) in FIG.

ここでは先ず、図３のステップＳ０７で取り出した１枚の記入済原稿とステップＳ０３で一時保存しておいた未記入原稿との間の差分の画像を生成する（ステップＳ２１）。
図１３は、差分画像の一例を示した図である。
この図１３に示す差分画像５３Ａは、図４（Ａ）に示す未記入原稿５１Ａと、図４（Ｂ）に示す記入済原稿５２Ａのうちの一番上の１枚の記入済原稿との間の差分画像である。この差分画像５３Ａには、回答者によって記入された、３つの○印５２１，５２２，５２３が抽出される。この差分画像上に現れた追加記録画像は、本発明にいう追加記録画像の一例に相当する。また、ここでは、追加記録画像を構成している１つ１つの画像を個別追加記録画像と称する。ここで、○印５２１に関しては、記入時の掠れ等により、２つの部分５２１ａ，５２１ｂに分かれている。このため、２つの部分５２１ａ，５２１ｂの各々が個別追加記録画像となる。
図１２に戻って説明を続ける。
図１３に例示するような差分画像を生成すると（ステップＳ２１）、次に、差分画像を左上から右下に向かって検査していき（ステップＳ２２）、個別追加記録画像を見つけたら、その見つけた１つの個別追加記録画像を取り出す（ステップＳ２３）。そして、今回対象としている差分画像上に未処理の個別追加記録画像が無くなるまで（ステップＳ２４）、以下の処理を繰り返す。
ここでは先ず、領域算出処理が行われる（ステップＳ２５）。この領域算出処理は、ステップＳ２３で取り出した１つの個別追加記録画像の記入済原稿上の領域（第２の領域）を算出する処理である。本実施形態では、この第２の領域の算出にあたり、図９に示した、未記入原稿上の第１の領域の算出方法と同じ算出方法が採用されている。すなわち、ここでは、ステップＳ２３で取り出した１つの個別追加記録画像に外接する長方形Ｒが算出され、その長方形Ｒがその個別追加記録画像に対応する第２の領域として、その個別追加記録画像に対応付けられる。あるいは、これも第１の領域の場合と同様、その長方形Ｒの４隅の座標のセット、もしくは、その長方形Ｒの中心座標を第２の領域としてもよい。
１つの個別追加記録画像に対応する第２の領域が算出されると（ステップＳ２５）、次に、文字列特定処理が行われる（ステップＳ２６）。 Here, first, an image of a difference between one completed original document taken out in step S07 in FIG. 3 and an unwritten original document temporarily stored in step S03 is generated (step S21).
FIG. 13 is a diagram illustrating an example of the difference image.
The difference image 53A shown in FIG. 13 is between the unfilled document 51A shown in FIG. 4A and the uppermost one of the filled documents 52A shown in FIG. 4B. It is a difference image. In this difference image 53A, three circles 521, 522, and 523 entered by the respondent are extracted. The additional recording image that appears on the difference image corresponds to an example of the additional recording image referred to in the present invention. Here, each image constituting the additional recording image is referred to as an individual additional recording image. Here, the ◯ mark 521 is divided into two parts 521a and 521b due to a wrinkle at the time of entry. For this reason, each of the two parts 521a and 521b becomes an individual additional recording image.
Returning to FIG. 12, the description will be continued.
When the difference image as illustrated in FIG. 13 is generated (step S21), the difference image is then inspected from the upper left to the lower right (step S22). One individually added recorded image is taken out (step S23). Then, the following processing is repeated until there is no unprocessed individual additional recording image on the current difference image (step S24).
Here, first, an area calculation process is performed (step S25). This area calculation process is a process for calculating the area (second area) on the completed document of one individual additional recording image taken out in step S23. In the present embodiment, in calculating the second area, the same calculation method as that of the first area on the unfilled document shown in FIG. 9 is employed. That is, here, a rectangle R circumscribing one individual additional recording image taken out in step S23 is calculated, and the rectangle R corresponds to the individual additional recording image as a second area corresponding to the individual additional recording image. Attached. Alternatively, as in the case of the first area, the coordinates of the four corners of the rectangle R or the center coordinates of the rectangle R may be set as the second area.
When the second area corresponding to one individually added recording image is calculated (step S25), a character string specifying process is performed (step S26).

図１４は、文字列特定処理の詳細フローを示した図である。 FIG. 14 is a diagram showing a detailed flow of the character string specifying process.

ここでは、図１２のステップＳ２５で今回算出された第２の領域が、いずれかの第１の領域と重なっているか否かが判定される（ステップＳ２６１）。 Here, it is determined whether or not the second area calculated this time in step S25 of FIG. 12 overlaps any of the first areas (step S261).

図１５は、第２の領域と第１の領域が重なっている例を示した図である。 FIG. 15 is a diagram illustrating an example in which the second region and the first region overlap.

図１５（Ａ）〜（Ｃ）のいずれにおいても、第２の領域（○印を囲う領域）と重なる第１の領域（文字列を囲う領域）とが重なっている。中心座標どうしの距離を算出して重なっているかどうかを判定するときは、第２の領域の中心座標と、文字列を構成している各文字それぞれの中心座標のうちの第２の領域の中心座標に一番接近した文字の中心座標との間の距離が閾値距離以内にあるか否かによって重なっているか否かを判定してもよい。 In any of FIGS. 15A to 15C, the first area (area surrounding the character string) overlaps with the second area (area surrounding the circle). When calculating the distance between the center coordinates to determine whether or not they overlap, the center coordinates of the second area among the center coordinates of the second area and the center coordinates of each character constituting the character string It may be determined whether or not the distance from the center coordinate of the character closest to the coordinates is within a threshold distance.

図１４に戻って説明を続ける。 Returning to FIG. 14, the description will be continued.

図１５に例示したように、第２の領域と重なっている第１の領域が存在するときは、その重なっている第１の領域に対応する文字列が、その第２の領域、すなわち今回の個別追加記録画像に対応する文字列として特定される（ステップＳ２６２）。 As illustrated in FIG. 15, when there is a first region that overlaps the second region, the character string corresponding to the first region that overlaps the second region, that is, the current region. It is specified as a character string corresponding to the individually added recording image (step S262).

第２の領域と重なっている第１の領域が存在しないときは（ステップＳ２６１）、その第２の領域の右側であって閾値距離以内の距離に第１の領域が存在するか否かが判定される（ステップＳ２６３）。この判定方法としては、前述した、第１の領域どうしを結合するか否かの判定方法と同様の、様々な判定方法が採用され得る。ただし、文字の中心座標どうしの距離に基づいて判定するときは、判定対象の第１の領域に対応する文字列の中の一番左側の文字の中心座標が採用される。 When there is no first area overlapping the second area (step S261), it is determined whether the first area exists on the right side of the second area and within a threshold distance. (Step S263). As this determination method, various determination methods similar to the above-described determination method for determining whether or not to combine the first regions can be employed. However, when the determination is made based on the distance between the center coordinates of the characters, the center coordinates of the leftmost character in the character string corresponding to the first region to be determined is adopted.

図１６は、第２の領域の右側に第１の領域が存在している例を示した図である。 FIG. 16 is a diagram illustrating an example in which the first region exists on the right side of the second region.

ここには、□印内に記入されたチェックマークを囲う第２の領域の右側であって閾値距離以内に「Ｖｅｒ７．０」の文字列を囲う第１の領域が存在している。そこで、この「Ｖｅｒ７．０」の文字列を囲う第１の領域に対応する文字列である「Ｖｅｒ７．０」が、その第２の領域、すなわち今回の個別追加記録画像であるチェックマークに対応する文字列として特定される（図１４、ステップＳ２６４）。なお、第２の領域の右側であって閾値距離以内に複数の第１の領域が存在するときは、それら複数の第１の領域のうちの第２の領域からの距離が最短の第１の領域に対応する文字列が、その第２の領域に対応する文字列として特定される。 Here, there is a first area that encloses the character string “Ver7.0” on the right side of the second area that encloses the check mark written in the square and within the threshold distance. Therefore, “Ver7.0”, which is a character string corresponding to the first area surrounding the character string of “Ver7.0”, corresponds to the second area, that is, the check mark which is the individual additional recording image this time. Is specified as a character string (FIG. 14, step S264). When there are a plurality of first regions on the right side of the second region and within the threshold distance, the first of the plurality of first regions having the shortest distance from the second region A character string corresponding to the area is specified as a character string corresponding to the second area.

第２の領域と重なっている第１の領域が存在せず、しかも、第２の領域の右側の閾値距離以内にも第１の領域が存在しなかったときは、今回の第２の領域、すなわち今回の個別追加記録画像に対応しては、文字列は特定されないステップＳ２６５）。 When there is no first area overlapping the second area, and there is no first area within the threshold distance on the right side of the second area, the second area of this time, That is, no character string is specified for the individual additional recording image this time (step S265).

図１２に戻って説明を続ける。 Returning to FIG. 12, the description will be continued.

今回の１つの第２の領域に対応する図１４に示した文字列特定処理、すなわち、今回の１つの第２の領域に対応する図１２のステップＳ２６における文字列特定処理が終了すると、次に、このようにして特定された文字列が、ステップＳ２２で個別追加記録画像を１つづつ取り出して処理していく間に複数回通過するステップＳ２６において先に特定された文字列に対応する第１の領域と同一の第１の領域の文字列であるか否かが判定される（ステップＳ２７）。
例えば、図１３に示す○印５２１は、その○印の記入時の掠れ等により、２つの部分５２１ａ，５２１ｂに分かれている。このため、各部分５２１ａ，５２１ｂのそれぞれが１つずつの個別追加記録画像として認識されることが有り得る。その場合、それら２つの部分５２１ａ，５２１ｂで同じ座標の同じ文字列（ここでは図４に示す数字の「３」）が特定される。図１２のステップＳ２７は、このような場合に、２度目以降に特定された同一の第１の領域の同一の文字列は不要なので、２度目以降に特定された同一の第１の領域の同一の文字列は無視される。
ステップＳ２７において、これまでとは別の第１の領域の文字列が特定されたときは、その特定された文字列が保存される（ステップＳ２８）。
ここでは以上の処理が、１枚の差分画像上の個別追加記録画像の１つ１つについて実行され（ステップＳ２２，Ｓ２３）、その１枚の差分画像上の全ての個別追加記録画像についての処理が終了すると（ステップＳ２４）、その１枚の差分画像についての、図１２に示す処理、すなわち、図３にステップＳ０９として示す文字認識処理が終了し、未処理の次の記入済原稿に関する文字認識処理に移行する（図３のステップＳ０７）。そして、全ての記入済原稿に関する文字認識処理が終了すると（図３のステップＳ０８）、画像処理ルーチンの今回の実行を終了する。
このように、本実施形態によれば、マークシートのマークの各位置ごとに、その位置のマークが何を意味しているか、という情報を予めインプットしておくといったような事前設定なしに、回答者の回答を認識することができる。 When the character string specifying process shown in FIG. 14 corresponding to this one second area, that is, the character string specifying process in step S26 of FIG. 12 corresponding to this one second area is completed, The first character string corresponding to the character string identified earlier in step S26, in which the character string identified in this way passes a plurality of times while the individual additional recorded images are taken out and processed one by one in step S22. It is determined whether or not the character string of the first area is the same as the area (step S27).
For example, a ◯ mark 521 shown in FIG. 13 is divided into two parts 521a and 521b due to a crease at the time of filling in the ◯ mark. For this reason, each of the portions 521a and 521b may be recognized as one individual additional recording image. In this case, the same character string (here, the numeral “3” shown in FIG. 4) having the same coordinates is specified in the two portions 521a and 521b. In such a case, step S27 in FIG. 12 does not require the same character string in the same first area specified after the second time, so the same first area specified after the second time is the same. The string of is ignored.
In step S27, when a character string in the first area different from the previous one is specified, the specified character string is stored (step S28).
Here, the above processing is executed for each individual additional recording image on one difference image (steps S22 and S23), and processing for all the individual additional recording images on that one difference image. Is completed (step S24), the processing shown in FIG. 12 for the one difference image, that is, the character recognition processing shown as step S09 in FIG. 3 ends, and character recognition relating to the unprocessed next completed manuscript is completed. The process proceeds (step S07 in FIG. 3). When the character recognition processing for all completed originals is completed (step S08 in FIG. 3), the current execution of the image processing routine is terminated.
As described above, according to the present embodiment, for each position of the mark on the mark sheet, the respondent without any prior setting such as inputting in advance information on what the mark at that position means. Can be recognized.

ここで、本実施形態の場合、スキャナ２０で複数枚の原稿を連続的に読み取り、それら複数枚の原稿のうちの１枚目の原稿を未記入原稿とし、２枚目以降の原稿を記入済原稿とするというルールが定められている。この場合、未記入原稿の画像データを容易かつ確実に取得することができる。しかしながら、本発明においては、未記入原稿を１枚目などの特定の位置に配置するというルールは必ずしも必要ではない。未記入原稿を、例えば複数枚積み重ねた記入済原稿の途中位置に挟みこんでおいてもよい。その場合、画像取得部の中に未記入原稿を複数枚の原稿から見つけ出す処理を実施すればよい。未記入原稿を見つけ出す処理の一例としては、１枚目の原稿と２枚目以降の原稿との差分を抽出する処理を順次行い、１枚目の原稿にのみ差分が出た原稿を未記入原稿とすればよい。また、未記入原稿であるか記入済原稿であるかを問わずに読取により得られたテータ上の複数枚の原稿の共通部分を抽出した画像を作成し、その作成した画像と読み込んだ各原稿とのパターンマッチングを行い、一致度が最も高かった原稿を未記入原稿としてもよい。 Here, in the case of the present embodiment, a plurality of originals are continuously read by the scanner 20, the first original of the plurality of originals is set as an unfilled original, and the second and subsequent originals are already entered. There is a rule that it is a manuscript. In this case, the image data of the unfilled document can be acquired easily and reliably. However, in the present invention, the rule that an unwritten document is arranged at a specific position such as the first sheet is not necessarily required. For example, a plurality of unwritten manuscripts may be sandwiched between half-finished manuscripts stacked. In that case, a process of finding an unfilled document from a plurality of documents in the image acquisition unit may be performed. As an example of the process for finding an unfilled document, a process for extracting the difference between the first document and the second and subsequent documents is sequentially performed, and a document with a difference only in the first document is unfilled. And it is sufficient. Also, an image is created by extracting the common part of multiple originals on the data obtained by reading regardless of whether it is an unfilled manuscript or a filled manuscript, and the created image and each read manuscript A document having the highest degree of matching may be used as an unfilled document.

あるいは、記入済原稿の読み込みが複数回に分かれていても、同種の原稿についての未記入原稿の読み込みは１回のみとし、一旦読み込んだ未記入原稿を記憶しておいて、今回読み込んだ記入済原稿とのパターンマッチングや特徴点抽出、あるいは直線で囲まれた領域の一致度を使ったフォーム認識により、今回読み込んだ記入済原稿に対応する未記入原稿を特定してもよい。 Alternatively, even if the completed manuscript is read multiple times, the unwritten manuscript is read only once for the same type of manuscript. An unfilled manuscript corresponding to the filled manuscript read this time may be specified by pattern matching with the manuscript, feature point extraction, or form recognition using the degree of coincidence of the areas surrounded by straight lines.

さらには、本発明では、未記入原稿を読み込むことすら必ずしも必要ではない。例えば、複数枚の記入済原稿から、それら複数枚の記入済原稿の共通部分を抽出することにより、データ上で未記入原稿を作成してもよい。この場合、共通部分を抽出することにより作成されたデータ上での未記入原稿が第１の画像を表す第１の画像データに対応する。 Furthermore, in the present invention, it is not always necessary to read a blank document. For example, an unfilled manuscript may be created on data by extracting a common part of the plurality of filled manuscripts from a plurality of manuscripts. In this case, the unfilled document on the data created by extracting the common part corresponds to the first image data representing the first image.

また、ここでは、図１に示すように、通信ケーブル４０でスキャナ２０と接続されたノートＰＣ３０からなる画像処理装置について説明したが、本発明における画像処理装置は必ずしもこの形態である必要はない。例えば、スキャナとプリンタとが合体した形態のコピー機ないしはさらに機能が増えた複合機に、本発明の画像処理装置の機能を組み込んでもよい。さらには、カメラ機能を備えた携帯型端末に本発明の画像処理装置の機能を組み込んでもよい。その場合、カメラ機能で原稿を撮影することにより得られた画像が文字認識の対象となる。 Here, as shown in FIG. 1, the image processing apparatus including the notebook PC 30 connected to the scanner 20 via the communication cable 40 has been described. However, the image processing apparatus according to the present invention is not necessarily in this form. For example, the function of the image processing apparatus according to the present invention may be incorporated into a copier in which a scanner and a printer are combined, or a multi-function machine having more functions. Furthermore, the function of the image processing apparatus of the present invention may be incorporated in a portable terminal having a camera function. In this case, an image obtained by photographing a document with the camera function is a character recognition target.

１０文字認識システム
２０スキャナ
２１原稿トレイ
２２排紙トレイ
２３上蓋
３０ノート型パーソナルコンピュータ（ノートＰＣ）
３１表示画面
３２キーボード
５１Ａ，５１Ｂ，５１Ｃ，５１Ｄ未記入原稿
５２Ａ，５２Ｂ，５２Ｃ，５２Ｄ記入済原稿
５３Ａ差分画像
５２１，５２２，５２３ ○印
５５１，５５２個別追加記録画像
６０画像処理装置
６１画像取得部
６２文字列認識部
６３追加記録画像抽出部
６４文字列特定部 DESCRIPTION OF SYMBOLS 10 Character recognition system 20 Scanner 21 Document tray 22 Discharge tray 23 Top cover 30 Notebook personal computer (notebook PC)
31 Display Screen 32 Keyboard 51A, 51B, 51C, 51D Unfilled Document 52A, 52B, 52C, 52D Filled Document 53A Difference Image 521, 522, 523 ○ Mark 551, 552 Individual Additional Recorded Image 60 Image Processing Unit 61 Image Acquisition Unit 62 Character string recognition unit 63 Additional recorded image extraction unit 64 Character string identification unit

Claims

An image acquisition unit configured to acquire first image data representing a first image and second image data representing a second image additionally recorded on the first image data;
A character string recognition unit for recognizing a character string including one character from the first image;
An additional recorded image extraction unit that extracts an additional recorded image that is an image additionally recorded with respect to the first image from the second image;
An image processing comprising: a character string specifying unit that specifies a character string corresponding to the additional recording image on the first image from the character strings recognized by the character string recognition unit. apparatus.

In addition to recognizing the character string from the first image, the character string recognizing unit records one or more points on which the character string is recorded on the first image for each recognized character string. A first region including the region represented by the coordinates of
The additional recorded image extraction unit records the individual additional recorded image on the second image for each individual additional recorded image that constitutes the additional recorded image. Extracting a second region including the region represented by the coordinates of the point;
2. The character string specifying unit specifies a character string associated with a first area having a predetermined first positional relationship with respect to the second area. Image processing apparatus.

A plurality of characters in which the character string recognition unit recognizes each of the characters including a region represented by one or a plurality of coordinates in a predetermined second positional relationship. Are recognized as one character string, and the first image includes a region including the region represented by the coordinates of one point or a plurality of points on which the character string is recorded. The image processing apparatus according to claim 2, wherein the image processing apparatus is associated with the character string as an area of the image.

The character string specifying unit does not specify a character string corresponding to the second area when the first area is not in the first positional relationship with respect to the second area. The image processing apparatus according to claim 2, wherein the image processing apparatus is an image processing apparatus.

The character string recognition unit recognizes a character string for each area surrounded by the ruled line when the first image is an image including a ruled line. The image processing apparatus according to any one of the above.

When the character string specifying unit specifies the same character string corresponding to the same first region in correspondence with the plurality of second regions, the character string specifying unit performs a plurality of times for the same character string. 6. The image processing apparatus according to claim 2, wherein the same character string that is specified in the remaining specifications other than one specification among the specifics is ignored. .

The information processing apparatus is executed in an information processing apparatus that executes a program.
An image acquisition unit configured to acquire first image data representing a first image and second image data representing a second image additionally recorded on the first image data;
A character string recognition unit for recognizing a character string including one character from the first image;
An additional recorded image extraction unit that extracts an additional recorded image that is an image additionally recorded with respect to the first image from the second image;
Operating as an image processing apparatus including a character string specifying unit that specifies a character string corresponding to the additional recording image on the first image from among character strings recognized by the character string recognition unit. An image processing program characterized by the above.