JP6888299B2

JP6888299B2 - Image processing equipment and image processing program

Info

Publication number: JP6888299B2
Application number: JP2017000149A
Authority: JP
Inventors: 猪股　浩司郎; 浩司郎猪股
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2017-01-04
Filing date: 2017-01-04
Publication date: 2021-06-16
Anticipated expiration: 2037-01-04
Also published as: JP2018109866A

Description

本発明は、画像処理装置および画像処理プログラムに関する。 The present invention relates to an image processing apparatus and an image processing program.

官公庁等に提出する書類や様々なアンケート用紙への記入等、印字された用紙（帳票やアンケート用紙等）に手書きで記入して提出する機会が多い。記入された用紙を集める側は、それらの用紙に記入された手書き文字や、○印等のマークで選択された文字（数字を含む）を自動で読み取って集計したいという要求がある。 There are many opportunities to handwrite and submit printed forms (forms, questionnaires, etc.) such as documents to be submitted to government offices and various questionnaires. There is a demand that the side that collects the filled-in forms wants to automatically read the handwritten characters written on those forms and the characters (including numbers) selected by the mark such as ○ mark and total them.

その要求に対し、特許文献１には、マークシートのように塗りつぶして回答する種類の帳票について、回答が記入されたマークシートをスキャナ等で読み取って集計する技術が開示されている。 In response to the request, Patent Document 1 discloses a technique of scanning a mark sheet on which an answer is written with a scanner or the like and totaling the form for a type of form to be filled and answered, such as a mark sheet.

特開平２０１３―４５３０９号公報Japanese Unexamined Patent Publication No. 2013-45309

しかしながら、上掲の特許文献１に開示された技術の場合、マークシートに記入されているマークの位置を検出することはできるが、その位置に記入されたマークが何を意味しているかは、別途の情報として事前設定しておく必要がある。 However, in the case of the technique disclosed in Patent Document 1 described above, the position of the mark written on the mark sheet can be detected, but what the mark written at that position means is separately determined. It is necessary to set in advance as the information of.

「本発明は、記入あるいは選択された位置に対応する文字情報を、あらかじめ文字と文字の位置とを対応づけて設定する作業を必要とすることなく認識する画像処理装置および画像処理プログラムを提供することを目的とする。
"The present invention provides an image processing device and an image processing program that recognize character information corresponding to an entered or selected position without requiring a work of associating a character with a character position in advance. The purpose is.

請求項１は、
第１の画像を表わす第１の画像データと、該第１の画像データに追加記録がなされた第２の画像を表わす第２の画像データとを取得する画像取得部と、
前記第２の画像の中から、前記第１の画像に対し追加記録された画像である追加記録画像を抽出する追加記録画像抽出部と、
前記追加記録画像に対応する第１の画像の領域に文字認識処理を実行する文字認識領域を設定し、該文字認識領域の面積を変化させながら、該文字認識領域について文字認識処理を繰り返し実行する文字認識処理実行部とを備え、
前記文字認識処理実行部が、認識の確からしさの算出を含む文字認識処理を実行するものであって、前記追加記録画像に対応する領域に、予め定められた閾値を越える確からしさの文字が認識されなかった場合に、該追加記録画像に対応する領域に設定された前記文字認識領域の位置を、他の第２の画像の追加記録画像の位置を参照して決定した位置にずらして、文字認識処理を実行することを特徴とする文字認識装置である。
請求項２は、
前記文字認識処理実行部は、前記追加記録画像に対応する領域に設定された前記文字認識領域の位置をずらすとともに、前記他の第２の画像の追加記録画像に対応する、予め定められた閾値を越える確からしさが得られたときの文字認識領域の面積に応じた面積の文字認識領域を設定して、該文字認識領域について文字認識処理を実行することを特徴とする請求項１に記載の文字認識装置である。 Claim 1 is
An image acquisition unit that acquires a first image data representing a first image and a second image data representing a second image in which additional recording is performed on the first image data.
An additional recorded image extraction unit that extracts an additional recorded image, which is an image additionally recorded with respect to the first image, from the second image.
A character recognition area for executing character recognition processing is set in the area of the first image corresponding to the additional recorded image, and the character recognition processing is repeatedly executed for the character recognition area while changing the area of the character recognition area. Equipped with a character recognition processing execution unit
The character recognition processing execution unit executes character recognition processing including calculation of recognition certainty, and recognizes characters with certainty exceeding a predetermined threshold in the area corresponding to the additional recorded image. If not, the position of the character recognition area set in the area corresponding to the additional recorded image is shifted to a position determined by referring to the position of the additional recorded image of the other second image, and the character is displayed. It is a character recognition device characterized by executing recognition processing.
Claim 2
The character recognition processing execution unit shifts the position of the character recognition area set in the area corresponding to the additional recording image, and has a predetermined threshold value corresponding to the additional recording image of the other second image. The first aspect of claim 1, wherein a character recognition area having an area corresponding to the area of the character recognition area when the certainty exceeding the above is obtained is set, and the character recognition process is executed for the character recognition area. It is a character recognition device.

請求項３は、
前記追加記録画像抽出部が、前記追加記録画像を構成する、相互に分離した複数の図形であっても、予め定められた閾値距離以内に互いに近づいた複数の図形については該複数の図形が同一の追加記録画像に属するものとみなすことを特徴とする請求項１または２に記載の文字認識装置である。 Claim 3
Even if the additional recorded image extraction unit constitutes the additional recorded image and is separated from each other, the plurality of figures are the same for the plurality of figures that are close to each other within a predetermined threshold distance. The character recognition device according to claim 1 or 2 , wherein the character recognition device is considered to belong to the additional recorded image of the above.

請求項４は、
前記文字認識処理実行部は、前記第１の画像上に前記文字認識領域を設定し、該第１の画像上の文字を認識対象とするものであることを特徴とする請求項１から３のうちのいずれか１項に記載の文字認識装置である。 Claim 4
The character recognition processing execution unit sets the character recognition area on the first image, and makes the character on the first image a recognition target, according to any one of claims 1 to 3 . The character recognition device according to any one of the items.

請求項５は、
前記文字認識処理実行部は、前記第２の画像上若しくは前記第１の画像と該第２の画像との差分の画像上に前記文字認識領域を設定し、該第２の画像上若しくは該差分の画像上の文字を認識対象とするものであることを特徴とする請求項１から４のうちのいずれか１項に記載の文字認識装置である。 Claim 5
The character recognition processing execution unit sets the character recognition area on the second image or on the image of the difference between the first image and the second image, and on the second image or the difference. The character recognition device according to any one of claims 1 to 4 , wherein the characters on the image of the above are to be recognized.

請求項６は、
前記文字認識処理実行部は、前記第１の画像上の、前記追加記録画像に対応する領域が
空白の領域であった場合に、前記第２の画像上若しくは前記第１の画像と該第２の画像との差分の画像上に前記文字認識領域を設定して該第２の画像上若しくは該差分の画像上の文字を認識対象とし、前記第１の画像上の前記追加記録画像に対応する領域が空白の領域ではなかった場合に、該第１の画像上に前記文字認識領域を設定して該第１の画像上の文字を認識対象とするものであることを特徴とする請求項１から５のうちのいずれか１項に記載の文字認識装置である。 Claim 6
When the region corresponding to the additional recorded image on the first image is a blank region, the character recognition processing execution unit is on the second image or on the first image and the second image. The character recognition area is set on the image of the difference from the image of the above, the character on the second image or the image of the difference is targeted for recognition, and corresponds to the additional recorded image on the first image. Claim 1 is characterized in that when the area is not a blank area, the character recognition area is set on the first image and the characters on the first image are to be recognized. The character recognition device according to any one of 5 to 5.

請求項７は、
プログラムを実行する情報処理装置内で実行されて、該情報処理装置を、
第１の画像を表わす第１の画像データと、該第１の画像データに追加記録がなされた第２の画像を表わす第２の画像データとを取得する画像取得部と、
前記第２の画像の中から、前記第１の画像に対し追加記録された画像である追加記録画像を抽出する追加記録画像抽出部と、
前記追加記録画像に対応する領域に文字認識処理を実行する文字認識領域を設定し、該文字認識領域の面積を変化させながら、該文字認識領域について文字認識処理を実行する文字認識処理実行部とを備え、
前記文字認識処理実行部が、認識の確からしさの算出を含む文字認識処理を実行するものであって、前記追加記録画像に対応する領域に、予め定められた閾値を越える確からしさの文字が認識されなかった場合に、該追加記録画像に対応する領域に設定された前記文字認識領域の位置を、他の第２の画像の追加記録画像の位置を参照して決定した位置にずらして、文字認識処理を実行する文字認識装置として動作させることを特徴とする文字認識プログラムである。 Claim 7
The information processing device is executed in the information processing device that executes the program.
An image acquisition unit that acquires a first image data representing a first image and a second image data representing a second image in which additional recording is performed on the first image data.
An additional recorded image extraction unit that extracts an additional recorded image, which is an image additionally recorded with respect to the first image, from the second image.
A character recognition processing execution unit that sets a character recognition area for executing character recognition processing in an area corresponding to the additional recorded image, and executes character recognition processing for the character recognition area while changing the area of the character recognition area. equipped with a,
The character recognition processing execution unit executes character recognition processing including calculation of recognition certainty, and recognizes characters with certainty exceeding a predetermined threshold in an area corresponding to the additional recorded image. If not, the position of the character recognition area set in the area corresponding to the additional recorded image is shifted to a position determined by referring to the position of the additional recorded image of the other second image, and the character is displayed. It is a character recognition program characterized by operating as a character recognition device that executes recognition processing.

請求項１の文字認識装置および請求項７の文字認識プログラムによれば、記入あるいは選択された位置に対応する文字情報を、あらかじめ文字と文字の位置とを対応づけて設定する作業を必要とすることなく認識することができる。
また、請求項１の文字認識装置および請求項７の文字認識プログラムによれば、他の第２の画像の追加記録画像の位置を参照しない場合と比べ、正しく認識される可能性が高まる。 According to the character recognition device of claim 1 and the character recognition program of claim 7 , it is necessary to set the character information corresponding to the entered or selected position in advance by associating the character with the character position. Can be recognized without.
Further, according to the character recognition device of claim 1 and the character recognition program of claim 7, the possibility of correct recognition is increased as compared with the case where the position of the additional recorded image of the other second image is not referred to.

請求項２の文字認識装置によれば、ずらした位置での文字認識領域の面積を初期面積から再スタートして文字認識処理を繰り返す場合と比べ、文字認識終了までの時間が短縮される可能性が高まる。
請求項３の文字認識装置によれば、物理的に分離した図形ごとに追加記録画像とみなす場合と比べ、正しく認識される可能性が高まる。 According to the character recognition device of claim 2 , the time until the end of character recognition may be shortened as compared with the case where the area of the character recognition area at the shifted position is restarted from the initial area and the character recognition process is repeated. Will increase.
According to the character recognition device of claim 3, there is an increased possibility that each physically separated figure is correctly recognized as compared with the case where it is regarded as an additional recorded image.

請求項４の文字認識装置によれば、認識対象の文字が第１の画像と第２の画像との双方に存在する場合に、第２の画像上に文字認識領域を設定する場合と比べ、正しく認識される可能性が高まる。 According to the character recognition device of claim 4, when the character to be recognized exists in both the first image and the second image, as compared with the case where the character recognition area is set on the second image, as compared with the case where the character recognition area is set on the second image. It is more likely to be recognized correctly.

請求項５の文字認識装置によれば、認識対象の文字が第１の画像には存在しない場合には、第２の画像上若しくは差分画像上に文字認識領域を設定することによって、文字認識を行なうことができる。 According to the character recognition device of claim 5, when the character to be recognized does not exist in the first image, the character recognition is performed by setting the character recognition area on the second image or the difference image. Can be done.

請求項６の文字認識装置によれば、文字認識領域を設定する画像を、第１の画像、第２の画像、若しくは差分画像のいずれかに固定した場合と比べ、広範囲の認識が可能である。 According to the character recognition device of claim 6 , a wide range of recognition is possible as compared with the case where the image for which the character recognition area is set is fixed to any one of the first image, the second image, or the difference image. ..

文字認識システムの外観図である。It is an external view of a character recognition system. ノートＰＣ内での文字認識プログラムの実行により実現する文字認識装置の機能ブロック図である。It is a functional block diagram of a character recognition device realized by executing a character recognition program in a notebook PC. 本発明の一実施形態としての文字認識プログラムのフローチャートを示した図である。It is a figure which showed the flowchart of the character recognition program as one Embodiment of this invention. 未記入原稿と記入済原稿の一例を示した図である。It is a figure which showed an example of the unfilled manuscript and the filled-out manuscript. 図３に１つのステップ（ステップＳ０８）で示した文字認識処理の第１例についての詳細フローを示した図である。FIG. 3 is a diagram showing a detailed flow of the first example of the character recognition process shown in one step (step S08). 差分画像の一例を示した図である。It is a figure which showed an example of the difference image. ２つに分離した図形を１つの個別追加記録画像として認識するための処理を示した模式図である。It is a schematic diagram which showed the process for recognizing the figure separated into two as one individual additional recorded image. 文字認識領域設定方法の説明図である。It is explanatory drawing of the character recognition area setting method. 文字認識領域の再設定方法を示した図である。It is a figure which showed the setting method of the character recognition area. 図３に１つのステップ（ステップＳ０８）で示した文字認識処理の第２例についての詳細フローを示した図である。FIG. 3 is a diagram showing a detailed flow of the second example of the character recognition process shown in one step (step S08). 回答としての○印が認識対象の文字からずれた位置に記入された場合の、文字認識領域を示した模式図である。It is a schematic diagram which showed the character recognition area when ○ mark as an answer is written in the position shifted from the character to recognize. 図３に１つのステップとして示した再認識処理の第１例についての詳細フロ―を示した図である。FIG. 3 is a diagram showing a detailed flow of the first example of the re-recognition process shown as one step in FIG. 図３に１つのステップとして示した再認識処理の第２例についての詳細フロ―を示した図である。It is a figure which showed the detailed flow about the 2nd example of the re-recognition processing shown as one step in FIG. 対象の差分画像上の１つの個別追加記録画像と、それに対応する、対象以外の差分画像上の１つの個別追加記録画像を、未記入原稿上に重ねて示した図である。It is a figure which superimposes one individual additional recording image on the target difference image, and corresponding one individual addition recording image on the non-target difference image on the unfilled manuscript.

以下、本発明の実施の形態について説明する。 Hereinafter, embodiments of the present invention will be described.

図１は、文字認識システムの外観図である。 FIG. 1 is an external view of a character recognition system.

ここに示す文字認識システム１０は、スキャナ２０とノート型パーソナルコンピュータ（以下、「ノートＰＣ」と略記する）３０とを備えている。スキャナ２０とノートＰＣ３０との間は、通信ケーブル４０で接続されている。 The character recognition system 10 shown here includes a scanner 20 and a notebook personal computer (hereinafter, abbreviated as "notebook PC") 30. The scanner 20 and the notebook PC 30 are connected by a communication cable 40.

スキャナ２０は、原稿に記録されている画像を読み取って画像データを生成する装置である。このスキャナ２０の原稿トレイ２１上に原稿を置き、スタートボタン（不図示）を押すと、あるいは、ノートＰＣから指示を与えると、原稿が１枚、スキャナ２０内に送り込まれる。スキャナ２０内には原稿上の画像を光電的に読み取るセンサ（不図示）が備えられていて、スキャナ２０内に送り込まれた原稿から、その原稿上に記録されている画像が光電的に読み取られて画像データが生成される。記録されている画像が読み取られた後の原稿は、排紙トレイ２２上に排出される。この原稿トレイ２１には複数枚の原稿を積み重ねて載置することができ、スキャナ２０は、原稿トレイ２１上の複数枚の原稿を１枚ずつ順次にスキャナ２０内に送り込み、その送り込まれた原稿上の画像を読み取り、排紙トレイ２２上に排出する。 The scanner 20 is a device that reads an image recorded on a document and generates image data. When a document is placed on the document tray 21 of the scanner 20 and the start button (not shown) is pressed, or when an instruction is given from the notebook PC, one document is sent into the scanner 20. The scanner 20 is provided with a sensor (not shown) that photoelectrically reads the image on the document, and the image recorded on the document is photoelectrically read from the document sent into the scanner 20. Image data is generated. After the recorded image is read, the original is ejected onto the output tray 22. A plurality of originals can be stacked and placed on the original tray 21, and the scanner 20 sequentially feeds the plurality of originals on the original tray 21 into the scanner 20 one by one, and the fed originals are fed. The above image is read and discharged onto the output tray 22.

また、このスキャナ２０は、背面側に設けられた左右に延びるヒンジ（不図示）を回転中心として上蓋２３を持ち上げることができる。この上蓋２３を持ち上げてその下に原稿を１枚置き、上蓋２３を閉じて、その置かれた原稿を読み取ることもできる。 Further, the scanner 20 can lift the upper lid 23 with a hinge (not shown) extending to the left and right provided on the back side as a center of rotation. It is also possible to lift the upper lid 23, place one document under it, close the upper lid 23, and read the placed document.

このスキャナ２０での読み取りにより得られた画像データは、通信ケーブル４０を経由してノートＰＣ３０に入力される。 The image data obtained by reading with the scanner 20 is input to the notebook PC 30 via the communication cable 40.

ノートＰＣ３０は、表示画面３１やキーボード３２を備えており、また、その内部には、プログラムを実行するためのＣＰＵやメモリ等の設備を備えている。このノートＰＣ３０ではプログラムが実行されて、その実行されたプログラムに応じた処理が行われる。本実施形態に対応しては、このノートＰＣでは、以下に説明する文字認識プログラムが実行される。このノートＰＣ３０内で実行される文字認識プログラムは、本発明の文字認識プログラムの一例に相当する。そして、このノートＰＣ３０は、この文字認識プログラムの実行により、本発明の一実施形態としての文字認識装置として動作する。 The notebook PC 30 is provided with a display screen 31 and a keyboard 32, and is provided with equipment such as a CPU and a memory for executing a program inside the notebook PC 30. A program is executed in the notebook PC 30, and processing is performed according to the executed program. Corresponding to this embodiment, the character recognition program described below is executed in this notebook PC. The character recognition program executed in the notebook PC 30 corresponds to an example of the character recognition program of the present invention. Then, the notebook PC 30 operates as a character recognition device as an embodiment of the present invention by executing this character recognition program.

図２は、ノートＰＣ内での文字認識プログラムの実行により実現する文字認識装置の機能ブロック図である。 FIG. 2 is a functional block diagram of a character recognition device realized by executing a character recognition program in a notebook PC.

本実施形態の文字認識装置５０は、画像取得部５１と、追加記録画像抽出部５２と、追加記録画像分離部５３と、文字認識処理実行部５４とを有する。具体的な実施形態の例示は後回しにして、ここでは、各部５１〜５４について概括的に説明する。なお、ここでは、データ上の画像を取り扱っており、したがって、ここでは、特に区別する必要がある場合を除き、データ上の画像であっても、データ上の画像であることを特に明記することなく、単に「画像」あるいは「原稿」と称することがある。 The character recognition device 50 of the present embodiment includes an image acquisition unit 51, an additional recording image extraction unit 52, an additional recording image separation unit 53, and a character recognition processing execution unit 54. Examples of specific embodiments will be postponed, and here, each part 51 to 54 will be described in detail. It should be noted that the image on the data is dealt with here, and therefore, unless it is necessary to distinguish it, it should be clearly stated that the image on the data is an image on the data. Instead, it may be simply referred to as an "image" or "manuscript."

画像取得部５１は、アンケートの設問としての文字が記録されていてその設問に対する回答が未記入の未記入原稿の画像と、その未記入原稿に回答が追加記録された記入済原稿の画像とを取得する。未記入原稿は１枚であるが、記入済原稿は通常は複数枚存在し、画像取得部５１は、それら全ての画像を取得する。これら未記入原稿および記入済原稿は、本発明にいう、それぞれ第１の画像および第２の画像の各一例に相当する。 The image acquisition unit 51 captures an image of an unfilled manuscript in which characters as a question of a questionnaire are recorded and an answer to the question is not filled in, and an image of a completed manuscript in which an answer is additionally recorded in the unfilled manuscript. get. Although there is one unfilled manuscript, there are usually a plurality of filled-in manuscripts, and the image acquisition unit 51 acquires all the images. These unfilled manuscripts and filled-in manuscripts correspond to examples of the first image and the second image, respectively, as referred to in the present invention.

また、追加記録画像抽出部５２は、記録済原稿と未記入原稿との差分の画像を算出することにより、記録済原稿の中から、未記入原稿に対し追加記録された回答の画像である追加記録画像を抽出する。 Further, the additional recorded image extraction unit 52 calculates an image of the difference between the recorded manuscript and the unfilled manuscript, and adds an image of the answer additionally recorded to the unfilled manuscript from the recorded manuscripts. Extract the recorded image.

また、追加記録画像分離部５３は、抽出された追加記録画像を、個別の記録ごとの追加記録画像（ここでは、個別の記録ごとの追加記録画像を「個別追加記録画像」と称する）に分離する。ここで、本実施形態における追加記録画像分離部５３は、追加記録画像を各個別追加記録画像に分離するにあたり、追加記録画像を構成する、相互に分離した複数の図形であっても、予め定められた閾値距離以内に互いに近づいた複数の図形についてはそれら複数の図形が同一の個別追加記録画像に属するものとみなして、追加記録画像を各個別追加記録画像に分離する。 Further, the additional recording image separation unit 53 separates the extracted additional recording image into an additional recording image for each individual recording (here, the additional recording image for each individual recording is referred to as an "individual additional recording image"). To do. Here, the additional recorded image separation unit 53 in the present embodiment determines in advance when separating the additional recorded image into each individual additional recorded image, even if it is a plurality of mutually separated figures constituting the additional recorded image. For a plurality of figures that approach each other within the specified threshold distance, the plurality of figures are regarded as belonging to the same individual additional recorded image, and the additional recorded image is separated into each individual additional recorded image.

さらに、文字認識処理実行部５４は、各個別追加記録画像に対応する各領域に文字認識処理を実行する文字認識領域を設定し、その文字認識領域の面積を変化させながら、その文字認識領域について文字認識処理を繰り返し実行する。ここで、本実施形態における文字認識処理実行部５４は、未記入原稿上の、個別追加記録画像に対応する領域が空白の領域であった場合には、記入済原稿上若しくは未記入原稿と記入済原稿との差分の画像上に文字認識領域を設定して記入済原稿上若しくは差分画像上の文字を認識対象とし、未記入原稿上の個別追加記録画像に対応する領域が空白の領域ではなかった場合には、未記入原稿上に文字認識領域を設定して未記入原稿上の文字を認識対象とする。 Further, the character recognition processing execution unit 54 sets a character recognition area for executing the character recognition processing in each area corresponding to each individual additional recorded image, and changes the area of the character recognition area for the character recognition area. Repeat the character recognition process. Here, when the area corresponding to the individually added recorded image on the unfilled manuscript is a blank area, the character recognition processing execution unit 54 in the present embodiment writes as an filled-in manuscript or an unfilled manuscript. A character recognition area is set on the image of the difference from the completed manuscript to recognize the characters on the filled-in manuscript or the difference image, and the area corresponding to the individually added recorded image on the unfilled manuscript is not a blank area. If this is the case, a character recognition area is set on the unfilled manuscript to recognize the characters on the unfilled manuscript.

また、この文字認識処理実行部５４は、認識の確からしさの算出を含む文字認識処理を実行するものであって、前記文字認識領域の面積を変化させながら、予め定められた認識処理停止条件を満足するまで、文字認識処理の実行を繰り返す。この認識処理停止条件としては、確からしさが予め定められた閾値を越えること、確からしさが極大値に達すること、若しくは、文字認識処理の実行を予め定められた回数繰り返すこと、などが採用される。 Further, the character recognition processing execution unit 54 executes character recognition processing including calculation of the certainty of recognition, and while changing the area of the character recognition area, a predetermined recognition processing stop condition is set. The execution of the character recognition process is repeated until it is satisfied. As the recognition processing stop condition, it is adopted that the certainty exceeds a predetermined threshold value, the certainty reaches a maximum value, or the execution of the character recognition processing is repeated a predetermined number of times. ..

ここで、本実施形態では、個別追加記録画像に対応する未記入原稿上の領域が空白の領域ではなかった場合には、未記入原稿上に文字認識領域が設定されて未記入原稿上の文字が認識対象となるが、本実施形態における文字認識処理実行部５４は、未記入原稿上に設定された文字認識領域に予め定められた閾値を越える確からしさの文字が認識されなかった場合に、その文字認識領域の位置をずらして、文字認識処理を実行する。この文字認識領域の位置をずらすにあたっては、文字認識処理実行部５４は、現在処理の対象としている記入済原稿とは異なる他の記入済原稿に記録されている個別追加記録画像の位置を参照して、文字認識領域をずらす位置を決定する。また、この文字認識処理実行部５４は、個別追加記録画像に対応する未記入原稿上の領域に設定された文字認識領域の位置をずらすとともに、上記の他の記入済原稿上の個別追加記録画像に対応する、閾値以上の確からしさが得られたときの文字認識領域の面積に応じた面積の文字認識領域を設定して、その面積の文字認識領域について文字認識処理を実行する。具体例は後述する。 Here, in the present embodiment, when the area on the unfilled manuscript corresponding to the individually added recorded image is not a blank area, the character recognition area is set on the unfilled manuscript and the characters on the unfilled manuscript are set. However, when the character recognition processing execution unit 54 in the present embodiment does not recognize a character with a certainty exceeding a predetermined threshold value in the character recognition area set on the unfilled manuscript, The character recognition process is executed by shifting the position of the character recognition area. In shifting the position of the character recognition area, the character recognition processing execution unit 54 refers to the position of the individually added recorded image recorded in another filled-in manuscript different from the filled-in manuscript currently being processed. To determine the position to shift the character recognition area. Further, the character recognition processing execution unit 54 shifts the position of the character recognition area set in the area on the unfilled manuscript corresponding to the individually added recorded image, and also shifts the position of the character recognition area on the other written manuscript described above. A character recognition area having an area corresponding to the area of the character recognition area when the certainty equal to or higher than the threshold value is obtained is set, and the character recognition process is executed for the character recognition area of that area. Specific examples will be described later.

図３は、本発明の一実施形態としての文字認識プログラムのフローチャートを示した図である。 FIG. 3 is a diagram showing a flowchart of a character recognition program as an embodiment of the present invention.

図１に示すスキャナ２０で原稿上の画像が読み取られて画像データが生成され、その生成された画像データが通信ケーブル４０を経由してノートＰＣ３０に入力される。すると、この図３に示す文字認識プログラムが起動し、通信ケーブル４０を経由してノートＰＣ３０に入力されてきた画像データが取得される（ステップＳ０１）。なお、前述の通り、ここでは、特に必要がある場合を除き、データ上の画像であっても「データ」を省略し、「画像」あるいは「原稿」と称している。 The image on the document is read by the scanner 20 shown in FIG. 1 to generate image data, and the generated image data is input to the notebook PC 30 via the communication cable 40. Then, the character recognition program shown in FIG. 3 is activated, and the image data input to the notebook PC 30 via the communication cable 40 is acquired (step S01). As described above, here, unless it is particularly necessary, even if it is an image on the data, the "data" is omitted and referred to as an "image" or a "manuscript".

ステップＳ０１にて画像を取得すると、今回取得した画像が１枚目の画像であるか２枚目以降の画像であるかが判定される（ステップＳ０２）。 When the image is acquired in step S01, it is determined whether the image acquired this time is the first image or the second and subsequent images (step S02).

本実施形態では、スキャナ２０に、１枚目は未記入原稿を読み取らせ、その後、２枚目以降に記入済原稿を順次読み取らせるというルールを置いている。そこで、この文字認識プログラムは、取得した画像が１枚目の画像のときは、その画像を未記入原稿として一時保存する（ステップＳ０３）。２枚目以降についても画像取得を繰り返し（ステップＳ０５）、２枚目以降に取得した画像は全て記入済原稿として一時保存する（ステップＳ０４）。 In the present embodiment, a rule is set in which the scanner 20 is made to read the unfilled manuscript on the first sheet, and then sequentially read the filled-in manuscript on the second and subsequent sheets. Therefore, when the acquired image is the first image, this character recognition program temporarily saves the image as an unfilled manuscript (step S03). Image acquisition is repeated for the second and subsequent sheets (step S05), and all the images acquired for the second and subsequent sheets are temporarily saved as completed manuscripts (step S04).

図４は、未記入原稿と記入済原稿の一例を示した図である。
図４（Ａ）は、記入前のアンケート用紙、すなわち未記入原稿５１を表している。ここでは、アンケート内容として（１）〜（３）の３つの設問があり、それら３つの設問のうち、（１）と（２）の設問に対する回答は、１〜５の数字のうちのいずれか１つの数字を○印で囲うことによりその数字を選択する方式のものである。（３）の設問は、その回答を、空白の回答欄５１１に自由に記入してもらう形式の設問である。 FIG. 4 is a diagram showing an example of an unfilled manuscript and a filled-in manuscript.
FIG. 4A shows a questionnaire form before filling in, that is, an unfilled manuscript 51. Here, there are three questions (1) to (3) as the contents of the questionnaire, and of these three questions, the answer to the questions (1) and (2) is any of the numbers 1 to 5. This is a method in which one number is selected by enclosing it with a circle. Question (3) is a question in the form of having the answer freely entered in the blank answer column 511.

また、図４（Ｂ）は、図４（Ａ）に示したアンケート用紙と同一様式のアンケート用紙上に回答者が回答を記入した記入済原稿５２を表している。記入済原稿は１枚とは限らず、スキャナ２０で順次読み込まれた複数枚の原稿のうちの２枚目以降の原稿の１枚１枚それぞれが記入済原稿として取り扱われる。 Further, FIG. 4B shows a completed manuscript 52 in which the respondent entered the answer on the questionnaire form having the same format as the questionnaire form shown in FIG. 4A. The completed manuscript is not limited to one, and each of the second and subsequent manuscripts out of the plurality of manuscripts sequentially read by the scanner 20 is treated as a filled-in manuscript.

この図４（Ｂ）に示されている１枚の記入済原稿では、（１）の設問に関しては、数字の「３」が○印５２１で囲まれている。また、（２）の設問に関しては、数字の「１」が○印５２２で囲まれている。さらに、（３）の設問に関しては、空白だった回答欄に回答者が記入した文字列５２３が並んでいる。 In the one completed manuscript shown in FIG. 4 (B), the number "3" is surrounded by a circle 521 for the question (1). Regarding the question (2), the number "1" is surrounded by a circle 522. Further, regarding the question (3), the character string 523 entered by the respondent is lined up in the blank answer column.

図３に戻って説明を続ける。 The explanation will be continued by returning to FIG.

一連の画像取得を終了すると（ステップＳ０５）、次に、ステップＳ０４で一時保存しておいた記入済原稿のうちの１枚を取り出す（ステップＳ０６）。ただし、ステップＳ０８における文字認識処理が済んでいる記入済原稿は取出しの対象からは外している。そして、未処理の記入済原稿が有ったときは、すなわち、未処理の記入済原稿を取り出せたときは（ステップＳ０７）、その取り出した１枚の未処理の記入済原稿について、文字認識処理を実行する（ステップＳ０８）。文字認識処理の詳細については、後述する。 When the series of image acquisition is completed (step S05), then one of the completed documents temporarily saved in step S04 is taken out (step S06). However, the completed manuscript that has undergone the character recognition process in step S08 is excluded from the extraction target. Then, when there is an unprocessed written manuscript, that is, when the unprocessed written manuscript can be taken out (step S07), the character recognition process is performed on the one unprocessed written manuscript taken out. Is executed (step S08). The details of the character recognition process will be described later.

未処理の記入済原稿を取り出せなかったとき、すなわち、全ての記入済原稿について文字認識処理（ステップＳ０８）が終了したときは（ステップＳ０７）、次に、一定条件下にある文字１つずつについて（ステップＳ０９）、再認識処理を実行する（ステップＳ１０）。ステップＳ０９の条件および再認識処理（ステップＳ１０）については後述する。ステップＳ０９の条件を満たす文字が存在しないとき、あるいは、再認識処理（ステップＳ１０）によってステップＳ０９の条件を満たす文字が存在しなくなったときは、今回の文字認識ルーチンを終了する。 When the unprocessed completed manuscript cannot be taken out, that is, when the character recognition process (step S08) for all the filled-in manuscripts is completed (step S07), then for each character under certain conditions. (Step S09), the re-recognition process is executed (step S10). The condition of step S09 and the re-recognition process (step S10) will be described later. When the character satisfying the condition of step S09 does not exist, or when the character satisfying the condition of step S09 does not exist due to the re-recognition process (step S10), the current character recognition routine is terminated.

図５は、図３に１つのステップ（ステップＳ０８）で示した文字認識処理の第１例についての詳細フローを示した図である。 FIG. 5 is a diagram showing a detailed flow of the first example of the character recognition process shown in one step (step S08) in FIG.

ここでは先ず、図３のステップＳ０６で取り出した１枚の記入済原稿とステップＳ０３で一時保存しておいた未記入原稿との間の差分の画像を生成する（ステップＳ２１）。 Here, first, an image of the difference between the one filled-in manuscript taken out in step S06 of FIG. 3 and the unfilled manuscript temporarily saved in step S03 is generated (step S21).

図６は、差分画像の一例を示した図である。 FIG. 6 is a diagram showing an example of a difference image.

この図６に示す差分画像５３は、図４（Ａ）に示す未記入原稿５１と、図４（Ｂ）に示す記入済原稿５２のうちの一番上の１枚の記入済原稿との間の差分画像である。この差分画像５３には、回答者によって記入された、２つの○印５２１，５２２と文字列５２３とからなる「追加記録画像」が抽出される。この差分画像上に現れた追加記録画像は、本発明にいう追加記録画像の一例に相当する。 The difference image 53 shown in FIG. 6 is between the unfilled manuscript 51 shown in FIG. 4 (A) and the top one of the filled-in manuscripts 52 shown in FIG. 4 (B). It is a difference image of. An "additional recorded image" composed of two circles 521 and 522 and a character string 523 entered by the respondent is extracted from the difference image 53. The additional recorded image appearing on the difference image corresponds to an example of the additional recorded image referred to in the present invention.

図５に戻って説明を続ける。 The explanation will be continued by returning to FIG.

図６に例示するような差分画像を生成すると（ステップＳ２１）、次に、その差分画像上に現れた追加記録画像を、個別の記録ごとの画像である「個別追加記録画像」に分離する（ステップＳ２２）。ここで、「個別追加記録画像」とは、回答者が１つの文字あるいは１つの図形として認識する程度にまとまった画像の各々をいう。具体的には、図６に示す例では、２つの○印５２１，５２２の各々と、文字列５２３を構成するひと文字ひと文字が、各個別追加記録画像である。したがって、記入時の掠れ等により複数に分離した図形や複数の部位に分離した文字であっても、複数に分離した図形あるいは複数の部位に分離した文字を１つの個別追加記録画像として認識すべき場面も存在する。 When a difference image as illustrated in FIG. 6 is generated (step S21), the additional recorded image appearing on the difference image is then separated into "individual additional recorded images" which are images for each individual recording (step S21). Step S22). Here, the "individual additional recorded image" refers to each of the images collected to the extent that the respondent recognizes it as one character or one figure. Specifically, in the example shown in FIG. 6, each of the two circles 521 and 522 and each character constituting the character string 523 are each individual additional recorded image. Therefore, even if the figure is separated into a plurality of figures or the characters are separated into a plurality of parts due to blurring at the time of writing, the figure separated into a plurality of parts or the characters separated into a plurality of parts should be recognized as one individual additional recorded image. There are also scenes.

図７は、２つに分離した図形を１つの個別追加記録画像として認識するための処理を示した模式図である。 FIG. 7 is a schematic diagram showing a process for recognizing a figure separated into two as one individual additional recorded image.

図７（Ａ）は、差分画像上に現れた、回答者によって描かれた○印の１つである。この図７（Ａ）に示された○印は、途中が掠れて２つに分離した図形となっている。 FIG. 7A is one of the circles drawn by the respondents that appeared on the difference image. The circles shown in FIG. 7 (A) are figures that are separated into two by being blurred in the middle.

ここでは、図７（Ａ）に示す○印を構成する各画素の周りを、予め定められた範囲に亘ってその○印を構成する画素として埋めていくことで、図７（Ｂ）に示すように、○印を構成している線を太らせる。 Here, by filling the periphery of each pixel constituting the ○ mark shown in FIG. 7 (A) as a pixel constituting the ○ mark over a predetermined range, it is shown in FIG. 7 (B). As shown, thicken the lines that make up the circle.

図７（Ｃ）は、画素を升目で表現した模式図である。１つの升目が１つの画素を表わしている。中央の画素Ｐは、図７（Ａ）の○印を構成する線上の多数の画素を代表させて１つだけ示した画素である。 FIG. 7C is a schematic diagram in which pixels are represented by squares. One square represents one pixel. The central pixel P is a pixel showing only one on behalf of a large number of pixels on the line forming the circle in FIG. 7 (A).

線を太らせるにあたっては、具体的には、この図７（Ｃ）に示すように、○印を構成している１つの画素Ｐが存在したときに、その画素Ｐの周りの予め定められた範囲内（ここに示す例では、５画素×５画素の範囲内）にある画素を、○印を構成する画素として塗り潰す。ここでは、代表的に１つの画素Ｐについて示したが、○印を構成している全ての画素について同様の処理を行なって、図７（Ｂ）に示すような、太線の丸印を生成する。このようにして線を太らせた結果、繋がった図形を、１つの個別追加記録画像として認識する。本実施形態では、このような処理により、差分画像上に現れた追加記録画像が各個別追加記録画像に分離される。図６に示す差分画像５３の例では、上記の処理により、２つの○印５２１，５２２の１つずつと、文字列５２３を構成している文字１つずつに分離され、それらの１つずつが、各個別追加記録画像として認識される。
Specifically, in thickening the line, as shown in FIG. 7 (C), when one pixel P constituting the circle is present, it is predetermined around the pixel P. Pixels within the range (in the example shown here, within the range of 5 pixels × 5 pixels) are filled as pixels constituting the circles. Here, one pixel P is typically shown, but the same processing is performed for all the pixels constituting the circles to generate thick circles as shown in FIG. 7 (B). .. As a result of thickening the line in this way, the connected figures are recognized as one individual additional recorded image. In the present embodiment, by such processing, the additional recorded image appearing on the difference image is separated into each individual additional recorded image. In the example of the difference image 53 shown in FIG. 6, by the above processing, each of the two circles 521 and 522 and one of the characters constituting the character string 523 are separated, and each of them is separated. Is recognized as each individual additional recorded image.

なお、本実施形態では、線を太らせて互いに繋がる図形を個別追加記録画像とする処理を採用しているが、この処理は、互いに離れた図形が互いに予め定められた距離以内に近接しているか否かを判定する処理の１つである。すなわち、ここでは、互いに離れた図形が互いに予め定められた距離以内に近接している場合に、１つの個別追加記録画像として認識される。 In the present embodiment, a process of thickening a line to make a figure connected to each other as an individual additional recorded image is adopted, but in this process, figures separated from each other are close to each other within a predetermined distance. This is one of the processes for determining whether or not there is a presence. That is, here, when the figures separated from each other are close to each other within a predetermined distance, they are recognized as one individual additional recorded image.

再び図５に戻って説明を続ける。 The explanation will be continued by returning to FIG.

差分画像上に現れた追加記録画像を、上記のようにして個々の個別追加記録画像に分離した後（ステップＳ２２）、差分画像を左上から右下に向かって検査していき（ステップＳ２３）、個別追加記録画像を見つけたら、その見つけた１つの個別追加記録画像を取り出す（ステップＳ２４）。そして、今回対象としている差分画像上に未処理の個別追加記録画像が無くなるまで（ステップＳ２５）、以下の処理を繰り返す。 After the additional recorded image appearing on the difference image is separated into individual additional recorded images as described above (step S22), the difference image is inspected from the upper left to the lower right (step S23). When the individual additional recorded image is found, one of the found individual additional recorded images is taken out (step S24). Then, the following processing is repeated until there is no unprocessed individual additional recorded image on the difference image targeted this time (step S25).

ここでは先ず、未記入原稿上の、今回取り出した１つの個別追加記録画像に対応する領域が、空白か否かを判定する（ステップＳ２６）。ここで説明している第１例の場合、空白か否かの判定方法として、２値化処理を行ない、白側に傾いたことをもって空白としている。空白か否かの判定方法の他の例については、後述する。
今回取り出した１つの個別追加記録画像に対応する未記入原稿上の領域が空白ではなかったときは（ステップＳ２６）、次に、その領域に追加記録されている画像が閾値以上の寸法の画像か否かが判定される（ステップＳ２７）。そして、その領域に閾値以上の寸法の画像が記録されていたときは、未記入原稿上に文字認識領域を設定する（ステップＳ２８）。一方、その個別追加記録画像に対応する未記入原稿上の領域が空白だったときは（ステップＳ２６）、本実施形態では、差分画像上に文字認識領域を設定する（ステップＳ２９）。また、その個別追加記録画像に対応する未記入原稿上の画像が空白ではないものの閾値に満たない寸法の画像、すなわちノイズ画像であったときも（ステップＳ２７）、差分画像上に文字認識領域を設定する（ステップＳ２９）。なお、その個別追加記録画像に対応する未記入原稿上の領域が空白あるいは閾値に満たない寸法の画像だったときは、差分画像上ではなく、今回処理を行なっている記入済原稿上に文字認識領域を設定してもよい。 Here, first, it is determined whether or not the area corresponding to the one individually added recorded image taken out this time on the unfilled manuscript is blank (step S26). In the case of the first example described here, as a method of determining whether or not it is blank, binarization processing is performed, and the white side is defined as blank. Another example of the method of determining whether or not it is blank will be described later.
When the area on the unfilled manuscript corresponding to the one individually added recorded image taken out this time is not blank (step S26), then, is the image additionally recorded in that area an image having dimensions equal to or larger than the threshold value? Whether or not it is determined (step S27). Then, when an image having a dimension equal to or larger than the threshold value is recorded in that area, a character recognition area is set on the unfilled document (step S28). On the other hand, when the area on the unfilled document corresponding to the individually added recorded image is blank (step S26), in the present embodiment, the character recognition area is set on the difference image (step S29). Further, even when the image on the unfilled manuscript corresponding to the individually added recorded image is an image having dimensions that are not blank but does not meet the threshold value, that is, a noise image (step S27), the character recognition area is formed on the difference image. Set (step S29). If the area on the unfilled manuscript corresponding to the individually added recorded image is blank or an image with dimensions less than the threshold value, character recognition is performed not on the difference image but on the filled-in manuscript being processed this time. The area may be set.

図８は、文字認識領域設定方法の説明図である。ここでは、未記入原稿と差分画像が互いに重ねられているものとし、未記入原稿上の文字認識領域と差分画像上の文字認識領域とを区別せずに説明する。 FIG. 8 is an explanatory diagram of a character recognition area setting method. Here, it is assumed that the unfilled manuscript and the difference image are overlapped with each other, and the character recognition area on the unfilled manuscript and the character recognition area on the difference image will be described without distinction.

図８（Ａ）は、１つの個別追加記録画像（一例として○印）を示している。ここでは、図８（Ｂ）に示すような、この個別追加記録画像が内接する長方形を考えて、その長方形の中心部に予め定められた最小面積の文字認識領域Ｄを設定する。ただし、長方形を考えることなく、この個別追加記録画像の重心点に文字認識領域を設定するなど、個別追加記録画像のほぼ中央を見つける他の手法を採用してもよい。 FIG. 8A shows one individual additional recorded image (marked with a circle as an example). Here, considering a rectangle inscribed by the individually added recorded image as shown in FIG. 8B, a character recognition area D having a predetermined minimum area is set at the center of the rectangle. However, instead of considering the rectangle, another method of finding the substantially center of the individually added recorded image, such as setting the character recognition area at the center of gravity of the individually added recorded image, may be adopted.

このようにして文字認識領域を設定して（ステップＳ２８，Ｓ２９）、その文字認識領域内について文字認識処理を実行する（ステップＳ３０）。文字認識処理自体は既存の技術であり、ここでの説明は省略する。この文字認識処理では、認識した文字についての「確からしさ」についても算出される。 In this way, the character recognition area is set (steps S28 and S29), and the character recognition process is executed in the character recognition area (step S30). The character recognition process itself is an existing technique, and the description thereof is omitted here. In this character recognition process, the "certainty" of the recognized character is also calculated.

そして、この文字認識処理（ステップＳ３０）は、停止条件を満足するまで（ステップＳ３１）、文字認識領域を再設定しながら（ステップＳ３２）、繰り返される。 Then, this character recognition process (step S30) is repeated while resetting the character recognition area (step S32) until the stop condition is satisfied (step S31).

図９は、文字認識領域の再設定方法を示した図である。 FIG. 9 is a diagram showing a method of resetting the character recognition area.

最初は、図８（Ｃ）に示すように、面積最小の文字認識領域Ｄが設定され（図５、ステップＳ３０）、その文字認識領域の面積を、図９に示すＤ１→Ｄ２→Ｄ３→・・・のように徐々に拡大しながら（ステップＳ３２）、停止条件を満足するまで（ステップＳ３１）、文字認識処理を実行する（ステップＳ３０）。 Initially, as shown in FIG. 8C, the character recognition area D having the smallest area is set (FIG. 5, step S30), and the area of the character recognition area is set to D1 → D2 → D3 → ... The character recognition process is executed (step S30) until the stop condition is satisfied (step S31) while gradually expanding as in (step S32).

ここで、ステップＳ３１の停止条件としては、
（ａ）文字認識の確からしさが予め定められた閾値を越えたこと
（ｂ）文字認識の確からしさが極大値に達したこと
（ｃ）文字認識領域の面積を徐々に拡大しながらの文字認識処理を、予め定められた回数繰り返したこと
などが採用される。ここで、停止条件として上記の（ａ）または（ｂ）を採用したときも、文字認識処理が無限に続かないように、上記の（ｃ）を併用することが望ましい。 Here, as a stop condition in step S31,
(A) The certainty of character recognition has exceeded a predetermined threshold value (b) The certainty of character recognition has reached the maximum value (c) Character recognition while gradually expanding the area of the character recognition area It is adopted that the process is repeated a predetermined number of times. Here, even when the above (a) or (b) is adopted as the stop condition, it is desirable to use the above (c) together so that the character recognition process does not continue indefinitely.

なお、ここでは、文字認識領域の面積を徐々に拡大しながら文字認識処理を繰り返す旨、説明したが、初期の文字認識領域として、例えば、図８（Ｂ）に示す長方形の枠に近似した大面積の文字認識領域を設定し、その文字認識領域の面積を徐々に縮小しながら、文字認識処理を繰り返してもよい。 Here, it has been explained that the character recognition process is repeated while gradually expanding the area of the character recognition area, but as the initial character recognition area, for example, a large size similar to the rectangular frame shown in FIG. 8 (B). The character recognition process may be repeated while setting the character recognition area of the area and gradually reducing the area of the character recognition area.

文字認識の停止条件を満足すると（ステップＳ３１）、今回認識されたひと文字が、その「確からしさ」とともに保存される（ステップＳ３３）。 When the character recognition stop condition is satisfied (step S31), the character recognized this time is saved together with the "certainty" (step S33).

以上の処理が１枚の差分画像上の個別追加記録画像の１つ１つについて実行され（ステップＳ２３，Ｓ２４）、その１枚の差分画像上の全ての個別追加記録画像についての処理が終了すると（ステップＳ２５）、その１枚の差分画像についての、図５に示す処理、すなわち、図３にステップＳ０８として示す文字認識処理が終了し、未処理の次の記入済原稿に関する文字認識処理に移行する（ステップＳ０６）。そして、全ての記入済原稿に関する文字認識処理が終了すると（ステップＳ０７）、次に再認識処理（ステップＳ１０）に移る。
ここでは、再認識処理（ステップＳ１０）の説明に移る前に、ステップＳ０８における文字認識処理の第２例について説明する。
図１０は、図３に１つのステップ（ステップＳ０８）で示した第２例としての文字認識処理の第２例についての詳細フローを示した図である。この図１０に示した第２例としての文字認識処理は、図５に示した第１例としての文字認識処理に代えて採用することのできる文字認識処理である。
この図１０に示した文字認識処理のステップＳ２１〜Ｓ２５は、図５に示した文字認識処理の同じステップＳ２１〜Ｓ２５とそれぞれ同一の処理であり、ここでの重複説明は省略する。
ステップＳ２５において個別追加記録画像有り、と判定されると、図５に示した第１例の場合、２値化処理により、空白か否かが判定され（図５、ステップＳ２６）、空白でなかったときは、そこに記録されている画像が閾値以上の寸法の画像か否かが判定される（ステップＳ２７）。これに対し、この図１０に示した第２例の場合、それらステップＳ２７，Ｓ２８は存在せず、いきなり、未記入原稿上に文字認識領域を設定し（ステップＳ２１１）、その文字認識領域について文字認識処理を実行する（ステップＳ２１２）。そして、その文字認識処理（ステップＳ２１２）を、停止条件を満足するまで（ステップＳ２１３）、文字認識領域を図９に示すＤ１→Ｄ２→Ｄ３→・・・のように再設定しながら（ステップＳ２１４）、繰り返す。これらステップＳ２１２〜Ｓ２１４は、未記入原稿のみを処理対象としている点を除き、図５のステップＳ３０〜Ｓ３２とそれぞれ同一の処理である。
そして、停止条件を満足すると（ステップＳ２１３）、その未記入原稿上に設定（再設定）された文字認識領域から文字が認識されたか否かが判定される（ステップＳ２１５）。文字が認識されたか否かのここでの判定は、単なるノイズと区別するための判定であり、閾値としての確からしさは、かなり低いレベル（例えば、確からしさ２０％）に設定されている。そして、文字が認識されたと判定されると（ステップＳ２１５）、その認識された情報が保存される（ステップＳ２１６）。
一方、ステップＳ２１５において、文字が認識されない、あるいは確からしさが閾値以下と判定されると、今度は差分画像上に文字認識領域が設定されて（ステップＳ２１７）、文字認識処理が実行される（ステップＳ２１８）。ステップＳ２１７〜Ｓ２２１の処理は、差分原稿のみを処理対象としている点を除き、図５のステップＳ３０〜Ｓ３３とそれぞれ同一の処理である。
図５に示した第１例の場合、未記入原稿上の対象の領域が空白か否かを直接に判定しているが（図５、スッテップＳ２６）、それに代えて、この図１０示した処理のように、未記入原稿上に文字が存在することを仮定して、未記入原稿上から文字を認識しようとし、文字が認識できない場合を空白とし、あるいは低い確からしさでしか認識できない場合をノイズとして、差分画像上からの文字認識処理に移ってもよい。
次に、再認識処理（図３、ステップＳ１０）について説明する。
When the above processing is executed for each of the individually added recorded images on one difference image (steps S23 and S24), and the processing for all the individually added recorded images on the one difference image is completed. (Step S25), the process shown in FIG. 5 for the one difference image, that is, the character recognition process shown as step S08 in FIG. 3 is completed, and the process proceeds to the character recognition process for the next unprocessed completed manuscript. (Step S06). Then, when the character recognition process for all the completed manuscripts is completed (step S07), the process proceeds to the re-recognition process (step S10).
Here, a second example of the character recognition process in step S08 will be described before moving on to the description of the re-recognition process (step S10).
FIG. 10 is a diagram showing a detailed flow of the second example of the character recognition process as the second example shown in one step (step S08) in FIG. The character recognition process as the second example shown in FIG. 10 is a character recognition process that can be adopted in place of the character recognition process as the first example shown in FIG.
The characters recognition process steps S21 to S25 shown in FIG. 10 are the same processes as the same steps S21 to S25 of the character recognition process shown in FIG. 5, and duplicate description thereof will be omitted here.
When it is determined in step S25 that there is an individual additional recorded image, in the case of the first example shown in FIG. 5, it is determined whether or not it is blank by the binarization process (FIG. 5, step S26), and it is not blank. If so, it is determined whether or not the image recorded there is an image having dimensions equal to or greater than the threshold value (step S27). On the other hand, in the case of the second example shown in FIG. 10, those steps S27 and S28 do not exist, and suddenly a character recognition area is set on the unfilled manuscript (step S211), and the character recognition area is set as a character. The recognition process is executed (step S212). Then, the character recognition process (step S212) is reset as shown in FIG. 9 as D1 → D2 → D3 → ... Until the stop condition is satisfied (step S213) (step S214). ),repeat. These steps S212 to S214 are the same processes as steps S30 to S32 of FIG. 5, except that only unfilled documents are processed.
Then, when the stop condition is satisfied (step S213), it is determined whether or not the character is recognized from the character recognition area set (reset) on the unfilled document (step S215). The determination here as to whether or not the character is recognized is a determination for distinguishing from mere noise, and the certainty as a threshold value is set to a considerably low level (for example, 20% certainty). Then, when it is determined that the character is recognized (step S215), the recognized information is saved (step S216).
On the other hand, in step S215, when it is determined that the character is not recognized or the certainty is equal to or less than the threshold value, the character recognition area is set on the difference image (step S217), and the character recognition process is executed (step S217). S218). The processing of steps S217 to S221 is the same as that of steps S30 to S33 of FIG. 5, except that only the difference document is processed.
In the case of the first example shown in FIG. 5, it is directly determined whether or not the target area on the blank manuscript is blank (FIG. 5, step S26), but instead, the process shown in FIG. 10 is performed. Assuming that characters exist on the unfilled manuscript, it tries to recognize the characters on the unfilled manuscript, and if the characters cannot be recognized, it is left blank, or if it can be recognized only with low certainty, it is noise. As a result, the character recognition process from the difference image may be started.
Next, the re-recognition process (FIG. 3, step S10) will be described.

図１１は、回答としての○印が認識対象の文字からずれた位置に記入された場合の、文字認識領域を示した模式図である。 FIG. 11 is a schematic diagram showing a character recognition area when a circle as an answer is written at a position deviated from the character to be recognized.

回答としての○印が認識対象の文字からずれた位置に記入されると、この図１１に示すように文字認識領域の中心点が認識対象の文字（ここでは数字の「３」）からずれた位置に設定され、その数字「３」の下に記録されている罫線を文字として誤認識し、確からしさが閾値未満のまま、その文字についての文字認識処理が終了することが有り得る。 When the ○ mark as an answer is entered at a position deviated from the character to be recognized, the center point of the character recognition area is deviated from the character to be recognized (here, the number “3”) as shown in FIG. It is possible that the ruled line set at the position and recorded under the number "3" is erroneously recognized as a character, and the character recognition process for that character is completed while the certainty remains less than the threshold value.

図３のステップＳ１０の再認識処理は、このような場面での認識率の低さを救うための処理である。 The re-recognition process of step S10 in FIG. 3 is a process for saving the low recognition rate in such a situation.

この再認識処理は、ステップＳ０９の再認識処理条件に適合する文字ひと文字ひと文字について実行される。このステップＳ０９の再認識処理条件は、以下の（ｄ）〜（ｆ）の全てを満足することである。 This re-recognition process is executed for each character that meets the re-recognition process condition of step S09. The re-recognition processing condition in step S09 is that all of the following (d) to (f) are satisfied.

（ｄ）この再認識処理は、未記入原稿上の文字を認識する場合を対象としている。差分画像あるいは記入済原稿上の文字については、以下の再認識処理の対象としても確からしさの向上には大きく資することは期待できないため、ここでは、差分画像あるいは記入済原稿上の文字については、再認識処理の対象とはしない。 (D) This re-recognition process is intended for recognizing characters on an unfilled manuscript. Since the characters on the difference image or the filled-in manuscript cannot be expected to greatly contribute to the improvement of the certainty even if they are the targets of the following re-recognition processing, here, the characters on the difference image or the filled-in manuscript are referred to. It is not subject to re-recognition processing.

（ｅ）また、この再認識処理は、文字認識処理（ステップＳ０８）において、確からしさが予め定められた閾値（例えば、確からしさ８０％）未満の確からしさしか得られなかった文字を対象としている。 (E) Further, this re-recognition process targets characters whose certainty is less than a predetermined threshold value (for example, 80% certainty) in the character recognition process (step S08). ..

（ｆ）さらに、この再認識処理は、未記入原稿上の文字認識における確からしさ８０％未満の文字のうちの、再認識処理を未だ実行していない文字を対象としている。 (F) Further, this re-recognition process targets characters for which the re-recognition process has not yet been executed, among the characters with less than 80% certainty in character recognition on the unfilled manuscript.

ここでは、以上の（ｄ）〜（ｆ）の再認識処理条件を満たす文字が存在する場合に（ステップＳ０９）、その再認識処理条件を満たす文字１つ１つについて、再認識処理（ステップＳ１０）が実行される。 Here, when there are characters satisfying the above re-recognition processing conditions (d) to (f) (step S09), the re-recognition process (step S10) is performed for each character satisfying the re-recognition processing conditions. ) Is executed.

図１２は、図３に１つのステップとして示した再認識処理の第１例についての詳細フロ―を示した図である。
ここでは、文字認識処理の中心点（図１１に示す面積最小の文字認識領域Ｄ）を予め定められた領域内（例えば上下左右４ピクセルずつの領域内）で移動させながら、図９を参照して説明した文字認識処理が繰り返される。具体的には、図１２に示したフローの通りである。
ここでは先ず、予め定められた領域内（例えば上下左右４ピクセルずつの領域内）における、ある１つのずれた位置に面積最小の文字認識領域を設定し（ステップＳ４１）、その文字認識領域について文字認識処理を実行する（ステップＳ４２）。そして、この文字認識処理を、停止条件を満足するまで（ステップＳ４３）、文字認識領域を再設定しながら（すなわち、図９に示すＤ１→Ｄ２→Ｄ３→・・・のように文字認識領域を徐々に広げながら）（ステップＳ４４）、繰り返す。
停止条件を満足すると（ステップＳ４３）、予め定められた領域内（例えば上下左右４ピクセルずつの領域内）の全てについて中心点（面積最小の文字認識領域Ｄ）をずらして文字認識処理を行なったか否かが判定され（ステップＳ４５）、中心点をずらすべき位置がその領域内に未だ残っているときは、ステップＳ４１に戻って、その残っている、中心点をずらすべき位置のうちの１つに中心点をずらして、ステップＳ４２〜Ｓ４４の処理を実行する。一方、ステップＳ４５において、中心点をずらすべき領域内の全ての位置に中心点をずらし終えたことが判定されると、今度は、今回の一連の文字認識処理の結果、確からしさがアップしたか否かが判定され（ステップＳ４６）、確からしさがアップしたときは、それまで保存しておいた同じ対象の認識結果が、今回の再認識結果に置き換えられる（ステップＳ４７）。
以上の再認識処理が、上記の（ｄ）〜（ｆ）の条件を満たす各文字について実行されて（図３、ステップＳ０９，Ｓ１０）、この文字認識処理ルーチンの実行を終了する。
次に、再認識処理の第２例について説明する。
図１３は、図３に１つのステップとして示した再認識処理の第２例についての詳細フローを示した図である。この第２例の再認識処理は、図１２を参照して説明した第１例としての再認識処理に代えて採用することのできる処理である。
FIG. 12 is a diagram showing a detailed flow of the first example of the re-recognition process shown as one step in FIG.
Here, refer to FIG. 9 while moving the center point of the character recognition process (the character recognition area D having the smallest area shown in FIG. 11) within a predetermined area (for example, within an area of 4 pixels each in the vertical and horizontal directions). The character recognition process described above is repeated. Specifically, the flow is as shown in FIG.
Here, first, a character recognition area having the smallest area is set at a certain offset position within a predetermined area (for example, within an area of 4 pixels each on the top, bottom, left, and right) (step S41), and the character recognition area is set as a character. The recognition process is executed (step S42). Then, in this character recognition process, the character recognition area is reset as shown in FIG. 9 (that is, D1 → D2 → D3 → ...) Until the stop condition is satisfied (step S43). While gradually expanding) (step S44), repeat.
When the stop condition is satisfied (step S43), is the character recognition process performed by shifting the center point (character recognition area D having the smallest area) in all of the predetermined areas (for example, in the areas of 4 pixels each in the vertical and horizontal directions)? If it is determined whether or not (step S45) and the position where the center point should be shifted still remains in the area, the process returns to step S41 and one of the remaining positions where the center point should be shifted remains. The process of steps S42 to S44 is executed by shifting the center point to. On the other hand, in step S45, when it is determined that the center point has been shifted to all the positions in the region where the center point should be shifted, this time, as a result of this series of character recognition processing, has the certainty improved? Whether or not it is determined (step S46), and when the certainty is improved, the recognition result of the same target saved up to that point is replaced with the current re-recognition result (step S47).
The above re-recognition process is executed for each character satisfying the above conditions (d) to (f) (FIG. 3, steps S09 and S10), and the execution of this character recognition process routine ends.
Next, a second example of the re-recognition process will be described.
FIG. 13 is a diagram showing a detailed flow of the second example of the re-recognition process shown as one step in FIG. The re-recognition process of the second example is a process that can be adopted in place of the re-recognition process as the first example described with reference to FIG.

この第２例の再認識処理では、先ず、認識対象の文字を認識するための文字認識領域をずらす位置およびその文字認識領域の面積を決定する（ステップＳ４１１）。このために、この文字の認識の際に用いられた差分画像（ここでは、この差分画像を、「対象の差分画像」と称する）とは別の差分画像（ここでは、この差分画像を、「対象以外の差分画像」と称する）を参照し、対象の差分画像と対象以外の差分画像とを重ねたときに、この文字の認識の基になった個別追加記録画像の中心点（図８（Ｃ）参照）と比べ、中心点同士が予め定められた距離以内にあって、対象以外の差分画像に関して確からしさ８０％以上の文字認識結果が得られたときの文字認識領域設定の基になった個別追加記録画像を探し出す。 In the re-recognition process of the second example, first, the position where the character recognition area for recognizing the character to be recognized is shifted and the area of the character recognition area are determined (step S411). For this reason, a difference image different from the difference image used in recognizing this character (here, this difference image is referred to as a "target difference image") (here, this difference image is referred to as "this difference image". When the target difference image and the non-target difference image are superimposed with reference to the non-target difference image), the center point of the individually added recorded image on which the recognition of this character is based (FIG. 8 (Fig. 8). Compared to C)), it is the basis for setting the character recognition area when the center points are within a predetermined distance and a character recognition result with a certainty of 80% or more is obtained for a difference image other than the target. Find individual additional recorded images.

図１４は、対象の差分画像上の１つの個別追加記録画像と、それに対応する、対象以外の差分画像上の１つの個別追加記録画像を、未記入原稿上に重ねて示した図である。 FIG. 14 is a diagram showing one individually added recorded image on the target difference image and the corresponding one individually added recorded image on the non-target difference image superimposed on the unfilled manuscript.

対象の差分画像上の個別追加記録画像５５１は、認識対象の文字（ここでは数字の「３」）から少しずれた位置にある。そして、この個別追加記録画像５５１を基にして設定した文字認識領域からは、確からしさ８０％未満の認識結果しか得られなかったものとする。 The individually added recorded image 551 on the target difference image is located at a position slightly deviated from the character to be recognized (here, the number “3”). Then, it is assumed that only the recognition result with a certainty of less than 80% can be obtained from the character recognition area set based on the individually added recorded image 551.

一方、対象以外の差分画像上の個別追加記録画像５５２は、認識対象の文字（ここでは数字の「３」）をきれいに取り巻くように描かれている。そして、この個別追加記録画像５５２を基にして設定した文字認識領域からは確からしさが８０％を越える認識結果が得られたものとする。 On the other hand, the individually added recorded image 552 on the difference image other than the target is drawn so as to neatly surround the character to be recognized (here, the number "3"). Then, it is assumed that a recognition result having a certainty of more than 80% is obtained from the character recognition area set based on the individually added recorded image 552.

この場合、確からしさ８０％未満の認識結果しか得られなかった文字について再認識処理を実行するにあたっては、その再認識処理を実行するための文字認識領域を、確からしさが８０％を越える認識結果が得られた、対象以外の差分画像上の個別追加記録画像５５２を基にして設定した文字認識領域と同じ位置にずらす。また、文字認識領域の面積に関しては、対象以外の差分画像上の個別追加記録画像５５２を基にして設定した文字認識領域であって、確からしさ８０％を越えたときの面積が設定される。例えば、図９に示す面積Ｄｘの文字認識領域が、対象以外の差分画像に関して確からしさ８０％を越えた確からしさが得られたときの文字認識領域であったときは、確からしさ８０％未満の認識結果しか得られなかった文字についての再認識処理にあたっては、その面積Ｄｘの文字認識領域が採用される。 In this case, when executing the re-recognition process for a character for which a recognition result of less than 80% certainty is obtained, a recognition result having a certainty of more than 80% is set in the character recognition area for executing the re-recognition process. Is shifted to the same position as the character recognition area set based on the individually added recorded image 552 on the difference image other than the target obtained. Further, regarding the area of the character recognition area, it is a character recognition area set based on the individually added recorded image 552 on the difference image other than the target, and the area when the certainty exceeds 80% is set. For example, when the character recognition area of the area Dx shown in FIG. 9 is the character recognition area when the certainty of more than 80% is obtained for the difference image other than the target, the certainty is less than 80%. In the re-recognition process for a character for which only the recognition result has been obtained, the character recognition area having the area Dx is adopted.

図１３のステップＳ４１１では、以上のようにして、文字認識領域のずらす位置および文字認識領域の面積を決定し、その決定した文字認識領域について文字認識処理を実行する（ステップＳ４１２）。 In step S411 of FIG. 13, the shift position of the character recognition area and the area of the character recognition area are determined as described above, and the character recognition process is executed for the determined character recognition area (step S412).

そして、その文字認識処理（ステップＳ４１２）の結果、確からしさがアップしたか否かが判定され（ステップＳ４１３）、確からしさがアップしたときは、それまで保存しておいた同じ対象の認識結果が、今回の再認識結果に置き換えられる（ステップＳ４１４）。 Then, as a result of the character recognition process (step S412), it is determined whether or not the certainty is improved (step S413), and when the certainty is improved, the recognition result of the same target saved up to that point is displayed. , It is replaced with the result of this re-recognition (step S414).

以上の再認識処理が上記の（ｄ）〜（ｆ）の条件を満たす各文字について実行されて（図３、ステップＳ０９，Ｓ１０）、この文字認識処理ルーチンの実行を終了する。 The above re-recognition process is executed for each character satisfying the above conditions (d) to (f) (FIG. 3, steps S09 and S10), and the execution of this character recognition process routine ends.

以上に説明したように、本実施形態によれば、マークシートのマークの各位置ごとに、その位置のマークが何を意味しているか、という情報を予めインプットしておくといったような事前設定なしに、回答者の回答を認識することができる。 As described above, according to the present embodiment, there is no need to pre-set information such as what the mark at that position means for each position of the mark on the mark sheet. , Can recognize the respondent's answer.

ここで、本実施形態の場合、スキャナ２０で複数枚の原稿を連続的に読み取り、それら複数枚の原稿のうちの１枚目の原稿を未記入原稿とし、２枚目以降の原稿を記入済原稿とするというルールが定められている。この場合、未記入原稿の画像データを容易かつ確実に取得することができる。しかしながら、本発明においては、未記入原稿を１枚目などの特定の位置に配置するというルールは必ずしも必要ではない。未記入原稿を、例えば複数枚積み重ねた記入済原稿の途中位置に挟みこんでおいてもよい。その場合、画像取得部の中に未記入原稿を複数枚の原稿から見つけ出す処理を実施すればよい。未記入原稿を見つけ出す処理の一例としては、１枚目の原稿と２枚目以降の原稿との差分を抽出する処理を順次行い、１枚目の原稿にのみ差分が出た原稿を未記入原稿とすればよい。また、未記入原稿であるか記入済原稿であるかを問わずに読取により得られたテータ上の複数枚の原稿の共通部分を抽出した画像を作成し、その作成した画像と読み込んだ各原稿とのパターンマッチングを行い、一致度が最も高かった原稿を未記入原稿としてもよい。 Here, in the case of the present embodiment, a plurality of originals are continuously read by the scanner 20, the first original among the plurality of originals is regarded as an unfilled original, and the second and subsequent originals have been filled. There is a rule that it should be a manuscript. In this case, the image data of the unfilled manuscript can be easily and surely acquired. However, in the present invention, the rule of arranging the unfilled manuscript at a specific position such as the first sheet is not always necessary. The unfilled manuscript may be sandwiched in the middle of the filled-in manuscripts in which a plurality of sheets are stacked, for example. In that case, a process of finding an unfilled manuscript from a plurality of manuscripts may be performed in the image acquisition unit. As an example of the process of finding the unfilled manuscript, the process of extracting the difference between the first manuscript and the second and subsequent manuscripts is sequentially performed, and the manuscript in which the difference appears only in the first manuscript is the unfilled manuscript. And it is sufficient. In addition, an image obtained by extracting the common parts of a plurality of manuscripts on the data obtained by scanning regardless of whether the manuscript is an unfilled manuscript or a filled manuscript is created, and the created image and each read manuscript are created. The manuscript with the highest degree of matching may be regarded as an unfilled manuscript by performing pattern matching with.

あるいは、記入済原稿の読み込みが複数回に分かれていても、同種の原稿についての未記入原稿の読み込みは１回のみとし、一旦読み込んだ未記入原稿を記憶しておいて、今回読み込んだ記入済原稿とのパターンマッチングや特徴点抽出、あるいは直線で囲まれた領域の一致度を使ったフォーム認識により、今回読み込んだ記入済原稿に対応する未記入原稿を特定してもよい。 Alternatively, even if the completed manuscript is read multiple times, the unfilled manuscript for the same type of manuscript is read only once, the unfilled manuscript once read is stored, and the filled-in manuscript read this time is completed. The unfilled manuscript corresponding to the filled-in manuscript read this time may be specified by pattern matching with the manuscript, feature point extraction, or form recognition using the degree of coincidence of the area surrounded by the straight line.

さらには、本発明では、未記入原稿を読み込むことすら必ずしも必要ではない。例えば、複数枚の記入済原稿から、それら複数枚の記入済原稿の共通部分を抽出することにより、データ上で未記入原稿を作成してもよい。この場合、共通部分を抽出することにより作成されたデータ上での未記入原稿が第１の画像を表す第１の画像データに対応する。 Furthermore, in the present invention, it is not always necessary to even read an unfilled manuscript. For example, an unfilled manuscript may be created on the data by extracting the common portion of the plurality of filled-in manuscripts from the plurality of filled-in manuscripts. In this case, the unfilled manuscript on the data created by extracting the common portion corresponds to the first image data representing the first image.

また、ここでは、図１に示すように、通信ケーブル４０でスキャナ２０と接続されたノートＰＣ３０からなる文字認識装置について説明したが、本発明における文字認識装置は必ずしもこの形態である必要はない。例えば、スキャナとプリンタとが合体した形態のコピー機ないしはさらに機能が増えた複合機に、本発明の文字認識装置の機能を組み込んでもよい。さらには、カメラ機能を備えた携帯型端末に本発明の文字認識装置の機能を組み込んでもよい。その場合、カメラ機能で原稿を撮影することにより得られた画像が文字認識の対象となる。 Further, as shown in FIG. 1, the character recognition device including the notebook PC 30 connected to the scanner 20 by the communication cable 40 has been described here, but the character recognition device in the present invention does not necessarily have to be in this form. For example, the function of the character recognition device of the present invention may be incorporated into a copier in which a scanner and a printer are combined, or a multifunction device having more functions. Further, the function of the character recognition device of the present invention may be incorporated into a portable terminal having a camera function. In that case, the image obtained by shooting the original with the camera function is the target of character recognition.

１０文字認識システム
１１画像取得部
１２追加記録画像抽出部
１３追加記録画像分離部
１４文字認識処理実行部
２０スキャナ
２１原稿トレイ
２２排紙トレイ
２３上蓋
３０ノート型パーソナルコンピュータ（ノートＰＣ）
３１表示画面
３２キーボード
５１未記入原稿
５２記入済原稿
５３差分画像
５１１回答欄
５２１，５２２ ○印
５２３文字列
５５１，５５２個別追加記録画像 10 Character recognition system 11 Image acquisition unit 12 Additional recording image extraction unit 13 Additional recording image separation unit 14 Character recognition processing execution unit 20 Scanner 21 Document tray 22 Paper ejection tray 23 Top lid 30 Notebook type personal computer (notebook PC)
31 Display screen 32 Keyboard 51 Unfilled manuscript 52 Filled manuscript 53 Difference image 511 Answer column 521,522 ○ mark 523 Character string 551,552 Individual additional recorded image

Claims

An image acquisition unit that acquires a first image data representing a first image and a second image data representing a second image in which additional recording is performed on the first image data.
An additional recorded image extraction unit that extracts an additional recorded image, which is an image additionally recorded with respect to the first image, from the second image.
A character recognition area for executing character recognition processing is set in the area of the first image corresponding to the additional recorded image, and the character recognition processing is repeatedly executed for the character recognition area while changing the area of the character recognition area. Equipped with a character recognition processing execution unit
The character recognition processing execution unit executes character recognition processing including calculation of recognition certainty, and recognizes characters with certainty exceeding a predetermined threshold in the area corresponding to the additional recorded image. If not, the position of the character recognition area set in the area corresponding to the additional recorded image is shifted to a position determined by referring to the position of the additional recorded image of the other second image, and the character is displayed. A character recognition device characterized by executing recognition processing.

The character recognition processing execution unit shifts the position of the character recognition area set in the area corresponding to the additional recording image, and has a predetermined threshold value corresponding to the additional recording image of the other second image. The first aspect of claim 1 , wherein a character recognition area having an area corresponding to the area of the character recognition area when the certainty exceeding the above is obtained is set, and the character recognition process is executed for the character recognition area. Character recognition device.

Even if the additional recorded image extraction unit constitutes the additional recorded image and is separated from each other, the plurality of figures are the same for the plurality of figures that are close to each other within a predetermined threshold distance. The character recognition device according to claim 1 or 2, wherein the character recognition device is considered to belong to the additional recorded image of the above.

The character recognition processing execution unit sets the character recognition area on the first image, and makes the character on the first image a recognition target, according to any one of claims 1 to 3. The character recognition device according to any one of the items.

The character recognition processing execution unit sets the character recognition area on the second image or on the image of the difference between the first image and the second image, and on the second image or the difference. The character recognition device according to any one of claims 1 to 4, wherein the characters on the image of the above are to be recognized.

When the region corresponding to the additional recorded image on the first image is a blank region, the character recognition processing execution unit is on the second image or on the first image and the second image. The character recognition area is set on the image of the difference from the image of the above, the character on the second image or the image of the difference is targeted for recognition, and corresponds to the additional recorded image on the first image. Claim 1 is characterized in that when the area is not a blank area, the character recognition area is set on the first image and the characters on the first image are to be recognized. The character recognition device according to any one of 5 to 5.

The information processing device is executed in the information processing device that executes the program.
An image acquisition unit that acquires a first image data representing a first image and a second image data representing a second image in which additional recording is performed on the first image data.
An additional recorded image extraction unit that extracts an additional recorded image, which is an image additionally recorded with respect to the first image, from the second image.
A character recognition processing execution unit that sets a character recognition area for executing character recognition processing in an area corresponding to the additional recorded image, and executes character recognition processing for the character recognition area while changing the area of the character recognition area. With
The character recognition processing execution unit executes character recognition processing including calculation of recognition certainty, and recognizes characters with certainty exceeding a predetermined threshold in an area corresponding to the additional recorded image. If not, the position of the character recognition area set in the area corresponding to the additional recorded image is shifted to a position determined by referring to the position of the additional recorded image of the other second image, and the character is displayed. A character recognition program characterized by operating as a character recognition device that executes recognition processing.