JP7452120B2

JP7452120B2 - Image processing method, image processing program, and image processing device

Info

Publication number: JP7452120B2
Application number: JP2020043072A
Authority: JP
Inventors: 大助村山; 雅基石川; 陽子朝倉; 響時尾; 康成石塚; 恭平大塚; 央樹一宮; 尭央田村; 海浅葉
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2020-03-12
Filing date: 2020-03-12
Publication date: 2024-03-19
Anticipated expiration: 2040-03-12
Also published as: JP2021144501A

Description

本発明は、画像処理方法等に関する。 The present invention relates to an image processing method and the like.

契約書には、契約を締結した人物に関する情報として、住所や氏名等が記載され、各種の処理に利用されている。以下の説明では、適宜、契約を締結した人物に関する情報を「締結者情報」と表記する。 A contract contains information about the person who has concluded the contract, such as the address and name, and is used for various processing. In the following explanation, information regarding the person who concluded the contract will be referred to as "contractor information" as appropriate.

ここで、紙媒体の契約書に記載された締結者情報を検出する場合には、スキャナを用いて、契約書を画像データとして取り込み、画像データに対して文字検出の画像処理を実行する。なお、契約書の画像データ全体に対して、文字検出の画像処理を実行すると、締結者情報を含まない領域に対しても、画像処理が行われ、検出効率が低下する。 Here, when detecting the conclusion party information written in a paper contract, a scanner is used to capture the contract as image data, and image processing for character detection is performed on the image data. Note that if image processing for character detection is performed on the entire image data of the contract, the image processing will also be performed on areas that do not include signatory information, reducing detection efficiency.

なお、契約書の書式が予め決まっている場合には、契約書の画像データの所定の領域に対して、文字検出の画像処理を実行することができ、締結者情報の検出効率を向上させることができる。 In addition, if the format of the contract is determined in advance, image processing for character detection can be performed on a predetermined area of the image data of the contract, improving the efficiency of detecting information on the signatory. Can be done.

特開２００１－１８４４８１号公報Japanese Patent Application Publication No. 2001-184481

しかしながら、上述した従来技術では、締結者情報が記載された領域を効率よく抽出することができないという問題がある。 However, the above-described conventional technology has a problem in that it is not possible to efficiently extract the region in which the conclusion party information is written.

たとえば、契約書の書式が未知である場合には、締結者情報が記載された領域を事前に特定することができず、契約書の画像データ全体に対して、文字検出の画像処理を実行することになり、検出効率が低下してしまう。 For example, if the format of the contract is unknown, it is not possible to identify in advance the area where the signatory information is written, and image processing for character detection is performed on the entire image data of the contract. As a result, detection efficiency decreases.

１つの側面では、本発明は、締結者情報が記載された領域を効率よく抽出することができる画像処理方法、画像処理プログラムおよび画像処理装置を提供することを目的とする。 In one aspect, an object of the present invention is to provide an image processing method, an image processing program, and an image processing device that can efficiently extract a region in which conclusion person information is written.

第１の案では、コンピュータに次の処理を実行させる。コンピュータは、押印済の契約書の書面が電子的に読み込まれることにより生成された画像データを処理対象として、画像データから、印面に相当する領域を特定する。コンピュータは、予め記憶装置に記憶された、契約書の印面位置に対応する契約書内の第１の領域と、契約を締結した人物に関する情報が記載される契約書内の第２の領域との位置関係を参照し、処理対象である画像データから特定した印面の領域と位置関係とに基づいて特定される領域を、処理対象である画像データから抽出する。コンピュータは、抽出した領域を出力する。 In the first plan, the computer executes the following process. The computer processes image data generated by electronically reading a sealed contract document, and identifies an area corresponding to the stamp surface from the image data. The computer stores a first area in the contract corresponding to the stamp position of the contract and a second area in the contract in which information about the person who concluded the contract is written, which are stored in advance in the storage device. With reference to the positional relationship, a region specified based on the positional relationship with the region of the stamp surface specified from the image data to be processed is extracted from the image data to be processed. The computer outputs the extracted area.

締結者情報が記載された領域を効率よく抽出することができる。 It is possible to efficiently extract the area in which the conclusion party information is written.

図１は、本実施例に係る画像処理装置の処理を説明するための図である。FIG. 1 is a diagram for explaining the processing of the image processing apparatus according to the present embodiment. 図２は、本実施例に係る画像処理装置の構成を示す機能ブロック図である。FIG. 2 is a functional block diagram showing the configuration of the image processing apparatus according to this embodiment. 図３は、学習データのデータ構造の一例を示す図である。FIG. 3 is a diagram showing an example of the data structure of learning data. 図４は、パターンデータのデータ構造の一例を示す図である。FIG. 4 is a diagram showing an example of the data structure of pattern data. 図５は、第１ｙ座標差分のヒストグラムを示す図である。FIG. 5 is a diagram showing a histogram of the first y-coordinate difference. 図６は、第１パターンの検出例を示す図である。FIG. 6 is a diagram showing an example of detection of the first pattern. 図７は、第２パターンの検出例を示す図である。FIG. 7 is a diagram showing an example of detection of the second pattern. 図８は、本実施例に係る画像処理装置の学習処理の処理手順を示すフローチャートである。FIG. 8 is a flowchart showing the processing procedure of the learning process of the image processing apparatus according to the present embodiment. 図９は、本実施例に係る画像処理装置の画像処理の処理手順を示すフローチャートである。FIG. 9 is a flowchart showing the procedure of image processing by the image processing apparatus according to this embodiment. 図１０は、第１パターンと第２パターンとの比較結果を示す図である。FIG. 10 is a diagram showing a comparison result between the first pattern and the second pattern. 図１１は、実施例の画像処理装置と同様の機能を実現するコンピュータのハードウェア構成の一例を示す図である。FIG. 11 is a diagram illustrating an example of the hardware configuration of a computer that implements the same functions as the image processing apparatus of the embodiment.

以下に、本願の開示する画像処理方法、画像処理プログラムおよび画像処理装置の実施例を図面に基づいて詳細に説明する。なお、この実施例によりこの発明が限定されるものではない。 Embodiments of an image processing method, an image processing program, and an image processing apparatus disclosed in the present application will be described in detail below based on the drawings. Note that the present invention is not limited to this example.

図１は、本実施例に係る画像処理装置の処理を説明するための図である。画像処理装置は、学習データ１４１を基にして、パターンデータ１４２を学習しておく。パターンデータ１４２は、印面１０の中心座標を基準とする検出領域を示す情報である。たとえば、検出領域は、契約書の領域のうち、契約を締結した人物に関する情報（締結者情報）が記述されるであろう領域に対応する。印面１０の中心座標（中心座標を含む印面の領域）が「第１の領域」に対応する。 FIG. 1 is a diagram for explaining the processing of the image processing apparatus according to the present embodiment. The image processing device learns pattern data 142 based on learning data 141. The pattern data 142 is information indicating a detection area based on the center coordinates of the stamp face 10. For example, the detection area corresponds to an area of the contract where information about the person who concluded the contract (contractor information) is written. The center coordinates of the stamp face 10 (the region of the stamp face including the center coordinates) correspond to the “first region”.

パターンデータ１４２は、印面１０の中心座標を基準とする第１ｙ座標差分（ｙ_１）と、第２ｙ座標差分（ｙ_２）とを定義する。第１ｙ座標差分は、検出領域の上端１１ａと、印面１０の中心座標との差分である。第２ｙ座標差分は、検出領域の下端１１ｂと、印面１０の中心座標との差分である。上端１１ａ、下端１１ｂのｙ座標に対応する領域が「第２の領域」に対応する。 The pattern data 142 defines a first y-coordinate difference (y ₁ ) and a second y-coordinate difference (y ₂ ) based on the center coordinates of the stamp face 10. The first y-coordinate difference is the difference between the upper end 11a of the detection area and the center coordinate of the stamp surface 10. The second y-coordinate difference is the difference between the lower end 11b of the detection area and the center coordinate of the stamp face 10. The area corresponding to the y coordinates of the upper end 11a and the lower end 11b corresponds to the "second area".

画像処理装置が、パターンデータ１４２を用いて、契約書の画像データ２０に含まれる検出領域２５を抽出する場合には、画像データ２０の印面２０ａを特定する。画像処理装置は、印面２０ａの中心座標と、第１ｙ座標差分（ｙ_１）とを基にして、検出領域２５の上端２５ａを特定する。画像処理装置は、印面２０ａの中心座標と、第２ｙ座標差分（ｙ_２）とを基にして、検出領域２５の下端２５ｂを特定する。なお、画像処理装置は、検出領域２５のｘ座標方向の幅ｘ２５を、画像データ２０の幅ｘ２０と同一とする。画像処理装置は、検出領域２５に対して、文字認識を実行することで、締結者情報の文字列を認識する。 When the image processing device uses the pattern data 142 to extract the detection area 25 included in the image data 20 of the contract, the stamp face 20a of the image data 20 is specified. The image processing device specifies the upper end 25a of the detection area 25 based on the center coordinates of the stamp surface 20a and the first y-coordinate difference (y ₁ ). The image processing device specifies the lower end 25b of the detection area 25 based on the center coordinates of the stamp surface 20a and the second y-coordinate difference (y ₂ ). Note that the image processing device makes the width x25 of the detection area 25 in the x-coordinate direction the same as the width x20 of the image data 20. The image processing device recognizes the character string of the conclusion party information by performing character recognition on the detection area 25.

上記のように、本実施例に係る画像処理装置は、契約書の画像データ２０から印面２０ａを特定し、パターンデータ１４２に定義された印面の中心座標を基準とした検出領域の位置関係を基にして、画像データ２０の検出領域２５を抽出する。すなわち、画像処理装置は、契約書の書式が未知でも、押印位置と締結者情報の記載位置との位置関係が変わらないことに着目することで、締結者情報が記載された検出領域を効率的に抽出することができる。また、検出領域を効率的に抽出することで、検出領域に記載されているであろう締結者情報を適切に認識することもできる。 As described above, the image processing device according to the present embodiment identifies the stamp face 20a from the image data 20 of the contract, and based on the positional relationship of the detection area with respect to the central coordinates of the stamp face defined in the pattern data 142. Then, the detection area 25 of the image data 20 is extracted. In other words, even if the format of the contract is unknown, the image processing device can efficiently detect the detection area where the signatory information is written by focusing on the fact that the positional relationship between the seal position and the position where the signatory information is written does not change. can be extracted to Further, by efficiently extracting the detection area, it is also possible to appropriately recognize the conclusion party information that may be written in the detection area.

次に、本実施例に係る画像処理装置の構成について説明する。図２は、本実施例に係る画像処理装置の構成を示す機能ブロック図である。図２に示すように、この画像処理装置１００は、スキャナ５０に接続される。画像処理装置１００は、通信部１１０と、入力部１２０と、表示部１３０と、記憶部１４０と、制御部１５０とを有する。 Next, the configuration of the image processing apparatus according to this embodiment will be explained. FIG. 2 is a functional block diagram showing the configuration of the image processing apparatus according to this embodiment. As shown in FIG. 2, this image processing device 100 is connected to a scanner 50. The image processing device 100 includes a communication section 110, an input section 120, a display section 130, a storage section 140, and a control section 150.

スキャナ５０は、契約書の表面を電子的に読み込むことで、契約書の画像データを生成する装置である。本実施例では、スキャナ５０は、押印済みの契約書の画像データを生成するものとする。スキャナ５０は、画像データを、画像処理装置１００の通信部１１０に出力する。 The scanner 50 is a device that generates image data of a contract by electronically reading the front surface of the contract. In this embodiment, it is assumed that the scanner 50 generates image data of a sealed contract. The scanner 50 outputs image data to the communication unit 110 of the image processing device 100.

通信部１１０は、スキャナ５０と接続し、画像データを取得する装置である。通信部１１０は、通信装置の一例である。図示を省略するが、通信部１１０は、ネットワークを介して、他の外部装置と接続し、データ通信を実行してもよい。後述する制御部１５０は、通信部１１０を解して、スキャナ５０、外部装置等とデータをやり取りする。 The communication unit 110 is a device that connects to the scanner 50 and acquires image data. Communication unit 110 is an example of a communication device. Although not shown, the communication unit 110 may connect to another external device via a network and perform data communication. A control unit 150, which will be described later, exchanges data with the scanner 50, external devices, etc. via the communication unit 110.

入力部１２０は、各種のデータを画像処理装置１００の制御部１５０に入力するための入力装置である。入力部１２０は、キーボードやマウス、タッチパネル等に対応する。 The input unit 120 is an input device for inputting various data to the control unit 150 of the image processing apparatus 100. The input unit 120 corresponds to a keyboard, a mouse, a touch panel, etc.

表示部１３０は、表示部１３０は、制御部１５０から出力される情報を表示する表示装置である。表示部１３０は、有機ＥＬ（Electro Luminescence）ディスプレイ、液晶ディスプレイ、タッチパネル等に対応する。 The display unit 130 is a display device that displays information output from the control unit 150. The display unit 130 corresponds to an organic EL (Electro Luminescence) display, a liquid crystal display, a touch panel, or the like.

記憶部１４０は、学習データ１４１、パターンデータ１４２を有する。記憶部１４０は、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）などの半導体メモリ素子や、ＨＤＤ（Hard Disk Drive）などの記憶装置に対応する。 The storage unit 140 has learning data 141 and pattern data 142. The storage unit 140 corresponds to a semiconductor memory element such as a RAM (Random Access Memory) or a flash memory, or a storage device such as an HDD (Hard Disk Drive).

学習データ１４１は、パターンデータ１４２を生成するために用いられるデータである。図３は、学習データのデータ構造の一例を示す図である。図３に示すように、この学習データ１４１は、サンプル番号と、画像データと、上端ｙ座標と、下端ｙ座標と、印面中心ｘ座標と、印面中心ｙ座標と、第１ｙ座標差分と、第２ｙ座標差分とを有する。 Learning data 141 is data used to generate pattern data 142. FIG. 3 is a diagram showing an example of the data structure of learning data. As shown in FIG. 3, this learning data 141 includes a sample number, image data, upper end y-coordinate, lower end y-coordinate, stamp face center x-coordinate, stamp face center y-coordinate, first y-coordinate difference, and first y-coordinate. 2y coordinate difference.

サンプル番号は、サンプルとなる画像データを識別する情報である。画像データは、スキャナなどによって電子化されたサンプルとなる契約書の画像データである。サンプルとなる契約書には、締結者情報が記載されており、押印済みとする。サンプルとなる契約書の画像データは、「第二の画像データ」に対応する。以下の説明では、特に区別する場合、学習データの画像データを、第二の画像データと表記する。 The sample number is information that identifies image data serving as a sample. The image data is image data of a sample contract that has been digitized using a scanner or the like. The sample contract contains the information of the signatory and is stamped. The image data of the sample contract corresponds to "second image data." In the following description, the image data of the learning data will be referred to as second image data, especially when distinguished.

上端ｙ座標は、第二の画像データにおける検出領域の上端のｙ座標を示す。下端ｙ座標は、第二の画像データにおける検出領域の下端のｙ座標を示す。印面中心ｘ座標は、第二の画像データにおける印面の中心ｘ座標である。印面中心ｙ座標は、第二の画像データにおける印面の中心ｙ座標である。たとえば、第二の画像データの左上隅の座標を原点とする。 The upper end y-coordinate indicates the y-coordinate of the upper end of the detection area in the second image data. The lower end y-coordinate indicates the y-coordinate of the lower end of the detection area in the second image data. The seal surface center x-coordinate is the center x-coordinate of the stamp surface in the second image data. The stamp surface center y-coordinate is the center y-coordinate of the stamp surface in the second image data. For example, the coordinates of the upper left corner of the second image data are set as the origin.

第１ｙ座標差分は、印面中心ｙ座標から、上端ｙ座標を減算した差分値である。第２ｙ座標は、下端ｙ座標から、印面中心座標を減算した差分値である。 The first y-coordinate difference is a difference value obtained by subtracting the top y-coordinate from the stamp face center y-coordinate. The second y-coordinate is a difference value obtained by subtracting the stamp face center coordinate from the lower end y-coordinate.

図２の説明に戻る。パターンデータ１４２は、学習データ１４１を基にして生成される情報であって、印面の中心座標を基準とする検出領域を示す情報である。パターンデータ１４２の検出座標は、新規の契約書の画像データにおいて、締結者情報が記載されている可能性の高い領域を示す。図４は、パターンデータのデータ構造の一例を示す図である。図４に示すように、パターンデータ１４２には、第１パターンと、第２パターンとが含まれる。 Returning to the explanation of FIG. 2. The pattern data 142 is information generated based on the learning data 141, and is information indicating a detection area based on the central coordinates of the stamp face. The detected coordinates of the pattern data 142 indicate an area in the image data of the new contract where the signatory information is likely to be written. FIG. 4 is a diagram showing an example of the data structure of pattern data. As shown in FIG. 4, the pattern data 142 includes a first pattern and a second pattern.

第１パターンには、第１－１ｙ座標差分と、第２－１ｙ座標差分とが設定される。第１－１ｙ座標差分は、学習データ１４１に格納された各第１ｙ座標差分のうち、最大の第１ｙ座標差分となる。第２－１ｙ座標差分は、学習データ１４１に格納された各第２ｙ座標差分のうち、最大の第２ｙ座標差分となる。 In the first pattern, a 1st-1y coordinate difference and a 2nd-1y coordinate difference are set. The 1-1st y-coordinate difference is the largest first y-coordinate difference among the first y-coordinate differences stored in the learning data 141. The 2-1st y-coordinate difference is the largest second y-coordinate difference among the second y-coordinate differences stored in the learning data 141.

第２パターンには、第１－２ｙ座標差分と、第２－２ｙ座標差分とが設定される。第１－２ｙ座標差分は、学習データ１４１に格納された各１ｙ座標差分から外れ値を除き、外れ値以外の第１ｙ座標差分のうち、最大の第１ｙ座標差分となる。第２－２ｙ座標差分は、学習データ１４１に格納された各第２ｙ座標差分から外れ値を除き、外れ値以外の第２ｙ座標差分のうち、最大の第２ｙ座標差分となる。 In the second pattern, a 1st-2nd y coordinate difference and a 2nd-2nd y coordinate difference are set. The 1st-2nd y-coordinate difference is the largest first y-coordinate difference among the first y-coordinate differences other than the outliers, excluding outliers from each 1y-coordinate difference stored in the learning data 141. The 2-2nd y-coordinate difference is the largest second y-coordinate difference among the second y-coordinate differences other than the outliers, excluding outliers from each second y-coordinate difference stored in the learning data 141.

第２パターンは、外れ値を除外する。このため、第１パターンによって特定される検出領域は、第２パターンによって特定される検出領域よりも広くなる。 The second pattern excludes outliers. Therefore, the detection area specified by the first pattern is wider than the detection area specified by the second pattern.

図２の説明に戻る。制御部１５０は、取得部１５１と、学習部１５２と、特定部１５３と、抽出部１５４と、文字認識部１５５とを有する。制御部１５０は、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）などによって実現できる。また、制御部１５０は、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）などのハードワイヤードロジックによっても実現できる。 Returning to the explanation of FIG. 2. The control unit 150 includes an acquisition unit 151 , a learning unit 152 , a specification unit 153 , an extraction unit 154 , and a character recognition unit 155 . The control unit 150 can be realized by a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like. Further, the control unit 150 can also be realized by hardwired logic such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

取得部１５１は、外部装置等から学習データ１４１を取得する処理部である。取得部１５１は、取得した学習データ１４１を記憶部１４０に格納する。また、取得部１５１は、スキャナ５０から、押印済みの新規の契約書の画像データを取得した場合に、取得した画像データを、特定部１５３に出力する。押印済みの新規の契約書の画像データは、「第一の画像データ」に対応する。以下の説明では、特に区別する場合、押印済みの新規の契約書の画像データを、第一の画像データと表記する。 The acquisition unit 151 is a processing unit that acquires the learning data 141 from an external device or the like. The acquisition unit 151 stores the acquired learning data 141 in the storage unit 140. Further, when the acquisition unit 151 acquires image data of a new contract with a stamp from the scanner 50, the acquisition unit 151 outputs the acquired image data to the identification unit 153. The image data of the new signed contract corresponds to the "first image data." In the following description, the image data of a new contract that has been stamped will be referred to as first image data, especially if it is to be distinguished.

学習部１５２は、学習データ１４１を基にして、パターンデータ１４２を生成する処理部である。なお、学習データ１４１に含まれる印面中心ｙ座標（印面中心ｘ座標）は、予め設定されていてもよいし、第二の画像データから印面を特定して、自動で設定してもよい。学習部１５２は、印面中心ｙ座標を自動で特定した場合には、上端ｙ座標および下端ｙ座標を基にして、第１ｙ座標差分、第２ｙ座標差分を自動で算出してもよい。 The learning unit 152 is a processing unit that generates pattern data 142 based on the learning data 141. Note that the seal surface center y-coordinate (seal surface center x-coordinate) included in the learning data 141 may be set in advance, or may be automatically set by specifying the stamp surface from the second image data. When the learning unit 152 automatically specifies the stamp face center y-coordinate, it may automatically calculate the first y-coordinate difference and the second y-coordinate difference based on the upper end y-coordinate and the lower end y-coordinate.

学習部１５２が、パターンデータ１４２の「第１パターン」を学習する処理について説明する。学習部１５２は、学習データ１４１の各第１ｙ座標差分を比較し、最大の第１ｙ座標差分を特定する。学習部１５２は、特定した最大の第１ｙ座標差分を、第１パターンの第１－１ｙ座標差分として学習する。 A process in which the learning unit 152 learns the "first pattern" of the pattern data 142 will be described. The learning unit 152 compares each first y-coordinate difference of the learning data 141 and identifies the largest first y-coordinate difference. The learning unit 152 learns the identified maximum first y coordinate difference as the 1-1 y coordinate difference of the first pattern.

学習部１５２は、学習データ１４１の各第２ｙ座標差分を比較し、最大の第２ｙ座標差分を特定する。学習部１５２は、特定した最大の第２ｙ座標差分を、第１パターンの第２－１ｙ座標差分として学習する。 The learning unit 152 compares each second y-coordinate difference of the learning data 141 and identifies the maximum second y-coordinate difference. The learning unit 152 learns the identified maximum second y-coordinate difference as the 2-1 y-coordinate difference of the first pattern.

学習部１５２は、第１－１ｙ座標差分および第２－１ｙ座標差分を、第１パターンとして、パターンデータ１４２に登録する。 The learning unit 152 registers the 1st-1y coordinate difference and the 2nd-1y coordinate difference in the pattern data 142 as a first pattern.

続いて、学習部１５２が、パターンデータ１４２の「第２パターン」を学習する処理について説明する。学習部１５２は、学習データ１４１の全ての第１ｙ座標差分を基にして、第１ｙ座標差分の平均値μ_１を算出する。また、学習部１５２は、学習データ１４１の全ての第１ｙ座標差分を基にして、第１ｙ座標差分の標準偏差σ_１を算出する。 Next, a process in which the learning unit 152 learns the "second pattern" of the pattern data 142 will be described. The learning unit 152 calculates the average value μ ₁ of the first y-coordinate differences based on all the first y-coordinate differences of the learning data 141. Furthermore, the learning unit 152 calculates the standard deviation σ ₁ of the first y-coordinate differences based on all the first y-coordinate differences of the learning data 141.

学習部１５２は、学習データ１４１の全ての第１ｙ座標差分のうち、「μ_１－３×σ_１」未満となる第１ｙ座標差分と、「μ_１＋３×σ_１」以上となる第１ｙ座標差分とを外れ値として特定する。学習部１５２は、学習データ１４１の全ての第１ｙ座標差分から外れ値を除外した第１ｙ座標差分のうち、最大の第１ｙ座標差分を、第２パターンの第２－１ｙ座標差分として学習する。 Of all the first y-coordinate differences of the learning data 141, the learning unit 152 selects a first y-coordinate difference that is less than “μ ₁ −3×σ ₁ ” and a first y-coordinate difference that is greater than or equal to “μ ₁ +3×σ ₁ ”. The difference is identified as an outlier. The learning unit 152 learns the largest first y-coordinate difference among all the first y-coordinate differences of the learning data 141 excluding outliers as the 2-1 y-coordinate difference of the second pattern.

図５は、第１ｙ座標差分のヒストグラムを示す図である。図５に示すヒストグラムの横軸は、第１ｙ座標差分の値に対応する。ヒストグラムの縦軸は、頻度に対応する。たとえば、範囲３０に値が含まれる第１ｙ座標差分の値が、外れ値となる。たとえば、３１ａに示す値が、第１パターンの第１－１ｙ座標差分の値となる。３１ｂに示す値が、第２パターンの第１－２ｙ座標差分の値となる。 FIG. 5 is a diagram showing a histogram of the first y-coordinate difference. The horizontal axis of the histogram shown in FIG. 5 corresponds to the value of the first y-coordinate difference. The vertical axis of the histogram corresponds to frequency. For example, the value of the first y-coordinate difference whose value is included in the range 30 becomes an outlier. For example, the value shown at 31a is the value of the 1-1y coordinate difference of the first pattern. The value shown in 31b is the value of the 1st-2nd y coordinate difference of the second pattern.

学習部１５２は、学習データ１４１の全ての第２ｙ座標差分を基にして、第２ｙ座標差分の平均値μ_２を算出する。また、学習部１５２は、学習データ１４１の全ての第２ｙ座標差分を基にして、第１ｙ座標差分の標準偏差σ_２を算出する。 The learning unit 152 calculates the average value μ ₂ of the second y-coordinate differences based on all the second y-coordinate differences of the learning data 141. Further, the learning unit 152 calculates the standard deviation σ ₂ of the first y-coordinate difference based on all the second y-coordinate differences of the learning data 141.

学習部１５２は、学習データ１４１の全ての第２ｙ座標差分のうち、「μ_２－３×σ_２」未満となる第２ｙ座標差分と、「μ_２＋３×σ_２」以上となる第２ｙ座標差分とを外れ値として特定する。学習部１５２は、学習データ１４１の全ての第２ｙ座標差分から外れ値を除外した第２ｙ座標差分のうち、最大の第２ｙ座標差分を、第２パターンの第２－２ｙ座標差分として学習する。第１ｙ座標差分について、ヒストグラムの説明を省略する。 The learning unit 152 selects a second y-coordinate difference that is less than “μ ₂ −3×σ ₂ ” and a second y-coordinate difference that is equal to or more than “μ ₂ +3×σ ₂ ” out of all the second y-coordinate differences of the learning data 141. The difference is identified as an outlier. The learning unit 152 learns the largest second y-coordinate difference among all the second y-coordinate differences of the learning data 141 excluding outliers as the 2-2nd y-coordinate difference of the second pattern. Regarding the first y-coordinate difference, explanation of the histogram will be omitted.

学習部１５２は、第１－２ｙ座標差分および第２－２ｙ座標差分を、第２パターンとして、パターンデータ１４２に登録する。 The learning unit 152 registers the 1st-2nd y coordinate difference and the 2nd-2nd y coordinate difference in the pattern data 142 as a second pattern.

学習部１５２は、上記処理を実行することで、パターンデータ１４２を学習し、学習したパターンデータ１４２を、記憶部１４０に格納する。 The learning unit 152 learns the pattern data 142 by performing the above processing, and stores the learned pattern data 142 in the storage unit 140.

図２の説明に戻る。特定部１５３は、第一の画像データから、印面に相当する領域座標を特定する処理部である。たとえば、特定部１５３は、印鑑の色や形状を定義したテンプレートと、第一の画像データと比較して、印面の領域を特定する。特定部１５３は、特定した印面の領域の中心座標（ｘ_０、ｙ_０）を特定し、中心座標（ｘ_０、ｙ_０）の情報を、抽出部１５４に出力する。ただし、領域座標は必ずしも中心座標に限定されるわけではなく、例えば、印面の頂点に対応する座標でも良く、複数の座標を領域座標としても良い。 Returning to the explanation of FIG. 2. The specifying unit 153 is a processing unit that specifies the area coordinates corresponding to the stamp surface from the first image data. For example, the specifying unit 153 specifies the area of the stamp face by comparing the first image data with a template that defines the color and shape of the stamp. The specifying unit 153 specifies the center coordinates (x ₀ , y ₀ ) of the specified region of the seal face, and outputs information on the center coordinates (x ₀ , y ₀ ) to the extracting unit 154 . However, the area coordinates are not necessarily limited to the center coordinates; for example, they may be coordinates corresponding to the vertices of the stamp face, or a plurality of coordinates may be used as the area coordinates.

抽出部１５４は、パターンデータ１４２を用いて、第一の画像データに含まれる検出領域を抽出する処理部である。抽出部１５４は、パターンデータ１４２に含まれる第１パターンまたは第２パターンのいずれか一方を用いて、検出領域を抽出する。第１パターンを用いるのか、第２パターンを用いるのかは、予め設定されているものとする。 The extraction unit 154 is a processing unit that uses the pattern data 142 to extract a detection area included in the first image data. The extraction unit 154 extracts a detection area using either the first pattern or the second pattern included in the pattern data 142. It is assumed that whether to use the first pattern or the second pattern is set in advance.

抽出部１５４が、第１パターンを用いて、検出領域を抽出する処理の一例について説明する。抽出部１５４は、印面の中心座標（ｘ_０、ｙ_０）と、第１－１ｙ座標差分とを基にして、検出領域２５の上端を特定する。抽出部１５４は、印面の中心座標（ｘ_０、ｙ_０）と、第２－１ｙ座標差分とを基にして、検出領域の下端を特定する。なお、抽出部１５４は、検出領域のｘ座標方向の幅を、第一の画像データの幅と同一とする。 An example of a process in which the extraction unit 154 extracts a detection area using the first pattern will be described. The extraction unit 154 specifies the upper end of the detection area 25 based on the central coordinates (x ₀ , y ₀ ) of the stamp face and the 1-1y coordinate difference. The extraction unit 154 specifies the lower end of the detection area based on the central coordinates (x ₀ , y ₀ ) of the stamp face and the 2-1y coordinate difference. Note that the extraction unit 154 sets the width of the detection area in the x-coordinate direction to be the same as the width of the first image data.

抽出部１５４は、第１パターンを用いて、第一の画像データに対して特定した検出領域の情報を、文字認識部１５５に出力する。 The extraction unit 154 outputs information on the detection area specified for the first image data to the character recognition unit 155 using the first pattern.

図６は、第１パターンの検出例を示す図である。図６に示すように、印面４０の中心座標と、第１パターンとを基にして、検出領域４０ａが設定される。 FIG. 6 is a diagram showing an example of detection of the first pattern. As shown in FIG. 6, a detection area 40a is set based on the center coordinates of the stamp surface 40 and the first pattern.

抽出部１５４が、第２パターンを用いて、検出領域を抽出する処理の一例について説明する。抽出部１５４は、印面の中心座標（ｘ_０、ｙ_０）と、第１－２ｙ座標差分とを基にして、検出領域２５の上端を特定する。抽出部１５４は、印面の中心座標（ｘ_０、ｙ_０）と、第２－２ｙ座標差分とを基にして、検出領域の下端を特定する。なお、抽出部１５４は、検出領域のｘ座標方向の幅を、第一の画像データの幅と同一とする。 An example of a process in which the extraction unit 154 extracts a detection area using the second pattern will be described. The extraction unit 154 specifies the upper end of the detection area 25 based on the center coordinates (x ₀ , y ₀ ) of the stamp face and the 1st-2nd y coordinate difference. The extraction unit 154 specifies the lower end of the detection area based on the central coordinates (x ₀ , y ₀ ) of the stamp face and the 2-2nd y coordinate difference. Note that the extraction unit 154 sets the width of the detection area in the x-coordinate direction to be the same as the width of the first image data.

抽出部１５４は、第２パターンを用いて、第一の画像データに対して特定した検出領域の情報を、文字認識部１５５に出力する。 The extraction unit 154 outputs information on the detection area specified for the first image data to the character recognition unit 155 using the second pattern.

図７は、第２パターンの検出例を示す図である。図７に示すように、印面４０の中心座標と、第２パターンとを基にして、検出領域４０ｂが設定される。 FIG. 7 is a diagram showing an example of detection of the second pattern. As shown in FIG. 7, a detection area 40b is set based on the center coordinates of the stamp surface 40 and the second pattern.

文字認識部１５５は、抽出部１５４によって抽出された検出領域の画像に対して、文字認識を実行することで、締結者情報の文字列を認識する処理部である。文字認識部１５５は、認識した文字列を、表示部１３０に出力する。文字認識部１５５は、ネットワークを介して、文字列の認識結果を、外部装置等に通知してもよい。 The character recognition unit 155 is a processing unit that recognizes the character string of the conclusion party information by performing character recognition on the image of the detection area extracted by the extraction unit 154. The character recognition unit 155 outputs the recognized character string to the display unit 130. The character recognition unit 155 may notify an external device or the like of the recognition result of the character string via the network.

次に、本実施例に係る画像処理装置１００の処理手順の一例について説明する。図８は、本実施例に係る画像処理装置の学習処理の処理手順を示すフローチャートである。図８に示すように、画像処理装置１００の取得部１５１は、学習データ１４１を取得する（ステップＳ１０１）。 Next, an example of the processing procedure of the image processing apparatus 100 according to the present embodiment will be described. FIG. 8 is a flowchart showing the processing procedure of the learning process of the image processing apparatus according to the present embodiment. As shown in FIG. 8, the acquisition unit 151 of the image processing device 100 acquires learning data 141 (step S101).

画像処理装置１００の学習部１５２は、学習データ１４１の各第１ｙ座標差分を比較し、最大となる第１ｙ座標差分を、第１－１座標差分に設定する（ステップＳ１０２）。学習部１５２は、学習データ１４１の各第２ｙ座標差分を比較し、最大となる第２ｙ座標差分を、第２－１座標差分に設定する（ステップＳ１０３）。 The learning unit 152 of the image processing device 100 compares each first y-coordinate difference of the learning data 141, and sets the largest first y-coordinate difference as the 1-1st coordinate difference (step S102). The learning unit 152 compares each second y-coordinate difference of the learning data 141, and sets the largest second y-coordinate difference as the 2-1st coordinate difference (step S103).

学習部１５２は、第１－１ｙ座標差分および第２－１ｙ座標差分を、第１パターンとして、パターンデータ１４２に登録する（ステップＳ１０４）。 The learning unit 152 registers the first-1y coordinate difference and the second-1y coordinate difference as the first pattern in the pattern data 142 (step S104).

学習部１５２は、学習データ１４１の各第１座標差分を基にして、平均値μ_１および標準偏差σ_１を算出する（ステップＳ１０５）。学習部１５２は、平均値μ_１および標準偏差σ_１を基にして、学習データ１４１の各第１ｙ座標差分から外れ値を除外する（ステップＳ１０６）。 The learning unit 152 calculates the average value μ ₁ and the standard deviation σ ₁ based on each first coordinate difference of the learning data 141 (step S105). The learning unit 152 excludes outliers from each first y-coordinate difference of the learning data 141 based on the average value μ ₁ and the standard deviation σ ₁ (step S106).

学習部１５２は、外れ値を除外した第１ｙ座標差分のうち、最大となる第１ｙ座標差分を、第１－２ｙ座標差分に設定する（ステップＳ１０７）。 The learning unit 152 sets the largest first y-coordinate difference among the first y-coordinate differences excluding outliers as the 1-2nd y-coordinate difference (step S107).

学習部１５２は、学習データ１４１の各第２座標差分を基にして、平均値μ_２および標準偏差σ_２を算出する（ステップＳ１０８）。学習部１５２は、平均値μ_２および標準偏差σ_２を基にして、学習データ１４１の各第２ｙ座標差分から外れ値を除外する（ステップＳ１０９）。 The learning unit 152 calculates the average value μ ₂ and the standard deviation σ ₂ based on each second coordinate difference of the learning data 141 (step S108). The learning unit 152 excludes outliers from each second y-coordinate difference of the learning data 141 based on the average value μ ₂ and standard deviation σ ₂ (step S109).

学習部１５２は、外れ値を除外した第２ｙ座標差分のうち、最大となる第２ｙ座標差分を、第２－２ｙ座標差分に設定する（ステップＳ１１０）。 The learning unit 152 sets the largest second y-coordinate difference among the second y-coordinate differences excluding outliers as the 2-2nd y-coordinate difference (step S110).

学習部１５２は、第１－２ｙ座標差分および第２－２ｙ座標差分を、第２パターンとして、パターンデータ１４２に登録する（ステップＳ１１１）。 The learning unit 152 registers the 1st-2nd y coordinate difference and the 2nd-2nd y coordinate difference in the pattern data 142 as a second pattern (step S111).

図９は、本実施例に係る画像処理装置の画像処理の処理手順を示すフローチャートである。図９に示すように、画像処理装置１００の取得部１５１は、スキャナ５０から第一の画像データを取得する（ステップＳ２０１）。 FIG. 9 is a flowchart showing the procedure of image processing by the image processing apparatus according to this embodiment. As shown in FIG. 9, the acquisition unit 151 of the image processing device 100 acquires first image data from the scanner 50 (step S201).

画像処理装置１００の特定部１５３は、第一の画像データから、印面の領域を特定する（ステップＳ２０２）。特定部１５３は、印面の中心座標を特定する（ステップＳ２０３）。 The specifying unit 153 of the image processing device 100 specifies the region of the seal face from the first image data (step S202). The specifying unit 153 specifies the center coordinates of the seal face (step S203).

画像処理装置１００の抽出部１５４は、第１パターンを選択する場合には（ステップＳ２０４，Ｙｅｓ）、ステップＳ２０５に移行する。一方、抽出部１５４は、第１パターンを選択しない場合には（ステップＳ２０４，Ｎｏ）、ステップＳ２０７に移行する。 If the extraction unit 154 of the image processing device 100 selects the first pattern (step S204, Yes), the process proceeds to step S205. On the other hand, when the extraction unit 154 does not select the first pattern (step S204, No), the extraction unit 154 moves to step S207.

ステップＳ２０５について説明する。抽出部１５４は、第１パターンの第１－１ｙ座標差分、第２－１ｙ座標差分をパターンデータ１４２から取得する（ステップＳ２０５）。抽出部１５４は、第１パターンと印面の中心座標とを基にして、検出領域を抽出する（ステップＳ２０６）。 Step S205 will be explained. The extraction unit 154 acquires the 1st-1y coordinate difference and the 2nd-1y coordinate difference of the first pattern from the pattern data 142 (step S205). The extraction unit 154 extracts a detection area based on the first pattern and the center coordinates of the seal face (step S206).

画像処理装置１００の文字認識部１５５は、検出領域の画像から、文字列を認識する（ステップＳ２０９）。文字認識部１５５は、認識結果を表示部１３０に出力する（ステップＳ２１０）。 The character recognition unit 155 of the image processing device 100 recognizes a character string from the image of the detection area (step S209). The character recognition unit 155 outputs the recognition result to the display unit 130 (step S210).

ステップＳ２０７について説明する。抽出部１５４は、第２パターンの第１－２ｙ座標差分、第２－２ｙ座標差分をパターンデータ１４２から取得する（ステップＳ２０７）。抽出部１５４は、第２パターンと印面の中心座標とを基にして、検出領域を抽出し（ステップＳ２０８）、ステップＳ２０９に移行する。 Step S207 will be explained. The extraction unit 154 acquires the 1st-2nd y coordinate difference and the 2nd-2nd y coordinate difference of the second pattern from the pattern data 142 (step S207). The extraction unit 154 extracts a detection area based on the second pattern and the center coordinates of the seal face (step S208), and proceeds to step S209.

次に、本実施例１に係る画像処理装置１００の効果について説明する。画像処理装置１００は、契約書の画像データから印面を特定し、パターンデータに定義された印面の中心座標を基準とした検出領域の位置関係を基にして、画像データの検出領域を抽出する。すなわち、画像処理装置は、契約書の書式が未知でも、押印位置と締結者情報の記載位置との位置関係が変わらないことに着目することで、締結者情報が記載された検出領域を効率的に抽出することができる。また、検出領域を効率的に抽出することで、検出領域に記載されているであろう締結者情報を適切に認識することもできる。 Next, the effects of the image processing apparatus 100 according to the first embodiment will be explained. The image processing device 100 identifies the seal face from the image data of the contract, and extracts the detection area of the image data based on the positional relationship of the detection area with respect to the central coordinates of the seal face defined in the pattern data. In other words, even if the format of the contract is unknown, the image processing device can efficiently detect the detection area where the signatory information is written by focusing on the fact that the positional relationship between the seal position and the position where the signatory information is written does not change. can be extracted to Further, by efficiently extracting the detection area, it is also possible to appropriately recognize the conclusion party information that may be written in the detection area.

画像処理装置１００は、学習データ１４１に含まれる各第１ｙ座標差分の最大値と、第２ｙ座標差分の最大値とを基にして、第１パターンを学習する。これによって、画像データから印面を抽出することで、締結者情報の検出領域を特定することが可能となる。 The image processing device 100 learns the first pattern based on the maximum value of each first y-coordinate difference and the maximum value of the second y-coordinate difference included in the learning data 141. With this, by extracting the seal face from the image data, it becomes possible to specify the detection area of the signatory information.

画像処理装置１００は、学習データ１４１に含まれる各第１ｙ座標差分から外れ値を除外した残りの第１ｙ座標差分と、各第２ｙ座標差分から外れ値を除外した残りの第２ｙ座標差分とを基にして、第２パターンを学習する。これによって、画像データから印面を抽出することで、締結者情報の検出領域を特定することが可能となる。また、第１パターンの検出領域よりも、検出領域を絞り込むことができる。 The image processing device 100 calculates the remaining first y-coordinate differences after excluding outliers from each first y-coordinate difference included in the learning data 141 and the remaining second y-coordinate differences after excluding outliers from each second y-coordinate difference. Based on this, a second pattern is learned. With this, by extracting the seal face from the image data, it becomes possible to specify the detection area of the signatory information. Furthermore, the detection area can be narrowed down more than the detection area of the first pattern.

ここで、第１パターンによって特定した検出領域と、第２パターンによって特定した検出領域とに対して文字認識を実行した場合の認識結果の一例について説明する。図１０は、第１パターンと第２パターンとの比較結果を示す図である。 Here, an example of recognition results when character recognition is performed on the detection area specified by the first pattern and the detection area specified by the second pattern will be described. FIG. 10 is a diagram showing a comparison result between the first pattern and the second pattern.

図１０に示す例では、印面４５の中心座標と、第１パターンとを基にして検出領域４５ａが設定される。印面４６の中心座標と、第１パターンとを基にして検出領域４６ａが設定される。図１０に示す例では、第１パターンでは、締結者情報を検出しているが、第２パターンでは、締結者情報の一部を検出（検出漏れが発生している）できていない。第１パターンの検出領域は、第２パターンの検出領域よりも大きいため、検出漏れを防ぐことができる。 In the example shown in FIG. 10, the detection area 45a is set based on the center coordinates of the stamp surface 45 and the first pattern. A detection area 46a is set based on the center coordinates of the stamp surface 46 and the first pattern. In the example shown in FIG. 10, in the first pattern, the conclusion party information is detected, but in the second pattern, part of the conclusion party information cannot be detected (some detection is omitted). Since the detection area of the first pattern is larger than the detection area of the second pattern, detection omissions can be prevented.

たとえば、発明者が実験を行ったところ、第１パターンの検出領域は、帳票の約４割であり、締結者情報の検出率は１００％であった。一方、第２パターンの検出領域は、帳票の約１．５割であり、締結者情報の検出率は９７．８％であった。 For example, when the inventor conducted an experiment, the detection area of the first pattern was approximately 40% of the form, and the detection rate of the conclusion party information was 100%. On the other hand, the detection area of the second pattern was approximately 1.50% of the form, and the detection rate of the conclusion party information was 97.8%.

次に、上記実施例に示した画像処理装置１００と同様の機能を実現するコンピュータのハードウェア構成の一例について説明する。図１１は、実施例の画像処理装置と同様の機能を実現するコンピュータのハードウェア構成の一例を示す図である。 Next, an example of the hardware configuration of a computer that implements the same functions as the image processing apparatus 100 shown in the above embodiment will be described. FIG. 11 is a diagram illustrating an example of the hardware configuration of a computer that implements the same functions as the image processing apparatus of the embodiment.

図１１に示すように、コンピュータ２００は、各種演算処理を実行するＣＰＵ２０１と、ユーザからのデータの入力を受け付ける入力装置２０２と、ディスプレイ２０３とを有する。また、コンピュータ２００は、記憶媒体からプログラム等を読み取る読み取り装置２０４と、有線または無線ネットワークを介して、スキャナ５０、外部装置等との間でデータの授受を行う通信装置２０５とを有する。また、コンピュータ２００は、各種情報を一時記憶するＲＡＭ２０６と、ハードディスク装置２０７とを有する。そして、各装置２０１～２０７は、バス２０８に接続される。 As shown in FIG. 11, the computer 200 includes a CPU 201 that executes various calculation processes, an input device 202 that receives data input from a user, and a display 203. The computer 200 also includes a reading device 204 that reads programs and the like from a storage medium, and a communication device 205 that exchanges data with the scanner 50, external devices, etc. via a wired or wireless network. The computer 200 also includes a RAM 206 that temporarily stores various information and a hard disk device 207. Each device 201-207 is then connected to a bus 208.

ハードディスク装置２０７は、取得プログラム２０７ａ、学習プログラム２０７ｂ、特定プログラム２０７ｃ、抽出プログラム２０７ｄ、文字認識プログラム２０７ｅを有する。また、ＣＰＵ２０１は、各プログラム２０７ａ～２０７ｅを読み出してＲＡＭ２０６に展開する。 The hard disk device 207 includes an acquisition program 207a, a learning program 207b, a specific program 207c, an extraction program 207d, and a character recognition program 207e. Further, the CPU 201 reads each program 207a to 207e and expands it into the RAM 206.

取得プログラム２０７ａは、取得プロセス２０６ａとして機能する。学習プログラム２０７ｂは、学習プロセス２０６ｂとして機能する。特定プログラム２０７ｃは、特定プロセス２０６ｃとして機能する。抽出プログラム２０７ｄは、抽出プロセス２０６ｄとして機能する。文字認識プログラム２０７ｅは、文字認識プロセス２０６ｅとして機能する。 The acquisition program 207a functions as an acquisition process 206a. The learning program 207b functions as a learning process 206b. The specific program 207c functions as a specific process 206c. The extraction program 207d functions as an extraction process 206d. Character recognition program 207e functions as character recognition process 206e.

取得プロセス２０６ａの処理は、取得部１５１の処理に対応する。学習プロセス２０６ｂの処理は、学習部１５２の処理に対応する。特定プロセス２０６ｃの処理は、特定部１５３の処理に対応する。抽出プロセス２０６ｄの処理は、抽出部１５４の処理に対応する。文字認識プロセス２０６ｅの処理は、文字認識部１５５の処理に対応する。 The processing of the acquisition process 206a corresponds to the processing of the acquisition unit 151. The processing of the learning process 206b corresponds to the processing of the learning unit 152. The processing of the identification process 206c corresponds to the processing of the identification unit 153. The processing of the extraction process 206d corresponds to the processing of the extraction unit 154. The processing of the character recognition process 206e corresponds to the processing of the character recognition unit 155.

なお、各プログラム２０７ａ～２０７ｅについては、必ずしも最初からハードディスク装置２０７に記憶させておかなくても良い。例えば、コンピュータ２００に挿入されるフレキシブルディスク（ＦＤ）、ＣＤ－ＲＯＭ、ＤＶＤ、光磁気ディスク、ＩＣカードなどの「可搬用の物理媒体」に各プログラムを記憶させておく。そして、コンピュータ２００が各プログラム２０７ａ～２０７ｅを読み出して実行するようにしてもよい。 Note that each of the programs 207a to 207e does not necessarily have to be stored in the hard disk device 207 from the beginning. For example, each program is stored in a "portable physical medium" such as a flexible disk (FD), CD-ROM, DVD, magneto-optical disk, or IC card that is inserted into the computer 200. Then, the computer 200 may read and execute each program 207a to 207e.

以上の各実施例を含む実施形態に関し、さらに以下の付記を開示する。 Regarding the embodiments including each of the above examples, the following additional notes are further disclosed.

（付記１）コンピュータに、
押印済の契約書の書面が電子的に読み込まれることにより生成された画像データを処理対象として、前記画像データから、印面に相当する領域を特定し、
予め記憶装置に記憶された、契約書の印面位置に対応する契約書内の第１の領域と、契約を締結した人物に関する情報が記載される契約書内の第２の領域との位置関係を参照し、処理対象である前記画像データから特定した前記印面の領域と前記位置関係とに基づいて特定される領域を、処理対象である前記画像データから抽出し、
抽出した前記領域を出力する
処理を実行させることを特徴とする画像処理方法。 (Additional note 1) On the computer,
Targeting image data generated by electronically reading a sealed contract document, identifying an area corresponding to the stamp surface from the image data,
The positional relationship between a first area in the contract corresponding to the stamp position of the contract and a second area in the contract in which information regarding the person who concluded the contract is written, which is stored in advance in the storage device. extracting a region specified based on the region of the seal face and the positional relationship identified from the image data to be processed by referring to the image data to be processed;
An image processing method characterized by executing a process of outputting the extracted region.

（付記２）押印済の契約書の書面が電子的に読み込まれることにより生成された複数の画像データを有する学習データを基にして、前記第１の領域と、前記第２の領域との位置関係を学習し、学習結果を前記記憶装置に記憶させる処理を更に実行させることを特徴とする付記１に記載の画像処理方法。 (Additional Note 2) The positions of the first area and the second area are determined based on learning data having multiple image data generated by electronically reading a sealed contract document. The image processing method according to appendix 1, further comprising the step of learning the relationship and storing the learning result in the storage device.

（付記３）前記学習する処理は、前記第１の領域として特定される座標と前記第２の領域として特定される座標との差分値を、画像データ毎に算出し、算出した差分値のうち、最大の差分値を、前記位置関係として学習することを特徴とする付記２に記載の画像処理方法。 (Additional note 3) The learning process calculates a difference value between the coordinates specified as the first area and the coordinates specified as the second area for each image data, and among the calculated difference values. , the maximum difference value is learned as the positional relationship, the image processing method according to appendix 2.

（付記４）前記学習する処理は、前記第１の領域として特定される座標と前記第２の領域として特定される座標との差分値を、画像データ毎に算出し、算出した複数の差分値を基にして、差分値の外れ値を算出し、前記複数の差分値から前記外れ値を除外した差分値を基にして、前記位置関係を学習することを特徴とする付記２に記載の画像処理方法。 (Additional note 4) The learning process calculates a difference value between the coordinates specified as the first area and the coordinates specified as the second area for each image data, and calculates a plurality of calculated difference values. The image according to appendix 2, wherein an outlier of the difference values is calculated based on the difference value, and the positional relationship is learned based on the difference value obtained by excluding the outlier from the plurality of difference values. Processing method.

（付記５）コンピュータに、
押印済の契約書の書面が電子的に読み込まれることにより生成された画像データを処理対象として、前記画像データから、印面に相当する領域を特定し、
予め記憶装置に記憶された、契約書の印面位置に対応する契約書内の第１の領域と、契約を締結した人物に関する情報が記載される契約書内の第２の領域との位置関係を参照し、処理対象である前記画像データから特定した前記印面の領域と前記位置関係とに基づいて特定される領域を、処理対象である前記画像データから抽出し、
抽出した前記領域を出力する
処理を実行させることを特徴とする画像処理プログラム。 (Additional note 5) On the computer,
Targeting image data generated by electronically reading a sealed contract document, identifying an area corresponding to the stamp surface from the image data,
The positional relationship between a first area in the contract corresponding to the stamp position of the contract and a second area in the contract in which information regarding the person who concluded the contract is written, which is stored in advance in the storage device. extracting a region specified based on the region of the seal face and the positional relationship identified from the image data to be processed by referring to the image data to be processed;
An image processing program that causes a process to output the extracted area.

（付記６）押印済の契約書の書面が電子的に読み込まれることにより生成された複数の画像データを有する学習データを基にして、前記第１の領域と、前記第２の領域との位置関係を学習し、学習結果を前記記憶装置に記憶させる処理を更に実行させることを特徴とする付記５に記載の画像処理プログラム。 (Additional Note 6) The positions of the first area and the second area are determined based on learning data having multiple image data generated by electronically reading a sealed contract document. The image processing program according to appendix 5, further comprising the step of learning a relationship and storing the learning result in the storage device.

（付記７）前記学習する処理は、前記第１の領域として特定される座標と前記第２の領域として特定される座標との差分値を、画像データ毎に算出し、算出した差分値のうち、最大の差分値を、前記位置関係として学習することを特徴とする付記６に記載の画像処理プログラム。 (Additional Note 7) The learning process calculates a difference value between the coordinates specified as the first area and the coordinates specified as the second area for each image data, and among the calculated difference values. , the maximum difference value is learned as the positional relationship, the image processing program according to appendix 6.

（付記８）前記学習する処理は、前記第１の領域として特定される座標と前記第２の領域として特定される座標との差分値を、画像データ毎に算出し、算出した複数の差分値を基にして、差分値の外れ値を算出し、前記複数の差分値から前記外れ値を除外した差分値を基にして、前記位置関係を学習することを特徴とする付記６に記載の画像処理プログラム。 (Additional note 8) The learning process calculates a difference value between the coordinates specified as the first area and the coordinates specified as the second area for each image data, and calculates a plurality of calculated difference values. The image according to appendix 6, wherein an outlier of the difference values is calculated based on the difference value, and the positional relationship is learned based on the difference value obtained by excluding the outlier from the plurality of difference values. Processing program.

（付記９）押印済の契約書の書面が電子的に読み込まれることにより生成された画像データを処理対象として、前記画像データから、印面に相当する領域を特定する特定部と、
予め記憶装置に記憶された、契約書の印面位置に対応する契約書内の第１の領域と、契約を締結した人物に関する情報が記載される契約書内の第２の領域との位置関係を参照し、処理対象である前記画像データから特定した前記印面の領域と前記位置関係とに基づいて特定される領域を処理対象である前記画像データから抽出し、抽出した前記領域を出力する抽出部と
を有することを特徴とする画像処理装置。 (Additional note 9) A specifying unit that specifies an area corresponding to the stamp surface from the image data, with image data generated by electronically reading a sealed contract document as a processing target;
The positional relationship between a first area in the contract corresponding to the stamp position of the contract and a second area in the contract in which information regarding the person who concluded the contract is written, which is stored in advance in the storage device. an extraction unit that refers to and extracts a region specified from the image data that is a processing target based on the area of the stamp face specified from the image data that is a processing target and the positional relationship, and outputs the extracted region; An image processing device comprising: and

（付記１０）押印済の契約書の書面が電子的に読み込まれることにより生成された複数の画像データを有する学習データを基にして、前記第１の領域と、前記第２の領域との位置関係を学習し、学習結果を前記記憶装置に記憶させる学習部を更に有することを特徴とする付記９に記載の画像処理装置。 (Additional Note 10) The positions of the first area and the second area are determined based on learning data having a plurality of image data generated by electronically reading a sealed contract document. The image processing device according to appendix 9, further comprising a learning section that learns the relationship and stores the learning result in the storage device.

（付記１１）前記学習部は、前記第１の領域として特定される座標と前記第２の領域として特定される座標との差分値を、画像データ毎に算出し、算出した差分値のうち、最大の差分値を、前記位置関係として学習することを特徴とする付記１０に記載の画像処理装置。 (Additional Note 11) The learning unit calculates a difference value between the coordinates specified as the first region and the coordinates specified as the second region for each image data, and among the calculated difference values, The image processing device according to appendix 10, wherein the maximum difference value is learned as the positional relationship.

（付記１２）前記学習部は、前記第１の領域として特定される座標と前記第２の領域として特定される座標との差分値を、画像データ毎に算出し、算出した複数の差分値を基にして、差分値の外れ値を算出し、前記複数の差分値から前記外れ値を除外した差分値を基にして、前記位置関係を学習することを特徴とする請求項１０に記載の画像処理装置。 (Additional Note 12) The learning unit calculates a difference value between the coordinates specified as the first region and the coordinates specified as the second region for each image data, and calculates a plurality of calculated difference values. 11. The image according to claim 10, wherein an outlier of difference values is calculated based on the difference value, and the positional relationship is learned based on the difference value obtained by excluding the outlier from the plurality of difference values. Processing equipment.

５０スキャナ
１００画像処理装置
１１０通信部
１２０入力部
１３０表示部
１４０記憶部
１４１学習データ
１４２パターンデータ
１５０制御部
１５１取得部
１５２学習部
１５３特定部
１５４抽出部
１５５文字認識部 50 Scanner 100 Image processing device 110 Communication unit 120 Input unit 130 Display unit 140 Storage unit 141 Learning data 142 Pattern data 150 Control unit 151 Acquisition unit 152 Learning unit 153 Specification unit 154 Extraction unit 155 Character recognition unit

Claims

An image processing method performed by a computer, the method comprising:
Targeting image data generated by electronically reading a sealed contract document, identifying an area corresponding to the stamp surface from the image data,
A first area in the contract corresponding to the stamp position of the contract and a second area in the contract in which information regarding the person who concluded the contract is written, which are stored in advance in the storage device, The positional relationship between the area of the stamp surface and the area identified from the image data to be processed by referring to the positional relationship with the second area defined by two coordinate differences in the vertical direction with respect to the first area. extracting a region identified based on the above from the image data to be processed;
An image processing method characterized by executing a process of outputting the extracted region.

The positional relationship between the first area and the second area is learned based on learning data having a plurality of image data generated by electronically reading a sealed contract document. 2. The image processing method according to claim 1, further comprising: storing the learning results in the storage device.

The learning process calculates the difference value between the coordinates specified as the first area and the coordinates specified as the second area for each image data, and calculates the maximum difference among the calculated difference values. 3. The image processing method according to claim 2, wherein a value is learned as the positional relationship.

The learning process calculates a difference value between the coordinates specified as the first region and the coordinates specified as the second region for each image data, and based on the plurality of calculated difference values. 3. The image processing method according to claim 2, further comprising calculating an outlier among the difference values, and learning the positional relationship based on the difference value obtained by excluding the outlier from the plurality of difference values.

Processing target is image data generated by electronically reading a sealed contract document into a computer, and identifying an area corresponding to the stamp surface from the image data,
a first area in the contract corresponding to the stamp position of the contract and a second area in the contract in which information regarding the person who concluded the contract is written, which are stored in advance in the storage device; The positional relationship between the area of the stamp surface and the area identified from the image data to be processed by referring to the positional relationship with the second area defined by two coordinate differences in the vertical direction with respect to the first area. extracting a region identified based on from the image data to be processed;
An image processing program that causes a process to output the extracted area.

a specifying unit for processing image data generated by electronically reading a sealed contract document, and identifying an area corresponding to the stamp surface from the image data;
a first area in the contract corresponding to the stamp position of the contract and a second area in the contract in which information regarding the person who concluded the contract is written, which are stored in advance in the storage device; The positional relationship between the area of the stamp surface and the area identified from the image data to be processed by referring to the positional relationship with the second area defined by two coordinate differences in the vertical direction with respect to the first area. an extraction unit that extracts a region specified based on the image data to be processed from the image data to be processed, and outputs the extracted region.