JP2018092459A

JP2018092459A - Image analyzing device, image analyzing method, and image analyzing program

Info

Publication number: JP2018092459A
Application number: JP2016236433A
Authority: JP
Inventors: 伸弘勝本; Nobuhiro Katsumoto; 隆司小池; Takashi Koike
Original assignee: Infordio Co Ltd; IRRC CORP
Current assignee: Infordio Co Ltd; IRRC CORP
Priority date: 2016-12-06
Filing date: 2016-12-06
Publication date: 2018-06-14

Abstract

PROBLEM TO BE SOLVED: To improve accuracy in extraction of character information from an image as an analysis subject such as an insurance policy.SOLUTION: An image analyzing device 10 for analyzing an object image, comprises an image analysis part 23 and a structure analysis part 25. The image analysis part 23 extracts character information from the object image. The structure analysis part 25 associates the character information with character information pairing with the character information. The association of the character information is performed on the basis of a structure distance obtained as a function of an image of a peripheral region in which the distance in the object image of two pieces of character information and the character information of the object image are extracted. The image analyzing device may further construct a mapping part that displays a combination of the character information associated by the image analysis part 23 in accordance with the object image, and receives an input of correction by a user in a case of including an error in the combination of the character information.SELECTED DRAWING: Figure 1

Description

本発明は、保険証券などの分析対象画像を分析する画像分析装置、画像分析方法および画像分析プログラムに関する。 The present invention relates to an image analysis apparatus, an image analysis method, and an image analysis program for analyzing an analysis target image such as an insurance policy.

生命保険や損害保険を提供する保険会社は、複数存在し、また、それぞれの保険会社は複数の保険商品を販売している。それぞれの保険商品には、保障額や保険期間などのパラメータがある。さらに、主たる保険契約に付随して特約が締結されることも多い。このため、保険の取り扱いにそれほど慣れていない保険契約人は、自分が契約した保険が自分にとって適切かどうかを判断することが難しい。 There are multiple insurance companies that provide life insurance and non-life insurance, and each insurance company sells multiple insurance products. Each insurance product has parameters such as the amount of insurance and the insurance period. In addition, special agreements are often signed with the main insurance contract. For this reason, it is difficult for policyholders who are not so used to handling insurance to determine whether or not the insurance he / she has contracted is appropriate for him / her.

複数の保険会社の保険商品を取り扱う保険代理店は、保険証券に基づいて顧客がすでに契約している保険の内容を把握し、顧客に適切な保険を提案する場合がある。 An insurance agent that handles insurance products of a plurality of insurance companies may grasp the contents of insurance that the customer has already contracted based on the insurance policy and propose appropriate insurance to the customer.

特開２０１５−４６０２７号公報JP 2015-46027 A

保険代理店による顧客に対する適切な保険の提案は、担当者のノウハウに依存していることが多い。しかし、保険商品を提供する会社や保険の種類によっても保険証券のフォーマットや使われる用語などが多岐にわたり、すべてを記憶しておくことは難しい。そこで、保険の分析や提案の作成には、コンピュータを利用して品質の均一化および向上が望まれる。 Insurers are often dependent on the know-how of the person in charge to offer appropriate insurance to customers. However, depending on the company that provides insurance products and the type of insurance, the format of insurance policies and terms used are diverse, and it is difficult to memorize all of them. Therefore, for insurance analysis and proposal creation, it is desirable to use a computer to equalize and improve quality.

しかし、生命保険や損害保険の契約の内容は、１枚程度の保険証券に記載されることが多い。保険証券は、一般に、紙媒体に印刷されており、契約内容は保険会社が電子データとして保管しているとしても、保険会社の相手方の契約者は電子データとして保険内容を有していることはほとんどない。このため、契約内容を計算機に入力するためには、担当者などが手入力しなければならず、契約の分析や提案の作成に割く時間が削られる場合がある。 However, the contents of life insurance and non-life insurance contracts are often written on about one insurance policy. Insurance policy is generally printed on paper media, and even if the contract contents are stored as electronic data by the insurance company, the contractor of the other party of the insurance company has the insurance contents as electronic data. rare. For this reason, in order to input the contract contents to the computer, the person in charge or the like has to input it manually, and the time required for contract analysis and proposal creation may be reduced.

そこで、本発明は、保険証券などの分析対象画像から文字情報を抽出する精度を向上させることを目的とする。 Accordingly, an object of the present invention is to improve the accuracy of extracting character information from an analysis target image such as an insurance policy.

上述の目的を達成するため、本発明は、対象画像を分析する画像分析装置において、前記対象画像から文字情報を抽出する画像解析部と、前記文字情報の前記対象画像中の位置に基づいてその文字情報と対をなす文字情報とを紐づけする構造解析部と、を有することを特徴とする。 In order to achieve the above-described object, the present invention provides an image analysis apparatus that analyzes a target image, an image analysis unit that extracts character information from the target image, and a position of the character information in the target image. And a structure analysis unit that associates character information with a pair of character information.

また、本発明は、コンピュータに対象画像を分析させる画像分析プログラムにおいて、コンピュータに、前記対象画像から文字情報を抽出する画像解析機能と、前記文字情報の前記対象画像中の位置に基づいてその文字情報と対をなす文字情報とを紐づけする構造解析機能と、を実現させる。 According to another aspect of the present invention, there is provided an image analysis program for causing a computer to analyze a target image. The computer analyzes an image analysis function for extracting character information from the target image, and the character information based on the position of the character information in the target image. And a structure analysis function for associating the character information paired with the information.

また、本発明は、対象画像を分析する画像分析方法において、前記対象画像から文字情報を抽出する画像解析ステップと、前記文字情報の前記対象画像中の位置に基づいてその文字情報と対をなす文字情報とを紐づけする構造解析ステップと、を有することを特徴とする。 According to another aspect of the present invention, there is provided an image analysis method for analyzing a target image, wherein an image analysis step for extracting character information from the target image is paired with the character information based on a position of the character information in the target image. And a structural analysis step for linking the character information.

本発明によれば、保険証券などの分析対象画像から文字情報を抽出する精度を向上させることができる。 ADVANTAGE OF THE INVENTION According to this invention, the precision which extracts character information from analysis object images, such as insurance policy, can be improved.

本発明に係る画像分析装置の一実施の形態におけるブロック図である。It is a block diagram in one embodiment of an image analysis device concerning the present invention. 本発明に係る画像分析装置の一実施の形態の分析対象の保険証券画像の平面図である。It is a top view of the insurance policy image of the analysis object of one embodiment of the image analysis device concerning the present invention. 本発明に係る画像分析装置の一実施の形態の分析の流れを示すフローチャートである。It is a flowchart which shows the flow of the analysis of one Embodiment of the image analyzer which concerns on this invention. 本発明に係る画像分析装置の一実施の形態における保険証券画像の一部分の例である。It is an example of a part of insurance policy image in one embodiment of an image analysis device concerning the present invention. 本発明に係る画像分析装置の一実施の形態における前処理済画像の一部分の例である。It is an example of a part of a preprocessed image in an embodiment of an image analysis apparatus according to the present invention. 本発明に係る画像分析装置の一実施の形態における保険証券画像の一部分の例である。It is an example of a part of insurance policy image in one embodiment of an image analysis device concerning the present invention. 本発明に係る画像分析装置の一実施の形態における保険証券画像の一部分の例である。It is an example of a part of insurance policy image in one embodiment of an image analysis device concerning the present invention. 本発明に係る画像分析装置の一実施の形態におけるマッピング画面である。It is a mapping screen in one embodiment of the image analysis apparatus according to the present invention.

本発明に係る画像分析装置の一実施の形態を、図面を参照して説明する。なお、この実施の形態は単なる例示であり、本発明はこれに限定されない。同一または類似の構成には同一の符号を付し、重複する説明は省略する。 An image analysis apparatus according to an embodiment of the present invention will be described with reference to the drawings. This embodiment is merely an example, and the present invention is not limited to this. The same or similar components are denoted by the same reference numerals, and redundant description is omitted.

図１は、本発明に係る画像分析装置の一実施の形態におけるブロック図である。 FIG. 1 is a block diagram of an image analysis apparatus according to an embodiment of the present invention.

本実施の形態の画像分析装置１０は、画像記憶部２０と画像解析前処理部２１と前処理済画像記憶部２２と画像解析部２３と画像解析後処理部２４と構造解析部２５とマッピング部２６とパターン分類部２７とパターン記憶部２８と抽出文字列記憶部２９と辞書３０とスキャナー１２とディスプレイ１３とキーボード１４とマウス１５とを有している。画像分析装置１０は、たとえば１台のコンピュータ上に構成される。画像分析装置１０の一部、たとえば画像記憶部２０、画像解析前処理部２１、前処理済画像記憶部２２、画像解析部２３、画像解析後処理部２４、構造解析部２５、マッピング部２６、パターン分類部２７、パターン記憶部２８、抽出文字列記憶部２９、および、辞書３０の一部または全部は、ネットワークで互いに接続された複数のコンピュータ上に分散して配置されていてもよい。 The image analysis apparatus 10 according to the present embodiment includes an image storage unit 20, an image analysis preprocessing unit 21, a preprocessed image storage unit 22, an image analysis unit 23, an image analysis post-processing unit 24, a structure analysis unit 25, and a mapping unit. 26, a pattern classification unit 27, a pattern storage unit 28, an extracted character string storage unit 29, a dictionary 30, a scanner 12, a display 13, a keyboard 14, and a mouse 15. The image analysis apparatus 10 is configured on a single computer, for example. A part of the image analysis device 10, for example, an image storage unit 20, an image analysis pre-processing unit 21, a pre-processed image storage unit 22, an image analysis unit 23, an image analysis post-processing unit 24, a structure analysis unit 25, a mapping unit 26, Part or all of the pattern classification unit 27, the pattern storage unit 28, the extracted character string storage unit 29, and the dictionary 30 may be distributed on a plurality of computers connected to each other via a network.

図２は、本実施の形態の画像分析装置の分析対象の保険証券画像の平面図である。 FIG. 2 is a plan view of an insurance policy image to be analyzed by the image analysis apparatus of the present embodiment.

本実施の形態の画像分析装置１０は、たとえば紙に印刷された保険証券の画像（保険証券画像９０）を分析する。また、紙に印刷されたものだけではなく、スマートフォンやコンピュータのディスプレイに表示された内容を分析することもできる。 The image analysis apparatus 10 according to the present embodiment analyzes, for example, an insurance policy image (insurance policy image 90) printed on paper. Moreover, not only what was printed on paper but the content displayed on the display of a smart phone or a computer can also be analyzed.

次に、本実施の形態の画像分析装置１０を用いた保険証券画像９０の分析処理の流れを説明する。 Next, a flow of analysis processing of the insurance policy image 90 using the image analysis apparatus 10 of the present embodiment will be described.

図３は、本実施の形態の画像分析装置の分析の流れを示すフローチャートである。 FIG. 3 is a flowchart showing an analysis flow of the image analysis apparatus according to the present embodiment.

保険証券画像９０は、たとえばスキャナー１２でスキャンされて画像分析装置１０の画像記憶部２０に記憶される（Ｓ１）。スキャナー１２の代わりに、スマートフォンなどのカメラで撮像してもよい。カメラで撮像する場合には、画像に台形補正や水平補正を施すなどしてもよい。 The insurance policy image 90 is scanned by, for example, the scanner 12 and stored in the image storage unit 20 of the image analysis apparatus 10 (S1). You may image with a camera, such as a smart phone, instead of the scanner 12. FIG. When imaging with a camera, trapezoidal correction or horizontal correction may be performed on the image.

次に、画像解析前処理部２１は、画像記憶部２０に記憶された保険証券画像９０に対して前処理を施す（Ｓ２）。 Next, the image analysis preprocessing unit 21 performs preprocessing on the insurance policy image 90 stored in the image storage unit 20 (S2).

具体的には、まず、画像解析前処理部２１は、保険証券画像９０から、表組みの領域９１を検出する。表組みの領域は、たとえば所定の長さ以上の直線およびそれらの直線をつなぐ曲線、すなわち罫線９２で囲まれた領域を探すことにより検出することができる。 Specifically, first, the image analysis preprocessing unit 21 detects a table-set area 91 from the insurance policy image 90. The table area can be detected by searching for a straight line having a predetermined length or more and a curve connecting the straight lines, that is, an area surrounded by the ruled line 92.

図４は、本実施の形態における保険証券画像の一部分の例である。図５は、本実施の形態における前処理済画像の一部分の例である。 FIG. 4 is an example of a part of an insurance policy image in the present embodiment. FIG. 5 is an example of a part of the preprocessed image in the present embodiment.

次に、画像解析前処理部２１は、表組みの領域９１から罫線９２を除去し、前処理済画像９３を生成する。前処理済画像９３は、画像解析前処理部２１から出力され、前処理済画像記憶部２２に記憶される。 Next, the image analysis preprocessing unit 21 removes the ruled line 92 from the table region 91 to generate a preprocessed image 93. The preprocessed image 93 is output from the image analysis preprocessing unit 21 and stored in the preprocessed image storage unit 22.

図６は、本実施の形態における前処理済画像の一部分の他の例である。 FIG. 6 is another example of a part of the preprocessed image in the present embodiment.

さらに、画像解析前処理部２１は、図６に示すように、前処理済画像９３に含まれる画像の並べ替えを行ってもよい。画像の並べ替えにおいては、文字列が含まれる画像を塊ごとに分割する。文字列が含まれる画像の塊をたとえば上下に並べる。 Further, the image analysis preprocessing unit 21 may rearrange the images included in the preprocessed image 93 as shown in FIG. In rearranging images, an image including a character string is divided into chunks. For example, image blocks including character strings are arranged vertically.

この画像の並べ替えは、表組みの領域９１の各セル９４を、項目名に対して項目の内容が横に並ぶようにしてもよい。あるいは、それぞれのセル９４を縦に並べてもよい。並べ替えを行う際には、並べ替えを行う前のそれぞれのセル９４の位置をセル９４の画像とともに記憶させておく。 The rearrangement of the images may be such that the contents of the items are arranged side by side with respect to the item names in the cells 94 of the table region 91. Alternatively, the cells 94 may be arranged vertically. When rearrangement is performed, the position of each cell 94 before the rearrangement is stored together with the image of the cell 94.

画像解析前処理部２１が前処理済画像９３に含まれる画像の並べ替えを行う際、それぞれのセル９４の間に、「＝＝＝＝＝＝」などの区切り文字（デリミタ）を画像として挿入してもよい。 When the image analysis preprocessing unit 21 rearranges the images included in the preprocessed image 93, a delimiter (delimiter) such as “======” is inserted between the cells 94 as an image. May be.

次に、画像解析部２３は、前処理済画像９３を画像処理して、文字情報を抽出する（Ｓ３）。画像解析部２３は、一般的なＯＣＲ（光学式文字認識：ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅｃｏｇｎｉｔｉｏｎ）のエンジンを用いることができる。ＯＣＲエンジンとして、インターネット上で公開されているものを用いてもよい。 Next, the image analysis unit 23 performs image processing on the preprocessed image 93 and extracts character information (S3). The image analysis unit 23 can use a general OCR (Optical Character Recognition) engine. As the OCR engine, an OCR engine published on the Internet may be used.

画像解析前処理部２１が罫線９２を除去しておくことにより文字認識の精度が向上する。また、画像解析前処理部２１が画像をたとえばセル９４を単位として並べ替えておくことにより、一群とすべき文字を認識しやすくなるため、文字認識の精度が全体として向上する。 Since the image analysis preprocessing unit 21 removes the ruled lines 92, the accuracy of character recognition is improved. In addition, since the image analysis preprocessing unit 21 rearranges the images in units of cells 94, for example, it becomes easier to recognize characters that should be grouped, so that the accuracy of character recognition is improved as a whole.

画像解析部２３が抽出した文字情報は、その文字情報の保険証券画像９０上の位置とともに、抽出文字列記憶部２９に記憶される。画像解析前処理部２１が前処理済画像９３に含まれる画像の並べ替えを行っていた場合には、前処理済画像９３上の文字情報の位置に基づいて記憶しておいたセル９４の位置と紐づけることにより、文字情報の保険証券画像９０上の位置がわかる。 The character information extracted by the image analysis unit 23 is stored in the extracted character string storage unit 29 together with the position of the character information on the insurance policy image 90. When the image analysis preprocessing unit 21 has rearranged the images included in the preprocessed image 93, the position of the cell 94 stored based on the position of the character information on the preprocessed image 93 , The position of the character information on the insurance policy image 90 is known.

前処理済画像９３に区切り文字（デリミタ）を画像として挿入していた場合には、その区切り文字も文字情報として抽出される。その結果、区切り文字で文字情報を分割していくことにより、各セル９４に含まれる文字列を容易に取り出すことができる。 When a delimiter (delimiter) is inserted as an image in the preprocessed image 93, the delimiter is also extracted as character information. As a result, the character string included in each cell 94 can be easily extracted by dividing the character information by the delimiter characters.

次に、画像解析後処理部２４は、抽出した文字情報に対して後処理を施す（Ｓ４）。後処理では、たとえば抽出した文字情報の認識の正誤を辞書３０を用いてチェックする。 Next, the image analysis post-processing unit 24 performs post-processing on the extracted character information (S4). In the post-processing, for example, the dictionary 30 is used to check whether the extracted character information is recognized correctly.

図７は、本実施の形態における保険証券画像の一部分の例である。 FIG. 7 is an example of a part of an insurance policy image in the present embodiment.

たとえば図７に示される画像をＯＣＲした結果、「入院拾付金日額」と認識した場合について考える。この結果文字列「入院拾付金日額」を、連続するたとえば２文字ずつ、先頭文字が結果文字列の先頭から１文字ずつ右に進むように複数のサブ文字列に分割する。つまり、「入院」、「院拾」、「拾付」、「付金」、「金日」、「日額」、という６つのサブ文字列に分割する。したがって、結果文字列の中で先頭文字が隣り合うサブ文字列は、たとえば１文字の重複文字を有することになる。 For example, consider the case where the image shown in FIG. 7 is recognized as “daily amount for hospitalization” as a result of OCR. The result character string “daily amount for hospitalization pick-up” is divided into a plurality of sub character strings so that the first character advances to the right one character at a time from the beginning of the result character string, for example, two consecutive characters. That is, it is divided into six sub-character strings of “hospitalization”, “hospital pick-up”, “pick-up”, “payment”, “gold day”, and “daily amount”. Therefore, the sub character string in which the first character is adjacent in the result character string has, for example, one overlapping character.

次に、それぞれの分割文字列「入院」、「院拾」、「拾付」、「付金」、「金日」および「日額」について、辞書３０に記憶された辞書での出現頻度を調べる。その結果、たとえば、出現頻度が、「入院」については２００、「院拾」については０、「拾付」については５、「付金」については１００、「金日」については５０、「日額」については１００となる。このように、ＯＣＲでの誤りを含む文字列である「院拾」および「拾付」については、出現頻度が極端に小さくなる。そこで、所定の閾値よりも出現頻度が小さい文字列に共通した部分（重複文字）、すなわち３番目の文字である「拾」は、ＯＣＲの誤りであると判定する。 Next, with respect to each divided character string “hospital”, “hospital pick-up”, “pick-up”, “payment”, “gold day” and “daily amount”, the appearance frequency in the dictionary stored in the dictionary 30 is determined. Investigate. As a result, for example, the appearance frequency is 200 for “hospitalization”, 0 for “hospital pick-up”, 5 for “pickup”, 100 for “payment”, 50 for “Friday”, “day “Forehead” is 100. As described above, the appearance frequency of the character string including the error in the OCR, that is, “pick-up” and “pick-up” becomes extremely small. Therefore, a portion (duplicate character) common to character strings whose appearance frequency is lower than a predetermined threshold, that is, “pickup” that is the third character is determined to be an OCR error.

さらに、「院」の後に１文字付加した文字列、および「付」の前に１文字付加した文字列について辞書で出現頻度を調べて、付加すべき文字が共通する文字列であって、出現頻度が最も高い文字列を抽出する。抽出された文字列に共通する文字（付加すべき文字）が正しい文字であるとする。 Furthermore, the appearance frequency of the character string added with one character after “in” and the character string added with one character before “attached” is checked in the dictionary, and the character string to be added is a common character string. Extract the string with the highest frequency. It is assumed that a character common to the extracted character string (a character to be added) is a correct character.

このように、画像解析後処理部２４が画像解析によって得られた文字列をＮ−ｇｒａｍ方式で分割して辞書と突き合わせることにより、文字情報の認識精度が向上する。特に、日本語、韓国語、中国語などの単語の区切りがない言語が記述された文書の画像の分析に有効である。再帰型ニューラルネットワーク（ＲＮＮ：ＲｅｃｕｒｒｅｎｔＮｅｕｒａｌＮｅｔｗｏｒｋ）を用いて、文字認識の正誤を学習し、文字の認識の正誤を判定してもよい。 In this way, the character analysis recognition accuracy is improved by the image analysis post-processing unit 24 dividing the character string obtained by the image analysis by the N-gram method and matching it with the dictionary. In particular, it is effective for analyzing images of documents in which languages such as Japanese, Korean, Chinese, and the like that do not have word breaks are described. A recursive neural network (RNN) may be used to learn the correctness of character recognition and determine the correctness of character recognition.

また、後処理において、抽出した文字情報の形態素解析を行ってもよい。形態素解析によって、抽出した文字情報に含まれる文字列の品詞を分析することができる。分析した結果は、文字列とともに記憶しておく。品詞だけではなく、固有名詞に該当するか否か、日付に該当するか否か、数量に該当するか否かなどを分析してもよい。 In post-processing, morphological analysis of the extracted character information may be performed. By morphological analysis, the part of speech of the character string included in the extracted character information can be analyzed. The analysis result is stored together with the character string. You may analyze not only a part of speech but whether it corresponds to a proper noun, whether it corresponds to a date, whether it corresponds to a quantity, etc.

次に、パターン分類部２７は、保険証券画像９０のパターンを判定する（Ｓ５）。具体的には、保険証券画像９０が、パターン記憶部２８に記憶された書類のパターンのうち、どのパターンに該当するかを判定する。 Next, the pattern classification unit 27 determines the pattern of the insurance policy image 90 (S5). Specifically, it is determined which of the document patterns stored in the pattern storage unit 28 the insurance policy image 90 corresponds to.

ここでパターンとは、特定の保険会社や保険商品などの種類ごとに共通に用いられる文書の形式である。保険証券画像９０の特徴とは、たとえば罫線・枠線などの配置や、会社名・保険商品名が保険証券に記載されているか否かなどである。たとえば、保険証券画像９０のそれぞれの画素の値を横方向に足し合わせたものを縦方向の位置ごとに並べた横方向画素濃度スペクトル、および、保険証券画像９０のそれぞれの画素の値を縦方向に足し合わせたものを横方向の位置ごとに並べた縦方向画素濃度スペクトルを用いてフォーマットを特定することができる。横方向画素濃度スペクトルおよび縦方向画素濃度スペクトルを、パターン記憶部２８にあらかじめ記憶しておいた基準となるパターンのものと比較し、類似度を算出することにより、パターンを特定することができる。パターンの特定あるいは罫線・枠線の認識には、カスケード型分類器（Ｈａａｒ分類器）を用いてもよい。パターンの判定に、会社名や商品名などの特定の文字列が特定の位置に記載されていることを用いてもよい。 Here, the pattern is a document format commonly used for each type of a specific insurance company or insurance product. The features of the insurance policy image 90 include, for example, the arrangement of ruled lines and frame lines, and whether or not the company name / insurance product name is described in the insurance policy. For example, the horizontal pixel density spectrum in which values obtained by adding the pixel values of the insurance policy image 90 in the horizontal direction are arranged for each position in the vertical direction, and the pixel values of the insurance policy image 90 are set in the vertical direction. The format can be specified by using the vertical pixel density spectrum in which the sum of the values is arranged for each position in the horizontal direction. The pattern can be specified by comparing the horizontal pixel density spectrum and the vertical pixel density spectrum with the reference pattern stored in advance in the pattern storage unit 28 and calculating the similarity. A cascade type classifier (Haar classifier) may be used for pattern identification or ruled line / frame line recognition. For pattern determination, it may be used that a specific character string such as a company name or a product name is described at a specific position.

次に、構造解析部２５は、画像解析部２３が抽出し、必要に応じて画像解析後処理部２４が修正した文字情報のそれぞれについて、項目名のキーと、その項目名の項目の値のバリューとを組として組み合わせる（紐づけする）（Ｓ６）。 Next, the structure analysis unit 25 extracts the item name key and the value of the item name item for each piece of character information extracted by the image analysis unit 23 and corrected by the image analysis post-processing unit 24 as necessary. The value is combined (linked) as a set (S6).

項目名となりうる文字列を記憶したデータベースに対象とする文字列が含まれるか否かを検索することにより、キーを特定することができる。あるいは、表組の領域９１の左側あるいは上側のセル９４に記載された文字列をキーとしてもよい。 A key can be specified by searching whether or not a target character string is included in a database storing character strings that can be item names. Or it is good also considering the character string described in the cell 94 of the left side or the upper side of the area | region 91 of a table | surface group as a key.

キーとバリューの紐づけには、構造化距離を用いる。構造化距離とは、実際の記載位置間の距離に、文書上の語句の記載位置の制約を加味した仮想的な距離である。文書上の語句の記載位置の制約とは、日本語の横書きの場合には、文字は、左から右に向かって、上から下に向かって記載されること、罫線が存在する場合には罫線に沿って記載されること、枠線で囲まれている領域内に存在する文字は一群の意味を形成していること、などである。 A structured distance is used to associate a key and a value. The structured distance is a virtual distance obtained by adding a restriction on a description position of a word on a document to a distance between actual description positions. What is the restriction on the position of words in a document? In Japanese horizontal writing, characters are written from left to right and from top to bottom, and when there are ruled lines, ruled lines And the characters existing in the area surrounded by the frame form a group of meanings.

構造化距離は、物理的距離（文字列間の距離）、文字列の位置および文字列周辺の画像の関数である。たとえば構造化距離は、物理的距離に構造化係数を乗じたものである。構造化距離を物理的距離に構造化バイアスを加えた値としてもよい。物理的距離とは、第１文字列の最後、すなわち、横書きの場合には右端の文字と、第２文字列の最初、すなわち、横書きの場合には左端の文字との間の距離である。 The structured distance is a function of the physical distance (distance between character strings), the position of the character string, and the image around the character string. For example, the structured distance is the physical distance multiplied by the structuring factor. The structured distance may be a physical distance plus a structured bias. The physical distance is the distance between the end of the first character string, that is, the rightmost character in the case of horizontal writing, and the first character of the second character string, that is, the leftmost character in the case of horizontal writing.

構造化係数は、第１文字列と第２文字列との間に水平方向の罫線９２があると大きくなるようにする。また、構造化係数は、項目と考えられる第１文字列が上下に配列されている場合には、第１文字列と第２文字列との間に垂直方向の罫線９２が複数存在すると大きくなるようにしてもよい。構造化係数は、項目と考えられる第１文字列が左右に配列されている場合には、第１文字列と第２文字列との間に水平方向の罫線９２が複数存在すると大きくなるようにしてもよい。また、右側に向かう場合には、構造化係数を正とし、左側に向かう場合には構造化係数を負としてもよい。 The structuring coefficient is increased when there is a horizontal ruled line 92 between the first character string and the second character string. In addition, when the first character strings considered as items are arranged one above the other, the structuring coefficient increases when there are a plurality of vertical ruled lines 92 between the first character string and the second character string. You may do it. When the first character string considered as an item is arranged on the left and right, the structuring coefficient is increased when there are a plurality of horizontal ruled lines 92 between the first character string and the second character string. May be. In addition, when going to the right side, the structuring coefficient may be positive, and when going to the left side, the structuring coefficient may be negative.

キーとバリューの組み合わせには、第１文字列から第２文字列へ向かうベクトルを用いてもよい。 For the combination of key and value, a vector from the first character string to the second character string may be used.

また、項目名が記載されていないバリューの候補リストを、「商品名」などの記載されない項目名とともに、予め記憶しておいてもよい。抽出した文字情報が、この候補リストの中の一つに該当する場合には、その抽出した文字情報をバリューとし、対応する項目名をキーとして記憶してもよい。 In addition, a value candidate list in which no item name is described may be stored in advance together with an item name not described such as “product name”. When the extracted character information corresponds to one of the candidate lists, the extracted character information may be stored as a value and the corresponding item name may be stored as a key.

後処理（Ｓ４）において形態素解析などを行っていた場合には、品詞などの文字列の特徴を紐づけに用いてもよい。たとえば、特定のキーに対するバリューとしては数量しか対応しないなどのルールを用いて紐づけすることもできる。 When morphological analysis or the like is performed in post-processing (S4), character string features such as part of speech may be used for association. For example, it can be linked using a rule that only a quantity corresponds to a value for a specific key.

キーとバリューとは、単純な一対一の関係でなくてもよい。たとえば、一つのキーに対して複数のバリューを組み合わせてもよい。キーは、メインキーにサブキーを従属させたような階層構造（ツリー構造）になっていてもよい。この場合、階層は３以上であってもよい。 Keys and values do not have to be a simple one-to-one relationship. For example, a plurality of values may be combined for one key. The key may have a hierarchical structure (tree structure) in which a sub key is subordinate to a main key. In this case, the number of hierarchies may be three or more.

次に、マッピング部２６は、元の保険証券画像９０上に抽出した文字列をマッピングする（Ｓ７）。 Next, the mapping unit 26 maps the extracted character string on the original insurance policy image 90 (S7).

図８は、本実施の形態の画像分析装置におけるマッピング画面である。 FIG. 8 is a mapping screen in the image analysis apparatus of the present embodiment.

マッピング画面４０は、ディスプレイ１３に表示される。マッピング画面４０は、画像表示部４１と対応表示部４２とを備えている。画像表示部４１には、保険証券画像９０が表示される。対応表示部４２には、キーとバリューの組み合わせが表示される。 The mapping screen 40 is displayed on the display 13. The mapping screen 40 includes an image display unit 41 and a correspondence display unit 42. An insurance policy image 90 is displayed on the image display unit 41. The correspondence display unit 42 displays combinations of keys and values.

抽出した文字列は、キーおよびバリューの組として表示される。マッピング部２６は、さらに、構造解析部２５が組み合わせたキーおよびバリューの対応関係の確認をユーザーに促し、誤りがある場合には修正できるようにする。 The extracted character string is displayed as a set of key and value. The mapping unit 26 further prompts the user to confirm the correspondence relationship between the key and the value combined by the structure analysis unit 25 so that it can be corrected if there is an error.

具体的には、まず、たとえば保険証券画像９０および抽出した文字情報をディスプレイ１３の画像表示部４１に表示する。抽出した文字情報は、たとえば保険証券画像９０の左側の対応表示部４２に、キーおよびバリューの組として表示される。ここで最初に表示されたキーおよびバリューの組は、構造解析部２５が作成したものである。 Specifically, first, for example, the insurance policy image 90 and the extracted character information are displayed on the image display unit 41 of the display 13. The extracted character information is displayed, for example, as a key and value pair on the correspondence display section 42 on the left side of the insurance policy image 90. Here, the key and value pairs displayed first are created by the structure analysis unit 25.

また、文字情報を抽出した部分に色付けをする。色付けされた部分にマウスの操作などでポインターを移動させてクリックした場合には、キーおよびバリューの組が表示された部分においてその部分に該当するキーまたはバリューの文字情報が他と区別できるように、たとえば色を変化させることにより表示される。この際、保険証券画像９０上では、選択された部分に対応するキーまたはバリューと紐づけられている文字情報が読み取られた部分が他と区別できるように、たとえば色を変化させることにより表示される。ユーザーは、この対応関係に間違いがない場合には、放置し、あるいは、間違いがないことを入力する。この対応関係に間違いがある場合には、キーおよびバリューの組が表示された部分において、キーまたはバリューの値を修正する。 In addition, the portion where the character information is extracted is colored. When the pointer is moved to the colored part by mouse operation etc. and clicked, the character information of the key or value corresponding to that part can be distinguished from the part where the key and value pair is displayed. For example, it is displayed by changing the color. At this time, on the insurance policy image 90, for example, it is displayed by changing the color so that the portion from which the character information associated with the key or value corresponding to the selected portion is read can be distinguished from the others. The If there is no mistake in this correspondence, the user leaves it or inputs that there is no mistake. If there is an error in the correspondence, the key or value value is corrected in the portion where the key and value pair is displayed.

修正に際しては、文字認識の誤りであれば、キーボードなどから入力する。対応付け（紐づけ）に誤りがある場合には、対応するキーまたはバリューが記されている部分を保険証券画像９０上でクリックするなどして、対応関係を修正してもよい。 When correcting, if there is an error in character recognition, input from the keyboard or the like. If there is an error in the association (association), the correspondence relationship may be corrected by clicking on the portion of the insurance policy image 90 where the corresponding key or value is written.

このように、本実施の形態では、文字認識の結果をユーザーが修正することにより、誤りを低減することができる。 Thus, in the present embodiment, errors can be reduced by the user correcting the result of character recognition.

さらに、ユーザーによる修正を学習することにより、文字認識の精度およびマッピング（紐づけ）の精度を向上させることができる。たとえば、特定の文字を他の文字と誤認識していて、それをユーザーが修正する回数が増えていくことにより、その特定の文字を正しく認識させるようにすることができる。また、マッピングが誤っていた場合は、ユーザーが紐づけを修正することにより、正しくキーとバリューを組み合わせられるようになっていく。 Further, by learning the correction by the user, the accuracy of character recognition and the accuracy of mapping (linking) can be improved. For example, when a specific character is misrecognized as another character and the number of times the user corrects it increases, the specific character can be recognized correctly. Also, if the mapping is incorrect, the user can correctly combine the key and value by correcting the association.

保険証券画像９０がすでに記憶しているフォーマットと適合しているとパターン分類部２７が認識できた場合には、その保険証券画像９０上のキーの位置およびバリューの位置が把握されたことになる。したがって、この場合には、構造解析部２５では、キーとバリューの紐づけをすることが容易にできる。 If the pattern classification unit 27 can recognize that the insurance policy image 90 matches the format already stored, the key position and the value position on the insurance policy image 90 have been grasped. . Therefore, in this case, the structure analysis unit 25 can easily associate the key and the value.

保険証券画像９０がすでに記憶しているフォーマットと適合しているとパターン分類部２７が認識できない場合には、その保険証券画像９０上のキーの位置およびバリューの位置を把握する必要がある。しかし、マッピング部２６によるマッピングおよびマッピングされたキーおよびバリューの組み合わせをユーザーが必要に応じて修正することにより、キーとバリューの組み合わせの誤りをきわめて少なくすることができる。 If the pattern classification unit 27 cannot recognize that the insurance policy image 90 matches the format already stored, it is necessary to grasp the key position and the value position on the insurance policy image 90. However, the user corrects the mapping by the mapping unit 26 and the combination of the mapped key and value as necessary, so that errors in the combination of key and value can be extremely reduced.

このように本実施の形態の画像分析装置１０は、文字情報の保険証券画像９０中の位置に基づいてその文字情報と対をなす文字情報とを紐づけする構造解析部２５を有している。このため、分析対象画像である保険証券画像９０で表される文書の構造、すなわち、項目名とそれに対応する値との対応関係を把握することができる。その結果、項目名とそれに対応する値との対応関係に基づいて、文字情報の認識精度を向上させることができる。 As described above, the image analysis apparatus 10 according to the present embodiment has the structure analysis unit 25 that associates character information with a pair of character information based on the position of the character information in the insurance policy image 90. . Therefore, it is possible to grasp the structure of the document represented by the insurance policy image 90 that is the analysis target image, that is, the correspondence between the item name and the corresponding value. As a result, the recognition accuracy of character information can be improved based on the correspondence between the item name and the corresponding value.

また、構造解析部２５は、２つの文字情報の保険証券画像９０中の距離とその画像中の文字情報が抽出された周辺の領域の画像との関数として求められる構造化距離に基づいて紐づけを行っている。このため、単に文字列相互の関係だけでなく、２つの文字列の位置関係や、間に罫線があることなどの情報に基づいて、項目名とそれに対応する値との対応関係をより正確に把握することができる。 Further, the structure analysis unit 25 associates the two character information based on the structured distance obtained as a function of the distance in the insurance policy image 90 and the image of the surrounding area from which the character information in the image is extracted. It is carried out. Therefore, based on information such as not only the relationship between character strings but also the positional relationship between two character strings and the presence of ruled lines between them, the relationship between item names and corresponding values can be more accurately determined. I can grasp it.

さらに、本実施の形態において、マッピング部２６は、構造解析部２５が紐づけした文字情報の組を保険証券画像９０と対応付けて表示し、文字情報の組の組み合わせに誤りがある場合にユーザーによる修正の入力を受け付ける。このため、項目名とそれに対応する値との対応関係をより正確に把握することができる。さらに、ユーザーによる修正の履歴を学習していくことにより、項目名とそれに対応する値との対応関係の把握の精度を向上させることができる。 Further, in the present embodiment, the mapping unit 26 displays the character information set linked by the structure analysis unit 25 in association with the insurance policy image 90, and when there is an error in the combination of character information Accepts correction input by. For this reason, the correspondence between the item name and the value corresponding thereto can be grasped more accurately. Furthermore, by learning the correction history by the user, it is possible to improve the accuracy of grasping the correspondence relationship between the item name and the value corresponding thereto.

学習においては、キーおよびバリューのそれぞれの画像上の位置そのもの、あるいは、キーからバリューに向かうベクトルを学習しておくことにより、対応関係の把握精度を向上させることもできる。 In learning, it is possible to improve the accuracy of grasping the correspondence relationship by learning the position of each of the key and value on the image itself or the vector from the key to the value.

パターン分類部２７は、保険証券画像９０がパターン記憶部２８に記憶されたパターンのいずれに該当するか判定する。このため、保険証券画像９０がパターン記憶部２８に記憶されたパターンと適合する場合には、項目名とそれに対応する値との対応関係を容易に把握することができる。したがって、構造解析部２５は、項目名とそれに対応する値との対応関係を容易に紐づけすることができる。 The pattern classification unit 27 determines which of the patterns stored in the pattern storage unit 28 the insurance policy image 90 corresponds to. For this reason, when the insurance policy image 90 matches the pattern stored in the pattern storage unit 28, it is possible to easily grasp the correspondence between the item name and the value corresponding thereto. Therefore, the structure analysis unit 25 can easily associate the correspondence between the item name and the corresponding value.

また、本実施の形態は、保険証券を対象として分析するシステムを例として説明したが、ある程度定型で記載される用語がある程度限定された他の印刷物等に記載された内容を分析するシステムにも適用できる。たとえば、運転免許証の記載内容を分析したり、マンションのパンフレットを分析したり、預金通帳から預金の流れを分析したり、登記簿謄本の記載内容を分析したり、健康診断の結果を分析したりすることができる。 In addition, although the present embodiment has been described with respect to an example of a system that analyzes insurance policies, it is also applicable to a system that analyzes contents described in other printed materials, etc., in which terms that are described in a certain form are limited to some extent. Applicable. For example, you can analyze the contents of a driver's license, analyze a pamphlet of an apartment, analyze the flow of deposits from a bank passbook, analyze the contents of a registered copy, and analyze the results of a health checkup. Can be.

１０…画像分析装置、１２…スキャナー、１３…ディスプレイ、１４…キーボード、１５…マウス、２０…画像記憶部、２１…画像解析前処理部、２２…前処理済画像記憶部、２３…画像解析部、２４…画像解析後処理部、２５…構造解析部、２６…マッピング部、２７…パターン分類部、２８…パターン記憶部、２９…抽出文字列記憶部、３０…辞書、４０…マッピング画面、４１…画像表示部、４２…対応表示部、９０…保険証券画像、９１…表組みの領域、９２…罫線、９３…前処理済画像、９４…セル DESCRIPTION OF SYMBOLS 10 ... Image analyzer, 12 ... Scanner, 13 ... Display, 14 ... Keyboard, 15 ... Mouse, 20 ... Image storage part, 21 ... Image analysis pre-processing part, 22 ... Pre-processed image storage part, 23 ... Image analysis part , 24 ... Image analysis post-processing unit, 25 ... Structure analysis unit, 26 ... Mapping unit, 27 ... Pattern classification unit, 28 ... Pattern storage unit, 29 ... Extracted character string storage unit, 30 ... Dictionary, 40 ... Mapping screen, 41 ... Image display section, 42 ... Corresponding display section, 90 ... Insurance policy image, 91 ... Table layout area, 92 ... Ruled line, 93 ... Pre-processed image, 94 ... Cell

Claims

In an image analysis apparatus for analyzing a target image,
An image analysis unit for extracting character information from the target image;
A structure analysis unit for associating the character information paired with the character information based on the position of the character information in the target image;
An image analyzing apparatus comprising:

The structural analysis unit performs association based on a structured distance obtained as a function of a distance between the two pieces of character information in the target image and an image of a peripheral region from which the character information of the target image is extracted. The image analysis apparatus according to claim 1.

A mapping unit that displays the set of character information linked by the structural analysis unit in association with the target image, and that receives a correction input by a user when the combination of the set of character information includes an error; The image analysis apparatus according to claim 2.

The image analysis apparatus according to claim 3, wherein the structure analysis unit learns the correction input history received by the mapping unit and reflects it in the association.

A pattern storage unit that stores a plurality of patterns of the target image and positions where the pairs of character information are described in the respective patterns;
A pattern classification unit that determines which of the patterns stored in the pattern storage unit the target image corresponds to,
When it is determined that the target image corresponds to any one of the patterns stored in the pattern storage unit, the structure analysis unit performs the character information based on a position where the pair of character information is described in the pattern. The image analysis apparatus according to claim 2, wherein the linking is performed.

The target image includes characters and graphics other than characters,
The image analysis apparatus according to claim 1, further comprising an image analysis preprocessing unit that removes at least a part of a figure other than the characters from the target image.

A sub-character string having two or more divided characters that partially overlap each other is generated from at least a part of the target character string of the character information, an appearance frequency of the sub-character string in the dictionary is obtained, and the appearance frequency is predetermined The image analysis post-processing unit further determines that a duplicate character common to the sub-character string smaller than the value of the character string is an error in the character information by the image analysis unit. The image analysis apparatus according to any one of the above.

In an image analysis program for causing a computer to analyze a target image,
An image analysis function for extracting character information from the target image;
A structure analysis function for associating the character information paired with the character information based on the position of the character information in the target image;
An image analysis program that realizes

In an image analysis method for analyzing a target image,
An image analysis step of extracting character information from the target image;
A structural analysis step for associating the character information paired with the character information based on the position of the character information in the target image;
An image analysis method characterized by comprising: