JP2023016031A

JP2023016031A - Character recognition method for recognizing information contained in table and character recognition system

Info

Publication number: JP2023016031A
Application number: JP2022115078A
Authority: JP
Inventors: ギウクキム; Gee Wook Kim; スンシン; Xin Sun; ヨンミンペク; Yong-Min Baek; ヒョソンワン; Hyosun Wang; ジュングンキム; Jung Gun Kim; スンボムチェー; Seung Beom Choi
Original assignee: Line Corp; Naver Corp
Current assignee: Z Intermediate Global Corp; Naver Corp
Priority date: 2021-07-20
Filing date: 2022-07-19
Publication date: 2023-02-01
Anticipated expiration: 2042-07-19
Also published as: KR20230013849A; JP7398526B2

Abstract

To provide a method and system for analyzing an image to generate a table.SOLUTION: A character recognition method according to the present invention comprises: receiving an image that contains a table; recognizing the texts contained in a plurality of cells constructing the table; specifying a to-be-corrected cell from the plurality of cells on the basis of a preset criterion; specifying at least one related cell, which is related to the to-be-corrected cell, from the plurality of cells; and correcting the text contained in the to-be-corrected cell using the vector of the to-be-corrected cell, which vector is calculated using the texts contained the to-be-corrected cell and the related cell.SELECTED DRAWING: Figure 1

Description

本発明は、テーブルに含まれる情報を認識する文字認識方法及び文字認識システムに関する。 The present invention relates to a character recognition method and system for recognizing information contained in a table.

人工知能の辞書的意味は、人間の学習能力、推論能力、知覚能力、自然言語理解能力などをコンピュータプログラムで実現した技術である。このような人工知能は、マシンラーニングに人間の脳を模倣したニューラルネットワークを加えたディープラーニングにより飛躍的に発展してきた。 The dictionary meaning of artificial intelligence is technology that realizes human learning ability, reasoning ability, perceptual ability, natural language understanding ability, etc. by computer program. Such artificial intelligence has made dramatic progress through deep learning, which is a combination of machine learning and a neural network that mimics the human brain.

ディープラーニング（ｄｅｅｐｌｅａｒｎｉｎｇ）とは、コンピュータが人間のように判断及び学習できるようにし、それにより事物やデータを群集化又は分類する技術をいい、近年、テキストデータだけでなく画像データまで分析できるようになり、非常に多様な産業分野に積極的に活用されている。 Deep learning is a technology that allows computers to make judgments and learn like humans, thereby grouping or classifying objects and data.In recent years, not only text data but also image data can be analyzed. and is actively used in a wide variety of industrial fields.

このような人工知能の発達により、オフィス・オートメーション（ｏｆｆｉｃｅａｕｔｏｍａｔｉｏｎ）分野においても様々な自動化が行われている。特に、オフィス・オートメーション分野においては、人工知能を活用した画像データ分析技術に基づいて、紙（ペーパ）に印刷されたコンテンツをデータ化するのに多くの努力をしている。その一環として、オフィス・オートメーション分野においては、紙文書をイメージ化し、イメージに含まれるコンテンツを分析するイメージ分析技術（又は画像データ分析技術）により、文書に含まれるコンテンツをデータ化しており、その場合、文書に含まれるコンテンツの特性によってイメージを分析する技術が必要である。 With the development of such artificial intelligence, various automations are being carried out in the field of office automation. In particular, in the field of office automation, much effort is being made to digitize content printed on paper based on image data analysis technology that utilizes artificial intelligence. As part of this, in the field of office automation, the content contained in the document is converted into data using image analysis technology (or image data analysis technology) that converts paper documents into images and analyzes the content contained in the images. Therefore, there is a need for techniques for analyzing images according to the characteristics of the content contained in the document.

例えば、テーブル（表）を含む文書をデータ化する場合、テーブルの形式、テーブルに含まれるテキストの内容、及びテーブルに含まれるテキストの位置などのように、テーブルに関連する様々な要素についての正確な分析が必要である。 For example, when digitizing a document that contains a table, the accuracy of various elements related to the table, such as the format of the table, the content of the text contained in the table, and the position of the text contained in the table. Further analysis is required.

そこで、特許文献１（書式自動化のためのテーブル生成装置及び方法）においては、イメージからテーブルを認識し、認識されたテーブルを再現する方法について開示しているが、それは、テーブルに含まれる線分（ライン）を基準にテーブルを再現するものであるので、テーブルに含まれる内容まで正確に分析するのに限界があった。 Therefore, Patent Document 1 (table generation device and method for form automation) discloses a method of recognizing a table from an image and reproducing the recognized table. Since the table is reproduced on the basis of (line), there was a limit in accurately analyzing the contents included in the table.

よって、テーブルに含まれる内容まで正確に生成できる文字認識方法が求められている。 Therefore, there is a need for a character recognition method that can accurately generate even the contents contained in the table.

韓国登録特許第１０－１９０７０２９号公報Korean Patent No. 10-1907029

本発明は、テーブルに含まれる内容（又はコンテンツ、情報）をデータ化することができる、文字認識のエラーに対してロバストな文字認識方法及び文字認識システムを提供するためのものである。 An object of the present invention is to provide a character recognition method and a character recognition system that are robust against character recognition errors and that can convert the contents (or contents, information) contained in a table into data.

また、本発明は、テーブルに含まれる内容をデータ化する場合、テーブルに含まれる内容を正確にデータとして確保することができる、文字認識のエラーに対してロバストな文字認識方法及び文字認識システムを提供するためのものである。 Further, the present invention provides a character recognition method and a character recognition system that are robust against character recognition errors and that can ensure the contents contained in the table as data when converting the contents contained in the table into data. It is for providing.

さらに、本発明は、テーブルに含まれる内容間の有機的な関係を考慮してテーブルに含まれる内容をデータ化することができる、文字認識のエラーに対してロバストな文字認識方法及び文字認識システムを提供するためのものである。 Furthermore, the present invention provides a character recognition method and a character recognition system that are robust against character recognition errors, which can convert the contents contained in the table into data in consideration of the organic relationship between the contents contained in the table. It is intended to provide

さらに、本発明は、データ処理量を最小限に抑えたうえでテーブルに含まれる内容をデータ化することができる、文字認識のエラーに対してロバストな文字認識方法及び文字認識システムを提供するためのものである。 Furthermore, the present invention is intended to provide a character recognition method and a character recognition system that are robust against character recognition errors and can convert the contents contained in a table into data while minimizing the amount of data processing. belongs to.

本発明による文字認識方法は、テーブルを含むイメージを受信するステップと、前記テーブルを構成する複数のセルに含まれるテキストを認識するステップと、予め設定された基準に基づいて、前記複数のセルから補正対象セルを特定するステップと、前記複数のセルから前記補正対象セルに関連する少なくとも１つの関連セルを特定するステップと、前記補正対象セル及び前記関連セルに含まれるテキストを用いて算出された前記補正対象セルのベクトルを用いて、前記補正対象セルに含まれるテキストの補正を行うステップとを含むようにしてもよい。 A character recognition method according to the present invention includes the steps of receiving an image containing a table, recognizing text contained in a plurality of cells constituting the table, and recognizing text contained in the plurality of cells based on preset criteria. identifying a correction target cell; identifying at least one related cell related to the correction target cell from the plurality of cells; and calculating using the text included in the correction target cell and the related cell and a step of correcting the text included in the correction target cell using the vector of the correction target cell.

また、本発明による文字認識システムは、保存部と、テーブルを含むイメージを受信する受信部と、前記イメージに含まれる前記テーブルを構成する複数のセルに含まれるテキストを認識する制御部とを含み、前記制御部は、予め設定された基準に基づいて、前記複数のセルから補正対象セルを特定し、前記複数のセルから前記補正対象セルに関連する少なくとも１つの関連セルを特定し、前記補正対象セル及び前記関連セルに含まれるテキストを用いて算出された前記補正対象セルのベクトルを用いて、前記補正対象セルに含まれるテキストの補正を行うようにしてもよい。 A character recognition system according to the present invention includes a storage unit, a reception unit for receiving an image including a table, and a control unit for recognizing text included in a plurality of cells forming the table included in the image. , the control unit specifies a correction target cell from the plurality of cells based on a preset standard; specifies at least one related cell related to the correction target cell from the plurality of cells; The text included in the correction target cell may be corrected using the vector of the correction target cell calculated using the text included in the target cell and the related cell.

さらに、本発明による複数の命令を含むコンピュータプログラムは、命令が実行されると、テーブルを含むイメージを受信するステップと、前記テーブルを構成する複数のセルに含まれるテキストを認識するステップと、予め設定された基準に基づいて、前記複数のセルから補正対象セルを特定するステップと、前記複数のセルから前記補正対象セルに関連する少なくとも１つの関連セルを特定するステップと、前記補正対象セル及び前記関連セルに含まれるテキストを用いて算出された前記補正対象セルのベクトルを用いて、前記補正対象セルに含まれるテキストの補正を行うステップと、をコンピュータで実行するようにしてもよい。 Further, a computer program comprising a plurality of instructions according to the present invention, upon execution of the instructions, comprises the steps of: receiving an image containing a table; recognizing text contained in a plurality of cells comprising said table; identifying a correction target cell from the plurality of cells based on set criteria; identifying at least one related cell related to the correction target cell from the plurality of cells; and the step of correcting the text included in the correction target cell using the vector of the correction target cell calculated using the text included in the related cell.

前述したように、本発明による文字認識方法及び文字認識システムは、イメージに含まれるテーブルからテキストを認識し、認識されたテキストの検証を行うことにより、イメージに含まれるテーブルの内容をより正確にデータ化することができる。 As described above, the character recognition method and character recognition system according to the present invention recognizes text from a table included in an image and verifies the recognized text to more accurately identify the contents of the table included in the image. It can be converted into data.

より具体的には、本発明による文字認識方法及び文字認識システムは、イメージに含まれるテーブルから、テーブルに含まれるテキストが誤って認識された場合、それを補正することにより、イメージに含まれるテーブルの内容をより正確にデータ化することができる。 More specifically, the character recognition method and the character recognition system according to the present invention correct the text contained in the table contained in the image when the text contained in the table is erroneously recognized from the table contained in the image. can be converted into data more accurately.

本発明による文字認識システムを説明するための概念図である。1 is a conceptual diagram for explaining a character recognition system according to the present invention; FIG. テーブルの構成要素を説明するための概念図である。4 is a conceptual diagram for explaining components of a table; FIG. 本発明による文字認識方法を説明するためのフローチャートである。4 is a flowchart for explaining a character recognition method according to the present invention; 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 4 is a conceptual diagram for explaining a method of recognizing information contained in a table by a character recognition system; 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 4 is a conceptual diagram for explaining a method of recognizing information contained in a table by a character recognition system; 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 4 is a conceptual diagram for explaining a method of recognizing information contained in a table by a character recognition system; 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 4 is a conceptual diagram for explaining a method of recognizing information contained in a table by a character recognition system; 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 4 is a conceptual diagram for explaining a method of recognizing information contained in a table by a character recognition system; 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 4 is a conceptual diagram for explaining a method of recognizing information contained in a table by a character recognition system; 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 4 is a conceptual diagram for explaining a method of recognizing information contained in a table by a character recognition system;

以下、添付図面を参照して本発明の実施形態について詳細に説明するが、図面番号に関係なく同一又は類似の構成要素には同一の符号を付し、それについての重複する説明は省略する。以下の説明で用いられる構成要素の接尾辞である「モジュール」や「部」は、明細書の作成を容易にするために付与又は混用されるものであり、それ自体が有意性や有用性を有するものではない。また、本発明の実施形態について説明するにあたり、関連する公知技術についての具体的な説明が本発明の実施形態の要旨を不明にすると判断される場合は、その詳細な説明を省略する。さらに、添付図面は本発明の実施形態の理解を助けるためのものにすぎず、添付図面により本発明の技術的思想が限定されるものではなく、本発明の思想及び技術範囲に含まれるあらゆる変更、均等物乃至代替物を含むものと理解すべきである。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. The same or similar components are denoted by the same reference numerals regardless of the drawing numbers, and duplicate descriptions thereof will be omitted. The suffixes “module” and “part” used in the following explanation are given or used together to facilitate the preparation of the specification, and themselves have significance and usefulness. does not have In addition, in describing the embodiments of the present invention, detailed descriptions of related known techniques will be omitted if it is determined that they may obscure the gist of the embodiments of the present invention. Furthermore, the accompanying drawings are only for helping understanding of the embodiments of the present invention, and the technical ideas of the present invention are not limited by the accompanying drawings. , equivalents or alternatives.

「第１」、「第２」などのように序数を含む用語は様々な構成要素を説明するために用いられるが、上記構成要素は上記用語により限定されるものではない。上記用語は１つの構成要素を他の構成要素と区別する目的でのみ用いられる。 Terms including ordinal numbers such as "first", "second", etc. are used to describe various components, but the components are not limited by the above terms. The above terms are only used to distinguish one component from another.

ある構成要素が他の構成要素に「連結」又は「接続」されていると言及された場合は、他の構成要素に直接連結又は接続されていてもよく、中間にさらに他の構成要素が存在してもよいものと解すべきである。それに対して、ある構成要素が他の構成要素に「直接連結」又は「直接接続」されていると言及された場合は、中間にさらに他の構成要素が存在しないものと解すべきである。 When a component is referred to as being "coupled" or "connected" to another component, it may be directly coupled or connected to the other component, with additional components in between. It should be interpreted as something that can be done. In contrast, when a component is referred to as being "directly coupled" or "directly connected" to another component, it should be understood that there are no additional components in between.

単数の表現には、特に断らない限り複数の表現が含まれる。 References to the singular include the plural unless specifically stated otherwise.

本明細書において、「含む」や「有する」などの用語は、本明細書に記載された特徴、数字、段階、動作、構成要素、部品又はそれらの組み合わせが存在することを指定しようとするもので、１つ又はそれ以上の他の特徴、数字、段階、動作、構成要素、部品又はそれらの組み合わせの存在や付加可能性を予め排除するものではないと理解すべきである。 As used herein, terms such as "including" and "having" are intended to specify the presence of features, numbers, steps, acts, components, parts, or combinations thereof described herein. and does not preclude the presence or possibility of adding one or more other features, figures, steps, acts, components, parts or combinations thereof.

前述したように、人工知能が発達するにつれて様々な形態でオフィス・オートメーション（ｏｆｆｉｃｅａｕｔｏｍａｔｉｏｎ）が行われており、業務効率のために、紙文書に含まれるテーブル（表）を、紙文書に含まれるテーブルの形式のままデジタル化（データ化）するニーズがますます高まっている。 As mentioned above, with the development of artificial intelligence, office automation is being carried out in various forms. There is a growing need to digitize (data) data in a table format.

例えば、様々な種類の領収証（レシート）などの文書はテーブルを含み、保険会社や病院などでは、大量の紙文書に含まれるテーブルをデジタル化されたデータとして処理して電算化する必要がある。 For example, various types of documents such as receipts contain tables, and in insurance companies and hospitals, it is necessary to process and computerize the tables contained in large amounts of paper documents as digitized data.

このようなニーズに伴い、イメージに含まれるテーブルと同じ構成を有するテーブルを生成又は再現する技術の開発が活発に行われている。 In response to such needs, techniques for generating or reproducing a table having the same configuration as the table included in the image are being actively developed.

一方、イメージに含まれるテーブルに含まれる内容（又はコンテンツ、情報）を認識し、認識された内容をデータ化する技術においては、テーブルに関連する様々な要素（例えば、テキスト、テキストの位置、セルの構成、セルの位置、セル間の関連関係など）を正確に認識することが非常に重要である。 On the other hand, in the technology of recognizing the content (or content, information) included in the table included in the image and converting the recognized content into data, various elements related to the table (for example, text, text position, cell configuration, cell locations, relationships between cells, etc.) is very important.

そのために、テーブルを構成する様々な構成要素を正確に認識し、誤って認識された要素に対しては補正を行うことにより、イメージに含まれるテーブルを正確に認識するための様々な努力がなされている。 For this reason, various efforts have been made to accurately recognize the table contained in the image by accurately recognizing the various components that make up the table and correcting the erroneously recognized elements. ing.

以下、イメージに含まれるテーブルの内容とは異なる内容（又はコンテンツ、情報）が抽出された場合にそれを補正する方法について、添付図面と共により具体的に説明する。図１は本発明による文字認識システムを説明するための概念図であり、図２はテーブルの構成要素を説明するための概念図である。また、図３は本発明による文字認識方法を説明するためのフローチャートであり、図４、図５、図６A、図６B、図６C、図７及び図８は文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。 Hereinafter, a method for correcting extracted content (or content, information) different from the content of the table included in the image will be described in more detail with reference to the accompanying drawings. FIG. 1 is a conceptual diagram for explaining a character recognition system according to the present invention, and FIG. 2 is a conceptual diagram for explaining constituent elements of a table. 3 is a flow chart for explaining the character recognition method according to the present invention, and FIGS. 4, 5, 6A, 6B, 6C, 7 and 8 show information contained in a table by the character recognition system. is a conceptual diagram for explaining a method of recognizing the .

本発明による文字認識システム１００は、イメージ１０００に含まれるテーブル１１００から、テーブル１１００に含まれる内容及びテーブル１１００を構成する構成成分を認識し、認識された内容及び構成成分間の関係に基づいて、テーブル１１００に含まれる内容を正確にデータ化することができる。 The character recognition system 100 according to the present invention recognizes the contents included in the table 1100 and the components that make up the table 1100 from the table 1100 included in the image 1000, and based on the recognized contents and the relationship between the components, The contents contained in the table 1100 can be accurately converted into data.

ここで、テーブル１１００に含まれる内容とは、文字、数字、記号、演算子などの意味のある全ての符号体系を意味する。本発明においては、説明の便宜上、「文字、数字、記号、演算子などの意味のある全ての符号体系」をまとめて「テキスト」と命名する。 Here, the content included in the table 1100 means all meaningful code systems such as letters, numerals, symbols, operators, and the like. In the present invention, for convenience of explanation, "all meaningful code systems such as letters, numerals, symbols, operators, etc." are collectively named "text".

例えば、図示のように、テーブル１１００には、「患者氏名」、「住民登録番号」、「診療費（薬剤費）内訳」などのテキストが含まれる。 For example, as shown, table 1100 includes text such as "Patient Name", "Resident Registration Number", and "Breakdown of Medical Expenses (Drug Expenses)".

また、テーブル１１００を構成する構成成分は、テキストが含まれるセル、セルを区画するライン又は線などの構成要素で構成される。これらの構成要素は、互いに位置的及び／又は意味的関係性を有する。 Also, the components that make up the table 1100 are composed of components such as cells containing text, lines that partition the cells, and the like. These components have a positional and/or semantic relationship with each other.

ここで、位置的関係性とは、テーブル１１００に含まれるセル間の相互位置関係を示すものであって、それぞれのセルに対して、周辺又は隣りにどのセルが位置し、どのセルと同じ又は隣接する行、列に位置するかなどに関する位置関係（又は配置関係）を意味する。 Here, the positional relationship indicates the mutual positional relationship between the cells included in the table 1100. For each cell, which cell is located in the vicinity or next to it, which cell is the same or It means a positional relationship (or positional relationship) regarding whether or not they are located in adjacent rows or columns.

また、意味的関係性とは、テーブル１１００に含まれるセルの少なくとも一部と他の一部とが互いに関連する内容を含むことを意味する。このような意味的関係性とは、それぞれのセルに含まれる内容（テキスト）に基づく連結情報を意味する。例えば、ある１つのセルに含まれる内容と他の１つのセルに含まれる内容とは、ｉ）同一又は類似の概念であるか、ｉｉ）同一のカテゴリー又は同一のグループに属するか、ｉｉｉ）互いに対して上位概念又は下位概念に該当する内容であることがあり、この場合、これらのセルが連結されていると表現できる。 In addition, the semantic relationship means that at least some of the cells included in the table 1100 and some of the other cells include contents related to each other. Such semantic relationship means connection information based on the content (text) contained in each cell. For example, the content contained in one cell and the content contained in another cell are i) the same or similar concepts, ii) belong to the same category or the same group, iii) each other On the other hand, the content may correspond to a higher concept or a lower concept, and in this case, it can be expressed that these cells are connected.

なお、イメージ１０００には１つ又はそれ以上のテーブルが含まれることがあり、本発明においては、イメージ１０００に含まれるテーブルの数に関係なく、イメージ１０００に含まれる全てのテーブルを認識することができる。 Note that image 1000 may contain one or more tables, and the present invention can recognize all tables contained in image 1000 regardless of the number of tables contained in image 1000. can.

本発明において、イメージ１０００は、紙文書のスキャンにより取得されたイメージ、写真撮影により取得されたイメージ、又はその他の様々な方法により取得されたイメージである。 In the present invention, the image 1000 is an image obtained by scanning a paper document, an image obtained by photographing, or an image obtained by various other methods.

一方、本発明による文字認識システム１００は、図１に示すように、受信部１１０、保存部１２０、ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）部１３０及び制御部１４０の少なくとも１つを含む。 Meanwhile, the character recognition system 100 according to the present invention includes at least one of a receiver 110, a storage 120, an OCR (Optical Character Reader) 130, and a controller 140, as shown in FIG.

まず、受信部１１０は、テーブル１１００を含むイメージ１０００を受信する手段であって、通信部、スキャン部及び入力部の少なくとも１つを含むようにしてもよく、その他のイメージ１０００を受信する手段からなるようにしてもよい。 First, the receiving unit 110 is means for receiving the image 1000 including the table 1100, and may include at least one of a communication unit, a scanning unit, and an input unit. can be

文字認識システム１００は、受信部１１０を介して受信したイメージ１０００に含まれるテーブル１１００を認識し、イメージ１０００に含まれるテーブル１１００に含まれる内容をデータ化するすることができる。その認識の結果、文字認識システム１００は、テーブル１１００に含まれる内容と一致するデータを確保することができる。 The character recognition system 100 can recognize the table 1100 included in the image 1000 received via the receiving unit 110 and convert the contents included in the table 1100 included in the image 1000 into data. As a result of that recognition, the character recognition system 100 can secure data that matches what is contained in the table 1100 .

本発明においては、文字認識システム１００により、イメージ１０００に含まれるテーブル１１００の内容（又はコンテンツ、情報）を認識し、認識された内容にエラーが存在するとそれを補正することにより、テーブル１１００に含まれる内容を正確にデータ化する方法を提供することができる。 In the present invention, the character recognition system 100 recognizes the content (or content, information) of the table 1100 included in the image 1000, corrects any errors in the recognized content, and It is possible to provide a method for accurately digitizing the content that can be obtained.

なお、本発明によりイメージ１０００に含まれるテーブル１１００から認識された内容は、イメージ１０００に含まれるテーブル１１００と同一又は類似の構造を有するテーブルデータを生成するのに活用することができる。 In addition, the contents recognized from the table 1100 included in the image 1000 according to the present invention can be used to generate table data having the same or similar structure as the table 1100 included in the image 1000. FIG.

テーブルデータは、イメージ１０００に含まれるテーブル１１００を構成するセルの構造及びセルの関係性（カテゴリーセルとデータセルの関係など）の少なくとも一方に基づいて整形化又は構造化されたデータからなるようにしてもよい。 The table data should consist of data formatted or structured based on at least one of the cell structure and cell relationship (relationship between category cells and data cells, etc.) forming the table 1100 included in the image 1000. may

テーブルデータには、本発明による文字認識システム１００により認識された内容に該当するデータが含まれる。 The table data includes data corresponding to the contents recognized by the character recognition system 100 according to the present invention.

本発明において、構成要素の位置的関係には、特にテーブルに含まれるセルの配置関係、行又は列の関係に応じた位置関係が含まれ、構成要素の意味的関係には、それぞれのセルに含まれるテキストが示す意味間に形成される関係が含まれる。 In the present invention, the positional relationship of the components particularly includes the positional relationship of the cells included in the table, the positional relationship according to the row or column relationship, and the semantic relationship of the components includes It contains the relationships formed between the meanings implied by the contained text.

次に、保存部１２０は、本発明による様々な情報を保存するようにしてもよい。保存部１２０は、その種類が非常に多様であり、少なくとも一部は外部サーバ１５０（クラウドサバー１５１及びデータベース（ＤＢ）１５２の少なくとも一方）を意味する。すなわち、保存部１２０は、本発明に関連する情報が保存される空間であればよく、物理的な空間の制約はない。 Next, the storage unit 120 may store various information according to the present invention. The storage unit 120 has a wide variety of types, and at least part of it means the external server 150 (at least one of the cloud server 151 and the database (DB) 152). That is, the storage unit 120 may be any space in which information related to the present invention is stored, and there is no physical space limitation.

保存部１２０には、テーブルを構成する様々な構成要素に関する情報が含まれる。保存部１２０には、ｉ）テーブル１１００を含むイメージ１０００及びそれに関連するデータ、ｉｉ）イメージ１０００から認識されたテーブル１１００のセル情報、構成成分（例えば、ライン、コーナーなど）及びそれに関連するデータ、ｉｉｉ）テーブル１１００に含まれるコンテンツ（例えば、テキスト、イメージなど）データ、ｉｖ）テーブル１１００に含まれるコンテンツに関連するデータセットの少なくとも１つが保存される。ここで、データセットは、テーブル１１００に含まれるコンテンツを検証又は補正するのに活用されるデータであってもよい。 The storage unit 120 contains information about various components that make up the table. The storage unit 120 stores i) an image 1000 including the table 1100 and data related thereto, ii) cell information of the table 1100 recognized from the image 1000, components (e.g., lines, corners, etc.) and data related thereto; At least one of iii) content (eg, text, image, etc.) data included in table 1100 and iv) data sets associated with the content included in table 1100 are stored. Here, the data set may be data used to verify or correct the contents included in the table 1100. FIG.

次に、ＯＣＲ部１３０は、イメージ１０００に含まれるコンテンツ（又は情報）を認識する手段であって、様々なコンテンツ認識アルゴリズムの少なくとも１つによりイメージ１０００に含まれるコンテンツを認識することができる。ＯＣＲ部１３０は、人工知能に基づくアルゴリズム（又はディープラーニングアルゴリズム）を用いて、コンテンツを認識することができる。ここで、コンテンツは、テキスト（文字）を含んでもよい。ＯＣＲ部は、「ＯＣＲＡＰＩ」とも命名できる。 Next, the OCR unit 130 is a means for recognizing the content (or information) included in the image 1000, and can recognize the content included in the image 1000 by at least one of various content recognition algorithms. The OCR unit 130 can recognize content using algorithms based on artificial intelligence (or deep learning algorithms). Here, the content may include text (characters). The OCR part can also be named "OCR API".

ＯＣＲ部１３０は、イメージ１０００に含まれるテキスト及びテキストの位置情報を認識することができる。ここで、テキストの位置情報には、受信部１１０を介して入力されたイメージ１０００内でのテキストの位置に関する情報、及びテーブル１１００内でのテキストの位置に関する情報の少なくとも一方が含まれる。 The OCR unit 130 can recognize the text included in the image 1000 and the location information of the text. Here, the text position information includes at least one of information about the position of the text within the image 1000 input via the receiving unit 110 and information about the position of the text within the table 1100 .

ＯＣＲ部１３０は、イメージ１０００に含まれるテキストに基づいて、それぞれのセルに含まれるテキストを認識することができる。 OCR unit 130 can recognize the text contained in each cell based on the text contained in image 1000 .

テーブル１１００は、複数のセルで構成され、ＯＣＲ部１３０は、それぞれのセルに含まれるテキストを認識することができる。 The table 1100 is composed of a plurality of cells, and the OCR section 130 can recognize text included in each cell.

例えば、図１に示すように、テーブル１１００の１番目のセル１１０１から「患者氏名」というテキストが認識された場合、１番目のセル１１０１の識別情報と「患者氏名」というテキストとがマッチングされ、このようなマッチング情報は保存部１２０に保存される。 For example, as shown in FIG. 1, when the text "patient name" is recognized from the first cell 1101 of the table 1100, the identification information in the first cell 1101 and the text "patient name" are matched, Such matching information is stored in the storage unit 120 .

本発明においては、ＯＣＲ部１３０により、それぞれのセルに含まれるテキストを区分して認識することができる。よって、本発明においては、どのセルにどのテキストが含まれるか、セルの識別情報とそれに対応するテキストとがマッチングされて存在する。このようなマッチング情報は保存部１２０に保存される。 In the present invention, the OCR unit 130 can distinguish and recognize the text contained in each cell. Therefore, in the present invention, which cell contains which text is matched between the identification information of the cell and the corresponding text. Such matching information is stored in the storage unit 120 .

このように、それぞれのセルから認識されたテキストは、それぞれのセルの識別情報とマッチングされて保存され、このようなセルの識別情報には、イメージ１０００及びテーブル１１００の少なくとも一方に関するセルの位置情報が含まれる。 In this way, the text recognized from each cell is matched with the identification information of each cell and stored, and the identification information of the cell includes position information of the cell with respect to at least one of the image 1000 and the table 1100. is included.

このようなマッチング情報は、ＯＣＲ部１３０により、イメージ１０００からテキストが認識される過程で生成されるか、又は制御部１４０の制御の下で生成される。 Such matching information is generated by the OCR unit 130 while text is being recognized from the image 1000 or generated under the control of the control unit 140 .

次に、制御部１４０は、本発明に関連する文字認識システム１００の全般的な動作を制御する。制御部１４０は、人工知能アルゴリズム（又はディープラーニングアルゴリズム）を処理するプロセッサ（又は人工知能プロセッサ、ディープラーニングプロセッサ）を含んでもよい。制御部１４０は、人工知能アルゴリズムに基づいて、イメージ１０００からテーブル１１００を認識し、テーブル１１００を構成する少なくとも１つのセルを認識することができる。 Next, the control unit 140 controls general operations of the character recognition system 100 related to the present invention. The control unit 140 may include a processor (or artificial intelligence processor, deep learning processor) that processes an artificial intelligence algorithm (or deep learning algorithm). The control unit 140 can recognize the table 1100 from the image 1000 and recognize at least one cell forming the table 1100 based on an artificial intelligence algorithm.

また、制御部１４０は、人工知能アルゴリズムに基づいて、セルの関係及びセルに含まれるテキストの意味を分析することができる。 In addition, the controller 140 can analyze the relationship between cells and the meaning of text included in the cells based on an artificial intelligence algorithm.

さらに、制御部１４０は、セルの関係及びセルに含まれるテキストの意味に基づいて、ＯＣＲ部１３０により認識されたテキストの少なくとも一部を補正することができる。 Additionally, the control unit 140 can correct at least a portion of the text recognized by the OCR unit 130 based on the cell relationships and the meaning of the text contained in the cells.

ここで、制御部１４０は、保存部１２０に保存されたデータセットに基づいて、ＯＣＲ部１３０により認識されたテキストを補正することができる。 Here, the control unit 140 can correct the text recognized by the OCR unit 130 based on the data set stored in the storage unit 120 .

なお、本発明におけるセルとは、テーブルを構成する複数のラインにより規定される長方形（ｒｅｃｔａｎｇｌｅ）のボックスをいう。 A cell in the present invention means a rectangular box defined by a plurality of lines forming a table.

本発明についての説明に先立って、テーブルの構成要素について説明する。本発明における「テーブル」とは、ある内容を所定の形式又は手順で示したものを意味し、表とも命名される。図２を参照してテーブルの構成要素について説明すると、テーブル２００は、少なくとも１つのセル（又は空間、領域）２１０を含む。すなわち、テーブル２００は、少なくとも１つのセル２１０を含み、セル２１０内に情報を含むように構成される。 Before describing the present invention, the constituent elements of the table will be described. A "table" in the present invention means a representation of certain contents in a predetermined format or procedure, and is also called a table. Referring to FIG. 2 to describe the components of the table, table 200 includes at least one cell (or space, area) 210 . That is, table 200 includes at least one cell 210 and is configured to include information within cell 210 .

テーブル２００に含まれるセル２１０は、少なくとも４つの線分（ｌｉｎｅｓｅｇｍｅｎｔ）により規定されるものであってもよい。テーブル２００及びテーブル２００内に備えられたセルは、長方形状からなる。 Cells 210 included in table 200 may be defined by at least four line segments. The table 200 and the cells provided in the table 200 are rectangular.

すなわち、テーブルに含まれるセルは、４つの線分で囲まれた四角形からなるものであってもよい。本発明における「線分（ｌｉｎｅｓｅｇｍｅｎｔ）」は、ライン（ｌｉｎｅ）又は線の少なくとも一部をいう。各セルを構成する線分は、延びてライン又は線となる。 That is, a cell included in the table may consist of a rectangle surrounded by four line segments. A "line segment" in the present invention refers to a line or at least part of a line. A line segment forming each cell extends to form a line or a line.

このように、１つの四角形状のセルを形成するためには、少なくとも４つの線分ａ、ｂ、ｃ、ｄが必要であり、これは、テーブルを構成する２つの水平ライン（ｈｏｒｉｚｏｎｔａｌｌｉｎｅ）（又は行方向のライン、エッジ）２０１、２０２及び２つの垂直ライン（ｖｅｒｔｉｃａｌｌｉｎｅ）（又は列方向のライン、エッジ）２０３、２０４の少なくとも一部からなるようにしてもよい。より具体的には、長方形状のセルは、２つの水平ライン２０１、２０２及び２つの垂直ライン２０３、２０４の少なくとも一部が互いに交差して形成されてもよい。 Thus, at least four line segments a, b, c, and d are required to form one square-shaped cell, which is equivalent to the two horizontal lines ( 201 , 202 in the row direction, and at least part of two vertical lines (or lines, edges in the column direction) 203 , 204 . More specifically, a rectangular cell may be formed by at least a portion of two horizontal lines 201, 202 and two vertical lines 203, 204 crossing each other.

よって、テーブル２００は、少なくとも４つのラインを含み、水平ラインの数、垂直ラインの数、ライン間の間隔、ラインの配列位置によって、セルの数、セルの大きさ、セルの位置などが定義される。 Therefore, the table 200 includes at least four lines, and the number of cells, the size of cells, the positions of cells, etc. are defined by the number of horizontal lines, the number of vertical lines, the spacing between lines, and the arrangement position of the lines. be.

一方、テーブル２００を構成する複数のラインは、異なる第１タイプ及び第２タイプのいずれかのタイプに分けられ、これは当該ラインがどの方向に延びたかによって特定される。 On the other hand, the plurality of lines forming the table 200 are classified into either a first type or a second type, which are specified by the direction in which the lines extend.

図示のように、水平方向に延びるライン２０１、２０２は第１タイプに定義され、垂直方向に延びるライン２０３、２０４は第２タイプに定義される。 As shown, horizontally extending lines 201, 202 are defined as a first type and vertically extending lines 203, 204 are defined as a second type.

第１タイプのラインは、水平ライン、横ライン、行ライン、横方向のラインなど、その意味が同一又は類似の用語で多様に命名される。 Lines of the first type are variously named with terms that have the same or similar meanings, such as horizontal lines, horizontal lines, row lines, and horizontal lines.

また、第２タイプのラインは、垂直ライン、縦ライン、列ライン、縦方向のラインなど、その意味が同一又は類似の用語で多様に命名される。 In addition, the second type lines are variously named with terms having the same or similar meanings, such as vertical line, vertical line, column line, and vertical line.

一方、テーブル２００に含まれるセルの数、セルの大きさ又はセルの位置などは、横ライン及び縦ラインの数や配置関係などに基づいて様々に変形することができ、セルの結合によっても様々に変形することができる。 Meanwhile, the number of cells included in the table 200, the size of the cells, the positions of the cells, etc., can be variously modified based on the number of horizontal lines and vertical lines, the arrangement relationship, etc., and can be variously changed by combining the cells. can be transformed into

また、本発明におけるテーブルを構成する構成成分は、テーブルを構成するライン（横ライン及び縦ライン）と、テーブルを構成するラインが交差して形成されるコーナー（又は角、頂点）（符号「ｅ、ｆ、ｇ、ｈ」参照）とを含んでもよい。 In addition, the components that make up the table in the present invention are the lines that make up the table (horizontal lines and vertical lines) and the corners (or corners, vertices) formed by the intersection of the lines that make up the table (symbol "e , f, g, h”).

全てのテーブルが少なくとも４つのコーナーを有し、それはテーブルの最外枠に含まれるコーナーであり得る。また、テーブルに含まれるコーナーの数は、テーブルに含まれるセルの数に応じて異なる。 All tables have at least four corners, which may be the corners included in the outermost frame of the table. Also, the number of corners included in the table depends on the number of cells included in the table.

以下、前述した本発明による文字認識システムの構成に基づいて、テーブルを生成する方法についてより具体的に説明する。 Hereinafter, a method for generating a table based on the configuration of the character recognition system according to the present invention will be described in more detail.

図３に示すように、本発明による文字認識方法においては、まず、テーブル１１００を含むイメージ１０００（図１参照）を受信する過程が行われる（Ｓ３１０）。 As shown in FIG. 3, in the character recognition method according to the present invention, first, a process of receiving an image 1000 (see FIG. 1) including a table 1100 is performed (S310).

前述したように、テーブル１１００を含むイメージ１０００は、様々なルートで受信することができる。例えば、イメージ１０００は、通信部により伝送される方式、スキャン部によりスキャンされる方式、入力部により入力される方式などで受信することができる。 As mentioned above, the image 1000 containing the table 1100 can be received by various routes. For example, the image 1000 can be received in a manner transmitted by a communication unit, scanned by a scanning unit, input by an input unit, or the like.

イメージ１０００が受信されると、イメージ１０００に含まれるテーブル１１００からテキストを認識する過程が行われる。 When the image 1000 is received, the process of recognizing text from the table 1100 included in the image 1000 is performed.

より具体的には、本発明においては、テーブル１１００を構成する複数のセルに含まれるテキストを認識する過程が行われる（Ｓ３２０）。このような認識はＯＣＲ部１３０で行われ、ＯＣＲ部１３０により、テーブル１１００に含まれる情報又は内容が認識される。ここで、テーブル１１００に含まれる情報又は内容が認識されるとは、テーブル１１００に含まれるテキストが認識されることを意味する。 More specifically, in the present invention, a process of recognizing texts included in a plurality of cells forming the table 1100 is performed (S320). Such recognition is performed by the OCR section 130 , which recognizes the information or content contained in the table 1100 . Here, recognizing the information or contents included in the table 1100 means that the text included in the table 1100 is recognized.

ＯＣＲ部１３０による認識の結果、テーブル１１００に含まれる内容に該当するデータを確保することができる。例えば、ＯＣＲ部１３０は、テーブル１１００に含まれるテキストを認識し、テーブル１１００に含まれる内容に該当するテキスト、例えば「患者氏名」、「住民登録番号」、「診療費（薬剤費）内訳」などのテキストをデータとして確保することができる。ＯＣＲ部１３０は、イメージ１０００に含まれる各テキスト（又は文字）及びテキストの位置を認識するために訓練されたテキスト認識モデルにより、イメージ１０００からテキストを認識することができる。このようなテキスト認識モデルは、人工知能に基づくアルゴリズム（例えば、ディープラーニングアルゴリズム）を含んでもよい。 As a result of recognition by the OCR unit 130, data corresponding to the contents included in the table 1100 can be secured. For example, the OCR unit 130 recognizes the text included in the table 1100, and the text corresponding to the contents included in the table 1100, such as "patient name", "resident registration number", "medical expense (drug expense) breakdown", etc. text can be secured as data. The OCR unit 130 can recognize text from the image 1000 with a text recognition model trained to recognize each text (or character) contained in the image 1000 and the location of the text. Such text recognition models may include artificial intelligence-based algorithms (eg, deep learning algorithms).

図４の（ａ）に示すように、ＯＣＲ部１３０は、テーブル１１００からテキストを認識し、図４の（ｂ）に示すように、認識されたテキストは、テキストが含まれるそれぞれのセルの識別情報ＫＥＹ１、ＫＥＹ２、ＫＥＹ３．．．と共にマッチングされて保存部１２０に保存される。 As shown in FIG. 4(a), OCR unit 130 recognizes text from table 1100, and as shown in FIG. 4(b), the recognized text is an identification of each cell containing the text. Information KEY1, KEY2, KEY3 . . . are matched together and stored in the storage unit 120 .

このように、テーブル１１００を構成するセルに対してテキストが認識されると、認識されたテキストの検証を行う過程が行われてもよい。ここで、「検証」とは、それぞれのセルに含まれるテキストが正確に認識されたか否かを確認する過程を意味する。例えば、図４の（ａ）に示すように、テーブル１１００の特定のセル４０１には「組合負担額」というテキストが含まれるが、図４の（ｂ）に示すように、ＯＣＲ部１３０により「粗合負担額」４０２というテキストが認識されることがある。 As such, when the text is recognized for the cells forming the table 1100, a process of verifying the recognized text may be performed. Here, 'verification' means a process of checking whether the text included in each cell has been correctly recognized. For example, as shown in (a) of FIG. 4, a specific cell 401 of table 1100 contains the text "union contribution amount", but as shown in (b) of FIG. The text "Rough Total Cost" 402 may be recognized.

この場合、原本資料とは異なる内容が認識（誤認識）されることにより、認識されるデータの信頼度及び正確度の問題が生じる。 In this case, the recognition (erroneous recognition) of the content different from the original document causes problems of the reliability and accuracy of the recognized data.

よって、制御部１４０は、テーブル１１００から認識されたテキストの検証を行うようにしてもよい。この場合、本発明においては、それぞれのセル間の相互位置関係及び意味関係（又は連結関係）に基づいて、検証過程で補正が必要なセルに含まれるテキストの補正を行うようにしてもよい。 Therefore, the control unit 140 may verify the text recognized from the table 1100 . In this case, the present invention may correct the text contained in the cells that need to be corrected in the verification process based on the mutual positional relationship and semantic relationship (or connection relationship) between the cells.

そのために、本発明においては、複数のセルから補正対象セルを特定する過程が行われる（Ｓ３３０）。制御部１４０は、予め設定された基準に基づいて、複数のセルから少なくとも１つの補正対象セルを特定するようにしてもよい。ここで、予め設定された基準は非常に多様に設定することができ、よって、本発明において、補正対象セルが特定される過程は非常に多様である。 Therefore, in the present invention, a process of identifying a correction target cell from a plurality of cells is performed (S330). The control unit 140 may specify at least one correction target cell from a plurality of cells based on preset criteria. Here, the preset criteria can be set in various ways, and therefore, the process of specifying the correction target cell in the present invention is very diverse.

例えば、制御部１４０は、テーブル１１００を構成する複数のセルに対してテキストが認識されると、テキストの認識が行われた全てのセルの検証を行うようにしてもよい。 For example, when text is recognized in a plurality of cells forming the table 1100, the control unit 140 may verify all cells in which text recognition has been performed.

この場合、テキストの認識が行われた全てのセルが補正対象セルとしてそれぞれ特定される。 In this case, all cells in which text recognition has been performed are specified as correction target cells.

それとは異なり、制御部１４０は、テーブル１１００を構成する複数のセルのうち、カテゴリーセルに含まれるテキストに対して検証を行うようにしてもよい。 Alternatively, the control unit 140 may verify the text included in the category cell among the plurality of cells forming the table 1100 .

この場合、制御部１４０は、テーブル１１００を構成する複数のセルのうち、カテゴリーセルに含まれるテキストのそれぞれを補正対象セルとして特定する。 In this case, control unit 140 identifies each of the texts included in the category cells among the plurality of cells forming table 1100 as correction target cells.

ここで、カテゴリーセルは、キーセルとも命名され、データセル（又はバリューセル）とは区分されるものである。 Here, category cells are also called key cells and are distinguished from data cells (or value cells).

カテゴリーセルは、タイトルセルとも命名され、データセルに含まれるデータ（テキスト）のカテゴリー、種類、所属、特徴などを定義する意味のテキストが含まれるセルとして理解される。 A category cell is also called a title cell, and is understood as a cell containing text that defines the category, type, affiliation, characteristics, etc. of the data (text) contained in the data cell.

カテゴリーは、同一又は類似の性質や意味を基準に分けられるものであり、範疇とも命名される。同一のカテゴリーに属するデータは、同一の意味、同一の種類、又は同一の所属に該当するデータであり得る。 Categories are divided based on the same or similar properties and meanings, and are also called categories. Data belonging to the same category may be data corresponding to the same meaning, the same type, or the same affiliation.

カテゴリーセルには、カテゴリー名又はカテゴリー名に関連するデータが含まれ、データセルには、カテゴリーセルに含まれるカテゴリー名に属するデータが含まれる。ここで、カテゴリー名は範疇名とも命名される。 A category cell contains a category name or data related to the category name, and a data cell contains data belonging to the category name contained in the category cell. Here, the category name is also named category name.

一例として、図４の（ａ）に示すように、テーブル１１００において、「診療，調剤日付（診療期間）」というテキストが含まれる特定のセル４１２は、カテゴリーセルであり、特定のセル４１２と同じ列に位置し、日付情報がそれぞれ含まれるセル４０３は、特定のセル４１２に関連するデータセルである。 As an example, as shown in FIG. 4A, in the table 1100, a specific cell 412 containing the text "medical care, dispensing date (medical care period)" is a category cell and is the same as the specific cell 412. Cells 403 located in columns and each containing date information are data cells associated with a particular cell 412 .

他の例として、テーブル１１００において、「患者氏名」というテキストが含まれる特定のセル４１１は、カテゴリーセルであり、特定のセル４１１と同じ行に位置し、名前情報「ホン・ギルドン」が含まれるセル４０５は、特定のセル４１１に関連するデータセルである。 As another example, in the table 1100, a particular cell 411 containing the text "Patient Name" is a category cell, is located in the same row as the particular cell 411, and contains the name information "Hong Gil Dong". Cell 405 is a data cell associated with a particular cell 411 .

図４に示すように、テーブル１１００には、１つ又はそれ以上のカテゴリーセル４１１、４１２、４１３、４１４、４１５、４１６が含まれる。 As shown in FIG. 4, table 1100 includes one or more category cells 411 , 412 , 413 , 414 , 415 , 416 .

制御部１４０は、テーブル１１００の構造及びセルに含まれるテキストの意味に基づいて、テーブル１１００を構成する複数のセルのタイプを第１タイプ（カテゴリーセルタイプ）及び第２タイプ（データセルタイプ）のいずれかに特定することができる。 Based on the structure of the table 1100 and the meaning of the text contained in the cells, the control unit 140 classifies the types of the plurality of cells forming the table 1100 into a first type (category cell type) and a second type (data cell type). can be specified.

また、制御部１４０は、テーブル１１００を構成する複数のセルのうち、カテゴリーセルに含まれるテキストのそれぞれを、補正対象セルとして特定することができる。 Further, the control unit 140 can specify each of the texts included in the category cells among the plurality of cells forming the table 1100 as correction target cells.

他の例として、制御部１４０は、テーブル１１００を構成する複数のセルのうち、カテゴリーセルの少なくとも一部を、補正対象セルとして特定することができる。ここで、補正対象セルを特定するための予め設定された基準は、前記複数のセルに含まれるテキストに対する認識率に関連するものであってもよい。制御部１４０は、カテゴリーセルから抽出されたテキストのうち、テキストの認識及び抽出当時にテキスト認識率が低い特定のセルを、補正対象セルとして特定することができる。 As another example, the control unit 140 can specify at least some of the category cells among the plurality of cells forming the table 1100 as correction target cells. Here, the preset criteria for specifying the correction target cell may be related to the recognition rate for the text included in the plurality of cells. The control unit 140 may specify a specific cell having a low text recognition rate at the time of recognition and extraction of the text among the texts extracted from the category cells, as a correction target cell.

図５に示すように、ＯＣＲ部１３０によりそれぞれのセルに対してテキストを認識する場合、それぞれのテキストの認識の信頼度を特定することができる。 As shown in FIG. 5, when text is recognized for each cell by the OCR unit 130, the recognition reliability of each text can be specified.

このような信頼度は、信頼度スコア（ｃｏｎｆｉｄｅｎｃｅｓｃｏｒｅ）又は信頼スコアとも命名される。 Such confidence is also named confidence score or confidence score.

図５に示すように、このような信頼度スコアは、テーブル１１００を構成するそれぞれのセル及びセルから認識されたテキストの少なくとも一方とマッチングされて保存部１２０に保存されてもよい。 As shown in FIG. 5 , such reliability scores may be matched with at least one of each cell constituting the table 1100 and the text recognized from the cell, and stored in the storage unit 120 .

制御部１４０は、図５に示すように、テキストの信頼度が予め設定された基準条件を満たすカテゴリーセル（例えば、テキストの信頼度が基準スコア（基準確率など）未満であるセル）を補正対象セルとして特定することができる。図５によれば、信頼スコアが１に近いほど、認識されたテキストが正確に認識された確率が高く、信頼スコアが０に近いほど、認識されたテキストが正確に認識された確率が低いことを意味する。 As shown in FIG. 5, the control unit 140 selects a category cell that satisfies a preset reference condition for text reliability (for example, a cell whose text reliability is less than a reference score (such as a reference probability)) as a correction target. can be identified as a cell. According to FIG. 5, the closer the confidence score is to 1, the higher the probability that the recognized text was recognized correctly, and the closer the confidence score is to 0, the lower the probability that the recognized text was recognized correctly. means

本発明において、制御部１４０は、テキストが正確に認識された確率が低いセルに含まれるテキストの補正を行うことにより、データ処理の演算量を低減し、テーブル１１００に含まれる情報を正確に抽出することができる。このように、制御部１４０は、認識された全てのテキストを補正対象テキストとして特定するのではなく、信頼度に基づいてテキストが正確に認識された確率が低いテキストを補正対象テキストとして特定する。よって、制御部１４０は、補正対象テキストとして特定されたテキストに対してのみ本発明による補正を行うので、全てのテキストに対して補正を行う場合より、データ処理の演算量を低減することができる。 In the present invention, the control unit 140 corrects the text contained in the cells in which the probability of correct recognition of the text is low, thereby reducing the amount of data processing operations and accurately extracting the information contained in the table 1100. can do. In this way, the control unit 140 does not specify all recognized texts as correction target texts, but specifies texts with a low probability of being correctly recognized based on the reliability as correction target texts. Therefore, since the control unit 140 performs the correction according to the present invention only on the text specified as the text to be corrected, it is possible to reduce the amount of calculation for data processing compared to the case where all the texts are corrected. .

例えば、予め設定された基準条件が、テキストの信頼スコアが０．５未満であるセルを補正するように設定された場合、制御部１４０は、図５において、「ＫＥＹ７」に該当するセルを補正対象セルとして特定する。この場合、セル「ＫＥＹ７」に含まれる「(1)粗合負担額」に該当するテキストの補正を行う。 For example, when the preset reference condition is set to correct a cell whose text confidence score is less than 0.5, the control unit 140 corrects a cell corresponding to 'KEY7' in FIG. Identify as the target cell. In this case, the text corresponding to "(1) total cost" contained in the cell "KEY7" is corrected.

このように、様々な方法又は基準に基づいて補正対象セルが特定されると、制御部１４０は、テーブル１１００に含まれる複数のセルのうち補正対象セルに関連する少なくとも１つの関連セル及びそれに含まれるコンテンツ（テキスト）を用いて、補正対象セルに含まれるテキストの補正を行うことができる。 In this way, when the correction target cell is specified based on various methods or criteria, the control unit 140 selects at least one cell related to the correction target cell among the plurality of cells included in the table 1100 and The text contained in the correction target cell can be corrected using the content (text) included in the correction target cell.

そのために、本発明においては、テーブル１１００に含まれる複数のセルから補正対象セルに関連する関連セルを特定する過程が行われる（Ｓ３４０）。 To this end, in the present invention, a process of identifying related cells related to the correction target cell from a plurality of cells included in the table 1100 is performed (S340).

以下、説明の便宜上、補正対象セルが図６Aに示す「(1)組合負担額」というテキストが含まれる特定のカテゴリーセル６０１であると仮定して説明する。以下の説明は、全てのカテゴリーセルに対して共通に適用することができる。 For convenience of explanation, the following description assumes that the cell to be corrected is the specific category cell 601 containing the text "(1) Co-payment amount" shown in FIG. 6A. The following description can be commonly applied to all category cells.

図４、図６A及び図６Bに示すように、イメージ１０００において、特定のカテゴリーセル６０１に「(1)組合負担額」に該当するテキストが含まれるが、図５において説明したように、ＯＣＲ認識エラーにより「(1)粗合負担額」というテキストが認識されることがある。よって、制御部１４０は、誤って認識されたテキストが含まれる特定のカテゴリーセル６０１及び特定のカテゴリーセル６０１に関連する少なくとも１つの関連セルを用いて、誤って認識されたテキスト（例えば、(1)粗合負担額）の補正を行うことができる。 As shown in FIGS. 4, 6A, and 6B, in the image 1000, a specific category cell 601 contains the text corresponding to "(1) Association contribution amount", but as described in FIG. Due to an error, the text "(1) total cost" may be recognized. Therefore, the control unit 140 uses the specific category cell 601 containing the erroneously recognized text and at least one related cell related to the specific category cell 601 to identify the erroneously recognized text (for example, (1 ) can be corrected.

制御部１４０は、補正対象セル６０１を基準として、少なくとも１つの関連セルを特定することができる。 The control unit 140 can specify at least one related cell using the correction target cell 601 as a reference.

制御部１４０は、補正対象セル６０１に含まれるテキストの意味及び補正対象セル６０１の位置の少なくとも一方に基づいて、関連セルを特定することができる。 The control unit 140 can identify related cells based on at least one of the meaning of the text included in the correction target cell 601 and the position of the correction target cell 601 .

ここで、補正対象セル６０１の関連セルは、カテゴリーセルであってもよい。制御部１４０は、テーブル１１００に含まれるカテゴリーセル及びデータセルのうち、カテゴリーセルのみを補正対象セル６０１の関連セルとして特定することができる。 Here, the related cell of the correction target cell 601 may be a category cell. The control unit 140 can specify only the category cells among the category cells and data cells included in the table 1100 as related cells of the correction target cell 601 .

また、制御部１４０は、カテゴリーセルのうち、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定するか、又は位置的に補正対象セル６０１に隣接するカテゴリーセルを関連セルとして特定することができる。 In addition, the control unit 140 identifies, among the category cells, category cells containing text semantically related to the correction target cell 601 as related cells, or identifies category cells that are positionally adjacent to the correction target cell 601. It can be identified as an associated cell.

制御部１４０は、補正対象セル６０１に関連するセルを特定する際に、以下のケースのいずれかに基づいて関連セルを特定することができる。 When identifying a cell related to the correction target cell 601, the control unit 140 can identify the related cell based on any of the following cases.

第１ケースとして、制御部１４０は、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定することができる。 As a first case, the control unit 140 can identify a category cell containing text semantically related to the correction target cell 601 as a related cell.

また、第２ケースとして、制御部１４０は、位置的に補正対象セル６０１に関連するカテゴリーセルを関連セルとして特定することができる。 Also, as a second case, the control unit 140 can identify a category cell that is positionally related to the correction target cell 601 as the related cell.

さらに、第３ケースとして、制御部１４０は、第１ケースと第２ケースとを組み合わせ、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定し、かつ位置的に補正対象セル６０１に関連するカテゴリーセルを関連セルとして特定することができる。第３ケース、第１ケース及び第２ケースにおいて関連セルとして特定されたセルの全てを補正対象セル６０１の関連セルとして特定することができる。 Furthermore, as a third case, the control unit 140 combines the first case and the second case, identifies a category cell containing text semantically related to the correction target cell 601 as a related cell, and positionally Category cells related to the correction target cell 601 can be identified as related cells. All of the cells identified as related cells in the third case, first case, and second case can be identified as related cells of the correction target cell 601 .

以下、第１ケースの具体的な例として、補正対象セル６０１に含まれるテキストの意味に基づいて関連セルを特定する方法について説明する。 Hereinafter, as a specific example of the first case, a method of specifying related cells based on the meaning of the text included in the correction target cell 601 will be described.

まず、補正対象セル６０１に含まれるテキストの意味に基づいて関連セルを特定する方法について説明すると、制御部１４０は、補正対象セル６０１に含まれるテキストに関連する意味に該当するテキストが含まれる少なくとも１つのカテゴリーセルを関連セルとして特定することができる。 First, the method of identifying related cells based on the meaning of the text included in the correction target cell 601 will be described. One category cell can be identified as an associated cell.

制御部１４０は、人工知能アルゴリズムに基づいて、補正対象セル６０１に含まれるテキストの意味及び他のカテゴリーセルに含まれるテキストの意味を分析し、互いに関連する意味を有するテキストが含まれる少なくとも１つのカテゴリーセルを関連セルとして特定することができる。 The control unit 140 analyzes the meaning of the text contained in the correction target cell 601 and the meaning of the text contained in the other category cells based on an artificial intelligence algorithm, and analyzes at least one Category cells can be identified as related cells.

補正対象セル６０１に関連する意味を有するテキストは、補正対象セル６０１に含まれるテキストの上位概念又は下位概念の意味を有するテキストであってもよい。 The text having a meaning related to the correction target cell 601 may be a text having a higher concept or a lower concept meaning than the text included in the correction target cell 601 .

制御部１４０は、カテゴリーセルに対して補正対象セル６０１及び関連セルを特定するので、カテゴリーセルの特性上、補正対象セル６０１は、関連セルに対して上位概念又は下位概念に該当する意味を有するテキストを含む。例えば、補正対象セル６０１と関連セルとは、同一又は関連するカテゴリー（種類）に該当する意味を有するテキストで構成される。 Since the control unit 140 specifies the correction target cell 601 and related cells for the category cell, the correction target cell 601 has a meaning corresponding to a higher concept or a lower concept for the related cell due to the characteristics of the category cell. Contains text. For example, the correction target cell 601 and related cells are composed of texts having meanings corresponding to the same or related categories (kinds).

例えば、図６Aに示すように、第１カテゴリーセル６１１に含まれるテキスト（「診療費（薬剤費）内訳」）は、第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）、第５カテゴリーセル６１８（「患者負担総額(2)＋(3)」）、第６カテゴリーセル６０１（補正対象セル、「(1)組合負担額」、ＯＣＲ認識時には「(1)粗合負担額」として認識される）、第７カテゴリーセル６１７（「(3)患者負担額」）、第８カテゴリーセル６１６（「非給与」）に含まれるテキストに対する上位概念の意味を有する。 For example, as shown in FIG. 6A, the text contained in the first category cell 611 (“Medical Expenses (Drug Expenses) Details”) is the text in the second category cell 612 (“Total (1)+(2)+(3) ”), third category cell 613 (“salary”), fourth category cell 615 (“(2) patient burden”), fifth category cell 618 (“total patient burden (2) + (3)”), Sixth category cell 601 (correction target cell, ``(1) Association burden'', recognized as ``(1) Total burden'' at the time of OCR recognition), Seventh category cell 617 (``(3) Patient burden ”), which has a broader meaning for the text contained in the eighth category cell 616 (“Non-Salary”).

また、第１カテゴリーセル６１１に対して下位概念のカテゴリーセルは、他のカテゴリーセルに対しては上位概念のカテゴリーセルでもある。例えば、第３カテゴリーセル６１３に含まれるテキスト「給与」は、第１カテゴリーセル６１１に含まれるテキスト「診療費（薬剤費）内訳」に対しては下位概念であり、第４カテゴリーセル６１５に含まれるテキスト「(2)患者負担額」及び第６カテゴリーセル６０１（補正対象セル）に含まれるテキスト「(1)組合負担額」（ＯＣＲ認識時には「(1)粗合負担額」として認識される）に対しては上位概念である。 Also, the category cells of the lower concept with respect to the first category cell 611 are also the category cells of the higher concept with respect to other category cells. For example, the text “salary” included in the third category cell 613 is a subordinate concept to the text “medical expense (drug expense) breakdown” included in the first category cell 611, and is included in the fourth category cell 615. The text "(2) patient burden" contained in the text "(2) patient burden" and the text "(1) association burden" contained in the sixth category cell 601 (correction target cell) (recognised as "(1) rough total burden" at the time of OCR recognition) ) is a superordinate concept.

一方、制御部１４０は、カテゴリーセルのうち、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定することができ、より具体的には、補正対象セル６０１に含まれるテキストの上位概念又は下位概念の意味を有するテキストが含まれるカテゴリーセルを関連セルとして特定することができる。 On the other hand, the control unit 140 can identify a category cell including a text semantically related to the correction target cell 601 among the category cells as a related cell. A category cell containing text having a superordinate or subordinate meaning of the text contained therein can be identified as a related cell.

第１ケースにおいて、補正対象セル６０１の関連セルは、補正対象セル６０１の上位概念のテキストが含まれる第１カテゴリーセル６１１（「診療費（薬剤費）内訳」）及び第３カテゴリーセル６１３（「給与」）に決定される。 In the first case, the related cells of the correction target cell 601 are the first category cell 611 (“medical expense (drug expense) breakdown”) and the third category cell 613 (“ salary”).

以下、第２ケースの具体的な例として、位置的に補正対象セル６０１に関連するカテゴリーセルを関連セルとして特定する方法について説明する。より具体的には、制御部１４０は、テーブル１１００の特性を考慮して、補正対象セル６０１と予め設定された位置関係（又は配列関係）を有するカテゴリーセルを、補正対象セル６０１に含まれるテキストの意味に関連するカテゴリーセル、すなわち関連セルとして判断することができる。 Hereinafter, as a specific example of the second case, a method of specifying category cells that are positionally related to the correction target cell 601 as related cells will be described. More specifically, in consideration of the characteristics of the table 1100, the control unit 140 selects a category cell having a preset positional relationship (or arrangement relationship) with the correction target cell 601 according to the text included in the correction target cell 601. can be determined as category cells related to the meaning of , ie related cells.

ここで、予め設定された位置関係は、補正対象セル６０１に隣接して位置することを意味するものであってもよい。例えば、第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）及び特定のデータセル６２２は、補正対象セル６０１に隣接して位置するものであり、補正対象セル６０１と予め設定された位置関係を有するといえる。 Here, the preset positional relationship may mean being positioned adjacent to the correction target cell 601 . For example, second category cell 612 (“total amount (1) + (2) + (3)”), third category cell 613 (“salary”), fourth category cell 615 (“(2) patient burden”) And the specific data cell 622 is positioned adjacent to the correction target cell 601 and can be said to have a preset positional relationship with the correction target cell 601 .

このように、補正対象セル６０１と予め設定された位置関係を有するセルは、補正対象セル６０１を基準にして補正対象セル６０１と列方向ａ又は行方向ｂに並んで位置し、補正対象セル６０１に隣接して位置するセルに該当するものである。 In this way, the cells having a preset positional relationship with the correction target cell 601 are positioned side by side with the correction target cell 601 in the column direction a or the row direction b with the correction target cell 601 as a reference. It corresponds to a cell located adjacent to the .

ここで、補正対象セル６０１の列方向ａに並んで位置するセルは、図６Aの符号６２１、６１１、６１３、６０１、６２２に該当するセルであり、補正対象セル６０１の行方向ｂに並んで位置するセルは、図６Aの符号６４１、６４２、６１２、６０１、６１５、６１７、６１８、６４３、６４４、６４５に該当するセルである。 Here, the cells aligned in the column direction a of the correction target cell 601 are cells corresponding to reference numerals 621, 611, 613, 601, and 622 in FIG. The located cells are cells corresponding to 641, 642, 612, 601, 615, 617, 618, 643, 644 and 645 of FIG. 6A.

一方、制御部１４０は、カテゴリーセルのみを補正対象セル６０１の関連セルとして特定するので、前述した予め設定された位置関係を有するセル（第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）及び特定のデータセル６２２）のうち特定のデータセル６２２は関連セルから除外される。 On the other hand, since the control unit 140 specifies only the category cells as related cells of the correction target cell 601, the cells having the previously set positional relationship (the second category cell 612 (“Total amount (1)+(2) +(3)”), the third category cell 613 (“salary”), the fourth category cell 615 (“(2) patient cost”), and the specific data cell 622), the specific data cell 622 is the related cell excluded from

その結果、制御部１４０は、図６Bに示すように、予め設定された位置関係を有するセルから、カテゴリーセルに該当する第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）を関連セルとして特定することができる。 As a result, as shown in FIG. 6B, the control unit 140 selects a second category cell 612 (“Total amount (1)+(2)+(3) ”), the third category cell 613 (“salary”), and the fourth category cell 615 (“(2) patient cost”) can be identified as related cells.

第２ケースによれば、補正対象セル６０１の関連セルとして、第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）が特定される。 According to the second case, as related cells of the correction target cell 601, the second category cell 612 ("total amount (1)+(2)+(3)"), the third category cell 613 ("salary"), the third A 4-category cell 615 (“(2) patient cost”) is identified.

一方、制御部１４０は、補正対象セル６０１と位置的関係性を有する関連セルを特定するために、補正対象セル６０１のコーナー（頂点）の中央値をテーブル１１００又はイメージ１０００上での補正対象セル６０１の位置として特定し、当該位置を基準として、予め設定された距離以内に位置する中央値を有するカテゴリーセル６１２、６１３、６１５を関連セル６１２、６１３、６１５として特定することができる。 On the other hand, in order to specify related cells having a positional relationship with the correction target cell 601, the control unit 140 calculates the median value of the corners (vertices) of the correction target cell 601 from the correction target cell on the table 1100 or the image 1000. The position of 601 is specified, and category cells 612, 613, 615 having median values located within a preset distance from the position can be specified as related cells 612, 613, 615. FIG.

ここで、予め設定された距離は、Ｌ２距離情報に基づくものであってもよく、Ｌ２距離情報は、ユークリッド距離（Ｅｕｃｌｉｄｅａｎｄｉｓｔａｎｃｅ）情報ともいえる。 Here, the preset distance may be based on L2 distance information, and the L2 distance information can also be said to be Euclidean distance information.

このように、制御部１４０は、テーブル１１００において、左、右、上、下方向毎に、補正対象セル６０１のコーナー（頂点）の中央値からそれぞれ予め設定された距離以内に位置する中央値を有するカテゴリーセルが存在するか否かを確認することができる。また、確認の結果、予め設定された距離以内に位置する中央値を有するカテゴリーセルが存在する場合、それを関連セルとして特定することができる。 In this way, the control unit 140 determines the median values located within a preset distance from the median values of the corners (vertexes) of the correction target cell 601 in each of the left, right, top, and bottom directions in the table 1100. It is possible to check whether there is a category cell with Also, if a category cell having a median located within a preset distance exists as a result of checking, it can be identified as a related cell.

前述したように、制御部１４０は、第１ケース又は第２ケースによって、補正対象セル６０１に関連する関連セルを特定することができる。 As described above, the control unit 140 can identify related cells related to the correction target cell 601 depending on the first case or the second case.

また、制御部１４０は、第１ケースにより特定された関連セル及び第２ケースにより特定された関連セルの全てを、補正対象セル６０１の関連セルとして特定することができる。この場合は前述した第３ケースに該当する。 Also, the control unit 140 can identify all of the related cells identified by the first case and the related cells identified by the second case as related cells of the correction target cell 601 . This case corresponds to the third case described above.

前述したように、制御部１４０は、テーブル１１００に含まれるカテゴリーセルのうち、第１ケースにより、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定し、また、第２ケースにより、位置的に補正対象セル６０１に隣接するカテゴリーセルを関連セルとして特定することができる。その結果、補正対象セル６０１の関連セルは、図６A及び図６Bに示すように、第１カテゴリーセル６１１（「診療費（薬剤費）内訳」）、第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）となる。 As described above, the control unit 140 identifies, from among the category cells included in the table 1100, category cells containing text semantically related to the correction target cell 601 in the first case as related cells, and According to the second case, category cells that are positionally adjacent to the correction target cell 601 can be identified as related cells. As a result, the related cells of the correction target cell 601 are, as shown in FIGS. +(2)+(3)”), the third category cell 613 (“salary”), and the fourth category cell 615 (“(2) patient burden”).

ここで、少なくとも１つのカテゴリーセル（例えば、第３カテゴリーセル６１３（「給与」）は、意味的及び位置的に重複して関連セルとして特定される。 Here, at least one category cell (eg, the third category cell 613 ("salary") is semantically and positionally redundant and identified as a related cell.

補正対象セル６０１の関連セルとして特定されたカテゴリーセル６１１、６１２、６１３、６１５は、「関連セル」とも命名されてもよいことは言うまでもなく、同一の符号を用いる。 The category cells 611, 612, 613, and 615 identified as the related cells of the correction target cell 601 may of course be named "related cells" and use the same reference numerals.

以下では、第３ケースの例示として、意味的関係及び位置的関係の両方を考慮して特定された関連セルを用いて補正対象セル６０１に含まれるテキストの補正を行う方法を例に挙げて説明する。 In the following, as an example of the third case, a method of correcting the text contained in the correction target cell 601 using the related cells identified by considering both the semantic relationship and the positional relationship will be described. do.

前述したように、第３ケースにより、補正対象セル６０１及び少なくとも１つの関連セルが特定された場合、制御部１４０は、図６Cに示すように、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキストを用いて補正対象セル６０１の埋め込み（ｅｍｂｅｄｄｉｎｇ）を行って計算（又は算出、導出）された密ベクトル（ｄｅｎｓｅｖｅｃｔｏｒ）を、補正対象セル６０１の補正に活用することができる。本発明においては、補正対象セル及び関連セルに含まれるテキスト間の関係性を示すために、補正対象セル及び関連セルに含まれるテキストの埋め込みを行い、補正対象セル及び関連セルに含まれるテキストをベクトルで示すことができる。 As described above, when the correction target cell 601 and at least one related cell are specified in the third case, the control unit 140 controls the correction target cell 601 and related cells 611, 612, and 613 as shown in FIG. 6C. , 615 is used to embed the correction target cell 601, and a dense vector calculated (or calculated or derived) can be used to correct the correction target cell 601. . In the present invention, in order to show the relationship between the texts contained in the correction target cell and related cells, the text contained in the correction target cell and related cells is embedded, and the text contained in the correction target cell and related cells is embedded. It can be represented by a vector.

ここで、埋め込みは、埋め込みの対象となる情報をベクトルで表現する方法であり、本発明において、制御部１４０は、埋め込みにより、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５の情報を密ベクトルの形で表現することができる。 Here, embedding is a method of expressing information to be embedded using a vector. It can be expressed in the form of a dense vector.

より具体的には、制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５のそれぞれに含まれるテキストを用いて、補正対象セル６０１を示す密ベクトルを計算することができる。 More specifically, the control unit 140 can calculate a dense vector indicating the correction target cell 601 using the text included in each of the correction target cell 601 and related cells 611 , 612 , 613 , and 615 .

また、制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５を用いて計算された密ベクトルを、補正対象セル６０１の補正に用いることができる。 Also, the control unit 140 can use the dense vector calculated using the correction target cell 601 and related cells 611 , 612 , 613 , and 615 to correct the correction target cell 601 .

本発明において、埋め込みを行うアルゴリズムは特に限定されず、テキストを密ベクトルで表現できるアルゴリズムであれば本発明に活用することができる。例えば、制御部１４０は、ＦａｓｔＴｅｘｔ、ＢＥＲＴなどのアルゴリズムに基づいて、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキストを基に補正対象セル６０１を示す密ベクトルを計算することができる。 In the present invention, the embedding algorithm is not particularly limited, and any algorithm that can express text as a dense vector can be used in the present invention. For example, the control unit 140 calculates a dense vector indicating the correction target cell 601 based on the text included in the correction target cell 601 and the related cells 611, 612, 613, and 615 based on algorithms such as FastText and BERT. can be done.

一方、制御部１４０は、埋め込みを行うために、図６Cに示すように、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキスト６３１、６３２、６３３、６３４、６３５を並べることができる。ここで、制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキスト６３１、６３２、６３３、６３４、６３５の並び順序を、予め設定された基準に基づいて決定することができる。 On the other hand, the control unit 140 arranges texts 631, 632, 633, 634, and 635 included in the correction target cell 601 and related cells 611, 612, 613, and 615 in order to embed them, as shown in FIG. 6C. can be done. Here, the control unit 140 determines the arrangement order of the texts 631, 632, 633, 634, and 635 included in the correction target cell 601 and the related cells 611, 612, 613, and 615 based on preset criteria. be able to.

例えば、制御部１４０は、意味的関連性のある関連セル６１１、６１３に含まれるテキスト６３２、６３４を補正対象セル６０１に含まれるテキスト６３１にさらに近い位置に配置することもでき、逆に、位置的関連性のある関連セル６１２、６１３、６１５に含まれるテキスト６３３、６３４、６３５を補正対象セル６０１に含まれるテキスト６３１にさらに近い位置に配置することもできる。 For example, the control unit 140 can arrange the texts 632 and 634 included in the related cells 611 and 613 having semantic relevance closer to the text 631 included in the correction target cell 601. The texts 633 , 634 , 635 included in the related cells 612 , 613 , 615 that are related to each other can be arranged closer to the text 631 included in the correction target cell 601 .

また、制御部１４０は、意味的関連性及び位置的関連性の両方を有する関連セル６１３に含まれるテキスト６３４を補正対象セル６０１に含まれるテキスト６３１に最も近い位置に配置することもできる。 The control unit 140 can also arrange the text 634 included in the related cell 613 having both semantic and positional relevance at the position closest to the text 631 included in the correction target cell 601 .

このように、制御部１４０は、埋め込みにより、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキストを基に補正対象セル６０１をベクトルで示すことができる。このように、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキスト６３１、６３２、６３３、６３４、６３５を基に補正対象セル６０１に対して埋め込みが完了すると、本発明においては、補正対象セルの補正を行う過程が行われる（Ｓ３５０）。 In this manner, the control unit 140 can indicate the correction target cell 601 by embedding vectors based on the texts included in the correction target cell 601 and related cells 611 , 612 , 613 , and 615 . In this way, when embedding into the correction target cell 601 is completed based on the texts 631, 632, 633, 634, and 635 contained in the correction target cell 601 and related cells 611, 612, 613, and 615, the present invention performs , a process of correcting the correction target cell is performed (S350).

制御部１４０は、補正対象セル６０１及び関連セルに含まれるテキスト６３１、６３２、６３３、６３４、６３５を用いて算出された補正対象セルのベクトルを用いて、補正対象セル６０１に含まれるテキストの補正を行うことができる。ここで、補正対象セル６０１のベクトルは、前述した密ベクトルに該当するものであり得る。 The control unit 140 corrects the text included in the correction target cell 601 using the vector of the correction target cell calculated using the texts 631, 632, 633, 634, and 635 included in the correction target cell 601 and related cells. It can be performed. Here, the vector of the correction target cell 601 may correspond to the dense vector described above.

より具体的には、制御部１４０は、予め特定されたデータセットから、補正対象セル６０１のベクトルに最も類似したベクトル表現を有する単語（例えば、補正対象ベクトルに最も近い距離に位置する特定のベクトルに対応する単語）を、補正対象テキストとして抽出することができる。 More specifically, the control unit 140 selects a word having a vector representation that is most similar to the vector of the correction target cell 601 (for example, a specific vector located closest to the correction target vector) from a data set specified in advance. ) can be extracted as correction target text.

制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５を用いて計算された補正対象セルのベクトルと予め特定されたデータセットに含まれるベクトル間の距離に基づいて、予め特定されたデータセットから補正対象テキストを抽出することができる。ここで、予め特定されたデータセットは、テーブル１１００に含まれる複数のセル、特にカテゴリーセルに含まれ得る候補テキスト及びそれに対応するベクトルを含んでもよい。このようなデータセットは、事前に構築されたものであってもよく、データセットに含まれ得る候補テキストは、データセットが構築された後も、さらにデータセットに含まれるか、又はデータセットから除去されることが可能である。 Based on the distance between the vector of the correction target cell calculated using the correction target cell 601 and the related cells 611, 612, 613, and 615 and the vector included in the pre-specified data set, the control unit 140 pre-specifies The text to be corrected can be extracted from the data set. Here, the pre-specified data set may include candidate texts and corresponding vectors that may be included in a plurality of cells included in table 1100, particularly category cells. Such datasets may be pre-constructed and the candidate texts that may be included in the dataset may be further included in or extracted from the dataset after the dataset is constructed. It can be removed.

また、このようなデータセットには、テーブル１１００を構成する複数のセルの位置関係及びそれぞれのセルに含まれる候補テキスト間の意味関係の少なくとも一方に基づいて埋め込みが行われた結果が含まれてもよい。すなわち、データセットには、候補テキストのそれぞれに関するベクトル情報が含まれてもよい。 In addition, such a data set includes the result of embedding based on at least one of the positional relationship of a plurality of cells forming the table 1100 and the semantic relationship between candidate texts contained in each cell. good too. That is, the dataset may include vector information for each of the candidate texts.

制御部１４０は、データセットから、補正対象セル６０１のベクトルに最も近い特定のベクトルに対応するテキスト８０６（図７参照）を、補正対象テキストとして抽出することができる。 The control unit 140 can extract the text 806 (see FIG. 7) corresponding to the specific vector closest to the vector of the correction target cell 601 from the data set as the correction target text.

また、制御部１４０は、補正対象セル６０１に含まれるテキストを、補正対象セル６０１のベクトルに最も近い特定のベクトルに対応するテキスト８０６（図７参照）に変更することができる。 Also, the control unit 140 can change the text included in the correction target cell 601 to a text 806 (see FIG. 7) corresponding to a specific vector closest to the vector of the correction target cell 601 .

このように、制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキスト８０１～８０５を用いて導出された補正対象セル６０１のベクトルを用いて、データセットから、補正対象テキスト８０６を抽出することができる。 In this way, the control unit 140 uses the vector of the correction target cell 601 derived using the texts 801 to 805 included in the correction target cell 601 and related cells 611, 612, 613, and 615, from the data set, Correction target text 806 can be extracted.

その結果、補正対象セル６０１に対して誤って認識されたテキスト「(1)粗合負担額」が「(1)組合負担額」に補正される。 As a result, the erroneously recognized text "(1) Co-payment amount" for the correction target cell 601 is corrected to "(1) Co-payment amount".

このように、データセットから補正対象テキスト８０６が抽出されると、制御部１４０は、補正対象セル６０１に含まれるテキスト８０１を補正対象テキスト８０６に変更することができる。 When the correction target text 806 is extracted from the data set in this way, the control unit 140 can change the text 801 included in the correction target cell 601 to the correction target text 806 .

一方、前述したように、補正対象テキスト８０６が抽出されると、制御部１４０は、抽出された補正対象テキスト８０６を、イメージ１０００に含まれるテーブル１１００をデータ化するのに活用することができる。制御部１４０は、イメージ１０００に含まれるテーブル１１００のデータ化の結果として得られたデータ上に、補正対象テキスト８０６を含めることができる。 Meanwhile, as described above, when the correction target text 806 is extracted, the control unit 140 can utilize the extracted correction target text 806 to convert the table 1100 included in the image 1000 into data. The control unit 140 can include the correction target text 806 in the data obtained as a result of digitizing the table 1100 included in the image 1000 .

また、制御部１４０は、テーブル１１００をデータ化するだけでなく、データ化の結果として得られたデータを用いて、イメージ１０００に含まれるテーブル１１００に対応する構造を有するテーブルを生成することができる。 Moreover, the control unit 140 can not only convert the table 1100 into data, but also generate a table having a structure corresponding to the table 1100 included in the image 1000 using the data obtained as a result of the data conversion. .

例えば、制御部１４０は、図８の（ａ）に示すように、スキャンされたイメージに含まれるテーブル９１０から抽出されたデータの少なくとも一部に該当するテキスト９３０（「撰沢診療科以外」）にエラーがある場合、前述した方法で補正を行うことにより、図８の（ｂ）に示すように、補正されたテキスト９４０（「選択診療料以外」）が含まれるデータを確保することができる。また、必要に応じて、補正されたテキスト９４０が含まれるテーブル９２０を生成することができ、このようなテーブル９２０は、スキャンの対象となったテーブル９１０と同じ内容で構成される。 For example, as shown in (a) of FIG. 8, the control unit 140 selects a text 930 (“Other than Kanzawa Department”) corresponding to at least a part of the data extracted from the table 910 included in the scanned image. If there is an error in the above, it is possible to secure data including the corrected text 940 (“Other than selected medical care fee”) as shown in FIG. . Also, if desired, a table 920 containing the corrected text 940 can be generated, and such a table 920 has the same content as the table 910 that was scanned.

より具体的には、本発明による文字認識方法及び文字認識システムは、イメージに含まれるテーブルから、テーブルに含まれるテキストが誤って認識された場合、それを補正することにより、イメージに含まれるテーブルの内容をより正確にデータ化することができる。そのために、本発明においては、テーブルを構成するセルの配置関係、セルに含まれるテキストの意味関係を考慮して、補正の対象となったセルに含まれるテキストを補正することにより、イメージに含まれるテーブルの内容をそのままデータ化することのできる、正確度の高い文字認識方法及びシステムを提供することができる。その結果、本発明による文字認識方法及びシステムは、イメージに含まれるテーブルと同じ内容を含むテーブルを生成することができる。 More specifically, the character recognition method and the character recognition system according to the present invention correct the text contained in the table contained in the image when the text contained in the table is erroneously recognized from the table contained in the image. can be converted into data more accurately. For this purpose, in the present invention, the arrangement of cells constituting a table and the semantic relation of text contained in the cells are taken into consideration, and the text contained in the cells to be corrected is corrected so that the text contained in the image is corrected. It is possible to provide a highly accurate character recognition method and system that can convert the contents of a table stored in the table into data as they are. As a result, the character recognition method and system according to the present invention can generate a table containing the same content as the table contained in the image.

また、本発明による文字認識方法及び文字認識システムは、テキスト認識率が低い特定のセルに対してのみ補正を行うことにより、イメージに含まれるテーブルの内容を正確にデータ化しながらも、データ処理量を最小限に抑えることができる。 In addition, the character recognition method and character recognition system according to the present invention can reduce the amount of data processing while accurately converting the contents of a table included in an image into data by correcting only specific cells with a low text recognition rate. can be minimized.

一方、前述した本発明は、コンピュータで１つ以上のプロセスにより実行され、コンピュータ可読媒体（又は記録媒体）に格納可能なプログラムとして実現することができる。 On the other hand, the present invention described above can be implemented as a program that is executed by one or more processes on a computer and can be stored in a computer-readable medium (or recording medium).

また、前述した本発明は、プログラム記録媒体にコンピュータ可読コード又はコマンドとして実現することができる。すなわち、本発明は、プログラムの形態で提供することができる。 Also, the present invention described above can be implemented as computer readable codes or commands on a program recording medium. That is, the present invention can be provided in the form of a program.

一方、コンピュータ可読媒体は、コンピュータシステムにより読み取り可能なデータが記録されるあらゆる種類の記録装置を含む。コンピュータ可読媒体の例としては、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｉｓｋ）、ＳＤＤ（ＳｉｌｉｃｏｎＤｉｓｋＤｒｉｖｅ）、ＲＯＭ、ＲＡＭ、ＣＤ－ＲＯＭ、磁気テープ、フロッピーディスク、光データ記憶装置などが挙げられる。 A computer-readable medium, on the other hand, includes any type of recording device on which data readable by a computer system is recorded. Examples of computer-readable media include HDDs (Hard Disk Drives), SSDs (Solid State Disks), SDDs (Silicon Disk Drives), ROMs, RAMs, CD-ROMs, magnetic tapes, floppy disks, optical data storage devices, and the like. be done.

また、コンピュータ可読媒体は、ストレージを含み、電子機器が通信によりアクセスできるサーバ又はクラウドストレージであり得る。この場合、コンピュータは、有線又は無線通信により、サーバ又はクラウドストレージから本発明によるプログラムをダウンロードすることができる。 Computer-readable media also includes storage, which may be a server or cloud storage communicatively accessible by the electronic device. In this case, the computer can download the program according to the present invention from a server or cloud storage via wired or wireless communication.

さらに、本発明において、前述したコンピュータは、プロセッサ、すなわち中央処理装置（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ，ＣＰＵ）が搭載された電子機器であり、その種類は特に限定されない。 Furthermore, in the present invention, the aforementioned computer is an electronic device equipped with a processor, that is, a central processing unit (CPU), and its type is not particularly limited.

一方、本発明の詳細な説明は例示的なものであり、あらゆる面で制限的に解釈されてはならない。本発明の範囲は添付の特許請求の範囲の合理的解釈により定められるべきであり、本発明の等価的範囲内でのあらゆる変更が本発明の範囲に含まれる。 On the other hand, the detailed description of the present invention is illustrative and should not be construed as restrictive in any aspect. The scope of the invention should be determined by reasonable interpretation of the appended claims, and all changes that come within the equivalent scope of the invention are included in the scope of the invention.

１００文字認識システム
１１０受信部
１２０保存部
１３０ＯＣＲ部
１４０制御部
１５０外部サーバ
１５１クラウドサバー
１５２データベース（ＤＢ）
２００テーブル
２０１、２０２水平ライン
２０３、２０４垂直ライン
２１０セル
１０００イメージ
１１００テーブル 100 Character Recognition System 110 Reception Unit 120 Storage Unit 130 OCR Unit 140 Control Unit 150 External Server 151 Cloud Server 152 Database (DB)
200 table 201, 202 horizontal line 203, 204 vertical line 210 cell 1000 image 1100 table

Claims

receiving an image containing a table;
recognizing text contained in a plurality of cells forming the table;
identifying a correction target cell from the plurality of cells based on preset criteria;
identifying at least one related cell related to the correction target cell from the plurality of cells;
and a step of correcting the text contained in the correction target cell using the vector of the correction target cell calculated using the text contained in the correction target cell and the related cell. character recognition method.

In the step of identifying related cells,
2. The character recognition method according to claim 1, wherein the related cell is specified based on at least one of the meaning of the text included in the correction target cell and the position of the correction target cell.

at least a portion of the plurality of cells are identified as category cells and another portion are identified as data cells containing data corresponding to at least one of the category cells;
In the step of identifying related cells,
3. The method according to claim 2, wherein, among the category cells, a specific category cell related to at least one of the meaning of the text included in the correction target cell and the position of the correction target cell is specified as the related cell. Described character recognition method.

In a specific category cell related to the meaning of the text contained in the correction target cell,
4. The character recognition method according to claim 3, wherein text having a meaning of a higher concept or a lower concept of the text included in the correction target cell is included.

A specific category cell related to the meaning of the text contained in the correction target cell is
5. The character recognition method according to claim 4, wherein the correction target cell is aligned with the correction target cell in a column direction or a row direction with the correction target cell as a reference.

The specific category cell is a cell located adjacent to the correction target cell in a row direction or a column direction with respect to the correction target cell on the table, among the category cells. 4. The character recognition method according to claim 3.

In the correcting step,
2. The character recognition method according to claim 1, wherein the text included in the correction target cell is corrected based on the distance between the vector of the correction target cell and the vector included in a data set specified in advance. .

The correcting step includes:
extracting text to be corrected from the data set based on the distance between the vector of the cell to be corrected and a vector included in a previously specified data set;
8. The character recognition method according to claim 7, further comprising the step of changing the text included in the correction target cell to the correction target text.

Said data set includes:
Candidate texts that can be included in the plurality of cells constituting the table; and
8. Vector information about each of said candidate texts corresponding to a result of embedding based on at least one of a positional relationship of said plurality of cells and a semantic relationship between said candidate texts is included. The character recognition method described in .

In the correcting step,
Identifying the vector closest to the vector of the correction target cell from the data set,
10. The character recognition method according to claim 9, wherein the text corresponding to said closest vector is extracted as said text to be corrected.

11. The character recognition method according to claim 10, further comprising the step of generating a table having a structure corresponding to a table included in the image and including the text to be corrected in the cell to be corrected.

2. The character recognition method according to claim 1, wherein the preset criteria for specifying the correction target cell relate to reliability of recognition of text included in the plurality of cells. .

a storage unit;
a receiver for receiving an image containing a table;
a control unit for recognizing text contained in a plurality of cells forming the table contained in the image;
The control unit
Identifying a correction target cell from the plurality of cells based on preset criteria;
identifying at least one related cell related to the correction target cell from the plurality of cells;
A character recognition system, wherein the text included in the correction target cell is corrected using a vector of the correction target cell calculated using the text included in the correction target cell and the related cell.

A computer program comprising a plurality of instructions,
When the instruction is executed
receiving an image containing a table;
recognizing text contained in a plurality of cells forming the table;
identifying a correction target cell from the plurality of cells based on preset criteria;
identifying at least one related cell related to the correction target cell from the plurality of cells;
and a computer executing a step of correcting the text contained in the correction target cell using the vector of the correction target cell calculated using the text contained in the correction target cell and the related cell. A computer program characterized by: