JP7398526B2

JP7398526B2 - Character recognition method and character recognition system for recognizing information contained in a table

Info

Publication number: JP7398526B2
Application number: JP2022115078A
Authority: JP
Inventors: ギウクキム; スンシン; ヨンミンペク; ヒョソンワン; ジュングンキム; スンボムチェー
Original assignee: Naver Corp
Current assignee: Naver Corp
Priority date: 2021-07-20
Filing date: 2022-07-19
Publication date: 2023-12-14
Anticipated expiration: 2042-07-19
Also published as: KR102697516B1; KR20230013849A; JP2023016031A

Description

本発明は、テーブルに含まれる情報を認識する文字認識方法及び文字認識システムに関する。 The present invention relates to a character recognition method and a character recognition system for recognizing information contained in a table.

人工知能の辞書的意味は、人間の学習能力、推論能力、知覚能力、自然言語理解能力などをコンピュータプログラムで実現した技術である。このような人工知能は、マシンラーニングに人間の脳を模倣したニューラルネットワークを加えたディープラーニングにより飛躍的に発展してきた。 The dictionary meaning of artificial intelligence is a technology that realizes human learning, reasoning, perceptual, and natural language understanding abilities through computer programs. This kind of artificial intelligence has developed dramatically through deep learning, which combines machine learning with neural networks that imitate the human brain.

ディープラーニング（ｄｅｅｐｌｅａｒｎｉｎｇ）とは、コンピュータが人間のように判断及び学習できるようにし、それにより事物やデータを群集化又は分類する技術をいい、近年、テキストデータだけでなく画像データまで分析できるようになり、非常に多様な産業分野に積極的に活用されている。 Deep learning is a technology that enables computers to judge and learn like humans, thereby grouping or classifying objects and data.In recent years, it has become possible to analyze not only text data but also image data. It is now actively used in a wide variety of industrial fields.

このような人工知能の発達により、オフィス・オートメーション（ｏｆｆｉｃｅａｕｔｏｍａｔｉｏｎ）分野においても様々な自動化が行われている。特に、オフィス・オートメーション分野においては、人工知能を活用した画像データ分析技術に基づいて、紙（ペーパ）に印刷されたコンテンツをデータ化するのに多くの努力をしている。その一環として、オフィス・オートメーション分野においては、紙文書をイメージ化し、イメージに含まれるコンテンツを分析するイメージ分析技術（又は画像データ分析技術）により、文書に含まれるコンテンツをデータ化しており、その場合、文書に含まれるコンテンツの特性によってイメージを分析する技術が必要である。 With the development of artificial intelligence, various types of automation are being carried out in the field of office automation. In particular, in the field of office automation, much effort is being made to convert content printed on paper into data based on image data analysis technology that utilizes artificial intelligence. As part of this, in the office automation field, the content contained in documents is converted into data using image analysis technology (or image data analysis technology) that converts paper documents into images and analyzes the content contained in the images. , a technique is needed to analyze images according to the characteristics of the content contained in the document.

例えば、テーブル（表）を含む文書をデータ化する場合、テーブルの形式、テーブルに含まれるテキストの内容、及びテーブルに含まれるテキストの位置などのように、テーブルに関連する様々な要素についての正確な分析が必要である。 For example, when converting a document containing a table into data, the accuracy of various elements related to the table, such as the format of the table, the content of the text included in the table, and the position of the text included in the table, is required. A thorough analysis is required.

そこで、特許文献１（書式自動化のためのテーブル生成装置及び方法）においては、イメージからテーブルを認識し、認識されたテーブルを再現する方法について開示しているが、それは、テーブルに含まれる線分（ライン）を基準にテーブルを再現するものであるので、テーブルに含まれる内容まで正確に分析するのに限界があった。 Therefore, Patent Document 1 (Table generation device and method for format automation) discloses a method for recognizing a table from an image and reproducing the recognized table. Since the table is reproduced based on (line), there is a limit to the ability to accurately analyze the contents included in the table.

よって、テーブルに含まれる内容まで正確に生成できる文字認識方法が求められている。 Therefore, there is a need for a character recognition method that can accurately generate the contents included in a table.

韓国登録特許第１０－１９０７０２９号公報Korean Registered Patent No. 10-1907029

本発明は、テーブルに含まれる内容（又はコンテンツ、情報）をデータ化することができる、文字認識のエラーに対してロバストな文字認識方法及び文字認識システムを提供するためのものである。 The present invention is intended to provide a character recognition method and a character recognition system that are robust against character recognition errors and are capable of converting content (or content or information) contained in a table into data.

また、本発明は、テーブルに含まれる内容をデータ化する場合、テーブルに含まれる内容を正確にデータとして確保することができる、文字認識のエラーに対してロバストな文字認識方法及び文字認識システムを提供するためのものである。 Furthermore, the present invention provides a character recognition method and character recognition system that are robust against character recognition errors and can ensure that the contents contained in the table are accurate as data when converting the contents contained in the table into data. It is intended to provide.

さらに、本発明は、テーブルに含まれる内容間の有機的な関係を考慮してテーブルに含まれる内容をデータ化することができる、文字認識のエラーに対してロバストな文字認識方法及び文字認識システムを提供するためのものである。 Furthermore, the present invention provides a character recognition method and character recognition system that are robust against character recognition errors and can convert the contents of a table into data by considering the organic relationship between the contents of the table. It is intended to provide

さらに、本発明は、データ処理量を最小限に抑えたうえでテーブルに含まれる内容をデータ化することができる、文字認識のエラーに対してロバストな文字認識方法及び文字認識システムを提供するためのものである。 Furthermore, the present invention provides a character recognition method and a character recognition system that are robust against character recognition errors and can convert the contents included in a table into data while minimizing the amount of data processing. belongs to.

本発明による文字認識方法は、テーブルを含むイメージを受信するステップと、前記テーブルを構成する複数のセルに含まれるテキストを認識するステップと、予め設定された基準に基づいて、前記複数のセルから補正対象セルを特定するステップと、前記複数のセルから前記補正対象セルに関連する少なくとも１つの関連セルを特定するステップと、前記補正対象セル及び前記関連セルに含まれるテキストを用いて算出された前記補正対象セルのベクトルを用いて、前記補正対象セルに含まれるテキストの補正を行うステップとを含むようにしてもよい。 A character recognition method according to the present invention includes the steps of: receiving an image including a table; recognizing text included in a plurality of cells constituting the table; a step of identifying a cell to be corrected; a step of identifying at least one related cell related to the cell to be corrected from the plurality of cells; The method may include the step of correcting text included in the correction target cell using the vector of the correction target cell.

また、本発明による文字認識システムは、保存部と、テーブルを含むイメージを受信する受信部と、前記イメージに含まれる前記テーブルを構成する複数のセルに含まれるテキストを認識する制御部とを含み、前記制御部は、予め設定された基準に基づいて、前記複数のセルから補正対象セルを特定し、前記複数のセルから前記補正対象セルに関連する少なくとも１つの関連セルを特定し、前記補正対象セル及び前記関連セルに含まれるテキストを用いて算出された前記補正対象セルのベクトルを用いて、前記補正対象セルに含まれるテキストの補正を行うようにしてもよい。 Further, the character recognition system according to the present invention includes a storage unit, a receiving unit that receives an image including a table, and a control unit that recognizes text included in a plurality of cells constituting the table included in the image. , the control unit specifies a correction target cell from the plurality of cells based on a preset criterion, specifies at least one related cell related to the correction target cell from the plurality of cells, and performs the correction. The text contained in the correction target cell may be corrected using a vector of the correction target cell calculated using the text contained in the target cell and the related cell.

さらに、本発明による複数の命令を含むコンピュータプログラムは、命令が実行されると、テーブルを含むイメージを受信するステップと、前記テーブルを構成する複数のセルに含まれるテキストを認識するステップと、予め設定された基準に基づいて、前記複数のセルから補正対象セルを特定するステップと、前記複数のセルから前記補正対象セルに関連する少なくとも１つの関連セルを特定するステップと、前記補正対象セル及び前記関連セルに含まれるテキストを用いて算出された前記補正対象セルのベクトルを用いて、前記補正対象セルに含まれるテキストの補正を行うステップと、をコンピュータで実行するようにしてもよい。 Further, a computer program including a plurality of instructions according to the present invention includes the steps of: receiving an image including a table when the instructions are executed; recognizing text contained in a plurality of cells constituting the table; identifying a correction target cell from the plurality of cells based on a set criterion; identifying at least one related cell related to the correction target cell from the plurality of cells; and the correction target cell and the correction target cell. The step of correcting the text included in the correction target cell using the vector of the correction target cell calculated using the text included in the related cell may be executed by a computer.

前述したように、本発明による文字認識方法及び文字認識システムは、イメージに含まれるテーブルからテキストを認識し、認識されたテキストの検証を行うことにより、イメージに含まれるテーブルの内容をより正確にデータ化することができる。 As described above, the character recognition method and character recognition system according to the present invention recognize text from a table included in an image and verify the recognized text, thereby more accurately identifying the contents of the table included in the image. It can be converted into data.

より具体的には、本発明による文字認識方法及び文字認識システムは、イメージに含まれるテーブルから、テーブルに含まれるテキストが誤って認識された場合、それを補正することにより、イメージに含まれるテーブルの内容をより正確にデータ化することができる。 More specifically, the character recognition method and character recognition system according to the present invention corrects when text included in a table is incorrectly recognized from a table included in an image. The contents can be converted into data more accurately.

本発明による文字認識システムを説明するための概念図である。1 is a conceptual diagram for explaining a character recognition system according to the present invention. テーブルの構成要素を説明するための概念図である。It is a conceptual diagram for explaining the constituent elements of a table. 本発明による文字認識方法を説明するためのフローチャートである。3 is a flowchart for explaining a character recognition method according to the present invention. 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 2 is a conceptual diagram for explaining a method of recognizing information included in a table using a character recognition system. 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 2 is a conceptual diagram for explaining a method of recognizing information included in a table using a character recognition system. 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 2 is a conceptual diagram for explaining a method of recognizing information included in a table using a character recognition system. 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 2 is a conceptual diagram for explaining a method of recognizing information included in a table using a character recognition system. 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 2 is a conceptual diagram for explaining a method of recognizing information included in a table using a character recognition system. 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 2 is a conceptual diagram for explaining a method of recognizing information included in a table using a character recognition system. 文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。FIG. 2 is a conceptual diagram for explaining a method of recognizing information included in a table using a character recognition system.

以下、添付図面を参照して本発明の実施形態について詳細に説明するが、図面番号に関係なく同一又は類似の構成要素には同一の符号を付し、それについての重複する説明は省略する。以下の説明で用いられる構成要素の接尾辞である「モジュール」や「部」は、明細書の作成を容易にするために付与又は混用されるものであり、それ自体が有意性や有用性を有するものではない。また、本発明の実施形態について説明するにあたり、関連する公知技術についての具体的な説明が本発明の実施形態の要旨を不明にすると判断される場合は、その詳細な説明を省略する。さらに、添付図面は本発明の実施形態の理解を助けるためのものにすぎず、添付図面により本発明の技術的思想が限定されるものではなく、本発明の思想及び技術範囲に含まれるあらゆる変更、均等物乃至代替物を含むものと理解すべきである。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings, but the same or similar components will be denoted by the same reference numerals regardless of the drawing numbers, and redundant explanation thereof will be omitted. The suffixes "module" and "part" used in the following explanations are given or mixed to facilitate the preparation of specifications, and they themselves have significance and usefulness. It's not something you have. Furthermore, when describing the embodiments of the present invention, if it is determined that detailed explanation of related known techniques would obscure the gist of the embodiments of the present invention, the detailed explanation will be omitted. Furthermore, the attached drawings are only for helping understanding of the embodiments of the present invention, and the technical idea of the present invention is not limited by the attached drawings, and any changes that fall within the idea and technical scope of the present invention may be made. , should be understood to include equivalents or substitutes.

「第１」、「第２」などのように序数を含む用語は様々な構成要素を説明するために用いられるが、上記構成要素は上記用語により限定されるものではない。上記用語は１つの構成要素を他の構成要素と区別する目的でのみ用いられる。 Terms including ordinal numbers such as "first", "second", etc. are used to describe various components, but the components are not limited to the above terms. These terms are only used to distinguish one component from another.

ある構成要素が他の構成要素に「連結」又は「接続」されていると言及された場合は、他の構成要素に直接連結又は接続されていてもよく、中間にさらに他の構成要素が存在してもよいものと解すべきである。それに対して、ある構成要素が他の構成要素に「直接連結」又は「直接接続」されていると言及された場合は、中間にさらに他の構成要素が存在しないものと解すべきである。 When a component is referred to as being "coupled" or "connected" to another component, it may be directly coupled or connected to the other component, and there may be other components in between. It should be understood that it is permissible to do so. In contrast, when an element is referred to as being "directly coupled" or "directly connected" to another element, there are no intermediate elements present.

単数の表現には、特に断らない限り複数の表現が含まれる。 Singular expressions include plural expressions unless otherwise specified.

本明細書において、「含む」や「有する」などの用語は、本明細書に記載された特徴、数字、段階、動作、構成要素、部品又はそれらの組み合わせが存在することを指定しようとするもので、１つ又はそれ以上の他の特徴、数字、段階、動作、構成要素、部品又はそれらの組み合わせの存在や付加可能性を予め排除するものではないと理解すべきである。 As used herein, terms such as "comprising" and "having" are intended to specify the presence of features, numbers, steps, acts, components, parts, or combinations thereof described herein. It should be understood that this does not exclude in advance the presence or possibility of adding one or more other features, figures, steps, acts, components, parts or combinations thereof.

前述したように、人工知能が発達するにつれて様々な形態でオフィス・オートメーション（ｏｆｆｉｃｅａｕｔｏｍａｔｉｏｎ）が行われており、業務効率のために、紙文書に含まれるテーブル（表）を、紙文書に含まれるテーブルの形式のままデジタル化（データ化）するニーズがますます高まっている。 As mentioned above, with the development of artificial intelligence, office automation is being carried out in various forms, and for business efficiency, tables contained in paper documents are There is an increasing need to digitize (digitize) table formats.

例えば、様々な種類の領収証（レシート）などの文書はテーブルを含み、保険会社や病院などでは、大量の紙文書に含まれるテーブルをデジタル化されたデータとして処理して電算化する必要がある。 For example, documents such as various types of receipts contain tables, and insurance companies and hospitals need to process the tables contained in large volumes of paper documents as digital data and computerize it.

このようなニーズに伴い、イメージに含まれるテーブルと同じ構成を有するテーブルを生成又は再現する技術の開発が活発に行われている。 In response to such needs, technology for generating or reproducing a table having the same configuration as a table included in an image is being actively developed.

一方、イメージに含まれるテーブルに含まれる内容（又はコンテンツ、情報）を認識し、認識された内容をデータ化する技術においては、テーブルに関連する様々な要素（例えば、テキスト、テキストの位置、セルの構成、セルの位置、セル間の関連関係など）を正確に認識することが非常に重要である。 On the other hand, in technology that recognizes the content (or content, information) included in a table included in an image and converts the recognized content into data, various elements related to the table (for example, text, text position, cell It is very important to accurately recognize the configuration of cells, cell locations, relationships between cells, etc.

そのために、テーブルを構成する様々な構成要素を正確に認識し、誤って認識された要素に対しては補正を行うことにより、イメージに含まれるテーブルを正確に認識するための様々な努力がなされている。 To this end, various efforts have been made to accurately recognize the tables contained in images by accurately recognizing the various components that make up the table, and by making corrections for incorrectly recognized elements. ing.

以下、イメージに含まれるテーブルの内容とは異なる内容（又はコンテンツ、情報）が抽出された場合にそれを補正する方法について、添付図面と共により具体的に説明する。図１は本発明による文字認識システムを説明するための概念図であり、図２はテーブルの構成要素を説明するための概念図である。また、図３は本発明による文字認識方法を説明するためのフローチャートであり、図４、図５、図６A、図６B、図６C、図７及び図８は文字認識システムによりテーブルに含まれる情報を認識する方法を説明するための概念図である。 Hereinafter, a method for correcting content (or content or information) that differs from the content of a table included in an image when it is extracted will be described in more detail with reference to the accompanying drawings. FIG. 1 is a conceptual diagram for explaining the character recognition system according to the present invention, and FIG. 2 is a conceptual diagram for explaining the constituent elements of a table. Further, FIG. 3 is a flowchart for explaining the character recognition method according to the present invention, and FIGS. 4, 5, 6A, 6B, 6C, 7, and 8 are information included in the table by the character recognition system. FIG. 2 is a conceptual diagram for explaining a method of recognizing.

本発明による文字認識システム１００は、イメージ１０００に含まれるテーブル１１００から、テーブル１１００に含まれる内容及びテーブル１１００を構成する構成成分を認識し、認識された内容及び構成成分間の関係に基づいて、テーブル１１００に含まれる内容を正確にデータ化することができる。 The character recognition system 100 according to the present invention recognizes the content included in the table 1100 and the components making up the table 1100 from the table 1100 included in the image 1000, and based on the recognized content and the relationship between the components, The contents included in the table 1100 can be accurately converted into data.

ここで、テーブル１１００に含まれる内容とは、文字、数字、記号、演算子などの意味のある全ての符号体系を意味する。本発明においては、説明の便宜上、「文字、数字、記号、演算子などの意味のある全ての符号体系」をまとめて「テキスト」と命名する。 Here, the contents included in the table 1100 refer to all meaningful code systems such as letters, numbers, symbols, and operators. In the present invention, for convenience of explanation, "all meaningful code systems such as letters, numbers, symbols, operators, etc." are collectively named "text."

例えば、図示のように、テーブル１１００には、「患者氏名」、「住民登録番号」、「診療費（薬剤費）内訳」などのテキストが含まれる。 For example, as shown in the figure, the table 1100 includes texts such as "patient name," "resident registration number," and "medical fee (medication fee) breakdown."

また、テーブル１１００を構成する構成成分は、テキストが含まれるセル、セルを区画するライン又は線などの構成要素で構成される。これらの構成要素は、互いに位置的及び／又は意味的関係性を有する。 Further, the constituent components that make up the table 1100 are composed of constituent elements such as cells containing text and lines or lines that partition the cells. These components have positional and/or semantic relationships with each other.

ここで、位置的関係性とは、テーブル１１００に含まれるセル間の相互位置関係を示すものであって、それぞれのセルに対して、周辺又は隣りにどのセルが位置し、どのセルと同じ又は隣接する行、列に位置するかなどに関する位置関係（又は配置関係）を意味する。 Here, the positional relationship indicates the mutual positional relationship between cells included in the table 1100, and for each cell, which cells are located around or next to it, and which cells are the same or It means the positional relationship (or arrangement relationship) regarding whether they are located in adjacent rows or columns.

また、意味的関係性とは、テーブル１１００に含まれるセルの少なくとも一部と他の一部とが互いに関連する内容を含むことを意味する。このような意味的関係性とは、それぞれのセルに含まれる内容（テキスト）に基づく連結情報を意味する。例えば、ある１つのセルに含まれる内容と他の１つのセルに含まれる内容とは、ｉ）同一又は類似の概念であるか、ｉｉ）同一のカテゴリー又は同一のグループに属するか、ｉｉｉ）互いに対して上位概念又は下位概念に該当する内容であることがあり、この場合、これらのセルが連結されていると表現できる。 Further, the semantic relationship means that at least some of the cells included in the table 1100 and another part of the cells include contents that are related to each other. Such a semantic relationship means connection information based on the content (text) included in each cell. For example, the contents contained in one cell and the contents contained in another cell may i) be the same or similar concept, ii) belong to the same category or the same group, or iii) be mutually exclusive. On the other hand, the content may correspond to a superordinate concept or a subordinate concept, and in this case, it can be expressed that these cells are connected.

なお、イメージ１０００には１つ又はそれ以上のテーブルが含まれることがあり、本発明においては、イメージ１０００に含まれるテーブルの数に関係なく、イメージ１０００に含まれる全てのテーブルを認識することができる。 Note that the image 1000 may include one or more tables, and in the present invention, all tables included in the image 1000 can be recognized regardless of the number of tables included in the image 1000. can.

本発明において、イメージ１０００は、紙文書のスキャンにより取得されたイメージ、写真撮影により取得されたイメージ、又はその他の様々な方法により取得されたイメージである。 In the present invention, the image 1000 is an image obtained by scanning a paper document, an image obtained by taking a photograph, or an image obtained by various other methods.

一方、本発明による文字認識システム１００は、図１に示すように、受信部１１０、保存部１２０、ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）部１３０及び制御部１４０の少なくとも１つを含む。 Meanwhile, as shown in FIG. 1, the character recognition system 100 according to the present invention includes at least one of a receiving unit 110, a storage unit 120, an optical character reader (OCR) unit 130, and a control unit 140.

まず、受信部１１０は、テーブル１１００を含むイメージ１０００を受信する手段であって、通信部、スキャン部及び入力部の少なくとも１つを含むようにしてもよく、その他のイメージ１０００を受信する手段からなるようにしてもよい。 First, the receiving unit 110 is a means for receiving the image 1000 including the table 1100, and may include at least one of a communication unit, a scanning unit, and an input unit, and may include other means for receiving the image 1000. You can also do this.

文字認識システム１００は、受信部１１０を介して受信したイメージ１０００に含まれるテーブル１１００を認識し、イメージ１０００に含まれるテーブル１１００に含まれる内容をデータ化するすることができる。その認識の結果、文字認識システム１００は、テーブル１１００に含まれる内容と一致するデータを確保することができる。 The character recognition system 100 can recognize the table 1100 included in the image 1000 received via the receiving unit 110 and convert the contents included in the table 1100 included in the image 1000 into data. As a result of the recognition, the character recognition system 100 can secure data that matches the content included in the table 1100.

本発明においては、文字認識システム１００により、イメージ１０００に含まれるテーブル１１００の内容（又はコンテンツ、情報）を認識し、認識された内容にエラーが存在するとそれを補正することにより、テーブル１１００に含まれる内容を正確にデータ化する方法を提供することができる。 In the present invention, the character recognition system 100 recognizes the content (or content, information) of the table 1100 included in the image 1000, and if there is an error in the recognized content, corrects it. It is possible to provide a method for accurately converting the contents into data.

なお、本発明によりイメージ１０００に含まれるテーブル１１００から認識された内容は、イメージ１０００に含まれるテーブル１１００と同一又は類似の構造を有するテーブルデータを生成するのに活用することができる。 Note that the content recognized from the table 1100 included in the image 1000 according to the present invention can be used to generate table data having the same or similar structure to the table 1100 included in the image 1000.

テーブルデータは、イメージ１０００に含まれるテーブル１１００を構成するセルの構造及びセルの関係性（カテゴリーセルとデータセルの関係など）の少なくとも一方に基づいて整形化又は構造化されたデータからなるようにしてもよい。 The table data consists of formatted or structured data based on at least one of the cell structure and cell relationships (such as the relationship between category cells and data cells) constituting the table 1100 included in the image 1000. It's okay.

テーブルデータには、本発明による文字認識システム１００により認識された内容に該当するデータが含まれる。 The table data includes data corresponding to the content recognized by the character recognition system 100 according to the present invention.

本発明において、構成要素の位置的関係には、特にテーブルに含まれるセルの配置関係、行又は列の関係に応じた位置関係が含まれ、構成要素の意味的関係には、それぞれのセルに含まれるテキストが示す意味間に形成される関係が含まれる。 In the present invention, the positional relationship of the constituent elements includes, in particular, the positional relationship of the cells included in the table, and the positional relationship according to the row or column relationship, and the semantic relationship of the constituent elements includes It includes the relationships formed between the meanings of the included texts.

次に、保存部１２０は、本発明による様々な情報を保存するようにしてもよい。保存部１２０は、その種類が非常に多様であり、少なくとも一部は外部サーバ１５０（クラウドサバー１５１及びデータベース（ＤＢ）１５２の少なくとも一方）を意味する。すなわち、保存部１２０は、本発明に関連する情報が保存される空間であればよく、物理的な空間の制約はない。 Next, the storage unit 120 may store various information according to the present invention. The types of storage units 120 are very diverse, and at least some of them refer to external servers 150 (at least one of a cloud server 151 and a database (DB) 152). That is, the storage unit 120 may be any space in which information related to the present invention is stored, and there is no physical space restriction.

保存部１２０には、テーブルを構成する様々な構成要素に関する情報が含まれる。保存部１２０には、ｉ）テーブル１１００を含むイメージ１０００及びそれに関連するデータ、ｉｉ）イメージ１０００から認識されたテーブル１１００のセル情報、構成成分（例えば、ライン、コーナーなど）及びそれに関連するデータ、ｉｉｉ）テーブル１１００に含まれるコンテンツ（例えば、テキスト、イメージなど）データ、ｉｖ）テーブル１１００に含まれるコンテンツに関連するデータセットの少なくとも１つが保存される。ここで、データセットは、テーブル１１００に含まれるコンテンツを検証又は補正するのに活用されるデータであってもよい。 The storage unit 120 includes information regarding various components that make up the table. The storage unit 120 stores i) an image 1000 including the table 1100 and data related thereto, ii) cell information of the table 1100 recognized from the image 1000, constituent components (for example, lines, corners, etc.) and data related thereto; At least one of the following is saved: iii) content (eg, text, images, etc.) data included in table 1100; and iv) data sets related to the content included in table 1100. Here, the data set may be data used to verify or correct the content included in the table 1100.

次に、ＯＣＲ部１３０は、イメージ１０００に含まれるコンテンツ（又は情報）を認識する手段であって、様々なコンテンツ認識アルゴリズムの少なくとも１つによりイメージ１０００に含まれるコンテンツを認識することができる。ＯＣＲ部１３０は、人工知能に基づくアルゴリズム（又はディープラーニングアルゴリズム）を用いて、コンテンツを認識することができる。ここで、コンテンツは、テキスト（文字）を含んでもよい。ＯＣＲ部は、「ＯＣＲＡＰＩ」とも命名できる。 Next, the OCR unit 130 is a means for recognizing content (or information) included in the image 1000, and can recognize the content included in the image 1000 using at least one of various content recognition algorithms. The OCR unit 130 can recognize content using an algorithm based on artificial intelligence (or a deep learning algorithm). Here, the content may include text (characters). The OCR section can also be named "OCR API".

ＯＣＲ部１３０は、イメージ１０００に含まれるテキスト及びテキストの位置情報を認識することができる。ここで、テキストの位置情報には、受信部１１０を介して入力されたイメージ１０００内でのテキストの位置に関する情報、及びテーブル１１００内でのテキストの位置に関する情報の少なくとも一方が含まれる。 The OCR unit 130 can recognize the text included in the image 1000 and the position information of the text. Here, the text position information includes at least one of information regarding the position of the text within the image 1000 input via the receiving unit 110 and information regarding the position of the text within the table 1100.

ＯＣＲ部１３０は、イメージ１０００に含まれるテキストに基づいて、それぞれのセルに含まれるテキストを認識することができる。 The OCR unit 130 can recognize the text included in each cell based on the text included in the image 1000.

テーブル１１００は、複数のセルで構成され、ＯＣＲ部１３０は、それぞれのセルに含まれるテキストを認識することができる。 The table 1100 is composed of a plurality of cells, and the OCR unit 130 can recognize the text contained in each cell.

例えば、図１に示すように、テーブル１１００の１番目のセル１１０１から「患者氏名」というテキストが認識された場合、１番目のセル１１０１の識別情報と「患者氏名」というテキストとがマッチングされ、このようなマッチング情報は保存部１２０に保存される。 For example, as shown in FIG. 1, when the text "Patient Name" is recognized from the first cell 1101 of the table 1100, the identification information in the first cell 1101 and the text "Patient Name" are matched, Such matching information is stored in the storage unit 120.

本発明においては、ＯＣＲ部１３０により、それぞれのセルに含まれるテキストを区分して認識することができる。よって、本発明においては、どのセルにどのテキストが含まれるか、セルの識別情報とそれに対応するテキストとがマッチングされて存在する。このようなマッチング情報は保存部１２０に保存される。 In the present invention, the OCR unit 130 can recognize the text contained in each cell separately. Therefore, in the present invention, the cell identification information and the corresponding text are matched to determine which text is included in which cell. Such matching information is stored in the storage unit 120.

このように、それぞれのセルから認識されたテキストは、それぞれのセルの識別情報とマッチングされて保存され、このようなセルの識別情報には、イメージ１０００及びテーブル１１００の少なくとも一方に関するセルの位置情報が含まれる。 In this way, the text recognized from each cell is matched and stored with the respective cell's identification information, where such cell identification information includes the cell's location information with respect to at least one of the image 1000 and the table 1100. is included.

このようなマッチング情報は、ＯＣＲ部１３０により、イメージ１０００からテキストが認識される過程で生成されるか、又は制御部１４０の制御の下で生成される。 Such matching information may be generated by the OCR unit 130 during the process of recognizing text from the image 1000, or may be generated under the control of the control unit 140.

次に、制御部１４０は、本発明に関連する文字認識システム１００の全般的な動作を制御する。制御部１４０は、人工知能アルゴリズム（又はディープラーニングアルゴリズム）を処理するプロセッサ（又は人工知能プロセッサ、ディープラーニングプロセッサ）を含んでもよい。制御部１４０は、人工知能アルゴリズムに基づいて、イメージ１０００からテーブル１１００を認識し、テーブル１１００を構成する少なくとも１つのセルを認識することができる。 Next, the control unit 140 controls the overall operation of the character recognition system 100 related to the present invention. The control unit 140 may include a processor (or an artificial intelligence processor, a deep learning processor) that processes an artificial intelligence algorithm (or a deep learning algorithm). The controller 140 may recognize the table 1100 from the image 1000 and recognize at least one cell forming the table 1100 based on an artificial intelligence algorithm.

また、制御部１４０は、人工知能アルゴリズムに基づいて、セルの関係及びセルに含まれるテキストの意味を分析することができる。 Further, the control unit 140 may analyze the relationship between cells and the meaning of text included in the cells based on an artificial intelligence algorithm.

さらに、制御部１４０は、セルの関係及びセルに含まれるテキストの意味に基づいて、ＯＣＲ部１３０により認識されたテキストの少なくとも一部を補正することができる。 Further, the control unit 140 can correct at least a portion of the text recognized by the OCR unit 130 based on the relationship between cells and the meaning of the text included in the cells.

ここで、制御部１４０は、保存部１２０に保存されたデータセットに基づいて、ＯＣＲ部１３０により認識されたテキストを補正することができる。 Here, the control unit 140 may correct the text recognized by the OCR unit 130 based on the data set stored in the storage unit 120.

なお、本発明におけるセルとは、テーブルを構成する複数のラインにより規定される長方形（ｒｅｃｔａｎｇｌｅ）のボックスをいう。 Note that a cell in the present invention refers to a rectangular box defined by a plurality of lines forming a table.

本発明についての説明に先立って、テーブルの構成要素について説明する。本発明における「テーブル」とは、ある内容を所定の形式又は手順で示したものを意味し、表とも命名される。図２を参照してテーブルの構成要素について説明すると、テーブル２００は、少なくとも１つのセル（又は空間、領域）２１０を含む。すなわち、テーブル２００は、少なくとも１つのセル２１０を含み、セル２１０内に情報を含むように構成される。 Prior to explaining the present invention, the constituent elements of the table will be explained. A "table" in the present invention means something that shows certain contents in a predetermined format or procedure, and is also called a table. Describing the components of the table with reference to FIG. 2, the table 200 includes at least one cell (or space, region) 210. That is, table 200 includes at least one cell 210 and is configured to include information within cell 210.

テーブル２００に含まれるセル２１０は、少なくとも４つの線分（ｌｉｎｅｓｅｇｍｅｎｔ）により規定されるものであってもよい。テーブル２００及びテーブル２００内に備えられたセルは、長方形状からなる。 Cells 210 included in table 200 may be defined by at least four line segments. The table 200 and the cells provided within the table 200 have a rectangular shape.

すなわち、テーブルに含まれるセルは、４つの線分で囲まれた四角形からなるものであってもよい。本発明における「線分（ｌｉｎｅｓｅｇｍｅｎｔ）」は、ライン（ｌｉｎｅ）又は線の少なくとも一部をいう。各セルを構成する線分は、延びてライン又は線となる。 That is, a cell included in the table may be a rectangle surrounded by four line segments. A "line segment" in the present invention refers to a line or at least a portion of a line. The line segments constituting each cell extend into lines or lines.

このように、１つの四角形状のセルを形成するためには、少なくとも４つの線分ａ、ｂ、ｃ、ｄが必要であり、これは、テーブルを構成する２つの水平ライン（ｈｏｒｉｚｏｎｔａｌｌｉｎｅ）（又は行方向のライン、エッジ）２０１、２０２及び２つの垂直ライン（ｖｅｒｔｉｃａｌｌｉｎｅ）（又は列方向のライン、エッジ）２０３、２０４の少なくとも一部からなるようにしてもよい。より具体的には、長方形状のセルは、２つの水平ライン２０１、２０２及び２つの垂直ライン２０３、２０４の少なくとも一部が互いに交差して形成されてもよい。 In this way, at least four line segments a, b, c, and d are required to form one rectangular cell, which corresponds to the two horizontal lines (horizontal lines) that make up the table. Alternatively, it may consist of at least a portion of lines (or edges) in the row direction 201 and 202 and two vertical lines (or lines or edges in the column direction) 203 and 204. More specifically, a rectangular cell may be formed by at least a portion of two horizontal lines 201, 202 and two vertical lines 203, 204 intersecting each other.

よって、テーブル２００は、少なくとも４つのラインを含み、水平ラインの数、垂直ラインの数、ライン間の間隔、ラインの配列位置によって、セルの数、セルの大きさ、セルの位置などが定義される。 Therefore, the table 200 includes at least four lines, and the number of cells, cell size, cell position, etc. are defined by the number of horizontal lines, the number of vertical lines, the spacing between lines, and the arrangement position of the lines. Ru.

一方、テーブル２００を構成する複数のラインは、異なる第１タイプ及び第２タイプのいずれかのタイプに分けられ、これは当該ラインがどの方向に延びたかによって特定される。 On the other hand, the plurality of lines constituting the table 200 are divided into different types, either a first type or a second type, and this is specified depending on which direction the line extends.

図示のように、水平方向に延びるライン２０１、２０２は第１タイプに定義され、垂直方向に延びるライン２０３、２０４は第２タイプに定義される。 As shown, horizontally extending lines 201, 202 are defined as a first type, and vertically extending lines 203, 204 are defined as a second type.

第１タイプのラインは、水平ライン、横ライン、行ライン、横方向のラインなど、その意味が同一又は類似の用語で多様に命名される。 The first type of line may be variously named by terms having the same or similar meanings, such as horizontal line, horizontal line, row line, horizontal line, etc.

また、第２タイプのラインは、垂直ライン、縦ライン、列ライン、縦方向のラインなど、その意味が同一又は類似の用語で多様に命名される。 In addition, the second type of line may be variously named by terms having the same or similar meanings, such as a vertical line, a vertical line, a column line, a vertical line, etc.

一方、テーブル２００に含まれるセルの数、セルの大きさ又はセルの位置などは、横ライン及び縦ラインの数や配置関係などに基づいて様々に変形することができ、セルの結合によっても様々に変形することができる。 On the other hand, the number of cells, cell size, cell position, etc. included in the table 200 can be variously modified based on the number and arrangement of horizontal lines and vertical lines, and can also be varied depending on the combination of cells. It can be transformed into.

また、本発明におけるテーブルを構成する構成成分は、テーブルを構成するライン（横ライン及び縦ライン）と、テーブルを構成するラインが交差して形成されるコーナー（又は角、頂点）（符号「ｅ、ｆ、ｇ、ｈ」参照）とを含んでもよい。 In addition, the constituent components constituting the table in the present invention are the corners (or corners, vertices) (symbol "e , f, g, h").

全てのテーブルが少なくとも４つのコーナーを有し、それはテーブルの最外枠に含まれるコーナーであり得る。また、テーブルに含まれるコーナーの数は、テーブルに含まれるセルの数に応じて異なる。 All tables have at least four corners, which may be the corners contained in the outermost frame of the table. Also, the number of corners included in the table varies depending on the number of cells included in the table.

以下、前述した本発明による文字認識システムの構成に基づいて、テーブルを生成する方法についてより具体的に説明する。 Hereinafter, a method for generating a table will be described in more detail based on the configuration of the character recognition system according to the present invention described above.

図３に示すように、本発明による文字認識方法においては、まず、テーブル１１００を含むイメージ１０００（図１参照）を受信する過程が行われる（Ｓ３１０）。 As shown in FIG. 3, in the character recognition method according to the present invention, first, a process of receiving an image 1000 (see FIG. 1) including a table 1100 is performed (S310).

前述したように、テーブル１１００を含むイメージ１０００は、様々なルートで受信することができる。例えば、イメージ１０００は、通信部により伝送される方式、スキャン部によりスキャンされる方式、入力部により入力される方式などで受信することができる。 As previously discussed, image 1000 including table 1100 may be received via a variety of routes. For example, the image 1000 can be received by being transmitted by a communication unit, scanned by a scanning unit, input by an input unit, etc.

イメージ１０００が受信されると、イメージ１０００に含まれるテーブル１１００からテキストを認識する過程が行われる。 Once image 1000 is received, a process of recognizing text from table 1100 contained in image 1000 is performed.

より具体的には、本発明においては、テーブル１１００を構成する複数のセルに含まれるテキストを認識する過程が行われる（Ｓ３２０）。このような認識はＯＣＲ部１３０で行われ、ＯＣＲ部１３０により、テーブル１１００に含まれる情報又は内容が認識される。ここで、テーブル１１００に含まれる情報又は内容が認識されるとは、テーブル１１００に含まれるテキストが認識されることを意味する。 More specifically, in the present invention, a process of recognizing text included in a plurality of cells constituting the table 1100 is performed (S320). Such recognition is performed by the OCR unit 130, and the information or content included in the table 1100 is recognized by the OCR unit 130. Here, "recognizing the information or contents included in table 1100" means that text included in table 1100 is recognized.

ＯＣＲ部１３０による認識の結果、テーブル１１００に含まれる内容に該当するデータを確保することができる。例えば、ＯＣＲ部１３０は、テーブル１１００に含まれるテキストを認識し、テーブル１１００に含まれる内容に該当するテキスト、例えば「患者氏名」、「住民登録番号」、「診療費（薬剤費）内訳」などのテキストをデータとして確保することができる。ＯＣＲ部１３０は、イメージ１０００に含まれる各テキスト（又は文字）及びテキストの位置を認識するために訓練されたテキスト認識モデルにより、イメージ１０００からテキストを認識することができる。このようなテキスト認識モデルは、人工知能に基づくアルゴリズム（例えば、ディープラーニングアルゴリズム）を含んでもよい。 As a result of the recognition by the OCR unit 130, data corresponding to the contents included in the table 1100 can be secured. For example, the OCR unit 130 recognizes the text included in the table 1100, and selects text that corresponds to the content included in the table 1100, such as "patient name", "resident registration number", "medical fee (medicine cost) breakdown", etc. text can be secured as data. The OCR unit 130 can recognize text from the image 1000 using a text recognition model trained to recognize each text (or character) included in the image 1000 and the position of the text. Such text recognition models may include artificial intelligence-based algorithms (eg, deep learning algorithms).

図４の（ａ）に示すように、ＯＣＲ部１３０は、テーブル１１００からテキストを認識し、図４の（ｂ）に示すように、認識されたテキストは、テキストが含まれるそれぞれのセルの識別情報ＫＥＹ１、ＫＥＹ２、ＫＥＹ３．．．と共にマッチングされて保存部１２０に保存される。 As shown in FIG. 4(a), the OCR unit 130 recognizes text from the table 1100, and as shown in FIG. 4(b), the recognized text is identified by the identification of each cell containing the text. Information KEY1, KEY2, KEY3. ．．．． and are matched and stored in the storage unit 120.

このように、テーブル１１００を構成するセルに対してテキストが認識されると、認識されたテキストの検証を行う過程が行われてもよい。ここで、「検証」とは、それぞれのセルに含まれるテキストが正確に認識されたか否かを確認する過程を意味する。例えば、図４の（ａ）に示すように、テーブル１１００の特定のセル４０１には「組合負担額」というテキストが含まれるが、図４の（ｂ）に示すように、ＯＣＲ部１３０により「粗合負担額」４０２というテキストが認識されることがある。 In this manner, when text is recognized for cells forming the table 1100, a process of verifying the recognized text may be performed. Here, "verification" refers to the process of confirming whether the text included in each cell has been correctly recognized. For example, as shown in (a) of FIG. 4, a specific cell 401 of the table 1100 includes the text "cooperative contribution amount," but as shown in (b) of FIG. The text ``Gross combined burden amount'' 402 may be recognized.

この場合、原本資料とは異なる内容が認識（誤認識）されることにより、認識されるデータの信頼度及び正確度の問題が生じる。 In this case, content different from the original document is recognized (erroneously recognized), causing problems with the reliability and accuracy of the recognized data.

よって、制御部１４０は、テーブル１１００から認識されたテキストの検証を行うようにしてもよい。この場合、本発明においては、それぞれのセル間の相互位置関係及び意味関係（又は連結関係）に基づいて、検証過程で補正が必要なセルに含まれるテキストの補正を行うようにしてもよい。 Therefore, the control unit 140 may verify the text recognized from the table 1100. In this case, in the present invention, the text included in the cells that require correction may be corrected in the verification process based on the mutual positional relationship and semantic relationship (or connection relationship) between the cells.

そのために、本発明においては、複数のセルから補正対象セルを特定する過程が行われる（Ｓ３３０）。制御部１４０は、予め設定された基準に基づいて、複数のセルから少なくとも１つの補正対象セルを特定するようにしてもよい。ここで、予め設定された基準は非常に多様に設定することができ、よって、本発明において、補正対象セルが特定される過程は非常に多様である。 To this end, in the present invention, a process of identifying a correction target cell from a plurality of cells is performed (S330). The control unit 140 may specify at least one correction target cell from a plurality of cells based on preset criteria. Here, the predetermined criteria can be set in a wide variety of ways, and therefore, in the present invention, the process for specifying the correction target cell is very diverse.

例えば、制御部１４０は、テーブル１１００を構成する複数のセルに対してテキストが認識されると、テキストの認識が行われた全てのセルの検証を行うようにしてもよい。 For example, when text is recognized for a plurality of cells forming the table 1100, the control unit 140 may verify all the cells in which text has been recognized.

この場合、テキストの認識が行われた全てのセルが補正対象セルとしてそれぞれ特定される。 In this case, all cells in which text has been recognized are identified as cells to be corrected.

それとは異なり、制御部１４０は、テーブル１１００を構成する複数のセルのうち、カテゴリーセルに含まれるテキストに対して検証を行うようにしてもよい。 On the other hand, the control unit 140 may verify text included in a category cell among a plurality of cells configuring the table 1100.

この場合、制御部１４０は、テーブル１１００を構成する複数のセルのうち、カテゴリーセルに含まれるテキストのそれぞれを補正対象セルとして特定する。 In this case, the control unit 140 specifies each of the texts included in the category cell among the plurality of cells configuring the table 1100 as a correction target cell.

ここで、カテゴリーセルは、キーセルとも命名され、データセル（又はバリューセル）とは区分されるものである。 Here, the category cell is also called a key cell and is distinguished from a data cell (or value cell).

カテゴリーセルは、タイトルセルとも命名され、データセルに含まれるデータ（テキスト）のカテゴリー、種類、所属、特徴などを定義する意味のテキストが含まれるセルとして理解される。 A category cell is also named a title cell, and is understood as a cell that includes text with a meaning that defines the category, type, affiliation, characteristics, etc. of data (text) included in the data cell.

カテゴリーは、同一又は類似の性質や意味を基準に分けられるものであり、範疇とも命名される。同一のカテゴリーに属するデータは、同一の意味、同一の種類、又は同一の所属に該当するデータであり得る。 A category is something that can be divided based on the same or similar properties or meanings, and is also called a category. Data belonging to the same category may be data that has the same meaning, the same type, or the same affiliation.

カテゴリーセルには、カテゴリー名又はカテゴリー名に関連するデータが含まれ、データセルには、カテゴリーセルに含まれるカテゴリー名に属するデータが含まれる。ここで、カテゴリー名は範疇名とも命名される。 The category cell contains a category name or data related to the category name, and the data cell contains data belonging to the category name included in the category cell. Here, the category name is also called a category name.

一例として、図４の（ａ）に示すように、テーブル１１００において、「診療，調剤日付（診療期間）」というテキストが含まれる特定のセル４１２は、カテゴリーセルであり、特定のセル４１２と同じ列に位置し、日付情報がそれぞれ含まれるセル４０３は、特定のセル４１２に関連するデータセルである。 As an example, as shown in (a) of FIG. 4, in the table 1100, a specific cell 412 containing the text "Medical treatment, dispensing date (medical treatment period)" is a category cell, and is the same as the specific cell 412. Cells 403 located in the column and each containing date information are data cells related to a particular cell 412 .

他の例として、テーブル１１００において、「患者氏名」というテキストが含まれる特定のセル４１１は、カテゴリーセルであり、特定のセル４１１と同じ行に位置し、名前情報「ホン・ギルドン」が含まれるセル４０５は、特定のセル４１１に関連するデータセルである。 As another example, in the table 1100, a specific cell 411 that includes the text "Patient Name" is a category cell, is located in the same row as the specific cell 411, and includes name information "Hong Gil-dong." Cell 405 is a data cell associated with particular cell 411.

図４に示すように、テーブル１１００には、１つ又はそれ以上のカテゴリーセル４１１、４１２、４１３、４１４、４１５、４１６が含まれる。 As shown in FIG. 4, table 1100 includes one or more category cells 411, 412, 413, 414, 415, 416.

制御部１４０は、テーブル１１００の構造及びセルに含まれるテキストの意味に基づいて、テーブル１１００を構成する複数のセルのタイプを第１タイプ（カテゴリーセルタイプ）及び第２タイプ（データセルタイプ）のいずれかに特定することができる。 Based on the structure of the table 1100 and the meaning of the text included in the cells, the control unit 140 divides the types of the plurality of cells making up the table 1100 into a first type (category cell type) and a second type (data cell type). It can be specified as either.

また、制御部１４０は、テーブル１１００を構成する複数のセルのうち、カテゴリーセルに含まれるテキストのそれぞれを、補正対象セルとして特定することができる。 Further, the control unit 140 can specify each text included in a category cell among the plurality of cells configuring the table 1100 as a correction target cell.

他の例として、制御部１４０は、テーブル１１００を構成する複数のセルのうち、カテゴリーセルの少なくとも一部を、補正対象セルとして特定することができる。ここで、補正対象セルを特定するための予め設定された基準は、前記複数のセルに含まれるテキストに対する認識率に関連するものであってもよい。制御部１４０は、カテゴリーセルから抽出されたテキストのうち、テキストの認識及び抽出当時にテキスト認識率が低い特定のセルを、補正対象セルとして特定することができる。 As another example, the control unit 140 can specify at least some of the category cells among the plurality of cells making up the table 1100 as correction target cells. Here, the preset standard for identifying the cells to be corrected may be related to the recognition rate for text included in the plurality of cells. The control unit 140 can specify, as a correction target cell, a specific cell that has a low text recognition rate at the time of text recognition and extraction, among the texts extracted from the category cells.

図５に示すように、ＯＣＲ部１３０によりそれぞれのセルに対してテキストを認識する場合、それぞれのテキストの認識の信頼度を特定することができる。 As shown in FIG. 5, when text is recognized for each cell by the OCR unit 130, the reliability of recognition of each text can be specified.

このような信頼度は、信頼度スコア（ｃｏｎｆｉｄｅｎｃｅｓｃｏｒｅ）又は信頼スコアとも命名される。 Such confidence is also named confidence score or confidence score.

図５に示すように、このような信頼度スコアは、テーブル１１００を構成するそれぞれのセル及びセルから認識されたテキストの少なくとも一方とマッチングされて保存部１２０に保存されてもよい。 As shown in FIG. 5, the reliability score may be matched with at least one of each cell constituting the table 1100 and the text recognized from the cell and stored in the storage unit 120.

制御部１４０は、図５に示すように、テキストの信頼度が予め設定された基準条件を満たすカテゴリーセル（例えば、テキストの信頼度が基準スコア（基準確率など）未満であるセル）を補正対象セルとして特定することができる。図５によれば、信頼スコアが１に近いほど、認識されたテキストが正確に認識された確率が高く、信頼スコアが０に近いほど、認識されたテキストが正確に認識された確率が低いことを意味する。 As shown in FIG. 5, the control unit 140 selects category cells whose text reliability satisfies preset standard conditions (for example, cells whose text reliability is less than a standard score (standard probability, etc.)) to be corrected. It can be specified as a cell. According to Figure 5, the closer the confidence score is to 1, the higher the probability that the recognized text was correctly recognized, and the closer the confidence score is to 0, the lower the probability that the recognized text was correctly recognized. means.

本発明において、制御部１４０は、テキストが正確に認識された確率が低いセルに含まれるテキストの補正を行うことにより、データ処理の演算量を低減し、テーブル１１００に含まれる情報を正確に抽出することができる。このように、制御部１４０は、認識された全てのテキストを補正対象テキストとして特定するのではなく、信頼度に基づいてテキストが正確に認識された確率が低いテキストを補正対象テキストとして特定する。よって、制御部１４０は、補正対象テキストとして特定されたテキストに対してのみ本発明による補正を行うので、全てのテキストに対して補正を行う場合より、データ処理の演算量を低減することができる。 In the present invention, the control unit 140 reduces the amount of data processing calculations and accurately extracts the information included in the table 1100 by correcting the text included in cells where the probability of the text being accurately recognized is low. can do. In this way, the control unit 140 does not specify all recognized texts as texts to be corrected, but rather specifies texts with a low probability of being accurately recognized as texts to be corrected based on the reliability. Therefore, since the control unit 140 performs the correction according to the present invention only on the text specified as the text to be corrected, the amount of calculation for data processing can be reduced compared to when correction is performed on all texts. .

例えば、予め設定された基準条件が、テキストの信頼スコアが０．５未満であるセルを補正するように設定された場合、制御部１４０は、図５において、「ＫＥＹ７」に該当するセルを補正対象セルとして特定する。この場合、セル「ＫＥＹ７」に含まれる「(1)粗合負担額」に該当するテキストの補正を行う。 For example, if the preset standard condition is set to correct a cell whose text confidence score is less than 0.5, the control unit 140 corrects the cell corresponding to "KEY7" in FIG. Specify as the target cell. In this case, the text corresponding to "(1) Gross combined burden" included in cell "KEY7" is corrected.

このように、様々な方法又は基準に基づいて補正対象セルが特定されると、制御部１４０は、テーブル１１００に含まれる複数のセルのうち補正対象セルに関連する少なくとも１つの関連セル及びそれに含まれるコンテンツ（テキスト）を用いて、補正対象セルに含まれるテキストの補正を行うことができる。 In this way, when a correction target cell is specified based on various methods or criteria, the control unit 140 selects at least one related cell related to the correction target cell from among the plurality of cells included in the table 1100 and the cells included therein. The text included in the correction target cell can be corrected using the content (text) contained in the correction target cell.

そのために、本発明においては、テーブル１１００に含まれる複数のセルから補正対象セルに関連する関連セルを特定する過程が行われる（Ｓ３４０）。 To this end, in the present invention, a process of identifying related cells related to the correction target cell from a plurality of cells included in the table 1100 is performed (S340).

以下、説明の便宜上、補正対象セルが図６Aに示す「(1)組合負担額」というテキストが含まれる特定のカテゴリーセル６０１であると仮定して説明する。以下の説明は、全てのカテゴリーセルに対して共通に適用することができる。 For convenience of explanation, the following description will be made assuming that the cell to be corrected is a specific category cell 601 that includes the text "(1) Association contribution amount" shown in FIG. 6A. The following explanation can be commonly applied to all category cells.

図４、図６A及び図６Bに示すように、イメージ１０００において、特定のカテゴリーセル６０１に「(1)組合負担額」に該当するテキストが含まれるが、図５において説明したように、ＯＣＲ認識エラーにより「(1)粗合負担額」というテキストが認識されることがある。よって、制御部１４０は、誤って認識されたテキストが含まれる特定のカテゴリーセル６０１及び特定のカテゴリーセル６０１に関連する少なくとも１つの関連セルを用いて、誤って認識されたテキスト（例えば、(1)粗合負担額）の補正を行うことができる。 As shown in FIG. 4, FIG. 6A, and FIG. 6B, in the image 1000, a specific category cell 601 includes text corresponding to "(1) Association contribution amount," but as explained in FIG. Due to an error, the text "(1) Gross contribution amount" may be recognized. Therefore, the control unit 140 uses the specific category cell 601 that includes the incorrectly recognized text and at least one related cell related to the specific category cell 601 to identify the incorrectly recognized text (for example, (1) )The gross combined contribution amount) can be corrected.

制御部１４０は、補正対象セル６０１を基準として、少なくとも１つの関連セルを特定することができる。 The control unit 140 can specify at least one related cell using the correction target cell 601 as a reference.

制御部１４０は、補正対象セル６０１に含まれるテキストの意味及び補正対象セル６０１の位置の少なくとも一方に基づいて、関連セルを特定することができる。 The control unit 140 can identify related cells based on at least one of the meaning of the text included in the correction target cell 601 and the position of the correction target cell 601.

ここで、補正対象セル６０１の関連セルは、カテゴリーセルであってもよい。制御部１４０は、テーブル１１００に含まれるカテゴリーセル及びデータセルのうち、カテゴリーセルのみを補正対象セル６０１の関連セルとして特定することができる。 Here, the related cell of the correction target cell 601 may be a category cell. The control unit 140 can specify only the category cells among the category cells and data cells included in the table 1100 as cells related to the correction target cell 601.

また、制御部１４０は、カテゴリーセルのうち、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定するか、又は位置的に補正対象セル６０１に隣接するカテゴリーセルを関連セルとして特定することができる。 Furthermore, the control unit 140 specifies, among the category cells, a category cell that includes text that is semantically related to the correction target cell 601 as a related cell, or identifies a category cell that is positionally adjacent to the correction target cell 601. It can be identified as a related cell.

制御部１４０は、補正対象セル６０１に関連するセルを特定する際に、以下のケースのいずれかに基づいて関連セルを特定することができる。 When specifying a cell related to the correction target cell 601, the control unit 140 can specify the related cell based on any of the following cases.

第１ケースとして、制御部１４０は、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定することができる。 As a first case, the control unit 140 can specify a category cell that includes text that is semantically related to the correction target cell 601 as a related cell.

また、第２ケースとして、制御部１４０は、位置的に補正対象セル６０１に関連するカテゴリーセルを関連セルとして特定することができる。 Furthermore, as a second case, the control unit 140 can specify a category cell that is positionally related to the correction target cell 601 as a related cell.

さらに、第３ケースとして、制御部１４０は、第１ケースと第２ケースとを組み合わせ、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定し、かつ位置的に補正対象セル６０１に関連するカテゴリーセルを関連セルとして特定することができる。第３ケース、第１ケース及び第２ケースにおいて関連セルとして特定されたセルの全てを補正対象セル６０１の関連セルとして特定することができる。 Furthermore, as a third case, the control unit 140 combines the first case and the second case, identifies a category cell containing text semantically related to the correction target cell 601 as a related cell, and A category cell related to the correction target cell 601 can be specified as a related cell. All of the cells identified as related cells in the third case, first case, and second case can be identified as related cells of the correction target cell 601.

以下、第１ケースの具体的な例として、補正対象セル６０１に含まれるテキストの意味に基づいて関連セルを特定する方法について説明する。 Hereinafter, as a specific example of the first case, a method of identifying related cells based on the meaning of the text included in the correction target cell 601 will be described.

まず、補正対象セル６０１に含まれるテキストの意味に基づいて関連セルを特定する方法について説明すると、制御部１４０は、補正対象セル６０１に含まれるテキストに関連する意味に該当するテキストが含まれる少なくとも１つのカテゴリーセルを関連セルとして特定することができる。 First, a method for identifying related cells based on the meaning of the text included in the correction target cell 601 will be explained. One category cell can be identified as a related cell.

制御部１４０は、人工知能アルゴリズムに基づいて、補正対象セル６０１に含まれるテキストの意味及び他のカテゴリーセルに含まれるテキストの意味を分析し、互いに関連する意味を有するテキストが含まれる少なくとも１つのカテゴリーセルを関連セルとして特定することができる。 The control unit 140 analyzes the meaning of the text included in the correction target cell 601 and the meanings of the texts included in other category cells based on an artificial intelligence algorithm, and selects at least one text that includes texts with mutually related meanings. Category cells can be identified as related cells.

補正対象セル６０１に関連する意味を有するテキストは、補正対象セル６０１に含まれるテキストの上位概念又は下位概念の意味を有するテキストであってもよい。 The text having a meaning related to the correction target cell 601 may be a text having a meaning of a superordinate concept or a subordinate concept of the text included in the correction target cell 601.

制御部１４０は、カテゴリーセルに対して補正対象セル６０１及び関連セルを特定するので、カテゴリーセルの特性上、補正対象セル６０１は、関連セルに対して上位概念又は下位概念に該当する意味を有するテキストを含む。例えば、補正対象セル６０１と関連セルとは、同一又は関連するカテゴリー（種類）に該当する意味を有するテキストで構成される。 Since the control unit 140 specifies the correction target cell 601 and related cells for the category cell, due to the characteristics of the category cell, the correction target cell 601 has a meaning that corresponds to a superordinate concept or a subordinate concept with respect to the related cell. Contains text. For example, the correction target cell 601 and related cells are composed of texts having meanings that fall under the same or related categories (types).

例えば、図６Aに示すように、第１カテゴリーセル６１１に含まれるテキスト（「診療費（薬剤費）内訳」）は、第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）、第５カテゴリーセル６１８（「患者負担総額(2)＋(3)」）、第６カテゴリーセル６０１（補正対象セル、「(1)組合負担額」、ＯＣＲ認識時には「(1)粗合負担額」として認識される）、第７カテゴリーセル６１７（「(3)患者負担額」）、第８カテゴリーセル６１６（「非給与」）に含まれるテキストに対する上位概念の意味を有する。 For example, as shown in FIG. 6A, the text included in the first category cell 611 (“Medical expenses (drug expenses) breakdown”) is changed to the second category cell 612 (“Total amount (1) + (2) + (3)”). ”), third category cell 613 (“Salary”), fourth category cell 615 (“(2) Patient burden amount”), fifth category cell 618 (“Patient burden total amount (2) + (3)”), 6th category cell 601 (cell to be corrected, "(1) Association contribution amount", recognized as "(1) Gross contribution amount" when OCR is recognized), 7th category cell 617 ("(3) Patient contribution amount") ”), which has a superordinate meaning for the text contained in the eighth category cell 616 (“non-salary”).

また、第１カテゴリーセル６１１に対して下位概念のカテゴリーセルは、他のカテゴリーセルに対しては上位概念のカテゴリーセルでもある。例えば、第３カテゴリーセル６１３に含まれるテキスト「給与」は、第１カテゴリーセル６１１に含まれるテキスト「診療費（薬剤費）内訳」に対しては下位概念であり、第４カテゴリーセル６１５に含まれるテキスト「(2)患者負担額」及び第６カテゴリーセル６０１（補正対象セル）に含まれるテキスト「(1)組合負担額」（ＯＣＲ認識時には「(1)粗合負担額」として認識される）に対しては上位概念である。 Furthermore, a category cell that is a subordinate concept to the first category cell 611 is also a category cell that is a superordinate concept to other category cells. For example, the text “salary” included in the third category cell 613 is a subordinate concept to the text “Medical expenses (medication costs) breakdown” included in the first category cell 611, and the text “salary” included in the fourth category cell 615 The text “(2) Patient burden amount” included in the text “(2) Patient burden amount” and the text “(1) Union burden amount” included in the sixth category cell 601 (cell to be corrected) (recognized as “(1) Gross combined burden amount” when OCR is recognized) ) is a superordinate concept.

一方、制御部１４０は、カテゴリーセルのうち、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定することができ、より具体的には、補正対象セル６０１に含まれるテキストの上位概念又は下位概念の意味を有するテキストが含まれるカテゴリーセルを関連セルとして特定することができる。 On the other hand, among the category cells, the control unit 140 can specify a category cell that includes text that is semantically related to the correction target cell 601 as a related cell. A category cell containing a text that has a meaning of a superordinate concept or a subordinate concept of the text can be specified as a related cell.

第１ケースにおいて、補正対象セル６０１の関連セルは、補正対象セル６０１の上位概念のテキストが含まれる第１カテゴリーセル６１１（「診療費（薬剤費）内訳」）及び第３カテゴリーセル６１３（「給与」）に決定される。 In the first case, cells related to the correction target cell 601 are a first category cell 611 (“Medical expenses (medication costs) breakdown”) and a third category cell 613 (“ ``Salary'').

以下、第２ケースの具体的な例として、位置的に補正対象セル６０１に関連するカテゴリーセルを関連セルとして特定する方法について説明する。より具体的には、制御部１４０は、テーブル１１００の特性を考慮して、補正対象セル６０１と予め設定された位置関係（又は配列関係）を有するカテゴリーセルを、補正対象セル６０１に含まれるテキストの意味に関連するカテゴリーセル、すなわち関連セルとして判断することができる。 Hereinafter, as a specific example of the second case, a method of specifying a category cell that is positionally related to the correction target cell 601 as a related cell will be described. More specifically, in consideration of the characteristics of the table 1100, the control unit 140 selects a category cell that has a preset positional relationship (or arrangement relationship) with the correction target cell 601, and converts the text included in the correction target cell 601 into a category cell that has a preset positional relationship (or arrangement relationship) with the correction target cell 601. It can be determined as a category cell related to the meaning of , that is, a related cell.

ここで、予め設定された位置関係は、補正対象セル６０１に隣接して位置することを意味するものであってもよい。例えば、第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）及び特定のデータセル６２２は、補正対象セル６０１に隣接して位置するものであり、補正対象セル６０１と予め設定された位置関係を有するといえる。 Here, the preset positional relationship may mean being located adjacent to the correction target cell 601. For example, the second category cell 612 (“total amount (1)+(2)+(3)”), the third category cell 613 (“salary”), and the fourth category cell 615 (“(2) patient burden amount”) The specific data cell 622 is located adjacent to the correction target cell 601, and can be said to have a preset positional relationship with the correction target cell 601.

このように、補正対象セル６０１と予め設定された位置関係を有するセルは、補正対象セル６０１を基準にして補正対象セル６０１と列方向ａ又は行方向ｂに並んで位置し、補正対象セル６０１に隣接して位置するセルに該当するものである。 In this way, cells having a preset positional relationship with the correction target cell 601 are located side by side with the correction target cell 601 in the column direction a or the row direction b, with the correction target cell 601 as a reference. This applies to cells located adjacent to.

ここで、補正対象セル６０１の列方向ａに並んで位置するセルは、図６Aの符号６２１、６１１、６１３、６０１、６２２に該当するセルであり、補正対象セル６０１の行方向ｂに並んで位置するセルは、図６Aの符号６４１、６４２、６１２、６０１、６１５、６１７、６１８、６４３、６４４、６４５に該当するセルである。 Here, the cells located side by side in the column direction a of the correction target cell 601 are cells corresponding to symbols 621, 611, 613, 601, and 622 in FIG. 6A, and the cells located side by side in the row direction b of the correction target cell 601 The located cells are cells corresponding to 641, 642, 612, 601, 615, 617, 618, 643, 644, and 645 in FIG. 6A.

一方、制御部１４０は、カテゴリーセルのみを補正対象セル６０１の関連セルとして特定するので、前述した予め設定された位置関係を有するセル（第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）及び特定のデータセル６２２）のうち特定のデータセル６２２は関連セルから除外される。 On the other hand, since the control unit 140 specifies only the category cell as the related cell of the correction target cell 601, the control unit 140 specifies only the category cell as the cell related to the correction target cell 601. +(3)''), the third category cell 613 (``salary''), the fourth category cell 615 (``(2) patient burden amount''), and the specific data cell 622) are related cells. excluded from.

その結果、制御部１４０は、図６Bに示すように、予め設定された位置関係を有するセルから、カテゴリーセルに該当する第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）を関連セルとして特定することができる。 As a result, as shown in FIG. 6B, the control unit 140 selects the second category cell 612 corresponding to the category cell from the cells having the preset positional relationship ("total amount (1) + (2) + (3) ”), the third category cell 613 (“salary”), and the fourth category cell 615 (“(2) patient burden amount”) can be specified as related cells.

第２ケースによれば、補正対象セル６０１の関連セルとして、第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）が特定される。 According to the second case, the cells related to the correction target cell 601 are the second category cell 612 (“total amount (1)+(2)+(3)”), the third category cell 613 (“salary”), and the third category cell 613 (“salary”). A 4-category cell 615 (“(2) Patient burden amount”) is specified.

一方、制御部１４０は、補正対象セル６０１と位置的関係性を有する関連セルを特定するために、補正対象セル６０１のコーナー（頂点）の中央値をテーブル１１００又はイメージ１０００上での補正対象セル６０１の位置として特定し、当該位置を基準として、予め設定された距離以内に位置する中央値を有するカテゴリーセル６１２、６１３、６１５を関連セル６１２、６１３、６１５として特定することができる。 On the other hand, in order to specify a related cell having a positional relationship with the correction target cell 601, the control unit 140 calculates the median value of the corners (vertices) of the correction target cell 601 to the correction target cell on the table 1100 or the image 1000. 601, and category cells 612, 613, and 615 having median values located within a preset distance from this position can be identified as related cells 612, 613, and 615.

ここで、予め設定された距離は、Ｌ２距離情報に基づくものであってもよく、Ｌ２距離情報は、ユークリッド距離（Ｅｕｃｌｉｄｅａｎｄｉｓｔａｎｃｅ）情報ともいえる。 Here, the preset distance may be based on L2 distance information, and the L2 distance information can also be called Euclidean distance information.

このように、制御部１４０は、テーブル１１００において、左、右、上、下方向毎に、補正対象セル６０１のコーナー（頂点）の中央値からそれぞれ予め設定された距離以内に位置する中央値を有するカテゴリーセルが存在するか否かを確認することができる。また、確認の結果、予め設定された距離以内に位置する中央値を有するカテゴリーセルが存在する場合、それを関連セルとして特定することができる。 In this way, in the table 1100, the control unit 140 calculates the median value located within a preset distance from the median value of the corner (vertex) of the correction target cell 601 in each of the left, right, upper, and lower directions. It is possible to check whether there is a category cell with the same category. Further, as a result of the confirmation, if there is a category cell having a median value located within a preset distance, it can be identified as a related cell.

前述したように、制御部１４０は、第１ケース又は第２ケースによって、補正対象セル６０１に関連する関連セルを特定することができる。 As described above, the control unit 140 can specify the related cells related to the correction target cell 601 according to the first case or the second case.

また、制御部１４０は、第１ケースにより特定された関連セル及び第２ケースにより特定された関連セルの全てを、補正対象セル６０１の関連セルとして特定することができる。この場合は前述した第３ケースに該当する。 Further, the control unit 140 can specify all of the related cells specified by the first case and the related cells specified by the second case as related cells of the correction target cell 601. This case corresponds to the third case mentioned above.

前述したように、制御部１４０は、テーブル１１００に含まれるカテゴリーセルのうち、第１ケースにより、意味的に補正対象セル６０１に関連するテキストが含まれるカテゴリーセルを関連セルとして特定し、また、第２ケースにより、位置的に補正対象セル６０１に隣接するカテゴリーセルを関連セルとして特定することができる。その結果、補正対象セル６０１の関連セルは、図６A及び図６Bに示すように、第１カテゴリーセル６１１（「診療費（薬剤費）内訳」）、第２カテゴリーセル６１２（「総額(1)＋(2)＋(3)」）、第３カテゴリーセル６１３（「給与」）、第４カテゴリーセル６１５（「(2)患者負担額」）となる。 As described above, the control unit 140 specifies, among the category cells included in the table 1100, a category cell that includes text that is semantically related to the correction target cell 601 in the first case as a related cell, and In the second case, a category cell that is located adjacent to the correction target cell 601 can be specified as a related cell. As a result, as shown in FIGS. 6A and 6B, the cells related to the correction target cell 601 are the first category cell 611 (“Medical expenses (drug expenses) breakdown”) and the second category cell 612 (“Total amount (1) +(2)+(3)''), the third category cell 613 (``salary''), and the fourth category cell 615 (``(2) patient burden amount'').

ここで、少なくとも１つのカテゴリーセル（例えば、第３カテゴリーセル６１３（「給与」）は、意味的及び位置的に重複して関連セルとして特定される。 Here, at least one category cell (for example, the third category cell 613 (“salary”)) is identified as a related cell because it overlaps semantically and positionally.

補正対象セル６０１の関連セルとして特定されたカテゴリーセル６１１、６１２、６１３、６１５は、「関連セル」とも命名されてもよいことは言うまでもなく、同一の符号を用いる。 It goes without saying that the category cells 611, 612, 613, and 615 identified as cells related to the correction target cell 601 may also be named "related cells," and the same reference numerals are used.

以下では、第３ケースの例示として、意味的関係及び位置的関係の両方を考慮して特定された関連セルを用いて補正対象セル６０１に含まれるテキストの補正を行う方法を例に挙げて説明する。 In the following, as an example of the third case, a method of correcting the text included in the correction target cell 601 using related cells identified in consideration of both semantic relationships and positional relationships will be explained. do.

前述したように、第３ケースにより、補正対象セル６０１及び少なくとも１つの関連セルが特定された場合、制御部１４０は、図６Cに示すように、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキストを用いて補正対象セル６０１の埋め込み（ｅｍｂｅｄｄｉｎｇ）を行って計算（又は算出、導出）された密ベクトル（ｄｅｎｓｅｖｅｃｔｏｒ）を、補正対象セル６０１の補正に活用することができる。本発明においては、補正対象セル及び関連セルに含まれるテキスト間の関係性を示すために、補正対象セル及び関連セルに含まれるテキストの埋め込みを行い、補正対象セル及び関連セルに含まれるテキストをベクトルで示すことができる。 As described above, when the correction target cell 601 and at least one related cell are specified in the third case, the control unit 140 specifies the correction target cell 601 and the related cells 611, 612, 613, as shown in FIG. 6C. , 615 to embed the correction target cell 601 and calculate (or calculate, derive) a dense vector, which can be used to correct the correction target cell 601. . In the present invention, in order to show the relationship between the text contained in the correction target cell and related cells, the text contained in the correction target cell and related cells is embedded, and the text contained in the correction target cell and related cells is embedded. It can be shown as a vector.

ここで、埋め込みは、埋め込みの対象となる情報をベクトルで表現する方法であり、本発明において、制御部１４０は、埋め込みにより、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５の情報を密ベクトルの形で表現することができる。 Here, embedding is a method of expressing information to be embedded as a vector, and in the present invention, the control unit 140 uses embedding to express information of the correction target cell 601 and related cells 611, 612, 613, and 615. It can be expressed in the form of a dense vector.

より具体的には、制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５のそれぞれに含まれるテキストを用いて、補正対象セル６０１を示す密ベクトルを計算することができる。 More specifically, the control unit 140 can calculate a dense vector indicating the correction target cell 601 using text included in each of the correction target cell 601 and related cells 611, 612, 613, and 615.

また、制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５を用いて計算された密ベクトルを、補正対象セル６０１の補正に用いることができる。 Further, the control unit 140 can use the dense vector calculated using the correction target cell 601 and the related cells 611, 612, 613, and 615 to correct the correction target cell 601.

本発明において、埋め込みを行うアルゴリズムは特に限定されず、テキストを密ベクトルで表現できるアルゴリズムであれば本発明に活用することができる。例えば、制御部１４０は、ＦａｓｔＴｅｘｔ、ＢＥＲＴなどのアルゴリズムに基づいて、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキストを基に補正対象セル６０１を示す密ベクトルを計算することができる。 In the present invention, the algorithm for performing embedding is not particularly limited, and any algorithm that can express text as a dense vector can be utilized in the present invention. For example, the control unit 140 calculates a dense vector indicating the correction target cell 601 based on the text included in the correction target cell 601 and related cells 611, 612, 613, and 615 based on an algorithm such as FastText or BERT. Can be done.

一方、制御部１４０は、埋め込みを行うために、図６Cに示すように、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキスト６３１、６３２、６３３、６３４、６３５を並べることができる。ここで、制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキスト６３１、６３２、６３３、６３４、６３５の並び順序を、予め設定された基準に基づいて決定することができる。 On the other hand, in order to perform embedding, the control unit 140 arranges texts 631, 632, 633, 634, and 635 included in the correction target cell 601 and related cells 611, 612, 613, and 615, as shown in FIG. 6C. Can be done. Here, the control unit 140 determines the arrangement order of the texts 631, 632, 633, 634, and 635 included in the correction target cell 601 and related cells 611, 612, 613, and 615 based on preset criteria. be able to.

例えば、制御部１４０は、意味的関連性のある関連セル６１１、６１３に含まれるテキスト６３２、６３４を補正対象セル６０１に含まれるテキスト６３１にさらに近い位置に配置することもでき、逆に、位置的関連性のある関連セル６１２、６１３、６１５に含まれるテキスト６３３、６３４、６３５を補正対象セル６０１に含まれるテキスト６３１にさらに近い位置に配置することもできる。 For example, the control unit 140 can also arrange the texts 632 and 634 included in the related cells 611 and 613 that have a semantic relationship closer to the text 631 included in the correction target cell 601; It is also possible to arrange texts 633, 634, 635 included in related cells 612, 613, 615 that have a physical relationship closer to text 631 included in the correction target cell 601.

また、制御部１４０は、意味的関連性及び位置的関連性の両方を有する関連セル６１３に含まれるテキスト６３４を補正対象セル６０１に含まれるテキスト６３１に最も近い位置に配置することもできる。 Further, the control unit 140 can also place the text 634 included in the related cell 613 having both semantic and positional relationships at the position closest to the text 631 included in the correction target cell 601.

このように、制御部１４０は、埋め込みにより、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキストを基に補正対象セル６０１をベクトルで示すことができる。このように、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキスト６３１、６３２、６３３、６３４、６３５を基に補正対象セル６０１に対して埋め込みが完了すると、本発明においては、補正対象セルの補正を行う過程が行われる（Ｓ３５０）。 In this way, by embedding, the control unit 140 can indicate the correction target cell 601 as a vector based on the text included in the correction target cell 601 and related cells 611, 612, 613, and 615. In this way, when embedding is completed for the correction target cell 601 based on the texts 631, 632, 633, 634, and 635 included in the correction target cell 601 and related cells 611, 612, 613, and 615, in the present invention, , a process of correcting the correction target cell is performed (S350).

制御部１４０は、補正対象セル６０１及び関連セルに含まれるテキスト６３１、６３２、６３３、６３４、６３５を用いて算出された補正対象セルのベクトルを用いて、補正対象セル６０１に含まれるテキストの補正を行うことができる。ここで、補正対象セル６０１のベクトルは、前述した密ベクトルに該当するものであり得る。 The control unit 140 corrects the text included in the correction target cell 601 using the vector of the correction target cell calculated using the texts 631, 632, 633, 634, and 635 included in the correction target cell 601 and related cells. It can be performed. Here, the vector of the correction target cell 601 may correspond to the above-mentioned dense vector.

より具体的には、制御部１４０は、予め特定されたデータセットから、補正対象セル６０１のベクトルに最も類似したベクトル表現を有する単語（例えば、補正対象ベクトルに最も近い距離に位置する特定のベクトルに対応する単語）を、補正対象テキストとして抽出することができる。 More specifically, the control unit 140 selects a word having a vector expression most similar to the vector of the correction target cell 601 from a data set specified in advance (for example, a specific vector located at the closest distance to the correction target cell 601). ) can be extracted as the correction target text.

制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５を用いて計算された補正対象セルのベクトルと予め特定されたデータセットに含まれるベクトル間の距離に基づいて、予め特定されたデータセットから補正対象テキストを抽出することができる。ここで、予め特定されたデータセットは、テーブル１１００に含まれる複数のセル、特にカテゴリーセルに含まれ得る候補テキスト及びそれに対応するベクトルを含んでもよい。このようなデータセットは、事前に構築されたものであってもよく、データセットに含まれ得る候補テキストは、データセットが構築された後も、さらにデータセットに含まれるか、又はデータセットから除去されることが可能である。 The control unit 140 performs the pre-identification based on the distance between the vector of the correction target cell calculated using the correction target cell 601 and related cells 611, 612, 613, and 615 and the vector included in the pre-specified data set. Text to be corrected can be extracted from the dataset. Here, the pre-specified data set may include candidate texts that may be included in a plurality of cells included in the table 1100, particularly category cells, and vectors corresponding to the candidate texts. Such a dataset may be pre-built, and candidate texts that may be included in the dataset may be further included in or removed from the dataset after the dataset is built. It is possible to be removed.

また、このようなデータセットには、テーブル１１００を構成する複数のセルの位置関係及びそれぞれのセルに含まれる候補テキスト間の意味関係の少なくとも一方に基づいて埋め込みが行われた結果が含まれてもよい。すなわち、データセットには、候補テキストのそれぞれに関するベクトル情報が含まれてもよい。 Further, such a data set includes the result of embedding based on at least one of the positional relationship of a plurality of cells configuring the table 1100 and the semantic relationship between candidate texts included in each cell. Good too. That is, the dataset may include vector information regarding each of the candidate texts.

制御部１４０は、データセットから、補正対象セル６０１のベクトルに最も近い特定のベクトルに対応するテキスト８０６（図７参照）を、補正対象テキストとして抽出することができる。 The control unit 140 can extract the text 806 (see FIG. 7) corresponding to the specific vector closest to the vector of the correction target cell 601 from the data set as the correction target text.

また、制御部１４０は、補正対象セル６０１に含まれるテキストを、補正対象セル６０１のベクトルに最も近い特定のベクトルに対応するテキスト８０６（図７参照）に変更することができる。 Further, the control unit 140 can change the text included in the correction target cell 601 to the text 806 (see FIG. 7) corresponding to a specific vector closest to the vector of the correction target cell 601.

このように、制御部１４０は、補正対象セル６０１及び関連セル６１１、６１２、６１３、６１５に含まれるテキスト８０１～８０５を用いて導出された補正対象セル６０１のベクトルを用いて、データセットから、補正対象テキスト８０６を抽出することができる。 In this manner, the control unit 140 uses the vector of the correction target cell 601 derived using the texts 801 to 805 included in the correction target cell 601 and related cells 611, 612, 613, and 615 to Correction target text 806 can be extracted.

その結果、補正対象セル６０１に対して誤って認識されたテキスト「(1)粗合負担額」が「(1)組合負担額」に補正される。 As a result, the erroneously recognized text "(1) Combined contribution amount" for the correction target cell 601 is corrected to "(1) Union contribution amount".

このように、データセットから補正対象テキスト８０６が抽出されると、制御部１４０は、補正対象セル６０１に含まれるテキスト８０１を補正対象テキスト８０６に変更することができる。 In this manner, when the correction target text 806 is extracted from the data set, the control unit 140 can change the text 801 included in the correction target cell 601 to the correction target text 806.

一方、前述したように、補正対象テキスト８０６が抽出されると、制御部１４０は、抽出された補正対象テキスト８０６を、イメージ１０００に含まれるテーブル１１００をデータ化するのに活用することができる。制御部１４０は、イメージ１０００に含まれるテーブル１１００のデータ化の結果として得られたデータ上に、補正対象テキスト８０６を含めることができる。 On the other hand, as described above, when the correction target text 806 is extracted, the control unit 140 can utilize the extracted correction target text 806 to convert the table 1100 included in the image 1000 into data. The control unit 140 can include the correction target text 806 on the data obtained as a result of converting the table 1100 included in the image 1000 into data.

また、制御部１４０は、テーブル１１００をデータ化するだけでなく、データ化の結果として得られたデータを用いて、イメージ１０００に含まれるテーブル１１００に対応する構造を有するテーブルを生成することができる。 Further, the control unit 140 can not only convert the table 1100 into data, but also use the data obtained as a result of data conversion to generate a table having a structure corresponding to the table 1100 included in the image 1000. .

例えば、制御部１４０は、図８の（ａ）に示すように、スキャンされたイメージに含まれるテーブル９１０から抽出されたデータの少なくとも一部に該当するテキスト９３０（「撰沢診療科以外」）にエラーがある場合、前述した方法で補正を行うことにより、図８の（ｂ）に示すように、補正されたテキスト９４０（「選択診療料以外」）が含まれるデータを確保することができる。また、必要に応じて、補正されたテキスト９４０が含まれるテーブル９２０を生成することができ、このようなテーブル９２０は、スキャンの対象となったテーブル９１０と同じ内容で構成される。 For example, as shown in (a) of FIG. 8, the control unit 140 controls the text 930 (“Other than Arasawa Medical Department”) that corresponds to at least part of the data extracted from the table 910 included in the scanned image. If there is an error, by correcting it using the method described above, it is possible to secure data that includes the corrected text 940 (“other than selected medical fees”), as shown in FIG. 8(b). . Further, if necessary, a table 920 including the corrected text 940 can be generated, and such a table 920 is configured with the same contents as the table 910 that was the object of the scan.

より具体的には、本発明による文字認識方法及び文字認識システムは、イメージに含まれるテーブルから、テーブルに含まれるテキストが誤って認識された場合、それを補正することにより、イメージに含まれるテーブルの内容をより正確にデータ化することができる。そのために、本発明においては、テーブルを構成するセルの配置関係、セルに含まれるテキストの意味関係を考慮して、補正の対象となったセルに含まれるテキストを補正することにより、イメージに含まれるテーブルの内容をそのままデータ化することのできる、正確度の高い文字認識方法及びシステムを提供することができる。その結果、本発明による文字認識方法及びシステムは、イメージに含まれるテーブルと同じ内容を含むテーブルを生成することができる。 More specifically, the character recognition method and character recognition system according to the present invention corrects when text included in a table is incorrectly recognized from a table included in an image. The contents can be converted into data more accurately. To this end, in the present invention, the text included in the cell that is the target of correction is corrected by taking into account the arrangement relationship of cells composing the table and the semantic relationship of the text included in the cell. It is possible to provide a highly accurate character recognition method and system that can directly convert the contents of a table into data. As a result, the character recognition method and system according to the present invention can generate a table containing the same contents as the table included in the image.

また、本発明による文字認識方法及び文字認識システムは、テキスト認識率が低い特定のセルに対してのみ補正を行うことにより、イメージに含まれるテーブルの内容を正確にデータ化しながらも、データ処理量を最小限に抑えることができる。 In addition, the character recognition method and character recognition system according to the present invention corrects only specific cells with a low text recognition rate, thereby reducing the amount of data processing while accurately converting the contents of the table included in the image into data. can be minimized.

一方、前述した本発明は、コンピュータで１つ以上のプロセスにより実行され、コンピュータ可読媒体（又は記録媒体）に格納可能なプログラムとして実現することができる。 On the other hand, the present invention described above can be implemented as a program that is executed by one or more processes on a computer and can be stored on a computer-readable medium (or recording medium).

また、前述した本発明は、プログラム記録媒体にコンピュータ可読コード又はコマンドとして実現することができる。すなわち、本発明は、プログラムの形態で提供することができる。 Further, the present invention described above can be implemented as computer readable codes or commands on a program recording medium. That is, the present invention can be provided in the form of a program.

一方、コンピュータ可読媒体は、コンピュータシステムにより読み取り可能なデータが記録されるあらゆる種類の記録装置を含む。コンピュータ可読媒体の例としては、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｉｓｋ）、ＳＤＤ（ＳｉｌｉｃｏｎＤｉｓｋＤｒｉｖｅ）、ＲＯＭ、ＲＡＭ、ＣＤ－ＲＯＭ、磁気テープ、フロッピーディスク、光データ記憶装置などが挙げられる。 On the other hand, the computer-readable medium includes any type of recording device on which data that can be read by a computer system is recorded. Examples of computer readable media include HDD (Hard Disk Drive), SSD (Solid State Disk), SDD (Silicon Disk Drive), ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage device, etc. It will be done.

また、コンピュータ可読媒体は、ストレージを含み、電子機器が通信によりアクセスできるサーバ又はクラウドストレージであり得る。この場合、コンピュータは、有線又は無線通信により、サーバ又はクラウドストレージから本発明によるプログラムをダウンロードすることができる。 The computer-readable medium also includes storage and can be a server or cloud storage that the electronic device can access via communication. In this case, the computer can download the program according to the invention from the server or cloud storage by wired or wireless communication.

さらに、本発明において、前述したコンピュータは、プロセッサ、すなわち中央処理装置（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ，ＣＰＵ）が搭載された電子機器であり、その種類は特に限定されない。 Furthermore, in the present invention, the computer described above is an electronic device equipped with a processor, that is, a central processing unit (Central Processing Unit, CPU), and its type is not particularly limited.

一方、本発明の詳細な説明は例示的なものであり、あらゆる面で制限的に解釈されてはならない。本発明の範囲は添付の特許請求の範囲の合理的解釈により定められるべきであり、本発明の等価的範囲内でのあらゆる変更が本発明の範囲に含まれる。 On the other hand, the detailed description of the present invention is illustrative and should not be construed as restrictive in any respect. The scope of the present invention should be determined by reasonable interpretation of the appended claims, and all modifications within the range of equivalency of the present invention are included within the scope of the present invention.

１００文字認識システム
１１０受信部
１２０保存部
１３０ＯＣＲ部
１４０制御部
１５０外部サーバ
１５１クラウドサバー
１５２データベース（ＤＢ）
２００テーブル
２０１、２０２水平ライン
２０３、２０４垂直ライン
２１０セル
１０００イメージ
１１００テーブル 100 Character recognition system 110 Receiving unit 120 Storage unit 130 OCR unit 140 Control unit 150 External server 151 Cloud server 152 Database (DB)
200 Table 201, 202 Horizontal line 203, 204 Vertical line 210 Cell 1000 Image 1100 Table

Claims

A character recognition method performed by a character recognition system including a receiving unit and a control unit,
the receiving unit receiving an image including a table;
The control unit,
Recognizing text contained in a plurality of cells constituting the table;
identifying a correction target cell from the plurality of cells based on preset criteria;
identifying at least one related cell related to the correction target cell from the plurality of cells based on a related relationship formed between meanings indicated by texts included in each cell ;
The present invention is characterized by comprising the step of correcting the text included in the correction target cell using a vector of the correction target cell calculated using the text included in the correction target cell and the related cell. Character recognition method.

In the step of identifying the related cells,
2. The character recognition method according to claim 1, wherein the related cell is identified based on the meaning of text included in the correction target cell.

At least some of the plurality of cells are identified as category cells, and other parts are identified as data cells that include data corresponding to at least one of the category cells,
In the step of identifying the related cells,
3. The character recognition method according to claim 2, wherein a specific category cell related to the meaning of the text included in the correction target cell is specified as the related cell among the category cells.

Specific category cells related to the meaning of the text contained in the correction target cell include:
4. The character recognition method according to claim 3, wherein the text included in the correction target cell includes a text having a meaning of a higher-level concept or a lower-level concept of the text included in the correction target cell.

A specific category cell related to the meaning of the text included in the correction target cell is:
5. The character recognition method according to claim 4, wherein the character recognition method is arranged parallel to the correction target cell in a column direction or a row direction with the correction target cell as a reference.

The specific category cell is a cell located adjacent to the correction target cell in the row or column direction on the table among the category cells, with the correction target cell as a reference. The character recognition method according to claim 3.

In the step of performing the correction,
The character recognition method according to claim 1, wherein the text included in the correction target cell is corrected based on a distance between the vector of the correction target cell and a vector included in a data set specified in advance. .

The step of performing the correction includes:
extracting a correction target text from the data set based on a distance between the vector of the correction target cell and a vector included in the data set specified in advance;
8. The character recognition method according to claim 7, further comprising the step of changing the text included in the correction target cell to the correction target text.

The dataset includes:
Candidate text that may be included in the plurality of cells constituting the table, and
Claim 8, characterized in that vector information regarding each of the candidate texts is included, which corresponds to a result of embedding based on at least one of a positional relationship between the plurality of cells and a semantic relationship between the candidate texts. Character recognition method described in.

In the step of performing the correction,
identifying a vector closest to the vector of the correction target cell from the data set;
10. The character recognition method according to claim 9, wherein a text corresponding to the closest vector is extracted as the correction target text.

the correction target cell contains the correction target text;
11. The character recognition method according to claim 10, further comprising the step of generating a table having a structure corresponding to a table included in the image.

The character recognition method according to claim 1, wherein the preset standard for identifying the correction target cell is related to reliability of recognition of text included in the plurality of cells. .

preservation department,
a receiving unit that receives an image including a table;
a control unit that recognizes text included in a plurality of cells configuring the table included in the image;
The control unit includes:
identifying a correction target cell from the plurality of cells based on preset criteria;
identifying at least one related cell related to the correction target cell from the plurality of cells based on a related relationship formed between meanings indicated by text included in each cell ;
A character recognition system characterized in that the text contained in the correction target cell is corrected using a vector of the correction target cell calculated using the text contained in the correction target cell and the related cell.

A computer program including a plurality of instructions,
When the instruction is executed,
receiving an image including a table;
Recognizing text contained in a plurality of cells constituting the table;
identifying a correction target cell from the plurality of cells based on preset criteria;
identifying at least one related cell related to the correction target cell from the plurality of cells based on a related relationship formed between meanings indicated by texts included in each cell ;
correcting the text included in the correction target cell using a vector of the correction target cell calculated using the text included in the correction target cell and the related cell; Featured computer program.