JP2020123273A

JP2020123273A - Information processing device and information processing program

Info

Publication number: JP2020123273A
Application number: JP2019016252A
Authority: JP
Inventors: クリシュナンラガワ; Krishnan Raghava
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2019-01-31
Filing date: 2019-01-31
Publication date: 2020-08-13
Anticipated expiration: 2039-01-31
Also published as: JP7279382B2

Abstract

To allow for accurately identifying character strings contained in an image without referring to a predefined reference list.SOLUTION: An information processing device 10 is provided, comprising: an acquisition unit 22 configured to acquire character strings recognized from an image by OCR processing; an extraction unit 24 configured to extract character string candidates from the character strings acquired by the acquisition unit 22 using an extraction pattern expressed in regular expression; a correction unit 26 configured to correct the character string candidates using false recognition patterns in which the OCR processing tends to make false recognition; and an output unit 28 configured to output character string candidates identified from among the character string candidates corrected by the correction unit 26 using the regular expression.SELECTED DRAWING: Figure 2

Description

本発明は、情報処理装置及び情報処理プログラムに関する。 The present invention relates to an information processing device and an information processing program.

特許文献１には、情報を複数保持する情報保持手段中から所定の情報を検索する情報検索装置において、文書画像入力手段と、文字認識手段と、文字認識の確信度から正規化レベルを決定する手段と、前記正規化レベルに基づき代表文字に置き換える正規化手段と、前記情報保持手段中から類似したテキストを含む情報を検索する検索手段と、前記検索結果を出力する検索結果出力手段とを備えたことを特徴とする情報検索装置が開示されている。 In Patent Document 1, in an information retrieval device that retrieves predetermined information from information retention means that retains a plurality of information, a normalization level is determined from a document image input means, a character recognition means, and a certainty factor of character recognition. Means, a normalization means for replacing with a representative character based on the normalization level, a search means for searching the information holding means for information including similar texts, and a search result output means for outputting the search result. An information search device characterized by the above is disclosed.

特許文献２には、文字列から所与の辞書に含まれる特定用語を検索する情報処理装置であって、ｎ−ｇｒａｍ方式を利用して前記所与の辞書に対して前記文字列の部分文字列の検索を行い、前記所与の辞書から該文字列に含まれる１以上の特定用語候補を抽出する抽出手段と、前記１以上の特定用語候補の各々に対して、ＤＰマッチング方式を利用して特定用語候補と該特定用語候補に対応する前記文字列内の部分文字列との編集距離を導出する導出手段と、前記編集距離が所定の閾値より短い前記１以上の特定用語候補に対応する前記文字列内の部分文字列を出力する出力手段と、を有することを特徴とする情報処理装置が開示されている。 Patent Document 2 is an information processing apparatus that searches a character string for a specific term included in a given dictionary, and uses an n-gram method to perform partial characters of the character string for the given dictionary. Extraction means for performing a column search and extracting one or more specific term candidates included in the character string from the given dictionary, and a DP matching method for each of the one or more specific term candidates are used. Deriving means for deriving an editing distance between a specific term candidate and a partial character string in the character string corresponding to the specific term candidate, and the one or more specific term candidates in which the editing distance is shorter than a predetermined threshold value. And an output unit that outputs a partial character string in the character string.

特開２００７−３２３４１５号公報JP, 2007-323415, A 特開２０１８−８１４５１号公報JP, 2008-81451, A

本発明は、予め定めた参照リストを用いなくても、精度良く画像に含まれる文字列を特定することができる情報処理装置及び情報処理プログラムを提供することを目的とする。 It is an object of the present invention to provide an information processing device and an information processing program capable of accurately specifying a character string included in an image without using a predetermined reference list.

第１態様に係る情報処理装置は、文字認識処理によって画像から認識された文字列を取得する取得部と、前記取得部で取得された文字列から、正規表現で表された抽出パターンを用いて文字列候補を抽出する抽出部と、前記文字認識処理が誤認識する誤認識パターンを用いて前記文字列候補を修正する修正部と、前記正規表現を用いて、前記修正部で修正された文字列候補の中から特定した文字列候補を出力する出力部と、を備える。 An information processing apparatus according to a first aspect uses an extraction unit that acquires a character string recognized from an image by a character recognition process and an extraction pattern represented by a regular expression from the character string acquired by the acquisition unit. An extraction unit that extracts a character string candidate, a correction unit that corrects the character string candidate using an erroneous recognition pattern that is erroneously recognized by the character recognition process, and a character that is corrected by the correction unit using the regular expression. An output unit that outputs the character string candidates specified from the column candidates.

第２態様に係る情報処理装置は、第１態様に係る情報処理装置において、前記抽出部は、前記誤認識パターンを用いて前記正規表現を修正した修正正規表現で表された修正抽出パターンを用いて前記文字列候補を抽出する。 An information processing apparatus according to a second aspect is the information processing apparatus according to the first aspect, wherein the extraction unit uses a modified extraction pattern represented by a modified regular expression obtained by modifying the regular expression using the misrecognition pattern. To extract the character string candidates.

第３態様に係る情報処理装置は、第１態様又は第２態様に係る情報処理装置において、前記取得部は前記画像を取得し、前記修正部は、特定の文字を対象として前記画像の文字認識処理を行う文字認識処理部と、前記文字認識処理部により認識された文字列と同型の文字列候補である誤認識文字列候補を生成する生成部と、を含み、前記出力部は、前記誤認識文字列候補の中から特定した文字列候補を出力する。 An information processing apparatus according to a third aspect is the information processing apparatus according to the first or second aspect, wherein the acquisition unit acquires the image, and the correction unit performs character recognition of the image targeting a specific character. The output unit includes a character recognition processing unit that performs processing, and a generation unit that generates a misrecognized character string candidate that is a character string candidate of the same type as the character string recognized by the character recognition processing unit. The specified character string candidate is output from the recognized character string candidates.

第４態様に係る情報処理装置は、第３態様に係る情報処理装置において、前記生成部は、ユーザーによって指定された誤認識パターンを用いて生成した文字列を前記誤認識文字列候補に追加する。 An information processing apparatus according to a fourth aspect is the information processing apparatus according to the third aspect, wherein the generation unit adds a character string generated using an erroneous recognition pattern designated by a user to the erroneously recognized character string candidates. ..

第５態様に係る情報処理装置は、第３態様又は第４態様に係る情報処理装置において、前記生成部は、前記抽出パターンで表される文字列候補を除いて前記誤認識文字列候補を生成する。 An information processing apparatus according to a fifth aspect is the information processing apparatus according to the third aspect or the fourth aspect, wherein the generation unit generates the misrecognized character string candidates except for the character string candidates represented by the extraction pattern. To do.

第６態様に係る情報処理装置は、第３〜第５態様の何れかの態様に係る情報処理装置において、前記文字認識処理部は、文字認識の精度を表すスコアを算出し、前記出力部は、前記スコアが予め定めた閾値以上の場合に、前記特定した文字列候補を出力する。 An information processing apparatus according to a sixth aspect is the information processing apparatus according to any one of the third to fifth aspects, wherein the character recognition processing unit calculates a score indicating accuracy of character recognition, and the output unit is If the score is equal to or greater than a predetermined threshold value, the specified character string candidate is output.

第７態様に係る情報処理装置は、第３〜第５態様の何れかの態様に係る情報処理装置において、前記文字認識処理部は、文字認識の精度を表すスコアを算出し、前記出力部は、前記スコアが予め定めた閾値未満の場合に、前記特定した文字列候補を推定文字列候補として出力する。 An information processing apparatus according to a seventh aspect is the information processing apparatus according to any one of the third to fifth aspects, wherein the character recognition processing unit calculates a score indicating accuracy of character recognition, and the output unit is If the score is less than a predetermined threshold value, the specified character string candidate is output as an estimated character string candidate.

第８態様に係る情報処理装置は、第３〜第５態様の何れかの態様に係る情報処理装置において、前記文字認識処理部は、文字認識の精度を表すスコアを算出し、前記出力部は、前記スコアが予め定めた閾値以上で且つ前記特定した文字列候補が複数の場合に、複数の前記特定した文字列候補を推定文字列候補として出力する。 An information processing apparatus according to an eighth aspect is the information processing apparatus according to any one of the third to fifth aspects, wherein the character recognition processing unit calculates a score indicating accuracy of character recognition, and the output unit is When the score is equal to or higher than a predetermined threshold and the specified character string candidates are plural, the plural specified character string candidates are output as estimated character string candidates.

第９態様に係る情報処理装置は、第３〜第５態様の何れかの態様に係る情報処理装置において、前記文字認識処理部により文字認識された文字列から必要な文字列である必要文字列を抽出する必要文字列抽出部を更に備え、前記生成部は、前記必要文字列抽出部により抽出された前記必要文字列と同型の文字列候補である誤認識文字列候補を生成する。 An information processing apparatus according to a ninth aspect is the information processing apparatus according to any one of the third to fifth aspects, in which a necessary character string that is a necessary character string from a character string recognized by the character recognition processing unit. The generating unit further includes a necessary character string extracting unit that extracts a character string candidate that is a character string candidate of the same type as the necessary character string extracted by the necessary character string extracting unit.

第１０態様に係る情報処理装置は、第１〜第５態様の何れかの態様に係る情報処理装置において、前記出力部は、前記正規表現で表される文字列に前記修正部で修正された文字列候補が存在しない場合、前記正規表現で表される文字列のうち、前記修正部で修正された文字列候補に近い文字列を推定文字列候補として出力する。 An information processing apparatus according to a tenth aspect is the information processing apparatus according to any one of the first to fifth aspects, wherein the output unit is corrected by the correction unit to a character string represented by the regular expression. When there is no character string candidate, a character string that is close to the character string candidate corrected by the correction unit among the character strings represented by the regular expression is output as an estimated character string candidate.

第１１態様に係る情報処理装置は、第１０態様に係る情報処理装置において、前記出力部は、前記正規表現で表される文字列のうち、前記修正部で修正された文字列候補と異なる文字の数が最も少ない文字列を推定文字列候補として出力する。 An information processing apparatus according to an eleventh aspect is the information processing apparatus according to the tenth aspect, wherein the output unit is a character string that is different from the character string candidate corrected by the correction unit in the character string represented by the regular expression. The character string with the smallest number of is output as an estimated character string candidate.

第１２態様に係る情報処理プログラムは、コンピュータを、第１〜第１１態様の何れか１つの態様に記載の情報処理装置の各部として機能させるための情報処理プログラムである。 The information processing program according to the twelfth aspect is an information processing program for causing a computer to function as each unit of the information processing apparatus according to any one of the first to eleventh aspects.

第１及び第１２態様によれば、予め定めた参照リストを用いなくても、精度良く画像に含まれる文字列を特定することができる、という効果を有する。 According to the first and twelfth aspects, there is an effect that a character string included in an image can be accurately specified without using a predetermined reference list.

第２態様によれば、誤認識パターンを用いて正規表現を修正した修正正規表現で表された修正抽出パターンを用いずに文字列候補を抽出する場合と比較して、本来抽出すべき文字列の一部の文字が誤認識された文字列も抽出することができる、という効果を有する。 According to the second aspect, as compared with the case of extracting the character string candidates without using the modified extraction pattern represented by the modified regular expression in which the regular expression is modified by using the misrecognition pattern, the character string that should be originally extracted It is possible to extract a character string in which some of the characters are erroneously recognized.

第３態様によれば、誤認識文字列候補を生成しない場合と比較して、精度良く画像に含まれる文字列を特定することができる、という効果を有する。 According to the third aspect, there is an effect that the character string included in the image can be specified more accurately than in the case where the erroneously recognized character string candidate is not generated.

第４態様によれば、ユーザーによって指定された誤認識パターンを用いずに誤認識文字列候補を生成する場合と比較して、精度良く画像に含まれる文字列を特定することができる、という効果を有する。 According to the fourth aspect, it is possible to accurately specify the character string included in the image, as compared with the case where the misrecognized character string candidate is generated without using the misrecognized pattern specified by the user. Have.

第５態様によれば、抽出パターンで表される文字列候補を除かずに誤認識文字列候補を生成する場合と比較して、画像に含まれる文字列を特定するまでの時間を短縮することができる、という効果を有する。 According to the fifth aspect, it is possible to shorten the time until the character string included in the image is specified, as compared with the case where the misrecognized character string candidate is generated without removing the character string candidate represented by the extraction pattern. It has the effect that

第６態様によれば、文字認識の精度を表すスコアに関係なく特定した文字列候補を出力する場合と比較して、特定した文字列候補が出力されたことについての信頼性が高まる、という効果を有する。 According to the sixth aspect, the reliability that the specified character string candidate is output is increased as compared with the case where the specified character string candidate is output regardless of the score indicating the accuracy of character recognition. Have.

第７態様によれば、文字認識の精度を表すスコアに関係なく特定した文字列候補を推定文字列候補として出力する場合と比較して、推定文字列候補として出力されたことについての信頼性が高まる、という効果を有する。 According to the seventh aspect, the reliability of being output as the estimated character string candidate is higher than that in the case of outputting the identified character string candidate as the estimated character string candidate regardless of the score indicating the accuracy of character recognition. It has the effect of increasing.

第８態様によれば、文字認識の精度を表すスコアに関係なく複数の特定した文字列候補を推定文字列候補として出力する場合と比較して、複数の推定文字列候補として出力されたことについての信頼性が高まる、という効果を有する。 According to the eighth aspect, as compared with the case where a plurality of specified character string candidates are output as estimated character string candidates regardless of the score indicating the accuracy of character recognition, the output as a plurality of estimated character string candidates is performed. Has the effect of increasing the reliability of.

第９態様によれば、文字認識された文字列から必要文字列を抽出せずに誤認識文字列候補を生成する場合と比較して、適切に誤認識文字列候補を生成することができる、という効果を有する。 According to the ninth aspect, it is possible to appropriately generate the erroneously recognized character string candidate, as compared with the case where the erroneously recognized character string candidate is generated without extracting the necessary character string from the character string subjected to character recognition. Has the effect.

第１０態様によれば、正規表現で表される文字列に文字列候補が存在しない場合に何も出力しない場合と比較して、画像に含まれる可能性のある文字列を把握することができる、という効果を有する。 According to the tenth aspect, when there is no character string candidate in the character string represented by the regular expression, as compared with the case where nothing is output, the character string that may be included in the image can be grasped. Has the effect of.

第１１態様によれば、正規表現で表される文字列と文字列候補との異なる文字の数に関係なく推定文字列候補を特定する場合と比較して、推定文字列候補の信頼性が高まる、という効果を有する。 According to the eleventh aspect, the reliability of the estimated character string candidate is increased as compared with the case where the estimated character string candidate is specified regardless of the number of different characters in the character string represented by the regular expression and the character string candidate. Has the effect of.

画像形成装置の構成図である。It is a block diagram of an image forming apparatus. 第１実施形態に係るＣＰＵの機能ブロック図である。FIG. 3 is a functional block diagram of a CPU according to the first embodiment. 第１実施形態に係る情報処理のフローチャートである。3 is a flowchart of information processing according to the first embodiment. 図面を読み取った画像の一例を示す図である。It is a figure which shows an example of the image which read the drawing. 再認識処理の対象となる領域について説明するための図である。It is a figure for demonstrating the area|region used as the object of a re-recognition process. 誤認識文字情報の一例を示す図である。It is a figure which shows an example of misrecognized character information. 誤認識パターン情報の一例を示す図である。It is a figure which shows an example of misrecognition pattern information. 第２実施形態に係るＣＰＵの機能ブロック図である。It is a functional block diagram of CPU which concerns on 2nd Embodiment. 第２実施形態に係る情報処理のフローチャートである。It is a flow chart of information processing concerning a 2nd embodiment.

以下、図面を参照して、本発明を実施するための形態例を詳細に説明する。 Embodiments for carrying out the present invention will be described in detail below with reference to the drawings.

（第１実施形態） (First embodiment)

図１は、本実施形態に係る画像形成装置１の構成図である。図１に示すように、画像形成装置１は、スキャナ部２、画像形成部３、操作部４、表示部５、通信部６、記憶部７、及び情報処理装置１０を備える。なお、図１では、用紙等の記録媒体を供給する供給機構、記録媒体を搬送する搬送機構、記録媒体を排出する排出機構等の図示を省略している。 FIG. 1 is a configuration diagram of an image forming apparatus 1 according to the present embodiment. As shown in FIG. 1, the image forming apparatus 1 includes a scanner unit 2, an image forming unit 3, an operation unit 4, a display unit 5, a communication unit 6, a storage unit 7, and an information processing device 10. Note that, in FIG. 1, a supply mechanism that supplies a recording medium such as paper, a transport mechanism that transports the recording medium, and a discharge mechanism that discharges the recording medium are not shown.

画像形成装置１は、原稿を読み取るスキャン機能、原稿を複写するコピー機能、印刷データを印刷する印刷機能、原稿をファクシミリ送信するファクシミリ送信機能等の複数の機能を有する複合機として機能する。 The image forming apparatus 1 functions as a multifunction peripheral having a plurality of functions such as a scan function of reading a document, a copy function of copying a document, a print function of printing print data, and a facsimile transmission function of facsimile transmission of a document.

スキャナ部２は、画像が形成された紙等の記録媒体を光学的に読み取って読み取り画像を生成し、情報処理装置１０に出力する。 The scanner unit 2 optically reads a recording medium such as paper on which an image is formed, generates a read image, and outputs the read image to the information processing apparatus 10.

画像形成部３は、例えば電子写真方式又はインクジェット記録方式等により紙等の記録媒体に画像を形成する機能を有し、情報処理装置１０からの指示により画像を記録媒体に形成する。 The image forming unit 3 has a function of forming an image on a recording medium such as paper by, for example, an electrophotographic system or an inkjet recording system, and forms an image on the recording medium according to an instruction from the information processing device 10.

操作部４は、各種操作を受け付ける操作キーを含んで構成される。 The operation unit 4 is configured to include operation keys that receive various operations.

表示部５は、液晶ディスプレイ等で構成され、例えば画面に指等で触れることで各種操作を受け付けるタッチパネルで構成される。 The display unit 5 is composed of a liquid crystal display or the like, and is composed of, for example, a touch panel that receives various operations by touching the screen with a finger or the like.

通信部６は、外部装置等とデータ通信を行うためのインターフェースである。 The communication unit 6 is an interface for performing data communication with an external device or the like.

記憶部７は、ハードディスク等の不揮発性の記憶装置で構成され、後述する情報処理プログラム等を記憶する。 The storage unit 7 is composed of a non-volatile storage device such as a hard disk, and stores an information processing program described later.

情報処理装置１０は、ＣＰＵ（Central Processing Unit）１０Ａ、ＲＯＭ（Read Only Memory）１０Ｂ、ＲＡＭ（Random Access Memory）１０Ｃ、不揮発性メモリ１０Ｄ、及び入出力インターフェース（Ｉ／Ｏ）１０Ｅを備える。そして、ＣＰＵ１０Ａ、ＲＯＭ１０Ｂ、ＲＡＭ１０Ｃ、不揮発性メモリ１０Ｄ、及びＩ／Ｏ１０Ｅがバス１０Ｆを介して各々接続されている。Ｉ／Ｏ１０Ｅには、スキャナ部２、画像形成部３、操作部４、表示部５、通信部６、記憶部７が接続されている。ＣＰＵ１０Ａは、記憶部７に記憶された情報処理プログラムを読み込んで実行する。 The information processing device 10 includes a CPU (Central Processing Unit) 10A, a ROM (Read Only Memory) 10B, a RAM (Random Access Memory) 10C, a nonvolatile memory 10D, and an input/output interface (I/O) 10E. The CPU 10A, the ROM 10B, the RAM 10C, the non-volatile memory 10D, and the I/O 10E are connected to each other via the bus 10F. The scanner unit 2, the image forming unit 3, the operation unit 4, the display unit 5, the communication unit 6, and the storage unit 7 are connected to the I/O 10E. The CPU 10A reads and executes the information processing program stored in the storage unit 7.

画像形成装置１は、前述したコピー機能等の複合機の基本機能の他に、図面情報抽出機能を有する。図面情報抽出機能は、製造業及び設備・建設業等で使用する図面を読み取り、読み取った図面の画像から、図面に記載されている指定の文字列を図面情報として抽出する機能である。図面に記載されている指定の文字列には、例えば品番、品名、及び版等を含む基本属性、図面に表された製品のスペックを表すスペック情報、材料情報、設計標準書番号等の各種情報を表す文字列が含まれる。図面から抽出した図面情報は、紙等の記録媒体に印刷したり、データファイルとして記憶部７に記憶したり、通信部６を介して外部装置に送信したり、表示部５に表示したりする。 The image forming apparatus 1 has a drawing information extraction function in addition to the basic functions of the multi-function peripheral such as the copy function described above. The drawing information extraction function is a function of reading a drawing used in the manufacturing industry, equipment/construction industry, and the like, and extracting a specified character string described in the drawing as drawing information from an image of the read drawing. In the specified character string shown in the drawing, for example, various information such as basic attributes including product number, product name, and version, specification information showing the specifications of the product shown in the drawing, material information, design standard document number, etc. Contains a string that represents. The drawing information extracted from the drawing is printed on a recording medium such as paper, stored in the storage unit 7 as a data file, transmitted to an external device via the communication unit 6, or displayed on the display unit 5. ..

図面情報抽出機能が利用されるケースとしては以下のようなケースが挙げられる。例えば図面に記載された文字列が或る技術標準に基づいて記載されている場合において、技術標準が更新されて図面に記載された文字列を修正する必要がある場合に、旧技術標準で記載された文字列を含む図面を探す場合等である。 Cases in which the drawing information extraction function is used include the following cases. For example, if a character string shown in the drawing is written based on a certain technical standard and the technical standard needs to be updated and the character string shown in the drawing needs to be corrected, the old technical standard should be used. This is the case, for example, when searching for a drawing that includes the specified character string.

以下、画像形成装置１が図面情報抽出機能を実行する場合におけるＣＰＵ１０Ａの機能構成について説明する。 The functional configuration of the CPU 10A when the image forming apparatus 1 executes the drawing information extraction function will be described below.

図２に示すように、ＣＰＵ１０Ａは、機能的には、ＯＣＲ処理部２０、取得部２２、抽出部２４、修正部２６、及び出力部２８を備える。 As shown in FIG. 2, the CPU 10A functionally includes an OCR processing unit 20, an acquisition unit 22, an extraction unit 24, a correction unit 26, and an output unit 28.

ＯＣＲ処理部２０は、スキャナ部２が原稿から読み取った画像に対して文字認識処理、すなわちＯＣＲ（Optical Character Recognition）処理を実行し、読み取った画像に含まれる文字列を取得部２２に出力する。なお、ＯＣＲ処理部２０が実行するＯＣＲ処理は、一例としてルールベースのＯＣＲ処理である。すなわち、予め定めた規則に基づいて、読み取った画像に含まれる文字列を全て抽出する。 The OCR processing unit 20 performs character recognition processing, that is, OCR (Optical Character Recognition) processing on the image read by the scanner unit 2 from the document, and outputs the character string included in the read image to the acquisition unit 22. The OCR processing executed by the OCR processing unit 20 is, for example, rule-based OCR processing. That is, all character strings included in the read image are extracted based on a predetermined rule.

取得部２２は、ＯＣＲ処理部２０のＯＣＲ処理によって画像から認識された文字列を取得する。取得部２２は、取得した文字列を修正部２６へ出力する。 The acquisition unit 22 acquires the character string recognized from the image by the OCR processing of the OCR processing unit 20. The acquisition unit 22 outputs the acquired character string to the correction unit 26.

抽出部２４は、取得部２２で取得された文字列から、正規表現で表された抽出パターンを用いて文字列候補を抽出する。なお、ＯＣＲ処理部２０によるＯＣＲ処理によって認識された文字列には、誤認識された文字が含まれる場合が多い。このため、抽出部２４は、ＯＣＲ処理部２０のＯＣＲ処理で誤認識する誤認識パターンを用いて修正された修正正規表現で表された抽出パターンを用いて文字列候補を抽出する。 The extraction unit 24 extracts character string candidates from the character string acquired by the acquisition unit 22, using an extraction pattern represented by a regular expression. It should be noted that the character string recognized by the OCR processing by the OCR processing unit 20 often includes a character that is erroneously recognized. Therefore, the extraction unit 24 extracts the character string candidates using the extraction pattern represented by the modified regular expression that is modified by using the misrecognized pattern that is misrecognized by the OCR processing of the OCR processing unit 20.

修正部２６は、ＯＣＲ処理が誤認識する誤認識パターンを用いて、抽出部２４が抽出した文字列候補を修正する。 The correction unit 26 corrects the character string candidates extracted by the extraction unit 24 using the erroneous recognition pattern erroneously recognized by the OCR process.

具体的には、修正部２６は、再認識処理部３０及び生成部３２を備える。 Specifically, the correction unit 26 includes a re-recognition processing unit 30 and a generation unit 32.

再認識処理部３０は、特定の文字を対象として画像のＯＣＲ処理を再度行う。再認識処理部３０は、例えばニューラルネットワーク等の機械学習を用いて画像のＯＣＲ処理を実行する。なお、再認識処理部３０は、文字認識処理部の一例である。 The re-recognition processing unit 30 performs the OCR processing of the image again for the specific character. The re-recognition processing unit 30 executes the OCR processing of the image using machine learning such as a neural network. The re-recognition processing unit 30 is an example of a character recognition processing unit.

生成部３２は、再認識処理部３０により認識された文字列候補と同型の文字列候補である誤認識文字列候補を生成する。例えば、誤認識パターンを用いて正規表現を修正した修正正規表現で表された修正抽出パターンを用いて誤認識文字列候補を生成する。 The generation unit 32 generates a misrecognized character string candidate that is a character string candidate of the same type as the character string candidate recognized by the re-recognition processing unit 30. For example, the misrecognized character string candidate is generated using the modified extraction pattern represented by the modified regular expression in which the regular expression is modified using the misrecognized pattern.

ここで、誤認識文字列候補とは、再認識処理部３０により認識された文字列候補であって、認識された文字と同型であり認識が難しい文字列候補、又は、再認識処理部３０により認識された文字列であって、認識された文字が誤っている可能性の高い文字列候補である。なお、以下では、再認識処理部３０により認識された文字列も誤認識文字列候補に含まれるものとする。 Here, the erroneously recognized character string candidate is a character string candidate recognized by the re-recognition processing unit 30 and is a character string candidate having the same type as the recognized character and difficult to recognize, or the re-recognition processing unit 30. The recognized character string is a character string candidate in which the recognized character is highly likely to be incorrect. In the following, it is assumed that the character string recognized by the re-recognition processing unit 30 is also included in the erroneously recognized character string candidates.

出力部２８は、正規表現を用いて、修正部２６で修正された文字列候補、すなわち生成部３２で生成された誤認識文字列候補の中から特定した文字列候補を出力する。 The output unit 28 outputs the character string candidates corrected by the correction unit 26, that is, the character string candidates identified from the misrecognized character string candidates generated by the generation unit 32, using the regular expression.

次に、図３を参照して、本実施の形態に係る情報処理装置１０の作用を説明する。図２に示すように、情報処理プログラム４０は記憶部７に記憶されている。ＣＰＵ１０Ａが情報処理プログラム４０を読み出して実行することにより、図３に示す情報処理が実行される。なお、図３に示す情報処理は、例えば、ユーザーがスキャナ部２に図面をセットし、操作部４を操作して図面情報抽出処理の実行を指示した場合に実行される。 Next, the operation of the information processing device 10 according to the present embodiment will be described with reference to FIG. As shown in FIG. 2, the information processing program 40 is stored in the storage unit 7. The information processing shown in FIG. 3 is executed by the CPU 10A reading and executing the information processing program 40. The information processing illustrated in FIG. 3 is executed, for example, when the user sets a drawing on the scanner unit 2 and operates the operation unit 4 to instruct execution of the drawing information extraction process.

ステップＳ１００では、ＣＰＵ１０Ａが、スキャナ部２に図面の読み取りを指示し、スキャナ部２が読み取った図面の画像を取得する。 In step S100, the CPU 10A instructs the scanner unit 2 to read the drawing, and acquires the image of the drawing read by the scanner unit 2.

ステップＳ１０２では、ＣＰＵ１０Ａが、ステップＳ１００で取得した画像に対してルールベースのＯＣＲ処理を実行する。これにより、読み取った画像に含まれる全ての文字列が取得される。また、読み取った画像に含まれる全ての文字列の位置、すなわち画像の中の座標も取得される。ＯＣＲ処理により認識された文字列は、例えばＸＭＬ形式の文字列として得られるが、文字列の形式はＸＭＬ形式に限られるものではない。 In step S102, the CPU 10A executes rule-based OCR processing on the image acquired in step S100. As a result, all the character strings included in the read image are acquired. Further, the positions of all the character strings included in the read image, that is, the coordinates in the image are also acquired. The character string recognized by the OCR process is obtained as a character string in the XML format, for example, but the format of the character string is not limited to the XML format.

ステップＳ１０４では、ＣＰＵ１０Ａが、ステップＳ１０２のＯＣＲ処理によって得られた文字列から、ステップＳ１００で取得した画像に含まれる文字列候補を抽出する。具体的には、正規表現で表された抽出パターンを用いて画像に含まれる文字列候補を抽出する。本実施形態では、例えば図２に示すように、正規表現が定義された正規表現情報４２が予め記憶部７に記憶されており、この正規表現情報４２で定義された正規表現で表される抽出パターンを用いて文字列候補を抽出する。なお、正規表現情報４２を予め記憶部７に記憶しておく場合に限らず、通信部６を介して外部装置から正規表現情報４２を取得してもよい。 In step S104, CPU 10A extracts a character string candidate included in the image acquired in step S100 from the character string obtained by the OCR processing in step S102. Specifically, the character string candidates included in the image are extracted using the extraction pattern represented by the regular expression. In the present embodiment, for example, as shown in FIG. 2, regular expression information 42 in which a regular expression is defined is stored in advance in the storage unit 7, and extraction represented by the regular expression defined in the regular expression information 42 is performed. Extract character string candidates using the pattern. The regular expression information 42 is not limited to being stored in the storage unit 7 in advance, and the regular expression information 42 may be acquired from an external device via the communication unit 6.

ここで、正規表現とは、文字列の集合を一つの文字列で表現する方法である。そして、正規表現は、図面情報の抽出対象となる図面に含まれ得る文字列に対応したものである。すなわち、記憶部７に予め記憶された正規表現情報４２には、図面情報の抽出対象となる図面に含まれ得る文字列に対応した正規表現が定義されている。 Here, the regular expression is a method of expressing a set of character strings by one character string. The regular expression corresponds to the character string that can be included in the drawing from which the drawing information is extracted. That is, the regular expression information 42 stored in advance in the storage unit 7 defines the regular expressions corresponding to the character strings that can be included in the drawing from which the drawing information is extracted.

例えば、図面情報の抽出対象となる図面に含まれ得る文字列が４個の文字から構成され、先頭の文字が“Ａ”、２番目の文字が“Ｂ”、３番目の文字が“Ｃ”、４番目の文字が“０”〜“９”の１０種類の数字の何れかの文字である文字列であるとする。この場合、図面情報の抽出対象となる図面に含まれ得る文字列の正規表現は、“［Ａ］［Ｂ］［Ｃ］［０−９］”という一つの文字列で表される。この場合、正規表現情報４２には、正規表現として“［Ａ］［Ｂ］［Ｃ］［０−９］”が定義される。なお、正規表現“［Ａ］［Ｂ］［Ｃ］［０−９］”で表された抽出パターンの文字列は、“ＡＢＣ０”〜“ＡＢＣ９”までの１０パターンとなる。 For example, a character string that can be included in a drawing from which drawing information is extracted is composed of four characters, the first character is "A", the second character is "B", and the third character is "C". It is assumed that the fourth character is a character string in which any of the 10 types of numbers "0" to "9" is a character. In this case, the regular expression of the character string that can be included in the drawing from which the drawing information is extracted is represented by one character string “[A][B][C][0-9]”. In this case, the regular expression information 42 defines “[A][B][C][0-9]” as a regular expression. The character string of the extracted pattern represented by the regular expression “[A][B][C][0-9]” is 10 patterns from “ABC0” to “ABC9”.

ところで、ステップＳ１０２のＯＣＲ処理によって認識された文字列には、誤認識された文字が含まれる場合が多い。このため、上記の例において、正規表現“［Ａ］［Ｂ］［Ｃ］［０−９］”で表される抽出パターンを用いて文字列候補を抽出したのでは、本来抽出すべき文字列の一部の文字が誤認識された文字列が抽出されなくなってしまう。このため、ステップＳ１０４では、ＯＣＲ処理で誤認識する誤認識パターンを用いて修正された修正正規表現で表された抽出パターンを用いて文字列候補を抽出する。すなわち、正規表現情報４２には、ＯＣＲ処理で誤認識する誤認識パターンを用いて修正された修正正規表現を定義しておく。 By the way, in many cases, the character string recognized by the OCR processing in step S102 includes a character that is erroneously recognized. Therefore, in the above example, if the character string candidates are extracted using the extraction pattern represented by the regular expression “[A][B][C][0-9]”, A character string in which some of the characters are erroneously recognized will not be extracted. Therefore, in step S104, the character string candidates are extracted using the extraction pattern represented by the corrected regular expression corrected using the erroneous recognition pattern that is erroneously recognized in the OCR process. That is, the regular expression information 42 defines a modified regular expression that is corrected by using an erroneous recognition pattern that is erroneously recognized by the OCR process.

例えば、上記の例の場合において、“Ａ”は、例えば“Ｈ”と誤認識されやすい。また、“Ｂ”は、例えば“８”又は“日”と誤認識されやすい。また、“Ｃ”は、“［”又は“（”と誤認識されやすい。また、“０”は、“ｏ”又は“Ｏ”と誤認識されやすい。この場合、ＯＣＲ処理で誤認識する誤認識パターンを用いて修正された修正正規表現は、“［ＡＨ］［Ｂ８日］［Ｃ［（］［０ｏＯ−９］”となる。このような修正正規表現で表される抽出パターンを用いて文字列候補を抽出することにより、本来抽出すべき文字列の一部の文字が誤認識された文字列も抽出される。 For example, in the case of the above example, “A” is likely to be erroneously recognized as “H”. Also, "B" is easily misrecognized as "8" or "day", for example. Further, "C" is apt to be erroneously recognized as "[" or "(", and "0" is apt to be erroneously recognized as "o" or "O". The modified regular expression modified using the recognition pattern is “[AH][B8 days][C[(][0oO-9]”. By using the extraction pattern represented by such modified regular expression By extracting the character string candidates, a character string in which some characters of the character string to be originally extracted are erroneously recognized is also extracted.

例えばステップＳ１００で得られた画像が図４に示す画像５０であったとする。画像５０には、文字列“ＡＢＣ０”を表す画像、文字列“ＡＢＣ８”を表す画像、文字列“ＸＹＺ１”を表す画像が含まれている。 For example, assume that the image obtained in step S100 is the image 50 shown in FIG. The image 50 includes an image representing the character string “ABC0”, an image representing the character string “ABC8”, and an image representing the character string “XYZ1”.

この場合、ステップＳ１０２のＯＣＲ処理では、画像５０に含まれる全ての文字列が認識されるので、文字列“ＡＢＣ０”、文字列“ＡＢＣ８”、文字列“ＸＹＺ１”の全てが何らかの文字列として全て認識される。そして、ステップＳ１０２のＯＣＲ処理では少なくとも一部の文字が誤認識される場合があるので、全ての文字列を正しく認識するとは限らない。例えば、文字列“ＡＢＣ０”については、例えば“ＡＢＣｏ”又は“ＡＢＣＯ”と誤認識される場合がある。また、文字列“ＡＢＣ８”については、“ＡＢＣＢ”と誤認識される場合がある。これに対して、ステップＳ１０４では、修正正規表現で表された抽出パターンを用いて文字列候補を抽出するので、例えば“ＡＢＣｏ”又は“ＡＢＣＯ”と誤認識されても文字列候補として抽出される。 In this case, since all the character strings included in the image 50 are recognized in the OCR processing of step S102, all of the character strings “ABC0”, the character string “ABC8”, and the character string “XYZ1” are all regarded as some character strings. Be recognized. Then, in the OCR processing of step S102, at least some characters may be erroneously recognized, so that not all character strings are recognized correctly. For example, the character string "ABC0" may be erroneously recognized as "ABCo" or "ABCO". Further, the character string "ABC8" may be erroneously recognized as "ABCB". On the other hand, in step S104, since the character string candidates are extracted using the extraction pattern represented by the modified regular expression, even if the character strings are erroneously recognized as "ABCo" or "ABCO", they are extracted as character string candidates. ..

ステップＳ１０６では、ＣＰＵ１０Ａが、ステップＳ１０４で抽出された文字列候補について学習モデルを用いたＯＣＲ処理（以下、再認識処理と称する）を実行する。すなわち、ステップＳ１００で取得した画像全体について再認識処理を実行するのではなく、ステップＳ１０４で抽出された文字列候補についてのみ再認識処理を実行する。なお、再認識処理を実行する対象の領域は、ステップＳ１０２のルールベースのＯＣＲ処理で得られた文字列候補の位置に基づいて設定すればよい。 In step S106, the CPU 10A executes an OCR process (hereinafter, referred to as a re-recognition process) using the learning model for the character string candidates extracted in step S104. That is, the re-recognition process is not executed for the entire image acquired in step S100, but the re-recognition process is executed only for the character string candidates extracted in step S104. Note that the target region for executing the re-recognition process may be set based on the position of the character string candidate obtained by the rule-based OCR process in step S102.

例えば図５に示すように、画像５０全体について再認識処理を実行するのではなく、文字列候補“ＡＢＣ０”を含む領域Ｒ１と、文字列候補“ＡＢＣ８”を含む領域Ｒ２と、について再認識処理を実行する。 For example, as shown in FIG. 5, the re-recognition process is not performed on the entire image 50, but the re-recognition process is performed on the region R1 including the character string candidate “ABC0” and the region R2 including the character string candidate “ABC8”. To execute.

なお、学習モデルを用いたＯＣＲ処理としては、本実施形態では一例として畳み込みニューラルネットワーク（ＣＮＮ：Convolutional Neural Network）等の機械学習を用いたＯＣＲ処理が挙げられるが、これに限られるものではない。 As an example of the OCR process using the learning model, an OCR process using machine learning such as a convolutional neural network (CNN) is used in the present embodiment, but the present invention is not limited to this.

学習モデルは、特定の文字について学習されている。ここで、特定の文字とは、図面情報抽出機能を実行する対象の図面に含まれ得る文字であり、例えば、アルファベット、数字、及び記号等である。 The learning model has been learned for a particular character. Here, the specific character is a character that can be included in the drawing for which the drawing information extraction function is executed, and is, for example, an alphabet, a number, or a symbol.

このように、特定の文字について学習された学習モデルを用いたＯＣＲ処理は、ステップＳ１０２で実行されるルールベースのＯＣＲ処理と比較して文字列の認識率は高くなる。 As described above, the OCR processing using the learning model learned for a specific character has a higher character string recognition rate than the rule-based OCR processing executed in step S102.

例えば画像５０に含まれる文字列“ＡＢＣ０”が、ステップＳ１０２のＯＣＲ処理で“ＡＢＣｏ”と誤認識され、文字列“ＡＢＣ８”が“ＡＢＣＢ”と誤認識されたとする。このような場合でも、ステップＳ１０６の再認識処理で図５に示す領域Ｒ１、Ｒ２について再認識処理を実行した結果、“ＡＢＣ０”、“ＡＢＣ８”とそれぞれ正しく認識される場合があり得る。 For example, it is assumed that the character string “ABC0” included in the image 50 is erroneously recognized as “ABCo” and the character string “ABC8” is erroneously recognized as “ABCB” in the OCR process of step S102. Even in such a case, as a result of performing the re-recognition process on the regions R1 and R2 shown in FIG. 5 in the re-recognition process of step S106, “ABC0” and “ABC8” may be correctly recognized, respectively.

なお、ステップＳ１０６で実行される再認識処理では、文字認識の精度を表すスコアが算出される。スコアが高いほど文字認識の精度が高く、スコアが低いほど文字認識の精度が低い。例えばスコアは０〜１の範囲の値であり、数値が高いほど文字認識の精度が高い。 In the re-recognition process executed in step S106, a score representing the accuracy of character recognition is calculated. The higher the score, the higher the accuracy of character recognition, and the lower the score, the lower the accuracy of character recognition. For example, the score is a value in the range of 0 to 1, and the higher the numerical value, the higher the accuracy of character recognition.

以下のステップＳ１０８以降の処理は、再認識処理で認識された文字列毎に実行される。 The processing from step S108 below is executed for each character string recognized in the re-recognition processing.

ステップＳ１０８では、ＣＰＵ１０Ａが、ステップＳ１０６の再認識処理により認識された文字列と同型の文字列候補である誤認識文字列候補を生成する。 In step S108, the CPU 10A generates a misrecognized character string candidate that is a character string candidate of the same type as the character string recognized by the re-recognition process of step S106.

具体的には、例えば図６に示すように、誤認識文字が定義された誤認識文字情報４４が記憶部７に予め記憶されており、この誤認識文字情報４４を用いて誤認識文字列候補を生成する。誤認識文字情報４４は、ステップＳ１０２のＯＣＲ処理で誤認識する誤認識パターンを定義したものである。図６の例では、“Ａ”に対して誤認識されやすい“Ｈ”が定義されている。また、“Ｂ”に対して誤認識されやすい“８”が定義されている。また、“０”に対して誤認識されやすい“ｏ”及び“Ｏ”が定義されている。 Specifically, for example, as shown in FIG. 6, misrecognized character information 44 in which misrecognized characters are defined is stored in advance in the storage unit 7, and the misrecognized character string candidate is used by using this misrecognized character information 44. To generate. The misrecognized character information 44 defines a misrecognized pattern that is misrecognized in the OCR process of step S102. In the example of FIG. 6, “H” that is easily misrecognized for “A” is defined. Further, “8” is defined which is easily misrecognized for “B”. Further, “o” and “O” that are easily misrecognized with respect to “0” are defined.

これにより、例えばステップＳ１０６の再認識処理により認識された文字列が“ＡＢＣ０”であった場合、“ＨＢＣ０”、“Ａ８Ｃ０”、“ＡＢＣｏ”、“ＡＢＣＯ”、“Ｈ８Ｃ０”、“Ｈ８Ｃｏ”、“Ｈ８ＣＯ”等が誤認識文字列候補として生成される。 Thereby, for example, when the character string recognized by the re-recognition process of step S106 is "ABC0", "HBC0", "A8C0", "ABCo", "ABCO", "H8C0", "H8Co", " H8CO" or the like is generated as a misrecognized character string candidate.

ステップＳ１１０では、ＣＰＵ１０Ａが、ユーザーによって指定された誤認識パターンを用いて生成した文字列を誤認識文字列候補に追加する。ここで、ユーザーによって指定された誤認識パターンとは、例えば図面に含まれる文字のフォントに応じて設定された誤認識パターンである。例えば“０”に対して誤認識されやすい文字の代表的な例は“ｏ”及び“Ｏ”であるが、図面で使用されている文字のフォントによっては、“ｏ”及び“Ｏ”以外の他の文字と誤認識されやすい場合もある。この場合、例えば図７に示すように、ユーザー指定の誤認識パターンを定義した誤認識パターン情報４６を記憶部７に予め記憶しておき、この誤認識パターン情報４６を用いて生成した文字列を誤認識文字列候補に追加する。 In step S110, the CPU 10A adds the character string generated using the erroneous recognition pattern designated by the user to the erroneously recognized character string candidates. Here, the erroneous recognition pattern designated by the user is, for example, an erroneous recognition pattern set according to the font of characters included in the drawing. For example, typical examples of characters that are easily erroneously recognized as "0" are "o" and "O", but depending on the font of the characters used in the drawings, characters other than "o" and "O" may be used. It may be easily misrecognized as another character. In this case, for example, as shown in FIG. 7, misrecognition pattern information 46 defining a user-specified misrecognition pattern is stored in the storage unit 7 in advance, and a character string generated using this misrecognition pattern information 46 is stored. Add to the misrecognized character string candidates.

図７の誤認識パターン情報４６の例では、“Ｂ”に対して誤認識されやすい文字として“日”が、“０”に対して誤認識されやすい文字として“Ｄ”が設定されている。例えばステップＳ１０６の再認識処理により認識された文字列が“ＡＢＣ０”であった場合、“Ａ日Ｃ０”、“ＡＢＣＤ”が誤認識文字列候補として追加される。 In the example of the misrecognized pattern information 46 of FIG. 7, "day" is set as a character that is easily misrecognized for "B", and "D" is set as a character that is easily misrecognized for "0". For example, when the character string recognized by the re-recognition process in step S106 is "ABC0", "A date C0" and "ABCD" are added as the misrecognized character string candidates.

なお、ステップＳ１０８及びステップＳ１１０で生成した誤認識文字列候補のうち、正規表現で表される文字列以外の誤認識文字列候補については、画像に含まれる文字列候補を特定する上で不要である。 Note that among the misrecognized character string candidates generated in step S108 and step S110, the misrecognized character string candidates other than the character string represented by the regular expression are not necessary for specifying the character string candidates included in the image. is there.

そこで、ステップＳ１１２では、ＣＰＵ１０Ａが、ステップＳ１０８及びステップＳ１１０で生成した誤認識文字列候補のうち、正規表現で表される抽出パターンの文字列以外の誤認識文字列候補を除外する。 Therefore, in step S112, the CPU 10A excludes the misrecognized character string candidates other than the character string of the extraction pattern represented by the regular expression from the misrecognized character string candidates generated in step S108 and step S110.

例えば、上記の例で正規表現が“［Ａ］［Ｂ］［Ｃ］［０−９］”の場合において、ステップＳ１０８及びステップＳ１１０で生成した誤認識文字列候補として“ＨＢＣ０”、“Ａ８Ｃ０”、“ＡＢＣｏ”、“ＡＢＣＯ”、“Ｈ８Ｃ０”、“Ｈ８Ｃｏ”、“Ｈ８ＣＯ”、“Ａ日Ｃ０”、“ＡＢＣＤ”が含まれていたとする。正規表現で表される文字列は先頭から３文字目まではアルファベット、先頭から４文字目は数字なので、上記の誤認識文字列候補のうち、先頭から４文字目がアルファベットである“ＡＢＣｏ”、“ＡＢＣＯ”、“Ｈ８Ｃｏ”、“Ｈ８ＣＯ”、“ＡＢＣＤ”、先頭から２文字目が漢字である“Ａ日Ｃ０”は誤認識文字列候補から除外される。 For example, in the above example, when the regular expression is “[A][B][C][0-9]”, “HBC0” and “A8C0” are set as the misrecognized character string candidates generated in step S108 and step S110. , "ABCo", "ABCO", "H8C0", "H8Co", "H8CO", "A day C0", "ABCD" are included. Since the character string represented by the regular expression is the alphabet from the first to the third character, and the fourth character from the first is a number, "ABCo" in which the fourth character from the first is alphabet among the above misrecognized character string candidates, “ABCO”, “H8Co”, “H8CO”, “ABCD”, and “A day C0” in which the second character from the beginning is a Chinese character are excluded from the misrecognized character string candidates.

ステップＳ１１４では、ＣＰＵ１０Ａが、正規表現を用いて、誤認識文字列候補の中から文字列候補を特定する。すなわち、誤認識文字列候補のうち、正規表現で表される文字列に含まれる誤認識文字列候補を文字列候補として特定する。例えば正規表現が“［Ａ］［Ｂ］［Ｃ］［０−９］”の場合、正規表現で表される文字列は“ＡＢＣ０”〜“ＡＢＣ９”までの１０パターンである。従って、誤認識文字列候補のうち、“ＡＢＣ０”〜“ＡＢＣ９”の何れかの文字列と一致する誤認識文字列候補を文字列候補として特定する。一方、正規表現で表される文字列に何れの誤認識文字列候補も含まれていない場合は、文字列候補は特定されない。 In step S114, the CPU 10A identifies a character string candidate from the misrecognized character string candidates using the regular expression. That is, among the misrecognized character string candidates, the misrecognized character string candidates included in the character string represented by the regular expression are specified as the character string candidates. For example, when the regular expression is “[A][B][C][0-9]”, the character string represented by the regular expression is 10 patterns from “ABC0” to “ABC9”. Therefore, among the misrecognized character string candidates, the misrecognized character string candidates that match any of the character strings “ABC0” to “ABC9” are specified as the character string candidates. On the other hand, if the character string represented by the regular expression does not include any erroneously recognized character string candidates, the character string candidates are not specified.

ステップＳ１１６では、ＣＰＵ１０Ａが、ステップＳ１１４で文字列候補が特定されたか否かを判定する。そして、文字列候補が特定された場合、すなわち正規表現で表される文字列の何れかと一致する誤認識文字列候補が存在する場合はステップＳ１１８へ移行する。一方、文字列候補が特定されなかった場合、すなわち正規表現で表される文字列と一致する誤認識文字列候補が存在しなかった場合はステップＳ１２４へ移行する。 In step S116, the CPU 10A determines whether the character string candidate is identified in step S114. Then, if the character string candidate is specified, that is, if there is a misrecognized character string candidate that matches any of the character strings represented by the regular expression, the process proceeds to step S118. On the other hand, if no character string candidate is specified, that is, if there is no misrecognized character string candidate that matches the character string represented by the regular expression, the process proceeds to step S124.

ステップＳ１１８では、ＣＰＵ１０Ａが、ステップＳ１０６の再認識処理において算出されたスコアが予め定めた閾値以上であるか否かを判定する。なお、閾値は、スコアが閾値以上であれば、ＯＣＲ処理の認識結果の信頼性が許容レベル以上とされる値に設定される。例えば、スコアが取り得る値の最大値の９割以上の値に設定される。具体的には、スコアが取り得る値の範囲が０〜１の場合、閾値は一例として０．９５に設定される。 In step S118, the CPU 10A determines whether or not the score calculated in the re-recognition process of step S106 is equal to or higher than a predetermined threshold value. It should be noted that the threshold value is set to a value at which the reliability of the recognition result of the OCR process is equal to or higher than the allowable level if the score is equal to or higher than the threshold value. For example, the score is set to 90% or more of the maximum possible values. Specifically, when the range of values that the score can take is 0 to 1, the threshold is set to 0.95 as an example.

そして、ステップＳ１０６の再認識処理において算出されたスコアが予め定めた閾値以上である場合、すなわちステップＳ１０６の再認識処理の認識結果の信頼性が許容レベル以上である場合はステップＳ１１９へ移行する。一方、ステップＳ１０６の再認識処理において算出されたスコアが予め定めた閾値未満である場合、すなわちステップＳ１０６の再認識処理の認識結果の信頼性が許容レベル未満である場合はステップＳ１２２へ移行する。 Then, when the score calculated in the re-recognition process of step S106 is equal to or higher than a predetermined threshold, that is, when the reliability of the recognition result of the re-recognition process of step S106 is equal to or higher than the allowable level, the process proceeds to step S119. On the other hand, if the score calculated in the re-recognition process of step S106 is less than the predetermined threshold, that is, if the reliability of the recognition result of the re-recognition process of step S106 is less than the allowable level, the process proceeds to step S122.

ステップＳ１１９では、ステップＳ１１４で特定された文字列候補が１つであるか否かを判定する。そして、特定された文字列候補が１つの場合はステップＳ１２０へ移行し、特定された文字列候補が複数の場合はステップＳ１２２へ移行する。 In step S119, it is determined whether the number of character string candidates identified in step S114 is one. Then, if there is one identified character string candidate, the process proceeds to step S120, and if there are multiple identified character string candidates, the process proceeds to step S122.

ステップＳ１２０では、ＣＰＵ１０Ａが、ステップＳ１１４で特定した１つの文字列候補を出力する。すなわち、特定した文字列候補を紙等の記録媒体に印刷したり、データファイルとして記憶部７に記憶したり、通信部６を介して外部装置に送信したり、表示部５に表示したりする。 In step S120, CPU 10A outputs the one character string candidate identified in step S114. That is, the specified character string candidates are printed on a recording medium such as paper, stored in the storage unit 7 as a data file, transmitted to an external device via the communication unit 6, or displayed on the display unit 5. ..

一方、ステップＳ１２２が実行されるのは、ステップＳ１１８で再認識処理の認識結果の信頼性が許容レベル未満と判定された又はステップＳ１１９で複数の文字列候補が特定された場合である。このため、ステップＳ１１４で特定された文字列候補を、あくまで図面に含まれると推定される文字列候補である推定文字列候補として出力する。 On the other hand, step S122 is executed when the reliability of the recognition result of the re-recognition processing is determined to be less than the allowable level in step S118 or when a plurality of character string candidates are specified in step S119. Therefore, the character string candidates identified in step S114 are output as estimated character string candidates that are character string candidates estimated to be included in the drawing.

また、ステップＳ１２４が実行されるのは、正規表現で表される文字列と一致する誤認識文字列候補が存在しない場合である。このため、誤認識文字列候補に近い文字列を正規表現で表される文字列から検索する。具体的には、正規表現で表される文字列の中から、誤認識文字列候補と異なる文字の数が最も少ない文字列を検索する。そして、検索した文字列を推定文字列候補として出力する。 Further, step S124 is executed when there is no erroneously recognized character string candidate that matches the character string represented by the regular expression. Therefore, a character string close to the erroneously recognized character string candidate is searched from the character string represented by the regular expression. Specifically, the character string represented by the regular expression is searched for the character string having the smallest number of characters different from the erroneously recognized character string candidate. Then, the retrieved character string is output as an estimated character string candidate.

ステップＳ１２６では、ステップＳ１０６で認識された全ての文字列についてステップＳ１０８〜Ｓ１２４の処理を実行したか否かを判定する。そして、ステップＳ１０６で認識された全ての文字列についてステップＳ１０８〜Ｓ１２４の処理を実行した場合は本ルーチンを終了する。一方、ステップＳ１０６で認識された全ての文字列についてステップＳ１０８〜Ｓ１２４の処理を実行していない場合は、ステップＳ１０８へ移行し、全ての文字列の処理が終了するまでステップＳ１０８〜Ｓ１２４の処理を繰り返す。 In step S126, it is determined whether or not the processes of steps S108 to S124 have been executed for all the character strings recognized in step S106. Then, when the processes of steps S108 to S124 have been executed for all the character strings recognized in step S106, this routine ends. On the other hand, if the processes of steps S108 to S124 have not been executed for all the character strings recognized in step S106, the process proceeds to step S108, and the processes of steps S108 to S124 are performed until the processing of all the character strings is completed. repeat.

このように、本実施形態では、ＯＣＲ処理によって画像から認識された文字列から、正規表現で表された抽出パターンを用いて文字列候補を抽出する。そして、ＯＣＲ処理が誤認識する誤認識パターンを用いて文字列候補を修正し、正規表現を用いて、修正された文字列候補の中から特定した文字列候補を出力する。 As described above, in this embodiment, character string candidates are extracted from the character string recognized from the image by the OCR process using the extraction pattern represented by the regular expression. Then, the OCR process corrects the character string candidates using the erroneous recognition pattern, and the regular expression is used to output the character string candidates identified from the corrected character string candidates.

なお、第１実施形態では、図３のステップＳ１１０において、ユーザー指定の誤認識パターンを定義した誤認識パターン情報４６を用いて生成した文字列を誤認識文字列候補に追加しているが、ステップＳ１１０の処理を省略してもよい。 In the first embodiment, in step S110 of FIG. 3, the character string generated using the misrecognized pattern information 46 defining the user-specified misrecognized pattern is added to the misrecognized character string candidates. The process of S110 may be omitted.

次に、文字列候補を特定する場合の具体例について説明する。 Next, a specific example of specifying a character string candidate will be described.

・具体例１・Specific example 1

具体例１では、正規表現が[F][X][1][0][A-Z][A-Z][0-9][0-9][0-9]、図面に含まれる実際の文字列が“FX10AA106”である。 In Example 1, the regular expression is [F][X][1][0][AZ][AZ][0-9][0-9][0-9], the actual string included in the drawing. Is "FX10AA106".

上記の条件でＯＣＲ処理を実行すると、認識した文字列候補は“FXIOAAIO6”であった。すなわち、３、４文字目の“10”が“IO”に、７、８文字目の“10”が“IO”に誤認識されている。 When the OCR process was executed under the above conditions, the recognized character string candidate was "FXIOAAIO6". That is, the third and fourth characters "10" are mistakenly recognized as "IO", and the seventh and eighth characters "10" are mistakenly recognized as "IO".

次に、再認識処理の結果は、“FX10AA106”となり、図面に含まれる実際の文字列と完全に一致した。なお、再認識処理のスコアは閾値以上であった。また、誤認識文字列候補としては、再認識処理により認識された文字列も含めて、“FX10AA106”、“FX1OAA106”、“FX1oAA06”、“FX10AA1O6”、“FX10AA1o6”等が生成された。なお、具体例１では、ユーザー指定の誤認識パターン情報を用いて誤認識文字列候補を更に生成する処理は実行しなかった。 Next, the result of the re-recognition process was “FX10AA106”, which completely matched the actual character string included in the drawing. The score of the re-recognition process was equal to or higher than the threshold. In addition, as the erroneously recognized character string candidates, “FX10AA106”, “FX1OAA106”, “FX1oAA06”, “FX10AA1O6”, “FX10AA1o6”, etc. were generated including the character strings recognized by the re-recognition process. In addition, in the specific example 1, the process of further generating the misrecognized character string candidate by using the misrecognized pattern information designated by the user is not executed.

誤認識文字列候補のうち、正規表現を用いて不要な文字列候補を除外すると、“FX10AA106”のみが残る。この誤認識文字列候補は、正規表現で表される文字列である。このため、図面に記載された文字列候補として“FX10AA106”が特定され、出力される。この文字列は、図面に含まれる実際の文字列と完全に一致している。 If unnecessary character string candidates are excluded from the misrecognized character string candidates by using a regular expression, only "FX10AA106" remains. This erroneously recognized character string candidate is a character string represented by a regular expression. Therefore, “FX10AA106” is specified and output as the character string candidate described in the drawing. This string exactly matches the actual string contained in the drawing.

・具体例２・Specific example 2

具体例２では、再認識処理のスコアが閾値未満であること以外は具体例１と同じである。 The specific example 2 is the same as the specific example 1 except that the score of the re-recognition process is less than the threshold value.

具体例２では、具体例１と同様に、図面に記載された文字列候補として、図面に含まれる実際の文字列と完全に一致する“FX10AA106”が特定されるが、再認識処理のスコアが閾値未満であるため信頼性が低い。従って、文字列候補“FX10AA106”は、あくまで図面に含まれると推定される文字列候補である推定文字列候補として出力される。 In the second specific example, as in the first specific example, “FX10AA106” that exactly matches the actual character string included in the drawing is specified as the character string candidate described in the drawing, but the score of the re-recognition process is Since it is less than the threshold value, the reliability is low. Therefore, the character string candidate “FX10AA106” is output as an estimated character string candidate which is a character string candidate estimated to be included in the drawing.

・具体例３・Specific example 3

具体例３では、正規表現が[F][X][1][0][A-Z][A-Z][0-9][0-9][0-9,A-Z]、図面に含まれる実際の文字列が“FX10AA106”である。 In Example 3, the regular expression is [F][X][1][0][AZ][AZ][0-9][0-9][0-9,AZ] The character string is "FX10AA106".

上記の条件でＯＣＲ処理を実行すると、認識した文字列候補は“FXIOAAIOG”であった。すなわち、３、４文字目の“10”が“IO”に、７〜９文字目の“106”が“IOG”に誤認識されている。 When the OCR process was executed under the above conditions, the recognized character string candidate was "FXIOAAIOG". That is, the third and fourth characters “10” are erroneously recognized as “IO”, and the seventh to ninth characters “106” are erroneously recognized as “IOG”.

次に、再認識処理の結果は、“FX10AA10G”となり、９文字目の“6”が“G”に誤認識されている。なお、再認識処理のスコアは閾値以上であった。
また、誤認識文字列候補としては、再認識処理により認識された文字列も含めて、“FX10AA10G”、“FX1OAA10G”、“FX1oAA0G”、“FX10AA1OG”、“FX10AA1oG”等が生成された。 Next, the result of the re-recognition process is "FX10AA10G", and the ninth character "6" is erroneously recognized as "G". The score of the re-recognition process was equal to or higher than the threshold.
In addition, as the erroneously recognized character string candidates, "FX10AA10G", "FX1OAA10G", "FX1oAA0G", "FX10AA1OG", "FX10AA1oG", etc. were generated, including the character strings recognized by the re-recognition process.

また、具体例３では、ユーザー指定の誤認識パターン情報を用いて生成した文字列を誤認識文字列候補に追加した。ユーザー指定の誤認識パターン情報は、誤認識パターンとして“６：Ｇ”が定義されている。このため、誤認識文字列候補として“FX10AA106”、“FX10AA10G”を生成して追加した。 Moreover, in the specific example 3, the character string generated using the user-specified misrecognition pattern information is added to the misrecognized character string candidates. In the user-specified misrecognition pattern information, "6:G" is defined as the misrecognition pattern. Therefore, "FX10AA106" and "FX10AA10G" are generated and added as the misrecognized character string candidates.

誤認識文字列候補のうち、正規表現を用いて不要な文字列候補を除外すると、“FX10AA106”、“FX10AA10G”が残る。これらの誤認識文字列候補は、何れも正規表現で表される文字列であるため、推定文字列候補として出力される。 If unnecessary character string candidates are excluded from the misrecognized character string candidates using regular expressions, "FX10AA106" and "FX10AA10G" remain. Since all of these erroneously recognized character string candidates are character strings represented by regular expressions, they are output as estimated character string candidates.

（第２実施形態） (Second embodiment)

次に、第２実施形態について説明する。なお、第１実施形態と同一部分には同一符号を付し、詳細な説明は省略する。第２実施形態では、図面から抽出したい情報がプラスチック部品の材質を表す文字列である場合について説明する。 Next, a second embodiment will be described. The same parts as those in the first embodiment are designated by the same reference numerals, and detailed description thereof will be omitted. In the second embodiment, a case will be described in which the information to be extracted from the drawing is a character string representing the material of the plastic part.

プラスチック部品の材質の表示方法は、ＪＩＳ（日本工業規格）で規定されている。例えば、単一のポリマー等の部品の表示は、ＪＩＳＫ６８９９−１で規定された材料の略語を境界文字“＞”及び“＜”で挟むことになっている。例えば、材料がアクリロニトリル−ブタジエン−スチレンであり、この略語がＡＢＳである場合は、“＞ＡＢＳ＜”と表記される。 The method of displaying the material of plastic parts is specified by JIS (Japanese Industrial Standard). For example, in the display of a single polymer or the like, an abbreviation of a material defined by JIS K 6899-1 is to be sandwiched by boundary characters ">" and "<". For example, when the material is acrylonitrile-butadiene-styrene and this abbreviation is ABS, it is written as ">ABS<".

また、プラスチック部品の材質の表示では、材料の略語の他に、“−”、“＋”、“（”、“）”等が区切り文字として使用されたり、質量分率又は材料の含有率等を表す数字が使用されたりする。例えば“＞ＰＡ６６−（ＧＦ２５＋ＭＤ１５）＜”と表記されている場合、ガラス繊維（ＧＦ）２５質量％と鉱物粉末（ＭＤ）１５質量％の混合物を含むポリアミド６６を表す。 In addition, in the display of the material of the plastic parts, in addition to the material abbreviation, "-", "+", "(", ")", etc. are used as delimiters, and the mass fraction or the content rate of the material, etc. A number that represents is used. For example, the expression ">PA66-(GF25+MD15)<" indicates polyamide 66 containing a mixture of 25% by mass of glass fiber (GF) and 15% by mass of mineral powder (MD).

必要とされるのは材料の略語を含む文字列であり、境界文字は不要である。このため、第２実施形態では、境界文字で挟まれた文字列については、境界文字を除外した文字列を必要文字列として抽出する。 All that is needed is a string containing the material abbreviations, and no delimiters. Therefore, in the second embodiment, for the character string sandwiched by the boundary characters, the character string excluding the boundary characters is extracted as the necessary character string.

図８には、第２実施形態に係るＣＰＵ１０Ａの機能ブロック図を示した。図８に示す機能ブロック図は、必要文字列抽出部３１が設けられている点が図２に示す機能ブロック図と異なる。 FIG. 8 shows a functional block diagram of the CPU 10A according to the second embodiment. The functional block diagram shown in FIG. 8 is different from the functional block diagram shown in FIG. 2 in that a necessary character string extraction unit 31 is provided.

必要文字列抽出部３１は、再認識処理部３０による再認識処理により認識された文字列から必要文字列を抽出する。具体的には、再認識処理により認識された文字列に境界文字が含まれているか否かを判定する。そして、再認識処理により認識された文字列に境界文字が含まれている場合には、境界文字を除外した文字列を必要文字列として抽出する。 The required character string extraction unit 31 extracts a required character string from the character string recognized by the re-recognition processing by the re-recognition processing unit 30. Specifically, it is determined whether the character string recognized by the re-recognition process includes a boundary character. Then, when the character string recognized by the re-recognition processing includes a boundary character, the character string excluding the boundary character is extracted as a necessary character string.

次に、図９を参照して、本実施の形態に係る情報処理装置１０の作用を説明する。図９に示す情報処理が図３に示す情報処理と異なるのは、ステップＳ１０７の処理が追加されている点である。 Next, the operation of the information processing device 10 according to the present embodiment will be described with reference to FIG. The information processing shown in FIG. 9 is different from the information processing shown in FIG. 3 in that the processing of step S107 is added.

ステップＳ１０７では、ステップＳ１０６の再認識処理により認識された文字列の各々について、必要文字列を抽出する処理を行う。すなわち、再認識処理により認識された文字列のうち、境界文字“＞”及び“＜”で挟まれた文字列を抽出する。そして、境界文字“＞”及び“＜”で挟まれた文字列については、境界文字を除いた文字列を必要文字列として抽出する。例えば再認識処理で認識された文字列が“＞ＡＢＳ＜”の場合は、境界文字を除外した“ＡＢＳ”を必要文字列として抽出する。また、再認識処理で認識された文字列が“＞ＰＡ６６−（ＧＦ２５＋ＭＤ１５）＜”の場合は、境界文字を除外した“ＰＡ６６−（ＧＦ２５＋ＭＤ１５）”を必要文字列として抽出する。 In step S107, a process of extracting a necessary character string is performed for each of the character strings recognized by the re-recognition process of step S106. That is, of the character strings recognized by the re-recognition process, the character string sandwiched between the boundary characters “>” and “<” is extracted. Then, for the character string sandwiched between the boundary characters “>” and “<”, the character string excluding the boundary character is extracted as a required character string. For example, when the character string recognized in the re-recognition process is “>ABS<”, “ABS” excluding the boundary character is extracted as the necessary character string. If the character string recognized in the re-recognition process is ">PA66-(GF25+MD15)<", "PA66-(GF25+MD15)" excluding the boundary character is extracted as the necessary character string.

・具体例１・Specific example 1

具体例１では、正規表現が[>][][][][][][][][][][][][][][][][][][][][][][][][<] 、図面に含まれる実際の文字列が“>PA66-(GF25+MD15)<”である。 In Specific Example 1, the regular expression is [>][][][][][][][][][][][][][][][][][][][][][ ][][][][<], the actual string contained in the drawing is “>PA66-(GF25+MD15)<”.

上記の条件でＯＣＲ処理を実行すると、認識した文字列候補は“>PAGG-(GF2S+MDI5)<”であった。すなわち、４、５文字目の“66”が“GG”に、１０文字目の“5”が“S”に、１４文字目の“1”が“I”に誤認識されている。 When the OCR process was executed under the above conditions, the recognized character string candidate was ">PAGG-(GF2S+MDI5)<". That is, the fourth and fifth characters "66" are mistakenly recognized as "GG", the tenth character "5" as "S", and the fourteenth character "1" as "I".

次に、再認識処理の結果は、“>PA66-(GF25+MD15)<”となり、再認識処理のスコアは閾値以上であった。この文字列は、正規表現で表される文字列と一致する。このため、図面に記載された文字列候補として“PA66-(GF25+MD15)”が特定され、出力される。 Next, the result of the re-recognition process was “>PA66-(GF25+MD15)<”, and the score of the re-recognition process was not less than the threshold. This string matches the string represented by the regular expression. Therefore, "PA66-(GF25+MD15)" is specified and output as the character string candidate described in the drawing.

・具体例２・Specific example 2

具体例２は、再認識処理のスコアが閾値未満である点以外は具体例１と同じである。この場合、再認識処理の認識結果の信頼性が低いので、“PA66-(GF25+MD15)”は推定文字列候補として出力される。 The specific example 2 is the same as the specific example 1 except that the score of the re-recognition process is less than the threshold value. In this case, since the reliability of the recognition result of the re-recognition process is low, "PA66-(GF25+MD15)" is output as the estimated character string candidate.

以上、各実施形態を用いて本発明について説明したが、本発明は各実施形態に記載の範囲には限定されない。本発明の要旨を逸脱しない範囲で各実施形態に多様な変更または改良を加えることができ、当該変更または改良を加えた形態も本発明の技術的範囲に含まれる。 Although the present invention has been described above using each embodiment, the present invention is not limited to the scope described in each embodiment. Various modifications and improvements can be added to each embodiment without departing from the gist of the present invention, and the modes to which the modifications or improvements are added are also included in the technical scope of the present invention.

例えば上記各実施形態では、情報処理装置が複合機に適用される場合を例に説明したが、スマートフォン又はタブレット端末等に適用してもよい。この場合、スマートフォン又はタブレット端末に搭載されたカメラで図面を撮影し、撮影した画像に対して図３又は図９の処理を実行すればよい。 For example, in each of the above-described embodiments, the case where the information processing apparatus is applied to the multifunction peripheral has been described as an example, but the information processing apparatus may be applied to a smartphone, a tablet terminal, or the like. In this case, a drawing may be taken with a camera mounted on a smartphone or a tablet terminal, and the process of FIG. 3 or 9 may be executed on the taken image.

また、例えば、図３、９に示した情報処理をＡＳＩＣ（Application Specific Integrated Circuit）等のハードウェアで実現するようにしてもよい。この場合、ソフトウエアで実現する場合に比べて、処理の高速化が図られる。 Further, for example, the information processing shown in FIGS. 3 and 9 may be realized by hardware such as an ASIC (Application Specific Integrated Circuit). In this case, the processing speed can be increased as compared with the case of being realized by software.

また、各実施形態では、情報処理プログラムが記憶部７にインストールされている形態を説明したが、これに限定されるものではない。本実施形態に係る情報処理プログラムを、コンピュータ読取可能な記憶媒体に記録した形態で提供してもよい。例えば、本実施形態に係る情報処理プログラムを、ＣＤ(Compact Disc)−ＲＯＭ及びＤＶＤ(Digital Versatile Disc)−ＲＯＭ等の光ディスクに記録した形態、若しくはＵＳＢ(Universal Serial Bus)メモリ及びメモリカード等の半導体メモリに記録した形態で提供してもよい。また、本実施形態に係る情報処理プログラムを、通信部６に接続された通信回線を介して外部装置から取得するようにしてもよい。 Further, in each of the embodiments, the mode in which the information processing program is installed in the storage unit 7 has been described, but the invention is not limited to this. The information processing program according to the present embodiment may be provided in a form recorded in a computer-readable storage medium. For example, the information processing program according to the present embodiment is recorded in an optical disc such as a CD (Compact Disc)-ROM and a DVD (Digital Versatile Disc)-ROM, or a semiconductor such as a USB (Universal Serial Bus) memory and a memory card. You may provide in the form recorded on memory. Further, the information processing program according to the present embodiment may be acquired from an external device via a communication line connected to the communication unit 6.

１画像形成装置
２スキャナ部
３画像形成部
４操作部
５表示部
６通信部
７記憶部
１０情報処理装置
２０ＯＣＲ処理部
２２取得部
２４抽出部
２６修正部
２８出力部
３０再認識処理部
３１必要文字列抽出部
３２生成部
４０情報処理プログラム
４２正規表現情報
４４誤認識文字情報
４６誤認識パターン情報
５０画像 1 image forming apparatus 2 scanner section 3 image forming section 4 operation section 5 display section 6 communication section 7 storage section 10 information processing apparatus 20 OCR processing section 22 acquisition section 24 extraction section 26 correction section 28 output section 30 re-recognition processing section 31 required Character string extraction unit 32 Generation unit 40 Information processing program 42 Regular expression information 44 False recognition character information 46 False recognition pattern information 50 Image

Claims

An acquisition unit that acquires a character string recognized from an image by character recognition processing,
From the character string acquired by the acquisition unit, an extraction unit that extracts character string candidates using an extraction pattern represented by a regular expression,
A correction unit that corrects the character string candidates using an erroneous recognition pattern that the character recognition process erroneously recognizes,
An output unit that outputs the character string candidates identified from the character string candidates corrected by the correction unit using the regular expression,
Information processing device equipped with.

The information processing apparatus according to claim 1, wherein the extraction unit extracts the character string candidates using a modified extraction pattern represented by a modified regular expression obtained by modifying the regular expression using the misrecognition pattern.

The acquisition unit acquires the image,
The correction unit, a character recognition processing unit for performing character recognition processing of the image for a specific character,
A generation unit that generates a misrecognized character string candidate that is a character string candidate of the same type as the character string candidate recognized by the character recognition processing unit;
Including
The information processing apparatus according to claim 1, wherein the output unit outputs a character string candidate specified from the misrecognized character string candidates.

The information processing apparatus according to claim 3, wherein the generation unit adds a character string generated using an erroneous recognition pattern designated by a user to the erroneously recognized character string candidate.

The information processing apparatus according to claim 3, wherein the generation unit generates the misrecognized character string candidate by excluding the character string candidate represented by the extraction pattern.

The character recognition processing unit calculates a score representing the accuracy of character recognition,
The information processing apparatus according to claim 3, wherein the output unit outputs the specified character string candidate when the score is equal to or higher than a predetermined threshold value.

The character recognition processing unit calculates a score representing the accuracy of character recognition,
The information processing apparatus according to claim 3, wherein the output unit outputs the specified character string candidate as an estimated character string candidate when the score is less than a predetermined threshold value.

The character recognition processing unit calculates a score representing the accuracy of character recognition,
The output unit outputs a plurality of the specified character string candidates as an estimated character string candidate when the score is equal to or more than a predetermined threshold and the specified character string candidates are plural. The information processing apparatus according to item 1.

Further comprising a necessary character string extraction unit for extracting a necessary character string that is a necessary character string from the character string recognized by the character recognition processing unit,
The information processing according to any one of claims 3 to 5, wherein the generation unit generates a misrecognized character string candidate that is a character string candidate having the same type as the necessary character string extracted by the necessary character string extraction unit. apparatus.

When the character string represented by the regular expression does not have a character string candidate modified by the modifying unit, the output unit includes a character string modified by the modifying unit among the character strings represented by the regular expression. The information processing apparatus according to claim 1, wherein a character string close to a column candidate is output as an estimated character string candidate.

The output unit outputs, as an estimated character string candidate, a character string having the smallest number of characters different from the character string candidate corrected by the correction unit among the character strings represented by the regular expression. Information processing device.

An information processing program for causing a computer to function as each unit of the information processing apparatus according to claim 1.