JP7279381B2

JP7279381B2 - Information processing device and information processing program

Info

Publication number: JP7279381B2
Application number: JP2019016251A
Authority: JP
Inventors: クリシュナンラガワ
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2019-01-31
Filing date: 2019-01-31
Publication date: 2023-05-23
Anticipated expiration: 2039-01-31
Also published as: JP2020123272A

Description

本発明は、情報処理装置及び情報処理プログラムに関する。 The present invention relates to an information processing device and an information processing program.

特許文献１には、情報を複数保持する情報保持手段中から所定の情報を検索する情報検索装置において、文書画像入力手段と、文字認識手段と、文字認識の確信度から正規化レベルを決定する手段と、前記正規化レベルに基づき代表文字に置き換える正規化手段と、前記情報保持手段中から類似したテキストを含む情報を検索する検索手段と、前記検索結果を出力する検索結果出力手段とを備えたことを特徴とする情報検索装置が開示されている。 In Patent Document 1, in an information retrieval device for retrieving predetermined information from information holding means holding a plurality of information, a document image input means, a character recognition means, and a normalization level is determined from the certainty of character recognition. normalization means for replacing representative characters based on the normalization level; search means for searching information containing similar text from the information holding means; and search result output means for outputting the search results. An information retrieval device characterized by the following is disclosed.

特許文献２には、文字列から所与の辞書に含まれる特定用語を検索する情報処理装置であって、ｎ－ｇｒａｍ方式を利用して前記所与の辞書に対して前記文字列の部分文字列の検索を行い、前記所与の辞書から該文字列に含まれる１以上の特定用語候補を抽出する抽出手段と、前記１以上の特定用語候補の各々に対して、ＤＰマッチング方式を利用して特定用語候補と該特定用語候補に対応する前記文字列内の部分文字列との編集距離を導出する導出手段と、前記編集距離が所定の閾値より短い前記１以上の特定用語候補に対応する前記文字列内の部分文字列を出力する出力手段と、を有することを特徴とする情報処理装置が開示されている。 Patent Document 2 discloses an information processing device for searching a character string for a specific term contained in a given dictionary, wherein partial characters of the character string are searched for the given dictionary using the n-gram method. An extracting means for performing a string search and extracting one or more specific term candidates included in the character string from the given dictionary, and using a DP matching method for each of the one or more specific term candidates. derivation means for deriving an edit distance between a specific term candidate and a substring in the character string corresponding to the specific term candidate, and the edit distance corresponding to the one or more specific term candidates shorter than a predetermined threshold and output means for outputting a partial character string in the character string.

特開２００７－３２３４１５号公報JP 2007-323415 A 特開２０１８－８１４５１号公報JP 2018-81451 A

本発明は、予め定めた参照リストを用いずに、文字認識処理によって画像から認識された文字列から画像に含まれる文字列を特定する場合と比較して、精度良く画像に含まれる文字列を特定することができる情報処理装置及び情報処理プログラムを提供することを目的とする。 According to the present invention, a character string included in an image can be identified with high accuracy as compared with the case of identifying the character string included in the image from the character string recognized from the image by character recognition processing without using a predetermined reference list. An object of the present invention is to provide an information processing device and an information processing program that can be specified.

第１態様に係る情報処理装置は、文字認識処理によって画像から認識された文字列を取得する取得部と、前記取得部で取得された文字列から、正規表現で表された抽出パターンを用いて文字列候補を抽出する抽出部と、前記文字認識処理が誤認識する誤認識パターンを用いて前記文字列候補を修正する修正部と、予め定めた参照リストを用いて、前記修正部で修正された文字列候補の中から特定した文字列候補を出力する出力部と、を備える。 An information processing apparatus according to a first aspect includes an acquisition unit that acquires a character string recognized from an image by character recognition processing; An extraction unit for extracting a character string candidate, a correction unit for correcting the character string candidate using an erroneous recognition pattern recognized by the character recognition process, and a predetermined reference list that is corrected by the correction unit. an output unit for outputting a character string candidate specified from among the character string candidates obtained.

第２態様に係る情報処理装置は、第１態様に係る情報処理装置において、前記抽出部は、前記誤認識パターンを用いて前記正規表現を修正した修正正規表現で表された修正抽出パターンを用いて前記文字列候補を抽出する。 An information processing apparatus according to a second aspect is the information processing apparatus according to the first aspect, wherein the extraction unit uses a modified extraction pattern represented by a modified regular expression obtained by modifying the regular expression using the misrecognition pattern. to extract the character string candidate.

第３態様に係る情報処理装置は、第１態様又は第２態様に係る情報処理装置において、前記取得部は前記画像を取得し、前記修正部は、特定の文字を対象として前記画像の文字認識処理を行う文字認識処理部と、前記文字認識処理部により認識された文字列と同型の文字列候補である誤認識文字列候補を生成する生成部と、を含み、前記出力部は、前記誤認識文字列候補の中から特定した文字列候補を出力する。 An information processing apparatus according to a third aspect is the information processing apparatus according to the first aspect or the second aspect, wherein the acquisition unit acquires the image, and the correction unit performs character recognition of the image on a specific character. a character recognition processing unit that performs processing; and a generation unit that generates misrecognized character string candidates that are character string candidates of the same type as the character string recognized by the character recognition processing unit. Output the character string candidates specified from among the recognized character string candidates.

第４態様に係る情報処理装置は、第３態様に係る情報処理装置において、前記生成部は、ユーザーによって指定された誤認識パターンを用いて生成した文字列を前記誤認識文字列候補に追加する。 An information processing apparatus according to a fourth aspect is the information processing apparatus according to the third aspect, wherein the generation unit adds a character string generated using an erroneous recognition pattern specified by a user to the erroneously recognized character string candidates. .

第５態様に係る情報処理装置は、第３態様又は第４態様に係る情報処理装置において、前記生成部は、前記抽出パターンで表される文字列候補を除いて前記誤認識文字列候補を生成する。 An information processing apparatus according to a fifth aspect is the information processing apparatus according to the third aspect or the fourth aspect, wherein the generating unit generates the misrecognized character string candidates by excluding the character string candidates represented by the extraction patterns. do.

第６態様に係る情報処理装置は、第３～第５態様の何れかの態様に係る情報処理装置において、前記文字認識処理部は、文字認識の精度を表すスコアを算出し、前記出力部は、前記スコアが予め定めた閾値以上の場合に、前記特定した文字列候補を出力する。 An information processing device according to a sixth aspect is the information processing device according to any one of the third to fifth aspects, wherein the character recognition processing unit calculates a score representing accuracy of character recognition, and the output unit and outputting the identified character string candidate when the score is equal to or greater than a predetermined threshold.

第７態様に係る情報処理装置は、第３～第５態様の何れかの態様に係る情報処理装置において、前記文字認識処理部は、文字認識の精度を表すスコアを算出し、前記出力部は、前記スコアが予め定めた閾値未満の場合に、前記特定した文字列候補を推定文字列候補として出力する。 An information processing device according to a seventh aspect is the information processing device according to any one of the third to fifth aspects, wherein the character recognition processing unit calculates a score representing accuracy of character recognition, and the output unit and outputting the specified character string candidate as an estimated character string candidate when the score is less than a predetermined threshold.

第８態様に係る情報処理装置は、第３～第５態様の何れかの態様に係る情報処理装置において、前記文字認識処理部は、文字認識の精度を表すスコアを算出し、前記出力部は、前記スコアが予め定めた閾値以上で且つ前記特定した文字列候補が複数の場合に、複数の前記特定した文字列候補を推定文字列候補として出力する。 An information processing device according to an eighth aspect is the information processing device according to any one of the third to fifth aspects, wherein the character recognition processing unit calculates a score representing accuracy of character recognition, and the output unit and outputting the plurality of specified character string candidates as estimated character string candidates when the score is equal to or greater than a predetermined threshold value and the number of the specified character string candidates is plural.

第９態様に係る情報処理装置は、第３～第５態様の何れかの態様に係る情報処理装置において、前記文字認識処理部により文字認識された文字列から必要な文字列である必要文字列を抽出する必要文字列抽出部を更に備え、前記生成部は、前記必要文字列抽出部により抽出された前記必要文字列と同型の文字列候補である誤認識文字列候補を生成する。 An information processing device according to a ninth aspect is the information processing device according to any one of the third to fifth aspects, wherein a required character string is a necessary character string from character strings recognized by the character recognition processing unit. and the generation unit generates a misrecognized character string candidate, which is a character string candidate of the same type as the necessary character string extracted by the necessary character string extraction unit.

第１０態様に係る情報処理装置は、第１～第５態様の何れかの態様に係る情報処理装置において、前記出力部は、前記参照リストに前記修正部で修正された文字列候補が存在しない場合、前記参照リストに含まれる文字列のうち、前記修正部で修正された文字列候補に近い文字列を推定文字列候補として出力する。 The information processing apparatus according to a tenth aspect is the information processing apparatus according to any one of the first to fifth aspects, wherein the output unit includes a character string candidate corrected by the correction unit that does not exist in the reference list. In this case, out of the character strings included in the reference list, character strings close to the character string candidates corrected by the correction unit are output as estimated character string candidates.

第１１態様に係る情報処理装置は、第１０態様に係る情報処理装置において、前記出力部は、前記参照リストに含まれる文字列のうち、前記修正部で修正された文字列候補と異なる文字の数が最も少ない文字列を推定文字列候補として出力する。 An information processing apparatus according to an eleventh aspect is the information processing apparatus according to the tenth aspect, wherein the output unit generates a character string that is different from the character string candidate corrected by the correction unit, out of the character strings included in the reference list. Output the character string with the smallest number as the estimated character string candidate.

第１２態様に係る情報処理プログラムは、コンピュータを、第１～第１１態様の何れか１つの態様に記載の情報処理装置の各部として機能させるための情報処理プログラムである。 An information processing program according to a twelfth aspect is an information processing program for causing a computer to function as each part of the information processing apparatus according to any one of the first to eleventh aspects.

第１及び第１２態様によれば、予め定めた参照リストを用いずに、文字認識処理によって画像から認識された文字列から画像に含まれる文字列を特定する場合と比較して、精度良く画像に含まれる文字列を特定することができる、という効果を有する。 According to the first and twelfth aspects, compared to the case of specifying a character string included in an image from a character string recognized from the image by character recognition processing without using a predetermined reference list, the image can be obtained with high accuracy. has the effect of being able to identify the character string contained in .

第２態様によれば、誤認識パターンを用いて正規表現を修正した修正正規表現で表された修正抽出パターンを用いずに文字列候補を抽出する場合と比較して、本来抽出すべき文字列の一部の文字が誤認識された文字列も抽出することができる、という効果を有する。 According to the second aspect, compared to the case of extracting a character string candidate without using a modified extraction pattern represented by a modified regular expression obtained by modifying a regular expression using an erroneously recognized pattern, the character string that should be extracted It is possible to extract a character string in which some of the characters are erroneously recognized.

第３態様によれば、誤認識文字列候補を生成しない場合と比較して、精度良く画像に含まれる文字列を特定することができる、という効果を有する。 According to the third aspect, there is an effect that a character string included in an image can be specified with high accuracy as compared with the case where no erroneously recognized character string candidate is generated.

第４態様によれば、ユーザーによって指定された誤認識パターンを用いずに誤認識文字列候補を生成する場合と比較して、精度良く画像に含まれる文字列を特定することができる、という効果を有する。 According to the fourth aspect, it is possible to specify a character string included in an image with high accuracy as compared with the case of generating incorrectly recognized character string candidates without using an incorrectly recognized pattern specified by the user. have

第５態様によれば、抽出パターンで表される文字列候補を除かずに誤認識文字列候補を生成する場合と比較して、画像に含まれる文字列を特定するまでの時間を短縮することができる、という効果を有する。 According to the fifth aspect, it is possible to shorten the time required to specify a character string included in an image, compared to the case where misrecognized character string candidates are generated without excluding character string candidates represented by extraction patterns. has the effect of being able to

第６態様によれば、文字認識の精度を表すスコアに関係なく特定した文字列候補を出力する場合と比較して、特定した文字列候補が出力されたことについての信頼性が高まる、という効果を有する。 According to the sixth aspect, there is an effect that the reliability of output of the specified character string candidate is increased compared to the case where the specified character string candidate is output regardless of the score representing the accuracy of character recognition. have

第７態様によれば、文字認識の精度を表すスコアに関係なく特定した文字列候補を推定文字列候補として出力する場合と比較して、推定文字列候補として出力されたことについての信頼性が高まる、という効果を有する。 According to the seventh aspect, the reliability of being output as an estimated character string candidate is higher than when the specified character string candidate is output as an estimated character string candidate regardless of the score representing the character recognition accuracy. It has the effect of increasing

第８態様によれば、文字認識の精度を表すスコアに関係なく複数の特定した文字列候補を推定文字列候補として出力する場合と比較して、複数の推定文字列候補として出力されたことについての信頼性が高まる、という効果を有する。 According to the eighth aspect, output as a plurality of estimated character string candidates compared to outputting a plurality of specified character string candidates as estimated character string candidates irrespective of the score representing character recognition accuracy has the effect of increasing the reliability of

第９態様によれば、文字認識された文字列から必要文字列を抽出せずに誤認識文字列候補を生成する場合と比較して、適切に誤認識文字列候補を生成することができる、という効果を有する。 According to the ninth aspect, it is possible to appropriately generate misrecognized character string candidates compared to generating misrecognized character string candidates without extracting necessary character strings from character-recognized character strings. has the effect of

第１０態様によれば、参照リストに文字列候補が存在しない場合に何も出力しない場合と比較して、画像に含まれる可能性のある文字列を把握することができる、という効果を有する。 According to the tenth aspect, there is an effect that it is possible to grasp the character strings that may be included in the image, compared to the case where nothing is output when no character string candidate exists in the reference list.

第１１態様によれば、参照リストに含まれる文字列と文字列候補との異なる文字の数に関係なく推定文字列候補を特定する場合と比較して、推定文字列候補の信頼性が高まる、という効果を有する。 According to the eleventh aspect, the reliability of the estimated character string candidate is increased compared to the case of specifying the estimated character string candidate regardless of the number of different characters between the character string and the character string candidate included in the reference list. has the effect of

画像形成装置の構成図である。1 is a configuration diagram of an image forming apparatus; FIG. 第１実施形態に係るＣＰＵの機能ブロック図である。3 is a functional block diagram of a CPU according to the first embodiment; FIG. 第１実施形態に係る情報処理のフローチャートである。4 is a flowchart of information processing according to the first embodiment; 図面を読み取った画像の一例を示す図である。It is a figure which shows an example of the image which read drawing. 再認識処理の対象となる領域について説明するための図である。FIG. 10 is a diagram for explaining a region to be re-recognized; 誤認識文字情報の一例を示す図である。It is a figure which shows an example of misrecognition character information. 誤認識パターン情報の一例を示す図である。It is a figure which shows an example of misrecognition pattern information. 参照リストの一例を示す図である。FIG. 10 is a diagram showing an example of a reference list; FIG. 参照リストの一例を示す図である。FIG. 10 is a diagram showing an example of a reference list; FIG. 参照リストの一例を示す図である。FIG. 10 is a diagram showing an example of a reference list; FIG. 参照リストの一例を示す図である。FIG. 10 is a diagram showing an example of a reference list; FIG. 第２実施形態に係るＣＰＵの機能ブロック図である。FIG. 7 is a functional block diagram of a CPU according to the second embodiment; FIG. 第２実施形態に係る情報処理のフローチャートである。9 is a flowchart of information processing according to the second embodiment; 参照リストの一例を示す図である。FIG. 10 is a diagram showing an example of a reference list; FIG. 参照リストの一例を示す図である。FIG. 10 is a diagram showing an example of a reference list; FIG.

以下、図面を参照して、本発明を実施するための形態例を詳細に説明する。 Embodiments for carrying out the present invention will be described in detail below with reference to the drawings.

（第１実施形態） (First embodiment)

図１は、本実施形態に係る画像形成装置１の構成図である。図１に示すように、画像形成装置１は、スキャナ部２、画像形成部３、操作部４、表示部５、通信部６、記憶部７、及び情報処理装置１０を備える。なお、図１では、用紙等の記録媒体を供給する供給機構、記録媒体を搬送する搬送機構、記録媒体を排出する排出機構等の図示を省略している。 FIG. 1 is a configuration diagram of an image forming apparatus 1 according to this embodiment. As shown in FIG. 1, the image forming apparatus 1 includes a scanner section 2, an image forming section 3, an operation section 4, a display section 5, a communication section 6, a storage section 7, and an information processing device . Note that FIG. 1 omits illustration of a supply mechanism for supplying a recording medium such as paper, a conveying mechanism for conveying the recording medium, an ejection mechanism for ejecting the recording medium, and the like.

画像形成装置１は、原稿を読み取るスキャン機能、原稿を複写するコピー機能、印刷データを印刷する印刷機能、原稿をファクシミリ送信するファクシミリ送信機能等の複数の機能を有する複合機として機能する。 The image forming apparatus 1 functions as a multifunction device having multiple functions such as a scanning function for reading an original, a copying function for copying an original, a printing function for printing print data, and a facsimile transmission function for facsimile transmission of an original.

スキャナ部２は、画像が形成された紙等の記録媒体を光学的に読み取って読み取り画像を生成し、情報処理装置１０に出力する。 The scanner unit 2 optically reads a recording medium such as paper on which an image is formed, generates a read image, and outputs the read image to the information processing apparatus 10 .

画像形成部３は、例えば電子写真方式又はインクジェット記録方式等により紙等の記録媒体に画像を形成する機能を有し、情報処理装置１０からの指示により画像を記録媒体に形成する。 The image forming unit 3 has a function of forming an image on a recording medium such as paper by, for example, an electrophotographic method or an inkjet recording method, and forms an image on the recording medium according to instructions from the information processing device 10 .

操作部４は、各種操作を受け付ける操作キーを含んで構成される。 The operation unit 4 includes operation keys for receiving various operations.

表示部５は、液晶ディスプレイ等で構成され、例えば画面に指等で触れることで各種操作を受け付けるタッチパネルで構成される。 The display unit 5 is composed of a liquid crystal display or the like, and is composed of, for example, a touch panel that accepts various operations by touching the screen with a finger or the like.

通信部６は、外部装置等とデータ通信を行うためのインターフェースである。 The communication unit 6 is an interface for performing data communication with an external device or the like.

記憶部７は、ハードディスク等の不揮発性の記憶装置で構成され、後述する情報処理プログラム等を記憶する。 The storage unit 7 is configured by a non-volatile storage device such as a hard disk, and stores information processing programs and the like, which will be described later.

情報処理装置１０は、ＣＰＵ（Central Processing Unit）１０Ａ、ＲＯＭ（Read Only Memory）１０Ｂ、ＲＡＭ（Random Access Memory）１０Ｃ、不揮発性メモリ１０Ｄ、及び入出力インターフェース（Ｉ／Ｏ）１０Ｅを備える。そして、ＣＰＵ１０Ａ、ＲＯＭ１０Ｂ、ＲＡＭ１０Ｃ、不揮発性メモリ１０Ｄ、及びＩ／Ｏ１０Ｅがバス１０Ｆを介して各々接続されている。Ｉ／Ｏ１０Ｅには、スキャナ部２、画像形成部３、操作部４、表示部５、通信部６、記憶部７が接続されている。ＣＰＵ１０Ａは、記憶部７に記憶された情報処理プログラムを読み込んで実行する。 The information processing apparatus 10 includes a CPU (Central Processing Unit) 10A, a ROM (Read Only Memory) 10B, a RAM (Random Access Memory) 10C, a nonvolatile memory 10D, and an input/output interface (I/O) 10E. A CPU 10A, a ROM 10B, a RAM 10C, a nonvolatile memory 10D, and an I/O 10E are connected via a bus 10F. The scanner section 2, the image forming section 3, the operation section 4, the display section 5, the communication section 6, and the storage section 7 are connected to the I/O 10E. CPU 10A reads and executes an information processing program stored in storage unit 7 .

画像形成装置１は、前述したコピー機能等の複合機の基本機能の他に、図面情報抽出機能を有する。図面情報抽出機能は、製造業及び設備・建設業等で使用する図面を読み取り、読み取った図面の画像から、図面に記載されている指定の文字列を図面情報として抽出する機能である。図面に記載されている指定の文字列には、例えば品番、品名、及び版等を含む基本属性、図面に表された製品のスペックを表すスペック情報、材料情報、設計標準書番号等の各種情報を表す文字列が含まれる。図面から抽出した図面情報は、紙等の記録媒体に印刷したり、データファイルとして記憶部７に記憶したり、通信部６を介して外部装置に送信したり、表示部５に表示したりする。 The image forming apparatus 1 has a drawing information extracting function in addition to the basic functions of a multifunction machine such as the above-described copying function. The drawing information extraction function is a function that reads drawings used in the manufacturing industry, equipment/construction industry, etc., and extracts specified character strings described in the drawings from the read drawing images as drawing information. Designated character strings described in drawings include basic attributes including product number, product name, plate, etc., various information such as specification information representing specifications of the product shown in the drawing, material information, design standard number, etc. Contains a string representing The drawing information extracted from the drawing is printed on a recording medium such as paper, stored as a data file in the storage unit 7, transmitted to an external device via the communication unit 6, or displayed on the display unit 5. .

図面情報抽出機能が利用されるケースとしては以下のようなケースが挙げられる。例えば図面に記載された文字列が或る技術標準に基づいて記載されている場合において、技術標準が更新されて図面に記載された文字列を修正する必要がある場合に、旧技術標準で記載された文字列を含む図面を探す場合等である。 Examples of cases where the drawing information extraction function is used include the following cases. For example, if the character strings described in the drawings are based on a certain technical standard, and the technical standards are updated and it is necessary to correct the character strings described in the drawings, it is described in the old technical standard For example, when searching for a drawing containing a specified character string.

以下、画像形成装置１が図面情報抽出機能を実行する場合におけるＣＰＵ１０Ａの機能構成について説明する。 The functional configuration of the CPU 10A when the image forming apparatus 1 executes the drawing information extraction function will be described below.

図２に示すように、ＣＰＵ１０Ａは、機能的には、ＯＣＲ処理部２０、取得部２２、抽出部２４、修正部２６、及び出力部２８を備える。 As shown in FIG. 2, the CPU 10A functionally includes an OCR processing section 20, an acquisition section 22, an extraction section 24, a correction section 26, and an output section .

ＯＣＲ処理部２０は、スキャナ部２が原稿から読み取った画像に対して文字認識処理、すなわちＯＣＲ（Optical Character Recognition）処理を実行し、読み取った画像に含まれる文字列を取得部２２に出力する。なお、ＯＣＲ処理部２０が実行するＯＣＲ処理は、一例としてルールベースのＯＣＲ処理である。すなわち、予め定めた規則に基づいて、読み取った画像に含まれる文字列を全て抽出する。 The OCR processing unit 20 performs character recognition processing, that is, OCR (Optical Character Recognition) processing on the image read from the document by the scanner unit 2 , and outputs character strings included in the read image to the acquisition unit 22 . Note that the OCR processing executed by the OCR processing unit 20 is, for example, rule-based OCR processing. That is, all character strings included in the read image are extracted based on a predetermined rule.

取得部２２は、ＯＣＲ処理部２０のＯＣＲ処理によって画像から認識された文字列を取得する。取得部２２は、取得した文字列を修正部２６へ出力する。 The acquisition unit 22 acquires character strings recognized from the image by the OCR processing of the OCR processing unit 20 . Acquisition unit 22 outputs the acquired character string to correction unit 26 .

抽出部２４は、取得部２２で取得された文字列から、正規表現で表された抽出パターンを用いて文字列候補を抽出する。なお、ＯＣＲ処理部２０によるＯＣＲ処理によって認識された文字列には、誤認識された文字が含まれる場合が多い。このため、抽出部２４は、ＯＣＲ処理部２０のＯＣＲ処理で誤認識する誤認識パターンを用いて修正された修正正規表現で表された抽出パターンを用いて文字列候補を抽出する。 The extraction unit 24 extracts character string candidates from the character strings acquired by the acquisition unit 22 using extraction patterns represented by regular expressions. Character strings recognized by OCR processing by the OCR processing unit 20 often include erroneously recognized characters. Therefore, the extraction unit 24 extracts the character string candidate using the extraction pattern represented by the modified regular expression corrected using the erroneous recognition pattern that is erroneously recognized in the OCR processing of the OCR processing unit 20 .

修正部２６は、ＯＣＲ処理が誤認識する誤認識パターンを用いて、抽出部２４が抽出した文字列候補を修正する。 The correcting unit 26 corrects the character string candidate extracted by the extracting unit 24 by using an erroneously recognized pattern that is erroneously recognized by the OCR process.

具体的には、修正部２６は、再認識処理部３０及び生成部３２を備える。 Specifically, the correction unit 26 includes a re-recognition processing unit 30 and a generation unit 32 .

再認識処理部３０は、特定の文字を対象として画像のＯＣＲ処理を再度行う。再認識処理部３０は、例えばニューラルネットワーク等の機械学習を用いて画像のＯＣＲ処理を実行する。なお、再認識処理部３０は、文字認識処理部の一例である。 The re-recognition processing unit 30 re-performs the OCR processing of the image for a specific character. The re-recognition processing unit 30 executes OCR processing of the image using machine learning such as a neural network. Note that the re-recognition processing unit 30 is an example of a character recognition processing unit.

生成部３２は、再認識処理部３０により認識された文字列候補と同型の文字列候補である誤認識文字列候補を生成する。 The generation unit 32 generates misrecognized character string candidates that are character string candidates of the same type as the character string candidates recognized by the re-recognition processing unit 30 .

ここで、誤認識文字列候補とは、再認識処理部３０により認識された文字列候補であって、認識された文字と同型であり認識が難しい文字列候補、又は、再認識処理部３０により認識された文字列であって、認識された文字が誤っている可能性の高い文字列候補である。なお、以下では、再認識処理部３０により認識された文字列も誤認識文字列候補に含まれるものとする。 Here, the erroneously recognized character string candidate is a character string candidate recognized by the re-recognition processing unit 30, which has the same type as the recognized character and is difficult to recognize, or a character string candidate that is difficult to recognize by the re-recognition processing unit 30. It is a recognized character string, and it is a character string candidate with a high possibility that the recognized characters are incorrect. In the following, it is assumed that the character string recognized by the re-recognition processing unit 30 is also included in the erroneously recognized character string candidates.

出力部２８は、予め定めた参照リストを用いて、修正部２６で修正された文字列候補、すなわち生成部３２で生成された誤認識文字列候補の中から特定した文字列候補を出力する。 The output unit 28 uses a predetermined reference list to output character string candidates specified from the character string candidates corrected by the correction unit 26, that is, the incorrectly recognized character string candidates generated by the generation unit 32.

次に、図３を参照して、本実施の形態に係る情報処理装置１０の作用を説明する。図２に示すように、情報処理プログラム４０は記憶部７に記憶されている。ＣＰＵ１０Ａが情報処理プログラム４０を読み出して実行することにより、図３に示す情報処理が実行される。なお、図３に示す情報処理は、例えば、ユーザーがスキャナ部２に図面をセットし、操作部４を操作して図面情報抽出処理の実行を指示した場合に実行される。 Next, operation of the information processing apparatus 10 according to the present embodiment will be described with reference to FIG. As shown in FIG. 2, the information processing program 40 is stored in the storage section 7 . The information processing shown in FIG. 3 is executed by reading and executing the information processing program 40 by the CPU 10A. The information processing shown in FIG. 3 is executed, for example, when the user sets a drawing on the scanner unit 2 and operates the operation unit 4 to instruct execution of drawing information extraction processing.

ステップＳ１００では、ＣＰＵ１０Ａが、スキャナ部２に図面の読み取りを指示し、スキャナ部２が読み取った図面の画像を取得する。 In step S100, the CPU 10A instructs the scanner unit 2 to read the drawing, and acquires the image of the drawing read by the scanner unit 2. FIG.

ステップＳ１０２では、ＣＰＵ１０Ａが、ステップＳ１００で取得した画像に対してルールベースのＯＣＲ処理を実行する。これにより、読み取った画像に含まれる全ての文字列が取得される。また、読み取った画像に含まれる全ての文字列の位置、すなわち画像の中の座標も取得される。ＯＣＲ処理により認識された文字列は、例えばＸＭＬ形式の文字列として得られるが、文字列の形式はＸＭＬ形式に限られるものではない。 In step S102, CPU 10A executes rule-based OCR processing on the image acquired in step S100. As a result, all character strings included in the read image are obtained. Also, the positions of all character strings contained in the read image, that is, the coordinates within the image are acquired. A character string recognized by OCR processing is obtained as a character string in XML format, for example, but the format of the character string is not limited to XML format.

ステップＳ１０４では、ＣＰＵ１０Ａが、ステップＳ１０２のＯＣＲ処理によって得られた文字列から、ステップＳ１００で取得した画像に含まれる文字列候補を抽出する。具体的には、正規表現で表された抽出パターンを用いて画像に含まれる文字列候補を抽出する。本実施形態では、例えば図２に示すように、正規表現が定義された正規表現情報４２が予め記憶部７に記憶されており、この正規表現情報４２で定義された正規表現で表される抽出パターンを用いて文字列候補を抽出する。なお、正規表現情報４２を予め記憶部７に記憶しておく場合に限らず、通信部６を介して外部装置から正規表現情報４２を取得してもよい。 In step S104, CPU 10A extracts character string candidates included in the image acquired in step S100 from the character strings obtained by the OCR processing in step S102. Specifically, character string candidates included in the image are extracted using extraction patterns represented by regular expressions. In this embodiment, for example, as shown in FIG. 2, regular expression information 42 in which regular expressions are defined is stored in the storage unit 7 in advance. Use patterns to extract string candidates. It should be noted that the regular expression information 42 may be obtained from an external device via the communication section 6 without being limited to storing the regular expression information 42 in advance in the storage section 7 .

ここで、正規表現とは、文字列の集合を一つの文字列で表現する方法である。そして、正規表現は、図面情報の抽出対象となる図面に含まれ得る文字列に対応したものである。すなわち、記憶部７に予め記憶された正規表現情報４２には、図面情報の抽出対象となる図面に含まれ得る文字列に対応した正規表現が定義されている。 Here, a regular expression is a method of expressing a set of character strings with one character string. A regular expression corresponds to a character string that can be included in a drawing from which drawing information is to be extracted. That is, the regular expression information 42 pre-stored in the storage unit 7 defines regular expressions corresponding to character strings that can be included in the drawing from which the drawing information is to be extracted.

例えば、図面情報の抽出対象となる図面に含まれ得る文字列が４個の文字から構成され、先頭の文字が“Ａ”、２番目の文字が“Ｂ”、３番目の文字が“Ｃ”、４番目の文字が“０”～“９”の１０種類の数字の何れかの文字である文字列であるとする。この場合、図面情報の抽出対象となる図面に含まれ得る文字列の正規表現は、“［Ａ］［Ｂ］［Ｃ］［０－９］”という一つの文字列で表される。この場合、正規表現情報４２には、正規表現として“［Ａ］［Ｂ］［Ｃ］［０－９］”が定義される。なお、正規表現“［Ａ］［Ｂ］［Ｃ］［０－９］”で表された抽出パターンの文字列は、“ＡＢＣ０”～“ＡＢＣ９”までの１０パターンとなる。 For example, a character string that can be included in a drawing from which drawing information is to be extracted consists of four characters: the first character is "A", the second character is "B", and the third character is "C". , the fourth character is a character string of any one of 10 types of numerals "0" to "9". In this case, a regular expression of character strings that can be included in a drawing from which drawing information is to be extracted is represented by a single character string “[A][B][C][0-9]”. In this case, the regular expression information 42 defines "[A][B][C][0-9]" as a regular expression. Note that the character string of the extraction pattern represented by the regular expression “[A][B][C][0-9]” is 10 patterns from “ABC0” to “ABC9”.

ところで、ステップＳ１０２のＯＣＲ処理によって認識された文字列には、誤認識された文字が含まれる場合が多い。このため、上記の例において、正規表現“［Ａ］［Ｂ］［Ｃ］［０－９］”で表される抽出パターンを用いて文字列候補を抽出したのでは、本来抽出すべき文字列の一部の文字が誤認識された文字列が抽出されなくなってしまう。このため、ステップＳ１０４では、ＯＣＲ処理で誤認識する誤認識パターンを用いて修正された修正正規表現で表された抽出パターンを用いて文字列候補を抽出する。すなわち、正規表現情報４２には、ＯＣＲ処理で誤認識する誤認識パターンを用いて修正された修正正規表現を定義しておく。 By the way, the character string recognized by the OCR processing in step S102 often includes erroneously recognized characters. Therefore, in the above example, if the character string candidate is extracted using the extraction pattern represented by the regular expression "[A][B][C][0-9]", the character string that should be extracted A character string in which some characters are incorrectly recognized will not be extracted. Therefore, in step S104, a character string candidate is extracted using an extraction pattern represented by a modified regular expression corrected using an erroneous recognition pattern that is erroneously recognized in OCR processing. That is, in the regular expression information 42, a corrected regular expression corrected using an erroneous recognition pattern that is erroneously recognized in OCR processing is defined.

例えば、上記の例の場合において、“Ａ”は、例えば“Ｈ”と誤認識されやすい。また、“Ｂ”は、例えば“８”又は“日”と誤認識されやすい。また、“Ｃ”は、“［”又は“（”と誤認識されやすい。また、“０”は、“ｏ”又は“Ｏ”と誤認識されやすい。この場合、ＯＣＲ処理で誤認識する誤認識パターンを用いて修正された修正正規表現は、“［ＡＨ］［Ｂ８日］［Ｃ［（］［０ｏＯ－９］”となる。このような修正正規表現で表される抽出パターンを用いて文字列候補を抽出することにより、本来抽出すべき文字列の一部の文字が誤認識された文字列も抽出される。 For example, in the case of the above example, "A" is likely to be erroneously recognized as "H", for example. Also, "B" is likely to be erroneously recognized as, for example, "8" or "day". "C" is likely to be erroneously recognized as "[" or "(". Also, "0" is likely to be erroneously recognized as "o" or "O." A modified regular expression modified using the recognition pattern is "[AH] [B8 days] [C [(] [0oO-9]". By extracting the character string candidates, a character string in which some of the characters of the character string that should be extracted is erroneously recognized is also extracted.

例えばステップＳ１００で得られた画像が図４に示す画像５０であったとする。画像５０には、文字列“ＡＢＣ０”を表す画像、文字列“ＡＢＣ８”を表す画像、文字列“ＸＹＺ１”を表す画像が含まれている。 For example, assume that the image obtained in step S100 is the image 50 shown in FIG. The image 50 includes an image representing the character string "ABC0", an image representing the character string "ABC8", and an image representing the character string "XYZ1".

この場合、ステップＳ１０２のＯＣＲ処理では、画像５０に含まれる全ての文字列が認識されるので、文字列“ＡＢＣ０”、文字列“ＡＢＣ８”、文字列“ＸＹＺ１”の全てが何らかの文字列として全て認識される。そして、ステップＳ１０２のＯＣＲ処理では少なくとも一部の文字が誤認識される場合があるので、全ての文字列を正しく認識するとは限らない。例えば、文字列“ＡＢＣ０”については、例えば“ＡＢＣｏ”又は“ＡＢＣＯ”と誤認識される場合がある。また、文字列“ＡＢＣ８”については、“ＡＢＣＢ”と誤認識される場合がある。これに対して、ステップＳ１０４では、修正正規表現で表された抽出パターンを用いて文字列候補を抽出するので、例えば“ＡＢＣｏ”又は“ＡＢＣＯ”と誤認識されても文字列候補として抽出される。 In this case, in the OCR processing in step S102, all character strings included in the image 50 are recognized, so all of the character strings "ABC0", "ABC8", and "XYZ1" are all recognized as some character strings. Recognized. Since at least some characters may be erroneously recognized in the OCR processing in step S102, not all character strings are necessarily recognized correctly. For example, the character string "ABC0" may be erroneously recognized as "ABCo" or "ABCO". Also, the character string "ABC8" may be erroneously recognized as "ABCB". On the other hand, in step S104, character string candidates are extracted using the extraction pattern represented by the modified regular expression. .

ステップＳ１０６では、ＣＰＵ１０Ａが、ステップＳ１０４で抽出された文字列候補について学習モデルを用いたＯＣＲ処理（以下、再認識処理と称する）を実行する。すなわち、ステップＳ１００で取得した画像全体について再認識処理を実行するのではなく、ステップＳ１０４で抽出された文字列候補についてのみ再認識処理を実行する。なお、再認識処理を実行する対象の領域は、ステップＳ１０２のルールベースのＯＣＲ処理で得られた文字列候補の位置に基づいて設定すればよい。 In step S106, CPU 10A executes OCR processing (hereinafter referred to as re-recognition processing) using a learning model for the character string candidates extracted in step S104. That is, the re-recognition process is performed only for the character string candidates extracted in step S104, instead of performing the re-recognition process on the entire image acquired in step S100. Note that the target area for re-recognition processing may be set based on the positions of the character string candidates obtained by the rule-based OCR processing in step S102.

例えば図５に示すように、画像５０全体について再認識処理を実行するのではなく、文字列候補“ＡＢＣ０”を含む領域Ｒ１と、文字列候補“ＡＢＣ８”を含む領域Ｒ２と、について再認識処理を実行する。 For example, as shown in FIG. 5, the re-recognition process is not performed on the entire image 50, but the re-recognition process is performed on a region R1 containing the character string candidate "ABC0" and a region R2 containing the character string candidate "ABC8". to run.

なお、学習モデルを用いたＯＣＲ処理としては、本実施形態では一例として畳み込みニューラルネットワーク（ＣＮＮ：Convolutional Neural Network）等の機械学習を用いたＯＣＲ処理が挙げられるが、これに限られるものではない。 In this embodiment, an example of OCR processing using a learning model is OCR processing using machine learning such as a convolutional neural network (CNN), but is not limited to this.

学習モデルは、特定の文字について学習されている。ここで、特定の文字とは、図面情報抽出機能を実行する対象の図面に含まれ得る文字であり、例えば、アルファベット、数字、及び記号等である。 The learning model is trained on specific characters. Here, the specific characters are characters that can be included in the target drawing for which the drawing information extraction function is to be executed, such as alphabets, numbers, and symbols.

このように、特定の文字について学習された学習モデルを用いたＯＣＲ処理は、ステップＳ１０２で実行されるルールベースのＯＣＲ処理と比較して文字列の認識率は高くなる。 Thus, OCR processing using a learning model trained on specific characters has a higher character string recognition rate than rule-based OCR processing executed in step S102.

例えば画像５０に含まれる文字列“ＡＢＣ０”が、ステップＳ１０２のＯＣＲ処理で“ＡＢＣｏ”と誤認識され、文字列“ＡＢＣ８”が“ＡＢＣＢ”と誤認識されたとする。このような場合でも、ステップＳ１０６の再認識処理で図５に示す領域Ｒ１、Ｒ２について再認識処理を実行した結果、“ＡＢＣ０”、“ＡＢＣ８”とそれぞれ正しく認識される場合があり得る。 For example, assume that the character string "ABC0" included in the image 50 is erroneously recognized as "ABCo" in the OCR processing in step S102, and the character string "ABC8" is erroneously recognized as "ABCB". Even in such a case, as a result of performing the re-recognition processing on the regions R1 and R2 shown in FIG. 5 in the re-recognition processing in step S106, "ABC0" and "ABC8" may be correctly recognized, respectively.

なお、ステップＳ１０６で実行される再認識処理では、文字認識の精度を表すスコアが算出される。スコアが高いほど文字認識の精度が高く、スコアが低いほど文字認識の精度が低い。例えばスコアは０～１の範囲の値であり、数値が高いほど文字認識の精度が高い。 In the re-recognition process executed in step S106, a score representing the accuracy of character recognition is calculated. The higher the score, the higher the accuracy of character recognition, and the lower the score, the lower the accuracy of character recognition. For example, the score is a value in the range of 0 to 1, and the higher the value, the higher the accuracy of character recognition.

以下のステップＳ１０８以降の処理は、再認識処理で認識された文字列毎に実行される。 The processing after step S108 below is executed for each character string recognized in the re-recognition processing.

ステップＳ１０８では、ＣＰＵ１０Ａが、ステップＳ１０６の再認識処理により認識された文字列と同型の文字列候補である誤認識文字列候補を生成する。 In step S108, CPU 10A generates a misrecognized character string candidate, which is a character string candidate of the same type as the character string recognized by the re-recognition process in step S106.

具体的には、例えば図６に示すように、誤認識文字が定義された誤認識文字情報４４が記憶部７に予め記憶されており、この誤認識文字情報４４を用いて誤認識文字列候補を生成する。誤認識文字情報４４は、ステップＳ１０２のＯＣＲ処理で誤認識する誤認識パターンを定義したものである。図６の例では、“Ａ”に対して誤認識されやすい“Ｈ”が定義されている。また、“Ｂ”に対して誤認識されやすい“８”が定義されている。また、“０”に対して誤認識されやすい“ｏ”及び“Ｏ”が定義されている。 Specifically, as shown in FIG. 6, misrecognized character information 44 in which misrecognized characters are defined is stored in advance in the storage unit 7, and misrecognized character string candidates are identified using this misrecognized character information 44. to generate The erroneously recognized character information 44 defines erroneously recognized patterns to be erroneously recognized in the OCR processing of step S102. In the example of FIG. 6, "H" is defined which is likely to be erroneously recognized as opposed to "A". Also, "8", which is likely to be erroneously recognized with respect to "B", is defined. Also, "o" and "O" which are likely to be erroneously recognized with respect to "0" are defined.

これにより、例えばステップＳ１０６の再認識処理により認識された文字列が“ＡＢＣ０”であった場合、“ＨＢＣ０”、“Ａ８Ｃ０”、“ＡＢＣｏ”、“ＡＢＣＯ”、“Ｈ８Ｃ０”、“Ｈ８Ｃｏ”、“Ｈ８ＣＯ”等が誤認識文字列候補として生成される。 As a result, for example, when the character string recognized by the re-recognition process in step S106 is "ABC0", "HBC0", "A8C0", "ABCo", "ABCO", "H8C0", "H8Co", " H8CO” and the like are generated as erroneously recognized character string candidates.

ステップＳ１１０では、ＣＰＵ１０Ａが、ユーザーによって指定された誤認識パターンを用いて生成した文字列を誤認識文字列候補に追加する。ここで、ユーザーによって指定された誤認識パターンとは、例えば図面に含まれる文字のフォントに応じて設定された誤認識パターンである。例えば“０”に対して誤認識されやすい文字の代表的な例は“ｏ”及び“Ｏ”であるが、図面で使用されている文字のフォントによっては、“ｏ”及び“Ｏ”以外の他の文字と誤認識されやすい場合もある。この場合、例えば図７に示すように、ユーザー指定の誤認識パターンを定義した誤認識パターン情報４６を記憶部７に予め記憶しておき、この誤認識パターン情報４６を用いて生成した文字列を誤認識文字列候補に追加する。 In step S110, CPU 10A adds the character string generated using the misrecognition pattern specified by the user to the misrecognition character string candidates. Here, the erroneous recognition pattern specified by the user is, for example, an erroneous recognition pattern set according to the font of characters included in the drawing. For example, typical examples of characters that are likely to be misrecognized as "0" are "o" and "O", but depending on the font used in the drawing, characters other than "o" and "O" may be used. In some cases, it is easy to be misrecognized as other characters. In this case, for example, as shown in FIG. 7, misrecognition pattern information 46 defining a user-specified misrecognition pattern is stored in advance in the storage unit 7, and a character string generated using this misrecognition pattern information 46 is generated. Add to misrecognized string candidates.

図７の誤認識パターン情報４６の例では、“Ｂ”に対して誤認識されやすい文字として“日”が、“０”に対して誤認識されやすい文字として“Ｄ”が設定されている。例えばステップＳ１０６の再認識処理により認識された文字列が“ＡＢＣ０”であった場合、“Ａ日Ｃ０”、“ＡＢＣＤ”が誤認識文字列候補として追加される。 In the example of the erroneous recognition pattern information 46 in FIG. 7, "day" is set as a character likely to be erroneously recognized for "B", and "D" is set as a character likely to be erroneously recognized for "0". For example, if the character string recognized by the re-recognition process in step S106 is "ABC0", "A day C0" and "ABCD" are added as erroneously recognized character string candidates.

なお、ステップＳ１０８及びステップＳ１１０で生成した誤認識文字列候補のうち、正規表現で表される文字列以外の誤認識文字列候補については、画像に含まれる文字列候補を特定する上で不要である。 Of the erroneously recognized character string candidates generated in steps S108 and S110, the erroneously recognized character string candidates other than character strings represented by regular expressions are unnecessary for specifying character string candidates included in the image. be.

そこで、ステップＳ１１２では、ＣＰＵ１０Ａが、ステップＳ１０８及びステップＳ１１０で生成した誤認識文字列候補のうち、正規表現で表される抽出パターンの文字列以外の誤認識文字列候補を除外する。 Therefore, in step S112, the CPU 10A excludes, from among the erroneously recognized character string candidates generated in steps S108 and S110, erroneously recognized character string candidates other than the character string of the extraction pattern represented by the regular expression.

例えば、上記の例で正規表現が“［Ａ］［Ｂ］［Ｃ］［０－９］”の場合において、ステップＳ１０８及びステップＳ１１０で生成した誤認識文字列候補として“ＨＢＣ０”、“Ａ８Ｃ０”、“ＡＢＣｏ”、“ＡＢＣＯ”、“Ｈ８Ｃ０”、“Ｈ８Ｃｏ”、“Ｈ８ＣＯ”、“Ａ日Ｃ０”、“ＡＢＣＤ”が含まれていたとする。正規表現で表される文字列は先頭から３文字目まではアルファベット、先頭から４文字目は数字なので、上記の誤認識文字列候補のうち、先頭から４文字目がアルファベットである“ＡＢＣｏ”、“ＡＢＣＯ”、“Ｈ８Ｃｏ”、“Ｈ８ＣＯ”、“ＡＢＣＤ”、先頭から２文字目が漢字である“Ａ日Ｃ０”は誤認識文字列候補から除外される。 For example, when the regular expression is "[A][B][C][0-9]" in the above example, the misrecognized character string candidates generated in steps S108 and S110 are "HBC0" and "A8C0". , "ABCo", "ABCO", "H8C0", "H8Co", "H8CO", "A day C0", and "ABCD". Since the first three characters of a character string represented by a regular expression are alphabetic characters, and the fourth character is a number, among the above misrecognized character string candidates, "ABCo" whose fourth character is an alphabetic character, "ABCO", "H8Co", "H8CO", "ABCD", and "A day C0" whose second character is a Chinese character are excluded from the erroneously recognized character string candidates.

ステップＳ１１４では、ＣＰＵ１０Ａが、誤認識文字列候補の中から文字列候補を特定する。例えば図８に示すように、記憶部７に予め記憶された参照リスト４８を用いて文字列候補を特定する。参照リスト４８は、図面情報を抽出する対象の図面に含まれ得る文字列のリストである。図８に示すように、参照リスト４８は、一例として５個の文字列“ＡＢＣ０”、“ＡＢＣ１”、“ＡＢＣ２”、“ＡＢＣ５”、“ＡＢＣ８”を含んでいる。 In step S114, CPU 10A identifies a character string candidate from the erroneously recognized character string candidates. For example, as shown in FIG. 8, character string candidates are identified using a reference list 48 pre-stored in the storage unit 7 . The reference list 48 is a list of character strings that can be included in the drawing from which drawing information is to be extracted. As shown in FIG. 8, the reference list 48 contains, by way of example, five character strings "ABC0", "ABC1", "ABC2", "ABC5" and "ABC8".

ここで、誤認識文字列候補のうち、参照リストに含まれる文字列を文字列候補として特定する。一方、参照リストに何れの誤認識文字列候補も含まれていない場合は、文字列候補は特定されない。 Here, among the erroneously recognized character string candidates, character strings included in the reference list are specified as character string candidates. On the other hand, if the reference list does not contain any misrecognized character string candidates, no character string candidates are identified.

ステップＳ１１６では、ＣＰＵ１０Ａが、ステップＳ１１４で文字列候補が特定されたか否かを判定する。そして、文字列候補が特定された場合、すなわち参照リストに誤認識文字列候補が存在する場合はステップＳ１１８へ移行する。一方、文字列候補が特定されなかった場合、すなわち参照リストに誤認識文字列候補が存在しなかった場合はステップＳ１２４へ移行する。 In step S116, CPU 10A determines whether or not a character string candidate has been identified in step S114. Then, if a character string candidate is specified, that is, if a misrecognized character string candidate exists in the reference list, the process proceeds to step S118. On the other hand, if no character string candidate is specified, that is, if there is no erroneously recognized character string candidate in the reference list, the process proceeds to step S124.

ステップＳ１１８では、ＣＰＵ１０Ａが、ステップＳ１０６の再認識処理において算出されたスコアが予め定めた閾値以上であるか否かを判定する。なお、閾値は、スコアが閾値以上であれば、ＯＣＲ処理の認識結果の信頼性が許容レベル以上とされる値に設定される。例えば、スコアが取り得る値の最大値の９割以上の値に設定される。具体的には、スコアが取り得る値の範囲が０～１の場合、閾値は一例として０．９５に設定される。 In step S118, CPU 10A determines whether or not the score calculated in the re-recognition process in step S106 is equal to or greater than a predetermined threshold. Note that the threshold is set to a value at which, if the score is equal to or higher than the threshold, the reliability of the recognition result of OCR processing is equal to or higher than the allowable level. For example, it is set to a value that is 90% or more of the maximum value that the score can take. Specifically, when the range of values that the score can take is 0 to 1, the threshold is set to 0.95 as an example.

そして、ステップＳ１０６の再認識処理において算出されたスコアが予め定めた閾値以上である場合、すなわちステップＳ１０６の再認識処理の認識結果の信頼性が許容レベル以上である場合はステップＳ１１９へ移行する。一方、ステップＳ１０６の再認識処理において算出されたスコアが予め定めた閾値未満である場合、すなわちステップＳ１０６の再認識処理の認識結果の信頼性が許容レベル未満である場合はステップＳ１２２へ移行する。 Then, when the score calculated in the re-recognition process of step S106 is equal to or higher than a predetermined threshold value, that is, when the reliability of the recognition result of the re-recognition process of step S106 is equal to or higher than the allowable level, the process proceeds to step S119. On the other hand, if the score calculated in the re-recognition process of step S106 is less than the predetermined threshold, that is, if the reliability of the recognition result of the re-recognition process of step S106 is less than the allowable level, the process proceeds to step S122.

ステップＳ１１９では、ステップＳ１１４で特定された文字列候補が１つであるか否かを判定する。そして、特定された文字列候補が１つの場合はステップＳ１２０へ移行し、特定された文字列候補が複数の場合はステップＳ１２２へ移行する。 In step S119, it is determined whether or not there is one character string candidate identified in step S114. If there is one specified character string candidate, the process proceeds to step S120, and if there are a plurality of specified character string candidates, the process proceeds to step S122.

ステップＳ１２０では、ＣＰＵ１０Ａが、ステップＳ１１４で特定した１つの文字列候補を出力する。すなわち、特定した文字列候補を紙等の記録媒体に印刷したり、データファイルとして記憶部７に記憶したり、通信部６を介して外部装置に送信したり、表示部５に表示したりする。 In step S120, CPU 10A outputs one character string candidate specified in step S114. That is, the specified character string candidate is printed on a recording medium such as paper, stored as a data file in the storage unit 7, transmitted to an external device via the communication unit 6, or displayed on the display unit 5. .

一方、ステップＳ１２２が実行されるのは、ステップＳ１１８で再認識処理の認識結果の信頼性が許容レベル未満と判定された又はステップＳ１１９で複数の文字列候補が特定された場合である。このため、ステップＳ１１４で特定された文字列候補を、あくまで図面に含まれると推定される文字列候補である推定文字列候補として出力する。 On the other hand, step S122 is executed when it is determined in step S118 that the reliability of the recognition result of the re-recognition process is below the allowable level or when a plurality of character string candidates are identified in step S119. Therefore, the character string candidate specified in step S114 is output as an estimated character string candidate, which is a character string candidate that is estimated to be included in the drawing.

また、ステップＳ１２４が実行されるのは、参照リストに誤認識文字列候補が存在しない場合である。このため、誤認識文字列候補に近い文字列を参照リストから検索する。具体的には、誤認識文字列候補と異なる文字の数が最も少ない文字列を、誤認識文字列候補に近い文字列として参照リストから検索する。そして、検索した文字列を推定文字列候補として出力する。 Further, step S124 is executed when there is no erroneously recognized character string candidate in the reference list. Therefore, the reference list is searched for a character string close to the erroneously recognized character string candidate. Specifically, the reference list is searched for a character string having the smallest number of characters different from the misrecognized character string candidate as a character string close to the misrecognized character string candidate. Then, the retrieved character string is output as an estimated character string candidate.

ステップＳ１２６では、ステップＳ１０６で認識された全ての文字列についてステップＳ１０８～Ｓ１２４の処理を実行したか否かを判定する。そして、ステップＳ１０６で認識された全ての文字列についてステップＳ１０８～Ｓ１２４の処理を実行した場合は本ルーチンを終了する。一方、ステップＳ１０６で認識された全ての文字列についてステップＳ１０８～Ｓ１２４の処理を実行していない場合は、ステップＳ１０８へ移行し、全ての文字列の処理が終了するまでステップＳ１０８～Ｓ１２４の処理を繰り返す。 In step S126, it is determined whether or not the processing of steps S108 to S124 has been executed for all the character strings recognized in step S106. Then, when the processing of steps S108 to S124 has been executed for all the character strings recognized in step S106, this routine ends. On the other hand, if the processing of steps S108 to S124 has not been executed for all the character strings recognized in step S106, the process proceeds to step S108, and the processing of steps S108 to S124 is continued until the processing of all character strings is completed. repeat.

このように、本実施形態では、ＯＣＲ処理によって画像から認識された文字列から、正規表現で表された抽出パターンを用いて文字列候補を抽出する。そして、ＯＣＲ処理が誤認識する誤認識パターンを用いて文字列候補を修正し、予め定めた参照リストを用いて、修正された文字列候補の中から特定した文字列候補を出力する。 As described above, in the present embodiment, character string candidates are extracted from character strings recognized from an image by OCR processing using extraction patterns represented by regular expressions. Then, the character string candidate is corrected using the erroneously recognized pattern that is erroneously recognized by the OCR processing, and the character string candidate specified from the corrected character string candidates is output using a predetermined reference list.

なお、第１実施形態では、図３のステップＳ１１０において、ユーザー指定の誤認識パターンを定義した誤認識パターン情報４６を用いて生成した文字列を誤認識文字列候補に追加しているが、ステップＳ１１０の処理を省略してもよい。 In the first embodiment, in step S110 of FIG. 3, the character string generated using the misrecognition pattern information 46 defining the misrecognition pattern specified by the user is added to the misrecognition character string candidates. The processing of S110 may be omitted.

次に、文字列候補を特定する場合の具体例について説明する。 Next, a specific example of identifying character string candidates will be described.

・具体例１・Specific example 1

具体例１では、正規表現が[F][X][1][0][A-Z][A-Z][0-9][0-9][0-9]、図面に含まれる実際の文字列が“FX10OG106”であり、参照リストには図９に示すように９個の文字列が含まれている。 In example 1, the regular expression is [F][X][1][0][A-Z][A-Z][0-9][0-9][0-9], and the actual character string contained in the drawing is "FX10OG106" and the reference list contains 9 strings as shown in FIG.

上記の条件でＯＣＲ処理を実行すると、認識した文字列候補は“FXIOO6IO6”であった。すなわち、３文字目の“１”が“I”に、４文字目の“０”が“Ｏ”に、６～８文字目の“G10”が“6IO”に誤認識されている。 When OCR processing was executed under the above conditions, the recognized character string candidate was "FXIOO6IO6". That is, the third character "1" is erroneously recognized as "I", the fourth character "0" as "O", and the sixth to eighth characters "G10" as "6IO".

次に、再認識処理の結果は、“FX100G106”となり、５文字目の“Ｏ”が“０”に誤認識されている。なお、再認識処理のスコアは閾値以上であった。また、誤認識文字列候補としては、“FX1o0G106”、“FX10OG106”、“FX10oG106”等が生成された。なお、具体例１では、ユーザー指定の誤認識パターン情報を用いて誤認識文字列候補を更に生成する処理は実行しなかった。 Next, the result of the re-recognition processing is "FX100G106", and the fifth character "O" is erroneously recognized as "0". Note that the score of the re-recognition process was equal to or higher than the threshold. In addition, "FX1o0G106", "FX10OG106", "FX10oG106", etc. were generated as misrecognized character string candidates. Note that in Specific Example 1, the process of further generating misrecognition character string candidates using user-specified misrecognition pattern information was not executed.

誤認識文字列候補のうち、正規表現を用いて不要な文字列候補を除外すると、“FX10OG106”のみが残る。この誤認識文字列候補は、図９の参照リストに含まれている。このため、図面に記載された文字列候補として“FX10OG106”が特定され、出力される。この文字列は、図面に含まれる実際の文字列と完全に一致している。 If unnecessary character string candidates are excluded from the incorrectly recognized character string candidates using regular expressions, only "FX10OG106" remains. This misrecognized character string candidate is included in the reference list in FIG. Therefore, "FX10OG106" is identified and output as a character string candidate described in the drawing. This string exactly matches the actual string contained in the drawing.

・具体例２・Specific example 2

具体例２では、正規表現、図面に実際に含まれる文字列、参照リスト、及びＯＣＲ処理の結果は具体例１と同じである。 In specific example 2, the regular expression, the character strings actually included in the drawing, the reference list, and the result of OCR processing are the same as in specific example 1. FIG.

再認識処理の結果は、“FX1006106”となり、５文字目の“Ｏ”が“０”に、６文字目の“Ｇ”が“６”に誤認識されている。なお、再認識処理のスコアは閾値以上であった。 The result of the re-recognition processing is "FX1006106", in which the fifth character "O" is erroneously recognized as "0" and the sixth character "G" is erroneously recognized as "6". Note that the score of the re-recognition process was equal to or higher than the threshold.

また、誤認識文字列候補としては、“FX1O06106”、“FX1oO6106”、“FX10O606”、 “FX10o6106”等が生成された。 In addition, "FX1O06106", "FX1oO6106", "FX10O606", "FX10o6106", etc. were generated as erroneously recognized character string candidates.

また、具体例２では、ユーザー指定の誤認識パターン情報を用いて生成した文字列を誤認識文字列候補に追加した。ユーザー指定の誤認識パターン情報は、誤認識パターンとして“６：Ｇ”が定義されている。このため、誤認識文字列候補として“FX10OG106”等を生成して追加した。 Further, in Specific Example 2, a character string generated using misrecognition pattern information specified by the user is added to the misrecognition character string candidates. In the user-specified recognition error pattern information, "6:G" is defined as the recognition error pattern. Therefore, "FX10OG106" etc. was generated and added as an erroneously recognized character string candidate.

再認識処理で認識された文字列及び誤認識文字列候補のうち、正規表現を用いて不要な誤認識文字列候補を除外すると、“FX10OG106”のみが残る。この誤認識文字列候補は、図９の参照リストに含まれている。このため、図面に記載された文字列候補として“FX10OG106”が特定され、出力される。この文字列は、図面に含まれる実際の文字列と完全に一致している。 Of the character strings recognized in the re-recognition process and the erroneously recognized character string candidates, if unnecessary erroneously recognized character string candidates are excluded using regular expressions, only "FX10OG106" remains. This misrecognized character string candidate is included in the reference list in FIG. Therefore, "FX10OG106" is identified and output as a character string candidate described in the drawing. This string exactly matches the actual string contained in the drawing.

・具体例３・Specific example 3

具体例３では、再認識処理のスコアが閾値未満であること以外は具体例２と同じである。 Specific example 3 is the same as specific example 2 except that the score of the re-recognition process is less than the threshold.

具体例３では、具体例２と同様に、図面に記載された文字列候補として、図面に含まれる実際の文字列と完全に一致する“FX10OG106”が特定されるが、再認識処理のスコアが閾値未満であるため信頼性が低い。従って、文字列候補“FX10OG106”は、あくまで図面に含まれると推定される文字列候補である推定文字列候補として出力される。 In Concrete Example 3, as in Concrete Example 2, "FX10OG106", which completely matches the actual character string included in the drawing, is specified as a character string candidate described in the drawing, but the re-recognition processing score is Reliability is low because it is below the threshold. Therefore, the character string candidate "FX10OG106" is output as an estimated character string candidate, which is a character string candidate presumed to be included in the drawing.

・具体例４・Specific example 4

具体例４では、図１０に示すように、参照リストに１０個の文字列が含まれている点以外は具体例１と同じである。 Specific example 4 is the same as specific example 1 except that the reference list includes 10 character strings, as shown in FIG.

具体例４では、具体例１と同様に文字列候補として“FX10OG106”が特定されるが、この文字列は図１０の参照リストに含まれていない。このため、図１０に参照リストに含まれる文字列のうち、“FX10OG106”と異なる文字の数が最も少ない文字列、すなわち“FX10OG106”と末尾の１文字が異なるだけの文字列である“FX10OG101”、“FX10OG102”、“FX10OG103”、“FX10OG104”、“FX10OG105”が推定文字列候補として出力される。 In Concrete Example 4, "FX10OG106" is specified as a character string candidate as in Concrete Example 1, but this character string is not included in the reference list of FIG. For this reason, among the character strings included in the reference list shown in FIG. , “FX10OG102”, “FX10OG103”, “FX10OG104”, and “FX10OG105” are output as estimated character string candidates.

・具体例５・Specific example 5

具体例５では、正規表現が[F][X][1][0][A-Z][0-9,A-Z][0-9][0-9][0-9]、図面に含まれる実際の文字列が“FX10OG104”であり、参照リストには図１１に示すように８個の文字列が含まれている。 In example 5, the regular expression is [F][X][1][0][A-Z][0-9,A-Z][0-9][0-9][0-9], which is included in the drawing The actual string is "FX10OG104" and the reference list contains 8 strings as shown in FIG.

上記の条件でＯＣＲ処理を実行すると、認識した文字列は“FXIOO6IO4”であった。すなわち、３、４文字目の“10”が“IO”に、６～９文字目の“G104”が“6IO4”に誤認識されている。 When OCR processing was performed under the above conditions, the recognized character string was "FXIOO6IO4". That is, the 3rd and 4th characters "10" are erroneously recognized as "IO", and the 6th to 9th characters "G104" are erroneously recognized as "6IO4".

次に、再認識処理の結果は、“FX1006104”となり、５、６文字目の“ＯＧ”が“０６”に誤認識されている。なお、再認識処理のスコアは閾値以上であった。また、誤認識文字列候補としては、“FX1O06104”、“FX1o06104”、“FX10O6104”、“FX10o6104”等が生成された。 Next, the result of the re-recognition processing is "FX1006104", and the 5th and 6th characters "OG" are erroneously recognized as "06". Note that the score of the re-recognition process was equal to or higher than the threshold. In addition, "FX1O06104", "FX1o06104", "FX10O6104", "FX10o6104", etc. were generated as misrecognized character string candidates.

また、ユーザー指定の誤認識パターン情報を用いて生成した文字列を誤認識文字列候補に追加した。ユーザー指定の誤認識パターン情報は、誤認識パターンとして“６：Ｇ”が定義されている。このため、誤認識文字列候補として“FX10OG104”等を生成して追加した。 In addition, character strings generated using misrecognition pattern information specified by the user are added to misrecognition character string candidates. In the user-specified recognition error pattern information, "6:G" is defined as the recognition error pattern. For this reason, "FX10OG104" etc. was generated and added as an erroneously recognized character string candidate.

誤認識文字列候補のうち、正規表現を用いて不要な誤認識文字列候補を除外すると、“FX10OG104”及び“FX10O6104”が残る。これらの文字列候補は、何れも図１１に示す参照リストに含まれているため、１つの文字列候補に絞り込めない。従って、これら２つの文字列候補を推定文字列候補として出力する。 When unnecessary misrecognized character string candidates are excluded from the misrecognized character string candidates using regular expressions, "FX10OG104" and "FX10O6104" remain. Since these character string candidates are all included in the reference list shown in FIG. 11, they cannot be narrowed down to one character string candidate. Therefore, these two character string candidates are output as estimated character string candidates.

（第２実施形態） (Second embodiment)

次に、第２実施形態について説明する。なお、第１実施形態と同一部分には同一符号を付し、詳細な説明は省略する。第２実施形態では、図面から抽出したい情報がプラスチック部品の材質を表す文字列である場合について説明する。 Next, a second embodiment will be described. In addition, the same code|symbol is attached|subjected to the same part as 1st Embodiment, and detailed description is abbreviate|omitted. In the second embodiment, the case where the information to be extracted from the drawing is a character string representing the material of the plastic part will be described.

プラスチック部品の材質の表示方法は、ＪＩＳ（日本工業規格）で規定されている。例えば、単一のポリマー等の部品の表示は、ＪＩＳＫ６８９９－１で規定された材料の略語を境界文字“＞”及び“＜”で挟むことになっている。例えば、材料がアクリロニトリル－ブタジエン－スチレンであり、この略語がＡＢＳである場合は、“＞ＡＢＳ＜”と表記される。 JIS (Japanese Industrial Standards) stipulates the method of indicating the material of plastic parts. For example, the designation of a part such as a single polymer is to enclose the abbreviation of the material specified in JIS K 6899-1 between the boundary characters ">" and "<". For example, if the material is acrylonitrile-butadiene-styrene and the abbreviation is ABS, it is written as ">ABS<".

また、プラスチック部品の材質の表示では、材料の略語の他に、“－”、“＋”、“（”、“）”等が区切り文字として使用されたり、質量分率又は材料の含有率等を表す数字が使用されたりする。例えば“＞ＰＡ６６－（ＧＦ２５＋ＭＤ１５）＜”と表記されている場合、ガラス繊維（ＧＦ）２５質量％と鉱物粉末（ＭＤ）１５質量％の混合物を含むポリアミド６６を表す。 In addition to the material abbreviations, "-", "+", "(", ")", etc. are used as delimiters, and mass fractions or material content rates are used to indicate the materials of plastic parts. A number representing is used. For example, the notation ">PA66-(GF25+MD15)<" denotes polyamide 66 containing a mixture of 25% by weight glass fiber (GF) and 15% by weight mineral powder (MD).

このように、境界文字で挟まれた文字列には、複数の材料の略語が含まれる場合があるが、必要とされるのは材料の略語であり、区切り文字は不要である。このため、第２実施形態では、境界文字で挟まれた文字列に区切り文字が含まれる場合は、文字列を区切り文字で分割し、材料の略語及び数字で構成される文字列を必要文字列として抽出する。 Thus, a string between boundary characters may contain multiple material abbreviations, but only the material abbreviations are required and no delimiters are required. For this reason, in the second embodiment, when a character string sandwiched between boundary characters includes a delimiter, the character string is divided by the delimiter, and a character string composed of material abbreviations and numbers is converted into a required character string. Extract as

図１２には、第２実施形態に係るＣＰＵ１０Ａの機能ブロック図を示した。図１２に示す機能ブロック図は、必要文字列抽出部３１が設けられている点が図２に示す機能ブロック図と異なる。 FIG. 12 shows a functional block diagram of the CPU 10A according to the second embodiment. The functional block diagram shown in FIG. 12 differs from the functional block diagram shown in FIG. 2 in that a required character string extraction unit 31 is provided.

必要文字列抽出部３１は、再認識処理部３０による再認識処理により認識された文字列から必要文字列を抽出する。具体的には、再認識処理により認識された文字列に境界文字が含まれているか否かを判定する。そして、再認識処理により認識された文字列に境界文字が含まれている場合には、境界文字で挟まれた文字列を区切り文字で分割し、材料の略語及び数字で構成される文字列を必要文字列として抽出する。 The required character string extraction unit 31 extracts a required character string from the character strings recognized by the re-recognition processing by the re-recognition processing unit 30 . Specifically, it is determined whether or not the character string recognized by the re-recognition process includes a boundary character. Then, if the character string recognized by the re-recognition process contains boundary characters, the character string sandwiched between the boundary characters is divided by the delimiter, and the character string composed of the material abbreviations and numbers is Extract as a required string.

次に、図１３を参照して、本実施の形態に係る情報処理装置１０の作用を説明する。図１０に示す情報処理が図３に示す情報処理と異なるのは、ステップＳ１０７の処理が追加されている点及びステップＳ１１４の処理が若干異なる点である。 Next, operation of the information processing apparatus 10 according to the present embodiment will be described with reference to FIG. The information processing shown in FIG. 10 differs from the information processing shown in FIG. 3 in that the processing of step S107 is added and the processing of step S114 is slightly different.

ステップＳ１０７では、ステップＳ１０６の再認識処理により認識された文字列の各々について、必要文字列を抽出する処理を行う。すなわち、再認識処理により認識された文字列のうち、境界文字“＞”及び“＜”で挟まれた文字列を抽出する。そして、境界文字“＞”及び“＜”で挟まれた文字列のうち、“－”、“＋”、“（”、“）”等の区切り文字が含まれていない文字列については、境界文字を除いた文字列を必要文字列として抽出する。例えば再認識処理で認識された文字列が“＞ＡＢＳ＜”の場合は、境界文字を除外した“ＡＢＳ”を必要文字列として抽出する。 In step S107, processing for extracting necessary character strings is performed for each of the character strings recognized by the re-recognition processing in step S106. That is, the character strings sandwiched between the boundary characters “>” and “<” are extracted from the character strings recognized by the re-recognition process. Among the character strings sandwiched between boundary characters ">" and "<", character strings that do not contain delimiters such as "-", "+", "(", and ")" Extract the string excluding characters as the required string. For example, if the character string recognized in the re-recognition process is ">ABS<", then "ABS" excluding the boundary characters is extracted as the required character string.

一方、境界文字“＞”及び“＜”で挟まれた文字列のうち、“－”、“＋”、“（”、“）”等の区切り文字が含まれている場合は、区切り文字で文字列を分割し、材料の略語及び数字で構成される文字列を必要文字列として抽出する。例えば再認識処理で認識された文字列が“＞ＰＡ６６－（ＧＦ２５＋ＭＤ１５）＜”の場合は、境界文字及び区切り文字を除外した“ＰＡ６６”、“ＧＦ２５”、及び“ＭＤ１５”を必要文字列として抽出する。このため、ステップＳ１０８～Ｓ１２６の処理は、必要文字列毎に実行される。すなわち、ステップＳ１０８の誤認識文字列候補の生成も必要文字列毎に生成される。 On the other hand, if a character string sandwiched between boundary characters “>” and “<” contains delimiters such as “-”, “+”, “(”, “)”, The character string is divided and character strings composed of material abbreviations and numbers are extracted as required character strings. For example, if the character string recognized in the re-recognition process is ">PA66-(GF25+MD15)<", extract "PA66", "GF25", and "MD15" excluding boundary characters and delimiters as required character strings. do. Therefore, the processes of steps S108 to S126 are executed for each required character string. That is, generation of erroneously recognized character string candidates in step S108 is also generated for each required character string.

また、ステップＳ１１４では、例えば図１４に示すような参照リスト４８Ａを用いて、必要文字列を含む誤認識文字列候補の中から文字列候補を特定する。図１４に示すように、参照リスト４８Ａは、材料の略語のみが定義されたリストであるが、必要文字列を含む誤認識文字列候補は材料の略語及び数字を含む文字列となる。このため、必要文字列を含む誤認識文字列候補のうち、参照リスト４８Ａに定義された文字列と材料の略語が部分的に一致する文字列が存在する場合は、その文字列を画像に含まれる文字列候補として特定する。 Further, in step S114, for example, using a reference list 48A as shown in FIG. 14, character string candidates are identified from the misrecognized character string candidates including the necessary character string. As shown in FIG. 14, the reference list 48A is a list in which only material abbreviations are defined, but misrecognized character string candidates including necessary character strings are character strings including material abbreviations and numbers. Therefore, if there is a character string partially matching the character string defined in the reference list 48A and the abbreviation of the material among the erroneously recognized character string candidates including the necessary character string, the character string is included in the image. Identifies string candidates that can be

例えば必要文字列が“ＰＡ６６”の場合、参照リスト４８Ａに定義された“ＰＡ”と前方一致するので、“ＰＡ”が画像に含まれる文字列候補として特定される。 For example, if the required character string is "PA66", it matches "PA" defined in the reference list 48A, so "PA" is identified as a character string candidate included in the image.

・具体例１・Specific example 1

具体例１では、正規表現が[>][][][][][][][][][][][][][][][][][][][][][][][][<] 、図面に含まれる実際の文字列が“>PA66-(GF25+MD15)<”であり、図１５に示すような参照リストに１１個の文字列が含まれている。 In example 1, the regular expression is [>][][][][][][][][][][][][][][][][][][][][ ][][][][<] , the actual string contained in the drawing is ">PA66-(GF25+MD15)<" and the reference list contains 11 strings as shown in Figure 15 is

上記の条件でＯＣＲ処理を実行すると、認識した文字列候補は“>PAGG-(GF2S+MDI5)<”であった。すなわち、４、５文字目の“66”が“GG”に、１０文字目の“5”が“S”に、１４文字目の“1”が“I”に誤認識されている。 When OCR processing was performed under the above conditions, the recognized character string candidate was ">PAGG-(GF2S+MDI5)<". That is, the 4th and 5th characters "66" are erroneously recognized as "GG", the 10th character "5" as "S", and the 14th character "1" as "I".

次に、再認識処理の結果は、“>PA66-(GF25+MD15)<”となり、再認識処理のスコアは閾値以上であった。また、必要文字抽出処理により“PA66”、“GF25”、“MD15”が抽出された。これらの文字列は、材料の略語を表す文字列が図１５に示す参照リストに含まれている。このため、図面に記載された文字列候補として“PA”、“GF”、“MD”が特定され、出力される。 Next, the result of the re-recognition process was ">PA66-(GF25+MD15)<", and the score of the re-recognition process was above the threshold. Also, "PA66", "GF25", and "MD15" were extracted by the necessary character extraction processing. These strings are included in the reference list shown in FIG. 15, which strings represent material abbreviations. Therefore, "PA", "GF", and "MD" are identified and output as character string candidates described in the drawing.

・具体例２・Specific example 2

具体例２は、再認識処理のスコアが閾値未満である点以外は具体例１と同じである。この場合、再認識処理の認識結果の信頼性が低いので、“PA”、“GF”、“MD”は推定文字列候補として出力される。 Specific example 2 is the same as specific example 1 except that the score of the re-recognition process is less than the threshold. In this case, since the reliability of the recognition result of the re-recognition process is low, "PA", "GF", and "MD" are output as estimated character string candidates.

以上、各実施形態を用いて本発明について説明したが、本発明は各実施形態に記載の範囲には限定されない。本発明の要旨を逸脱しない範囲で各実施形態に多様な変更または改良を加えることができ、当該変更または改良を加えた形態も本発明の技術的範囲に含まれる。 Although the present invention has been described above using each embodiment, the present invention is not limited to the scope described in each embodiment. Various changes or improvements can be made to each embodiment without departing from the gist of the present invention, and forms with such changes or improvements are also included in the technical scope of the present invention.

例えば上記各実施形態では、情報処理装置が複合機に適用される場合を例に説明したが、スマートフォン又はタブレット端末等に適用してもよい。この場合、スマートフォン又はタブレット端末に搭載されたカメラで図面を撮影し、撮影した画像に対して図３又は図１０の処理を実行すればよい。 For example, in each of the above-described embodiments, the case where the information processing apparatus is applied to a multi-function peripheral has been described as an example, but it may be applied to a smart phone, a tablet terminal, or the like. In this case, the drawing may be photographed with a camera mounted on a smartphone or tablet terminal, and the processing of FIG. 3 or FIG. 10 may be performed on the photographed image.

また、例えば、図３、１３に示した情報処理をＡＳＩＣ（Application Specific Integrated Circuit）等のハードウェアで実現するようにしてもよい。この場合、ソフトウエアで実現する場合に比べて、処理の高速化が図られる。 Further, for example, the information processing shown in FIGS. 3 and 13 may be realized by hardware such as ASIC (Application Specific Integrated Circuit). In this case, the speed of processing can be increased as compared with the case of realizing by software.

また、各実施形態では、情報処理プログラムが記憶部７にインストールされている形態を説明したが、これに限定されるものではない。本実施形態に係る情報処理プログラムを、コンピュータ読取可能な記憶媒体に記録した形態で提供してもよい。例えば、本実施形態に係る情報処理プログラムを、ＣＤ(Compact Disc)－ＲＯＭ及びＤＶＤ(Digital Versatile Disc)－ＲＯＭ等の光ディスクに記録した形態、若しくはＵＳＢ(Universal Serial Bus)メモリ及びメモリカード等の半導体メモリに記録した形態で提供してもよい。また、本実施形態に係る情報処理プログラムを、通信部６に接続された通信回線を介して外部装置から取得するようにしてもよい。 Further, in each embodiment, a form in which the information processing program is installed in the storage unit 7 has been described, but the present invention is not limited to this. The information processing program according to this embodiment may be provided in a form recorded on a computer-readable storage medium. For example, the information processing program according to the present embodiment is recorded in an optical disk such as CD (Compact Disc)-ROM and DVD (Digital Versatile Disc)-ROM, or a semiconductor such as USB (Universal Serial Bus) memory and memory card It may be provided in a form recorded in memory. Also, the information processing program according to the present embodiment may be acquired from an external device via a communication line connected to the communication section 6 .

１画像形成装置
２スキャナ部
３画像形成部
４操作部
５表示部
６通信部
７記憶部
１０情報処理装置
２０ＯＣＲ処理部
２２取得部
２４抽出部
２６修正部
２８出力部
３０再認識処理部
３１必要文字列抽出部
３２生成部
４０情報処理プログラム
４２正規表現情報
４４誤認識文字情報
４６誤認識パターン情報
４８、４８Ａ参照リスト
５０画像 1 Image forming device 2 Scanner unit 3 Image forming unit 4 Operation unit 5 Display unit 6 Communication unit 7 Storage unit 10 Information processing device 20 OCR processing unit 22 Acquisition unit 24 Extraction unit 26 Correction unit 28 Output unit 30 Re-recognition processing unit 31 Necessary Character string extraction unit 32 Generation unit 40 Information processing program 42 Regular expression information 44 Misrecognition character information 46 Misrecognition pattern information 48, 48A Reference list 50 Image

Claims

an acquisition unit that acquires a character string recognized from an image by character recognition processing;
an extraction unit that extracts a character string candidate from the character string acquired by the acquisition unit using an extraction pattern represented by a regular expression;
a correction unit that corrects the character string candidate using an erroneous recognition pattern that is erroneously recognized by the character recognition process;
an output unit configured to output character string candidates identified from the character string candidates corrected by the correction unit using a predetermined reference list;
with
The acquisition unit acquires the image,
a character recognition processing unit in which the correction unit performs character recognition processing again on only the image of the character string candidate extracted by the extraction unit using a learning model learned about a specific character;
a generating unit that generates misrecognized character string candidates that are character string candidates of the same type as the character string candidates recognized by the character recognition processing unit;
including
The output unit outputs a character string candidate identified from the misrecognized character string candidates.
Information processing equipment.

The information processing apparatus according to claim 1, wherein the extraction unit extracts the character string candidate using a modified extraction pattern represented by a modified regular expression obtained by modifying the regular expression using the misrecognition pattern.

3. The information processing apparatus according to claim 1 , wherein the generation unit adds a character string generated using an erroneous recognition pattern specified by a user to the erroneously recognized character string candidates.

The information processing apparatus according to any one of claims 1 to 3, wherein the generation unit generates the misrecognized character string candidates by excluding character string candidates represented by the extraction patterns.

The character recognition processing unit calculates a score representing accuracy of character recognition,
The information processing apparatus according to any one of claims 1 to 4 , wherein the output unit outputs the identified character string candidate when the score is equal to or greater than a predetermined threshold.

The character recognition processing unit calculates a score representing accuracy of character recognition,
The information processing apparatus according to any one of claims 1 to 4 , wherein the output unit outputs the specified character string candidate as an estimated character string candidate when the score is less than a predetermined threshold.

The character recognition processing unit calculates a score representing accuracy of character recognition,
5. The output unit outputs a plurality of the specified character string candidates as estimated character string candidates when the score is equal to or greater than a predetermined threshold and the number of the specified character string candidates is plural . 1. The information processing apparatus according to 1.

an acquisition unit that acquires a character string recognized from an image by character recognition processing;
an extraction unit that extracts a character string candidate from the character string acquired by the acquisition unit using an extraction pattern represented by a regular expression;
a correction unit that corrects the character string candidate using an erroneous recognition pattern that is erroneously recognized by the character recognition process;
an output unit configured to output character string candidates identified from the character string candidates corrected by the correction unit using a predetermined reference list;
with
The acquisition unit acquires the image,
The correction unit includes a character recognition processing unit that performs character recognition processing on the image with a specific character as a target;
a required character string extraction unit for extracting a required character string, which is a required character string, from the character string recognized by the character recognition processing unit;
a generating unit that generates misrecognized character string candidates that are character string candidates of the same type as the necessary character string extracted by the necessary character string extracting unit;
including
The output unit outputs a character string candidate identified from the misrecognized character string candidates.
Information processing equipment.

If the reference list does not contain the character string candidate corrected by the correction unit, the output unit selects a character string similar to the character string candidate corrected by the correction unit from among the character strings included in the reference list. The information processing apparatus according to any one of claims 1 to 4 , wherein the estimated character string candidate is output.

10. The information according to claim 9 , wherein the output unit outputs, among the character strings included in the reference list, a character string having the smallest number of characters different from the character string candidates corrected by the correction unit as an estimated character string candidate. processing equipment.

An information processing program for causing a computer to function as each part of the information processing apparatus according to any one of claims 1 to 10 .