JP2015201077A

JP2015201077A - Image processor, image processing method, and image processing program

Info

Publication number: JP2015201077A
Application number: JP2014080155A
Authority: JP
Inventors: 堤　隆弘; Takahiro Tsutsumi; 隆弘堤
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2014-04-09
Filing date: 2014-04-09
Publication date: 2015-11-12
Anticipated expiration: 2034-04-09
Also published as: JP6303742B2

Abstract

PROBLEM TO BE SOLVED: To provide an image processor capable of efficiently utilizing an OCR function while ensuring security.SOLUTION: The image processor performs a layout analysis (#3), and extracts an area on the basis of the result when receiving an input of a scan image (#1). The image processor determines the degree of a correlation between a plurality of areas (#4). In the case that the degree of the correlation between the areas is equal to or more than a prescribed degree, the image processor uses a different OCR function to each area (#5).

Description

この発明は画像処理装置、画像処理方法、および画像処理プログラムに関し、特に、ＯＣＲ（Optical Character Reader）機能を利用可能な画像処理装置、画像処理方法、および画像処理プログラムに関する。 The present invention relates to an image processing device, an image processing method, and an image processing program, and more particularly to an image processing device, an image processing method, and an image processing program that can use an OCR (Optical Character Reader) function.

画像処理の１つに、スキャン画像に対するＯＣＲ（Optical Character Reader）処理が挙げられる。ＯＣＲ処理を行なうとスキャン画像が文字認識されるため、ＯＣＲ機能の提供元の信頼度に応じては、元の原稿の情報の漏えいの危険性がある。そのため、たとえば外部装置の提供するＯＣＲ機能を利用する場合など、ＯＣＲ機能の提供元の信用度が高くない場合には、セキュリティーを確保する方法として、スキャン画像を分割して異なる提供元から提供されるＯＣＲ機能を利用する、などが考えられる。 One example of image processing is OCR (Optical Character Reader) processing on a scanned image. When the OCR process is performed, the scanned image is recognized, so that there is a risk of leakage of information on the original document depending on the reliability of the OCR function provider. Therefore, for example, when the reliability of the OCR function provider is not high, such as when using the OCR function provided by an external device, the scan image is divided and provided from different providers as a method for ensuring security. It is possible to use the OCR function.

特開２００８−２９３３５４号公報JP 2008-293354 A 特開２００６−２６０１１５号公報JP 2006-260115 A 特開２０１０−１０２７０９号公報JP 2010-102709 A 特開２００７−３２３４１５号公報JP 2007-323415 A 特開２００５−００７３１５号公報JP 2005-007315 A

しかしながら、スキャン画像を小エリアに分断してそれぞれ別個にＯＣＲ処理を行なうと、分断された小エリア間での意味解析や単語解析ができなくなる。そのため、ＯＣＲ精度の低下につながるという問題がある。 However, if the scanned image is divided into small areas and OCR processing is performed separately, it becomes impossible to perform semantic analysis and word analysis between the divided small areas. Therefore, there is a problem that the OCR accuracy is reduced.

また、スキャン画像から１文字ずつ切り出す処理は難しいため、小エリアのサイズを小さく設定したり、一部分ずつ重複させて小エリアに分断したりすることが考えられる。しかしながら、そのようにすると、ＯＣＲの処理量が増加してしまうという問題もある。 In addition, since it is difficult to cut out characters one by one from the scanned image, it is conceivable to set the size of the small area to be small or to divide it into small areas by overlapping each portion. However, when doing so, there is also a problem that the amount of OCR processing increases.

本発明はこのような問題に鑑みてなされたものであって、セキュリティーを確保しつつ効率的にＯＣＲ機能を利用可能な画像処理装置、画像処理方法、および画像処理プログラムを提供することを目的としている。 SUMMARY An advantage of some aspects of the invention is that it provides an image processing apparatus, an image processing method, and an image processing program that can efficiently use an OCR function while ensuring security. Yes.

上記目的を達成するために、本発明のある局面に従うと、画像処理装置はスキャン画像を処理可能な画像処理装置であって、スキャン画像の入力を受け付けるための入力手段と、ＯＣＲ（Optical Character Reader）機能を利用して、スキャン画像に対してＯＣＲ処理することで得られるテキストデータを取得するためのＯＣＲ処理手段と、ＯＣＲ処理手段でのＯＣＲ機能の利用を制御するための制御手段とを備える。制御手段は、スキャン画像のレイアウトに基づいてスキャン画像から抽出された第１のエリアと第２のエリアとの関連性の度合を判断するための判断手段を含み、第１のエリアと第２のエリアとの関連性の度合が所定度合以上である場合に、制御手段は、第１のエリアと第２のエリアとのそれぞれに対して異なるＯＣＲ機能を利用してＯＣＲ処理を行なうよう制御する。 In order to achieve the above object, according to an aspect of the present invention, an image processing apparatus is an image processing apparatus capable of processing a scanned image, an input means for receiving an input of the scanned image, an OCR (Optical Character Reader) ) OCR processing means for acquiring text data obtained by performing OCR processing on the scanned image using the function, and control means for controlling use of the OCR function in the OCR processing means. . The control means includes a determination means for determining the degree of relevance between the first area and the second area extracted from the scan image based on the layout of the scan image, the first area and the second area When the degree of association with the area is equal to or greater than the predetermined degree, the control unit controls the OCR process to be performed on each of the first area and the second area using different OCR functions.

好ましくは、ＯＣＲ処理手段がそれぞれ異なる提供元から提供される複数のＯＣＲ機能を利用可能である場合、制御手段は、第１のエリアと第２のエリアとのそれぞれに対して、それぞれ異なる提供元から提供されるＯＣＲ機能を利用してＯＣＲ処理を行なうよう制御する。 Preferably, when the OCR processing means can use a plurality of OCR functions provided from different providers, the control means provides different providers for the first area and the second area, respectively. Control is performed to perform OCR processing using the OCR function provided from.

好ましくは、制御手段は、第１のエリアと第２のエリアとのそれぞれに対して、同一の提供元から提供される同一のＯＣＲ機能を異なるタイミングで利用してＯＣＲ処理を行なうよう制御する。 Preferably, the control means controls the OCR process for each of the first area and the second area by using the same OCR function provided from the same provider at different timings.

好ましくは、判断手段は、第１のエリアおよび第２のエリアに含まれるキーワードに基づいて関連性の度合を判断する。 Preferably, the determination unit determines the degree of relevance based on keywords included in the first area and the second area.

好ましくは、判断手段は、スキャン画像における第１のエリアと第２のエリアとのそれぞれの配置に基づいて関連性の度合を判断する。 Preferably, the determination unit determines the degree of relevance based on the arrangement of the first area and the second area in the scan image.

好ましくは、判断手段において第１のエリアの第２のエリアに対する関連性が判断不能な場合に、制御手段は第１のエリアをさらに複数のエリアに分割し、それぞれ異なるＯＣＲ機能を利用してＯＣＲ処理を行なうよう制御する。 Preferably, when the determination unit cannot determine the relevance of the first area to the second area, the control unit further divides the first area into a plurality of areas, and uses OCR functions different from each other. Control to perform processing.

好ましくは、画像処理装置は、レイアウトごとに、当該レイアウトに含まれるエリアごとに用いるＯＣＲ機能を記憶しておくための記憶領域を有するメモリーをさらに備え、制御手段は、スキャン画像のレイアウトが記憶領域に記憶されているレイアウトに該当する場合に、記憶領域に記憶されているＯＣＲ機能を利用してスキャン画像のＯＣＲ処理を行なうように制御する。 Preferably, the image processing apparatus further includes a memory having a storage area for storing an OCR function to be used for each area included in the layout for each layout, and the control unit has a storage area for the layout of the scanned image. When the layout corresponds to the layout stored in the storage area, the OCR function stored in the storage area is used to perform OCR processing of the scanned image.

好ましくは、制御手段は、スキャン画像に設定されているセキュリティーレベルが所定範囲である場合に、第１のエリアと第２のエリアとのそれぞれに対して異なるＯＣＲ機能を利用してＯＣＲ処理を行なうよう制御する。 Preferably, the control means performs OCR processing using different OCR functions for the first area and the second area when the security level set in the scanned image is within a predetermined range. Control as follows.

より好ましくは、入力手段は、スキャン画像に対してセキュリティーレベルを設定するための設定手段を含む。 More preferably, the input means includes setting means for setting a security level for the scanned image.

好ましくは、制御手段は、さらに、スキャン画像から抽出されるエリアに対して設定されているセキュリティーレベルと、ＯＣＲ処理手段で利用可能なＯＣＲ機能のセキュリティーレベルとに基づいて、エリアに対して用いるＯＣＲ機能を決定する。 Preferably, the control unit further uses an OCR used for the area based on a security level set for the area extracted from the scanned image and a security level of an OCR function available in the OCR processing unit. Determine function.

好ましくは、スキャン画像はページ単位で分けられ、制御手段は、スキャン画像のページごとのレイアウトに基づいて、第１のエリアと第２のエリアとのそれぞれに対して異なるＯＣＲ機能を利用してＯＣＲ処理を行なうよう制御する。 Preferably, the scanned image is divided in units of pages, and the control unit uses an OCR function that is different for each of the first area and the second area based on the layout of each page of the scanned image. Control to perform processing.

本発明の他の局面に従うと、画像処理方法はスキャン画像の処理方法であって、スキャン画像の入力を受け付けるステップと、スキャン画像のレイアウトに基づいてスキャン画像から抽出された第１のエリアと第２のエリアとの関連性の度合を判断するステップと、ＯＣＲ機能を利用して、スキャン画像に対してＯＣＲ処理することで得られるテキストデータを取得するステップとを備え、テキストデータを取得するステップは、第１のエリアと第２のエリアとの関連性の度合が所定度合以上である場合に第１のエリアと第２のエリアとのそれぞれに対して異なるＯＣＲ機能を利用してＯＣＲ処理を行なうステップを含む。 According to another aspect of the present invention, the image processing method is a scan image processing method, the step of receiving an input of the scan image, the first area extracted from the scan image based on the scan image layout, and the first area A step of determining the degree of relevance with the area 2 and a step of acquiring text data obtained by performing OCR processing on a scanned image using an OCR function, and acquiring text data Uses the different OCR function for each of the first area and the second area when the degree of association between the first area and the second area is a predetermined degree or more. Including performing steps.

本発明のさらに他の局面に従うと、画像処理プログラムはコンピューターにスキャン画像の処理を実行させるプログラムであって、スキャン画像の入力を受け付けるステップと、スキャン画像のレイアウトに基づいてスキャン画像から抽出された第１のエリアと第２のエリアとの関連性の度合を判断するステップと、ＯＣＲ機能を利用して、スキャン画像に対してＯＣＲ処理することで得られるテキストデータを取得するステップとをコンピューターに実行させ、テキストデータを取得するステップは、第１のエリアと第２のエリアとの関連性の度合が所定度合以上である場合に第１のエリアと第２のエリアとのそれぞれに対して異なるＯＣＲ機能を利用してＯＣＲ処理を行なうステップを含む。 According to still another aspect of the present invention, the image processing program is a program that causes a computer to execute processing of a scanned image, and is extracted from the scanned image based on a step of receiving an input of the scanned image and a layout of the scanned image The step of determining the degree of relevance between the first area and the second area and the step of acquiring text data obtained by performing OCR processing on the scanned image using the OCR function The step of executing and acquiring the text data is different for each of the first area and the second area when the degree of association between the first area and the second area is equal to or greater than a predetermined degree. And a step of performing OCR processing using the OCR function.

この発明によると、画像亜処理装置において、セキュリティーを確保しつつ効率的にＯＣＲ機能を利用することができる。 According to the present invention, in the image sub-processing apparatus, it is possible to efficiently use the OCR function while ensuring security.

実施の形態にかかる画像処理装置の一例としてのＭＦＰ（Multi-Functional Peripheral）の装置構成の具体例を示すブロック図である。1 is a block diagram illustrating a specific example of an apparatus configuration of an MFP (Multi-Functional Peripheral) as an example of an image processing apparatus according to an embodiment; ＭＦＰでの動作概要を表わした図である。FIG. 3 is a diagram illustrating an outline of operation in an MFP. レイアウト解析を説明するための図である。It is a figure for demonstrating layout analysis. ＭＦＰの機能構成の具体例を表わしたブロック図である。2 is a block diagram illustrating a specific example of a functional configuration of an MFP. FIG. ＭＦＰでの動作の流れを表わすフローチャートである。3 is a flowchart showing an operation flow in the MFP.

以下に、図面を参照しつつ、本発明の実施の形態について説明する。以下の説明では、同一の部品および構成要素には同一の符号を付してある。それらの名称および機能も同じである。したがって、これらの説明は繰り返さない。 Embodiments of the present invention will be described below with reference to the drawings. In the following description, the same parts and components are denoted by the same reference numerals. Their names and functions are also the same. Therefore, these descriptions will not be repeated.

＜装置構成＞
図１は、本実施の形態にかかる画像処理装置の一例としてのＭＦＰ（Multi-Functional Peripheral）１００の装置構成の具体例を示すブロック図である。図１を参照して、ＭＦＰ１００は、装置全体を制御するためのＣＰＵ（Central Processing Unit）１０と、ＣＰＵ１０で実行されるプログラムを記憶するためのメモリーであるＲＯＭ（Read Only Memory）１１と、ＣＰＵ１０でプログラムを実行する際の作業領域となったり計算値を記憶したりするためのメモリーであるＲＡＭ（Random Access Memory）１２と、画像データなどを記憶するための大容量の記憶装置の一例としてのＨＤＤ（Hard Disk Drive）１３と、スキャナー１４と、プリンター１５と、操作パネル１６と、他の装置と通信するためのネットワークコントローラー１７とを含む。 <Device configuration>
FIG. 1 is a block diagram showing a specific example of the apparatus configuration of an MFP (Multi-Functional Peripheral) 100 as an example of an image processing apparatus according to the present embodiment. Referring to FIG. 1, MFP 100 includes a CPU (Central Processing Unit) 10 for controlling the entire apparatus, a ROM (Read Only Memory) 11 that is a memory for storing a program executed by CPU 10, and a CPU 10. As an example of a RAM (Random Access Memory) 12 that is a memory for storing a calculation value or a work area when executing a program, and a large-capacity storage device for storing image data and the like An HDD (Hard Disk Drive) 13, a scanner 14, a printer 15, an operation panel 16, and a network controller 17 for communicating with other devices are included.

ＭＦＰ１００は、他の装置の一例としてサーバー３００と通信可能である。ＭＦＰ１００は、サーバー３００と通信することによって、サーバー３００が提供するＯＣＲ（Optical Character Reader）機能を利用可能である。サーバー３００は、複数の種類のＯＣＲ機能を提供してもよい。または、ＭＦＰ１００は複数のサーバー３００と通信可能であって、それぞれのサーバー３００が提供する複数のＯＣＲ機能を利用可能であってもよい。 MFP 100 can communicate with server 300 as an example of another device. The MFP 100 can use an OCR (Optical Character Reader) function provided by the server 300 by communicating with the server 300. The server 300 may provide a plurality of types of OCR functions. Alternatively, MFP 100 may be able to communicate with a plurality of servers 300 and use a plurality of OCR functions provided by each server 300.

＜動作概要＞
図２は、ＭＦＰ１００での動作概要を表わした図である。図２を参照して、ユーザーは処理対象のドキュメントに対してスキャンを実行する（ステップ＃１）。好ましくは、ユーザーは、スキャン画像に対してセキュリティーレベルを設定する（ステップ＃２）。セキュリティーレベルの設定は、解像度の設定と同様に属性としてスキャン画像情報に付加されるものであってもよいし、セキュリティーレベルを表わす、たとえば「社外秘」などの予め規定された画像やコード（たとえばＱＲコード（登録商標）等）がスキャン画像自体に付加されるものであってもよい。 <Overview of operation>
FIG. 2 is a diagram showing an outline of operation in MFP 100. Referring to FIG. 2, the user scans the document to be processed (step # 1). Preferably, the user sets a security level for the scanned image (step # 2). The security level setting may be added to the scanned image information as an attribute in the same manner as the resolution setting, or a predetermined image or code (for example, QR) indicating the security level, such as “confidential”. A code (registered trademark) or the like may be added to the scanned image itself.

図３は、レイアウト解析を説明するための図であって、一例として「請求書」とタイトルされたドキュメントをスキャンすることで得られたスキャン画像が示されている。図３にも示されたように、ＯＣＲ処理の対象として、請求書や履歴書など、特定のフォーマットで作成されたドキュメントのスキャン画像が一般的に多いと考えられる。ＭＦＰ１００は、このようなフォーマットを有するスキャン画像に対してＯＣＲ処理を行なうことで当該ドキュメントに含まれる文字情報（テキスト）を取得することができる。 FIG. 3 is a diagram for explaining the layout analysis. As an example, a scan image obtained by scanning a document titled “Bill” is shown. As shown in FIG. 3, it is generally considered that there are many scanned images of a document created in a specific format such as an invoice or a resume as a target of OCR processing. MFP 100 can acquire character information (text) included in the document by performing OCR processing on the scanned image having such a format.

図３のようなスキャン画像の入力を受け付けたＭＦＰ１００は、スキャン画像に対してレイアウト解析を実行する（ステップ＃３）。レイアウト解析は、画像の特徴量から当該画像に含まれる情報を解析、分析する技術であって、文字を含む画像内の情報の配置を検出する技術を指す。図３のスキャン画像をレイアウト解析した場合、一例としてＭＦＰ１００は、文字を含む（または含んでいる可能性のある、以下、同様である）エリアとしてエリア５０１〜５１７を抽出する。ＭＦＰ１００は文字をエリア以外のエリアとして、罫線で構成された表や、写真、その他の図形を含んだエリアを抽出してもよい。ＭＦＰ１００は、各エリアの配置や内容を表わす情報をレイアウト情報として保持する。ＭＦＰ１００は、フォーマットごとにレイアウト情報を記憶してもよい。 Upon receiving the input of the scan image as shown in FIG. 3, the MFP 100 performs layout analysis on the scan image (step # 3). The layout analysis is a technique for analyzing and analyzing information included in an image from the feature amount of the image, and refers to a technique for detecting the arrangement of information in the image including characters. When the layout analysis of the scanned image of FIG. 3 is performed, for example, the MFP 100 extracts areas 501 to 517 as areas including (or possibly including) characters. The MFP 100 may extract an area including a table composed of ruled lines, a photograph, and other figures by using characters as areas other than the area. MFP 100 holds information representing the arrangement and contents of each area as layout information. MFP 100 may store layout information for each format.

ＭＦＰ１００は、さらに、抽出したエリア間での関連性を判断し（ステップ＃４）、その判断結果に基づいて各エリアのＯＣＲ処理に利用するＯＣＲ機能を決定する（ステップ＃５）。 The MFP 100 further determines the relationship between the extracted areas (step # 4), and determines an OCR function to be used for the OCR processing of each area based on the determination result (step # 5).

上記ステップ＃４でＭＦＰ１００は各エリアに対してキーワード検索を行なって、予め規定しているキーワードを含むか否かで各エリアを分類する。たとえば、「株式会社」や「有限会社」などの予め規定しているキーワードを含むエリア５０４，５０７，５１７について、ＭＦＰ１００はエリア５０４，５０７，５１７に含まれるテキストが社名を表わすとしてエリア５０４，５０７，５１７の分類を「社名」とすることができる。同様に、「部」や「部長」、「課長」などの予め規定しているキーワードを含むエリア５０６について、ＭＦＰ１００はエリア５０６に含まれるテキストが担当者を表わすとしてエリア５０６の分類を「担当者」とすることができる。予め規定しているいずれのキーワードも含まないエリア５１５については、ＭＦＰ１００は、キーワードを含まない「特殊文字列」として分類する。 In step # 4, MFP 100 performs a keyword search for each area, and classifies each area according to whether or not it includes a predefined keyword. For example, regarding areas 504, 507, and 517 including keywords such as “corporation” and “limited company”, MFP 100 assumes that the text included in areas 504, 507, and 517 represents the company name, and areas 504 and 507. , 517 can be “company name”. Similarly, for an area 506 that includes predetermined keywords such as “department”, “department manager”, and “section manager”, the MFP 100 classifies the area 506 as “person in charge” assuming that the text included in the area 506 represents the person in charge. ". For the area 515 that does not include any predefined keyword, the MFP 100 classifies the area 515 as a “special character string” that does not include the keyword.

ＭＦＰ１００は、上記ステップ＃４で各エリアの分類結果に基づいて、各エリアの関連性を判断する。たとえば、ＭＦＰ１００は、「社名」に分類されたエリア５０４と「担当者」に分類されたエリア５０６とを関連性ある組み合わせと特定することができる。好ましくは、ＭＦＰ１００は、さらに、エリアの配置にも基づいて関連性を判断する。たとえば、ＭＦＰ１００は、「社名」に分類されたエリア５０４と、その近傍に配置された「担当者」に分類されたエリア５０６とを関連性ある組み合わせと特定することができる。 In step # 4, the MFP 100 determines the relevance of each area based on the classification result of each area. For example, the MFP 100 can identify the area 504 classified as “company name” and the area 506 classified as “person in charge” as a relevant combination. Preferably, MFP 100 further determines relevance based on the arrangement of areas. For example, the MFP 100 can specify an area 504 classified as “company name” and an area 506 classified as “person in charge” arranged in the vicinity thereof as a relevant combination.

ＭＦＰ１００は、関連性あるエリア（この例ではエリア５０４およびエリア５０６）に対してタグを付与するなどしてグループ化する。そして、ＭＦＰ１００は、関連性があるとされた複数のエリアそれぞれのＯＣＲに用いるＯＣＲ機能を異なるものとするように、ＯＣＲ機能を決定し（ステップ＃５）、ＯＣＲ処理を実行させる（ステップ＃６）。たとえば、上記の例では、ＭＦＰ１００はエリア５０４およびエリア５０６それぞれのＯＣＲ処理に利用するＯＣＲ機能を異なるＯＣＲ機能と決定する。異なるＯＣＲ機能とは、エリア５０４とエリア５０６とのそれぞれに対して異なる提供元である第１のサーバーの提供するＯＣＲ機能と第２のサーバーの提供するＯＣＲ機能とを利用することであってもよいし、第１のサーバーの提供する同一のＯＣＲ機能をエリア５０４とエリア５０６とのそれぞれに対して異なるタイミングで利用することであってもよい。 The MFP 100 groups related areas (in this example, area 504 and area 506) by adding tags or the like. Then, the MFP 100 determines the OCR function so that the OCR function used for the OCR of each of the plurality of areas that are related to each other is different (step # 5), and executes the OCR process (step # 6). ). For example, in the above example, MFP 100 determines that the OCR function used for the OCR processing in each of area 504 and area 506 is a different OCR function. The different OCR function is to use the OCR function provided by the first server and the OCR function provided by the second server which are different providers for the areas 504 and 506, respectively. Alternatively, the same OCR function provided by the first server may be used for the areas 504 and 506 at different timings.

このように、ＭＦＰ１００は、関連性があると判断されたエリアに対して異なるＯＣＲ機能を利用するので、たとえば社名と担当者とに対して異なるＯＣＲ機能を利用することになる。そのため、複数の情報の間の関連性が分断される。つまり、同一の機会に同一のサーバーの提供するＯＣＲ機能において、たとえば社名および担当者などの関連性のある情報がＯＣＲ処理されることがない。そのため、当該ドキュメントの内容の漏えいを効果的に防止することができる。 Thus, since MFP 100 uses different OCR functions for areas that are determined to be related, for example, different OCR functions are used for company names and persons in charge. Therefore, the relevance between a plurality of information is divided. That is, in the OCR function provided by the same server at the same opportunity, relevant information such as a company name and a person in charge is not subjected to OCR processing. Therefore, it is possible to effectively prevent leakage of the contents of the document.

なお、ＭＦＰ１００は、関連性の有無のみならず、関連性の度合を判断してもよい。すなわち、ＭＦＰ１００は予め分類ごとに関連性の度合を記憶しておき、レイアウト分析の結果である各エリアの分類に応じて、エリア間の関連性の度合を判断することができる。この場合、ＭＦＰ１００は、エリア間の関連性の度合に応じて各エリアに用いるＯＣＲ機能を決定してもよい。すなわち、ＭＦＰ１００は、エリア間の関連性が所定度以上である場合に異なるＯＣＲ機能を利用すると決定し、上記所定度に満たない場合には同一のＯＣＲ機能を利用してもよいと決定してもよい。または、ＯＣＲ機能にセキュリティーレベルが設定されている場合、ＭＦＰ１００は、エリア間の関連性が所定度以上である場合にはこれらエリアそれぞれに対してセキュリティーレベルの高い、それぞれ異なるＯＣＲ機能を利用する、と決定してもよい。 Note that the MFP 100 may determine not only the presence / absence of relevance but also the degree of relevance. That is, MFP 100 can store the degree of relevance for each classification in advance, and can determine the degree of relevance between areas according to the classification of each area as a result of layout analysis. In this case, MFP 100 may determine an OCR function to be used for each area according to the degree of association between the areas. That is, MFP 100 determines to use a different OCR function when the relationship between areas is greater than or equal to a predetermined level, and determines that the same OCR function may be used when the predetermined level is not reached. Also good. Alternatively, when a security level is set for the OCR function, the MFP 100 uses a different OCR function having a high security level for each area when the relationship between the areas is equal to or greater than a predetermined level. May be determined.

さらに、ＭＦＰ１００は、スキャン画像のエリアごとにセキュリティーレベルが設定されている場合、当該エリアのセキュリティーレベルもさらに考慮して利用するＯＣＲ機能を決定してもよい。つまり、好ましくはＭＦＰ１００は、エリアに設定されたセキュリティーレベルが高い場合には、セキュリティーレベルの高いＯＣＲ機能を利用すると決定する。 Further, when a security level is set for each area of the scanned image, MFP 100 may determine an OCR function to be used in consideration of the security level of the area. In other words, MFP 100 preferably determines to use the OCR function with a high security level when the security level set for the area is high.

エリアごとのセキュリティーレベルは、予めドキュメントの作成者が設定してもよいし、フォーマットに関連付けてメモリーに登録されていてもよいし、予めＭＦＰ１００がキーワードに応じたセキュリティーレベルを記憶しておき、当該エリアから検出されたキーワードに基づいて判断してもよい。 The security level for each area may be set in advance by the document creator, or may be registered in the memory in association with the format, or the MFP 100 may store the security level corresponding to the keyword in advance, and You may judge based on the keyword detected from the area.

一方、上記ステップ＃３のレイアウト解析によって「特殊文字列」と分類されたエリア５０４については、ＭＦＰ１００はさらに小エリアに分割し、各小エリアの処理に利用するＯＣＲ機能を異なるＯＣＲ機能に決定する。または、ＯＣＲ機能にセキュリティーレベルが設定されている場合、ＭＦＰ１００は「特殊文字列」と分類されたエリア５０４に対して、特殊文字列に分類されていないエリアに対して利用するＯＣＲ機能よりもセキュリティーレベルの高いＯＣＲ機能を利用すると決定する。 On the other hand, for the area 504 classified as “special character string” by the layout analysis in step # 3, the MFP 100 further divides the area 504 into smaller areas, and determines the OCR function used for the processing of each small area as a different OCR function. . Alternatively, when the security level is set for the OCR function, the MFP 100 is more secure than the OCR function used for the area 504 classified as “special character string” for the area not classified as the special character string. It is decided to use a high-level OCR function.

「特殊文字列」と分類されたエリアに記載された内容はレイアウト解析では不明であるため、他のエリアに対する関連性が判断不能である。ＭＦＰ１００が「特殊文字列」と分類されたエリアに対して上記のように利用するＯＣＲ機能を決定することで、当該エリアの内容の漏えいをより効果的に防止することができる。 Since the contents described in the area classified as “special character string” are unknown in the layout analysis, the relevance to other areas cannot be determined. By determining the OCR function to be used as described above for the area classified as “special character string” by the MFP 100, it is possible to more effectively prevent leakage of the contents of the area.

また、他の例として上記のようにして「住所」と分類されたエリア５０５，５０８についても、ＭＦＰ１００は、さらに小エリアに分類し、各小エリアの処理に利用するＯＣＲ機能を異なるＯＣＲ機能に決定するようにしてもよい。すなわち、ＭＦＰ１００は、特定の分類に対しては、さらに小エリアに分類し、各小エリアの処理に利用するＯＣＲ機能を異なるＯＣＲ機能に決定するようにしてもよい。このようにすることで、当該エリアの内容の漏えいをより効果的に防止することができる。 As another example, the areas 505 and 508 classified as “address” as described above are further classified into smaller areas by the MFP 100, and the OCR function used for processing of each small area is changed to a different OCR function. It may be determined. That is, MFP 100 may further classify into a small area for a specific classification, and determine the OCR function used for the processing of each small area as a different OCR function. By doing in this way, the leakage of the content of the said area can be prevented more effectively.

ＭＦＰ１００は、各ＯＣＲ機能からＯＣＲ処理の結果であるテキスト情報を受け取り、それらを統合することで（ステップ＃７）、ドキュメントの内容を得ることができる。 The MFP 100 receives the text information that is the result of the OCR processing from each OCR function and integrates them (step # 7), thereby obtaining the contents of the document.

なお、ＭＦＰ１００は、ＯＣＲ機能にセキュリティーレベルが設定されている場合、さらに、スキャン画像に対して設定されているセキュリティーレベルもさらに考慮してＯＣＲ機能を決定してもよい。すなわち、スキャン画像に対して高いセキュリティーレベルが設定されている場合、ＭＦＰ１００は、最も高いセキュリティーレベルが設定されているＯＣＲ機能を用いると決定することができる。最もセキュリティーレベルの高いＯＣＲ機能としては、当該ＭＦＰ１００自身に搭載されているＯＣＲ機能が挙げられる。一方、スキャン画像に対して設定されているセキュリティーレベルが低い場合には、ＭＦＰ１００は、上記のようにエリア間の関連性の度合を考慮することなく、スキャン画像全体に対していずれかのＯＣＲ機能を利用すると決定してもよい。このようにすることで、処理を効率化、高速化することができる。従って、ＦＭＰ１００は、スキャン画像に対して設定されているセキュリティーレベルが所定範囲の場合にのみ、上記のようにレイアウト分析の結果抽出されたエリア間の関連性の度合に応じて利用するＯＣＲ機能を決定するようにしてもよい。 When the security level is set for the OCR function, the MFP 100 may determine the OCR function by further considering the security level set for the scanned image. That is, when a high security level is set for the scanned image, MFP 100 can determine to use the OCR function for which the highest security level is set. As the OCR function having the highest security level, an OCR function installed in the MFP 100 itself can be cited. On the other hand, when the security level set for the scanned image is low, the MFP 100 does not consider the degree of association between the areas as described above, and any one of the OCR functions for the entire scanned image. You may decide to use. By doing so, the processing can be made more efficient and faster. Therefore, the FMP 100 uses an OCR function that is used according to the degree of relevance between areas extracted as a result of layout analysis as described above only when the security level set for the scanned image is within a predetermined range. It may be determined.

好ましくは、ＭＦＰ１００は、エリアごとに決定した利用するＯＣＲ機能をＯＣＲ情報として、ドキュメントのフォーマットに関連付けてメモリーに記憶する。たとえば、図３の例の場合、ＭＦＰ１００は、ドキュメントのフォーマット「請求書」に関連付けて、エリア５０１〜５１７それぞれの分類や利用するＯＣＲ機能をメモリーに記憶する。そして、次回の処理でレイアウト解析の結果、処理対象のスキャン画像がフォーマット「請求書」のものであると判断されると、ＭＦＰ１００はメモリーから当該フォーマット「請求書」に関連付けられたＯＣＲ情報を読み出して処理対象のスキャン画像に適用する。このようにすることで、２回目以降の処理において利用するＯＣＲ機能を決定する処理を高速化することができる。従って、好ましくは、ＭＦＰ１００は、スキャン画像がページ単位で分けられたものである場合、つまり、ドキュメントが複数ページからなるものである場合、ページ単位でレイアウト解析を実行して当該ページのフォーマットを特定し、当該フォーマットに関連付けられたＯＣＲ情報がメモリーに登録されているか否かを確認する。 Preferably, MFP 100 stores the OCR function to be used determined for each area as OCR information in association with a document format in a memory. For example, in the example of FIG. 3, the MFP 100 stores the classification of each of the areas 501 to 517 and the OCR function to be used in a memory in association with the document format “invoice”. If it is determined that the scanned image to be processed is of the format “invoice” as a result of layout analysis in the next process, the MFP 100 reads out the OCR information associated with the format “invoice” from the memory. Applied to the scanned image to be processed. By doing in this way, the process which determines the OCR function utilized in the process after the 2nd time can be sped up. Therefore, preferably, when the scanned image is divided into pages, that is, when the document is composed of a plurality of pages, MFP 100 performs layout analysis for each page and specifies the format of the page. Then, it is confirmed whether or not the OCR information associated with the format is registered in the memory.

＜機能構成＞
図４は、ＭＦＰ１００が上記の動作を行なうための機能構成の具体例を表わしたブロック図である。図４の各機能は、ＭＦＰ１００のＣＰＵ１０がＲＯＭ１１に記憶されているプログラムをＲＡＭ１２上に読み出して実行することで、主に、ＣＰＵ１０で実現される。しかしながら、少なくとも一部機能が、図１に表わされた他の構成、または図示されない電気回路などの他のハードウェア構成によって実現されてもよい。 <Functional configuration>
FIG. 4 is a block diagram showing a specific example of a functional configuration for causing MFP 100 to perform the above-described operation. Each function of FIG. 4 is realized mainly by the CPU 10 when the CPU 10 of the MFP 100 reads the program stored in the ROM 11 onto the RAM 12 and executes it. However, at least a part of the functions may be realized by another configuration shown in FIG. 1 or another hardware configuration such as an electric circuit (not shown).

図４を参照して、メモリーの一例としてのＨＤＤ１３は、ドキュメントのフォーマットに関連付けてエリアごとに利用すると決定されたＯＣＲ機能を表わした情報であるＯＣＲ情報を記憶するための記憶領域であるＯＣＲ情報記憶部１３１を含む。 Referring to FIG. 4, HDD 13 as an example of memory is OCR information which is a storage area for storing OCR information which is information representing an OCR function determined to be used for each area in association with a document format. A storage unit 131 is included.

さらに図４を参照して、ＣＰＵ１０は、スキャナー１４やネットワークコントローラー１７を介して他の装置からスキャン画像の入力を受け付けるための入力部１０１と、入力されたスキャン画像に対してＯＣＲ処理することで得られるテキストデータを取得するためのＯＣＲ処理部１０２と、スキャン画像に対してＯＣＲ処理を実行するためのＯＣＲ実行部１０３と、ＯＣＲ処理部１０２でのＯＣＲ機能の利用を制御するための制御部１０４とを含む。好ましくは、入力部１０１は、スキャン画像に対してセキュリティーレベルを設定するための設定部１０９を含む。 Further, referring to FIG. 4, the CPU 10 performs OCR processing on the input scan image and the input unit 101 for receiving the input of the scan image from another device via the scanner 14 or the network controller 17. An OCR processing unit 102 for acquiring obtained text data, an OCR execution unit 103 for executing OCR processing on a scanned image, and a control unit for controlling use of the OCR function in the OCR processing unit 102 104. Preferably, the input unit 101 includes a setting unit 109 for setting a security level for the scanned image.

制御部１０４は、スキャン画像のレイアウト解析を行なうための解析部１０５と、解析されたレイアウトに基づいてスキャン画像から文字（または文字列）を含むエリアを抽出するための抽出部１０６と、抽出されたエリア間の関連性の度合（関連性の有無を含む）を判断するための判断部１０７と、判断されたエリア間の関連性の度合に基づいて各エリアのＯＣＲ処理に利用するＯＣＲ機能を決定するための決定部１０８とを含む。 The control unit 104 is extracted with an analysis unit 105 for performing layout analysis of the scanned image, an extraction unit 106 for extracting an area including characters (or character strings) from the scanned image based on the analyzed layout. A determination unit 107 for determining the degree of relevance between areas (including the presence or absence of relevance), and an OCR function used for OCR processing of each area based on the determined degree of relevance between areas And a determination unit 108 for determination.

決定部１０８で決定したエリアごとに利用するＯＣＲ機能は、当該ドキュメントのレイアウトに関連付けてＯＣＲ情報としてＯＣＲ情報記憶部１３１に記憶される。また、解析部１０５で解析されたレイアウトから特定されるドキュメントのフォーマットに関連付けられたＯＣＲ情報がＯＣＲ情報記憶部１３１に記憶されている場合、制御部１０４はＯＣＲ情報記憶部１３１から街頭するＯＣＲ情報を読み出す。この場合、決定部１０８は、ＯＣＲ情報に基づいて各エリアに利用するＯＣＲ機能を決定する。 The OCR function used for each area determined by the determination unit 108 is stored in the OCR information storage unit 131 as OCR information in association with the layout of the document. When the OCR information associated with the document format specified from the layout analyzed by the analysis unit 105 is stored in the OCR information storage unit 131, the control unit 104 stores the OCR information on the street from the OCR information storage unit 131. Is read. In this case, the determination unit 108 determines an OCR function to be used for each area based on the OCR information.

判断部１０７は、各エリアに含まれるキーワードに基づいて関連性の度合を判断する。好ましくは、判断部１０７は、さらに、スキャン画像における各エリアの配置も考慮して関連性の度合を判断する。 The determination unit 107 determines the degree of relevance based on the keywords included in each area. Preferably, the determination unit 107 further determines the degree of relevance in consideration of the arrangement of each area in the scan image.

決定部１０８は、エリア間の関連性の度合が所定度合以上である場合に、各エリアに対して異なるＯＣＲ機能を利用すると決定する。異なるＯＣＲ機能とは、それぞれ異なる提供元から提供されるＯＣＲ機能であってもよいし、同一のＯＣＲ機能を異なるタイミングで利用することであってもよい。 The determination unit 108 determines to use a different OCR function for each area when the degree of association between the areas is equal to or greater than a predetermined degree. Different OCR functions may be OCR functions provided from different providers, or may use the same OCR function at different timings.

なお、判断部１０７において関連性の度合が判断不能であったエリアについては、好ましくは決定部１０８は、当該エリアをさらに複数のエリアに分割してそれぞれ異なるＯＣＲ機能を利用すると決定する。 For areas where the degree of relevance cannot be determined by the determination unit 107, the determination unit 108 preferably determines that the area is further divided into a plurality of areas and uses different OCR functions.

＜動作フロー＞
図５は、ＭＦＰ１００での動作の流れを表わすフローチャートである。図５のフローチャートに表わされた動作は、ＭＦＰ１００のＣＰＵ１０がＲＯＭ１１に記憶されているプログラムをＲＡＭ１２上に読み出して実行し、図４の各機能を発揮させることによって実現される。 <Operation flow>
FIG. 5 is a flowchart showing an operation flow in MFP 100. The operation shown in the flowchart of FIG. 5 is realized by causing the CPU 10 of the MFP 100 to read out and execute a program stored in the ROM 11 on the RAM 12 and exhibit the functions shown in FIG.

図５を参照して、ＣＰＵ１０は、スキャン画像の入力を受け付けると（ステップＳ１０１でＹＥＳ）、当該スキャン画像にセキュリティーレベルが設定されているか否かを確認する。入力されたスキャン画像に対して高いセキュリティーレベル、または低いセキュリティーレベルが設定されている場合、または、セキュリティーレベルの設定のない場合（ステップＳ１０３でＮＯ）、ＣＰＵ１０は、以降の、エリアごとに異なるＯＣＲ機能を利用するための処理を行なうことなく、所定のＯＣＲ機能を利用すると決定し（ステップＳ１２５）、ＯＣＲ処理を実行する（ステップＳ１２７）。すなわち、スキャン画像のセキュリティーレベルが所定レベルよりも高い場合には、ＣＰＵ１０は、当該スキャン画像のすべてのエリアに対して、たとえばＭＦＰ１００に搭載されているＯＣＲ機能など、セキュリティーレベルの高いＯＣＲ機能を利用すると決定する。スキャン画像のセキュリティーレベルが所定レベルよりも低い場合、またはセキュリティーレベルが設定されていない場合には、ＣＰＵ１０は、当該スキャン画像のすべてのエリアに対して、予め規定されたＯＣＲ機能を利用すると決定する。 Referring to FIG. 5, when receiving an input of a scanned image (YES in step S101), CPU 10 checks whether or not a security level is set for the scanned image. When a high security level or a low security level is set for the input scanned image, or when no security level is set (NO in step S103), the CPU 10 performs different OCR for each area thereafter. It is determined that the predetermined OCR function is used without performing the process for using the function (step S125), and the OCR process is executed (step S127). That is, when the security level of the scanned image is higher than the predetermined level, the CPU 10 uses an OCR function with a high security level such as an OCR function installed in the MFP 100 for all areas of the scanned image. Then decide. When the security level of the scanned image is lower than the predetermined level, or when the security level is not set, the CPU 10 determines to use a predetermined OCR function for all areas of the scanned image. .

入力されたスキャン画像に設定されているセキュリティーレベルが所定範囲である場合（ステップＳ１０３でＹＥＳ）、ＣＰＵ１０は、スキャン画像に対してレイアウト解析を行なって、当該スキャン画像の元のドキュメントのレイアウトを特定して、文字を含んでいるエリアを抽出する（ステップＳ１０５）。 If the security level set for the input scan image is within a predetermined range (YES in step S103), the CPU 10 performs a layout analysis on the scan image and specifies the original document layout of the scan image. Then, an area including characters is extracted (step S105).

特定されたレイアウトから得られるフォーマットに関連付けてＯＣＲ情報がメモリーに記憶されている場合（ステップＳ１０７でＹＥＳ）、ＣＰＵ１０は、以下の処理に替えて、メモリーから街頭するＯＣＲ情報を読み出すことで（ステップＳ１０９）、エリアごとの利用するＯＣＲ機能を決定し（ステップＳ１２５）、ＯＣＲ処理を実行する（ステップＳ１２７）。 When the OCR information is stored in the memory in association with the format obtained from the specified layout (YES in step S107), the CPU 10 reads the OCR information on the street from the memory instead of the following process (step S107). In step S109, an OCR function to be used for each area is determined (step S125), and an OCR process is executed (step S127).

メモリーにＯＣＲ情報が登録されていない場合（ステップＳ１０７でＮＯ）、ＣＰＵ１０は、エリアごとに利用するＯＣＲ機能を決定する処理を実行する。すなわち、ＣＰＵ１０は、上記ステップＳ１０５のレイアウト解析によって抽出されたエリアごとにＯＣＲ処理を実行して（ステップＳ１１１）、当該エリアに予め規定したキーワードが含まれているか否かを検索する（ステップＳ１１３）。その結果、当該エリアにキーワードが含まれる場合には（ステップＳ１１５でＹＥＳ）、ＣＰＵ１０は、含まれるキーワードに応じて当該エリアを分類する（ステップＳ１１７）。当該エリアにいずれのキーワードも含まれない場合には（ステップＳ１１５でＮＯ）、ＣＰＵ１０は、当該エリアを「特殊文字列」に分類する（ステップＳ１１９）。 When the OCR information is not registered in the memory (NO in step S107), the CPU 10 executes a process for determining an OCR function to be used for each area. That is, the CPU 10 executes OCR processing for each area extracted by the layout analysis in step S105 (step S111), and searches whether or not a predetermined keyword is included in the area (step S113). . As a result, when a keyword is included in the area (YES in step S115), the CPU 10 classifies the area according to the included keyword (step S117). If none of the keywords are included in the area (NO in step S115), the CPU 10 classifies the area as a “special character string” (step S119).

ＣＰＵ１０は、以上のステップＳ１１１〜Ｓ１１９の処理を抽出されたすべてのエリアに対して実行する。そして、すべてのエリアに対して処理が終了すると（ステップＳ１２１でＮＯ）、ＣＰＵ１０は、各エリアの分類結果に基づいて各エリアの関連性を判断する。すなわち、ＣＰＵ１０は、各エリアの分類結果に応じて関連性あるエリアのグループを特定して、タグを付与するなどしてグループ化する（ステップＳ１２３）。このとき、ＣＰＵ１０は、さらに各エリアの配置も考慮して関連性あるエリアのグループを特定してもよい。 CPU10 performs the process of the above step S111-S119 with respect to all the extracted areas. When the processing is completed for all areas (NO in step S121), the CPU 10 determines the relevance of each area based on the classification result of each area. That is, the CPU 10 identifies a group of related areas according to the classification result of each area, and groups them by adding a tag or the like (step S123). At this time, the CPU 10 may specify a group of related areas in consideration of the arrangement of each area.

ＣＰＵ１０は、関連性があるとグループ化された複数のエリアそれぞれに利用するＯＣＲ機能を異なるものとするように、各エリアに対して利用するＯＣＲ機能を決定する（ステップＳ１２５）。また、上記ステップＳ１１９で「特殊文字列」と分類されたエリアに対しては、ＣＰＵ１０はさらに当該エリアを小エリアに分割し、各小エリアそれぞれに利用するＯＣＲ機能を異なるものとするように、各小エリアに対して利用するＯＣＲ機能を決定する。 The CPU 10 determines the OCR function to be used for each area so that the OCR function to be used for each of the plurality of grouped areas is different if there is a relationship (step S125). In addition, for the area classified as “special character string” in step S119, the CPU 10 further divides the area into small areas and uses different OCR functions for each small area. The OCR function to be used for each small area is determined.

ＣＰＵ１０は、スキャン画像のうちの対象のエリアのみを、当該エリアのＯＣＲ処理に利用すると決定されたＯＣＲ機能を提供するサーバー３００に渡して、ＯＣＲ処理を実行させる（ステップＳ１２７）。 The CPU 10 passes only the target area of the scanned image to the server 300 that provides the OCR function determined to be used for the OCR process of the area, and executes the OCR process (step S127).

ＣＰＵ１０は、利用したＯＣＲ機能ごとに、当該ＯＣＲ機能を提供するサーバー３００から処理結果であるテキストを受け付ける。そして、それらをエリアの配置に基づいて統合することで（ステップＳ１２９）、スキャン画像の内容を得る。 For each used OCR function, the CPU 10 accepts a text as a processing result from the server 300 that provides the OCR function. Then, by integrating them based on the arrangement of the areas (step S129), the contents of the scanned image are obtained.

なお、上記ステップＳ１０１で入力されたスキャン画像がページ単位に分けられたものであった場合、ＣＰＵ１０は、ページごとに上記ステップＳ１０３〜Ｓ１２９を繰り返す。これによって、ＣＰＵ１０は、ページ単位でレイアウト解析を行ない、当該レイアウトから得られたフォーマットに関連付けられたＯＣＲ情報がメモリーに登録されている場合には、（以降のエリアごとにＯＣＲ機能を決定する処理を行なうことなく）登録されているＯＣＲ情報を利用してＯＣＲ機能を決定して処理を実行させることができる。 If the scanned image input in step S101 is divided into pages, the CPU 10 repeats steps S103 to S129 for each page. As a result, the CPU 10 performs layout analysis in units of pages, and when the OCR information associated with the format obtained from the layout is registered in the memory (the process of determining the OCR function for each subsequent area). It is possible to determine the OCR function using the registered OCR information and execute the process (without performing the process).

＜実施の形態の効果＞
以上の処理がＭＦＰ１００において実行されることで、ＭＦＰ１００では、セキュリティーを確保しつつ外部装置の提供するＯＣＲ機能も利用することができる。すなわち、本実施の形態にかかるＭＦＰ１００を利用すると、たとえば住所と社名、商品名と単価など、関連性の度合の高い内容を含むエリアがそれぞれ異なるＯＣＲ機能を利用してＯＣＲ処理がなされる。そのため、たとえ当該ＯＣＲ機能を提供するサーバー３００を介してＯＣＲ処理結果である当該エリアの内容が漏えいした場合であっても、関連性が分断されているため、その内容の意味の漏えいは防止される。つまり、上記の例であれば、商品名のみ、または単価のみが第三者に漏えいしたとしても、「その商品の単価」として意味を構成するような内容で第三者に漏えいすることがない。そのため、すべて自身のＯＣＲ機能を利用する場合と比較して効率的にＯＣＲ処理を行なうことができる。つまり、本実施の形態にかかるＭＦＰ１００では、セキュリティーを確保しつつ効率的にＯＣＲ機能を利用することができる。 <Effect of Embodiment>
By executing the above processing in MFP 100, MFP 100 can also use the OCR function provided by the external device while ensuring security. In other words, when MFP 100 according to the present embodiment is used, OCR processing is performed using OCR functions in which areas including highly relevant contents such as address and company name, product name and unit price are different. Therefore, even if the content of the area that is the result of the OCR processing is leaked through the server 300 that provides the OCR function, since the relationship is broken, the leakage of the meaning of the content is prevented. The In other words, in the above example, even if only the product name or only the unit price is leaked to a third party, it will not be leaked to a third party with a content that constitutes a meaning as “unit price of the product”. . Therefore, OCR processing can be performed more efficiently than when all of the OCR functions are used. That is, MFP 100 according to the present embodiment can efficiently use the OCR function while ensuring security.

さらに、本実施の形態にかかるＭＦＰ１００では、いったんエリアごとに利用するＯＣＲ機能を決定すると、その結果をＯＣＲ情報として、フォーマットに関連付けてメモリーに記憶する。そして、次回以降、対象のスキャン画像をレイアウト解析してフォーマットを特定すると、当該フォーマットに関連付けて記憶されているＯＣＲ情報を読み出して利用するＯＣＲ機能を特定する。一般的に、ドキュメントをＯＣＲ処理して用いる場面はオフィス環境であることが多く、多くの場合には「請求書」や「納品書」など、フォーマットに沿って作成されたドキュメントが処理対象となる。従って、フォーマットごとにエリアごとの利用するＯＣＲ機能を予め登録しておくことで、２回目以降、処理のつどＯＣＲ機能を決定するための処理を繰り返す必要がなく、処理を高速化することができる。 Further, in MFP 100 according to the present embodiment, once an OCR function to be used for each area is determined, the result is stored as OCR information in a memory in association with a format. After the next time, when the target scan image is subjected to layout analysis and the format is specified, the OCR function that reads and uses the OCR information stored in association with the format is specified. In general, scenes used for OCR processing of documents are often in an office environment, and in many cases, documents created in accordance with the format such as “invoice” and “invoice” are processed. . Therefore, by registering in advance the OCR function to be used for each area for each format, it is not necessary to repeat the process for determining the OCR function for each process after the second time, and the process can be speeded up. .

＜他の例１＞
上記の例では、ＭＦＰ１００が自身の搭載するＯＣＲ機能を利用してエリアに含まれるキーワードを特定し、分類するものとしている。エリアの分類はＯＣＲ機能を利用する方法に限定されない。他の例として、予め管理者等の特定のユーザーやドキュメントの作成者によってエリアごとに分類され、当該分類を表わすタグ等が付与されていてもよい。さらには、予め管理者等の特定のユーザーやドキュメントの作成者によってフォーマットごとのＯＣＲ情報が登録されていてもよい。さらには、そのＯＣＲ情報は、ＭＦＰ１００のメモリーのみならず、（サーバー３００等の）他の記憶装置に登録され、ＭＦＰ１００が当該記憶装置にアクセスしてＯＣＲ情報を読み出すようにしてもよい。または、スキャン画像にＯＣＲ情報が付加されていてもよい。 <Other example 1>
In the above example, the MFP 100 specifies and classifies the keywords included in the area using the OCR function installed in the MFP 100. The area classification is not limited to the method using the OCR function. As another example, a specific user such as an administrator or a document creator may classify each area in advance, and a tag representing the classification may be given. Furthermore, OCR information for each format may be registered in advance by a specific user such as an administrator or a document creator. Further, the OCR information may be registered not only in the memory of MFP 100 but also in another storage device (such as server 300), and MFP 100 may access the storage device and read the OCR information. Alternatively, OCR information may be added to the scanned image.

＜他の例２＞
なお、以上の例では、本実施の形態にかかる画像処理装置がＭＦＰであるものとし、さらに、当該ＭＦＰ自身もＯＣＲ機能を搭載するものとしている。しかしながら、画像処理装置はＭＦＰに限定されない。画像処理装置の他の例として、たとえば、汎用のコンピューターやいわゆるタブレットなどと呼ばれる端末装置であってもよい。つまり、ＯＣＲ機能を提供するサーバー３００と通信可能であって、画像データを処理対象のデータとして取扱い可能な装置であればいかなる装置であってもよい。 <Other example 2>
In the above example, the image processing apparatus according to the present embodiment is an MFP, and the MFP itself is also equipped with an OCR function. However, the image processing apparatus is not limited to the MFP. As another example of the image processing apparatus, for example, a terminal device called a general-purpose computer or a so-called tablet may be used. That is, any device can be used as long as it can communicate with the server 300 that provides the OCR function and can handle image data as data to be processed.

上記のように画像処理装置が自身がＯＣＲ機能を搭載しない場合には、画像処理装置は上記の例のように予めエリアに対して設定された分類を表わす情報に基づいて、当該エリアに対して利用するＯＣＲ機能を決定することができる。 When the image processing apparatus itself does not have the OCR function as described above, the image processing apparatus performs the processing for the area based on the information indicating the classification set in advance for the area as in the above example. The OCR function to be used can be determined.

＜他の例３＞
さらに、上述の処理をＭＦＰ１００や他の装置である画像処理装置に実行させるためのプログラムを提供することもできる。プログラムを提供することで、汎用のＭＦＰやコンピューターを本実施の形態にかかる画像処理装置として機能させることができる。 <Other example 3>
Furthermore, it is possible to provide a program for causing the MFP 100 or an image processing apparatus, which is another apparatus, to execute the above-described processing. By providing the program, a general-purpose MFP or computer can function as the image processing apparatus according to the present embodiment.

このようなプログラムは、コンピューターに付属するフレキシブルディスク、ＣＤ−ＲＯＭ（Compact Disk-Read Only Memory）、ＲＯＭ、ＲＡＭおよびメモリカードなどのコンピューター読取り可能な記録媒体にて記録させて、プログラム製品として提供することもできる。あるいは、コンピューターに内蔵するハードディスクなどの記録媒体にて記録させて、プログラムを提供することもできる。また、ネットワークを介したダウンロードによって、プログラムを提供することもできる。 Such a program is recorded on a computer-readable recording medium such as a flexible disk attached to the computer, a CD-ROM (Compact Disk-Read Only Memory), a ROM, a RAM, and a memory card, and provided as a program product. You can also Alternatively, the program can be provided by being recorded on a recording medium such as a hard disk built in the computer. A program can also be provided by downloading via a network.

なお、本発明にかかるプログラムは、コンピューターのオペレーティングシステム（ＯＳ）の一部として提供されるプログラムモジュールのうち、必要なモジュールを所定の配列で所定のタイミングで呼出して処理を実行させるものであってもよい。その場合、プログラム自体には上記モジュールが含まれずＯＳと協働して処理が実行される。このようなモジュールを含まないプログラムも、本発明にかかるプログラムに含まれ得る。 A program according to the present invention is a program module that is provided as a part of an operating system (OS) of a computer and that executes necessary processes by calling necessary modules in a predetermined arrangement at a predetermined timing. Also good. In that case, the program itself does not include the module, and the process is executed in cooperation with the OS. A program that does not include such a module can also be included in the program according to the present invention.

また、本発明にかかるプログラムは他のプログラムの一部に組込まれて提供されるものであってもよい。その場合にも、プログラム自体には上記他のプログラムに含まれるモジュールが含まれず、他のプログラムと協働して処理が実行される。このような他のプログラムに組込まれたプログラムも、本発明にかかるプログラムに含まれ得る。 The program according to the present invention may be provided by being incorporated in a part of another program. Even in this case, the program itself does not include the module included in the other program, and the process is executed in cooperation with the other program. Such a program incorporated in another program can also be included in the program according to the present invention.

提供されるプログラム製品は、ハードディスクなどのプログラム格納部にインストールされて実行される。なお、プログラム製品は、プログラム自体と、プログラムが記録された記録媒体とを含む。 The provided program product is installed in a program storage unit such as a hard disk and executed. The program product includes the program itself and a recording medium on which the program is recorded.

今回開示された実施の形態はすべての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は上記した説明ではなくて特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The embodiment disclosed this time should be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the description above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.

１０ＣＰＵ、１１ＲＯＭ、１２ＲＡＭ、１３ＨＤＤ、１４スキャナー、１５プリンター、１６操作パネル、１７ネットワークコントローラー、１００ＭＦＰ、１０１入力部、１０２ＯＣＲ処理部、１０３ＯＣＲ実行部、１０４制御部、１０５解析部、１０６抽出部、１０７判断部、１０８決定部、１０９設定部、１３１ＯＣＲ情報記憶部、３００サーバー、５０１〜５１７エリア。 10 CPU, 11 ROM, 12 RAM, 13 HDD, 14 scanner, 15 printer, 16 operation panel, 17 network controller, 100 MFP, 101 input unit, 102 OCR processing unit, 103 OCR execution unit, 104 control unit, 105 analysis unit 106 extraction unit 107 determination unit 108 determination unit 109 setting unit 131 OCR information storage unit 300 server 501-517 area.

Claims

An image processing apparatus capable of processing a scanned image,
Input means for receiving input of the scanned image;
OCR processing means for acquiring text data obtained by performing OCR processing on the scanned image using an OCR (Optical Character Reader) function;
Control means for controlling the use of the OCR function in the OCR processing means,
The control means includes determination means for determining the degree of relevance between the first area and the second area extracted from the scan image based on the layout of the scan image,
When the degree of relevance between the first area and the second area is equal to or greater than a predetermined degree, the control means performs different OCR for each of the first area and the second area. An image processing apparatus that controls to perform OCR processing using a function.

When the OCR processing means can use a plurality of OCR functions provided by different providers, the control means provides different provisions for each of the first area and the second area. The image processing apparatus according to claim 1, wherein control is performed so that OCR processing is performed using an OCR function provided from the beginning.

The control means controls to perform OCR processing for each of the first area and the second area by using the same OCR function provided from the same provider at different timings. The image processing apparatus according to claim 1.

The image processing apparatus according to claim 1, wherein the determination unit determines the degree of relevance based on keywords included in the first area and the second area.

The image processing apparatus according to claim 1, wherein the determination unit determines the degree of relevance based on the arrangement of the first area and the second area in the scanned image. .

When the determination means cannot determine the relevance of the first area to the second area, the control means further divides the first area into a plurality of areas and uses different OCR functions. The image processing apparatus according to claim 1, wherein the image processing apparatus controls to perform OCR processing.

Each layout further includes a memory having a storage area for storing an OCR function used for each area included in the layout,
When the layout of the scanned image corresponds to the layout stored in the storage area, the control means performs OCR processing of the scanned image using the OCR function stored in the storage area. The image processing apparatus according to claim 1, which is controlled.

The control means performs OCR processing using different OCR functions for the first area and the second area when the security level set for the scanned image is within a predetermined range. The image processing apparatus according to claim 1, wherein the image processing apparatus is controlled to perform.

The image processing apparatus according to claim 8, wherein the input unit includes a setting unit for setting a security level for the scanned image.

The control means is further used for the area based on the security level set for the area extracted from the scanned image and the security level of the OCR function available in the OCR processing means. The image processing apparatus according to claim 1, wherein an OCR function is determined.

The scanned image is divided into pages,
The control means controls to perform OCR processing using a different OCR function for each of the first area and the second area based on a layout of each page of the scanned image. Item 11. The image processing apparatus according to any one of Items 1 to 10.

A method of processing a scanned image,
Receiving an input of the scanned image;
Determining the degree of relevance between the first area and the second area extracted from the scanned image based on the layout of the scanned image;
Using the OCR function to obtain text data obtained by performing OCR processing on the scanned image,
The step of acquiring the text data is performed for each of the first area and the second area when the degree of association between the first area and the second area is a predetermined degree or more. An image processing method including a step of performing OCR processing using different OCR functions.

A program that causes a computer to process scanned images,
Receiving an input of the scanned image;
Determining the degree of relevance between the first area and the second area extracted from the scanned image based on the layout of the scanned image;
Using the OCR function to cause the computer to execute text data obtained by performing OCR processing on the scanned image;
The step of acquiring the text data is performed for each of the first area and the second area when the degree of association between the first area and the second area is a predetermined degree or more. An image processing program including a step of performing OCR processing using different OCR functions.