JP2015122796A

JP2015122796A - Image reading device, document management system, and image reading control program

Info

Publication number: JP2015122796A
Application number: JP2015028555A
Authority: JP
Inventors: 貴博南; Takahiro Minami; 修二山下; Shuji Yamashita
Original assignee: Kyocera Document Solutions Inc
Current assignee: Kyocera Document Solutions Inc
Priority date: 2015-02-17
Filing date: 2015-02-17
Publication date: 2015-07-02
Anticipated expiration: 2032-08-29
Also published as: JP5993969B2

Abstract

PROBLEM TO BE SOLVED: To ensure that when a stamped image having a stamp character image representing characters superposed on a document image and information pertaining to a character string included in the stamped image are output in one set, the characters corresponding to the stamp character image are surely output.SOLUTION: Input of character stamp information L which is information pertaining to a stamp character CS to be superposed as an image on a document image DB which is an image derived by reading a document image is accepted (S1 to S4), and document character information M which is information pertaining to character strings C1 to C3 included in the document image DB is extracted (S5 to S6). Then, a stamp character image DA representing the stamp character CS is generated and superposed on the document image DB to generate a stamped image DA (S7), and information-attached image data composed, as one set, of meta information N having the character stamp information L added to the document character information M and the stamped image DA is generated (S8 to S9).

Description

本発明は、画像読取装置、文書管理システム、及び画像読取制御プログラムに関し、特に、原稿の画像に文字を表す画像が重畳された画像と当該重畳された画像に含まれている文字列に関する情報とを１組にして出力する技術に関する。 The present invention relates to an image reading apparatus, a document management system, and an image reading control program, and in particular, information related to an image in which an image representing characters is superimposed on an image of a document and a character string included in the superimposed image; It is related with the technique which outputs as 1 set.

従来から、例えば、下記特許文献１に記載されているように、ＭＦＰ（Multi Function
Peripherals：複合機）のスキャナ部で読み取った原稿の画像を表す画像データを、ＰＤＦデータに変換するとともに、文字認識を行って当該画像データによって表される画像に含まれる文字列を表すテキストデータを作成し、これを当該ＰＤＦデータに透明テキストデータとして貼り付けて出力する所謂検索可能ＰＤＦを出力する技術が知られている。また、例えば、下記特許文献２に記載されているように、読み取りユニット（スキャナ部）で読み取った原稿の画像に、ユーザーによって入力された文字を表すビットマップを重畳する技術が知られている。 Conventionally, for example, as described in Patent Document 1 below, MFP (Multi Function
Peripherals: Image data representing an image of a document read by a scanner unit is converted into PDF data and character data is recognized and text data representing a character string included in the image represented by the image data is converted. There is known a technique of outputting a so-called searchable PDF that is created, pasted as transparent text data on the PDF data, and output. For example, as described in Patent Document 2 below, a technique is known in which a bitmap representing a character input by a user is superimposed on an image of a document read by a reading unit (scanner unit).

これらの技術を用いることによって、ユーザーによって入力された、例えば「社外秘」や「秘」等の文字を表す画像であるスタンプ文字画像を、原稿の画像に重畳した画像を生成するとともに、生成した画像に含まれている文字列に関する情報（原稿文字情報）を抽出し、生成した画像と抽出した原稿文字情報とを一組にして出力することができる。 By using these technologies, a stamp character image, which is an image representing characters such as “confidential” or “secret” input by the user, is generated on the original image, and the generated image is also generated. Information (original character information) relating to the character string included in the image can be extracted, and the generated image and the extracted original character information can be output as a set.

特開２０１０−７３０１６号公報JP 2010-73016 A 特開２００１−７５４３１号公報JP 2001-75431 A

しかし、スタンプ文字画像が原稿の画像に重畳された画像から文字列を抽出する場合に、例えば、スタンプ文字画像の解像度が低いためにスタンプ文字画像によって表される文字が図柄として認識される、或いは、スタンプ文字画像によって表される文字が原稿の画像に含まれている文字と重なっているために適切に文字として認識できない等して、スタンプ文字画像によって表される文字を抽出することができない虞があった。 However, when a character string is extracted from an image in which the stamp character image is superimposed on the image of the document, for example, the character represented by the stamp character image is recognized as a design because the resolution of the stamp character image is low, or The character represented by the stamp character image may not be extracted because the character represented by the stamp character image overlaps with the character included in the document image and cannot be properly recognized as a character. was there.

これによって、スタンプ文字画像が原稿の画像に重畳された画像（スタンプ済画像）と、スタンプ済画像から抽出した原稿文字情報とを１組にして出力した場合に、出力された原稿文字情報の中に、ユーザーが明示的に入力したはずのスタンプ文字画像によって表される文字が含まれていない虞があった。 As a result, when an image in which a stamp character image is superimposed on a document image (stamped image) and document character information extracted from the stamped image are output as a set, However, there is a possibility that the character represented by the stamp character image that should be explicitly input by the user is not included.

本発明は、このような事情に鑑みてなされたものであり、文字を表す画像であるスタンプ文字画像が原稿の画像に重畳された画像であるスタンプ済画像と、スタンプ済画像に含まれている文字列に関する情報とを１組にして出力する場合に、スタンプ文字画像に対応する文字を確実に出力することができる画像読取装置、文書管理システム、及び画像読取制御プログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and is included in a stamped image, which is an image obtained by superimposing a stamp character image, which is an image representing characters, on a document image, and the stamped image. An object of the present invention is to provide an image reading apparatus, a document management system, and an image reading control program capable of reliably outputting characters corresponding to a stamp character image when outputting information relating to a character string as a set. To do.

本発明に係る画像読取装置は、原稿の画像を読み取る画像読取部と、前記画像読取部によって読み取られた画像である原稿画像に画像として重畳させる文字であるスタンプ文字に関する情報であって、前記スタンプ文字を示す情報であるスタンプ文字情報と前記原稿画像において前記スタンプ文字を表すスタンプ文字画像を重畳させる位置を示すスタンプ位置情報とを対応付けた情報である文字スタンプ情報の入力を、前記スタンプ文字の選択肢として予め定められた複数の文字情報の中から選択された文字情報を前記スタンプ文字情報として受け付け、前記原稿画像における予め定められた複数の位置を示す情報の中から選択された位置を示す情報を前記スタンプ位置情報として受け付けることにより受け付ける文字スタンプ情報受付部と、前記原稿画像に含まれている文字列に関する情報であって、前記原稿画像に含まれている文字列を示す原稿文字列情報と前記原稿画像において前記文字列が配置されている位置を示す原稿位置情報とを対応付けた情報である原稿文字情報を、前記原稿画像を、前記原稿文字情報の抽出精度を高めることができるように予め定められた解像度の画像である抽出用画像に変換する処理と、前記抽出用画像に対して所定の文字認識処理を実行することによって前記抽出用画像から文字列を抽出する処理と、前記原稿文字列情報として前記抽出された文字列を示す情報を生成する処理と、前記文字列が抽出された前記抽出用画像上の位置に対応する前記原稿画像上の位置を示す情報を前記原稿位置情報として取得する処理と、前記原稿文字列情報と前記原稿位置情報とを対応付けて前記原稿文字情報を生成する処理と、を実行することにより抽出する原稿文字情報抽出部と、前記原稿文字情報抽出部により前記原稿文字情報が抽出された後に、前記文字スタンプ情報受付部によって受け付けられた文字スタンプ情報に対応するスタンプ文字を表し且つ前記原稿画像と同じ解像度の前記スタンプ文字画像を生成し、当該スタンプ文字画像を前記抽出用画像と同じ解像度の画像である仮スタンプ文字画像に変換し、当該仮スタンプ文字画像を、前記抽出用画像における、前記受け付けられた文字スタンプ情報に含まれる前記スタンプ位置情報に対応する位置に重畳し、当該仮スタンプ文字画像が重畳された抽出用画像を、前記原稿画像と同じ解像度の画像に変換することにより、前記スタンプ文字画像が前記原稿画像に目視可能に重畳された画像であるスタンプ済画像を生成するスタンプ済画像生成部と、前記原稿文字情報抽出部によって抽出された、前記原稿文字列情報と前記原稿位置情報とを対応付けた前記原稿文字情報に、前記文字スタンプ情報受付部によって受け付けられた、前記スタンプ文字情報と前記スタンプ位置情報とを対応付けた前記文字スタンプ情報を付加した情報であるメタ情報と、前記スタンプ済画像生成部により生成された前記スタンプ済画像と、を１組にしたデータである情報付画像データを生成する情報付画像データ生成部と、を備える。
また、本発明に係る画像読取装置は、原稿の画像を読み取る画像読取部と、前記画像読取部によって読み取られた画像である原稿画像に画像として重畳させる文字であるスタンプ文字に関する情報であって、前記スタンプ文字を示す情報であるスタンプ文字情報と前記原稿画像において前記スタンプ文字を表すスタンプ文字画像を重畳させる位置を示すスタンプ位置情報とを対応付けた情報である文字スタンプ情報の入力を、前記スタンプ文字の選択肢として予め定められた複数の文字情報の中から選択された文字情報を前記スタンプ文字情報として受け付け、前記原稿画像における予め定められた複数の位置を示す情報の中から選択された位置を示す情報を前記スタンプ位置情報として受け付けることにより受け付ける文字スタンプ情報受付部と、前記原稿画像に含まれている文字列に関する情報であって、前記原稿画像に含まれている文字列を示す原稿文字列情報と前記原稿画像において前記文字列が配置されている位置を示す原稿位置情報とを対応付けた情報である原稿文字情報を、前記原稿画像に対して所定の文字認識処理を実行することによって文字列を抽出する処理と、前記原稿文字列情報として前記抽出された文字列を示す情報を生成する処理と、前記文字列が抽出された前記原稿画像上の位置を示す情報を前記原稿位置情報として取得する処理と、前記原稿文字列情報と前記原稿位置情報とを対応付けて前記原稿文字情報を生成する処理と、を実行することにより抽出する原稿文字情報抽出部と、前記原稿文字情報抽出部により前記原稿文字情報が抽出された後に、前記文字スタンプ情報受付部によって受け付けられた文字スタンプ情報に対応するスタンプ文字を表し且つ前記原稿画像と同じ解像度の前記スタンプ文字画像を生成し、当該スタンプ文字画像を、前記原稿画像における、前記受け付けられた文字スタンプ情報に含まれる前記スタンプ位置情報に対応する位置に重畳することにより、前記スタンプ文字画像が前記原稿画像に目視可能に重畳された画像であるスタンプ済画像を生成するスタンプ済画像生成部と、前記原稿文字情報抽出部によって抽出された、前記原稿文字列情報と前記原稿位置情報とを対応付けた前記原稿文字情報に、前記文字スタンプ情報受付部によって受け付けられた、前記スタンプ文字情報と前記スタンプ位置情報とを対応付けた前記文字スタンプ情報を付加した情報であるメタ情報と、前記スタンプ済画像生成部により生成された前記スタンプ済画像と、を１組にしたデータである情報付画像データを生成する情報付画像データ生成部と、を備えてもよい。 An image reading apparatus according to the present invention includes an image reading unit that reads an image of a document, and information related to a stamp character that is a character to be superimposed as an image on a document image that is an image read by the image reading unit. Input of character stamp information that is information in which stamp character information that is information indicating characters and stamp position information that indicates a position at which the stamp character image that represents the stamp characters is superimposed on the document image is input to the stamp character. Information indicating a position selected from information indicating a plurality of predetermined positions in the document image by receiving character information selected from a plurality of predetermined character information as options as the stamp character information A character stamp information accepting unit that accepts by accepting as stamp position information Information relating to a character string included in the document image, the document character string information indicating a character string included in the document image, and a document position indicating a position where the character string is arranged in the document image Processing for converting document character information, which is information associated with information, into an extraction image that is an image having a predetermined resolution so that the document character information can be extracted with high accuracy. , A process of extracting a character string from the extraction image by executing a predetermined character recognition process on the extraction image, and a process of generating information indicating the extracted character string as the document character string information Processing for obtaining information indicating the position on the original image corresponding to the position on the extraction image from which the character string has been extracted as the original position information; and the original character string information; A document character information extraction unit that extracts the document character information by executing a process of generating the document character information in association with the document document position information, and after the document character information is extracted by the document character information extraction unit, The stamp character image representing the stamp character corresponding to the character stamp information received by the character stamp information receiving unit and having the same resolution as the original image is generated, and the stamp character image is an image having the same resolution as the extraction image. Is converted into a temporary stamp character image, and the temporary stamp character image is superimposed on a position corresponding to the stamp position information included in the received character stamp information in the extraction image, and the temporary stamp character image Is converted into an image having the same resolution as that of the original image. A stamped image generation unit that generates a stamped image, which is an image in which a character image is visibly superimposed on the document image, and the document character string information and the document position information extracted by the document character information extraction unit. Meta information that is information obtained by adding the character stamp information associated with the stamp character information and the stamp position information received by the character stamp information reception unit to the document character information associated with An information-added image data generation unit that generates information-added image data that is a set of the stamped image generated by the stamped image generation unit.
An image reading apparatus according to the present invention is information relating to an image reading unit that reads an image of a document, and stamp characters that are characters to be superimposed as an image on a document image that is an image read by the image reading unit, Input of character stamp information that is information in which stamp character information that is information indicating the stamp character and stamp position information that indicates a position where the stamp character image that represents the stamp character is superimposed on the document image is input to the stamp Character information selected from a plurality of predetermined character information as character options is received as the stamp character information, and a position selected from information indicating a plurality of predetermined positions in the document image is selected. Character stamp information received by receiving the indicated information as the stamp position information. Information about a character string included in the document image, and document character string information indicating the character string included in the document image, and a position where the character string is arranged in the document image. Document character information, which is information associated with document position information shown, is extracted as a document character string information by extracting a character string by executing a predetermined character recognition process on the document image. A process for generating information indicating a character string; a process for acquiring information indicating a position on the original image from which the character string has been extracted as the original position information; the original character string information and the original position information; Generating the original character information by associating the original character information with the original character information extracting unit, and after the original character information is extracted by the original character information extracting unit The stamp character image representing the stamp character corresponding to the character stamp information received by the character stamp information receiving unit and having the same resolution as the document image is generated, and the stamp character image is received in the document image. A stamped image generation unit that generates a stamped image that is an image in which the stamp character image is visibly superimposed on the document image by superimposing the stamp character image on a position corresponding to the stamp position information included in the character stamp information. And the stamp character information received by the character stamp information receiving unit in the document character information extracted by the document character information extracting unit in association with the document character string information and the document position information. Information with the character stamp information associated with the stamp position information added. You may provide the image data generation part with information which produces | generates the image data with information which is a set of a certain meta information and the said stamped image produced | generated by the said stamped image production | generation part.

本発明に係る文書管理システムは、前記画像読取装置と、前記情報付画像データ生成部によって生成された情報付画像データを記憶する情報付画像データ記憶部と、前記情報付画像データ記憶部に記憶されている情報付画像データの中から、ユーザーによって指定された検索キーワードをメタ情報に含む情報付画像データを検索する検索サーバーと、を備える。 The document management system according to the present invention includes the image reading device, an image data storage unit with information generated by the image data generation unit with information generated, and stored in the image data storage unit with information. A search server that searches the image data with information including the search keyword specified by the user in the meta information from the image data with information.

本発明に係る画像読取制御プログラムは、原稿の画像を読み取る画像読取部を備えた画像読取装置に内蔵されるコンピューターを、前記画像読取部によって読み取られた画像である原稿画像に画像として重畳させる文字であるスタンプ文字に関する情報であって、前記スタンプ文字を示す情報であるスタンプ文字情報と前記原稿画像において前記スタンプ文字を表すスタンプ文字画像を重畳させる位置を示すスタンプ位置情報とを対応付けた情報である文字スタンプ情報の入力を、前記スタンプ文字の選択肢として予め定められた複数の文字情報の中から選択された文字情報を前記スタンプ文字情報として受け付け、前記原稿画像における予め定められた複数の位置を示す情報の中から選択された位置を示す情報を前記スタンプ位置情報として受け付けることにより受け付ける文字スタンプ情報受付部と、前記原稿画像に含まれている文字列に関する情報であって、前記原稿画像に含まれている文字列を示す原稿文字列情報と前記原稿画像において前記文字列が配置されている位置を示す原稿位置情報とを対応付けた情報である原稿文字情報を、前記原稿画像を、前記原稿文字情報の抽出精度を高めることができるように予め定められた解像度の画像である抽出用画像に変換する処理と、前記抽出用画像に対して所定の文字認識処理を実行することによって前記抽出用画像から文字列を抽出する処理と、前記原稿文字列情報として前記抽出された文字列を示す情報を生成する処理と、前記文字列が抽出された前記抽出用画像上の位置に対応する前記原稿画像上の位置を示す情報を前記原稿位置情報として取得する処理と、前記原稿文字列情報と前記原稿位置情報とを対応付けて前記原稿文字情報を生成する処理と、を実行することにより抽出する原稿文字情報抽出部と、前記原稿文字情報抽出部により前記原稿文字情報が抽出された後に、前記文字スタンプ情報受付部によって受け付けられた文字スタンプ情報に対応するスタンプ文字を表し且つ前記原稿画像と同じ解像度の前記スタンプ文字画像を生成し、当該スタンプ文字画像を前記抽出用画像と同じ解像度の画像である仮スタンプ文字画像に変換し、当該仮スタンプ文字画像を、前記抽出用画像における、前記受け付けられた文字スタンプ情報に含まれる前記スタンプ位置情報に対応する位置に重畳し、当該仮スタンプ文字画像が重畳された抽出用画像を、前記原稿画像と同じ解像度の画像に変換することにより、前記スタンプ文字画像が前記原稿画像に目視可能に重畳された画像であるスタンプ済画像を生成するスタンプ済画像生成部と、前記原稿文字情報抽出部によって抽出された、前記原稿文字列情報と前記原稿位置情報とを対応付けた前記原稿文字情報に、前記文字スタンプ情報受付部によって受け付けられた、前記スタンプ文字情報と前記スタンプ位置情報とを対応付けた前記文字スタンプ情報を付加した情報であるメタ情報と、前記スタンプ済画像生成部により生成された前記スタンプ済画像と、を１組にしたデータである情報付画像データを生成する情報付画像データ生成部として機能させる。
また、本発明に係る画像読取制御プログラムは、原稿の画像を読み取る画像読取部を備えた画像読取装置に内蔵されるコンピューターを、前記画像読取部によって読み取られた画像である原稿画像に画像として重畳させる文字であるスタンプ文字に関する情報であって、前記スタンプ文字を示す情報であるスタンプ文字情報と前記原稿画像において前記スタンプ文字を表すスタンプ文字画像を重畳させる位置を示すスタンプ位置情報とを対応付けた情報である文字スタンプ情報の入力を、前記スタンプ文字の選択肢として予め定められた複数の文字情報の中から選択された文字情報を前記スタンプ文字情報として受け付け、前記原稿画像における予め定められた複数の位置を示す情報の中から選択された位置を示す情報を前記スタンプ位置情報として受け付けることにより受け付ける文字スタンプ情報受付部と、前記原稿画像に含まれている文字列に関する情報であって、前記原稿画像に含まれている文字列を示す原稿文字列情報と前記原稿画像において前記文字列が配置されている位置を示す原稿位置情報とを対応付けた情報である原稿文字情報を、前記原稿画像に対して所定の文字認識処理を実行することによって文字列を抽出する処理と、前記原稿文字列情報として前記抽出された文字列を示す情報を生成する処理と、前記文字列が抽出された前記原稿画像上の位置を示す情報を前記原稿位置情報として取得する処理と、前記原稿文字列情報と前記原稿位置情報とを対応付けて前記原稿文字情報を生成する処理と、を実行することにより抽出する原稿文字情報抽出部と、前記原稿文字情報抽出部により前記原稿文字情報が抽出された後に、前記文字スタンプ情報受付部によって受け付けられた文字スタンプ情報に対応するスタンプ文字を表し且つ前記原稿画像と同じ解像度の画像である前記スタンプ文字画像を生成し、当該スタンプ文字画像を、前記原稿画像における、前記受け付けられた文字スタンプ情報に含まれる前記スタンプ位置情報に対応する位置に重畳することにより、前記スタンプ文字画像が前記原稿画像に目視可能に重畳された画像であるスタンプ済画像を生成するスタンプ済画像生成部と、前記原稿文字情報抽出部によって抽出された、前記原稿文字列情報と前記原稿位置情報とを対応付けた前記原稿文字情報に、前記文字スタンプ情報受付部によって受け付けられた、前記スタンプ文字情報と前記スタンプ位置情報とを対応付けた前記文字スタンプ情報を付加した情報であるメタ情報と、前記スタンプ済画像生成部により生成された前記スタンプ済画像と、を１組にしたデータである情報付画像データを生成する情報付画像データ生成部として機能させてもよい。 The image reading control program according to the present invention is a character that superimposes a computer incorporated in an image reading apparatus including an image reading unit that reads an image of a document as an image on a document image that is an image read by the image reading unit. The stamp character information that is information indicating the stamp character and the stamp position information indicating the position at which the stamp character image representing the stamp character is superimposed on the document image. Input of certain character stamp information is accepted as character information selected from a plurality of character information predetermined as stamp character options as the stamp character information, and a plurality of predetermined positions in the document image are received. The information indicating the position selected from the indicated information is used as the stamp position information. A character stamp information receiving unit that receives the character stamp, and information on a character string included in the document image, the character string information indicating the character string included in the document image and the character in the document image Document character information, which is information associated with document position information indicating the position at which the column is arranged, is converted to a predetermined resolution so as to increase the accuracy of extracting the document image and the document character information. A process of converting to an extraction image, which is an image, a process of extracting a character string from the extraction image by executing a predetermined character recognition process on the extraction image, and the extraction as the original character string information Processing for generating information indicating the read character string, and information indicating the position on the document image corresponding to the position on the extraction image from which the character string has been extracted. A document character information extracting unit that extracts the document by executing a process of obtaining the document position information, a process of generating the document character information by associating the document character string information and the document position information, and the document After the document character information is extracted by the character information extracting unit, the stamp character image representing the stamp character corresponding to the character stamp information received by the character stamp information receiving unit and having the same resolution as the document image is generated. The stamp character image is converted into a temporary stamp character image that is an image having the same resolution as the extraction image, and the temporary stamp character image is included in the accepted character stamp information in the extraction image. The extraction image superimposed on the position corresponding to the position information and the temporary stamp character image is superimposed on the original image. A stamped image generating unit that generates a stamped image that is an image in which the stamp character image is visibly superimposed on the document image by converting the image into an image having the same resolution as the image, and the document character information extracting unit. The stamp character information and the stamp position information received by the character stamp information reception unit are associated with the extracted document character information in which the document character string information and the document position information are associated with each other. Image data generation with information that generates image data with information that is a set of the meta information that is the information with the character stamp information added and the stamped image generated by the stamped image generation unit Function as a part.
The image reading control program according to the present invention superimposes a computer built in an image reading apparatus having an image reading unit that reads an image of a document as an image on a document image that is an image read by the image reading unit. Information relating to a stamp character that is a character to be generated, and stamp character information that is information indicating the stamp character is associated with stamp position information that indicates a position where the stamp character image representing the stamp character is superimposed on the document image Input of character stamp information, which is information, accepts character information selected from a plurality of character information predetermined as the stamp character options as the stamp character information, and sets a plurality of predetermined characters in the document image. Information indicating the position selected from the information indicating the position is used as the stamp position information. A character stamp information receiving unit that receives the character stamp information and information on a character string included in the document image, the document character string information indicating the character string included in the document image, and the document image A process of extracting a character string by performing a predetermined character recognition process on the document image, as document character information that is information associated with document position information indicating a position where the character string is arranged; Processing for generating information indicating the extracted character string as the document character string information, processing for acquiring information indicating a position on the document image from which the character string has been extracted, as the document position information, and the document A document character information extracting unit that extracts the document character information by executing a process of associating the character string information with the document position information and generating the document character information; The stamp character representing the stamp character corresponding to the character stamp information received by the character stamp information receiving unit and having the same resolution as the original image after the document character information is extracted by the draft character information extracting unit. An image is generated, and the stamp character image is superimposed on a position corresponding to the stamp position information included in the accepted character stamp information in the document image, whereby the stamp character image is visually observed on the document image. A stamped image generation unit that generates a stamped image that is a superposed image, and the document character string that associates the document character string information and the document position information extracted by the document character information extraction unit. In the information, the stamp character information received by the character stamp information receiving unit and the stamp character information Information-added image data that is a set of meta information, which is information added with the character stamp information associated with stamp position information, and the stamped image generated by the stamped image generation unit May be caused to function as an image data generation unit with information for generating the information.

本発明によれば、文字を表す画像であるスタンプ文字画像が原稿の画像に重畳された画像であるスタンプ済画像と、スタンプ済画像に含まれている文字列に関する情報とを１組にして出力する場合に、スタンプ文字画像に対応する文字を確実に出力することができる画像読取装置、文書管理システム、及び画像読取制御プログラムを提供することが可能になる。 According to the present invention, a stamped image, which is an image in which a stamp character image, which is an image representing a character, is superimposed on an image of a document, and information related to a character string included in the stamped image are output as a set. In this case, it is possible to provide an image reading apparatus, a document management system, and an image reading control program that can reliably output characters corresponding to a stamp character image.

本発明に係る文書管理システムの一例を示すシステム構成図。1 is a system configuration diagram showing an example of a document management system according to the present invention. 本発明に係る画像読取装置を備えた複合機の一例を示す概略構造図。1 is a schematic structural diagram showing an example of a multi-function peripheral equipped with an image reading apparatus according to the present invention. 複合機の電気的構成の一例を示すブロック図。1 is a block diagram illustrating an example of an electrical configuration of a multifunction machine. 情報付画像データを生成する動作の一例を示すフローチャート。The flowchart which shows an example of the operation | movement which produces | generates image data with information. 文字スタンプ情報の一例を示す説明図。Explanatory drawing which shows an example of character stamp information. 原稿画像の一例を示す説明図。FIG. 3 is an explanatory diagram illustrating an example of a document image. 原稿画像と抽出用画像との関係の一例を示す説明図。FIG. 3 is an explanatory diagram illustrating an example of a relationship between a document image and an extraction image. 抽出用画像の一例を示す説明図。Explanatory drawing which shows an example of the image for extraction. 抽出用画像から抽出した文字列に関する情報の一例を示す説明図。Explanatory drawing which shows an example of the information regarding the character string extracted from the image for extraction. 原稿文字情報の一例を示す説明図。FIG. 3 is an explanatory diagram illustrating an example of document character information. スタンプ済画像の一例を示す説明図。Explanatory drawing which shows an example of a stamped image. メタ情報の一例を示す説明図。Explanatory drawing which shows an example of meta information.

以下、本発明に係る実施形態を図面に基づいて説明する。図１は、本発明に係る文書管理システム９００の一例を示すシステム構成図である。 Embodiments according to the present invention will be described below with reference to the drawings. FIG. 1 is a system configuration diagram showing an example of a document management system 900 according to the present invention.

図１に示すように、ＬＡＮ（Local Area Network）等のネットワーク９０には、本発明に係る画像読取装置の一例である複合機１、検索サーバー２、及びパーソナルコンピュータ（以下、ＰＣと称す）３が接続され、互いに各種データの送受信が可能に接続されている。 As shown in FIG. 1, a network 90 such as a LAN (Local Area Network) includes an MFP 1, a search server 2, and a personal computer (hereinafter referred to as a PC) 3, which are examples of an image reading apparatus according to the present invention. Are connected so that various data can be transmitted and received.

複合機１は、後述の画像読取部５によって読み取った原稿の画像の印刷を行う。また、複合機１は、検索サーバー２やＰＣ３から送信された画像データを、ネットワーク９０を介して受信し、受信した画像データによって表される画像の印刷を行う。 The multifunction device 1 prints an image of a document read by an image reading unit 5 described later. In addition, the multifunction device 1 receives the image data transmitted from the search server 2 or the PC 3 via the network 90 and prints an image represented by the received image data.

また、複合機１は、後述する情報付画像データ生成部８４により生成された、画像と当該画像に含まれている文字列に関する情報であるメタ情報とを一組にしたデータである情報付画像データを、ネットワーク９０を介して検索サーバー２に送信する。 In addition, the multifunction device 1 is an image with information that is a set of data and meta information that is information related to a character string included in the image, which is generated by the image data generation unit with information 84 described later. Data is transmitted to the search server 2 via the network 90.

検索サーバー２は、受信した情報付画像データを検索サーバー２に備えられた情報付画像記憶部２１に記憶する。検索サーバー２は、ユーザーによる複合機１及びパーソナルコンピュータ３の操作によって指定された検索キーワードを、ネットワーク９０を介して受信すると、情報付画像記憶部２１に記憶されている情報付画像データの中から、受信した検索キーワードをメタ情報に含んでいる情報付画像データを検索する。そして、検索サーバー２は、検索結果である検索キーワードをメタ情報に含んでいる情報付画像データを、検索キーワードの送信元に向けて返信する。 The search server 2 stores the received image data with information in the information-added image storage unit 21 provided in the search server 2. When the search server 2 receives a search keyword designated by the user's operation of the multifunction device 1 and the personal computer 3 via the network 90, the search server 2 searches the information-added image data stored in the information-added image storage unit 21. The image data with information containing the received search keyword in the meta information is searched. Then, the search server 2 returns the image data with information that includes the search keyword as the search result in the meta information to the transmission source of the search keyword.

このようにして、複合機１と検索サーバー２とＰＣ３とを備えた文書管理システム９００が構成されている。尚、文書管理システム９００の構成は、これに限らず、例えば、ＰＣ３を備えない簡素化された構成であってもよい。 In this way, the document management system 900 including the multifunction device 1, the search server 2, and the PC 3 is configured. The configuration of the document management system 900 is not limited to this, and may be a simplified configuration that does not include the PC 3, for example.

以下、複合機１について詳述する。図２は、本発明に係る画像読取装置を備えた複合機１の一例を示す概略構造図である。図３は、複合機１の電気的構成の一例を示すブロック図である。 Hereinafter, the multifunction device 1 will be described in detail. FIG. 2 is a schematic structural diagram illustrating an example of a multifunction machine 1 including an image reading apparatus according to the present invention. FIG. 3 is a block diagram illustrating an example of an electrical configuration of the multifunction machine 1.

図２に示すように、複合機１は、本体部１０と、画像読取部５と、原稿給送部６と、を備えている。 As shown in FIG. 2, the multifunction machine 1 includes a main body unit 10, an image reading unit 5, and a document feeding unit 6.

複合機１のフロント部には、操作パネル部７が備えられている。操作パネル部７は、表示部７３と、操作キー部７６とを備えている。 An operation panel unit 7 is provided at the front part of the multifunction machine 1. The operation panel unit 7 includes a display unit 73 and an operation key unit 76.

画像読取部５は、露光ランプ５１１及びＣＣＤ（Charge Coupled Device）５１２等（
図３）からなるスキャナー部５１と、ガラス等の透明部材により構成された原稿台５２及び原稿読取スリット５３と、を備えている。 The image reading unit 5 includes an exposure lamp 511, a CCD (Charge Coupled Device) 512, and the like (
3) and a document table 52 and a document reading slit 53 made of a transparent member such as glass.

スキャナー部５１は、図略の駆動部によって移動可能に構成され、原稿台５２に載置された原稿の画像を読み取るときは、原稿台５２に対向する位置で原稿面に沿って移動され、読み取った原稿の画像（以下、原稿画像）を示す画像データを制御部８へ出力する。また、原稿給送部６により給送された原稿の画像を読み取るときは、原稿読取スリット５３と対向する位置に移動され、原稿読取スリット５３を介して原稿給送部６による原稿の搬送動作と同期して原稿の画像を読み取り、原稿画像を示す画像データを制御部８へ出力す
る。 The scanner unit 51 is configured to be movable by a driving unit (not shown). When reading an image of a document placed on the document table 52, the scanner unit 51 is moved along the document surface at a position facing the document table 52 and reads the image. The image data indicating the original image (hereinafter referred to as the original image) is output to the control unit 8. Further, when reading an image of a document fed by the document feeding unit 6, the document is moved to a position facing the document reading slit 53, and the document feeding operation by the document feeding unit 6 is performed via the document reading slit 53. The document image is read in synchronization and image data indicating the document image is output to the control unit 8.

原稿給送部６は、原稿を載置するための原稿載置部６１と、読み取り済みの原稿が排出される原稿排出部６２と、原稿載置部６１に載置された原稿を１枚ずつ繰り出して原稿読取スリット５３に対向する位置へ搬送し、原稿排出部６２へ排出する原稿搬送機構６３と、を備えている。 The document feeder 6 includes a document placement unit 61 for placing a document, a document discharge unit 62 for discharging a read document, and a document placed on the document placement unit 61 one by one. A document transport mechanism 63 that feeds out and transports the document to a position facing the document reading slit 53 and discharges it to the document discharge unit 62 is provided.

本体部１０は、複数の給紙カセット４６１と、給紙カセット４６１から用紙を画像形成部４へ搬送する給紙ローラー４１２と、搬出されてきた用紙に画像を形成する画像形成部４と、画像が形成された用紙が排出される排出トレイ４８と、ネットワーク通信部９と、装置全体の動作制御を司る制御部８と、を備えている。 The main body 10 includes a plurality of paper feed cassettes 461, a paper feed roller 412 that conveys paper from the paper feed cassette 461 to the image forming unit 4, an image forming unit 4 that forms an image on the conveyed paper, and an image Is provided with a discharge tray 48 for discharging the paper on which the paper is formed, a network communication unit 9, and a control unit 8 for controlling the operation of the entire apparatus.

画像形成部４は、用紙搬送部４１と、光走査装置４２と、感光体ドラム４３と、現像部４４と、転写部４５と、定着部４６と、を備えている。 The image forming unit 4 includes a paper transport unit 41, an optical scanning device 42, a photosensitive drum 43, a developing unit 44, a transfer unit 45, and a fixing unit 46.

用紙搬送部４１は、画像形成部４内の用紙搬送路中に設けられ、給紙ローラー４１２によって搬送されてきた用紙を感光体ドラム４３に供給する搬送ローラー４１３や、用紙を本体部１０の左方に配設されたスタックトレイ４９まで搬送する搬送ローラー４１４や、用紙を排出トレイ４８まで搬送する搬送ローラー４１５等を備えている。 The paper transport unit 41 is provided in the paper transport path in the image forming unit 4, and includes a transport roller 413 that supplies the paper transported by the paper feed roller 412 to the photosensitive drum 43, and the paper on the left side of the main body unit 10. A transport roller 414 that transports the stack tray 49 disposed on the side, a transport roller 415 that transports the paper to the discharge tray 48, and the like.

光走査装置４２は、画像データに基づいてレーザー光を出力し、当該レーザー光によって感光体ドラム４３を露光することで、感光体ドラム４３上に静電潜像を形成する。 The optical scanning device 42 outputs a laser beam based on the image data and exposes the photosensitive drum 43 with the laser beam, thereby forming an electrostatic latent image on the photosensitive drum 43.

現像部４４は、トナー像を形成する。転写部４５は、感光体ドラム４３上のトナー像を用紙に転写する。定着部４６は、加熱してトナー像を用紙に定着させる。 The developing unit 44 forms a toner image. The transfer unit 45 transfers the toner image on the photosensitive drum 43 to a sheet. The fixing unit 46 heats and fixes the toner image on the sheet.

ネットワーク通信部９は、ネットワーク９０に接続された検索サーバー２やＰＣ３との間で各種データを送受信するための通信インターフェイス回路である。 The network communication unit 9 is a communication interface circuit for transmitting / receiving various data to / from the search server 2 and the PC 3 connected to the network 90.

制御部８は、例えば、所定の演算処理を実行するＣＰＵ（Central Processing Unit）
、所定の制御プログラムが記憶されたＲＯＭ（Read Only Memory）やデータを一時的に記憶するＲＡＭ（Random Access Memory）等のメモリー、画像処理等の所定の処理を高速処理可能に構成された専用ハードウェアであるＡＳＩＣ（Application Specific Integrated Circuits）、及び、これらの周辺回路等を備えたマイクロコンピューター（コンピューター）によって構成されている。 The control unit 8 is, for example, a CPU (Central Processing Unit) that executes predetermined arithmetic processing
, ROM (Read Only Memory) in which a predetermined control program is stored, RAM (Random Access Memory) in which data is temporarily stored, and dedicated hardware configured so that predetermined processing such as image processing can be performed at high speed ASIC (Application Specific Integrated Circuits) that is hardware, and a microcomputer (computer) including these peripheral circuits and the like.

制御部８は、ＲＯＭ等に記憶された制御プログラムをＣＰＵによって実行することによって各種処理を実行し、複合機１内の各部の動作を制御させる。これによって、制御部８は、複合機１に備えられた複数の機能をそれぞれ実行する。 The control unit 8 executes various processes by causing the CPU to execute a control program stored in a ROM or the like, and controls the operation of each unit in the multifunction machine 1. As a result, the control unit 8 executes a plurality of functions provided in the multifunction machine 1.

制御部８は、複数の機能を実行する処理部として、本実施形態では、特に、文字スタンプ情報受付部８１、原稿文字情報抽出部８２、スタンプ済画像生成部８３、及び情報付画像データ生成部８４を構成する。 In the present embodiment, the control unit 8 is a processing unit that executes a plurality of functions, in particular, a character stamp information receiving unit 81, a document character information extracting unit 82, a stamped image generation unit 83, and an image data generation unit with information. 84 is configured.

以下では、文字スタンプ情報受付部８１、原稿文字情報抽出部８２、スタンプ済画像生成部８３、及び情報付画像データ生成部８４によって、情報付画像データを生成する動作について説明する。当該説明の中で各処理部について詳述する。図４は、情報付画像データを生成する動作の一例を示すフローチャートである。 Hereinafter, an operation of generating image data with information by the character stamp information reception unit 81, the document character information extraction unit 82, the stamped image generation unit 83, and the information-added image data generation unit 84 will be described. Each processing unit will be described in detail in the description. FIG. 4 is a flowchart illustrating an example of an operation for generating image data with information.

図４に示すように、先ず、文字スタンプ情報受付部８１は、ステップＳ１〜ステップＳ
３を実行することによって、原稿画像ＤＢに画像として重畳させる文字であるスタンプ文字ＣＳに関する情報である文字スタンプ情報Ｌの入力を受け付ける。 As shown in FIG. 4, first, the character stamp information receiving unit 81 performs steps S1 to S.
3 is received, input of character stamp information L, which is information related to the stamp character CS, which is a character to be superimposed as an image on the document image DB, is received.

具体的には、例えば、ユーザーが操作パネル部７（図３）を操作して、スタンプ機能を選択すると、制御部８は、スタンプ文字ＣＳの選択肢として、「社外秘」や「秘」等の選択メニューを表示部７３に表示する。そして、ユーザーが例えば「秘」の選択メニューを選択すると、文字スタンプ情報受付部８１は、例えばテキストデータで表された「秘」の文字情報を、スタンプ文字ＣＳを示す情報であるスタンプ文字情報ＬＳとして受け付ける（ステップＳ１）。尚、各選択メニューに対応するテキストデータは、予めＲＯＭ等に記憶されている。 Specifically, for example, when the user operates the operation panel unit 7 (FIG. 3) and selects the stamp function, the control unit 8 selects “confidential”, “secret” or the like as the choice of the stamp character CS. The menu is displayed on the display unit 73. When the user selects the “secret” selection menu, for example, the character stamp information receiving unit 81 converts the character information “secret” represented by, for example, text data into stamp character information LS that is information indicating the stamp character CS. (Step S1). Note that text data corresponding to each selection menu is stored in advance in a ROM or the like.

次に、制御部８は、ステップＳ１で受け付けられたスタンプ文字情報ＬＳに対応するスタンプ文字ＣＳを表す画像であるスタンプ文字画像ＤＳを、原稿画像ＤＢに重畳させるときの位置として予め定められた、例えば、「左上」、「右上」、「左下」、及び「右下」等の複数の位置を示す選択メニューを表示部７３に表示する。そして、ユーザーが例えば「右下」の選択メニューを選択すると、文字スタンプ情報受付部８１は、「右下」の位置を示す情報を、原稿画像ＤＢにおいてスタンプ文字画像ＤＳを重畳させる位置を示す情報であるスタンプ位置情報ＬＰとして受け付ける（ステップＳ２）。 Next, the control unit 8 is set in advance as a position when the stamp character image DS, which is an image representing the stamp character CS corresponding to the stamp character information LS received in step S1, is superimposed on the document image DB. For example, a selection menu indicating a plurality of positions such as “upper left”, “upper right”, “lower left”, and “lower right” is displayed on the display unit 73. For example, when the user selects the selection menu “lower right”, the character stamp information reception unit 81 uses information indicating the position of “lower right” as information indicating the position where the stamp character image DS is superimposed in the document image DB. Is received as stamp position information LP (step S2).

図５は、文字スタンプ情報Ｌの一例を示す説明図である。そして、文字スタンプ情報受付部８１は、例えば図５に示すように、ステップＳ１で受け付けたスタンプ文字ＣＳを示す情報であるスタンプ文字情報ＬＳと、ステップＳ２で受け付けた原稿画像ＤＢにおいてスタンプ文字画像ＤＳを重畳させる位置を示す情報であるスタンプ位置情報ＬＰと、を対応付けた情報である文字スタンプ情報Ｌを生成して、ＲＡＭに記憶する（ステップＳ３）。 FIG. 5 is an explanatory diagram showing an example of the character stamp information L. As shown in FIG. Then, for example, as shown in FIG. 5, the character stamp information receiving unit 81 includes stamp character information LS that is information indicating the stamp character CS received in step S1, and a stamp character image DS in the document image DB received in step S2. Is generated and stored in the RAM (step S3).

そして、例えば、ユーザーが操作パネル部７（図３）に備えられたスタートキーを押下して、情報付画像データの送信機能の開始指示が入力すると、制御部８は、当該入力を受け付けて、情報付画像データを生成する動作を開始する。情報付画像データを生成する動作が開始されると、制御部８は、画像読取部５によって、原稿の画像を読み取らせて、読み取られた画像である原稿画像ＤＢを示す画像データを生成させる（ステップＳ４）。 For example, when the user presses the start key provided in the operation panel unit 7 (FIG. 3) and inputs an instruction to start the transmission function of the image data with information, the control unit 8 receives the input, The operation of generating image data with information is started. When the operation for generating the image data with information is started, the control unit 8 causes the image reading unit 5 to read the image of the document and generate image data indicating the document image DB that is the read image (see FIG. Step S4).

ここで、以下の説明に具体例として用いる原稿画像ＤＢについて説明する。図６は、原稿画像ＤＢの一例を示す説明図である。例えば図６に示すように、原稿画像ＤＢには、左上の画素の座標Ｓ１と右下の画素の座標Ｅ１によって定まる矩形領域に「最新の・・・について」と記載された文字列Ｃ１が含まれている。また、原稿画像ＤＢには、左上の画素の座標Ｓ２と右下の画素の座標Ｅ２によって定まる矩形領域に「今回、・・・です。」と記載された文字列Ｃ２が含まれている。また、原稿画像ＤＢには、左上の画素の座標Ｓ３と右下の画素の座標Ｅ３によって定まる矩形領域に「右図のように、・・・います。」と記載された文字列Ｃ３が含まれている。尚、原稿画像ＤＢには、符号Ｇ１に示す画像も含まれている。 Here, the document image DB used as a specific example in the following description will be described. FIG. 6 is an explanatory diagram showing an example of the document image DB. For example, as shown in FIG. 6, the document image DB includes a character string C1 in which “about the latest ...” is described in a rectangular area determined by the coordinates S1 of the upper left pixel and the coordinates E1 of the lower right pixel. It is. In addition, the document image DB includes a character string C2 in which "This time is ..." is included in a rectangular area determined by the coordinates S2 of the upper left pixel and the coordinates E2 of the lower right pixel. In addition, the document image DB includes a character string C3 in which “as shown in the right figure” is described in a rectangular area determined by the coordinates S3 of the upper left pixel and the coordinates E3 of the lower right pixel. It is. Note that the document image DB also includes an image indicated by reference numeral G1.

図４に戻り、次に、原稿文字情報抽出部８２は、ＯＣＲ（Optical Character Recognition）等の公知の文字認識処理を実行することによって、ステップＳ４で生成された画像
データによって示される原稿画像ＤＢに含まれている文字列に関する情報を抽出する（ステップＳ５）。 Returning to FIG. 4, next, the document character information extraction unit 82 executes a known character recognition process such as OCR (Optical Character Recognition), thereby creating a document image DB indicated by the image data generated in step S <b> 4. Information on the contained character string is extracted (step S5).

具体的には、原稿文字情報抽出部８２は、ステップＳ５において、ステップＳ４で生成された画像データによって示される原稿画像ＤＢの解像度を、原稿画像ＤＢに含まれている文字列に関する情報の抽出精度を高めることができるように予め定められた解像度に変
換する。これによって、原稿文字情報抽出部８２は、原稿画像ＤＢを、前記予め定められた解像度の画像である抽出用画像ＤＴに変換する。 Specifically, in step S5, the document character information extraction unit 82 sets the resolution of the document image DB indicated by the image data generated in step S4 to the information extraction accuracy for the character string included in the document image DB. Is converted to a predetermined resolution so that the image quality can be increased. Thus, the document character information extraction unit 82 converts the document image DB into the extraction image DT which is an image having the predetermined resolution.

図７は、原稿画像ＤＢと抽出用画像ＤＴとの関係の一例を示す説明図である。例えば、図７に示すように、原稿文字情報抽出部８２は、ステップＳ４で生成された、水平方向にＷｂ個の画素と垂直方向にＨｂ個の画素によって表される解像度の原稿画像ＤＢを、原稿画像ＤＢに含まれている文字列に関する情報の抽出精度を高めることができるように予め定められた、水平方向にＷｔ個の画素と垂直方向にＨｔ個の画素によって表される解像度の抽出用画像ＤＴに変換する。尚、図７では、説明の便宜上、原稿画像ＤＢの大きさと抽出用画像ＤＴの大きさを異ならせて図示している。 FIG. 7 is an explanatory diagram showing an example of the relationship between the document image DB and the extraction image DT. For example, as shown in FIG. 7, the manuscript character information extraction unit 82 generates the manuscript image DB having a resolution represented by Wb pixels in the horizontal direction and Hb pixels in the vertical direction generated in step S4. For extracting the resolution represented by Wt pixels in the horizontal direction and Ht pixels in the vertical direction, which are set in advance so as to improve the accuracy of extracting information related to the character string included in the document image DB. Convert to image DT. In FIG. 7, for the convenience of explanation, the size of the document image DB and the size of the extraction image DT are illustrated differently.

以下、水平方向に左からＸ番目であって垂直方向に上からＹ番目の画素の座標を（Ｘ，Ｙ）と示す。つまり、抽出用画像ＤＴにおける座標（Ｘｔ，Ｙｔ）に対応する原稿画像ＤＢにおける座標（Ｘｂ，Ｙｂ）は、抽出用画像ＤＴの水平方向の画素数Ｗｔ及び垂直方向の画素数Ｈｔと、抽出用画像ＤＴにおける座標（Ｘｔ，Ｙｔ）とを用いて、以下の式（１）、（２）によって算出することができる。
Ｘｂ＝Ｘｔ＊Ｗｂ／Ｗｔ・・・（１）
Ｙｂ＝Ｙｔ＊Ｈｂ／Ｈｔ・・・（２）
尚、「＊」は、掛け算（乗算）を示し、「／」は割り算（除算）を示す。すなわち、原稿画像ＤＢの座標位置と抽出用画像ＤＴの座標位置とは、式（１）、（２）によって対応付けられている。 Hereinafter, the coordinates of the Xth pixel from the left in the horizontal direction and the Yth pixel from the top in the vertical direction are denoted as (X, Y). That is, the coordinates (Xb, Yb) in the document image DB corresponding to the coordinates (Xt, Yt) in the extraction image DT are the number of pixels Wt in the horizontal direction and the number of pixels Ht in the vertical direction of the extraction image DT. Using the coordinates (Xt, Yt) in the image DT, it can be calculated by the following equations (1) and (2).
Xb = Xt * Wb / Wt (1)
Yb = Yt * Hb / Ht (2)
“*” Indicates multiplication (multiplication), and “/” indicates division (division). That is, the coordinate position of the document image DB and the coordinate position of the extraction image DT are associated with each other by the equations (1) and (2).

次に、原稿文字情報抽出部８２は、抽出用画像ＤＴに対してＯＣＲ等の公知の文字認識処理を実行して、抽出用画像ＤＴに含まれている文字列に関する情報を抽出する。 Next, the document character information extraction unit 82 performs a known character recognition process such as OCR on the extraction image DT to extract information on the character string included in the extraction image DT.

図８は、抽出用画像ＤＴの一例を示す説明図である。図９は、抽出用画像ＤＴから抽出した文字列に関する情報の一例を示す説明図である。例えば図８に示すように、原稿文字情報抽出部８２は、抽出用画像ＤＴに対して文字認識処理を実行して、左上の座標（ＳＸｔ１，ＳＹｔ１）と右下の座標（ＥＸｔ１，ＥＹｔ１）によって定まる矩形領域に「最新の・・・について」と記載された文字列Ｃｔ１が存在することを認識する。そして、原稿文字情報抽出部８２は、例えば図９に示すように、文字列Ｃｔ１を示す情報と、文字列Ｃｔ１が配置されている領域を定める左上の座標（ＳＸｔ１，ＳＹｔ１）と右下の座標（ＥＸｔ１，ＥＹｔ１）を示す情報とを対応付けてＲＡＭに記憶する。 FIG. 8 is an explanatory diagram illustrating an example of the extraction image DT. FIG. 9 is an explanatory diagram showing an example of information related to the character string extracted from the extraction image DT. For example, as shown in FIG. 8, the document character information extraction unit 82 executes character recognition processing on the extraction image DT, and uses the upper left coordinates (SXt1, SYt1) and the lower right coordinates (EXt1, EYt1). It is recognized that the character string Ct1 described as “about the latest ...” exists in the fixed rectangular area. Then, as shown in FIG. 9, for example, the manuscript character information extraction unit 82 has information indicating the character string Ct1, upper left coordinates (SXt1, SYt1) and lower right coordinates that define an area where the character string Ct1 is arranged. The information indicating (EXt1, EYt1) is associated with and stored in the RAM.

これと同様にして、原稿文字情報抽出部８２は、「今回、・・・です。」と記載された文字列Ｃｔ２を示す情報と、文字列Ｃｔ２が配置されている領域を定める左上の座標（ＳＸｔ２，ＳＹｔ２）と右下の座標（ＥＸｔ２，ＥＹｔ２）を示す情報とを対応付けてＲＡＭに記憶する。また、原稿文字情報抽出部８２は、「右図のように、・・・います。」と記載された文字列Ｃｔ３を示す情報と、文字列Ｃｔ３が配置されている領域を定める左上の座標（ＳＸｔ３，ＳＹｔ３）と右下の座標（ＥＸｔ３，ＥＹｔ３）を示す情報とを対応付けてＲＡＭに記憶する。 Similarly, the manuscript character information extraction unit 82 includes information indicating the character string Ct2 in which “this time is...” And the upper left coordinates (which define the area where the character string Ct2 is arranged ( SXt2, SYt2) and information indicating the lower right coordinates (EXt2, EYt2) are stored in the RAM in association with each other. Further, the manuscript character information extraction unit 82 includes information indicating the character string Ct3 described as “As shown in the right figure” and the upper left coordinates that define the area where the character string Ct3 is arranged. (SXt3, SYt3) and information indicating the lower right coordinates (EXt3, EYt3) are stored in the RAM in association with each other.

そして、原稿文字情報抽出部８２は、ＲＡＭに記憶した抽出用画像ＤＴに含まれている文字列Ｃｔ１〜Ｃｔ３を示す情報を例えばテキストデータとして取得して、原稿画像ＤＢに含まれている文字列Ｃ１〜Ｃ３を示す情報である文字列情報ＭＳとする。 Then, the document character information extraction unit 82 acquires information indicating the character strings Ct1 to Ct3 included in the extraction image DT stored in the RAM as, for example, text data, and the character string included in the document image DB. It is assumed that the character string information MS is information indicating C1 to C3.

また、原稿文字情報抽出部８２は、ＲＡＭに記憶した抽出用画像ＤＴにおいて文字列Ｃｔ１〜Ｃｔ３のそれぞれが配置されている領域を定める左上の座標と右下の座標を示す情報を取得し、取得した情報が示す座標を、上記式（１）、（２）を用いて、原稿画像ＤＢに対応する座標にそれぞれ変換する。そして、原稿文字情報抽出部８２は、当該変更後の
座標を示す情報を、原稿画像ＤＢに含まれている文字列Ｃ１〜Ｃ３のそれぞれが配置されている位置を示す位置情報ＭＰとする。 Further, the document character information extraction unit 82 acquires information indicating the upper left coordinates and the lower right coordinates that define the area where each of the character strings Ct1 to Ct3 is arranged in the extraction image DT stored in the RAM. The coordinates indicated by the information are converted into coordinates corresponding to the document image DB using the above formulas (1) and (2). Then, the document character information extraction unit 82 uses the information indicating the coordinates after the change as the position information MP indicating the position where each of the character strings C1 to C3 included in the document image DB is arranged.

具体的には、原稿文字情報抽出部８２は、抽出用画像ＤＴにおいて文字列Ｃｔ１が配置されている領域を定める左上の座標（ＳＸｔ１，ＳＹｔ１）を、上記の式（１）を用いて、原稿画像ＤＢにおける座標（ＳＸｔ１＊Ｗｂ／Ｗｔ，ＳＹｔ１＊Ｈｂ／Ｈｔ）に変換する。そして、当該変換後の座標を、原稿画像ＤＢにおいて文字列Ｃ１が配置されている領域の左上の座標Ｓ１（図６）とする。また、原稿文字情報抽出部８２は、抽出用画像ＤＴにおいて文字列Ｃｔ１が配置されている領域を定める右下の座標（ＳＸｔ１，ＳＹｔ１）を、上記の式（２）を用いて、原稿画像ＤＢにおける座標（ＥＸｔ１＊Ｗｂ／Ｗｔ，ＥＹｔ１＊Ｈｂ／Ｈｔ）に変換する。そして、当該変換後の座標を、原稿画像ＤＢにおいて文字列Ｃ１が配置されている領域の右下の座標Ｅ１とする。尚、Ｗｂは、原稿画像ＤＢの水平方向の画素数、Ｗｔは、抽出用画像ＤＴの水平方向の画素数を示す。また、Ｈｂは、原稿画像ＤＢの垂直方向の画素数、Ｈｔは、抽出用画像ＤＴの垂直方向の画素数を示す。 Specifically, the document character information extraction unit 82 uses the above equation (1) to determine the upper left coordinates (SXt1, SYt1) that define the area where the character string Ct1 is arranged in the extraction image DT. It is converted into coordinates (SXt1 * Wb / Wt, SYt1 * Hb / Ht) in the image DB. Then, the converted coordinates are set as the upper left coordinates S1 (FIG. 6) of the area where the character string C1 is arranged in the document image DB. Further, the document character information extraction unit 82 uses the above equation (2) to convert the lower right coordinates (SXt1, SYt1) defining the area where the character string Ct1 is arranged in the extraction image DT to the document image DB. Are converted into coordinates (EXt1 * Wb / Wt, EYt1 * Hb / Ht). Then, the converted coordinates are set as the lower right coordinates E1 of the area where the character string C1 is arranged in the document image DB. Wb represents the number of pixels in the horizontal direction of the document image DB, and Wt represents the number of pixels in the horizontal direction of the extraction image DT. Hb represents the number of pixels in the vertical direction of the document image DB, and Ht represents the number of pixels in the vertical direction of the extraction image DT.

これと同様にして、原稿文字情報抽出部８２は、文字列Ｃｔ２が配置されている領域を定める左上の座標（ＳＸｔ２，ＳＹｔ２）と右下の座標（ＥＸｔ２，ＥＹｔ２）を、原稿画像ＤＢにおける座標（ＳＸｔ２＊Ｗｂ／Ｗｔ，ＳＹｔ２＊Ｈｂ／Ｈｔ）と座標（ＥＸｔ２＊Ｗｂ／Ｗｔ，ＥＹｔ２＊Ｈｂ／Ｈｔ）に変換する。そして、原稿文字情報抽出部８２は、当該変換後の座標を、原稿画像ＤＢにおいて文字列Ｃ２が配置されている領域の左上の座標Ｓ２と右下の座標Ｅ２（図６）とする。 Similarly, the document character information extraction unit 82 uses the upper left coordinates (SXt2, SYt2) and the lower right coordinates (EXt2, EYt2) that define the area in which the character string Ct2 is arranged, as coordinates in the document image DB. (SXt2 * Wb / Wt, SYt2 * Hb / Ht) and coordinates (EXt2 * Wb / Wt, EYt2 * Hb / Ht) are converted. Then, the document character information extraction unit 82 sets the converted coordinates as the upper left coordinate S2 and the lower right coordinate E2 (FIG. 6) of the area where the character string C2 is arranged in the document image DB.

また、原稿文字情報抽出部８２は、文字列Ｃｔ３が配置されている領域を定める左上の座標（ＳＸｔ３，ＳＹｔ３）と右下の座標（ＥＸｔ３，ＥＹｔ３）を、原稿画像ＤＢに対応する座標（ＳＸｔ３＊Ｗｂ／Ｗｔ，ＳＹｔ３＊Ｈｂ／Ｈｔ）と座標（ＥＸｔ３＊Ｗｂ／Ｗｔ，ＥＹｔ３＊Ｈｂ／Ｈｔ）に変換する。そして、原稿文字情報抽出部８２は、当該変換後の座標を、原稿画像ＤＢにおいて文字列Ｃ３が配置されている領域の左上の座標Ｓ３と右下の座標Ｅ３（図６）とする。 Further, the document character information extraction unit 82 uses the upper left coordinates (SXt3, SYt3) and the lower right coordinates (EXt3, EYt3) that define the area where the character string Ct3 is arranged, as coordinates (SXt3) corresponding to the document image DB. * Wb / Wt, SYt3 * Hb / Ht) and coordinates (EXt3 * Wb / Wt, EYt3 * Hb / Ht). Then, the document character information extraction unit 82 sets the converted coordinates as the upper left coordinate S3 and the lower right coordinate E3 (FIG. 6) of the area where the character string C3 is arranged in the document image DB.

このようにして、原稿文字情報抽出部８２は、ステップＳ５において、原稿画像ＤＢに含まれている文字列Ｃ１〜Ｃ３を示す情報である文字列情報ＭＳと、原稿画像ＤＢに含まれている文字列Ｃ１〜Ｃ３が配置されている位置を示す位置情報ＭＰと、を原稿画像ＤＢから抽出する。 In this manner, in step S5, the document character information extraction unit 82 performs character string information MS that is information indicating the character strings C1 to C3 included in the document image DB, and characters included in the document image DB. Position information MP indicating positions where the columns C1 to C3 are arranged is extracted from the document image DB.

図１０は、原稿文字情報Ｍの一例を示す説明図である。そして、原稿文字情報抽出部８２は、例えば図１０に示すように、ステップＳ５において抽出した原稿画像ＤＢに含まれている文字列Ｃ１〜Ｃ３を示す情報である文字列情報ＭＳと、原稿画像ＤＢに含まれている文字列Ｃ１〜Ｃ３が配置されている位置を示す位置情報ＭＰと、を対応付けた情報である原稿文字情報Ｍを生成して、ＲＡＭに記憶する（ステップＳ６）。 FIG. 10 is an explanatory diagram showing an example of document character information M. Then, for example, as shown in FIG. 10, the document character information extraction unit 82 includes character string information MS, which is information indicating the character strings C1 to C3 included in the document image DB extracted in step S5, and the document image DB. Is generated and stored in the RAM (step S6). The document character information M, which is information that associates the position information MP indicating the positions where the character strings C1 to C3 included in the document, are arranged.

図４に戻り、スタンプ済画像生成部８３は、ステップＳ１で受け付けられたスタンプ文字情報ＬＳに対応するスタンプ文字ＣＳ（図５において「秘」）を表す、原稿画像ＤＢと同じ解像度の画像であるスタンプ文字画像ＤＳを生成する。そして、スタンプ済画像生成部８３は、原稿画像ＤＢにおけるステップＳ１で受け付けられたスタンプ位置情報ＬＰに対応する位置（図５において「右下」）に、生成したスタンプ文字画像ＤＳを重畳する。これによって、スタンプ済画像生成部８３は、原稿画像ＤＢにスタンプ文字ＣＳを表すスタンプ文字画像ＤＳが重畳された画像であるスタンプ済画像ＤＡを生成する（ステップＳ７）。 Returning to FIG. 4, the stamped image generation unit 83 is an image having the same resolution as the document image DB, which represents the stamp character CS (“secret” in FIG. 5) corresponding to the stamp character information LS received in step S <b> 1. A stamp character image DS is generated. Then, the stamped image generating unit 83 superimposes the generated stamp character image DS on the position (“lower right” in FIG. 5) corresponding to the stamp position information LP received in step S1 in the document image DB. As a result, the stamped image generating unit 83 generates a stamped image DA that is an image in which the stamp character image DS representing the stamp character CS is superimposed on the document image DB (step S7).

図１１は、スタンプ済画像ＤＡの一例を示す説明図である。具体的には、スタンプ済画
像生成部８３は、ステップＳ７において、例えば図１１に示すように、スタンプ位置情報ＬＰに対応する位置が、原稿画像ＤＢにおける「右下」の位置を示す場合には、生成したスタンプ文字画像ＤＳの右下の画素の座標Ｚ４が、原稿画像ＤＢの右下の画素の座標Ｑ４に重なるように、スタンプ文字画像ＤＳを原稿画像ＤＢに重畳することによって、スタンプ済画像ＤＡを生成する。尚、右下の画素とは、水平方向の終端の画素であって垂直方向の終端の画素を示す。 FIG. 11 is an explanatory diagram showing an example of a stamped image DA. Specifically, in step S7, the stamped image generation unit 83 determines that the position corresponding to the stamp position information LP indicates the “lower right” position in the document image DB, for example, as shown in FIG. The stamped image is superimposed on the document image DB so that the coordinate Z4 of the lower right pixel of the generated stamp character image DS overlaps the coordinate Q4 of the lower right pixel of the document image DB. Generate DA. Note that the lower right pixel is a pixel at the end in the horizontal direction and a pixel at the end in the vertical direction.

同様にして、スタンプ済画像生成部８３は、ステップＳ７において、スタンプ位置情報ＬＰに対応する位置が、例えば、原稿画像ＤＢにおける「左上」の位置を示す場合には、生成したスタンプ文字画像ＤＳの左上の画素の座標Ｚ１が、原稿画像ＤＢの左上の画素の座標Ｑ１に重なるようにして、スタンプ文字画像ＤＳを原稿画像ＤＢに重畳する。尚、左上の画素とは、水平方向の先頭の画素であって垂直方向の先頭の画素を示す。 Similarly, when the position corresponding to the stamp position information LP indicates, for example, the “upper left” position in the document image DB in step S7, the stamped image generation unit 83 determines the position of the generated stamp character image DS. The stamp character image DS is superimposed on the document image DB so that the coordinate Z1 of the upper left pixel overlaps the coordinate Q1 of the upper left pixel of the document image DB. The upper left pixel is the first pixel in the horizontal direction and the first pixel in the vertical direction.

また、スタンプ済画像生成部８３は、ステップＳ７において、スタンプ位置情報ＬＰに対応する位置が、例えば、原稿画像ＤＢにおける「右上」の位置を示す場合には、生成したスタンプ文字画像ＤＳの右上の画素の座標Ｚ２が、原稿画像ＤＢの右上の画素の座標Ｑ２に重なるようにして、スタンプ文字画像ＤＳを原稿画像ＤＢに重畳する。尚、右上の画素とは、水平方向の終端の画素であって垂直方向の先頭の画素を示す。 Further, in step S7, the stamped image generation unit 83, when the position corresponding to the stamp position information LP indicates the position of “upper right” in the document image DB, for example, the upper right of the generated stamp character image DS. The stamp character image DS is superimposed on the document image DB so that the pixel coordinate Z2 overlaps the coordinate Q2 of the upper right pixel of the document image DB. Note that the upper right pixel is a pixel at the end in the horizontal direction and the top pixel in the vertical direction.

また、スタンプ済画像生成部８３は、ステップＳ７において、スタンプ位置情報ＬＰに対応する位置が、例えば、原稿画像ＤＢにおける「左下」の位置を示す場合には、生成したスタンプ文字画像ＤＳの左下の画素の座標Ｚ３が、原稿画像ＤＢの左下の画素の座標Ｑ３に重なるようにして、スタンプ文字画像ＤＳを原稿画像ＤＢに重畳する。尚、左下の画素とは、水平方向の先頭の画素であって垂直方向の終端の画素を示す。 Also, in step S7, the stamped image generation unit 83, when the position corresponding to the stamp position information LP indicates, for example, the “lower left” position in the document image DB, the lower left corner of the generated stamp character image DS. The stamp character image DS is superimposed on the document image DB so that the pixel coordinate Z3 overlaps the coordinate Q3 of the lower left pixel of the document image DB. The lower left pixel is the first pixel in the horizontal direction and the last pixel in the vertical direction.

図４に戻り、情報付画像データ生成部８４は、ステップＳ６において原稿文字情報抽出部８２によって生成された原稿文字情報Ｍ（図１０）に、ステップＳ３において生成された文字スタンプ情報Ｌ（図５）を付加して、メタ情報Ｎを生成する（ステップＳ８）。 Returning to FIG. 4, the image data with information generation unit 84 adds the character stamp information L (FIG. 5) generated in step S3 to the document character information M (FIG. 10) generated by the document character information extraction unit 82 in step S6. ) Is added to generate meta information N (step S8).

図１２は、メタ情報Ｎの一例を示す説明図である。情報付画像データ生成部８４は、例えば図１２に示すように、スタンプ位置情報ＬＰ（図５）が原稿画像ＤＢにおける「右下」の位置を示す情報である場合には、スタンプ位置情報ＬＰを、スタンプ文字画像ＤＳが原稿画像ＤＢにおいて重畳されている領域を定める左上の座標Ｚ１（図１１）と右下の座標Ｚ４（図１１）を示す情報に変換する。具体的には、スタンプ文字画像ＤＳの水平方向の画素数Ｗｄ及び垂直方向の画素数Ｈｄとし、原稿画像ＤＢの水平方向の画素数Ｗｂ及び垂直方向の画素数Ｈｂとすると、座標Ｚ１は（Ｗｂ−Ｗｄ＋１，Ｈｂ−Ｈｄ＋１）で表され、座標Ｚ２は、（Ｗｂ，Ｈｂ）で表される。 FIG. 12 is an explanatory diagram illustrating an example of the meta information N. When the stamp position information LP (FIG. 5) is information indicating the “lower right” position in the document image DB, for example, as shown in FIG. The stamp character image DS is converted into information indicating the upper left coordinate Z1 (FIG. 11) and the lower right coordinate Z4 (FIG. 11) that define the region in which the stamp character image DB is superimposed. Specifically, assuming that the horizontal number of pixels Wd and the number of vertical pixels Hd of the stamp character image DS and the number of horizontal pixels Wb and the number of vertical pixels Hb of the document image DB, the coordinate Z1 is (Wb −Wd + 1, Hb−Hd + 1), and the coordinate Z2 is represented by (Wb, Hb).

同様にして、情報付画像データ生成部８４は、スタンプ位置情報ＬＰが原稿画像ＤＢにおける「左上」の位置を示す情報である場合には、スタンプ位置情報ＬＰを、原稿画像ＤＢにおいてスタンプ文字画像ＤＳが重畳されている領域を定める左上の座標（１，１）と右下の座標（Ｗｄ，Ｈｄ）を示す情報に変換する。また、情報付画像データ生成部８４は、スタンプ位置情報ＬＰが原稿画像ＤＢにおける「右上」の位置を示す情報である場合には、スタンプ位置情報ＬＰを、原稿画像ＤＢにおいてスタンプ文字画像ＤＳが重畳されている領域を定める左上の座標（Ｗｂ−Ｗｄ＋１，１）と右下の座標（Ｗｂ，Ｈｄ）を示す情報に変換する。また、情報付画像データ生成部８４は、スタンプ位置情報ＬＰが原稿画像ＤＢにおける「左下」の位置を示す情報である場合には、スタンプ位置情報ＬＰを、原稿画像ＤＢにおいてスタンプ文字画像ＤＳが重畳されている領域を定める左上の座標（１，Ｈｂ−Ｈｄ＋１）と右下の座標（Ｗｄ，Ｈｂ）を示す情報に変換する。 Similarly, when the stamp position information LP is information indicating the “upper left” position in the document image DB, the image data with information generation unit 84 uses the stamp position information LP as the stamp character image DS in the document image DB. Are converted into information indicating coordinates (1, 1) at the upper left and coordinates (Wd, Hd) at the lower right that define the region where the is superimposed. In addition, when the stamp position information LP is information indicating the “upper right” position in the document image DB, the image data with information generation unit 84 superimposes the stamp position information LP on the stamp character image DS in the document image DB. The information is converted into information indicating the upper left coordinates (Wb−Wd + 1, 1) and the lower right coordinates (Wb, Hd) that define the area being processed. Further, the image data generation unit with information 84 superimposes the stamp position information LP and the stamp character image DS in the document image DB when the stamp position information LP is information indicating the “lower left” position in the document image DB. The information is converted into information indicating the upper left coordinates (1, Hb-Hd + 1) and the lower right coordinates (Wd, Hb) that define the area being processed.

尚、情報付画像データ生成部８４は、上記のスタンプ位置情報ＬＰを、原稿画像ＤＢにおける座標を示す情報に変換することに代えて、スタンプ位置情報ＬＰを、原稿画像ＤＢにおける予め定められた位置（例えば、「右下」）を示すテキストデータに変換してもよい。 The information-added image data generation unit 84 converts the stamp position information LP into information indicating coordinates in the document image DB, instead of converting the stamp position information LP into a predetermined position in the document image DB. You may convert into the text data which shows (for example, "lower right").

そして、情報付画像データ生成部８４は、ステップＳ６において原稿文字情報抽出部８２によって生成された、原稿画像ＤＢに含まれる文字列Ｃ１〜Ｃ３に関する情報である原稿文字情報Ｍ（図１０）に、スタンプ位置情報ＬＰを付加した情報であるメタ情報Ｎを生成する。 Then, the image data with information generation unit 84 adds the document character information M (FIG. 10), which is information about the character strings C1 to C3 included in the document image DB, generated by the document character information extraction unit 82 in step S6. Meta information N, which is information to which the stamp position information LP is added, is generated.

図４に戻り、情報付画像データ生成部８４は、ステップＳ８において生成したメタ情報Ｎと、ステップＳ７においてスタンプ済画像生成部８３によって生成されたスタンプ済画像ＤＡと、を１組にしたデータである情報付画像データを生成する（ステップＳ９）。 Returning to FIG. 4, the information-added image data generation unit 84 is a set of data including the meta information N generated in step S8 and the stamped image DA generated by the stamped image generation unit 83 in step S7. Some image data with information is generated (step S9).

具体的には、情報付画像データ生成部８４は、ステップＳ９において、例えば、メタ情報Ｎに含まれる原稿文字情報Ｍに対応する文字列Ｃ１〜Ｃ３を表すテキストデータ（所謂透明テキスト）、及びメタ情報Ｎに含まれる文字スタンプ情報Ｌに対応するスタンプ文字ＣＳを表すテキストデータ（所謂透明テキスト）を生成する。そして、情報付画像データ生成部８４は、生成したテキストデータをスタンプ済画像ＤＡに重ね合わせることによって、メタ情報Ｎとスタンプ済画像ＤＡとを１組にした所謂検索可能ＰＤＦフォーマットのデータを情報付画像データとして生成する。 Specifically, the image data generation unit with information 84, for example, in step S9, for example, text data (so-called transparent text) representing character strings C1 to C3 corresponding to the document character information M included in the meta information N, and the meta data Text data (so-called transparent text) representing the stamp character CS corresponding to the character stamp information L included in the information N is generated. Then, the information-added image data generation unit 84 superimposes the generated text data on the stamped image DA to generate so-called searchable PDF format data in which the meta information N and the stamped image DA are combined. Generate as image data.

そして、情報付画像データ生成部８４は、ステップＳ９で生成した情報付画像データを、ネットワーク通信部９によって、ネットワーク９０に接続された検索サーバー２に送信させる（ステップＳ１０）。 Then, the image data with information generation unit 84 causes the network communication unit 9 to transmit the image data with information generated in step S9 to the search server 2 connected to the network 90 (step S10).

つまり、画像読取部５及び制御部８によって、本発明に係る画像読取装置の一例が構成されている。 That is, the image reading unit 5 and the control unit 8 constitute an example of an image reading apparatus according to the present invention.

上記実施形態の構成によれば、情報付画像データに含まれるメタ情報Ｎに、原稿画像ＤＢに含まれている文字列に関する情報である原稿文字情報Ｍと、文字スタンプ情報受付部８１によって受け付けられた文字スタンプ情報Ｌが含まれる。したがって、例えばＯＣＲ等の文字認識処理によってスタンプ済画像ＤＡに含まれている文字列を抽出しなくても、メタ情報Ｎには、入力された文字スタンプ情報Ｌ、つまり、入力されたスタンプ文字ＣＳに関する情報が含まれることになる。このため、スタンプ済画像ＤＡから文字列に関する情報を抽出した場合に、スタンプ文字ＣＳに関する情報が抽出されない虞を回避して、確実にスタンプ文字ＣＳに関する情報を含む情報付画像データを出力することができる。 According to the configuration of the above-described embodiment, the meta information N included in the image data with information is received by the manuscript character information M, which is information regarding the character string included in the manuscript image DB, and the character stamp information accepting unit 81. Character stamp information L is included. Therefore, for example, even if a character string included in the stamped image DA is not extracted by character recognition processing such as OCR, the input character stamp information L, that is, the input stamp character CS is included in the meta information N. Information will be included. For this reason, when the information about the character string is extracted from the stamped image DA, it is possible to avoid the possibility that the information about the stamp character CS is not extracted, and to reliably output the image data with information including the information about the stamp character CS. it can.

また、上記実施形態の構成によれば、情報付画像データに含まれるメタ情報Ｎ（図１２）には、原稿文字情報Ｍとして、原稿画像ＤＢに含まれている文字列を示す文字列情報ＭＳと、原稿画像ＤＢにおいて前記文字列が配置されている位置を示す位置情報ＭＰとが含まれるようになる。また、メタ情報Ｎには、文字スタンプ情報Ｌ（図５）として、スタンプ文字ＣＳを示す情報であるスタンプ文字情報ＬＳと、原稿画像ＤＢにおいてスタンプ文字ＣＳに対応するスタンプ文字画像ＤＳを重畳させる位置を示す情報であるスタンプ位置情報ＬＰとが含まれるようになる。 According to the configuration of the above embodiment, the meta information N (FIG. 12) included in the image data with information includes character string information MS indicating the character string included in the document image DB as the document character information M. And position information MP indicating the position where the character string is arranged in the document image DB. Further, in the meta information N, as the character stamp information L (FIG. 5), the stamp character information LS, which is information indicating the stamp character CS, and the stamp character image DS corresponding to the stamp character CS in the document image DB are superimposed. The stamp position information LP, which is information indicating the above, is included.

このため、情報付画像データに含まれるメタ情報Ｎ（図１２）を用いて、原稿画像ＤＢに含まれている文字列とスタンプ文字ＣＳとを把握することができる。更に、情報付画像データに含まれるメタ情報Ｎを用いて、原稿画像ＤＢに含まれている文字列及びスタンプ文字ＣＳが、それぞれ原稿画像ＤＢにおけるどの位置に配置されているかを把握すること
ができる。また、スタンプ文字画像ＤＳは、原稿画像ＤＢと同じ解像度であるので、スタンプ位置情報ＬＰによって示される原稿画像ＤＢにおける位置に容易に重畳することができる。 Therefore, it is possible to grasp the character string and the stamp character CS included in the document image DB by using the meta information N (FIG. 12) included in the image data with information. Further, by using the meta information N included in the image data with information, it is possible to grasp where the character string and the stamp character CS included in the document image DB are respectively arranged in the document image DB. . Further, since the stamp character image DS has the same resolution as the document image DB, it can be easily superimposed on the position in the document image DB indicated by the stamp position information LP.

また、上記実施形態の構成によれば、原稿文字情報Ｍの抽出精度を高めることができるように予め定められた解像度の抽出用画像ＤＴを用いて、抽出用画像ＤＴに含まれている文字列を示す情報（図９）と、抽出用画像ＤＴにおいて抽出された文字列が存在する位置を示す情報（図９）とを精度良く抽出することができる。つまり、当該抽出された情報を用いて原稿文字情報Ｍを生成することによって（ステップＳ６）、原稿画像ＤＢからの原稿文字情報Ｍの抽出精度を高めることができる。 Further, according to the configuration of the above embodiment, the character string included in the extraction image DT is used by using the extraction image DT having a predetermined resolution so that the extraction accuracy of the document character information M can be increased. 9 (FIG. 9) and information (FIG. 9) indicating the position where the extracted character string exists in the extraction image DT can be extracted with high accuracy. That is, by generating the document character information M using the extracted information (step S6), the accuracy of extracting the document character information M from the document image DB can be improved.

また、上記実施形態の構成によれば、情報付画像データがメタ情報Ｎとスタンプ済画像ＤＡとを１組にした所謂検索可能ＰＤＦフォーマットのデータであるため、Windows（登
録商標）・Mac OS（登録商標）・Linux（登録商標）等の各種オペレーティングシステム
（ＯＳ）が搭載された多種のコンピューターにおいて、検索可能ＰＤＦフォーマットのデータを利用可能なソフトウェアを用いて、同じレイアウトでスタンプ済画像ＤＡを表示することができる。また、スタンプ済画像ＤＡに重ね合わされたデータによって表される文字を検索することができる。 Further, according to the configuration of the above embodiment, since the image data with information is data in a so-called searchable PDF format in which the meta information N and the stamped image DA are combined into one set, Windows (registered trademark) / Mac OS ( Display stamped image DA in the same layout using software that can use searchable PDF format data on various computers with various operating systems (OS) such as registered trademark and Linux registered trademark can do. Further, it is possible to search for a character represented by data superimposed on the stamped image DA.

また、上記実施形態の構成によれば、例えば、「秘密厳守」等の秘密情報であることを示す文字をスタンプ文字ＣＳとして、当該スタンプ文字ＣＳに対応するスタンプ文字画像ＤＳが重畳されたスタンプ済画像ＤＡを含む情報付画像データを、「秘密厳守」等の秘密情報であることを示す文字を検索キーワードとして指定して、検索サーバー２に検索させることができる。つまり、スタンプ文字画像ＤＳによって表されるスタンプ文字ＣＳを検索キーワードとして指定することができ、スタンプ済画像ＤＡの利便性を向上することができる。 In addition, according to the configuration of the above-described embodiment, for example, a stamp indicating that the character is confidential information such as “confidentiality” is used as the stamp character CS, and the stamp character image DS corresponding to the stamp character CS is superimposed. The search server 2 can be made to search the image data with information including the image DA by specifying a character indicating that it is confidential information such as “confidentiality” as a search keyword. That is, the stamp character CS represented by the stamp character image DS can be specified as a search keyword, and the convenience of the stamped image DA can be improved.

尚、上記実施形態において図１乃至図１２に示した構成は単なる一例に過ぎず、本発明を当該実施形態に限定する趣旨ではない。 In the above embodiment, the configurations shown in FIGS. 1 to 12 are merely examples, and the present invention is not limited to the embodiment.

例えば、ステップＳ１０を実行しないように簡素化して構成してもよい。これに合わせて、情報付画像データ生成部８４によって生成された情報付画像データを、複合機１のＲＡＭに記憶するように構成し、複合機１においてのみ、情報付画像データを利用可能にしてもよい。 For example, the configuration may be simplified so that step S10 is not executed. Accordingly, the image data with information generated by the image data generation unit with information 84 is configured to be stored in the RAM of the multifunction machine 1 so that the image data with information can be used only in the multifunction machine 1. Also good.

また、ステップＳ９において情報付画像データ生成部８４によって生成される情報付画像データのフォーマットは、メタ情報Ｎとスタンプ済画像ＤＡとを一組にした所謂検索可能ＰＤＦフォーマットであるとしたが、これに限定する趣旨ではない。例えば、情報付画像データ生成部８４は、マイクロソフト社のＷｏｒｄ，Ｅｘｃｅｌ，ＰｏｗｅｒＰｏｉｎｔ（登録商標）用のデータフォーマットの１つのデータファイルに、メタ情報Ｎを示す文字列とスタンプ済画像ＤＡとを掲載するようにして、これを情報付画像データとして生成してもよい。 In addition, the format of the image data with information generated by the image data generation unit with information 84 in step S9 is a so-called searchable PDF format in which the meta information N and the stamped image DA are combined. It is not intended to be limited to. For example, the information-added image data generation unit 84 publishes a character string indicating the meta information N and the stamped image DA in one data file in a data format for Microsoft Word, Excel, and PowerPoint (registered trademark). In this way, this may be generated as image data with information.

また、ステップＳ７において、スタンプ済画像生成部８３は、原稿画像ＤＢと同じ解像度のスタンプ文字画像ＤＳを生成し、生成したスタンプ文字画像ＤＳを、原稿画像ＤＢにおけるスタンプ位置情報ＬＰに対応する位置（例えば、図５において「右下」）に重畳して、スタンプ済画像ＤＡを生成していた。 In step S7, the stamped image generation unit 83 generates a stamp character image DS having the same resolution as that of the document image DB, and the generated stamp character image DS is a position corresponding to the stamp position information LP in the document image DB ( For example, the stamped image DA is generated by being superimposed on “lower right” in FIG.

しかし、これに代えて、ステップＳ７において、スタンプ済画像生成部８３は、原稿画像ＤＢと同じ解像度のスタンプ文字画像ＤＳを生成した後、これをステップＳ５で原稿文
字情報抽出部８２によって用いられた抽出用画像ＤＴと同じ解像度の画像である仮スタンプ文字画像に変換するように構成してもよい。そして、スタンプ済画像生成部８３は、仮スタンプ文字画像を、抽出用画像ＤＴにおける、スタンプ位置情報ＬＰに対応する位置（例えば、スタンプ位置情報ＬＰが原稿画像ＤＢにおける「右下」の位置を示す情報である場合、抽出用画像ＤＴの右下の画素と仮スタンプ文字画像の右下の画素が重なる位置）に重畳し、仮スタンプ文字画像が重畳された抽出用画像ＤＴを生成するように構成してもよい。そして、スタンプ済画像生成部８３は、当該生成した仮スタンプ文字画像が重畳された抽出用画像ＤＴを、原稿画像ＤＢと同じ解像度の画像に変換することによって、スタンプ済画像ＤＡを生成するように構成してもよい。 However, instead of this, in step S7, the stamped image generation unit 83 generates a stamp character image DS having the same resolution as that of the document image DB, and then used it by the document character information extraction unit 82 in step S5. You may comprise so that it may convert into the temporary stamp character image which is an image of the same resolution as the image DT for extraction. Then, the stamped image generation unit 83 converts the temporary stamp character image into the position corresponding to the stamp position information LP in the extraction image DT (for example, the stamp position information LP indicates the “lower right” position in the document image DB). In the case of information, the extraction image DT is superimposed on the lower right pixel of the extraction image DT and the lower right pixel of the temporary stamp character image, and the extraction image DT on which the temporary stamp character image is superimposed is generated. May be. Then, the stamped image generation unit 83 generates the stamped image DA by converting the extraction image DT on which the generated temporary stamp character image is superimposed into an image having the same resolution as the document image DB. It may be configured.

また、ステップＳ５において、原稿文字情報抽出部８２は、ステップＳ４で生成された原稿画像ＤＢを抽出用画像ＤＴに変換し、抽出用画像ＤＴに対して文字認識処理を実行して、抽出用画像ＤＴに含まれている文字列に関する情報（図９）を抽出していた。しかし、これに代えて、ステップＳ５において、原稿文字情報抽出部８２は、原稿画像ＤＢを抽出用画像ＤＴに変換することなく、原稿画像ＤＢに対して文字認識処理を実行して、原稿画像ＤＢに含まれている文字列を示す文字列情報ＭＳと、当該文字列が原稿画像ＤＢにおいて配置されている領域を定める左上の座標と右下の座標からなる位置情報ＭＰを、直接的に抽出するように構成してもよい。 In step S5, the document character information extraction unit 82 converts the document image DB generated in step S4 into the extraction image DT, executes character recognition processing on the extraction image DT, and extracts the extraction image. Information on the character string included in the DT (FIG. 9) was extracted. However, instead of this, in step S5, the manuscript character information extraction unit 82 executes character recognition processing on the manuscript image DB without converting the manuscript image DB into the extraction image DT, and the manuscript image DB. The character string information MS indicating the character string included in the image and the position information MP including the upper left coordinates and the lower right coordinates defining the area where the character strings are arranged in the document image DB are directly extracted. You may comprise as follows.

また、上記の構成では、文字スタンプ情報Ｌは、スタンプ文字情報ＬＳと、スタンプ位置情報ＬＰと、を対応付けた情報として構成していたが、これに代えて、文字スタンプ情報Ｌをスタンプ文字情報ＬＳのみで構成してもよい。ただし、これに合わせて、スタンプ文字画像ＤＳを原稿画像ＤＢに重畳させる位置を、原稿画像ＤＢの解像度に応じて固定的に予め定めておく必要がある。 In the above configuration, the character stamp information L is configured as information in which the stamp character information LS and the stamp position information LP are associated with each other. Instead, the character stamp information L is converted into stamp character information. You may comprise only LS. However, in accordance with this, the position where the stamp character image DS is superimposed on the document image DB needs to be fixedly determined in advance according to the resolution of the document image DB.

また、原稿文字情報Ｍは、原稿画像ＤＢに含まれている文字列を示す文字列情報ＭＳと、原稿画像ＤＢにおいて前記文字列が配置されている位置を示す位置情報ＭＰと、を対応付けた情報として構成していたが、これに代えて、原稿文字情報Ｍを原稿画像ＤＢに含まれている文字列を示す文字列情報ＭＳのみによって構成してもよい。 The document character information M associates character string information MS indicating a character string included in the document image DB with position information MP indicating a position where the character string is arranged in the document image DB. Although it is configured as information, instead of this, the document character information M may be configured only by the character string information MS indicating the character string included in the document image DB.

また、上記の構成では、本発明に係る画像読取装置を複合機１に適用する例について説明したが、これに限らず、例えば、コピー機やスキャナー装置等に適用してもよい。 In the above configuration, the example in which the image reading apparatus according to the present invention is applied to the multifunction device 1 has been described.

１：複合機
２：検索サーバー
３：パーソナルコンピュータ（ＰＣ）
４：画像形成部
５：画像読取部（画像読取装置）
８：制御部（画像読取装置）
９：ネットワーク通信部
２１：情報付画像記憶部
７３：表示部
７６：操作キー部
８１：文字スタンプ情報受付部
８２：原稿文字情報抽出部
８３：スタンプ済画像生成部
８４：情報付画像データ生成部
９０：ネットワーク
９００：文書管理システム
ＣＳ：スタンプ文字
ＤＡ：スタンプ済画像
ＤＢ：原稿画像
ＤＳ：スタンプ文字画像
ＤＴ：抽出用画像
Ｌ：文字スタンプ情報
ＬＰ：スタンプ位置情報
ＬＳ：スタンプ文字情報
Ｍ：原稿文字情報
ＭＰ：位置情報
ＭＳ：文字列情報
Ｎ：メタ情報 1: Multifunction machine 2: Search server 3: Personal computer (PC)
4: Image forming unit 5: Image reading unit (image reading device)
8: Control unit (image reading device)
9: Network communication unit 21: Image storage unit with information 73: Display unit 76: Operation key unit 81: Character stamp information reception unit 82: Document character information extraction unit 83: Stamped image generation unit 84: Image data generation unit with information 90: Network 900: Document management system CS: Stamp character DA: Stamped image DB: Document image DS: Stamp character image DT: Extraction image L: Character stamp information LP: Stamp position information LS: Stamp character information M: Document character Information MP: Position information MS: Character string information N: Meta information

Claims

An image reading unit for reading an image of a document;
Information related to stamp characters, which are characters to be superimposed as an image on a document image that is an image read by the image reading unit, and represents the stamp characters in the document image, which is information indicating the stamp characters. Character information selected from a plurality of character information predetermined as an option for the stamp character is input as character stamp information, which is information associated with stamp position information indicating a position where the stamp character image is superimposed. A character stamp information receiving unit that receives the stamp character information and receives information indicating a position selected from information indicating a plurality of predetermined positions in the document image as the stamp position information;
Information relating to a character string included in the document image, the document character string information indicating a character string included in the document image, and a document position indicating a position where the character string is arranged in the document image Document character information, which is information associated with information,
A process for converting the original image into an extraction image that is an image having a predetermined resolution so as to improve the extraction accuracy of the original character information, and a predetermined character recognition process for the extraction image. A process of extracting a character string from the image for extraction by executing, a process of generating information indicating the extracted character string as the document character string information, and the image on the extraction image from which the character string has been extracted Processing for acquiring information indicating the position on the document image corresponding to the position of the document as the document position information, processing for generating the document character information by associating the document character string information and the document position information, A manuscript character information extraction unit that extracts by executing
After the original character information is extracted by the original character information extracting unit, the stamp character image representing the stamp character corresponding to the character stamp information received by the character stamp information receiving unit and having the same resolution as the original image is displayed. Generating and converting the stamp character image into a temporary stamp character image having the same resolution as the extraction image, and the temporary stamp character image is included in the accepted character stamp information in the extraction image. By superimposing the temporary stamp character image on the position corresponding to the stamp position information and converting the extraction image with the same resolution as the original image, the stamp character image is visually observed on the original image. A stamped image generation unit that generates a stamped image that is a superimposed image;
The stamp character information and the stamp received by the character stamp information receiving unit are added to the document character information obtained by associating the document character string information and the document position information extracted by the document character information extracting unit. Information-added image data, which is data obtained by combining meta information, which is information added with the character stamp information associated with position information, and the stamped image generated by the stamped image generation unit. An image reading apparatus comprising: an image data generation unit with information to be generated.

An image reading unit for reading an image of a document;
Information related to stamp characters, which are characters to be superimposed as an image on a document image that is an image read by the image reading unit, and represents the stamp characters in the document image, which is information indicating the stamp characters. Character information selected from a plurality of character information predetermined as an option for the stamp character is input as character stamp information, which is information associated with stamp position information indicating a position where the stamp character image is superimposed. A character stamp information receiving unit that receives the stamp character information and receives information indicating a position selected from information indicating a plurality of predetermined positions in the document image as the stamp position information;
Information relating to a character string included in the document image, the document character string information indicating a character string included in the document image, and a document position indicating a position where the character string is arranged in the document image Document character information, which is information associated with information,
A process of extracting a character string by executing a predetermined character recognition process on the original image, a process of generating information indicating the extracted character string as the original character string information, and extracting the character string A process of acquiring information indicating the position on the original image as the original position information, and a process of generating the original character information by associating the original character string information with the original position information. A manuscript character information extracting unit for extracting
After the original character information is extracted by the original character information extracting unit, the stamp character image representing the stamp character corresponding to the character stamp information received by the character stamp information receiving unit and having the same resolution as the original image is displayed. The stamp character image is generated and superimposed on a position corresponding to the stamp position information included in the received character stamp information in the document image, so that the stamp character image can be visually recognized on the document image. A stamped image generating unit that generates a stamped image that is a superimposed image;
The stamp character information and the stamp received by the character stamp information receiving unit are added to the document character information obtained by associating the document character string information and the document position information extracted by the document character information extracting unit. Information-added image data, which is data obtained by combining meta information, which is information added with the character stamp information associated with position information, and the stamped image generated by the stamped image generation unit. An image reading apparatus comprising: an image data generation unit with information to be generated.

The image data with information includes data representing a character string corresponding to the document character information included in the meta information and data representing a stamp character corresponding to the character stamp information included in the meta information. The image reading apparatus according to claim 1, wherein the meta information and the stamped image are combined into a searchable PDF format data.

The image reading apparatus according to any one of claims 1 to 3,
An image data storage unit with information for storing the image data with information generated by the image data generation unit with information;
A search server for searching for image data with information including a search keyword specified by a user in meta information from the image data with information stored in the image data storage unit with information;
A document management system comprising:

A computer built in an image reading apparatus having an image reading unit for reading an image of a document,
Information related to stamp characters, which are characters to be superimposed as an image on a document image that is an image read by the image reading unit, and represents the stamp characters in the document image, which is information indicating the stamp characters. Character information selected from a plurality of character information predetermined as an option for the stamp character is input as character stamp information, which is information associated with stamp position information indicating a position where the stamp character image is superimposed. A character stamp information receiving unit that receives the stamp character information and receives information indicating a position selected from information indicating a plurality of predetermined positions in the document image as the stamp position information;
Information relating to a character string included in the document image, the document character string information indicating a character string included in the document image, and a document position indicating a position where the character string is arranged in the document image Document character information, which is information associated with information,
A process for converting the original image into an extraction image that is an image having a predetermined resolution so as to improve the extraction accuracy of the original character information, and a predetermined character recognition process for the extraction image. A process of extracting a character string from the image for extraction by executing, a process of generating information indicating the extracted character string as the document character string information, and the image on the extraction image from which the character string has been extracted Processing for acquiring information indicating the position on the document image corresponding to the position of the document as the document position information, processing for generating the document character information by associating the document character string information and the document position information, A manuscript character information extraction unit that extracts by executing
After the original character information is extracted by the original character information extracting unit, the stamp character image representing the stamp character corresponding to the character stamp information received by the character stamp information receiving unit and having the same resolution as the original image is displayed. Generating and converting the stamp character image into a temporary stamp character image having the same resolution as the extraction image, and the temporary stamp character image is included in the accepted character stamp information in the extraction image. By superimposing the temporary stamp character image on the position corresponding to the stamp position information and converting the extraction image with the same resolution as the original image, the stamp character image is visually observed on the original image. A stamped image generation unit that generates a stamped image that is a superimposed image;
The stamp character information and the stamp received by the character stamp information receiving unit are added to the document character information obtained by associating the document character string information and the document position information extracted by the document character information extracting unit. Information-added image data, which is data obtained by combining meta information, which is information added with the character stamp information associated with position information, and the stamped image generated by the stamped image generation unit. An image reading control program that functions as an image data generation unit with information to be generated.

A computer built in an image reading apparatus having an image reading unit for reading an image of a document,
Information related to stamp characters, which are characters to be superimposed as an image on a document image that is an image read by the image reading unit, and represents the stamp characters in the document image, which is information indicating the stamp characters. Character information selected from a plurality of character information predetermined as an option for the stamp character is input as character stamp information, which is information associated with stamp position information indicating a position where the stamp character image is superimposed. A character stamp information receiving unit that receives the stamp character information and receives information indicating a position selected from information indicating a plurality of predetermined positions in the document image as the stamp position information;
Information relating to a character string included in the document image, the document character string information indicating a character string included in the document image, and a document position indicating a position where the character string is arranged in the document image Document character information, which is information associated with information,
A process of extracting a character string by executing a predetermined character recognition process on the original image, a process of generating information indicating the extracted character string as the original character string information, and extracting the character string A process of acquiring information indicating the position on the original image as the original position information, and a process of generating the original character information by associating the original character string information with the original position information. A manuscript character information extracting unit for extracting
The stamp representing a stamp character corresponding to the character stamp information received by the character stamp information receiving unit and having the same resolution as the original image after the document character information is extracted by the document character information extracting unit. A character image is generated, and the stamp character image is superimposed on the original image at a position corresponding to the stamp position information included in the accepted character stamp information in the original image. A stamped image generating unit that generates a stamped image that is an image that is visibly superimposed;
The stamp character information and the stamp received by the character stamp information receiving unit are added to the document character information obtained by associating the document character string information and the document position information extracted by the document character information extracting unit. Information-added image data, which is data obtained by combining meta information, which is information added with the character stamp information associated with position information, and the stamped image generated by the stamped image generation unit. An image reading control program that functions as an image data generation unit with information to be generated.