JP2011257946A

JP2011257946A - Image processing apparatus, image processing method, and computer program

Info

Publication number: JP2011257946A
Application number: JP2010131284A
Authority: JP
Inventors: Minoru Taguchi; 穂田口
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2010-06-08
Filing date: 2010-06-08
Publication date: 2011-12-22

Abstract

PROBLEM TO BE SOLVED: To provide an image processing apparatus, an image processing method, and a computer program for extracting only articles desired by a user from a manuscript on which various articles such as a newspaper and a magazine are placed.SOLUTION: An image processing apparatus 1 includes: reading means 11 for reading a placed manuscript; image processing means 12 for performing image processing on manuscript image data 23g; character recognition means 15 for obtaining manuscript character data 24m by performing character recognition; layout recognition means 16 for recognizing a manuscript layout; area data generation means 17 for generating area data 24r on the basis of a recognition result by the layout recognition means 16; retrieval keyword input means 18 for accepting an input of a retrieval keyword 24k; and keyword retrieval means 19 for performing retrieval of the retrieval keyword 24k. The image processing means 12 extracts an area including the retrieval keyword 24k from the manuscript image data 23g as extraction image data 23t.

Description

本発明は、載置された原稿を読み取って画像処理を施す画像処理装置、画像処理方法およびコンピュータに画像処理を施させるためのコンピュータプログラムに関する。 The present invention relates to an image processing apparatus, an image processing method, and a computer program for causing a computer to perform image processing by reading a placed document and performing image processing.

従来、紙などの平面で構成された原稿を画像データとして読み取るスキャナ装置や原稿を複写する複写装置は、多種多様なものが考案されており、オフィスや家庭などで、このようなスキャナ装置および複写装置が利用される機会が増えている。 2. Description of the Related Art Conventionally, a wide variety of scanner devices for reading a document composed of a plane such as paper as image data and copying devices for copying a document have been devised. There are increasing opportunities for the equipment to be used.

このような現状を受けて、スキャナ装置および複写装置を用いて読み取られる原稿は多様化している。例えば、新聞や雑誌といった様々な記事が記載された原稿を読み取ったとき、従来のスキャナ装置および複写装置では、原稿に記載された全ての記事が読み取られ、ユーザにとって必要な記事だけを抽出するといったことはできなかった。このように、ユーザが必要としない記事も一緒に読み取られると、記事を印刷するとき、トナーや用紙といった資源の無駄遣いにつながっていた。 In response to such a current situation, originals read using a scanner device and a copying device are diversified. For example, when a manuscript containing various articles such as newspapers and magazines is read, all the articles described in the manuscript are read by the conventional scanner and copying apparatus, and only the articles necessary for the user are extracted. I couldn't. As described above, when an article that is not required by the user is also read together, resources such as toner and paper are wasted when the article is printed.

ところで、近年、画像を編集する技術として、インターネットのホームページといった、原稿画像データや画像データが存在しない仮想的な原稿に対して、ユーザが必要とする領域を指定して、指定された領域だけを画像データとして出力する技術が考えられている（例えば、特許文献１参照。）。 By the way, as a technique for editing an image in recent years, an area required by a user is designated for a virtual manuscript having no image data or image data such as an Internet homepage, and only the designated area is designated. A technique for outputting as image data has been considered (for example, see Patent Document 1).

また、文書の文字情報からキーワードを検索して、キーワードを含む文書の範囲に基づいた優先順位をユーザに表示する技術が考えられている（例えば、特許文献２参照。）。 Further, a technique has been considered in which a keyword is searched from character information of a document and a priority order based on the range of the document including the keyword is displayed to the user (for example, see Patent Document 2).

特開２００８−６０６７７号公報JP 2008-60677 A 特開平８−２７２７８２号公報Japanese Patent Application Laid-Open No. 8-272784

特許文献１に記載のプリンタ制御装置は、対象画像をプレビュー表示手段に表示し、表示されたプレビュー画像に対して、ユーザが必要な領域を指定するものである。そのため、プレビュー画像に表示された複数の記事から必要な記事を、ユーザが判定し、領域を指定しなければならない。つまり、特許文献１に記載のプリンタ制御装置は、ユーザが必要とする記事の領域を判定することができないという課題がある。 The printer control device described in Patent Document 1 displays a target image on a preview display unit and designates a necessary area for the displayed preview image. Therefore, the user must determine a necessary article from a plurality of articles displayed in the preview image, and specify an area. That is, the printer control device described in Patent Literature 1 has a problem that it cannot determine the article area that the user needs.

特許文献２に記載の文書検索装置では、ユーザが入力したキーワードに基づいて、検索手段が、データベースを検索して該当する文書情報を抽出する。抽出された文書情報は、境界検出手段によって文書の内容が検索され句点や改行といった境界が検出されて、文や段落といった、文書を所定の単位で区切られた範囲が把握される。そして、文や段落にキーワードがどの程度含まれるかによって、優先度付け手段が文書の優先度を付けるものである。特許文献２に記載の文書検索装置を用いた場合、文書の内容を把握できるが、例えば、大きな見出しや記事の配置などで、原稿が視覚的に区切られているかどうかは把握できないという課題がある。 In the document search device described in Patent Document 2, the search means searches the database and extracts the corresponding document information based on the keyword input by the user. The extracted document information is searched for the content of the document by the boundary detection means, the boundary such as a punctuation point or a line break is detected, and the range in which the document is divided into predetermined units such as sentences and paragraphs is grasped. The prioritizing means assigns the priority of the document depending on how much the keyword is included in the sentence or paragraph. When the document search device described in Patent Document 2 is used, the contents of the document can be grasped, but there is a problem that it is impossible to grasp whether the manuscript is visually divided due to, for example, a large headline or an article layout. .

本発明は、上記の課題を解決するためになされたものであり、新聞や雑誌といった様々な記事が記載された原稿から、検索キーワードを含む領域だけを抽出することによって、ユーザの望む記事だけを抽出することが可能な画像処理装置、画像処理方法およびコンピュータプログラムを提供することを目的とする。 The present invention has been made to solve the above-described problem, and by extracting only an area including a search keyword from a manuscript in which various articles such as newspapers and magazines are described, only an article desired by a user is obtained. An object is to provide an image processing apparatus, an image processing method, and a computer program that can be extracted.

本発明に係る画像処理装置は、載置された原稿を原稿画像データとして読み取る読取手段と、前記原稿画像データに対して画像処理を施す画像処理手段とを備える画像処理装置であって、前記原稿画像データに文字認識を施して、前記原稿の文字に対応する原稿文字データを取得する文字認識手段と、前記原稿画像データを解析して、前記原稿のレイアウトを認識するレイアウト認識手段と、前記レイアウト認識手段の認識結果に基づいて、前記原稿のレイアウトに対応した領域を画定する領域データを生成する領域データ生成手段と、文字の検索に用いる検索キーワードの入力を受け付ける検索キーワード入力手段と、前記原稿文字データに対して、前記検索キーワードの検索を行うキーワード検索手段とを備え、前記画像処理手段は、前記キーワード検索手段の検索結果に基づいて、前記原稿画像データから前記検索キーワードを含む領域を抽出して抽出画像データとすることを特徴とする。 The image processing apparatus according to the present invention is an image processing apparatus comprising: a reading unit that reads a placed document as document image data; and an image processing unit that performs image processing on the document image data. Character recognition means for performing character recognition on image data to obtain original character data corresponding to characters of the original, layout recognition means for analyzing the original image data and recognizing the original layout, and the layout Based on the recognition result of the recognition means, area data generation means for generating area data defining an area corresponding to the document layout, search keyword input means for receiving input of a search keyword used for character search, and the original Keyword search means for searching the search keyword for character data, and the image processing means Based on the results for the keyword search means, characterized in that the extracted image data by extracting a region including the search keyword from the document image data.

この構成によると、原稿から検索キーワードを含む領域だけを抽出することによって、例えば、新聞や雑誌といった様々な記事が記載された原稿から、ユーザの望む記事だけを抽出できる。 According to this configuration, by extracting only the region including the search keyword from the manuscript, for example, only the article desired by the user can be extracted from the manuscript in which various articles such as newspapers and magazines are described.

本発明に係る画像処理装置では、前記レイアウト認識手段は、前記原稿の枠線を検出してレイアウトを認識することを特徴とする。 In the image processing apparatus according to the present invention, the layout recognizing unit recognizes a layout by detecting a frame line of the document.

この構成によると、枠線で区切られた原稿に対して最適な領域データを生成することができる。 According to this configuration, it is possible to generate optimum region data for a document separated by a frame line.

本発明に係る画像処理装置では、前記レイアウト認識手段は、前記原稿の文字の間隔を検出してレイアウトを認識することを特徴とする。 In the image processing apparatus according to the present invention, the layout recognizing unit recognizes a layout by detecting an interval between characters of the document.

この構成によると、広い空白行で区切られた原稿に対して最適な領域データを生成することができる。 According to this configuration, it is possible to generate optimum area data for an original document separated by wide blank lines.

本発明に係る画像処理装置では、前記レイアウト認識手段は、文字の間隔に対して予め設定された設定値に基づいて、レイアウトを認識することを特徴とする。 In the image processing apparatus according to the present invention, the layout recognizing unit recognizes a layout based on a preset value set for a character interval.

この構成によると、より明確に原稿のレイアウトを認識することができる。 According to this configuration, the document layout can be recognized more clearly.

本発明に係る画像処理装置は、前記抽出画像データを表示する画像表示手段と、前記画像表示手段に表示する前記抽出画像データを選択する画像選択手段とを備えることを特徴とする。 The image processing apparatus according to the present invention comprises image display means for displaying the extracted image data, and image selection means for selecting the extracted image data to be displayed on the image display means.

この構成によると、最適な抽出画像データを選択し、原稿から抽出された記事をユーザに視覚的に把握させることができる。 According to this configuration, it is possible to select optimal extracted image data and allow a user to visually grasp an article extracted from a document.

本発明に係る画像処理装置は、前記抽出画像データを用紙に印刷する画像印刷手段と、前記抽出画像データに基づいて、前記用紙のサイズを決定するサイズ決定手段を備えることを特徴とする。 The image processing apparatus according to the present invention includes an image printing unit that prints the extracted image data on a sheet, and a size determination unit that determines the size of the sheet based on the extracted image data.

この構成によると、原稿から抽出された記事のサイズに合わせて、適切なサイズの用紙に印刷することができることから、用紙やトナーを節約することができる。 According to this configuration, it is possible to print on an appropriately sized sheet in accordance with the size of an article extracted from a document, so that the sheet and toner can be saved.

本発明に係る画像処理方法は、画像処理装置に、載置された原稿を原稿画像データとして読み取らせる読取ステップと、前記原稿画像データに対して画像処理を施させる画像処理ステップとを実行させる画像処理方法であって、画像処理装置に、前記原稿画像データに文字認識を施して、前記原稿の文字に対応する原稿文字データを取得させる文字認識ステップと、画像処理装置に、前記原稿画像データを解析して、前記原稿のレイアウトを認識させるレイアウト認識ステップと、画像処理装置に、前記レイアウト認識ステップの認識結果に基づいて、前記原稿のレイアウトに対応した領域を画定する領域データを生成させる領域データ生成ステップと、画像処理装置に、文字の検索に用いる検索キーワードの入力を受け付けさせる検索キーワード入力ステップと、画像処理装置に、前記原稿文字データに対して、前記検索キーワードの検索を行わせるキーワード検索ステップとを実行させ、前記画像処理ステップは、前記キーワード検索ステップの検索結果に基づいて、前記原稿画像データから前記検索キーワードを含む領域を抽出して抽出画像データとすることを特徴とする。 An image processing method according to the present invention is an image that causes an image processing apparatus to execute a reading step of reading a placed document as document image data and an image processing step of performing image processing on the document image data. A character recognition step for causing the image processing device to perform character recognition on the document image data to obtain document character data corresponding to the characters of the document; and an image processing device for storing the document image data. A layout recognition step for analyzing and recognizing the layout of the document; and region data for causing the image processing apparatus to generate region data for defining a region corresponding to the layout of the document based on the recognition result of the layout recognition step. A search key that causes the generation step and the image processing apparatus to accept an input of a search keyword used for character search A keyword input step, and a keyword search step for causing the image processing apparatus to search for the search keyword with respect to the original character data, wherein the image processing step is based on a search result of the keyword search step. Then, an area including the search keyword is extracted from the document image data to obtain extracted image data.

この構成によると、本発明に係る画像処理装置と同様の機能、作用を実現することができる。 According to this configuration, the same functions and operations as those of the image processing apparatus according to the present invention can be realized.

本発明に係るコンピュータプログラムは、コンピュータに、載置された原稿を原稿画像データとして読み取らせる読取ステップと、前記原稿画像データに対して画像処理を施させる画像処理ステップとを画像処理装置で実行させるコンピュータプログラムであって、コンピュータに、前記原稿画像データに文字認識を施して、前記原稿の文字に対応する原稿文字データを取得させる文字認識ステップと、コンピュータに、前記原稿画像データを解析して、前記原稿のレイアウトを認識させるレイアウト認識ステップと、コンピュータに、前記レイアウト認識ステップの認識結果に基づいて、前記原稿のレイアウトに対応した領域を画定する領域データを生成させる領域データ生成ステップと、コンピュータに、文字の検索に用いる検索キーワードの入力を受け付けさせる検索キーワード入力ステップと、コンピュータに、前記原稿文字データに対して、前記検索キーワードの検索を行わせるキーワード検索ステップとを実行させ、前記画像処理ステップは、前記キーワード検索ステップの検索結果に基づいて、前記原稿画像データから前記検索キーワードを含む領域を抽出して抽出画像データとすることを特徴とする。 A computer program according to the present invention causes a computer to execute a reading step for reading a placed document as document image data, and an image processing step for performing image processing on the document image data. A computer program for performing character recognition on the document image data to obtain document character data corresponding to the characters of the document; and analyzing the document image data on the computer, A layout recognition step for recognizing the layout of the document; a region data generation step for causing the computer to generate region data for defining a region corresponding to the layout of the document based on the recognition result of the layout recognition step; , A search keyword used to search for characters A search keyword input step for accepting an input of a keyword, and a keyword search step for causing the computer to search for the search keyword with respect to the original character data, and the image processing step includes: Based on the search result, a region including the search keyword is extracted from the document image data to obtain extracted image data.

本発明に係る画像処理装置、画像処理方法およびコンピュータプログラムでは、原稿から検索キーワードを含む領域だけを抽出することによって、例えば、新聞や雑誌といった様々な記事が記載された原稿から、ユーザの望む記事だけを抽出できる。 In the image processing apparatus, the image processing method, and the computer program according to the present invention, by extracting only an area including a search keyword from an original, for example, an article desired by a user from an original in which various articles such as newspapers and magazines are described. Only can be extracted.

本発明の実施の形態に係る画像処理装置の概略構成を示すブロック図である。1 is a block diagram illustrating a schematic configuration of an image processing apparatus according to an embodiment of the present invention. 本発明の実施の形態に係る画像処理装置の操作入力手段と画像表示手段とを合わせたＵＩディスプレイの一例を示す説明図である。It is explanatory drawing which shows an example of UI display which combined the operation input means and image display means of the image processing apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係る画像処理装置で読み取られる原稿に関する説明図であり、（ａ）は、枠線で区切られた原稿の一例を示す説明図、（ｂ）は、（ａ）を区切る領域を示す説明図、（ｃ）は、（ａ）から抽出された記事を示す説明図である。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is an explanatory diagram relating to a document read by an image processing apparatus according to an embodiment of the present invention, where (a) illustrates an example of a document partitioned by a frame line, and (b) partitions (a). Explanatory drawing which shows an area | region, (c) is explanatory drawing which shows the article extracted from (a). 本発明の実施の形態に係る画像処理装置で読み取られる原稿に関する説明図であり、（ａ）は、広い空白行で区切られた原稿の一例を示す説明図、（ｂ）は、（ａ）を区切る領域を示す説明図、（ｃ）は、（ａ）から抽出された記事を示す説明図、（ｄ）は、（ａ）から抽出された他の記事を示す説明図である。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is an explanatory diagram relating to a document read by an image processing apparatus according to an embodiment of the present invention, where (a) is an explanatory diagram illustrating an example of a document separated by wide blank lines, and (b) is a diagram illustrating (a). Explanatory drawing which shows the area | region to divide | segment, (c) is explanatory drawing which shows the article extracted from (a), (d) is explanatory drawing which shows the other article extracted from (a). 本発明の実施の形態に係る画像処理装置で印刷を実行する場合の処理フローを示すフロー図である。It is a flowchart which shows the processing flow in the case of performing printing with the image processing apparatus which concerns on embodiment of this invention. デジタル複合機の構成を示す概略断面図である。It is a schematic sectional drawing which shows the structure of a digital multifunctional device.

以下、本発明の実施の形態に係る画像処理装置について、図面に基づいて説明する。 Hereinafter, an image processing apparatus according to an embodiment of the present invention will be described with reference to the drawings.

図１は、本発明の実施の形態に係る画像処理装置の概略構成を示すブロック図である。 FIG. 1 is a block diagram showing a schematic configuration of an image processing apparatus according to an embodiment of the present invention.

本実施の形態に係る画像処理装置１は、載置された原稿を原稿画像データ２３ｇとして読み取る読取手段１１と、原稿画像データ２３ｇに対して画像処理を施す画像処理手段１２とを備える。画像処理装置１は、原稿画像データ２３ｇに文字認識を施して、原稿の文字に対応する原稿文字データ２４ｍを取得する文字認識手段１５と、原稿画像データ２３ｇを解析して、原稿のレイアウトを認識するレイアウト認識手段１６と、レイアウト認識手段１６の認識結果に基づいて、原稿のレイアウトに対応した領域を画定する領域データ２４ｒを生成する領域データ生成手段１７と、文字の検索に用いる検索キーワード２４ｋの入力を受け付ける検索キーワード入力手段１８と、原稿文字データ２４ｍに対して、検索キーワード２４ｋの検索を行うキーワード検索手段１９とを備える。画像処理手段１２は、キーワード検索手段１９の検索結果に基づいて、原稿画像データ２３ｇから検索キーワード２４ｋを含む領域を抽出して抽出画像データ２３ｔとする。 The image processing apparatus 1 according to the present embodiment includes a reading unit 11 that reads a placed document as document image data 23g, and an image processing unit 12 that performs image processing on the document image data 23g. The image processing apparatus 1 performs character recognition on the document image data 23g to obtain document character data 24m corresponding to the characters of the document, and analyzes the document image data 23g to recognize the document layout. The layout recognition unit 16 for performing the search, the region data generation unit 17 for generating the region data 24r for defining the region corresponding to the layout of the document based on the recognition result of the layout recognition unit 16, and the search keyword 24k used for the character search. A search keyword input means 18 for receiving input and a keyword search means 19 for searching for the search keyword 24k with respect to the original character data 24m are provided. Based on the search result of the keyword search means 19, the image processing means 12 extracts an area including the search keyword 24k from the document image data 23g and sets it as extracted image data 23t.

この構成によると、原稿から検索キーワード２４ｋを含む領域だけを抽出することによって、例えば、新聞や雑誌といった様々な記事が記載された原稿から、ユーザの望む記事だけを抽出できる。 According to this configuration, by extracting only the region including the search keyword 24k from the manuscript, only the article desired by the user can be extracted from the manuscript in which various articles such as newspapers and magazines are described.

画像処理装置１は、さらに操作入力手段１３、画像記憶領域２３および検索データ記憶領域２４を備える構成とされている。 The image processing apparatus 1 further includes an operation input unit 13, an image storage area 23, and a search data storage area 24.

操作入力手段１３は、ユーザが指示を入力するためのボタン、キーボードおよびタッチパネルなどである。操作入力手段１３での入力に応じて、原稿の読み取りなどが実施される。また、操作入力手段１３から検索キーワード２４ｋを入力することができる。本実施の形態において、操作入力手段１３は、外部から操作可能なように、画像処理装置１に設けられている。 The operation input means 13 is a button, a keyboard, a touch panel, etc. for a user to input an instruction. In response to an input from the operation input unit 13, a document is read. Further, the search keyword 24k can be input from the operation input means 13. In the present embodiment, the operation input means 13 is provided in the image processing apparatus 1 so that it can be operated from the outside.

画像記憶領域２３は、原稿画像データ２３ｇおよび抽出画像データ２３ｔを保存するものであって、通常の記憶装置を用いることができ、例えば、揮発性のＤＲＡＭ（ＤｙｎａｍｉｃＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）などで構成されるメモリである。 The image storage area 23 stores the original image data 23g and the extracted image data 23t, and a normal storage device can be used. The image storage area 23 includes, for example, a volatile DRAM (Dynamic Random Access Memory). It is memory.

検索データ記憶領域２４は、原稿文字データ２４ｍ、領域データ２４ｒおよび検索キーワード２４ｋを保存するものであって、通常の記憶装置を用いることができ、例えば、揮発性のＤＲＡＭなどで構成されるメモリである。なお、検索データ記憶領域２４に用いる記憶装置は、画像記憶領域２３と共通のものとしてもよい。 The search data storage area 24 stores manuscript character data 24m, area data 24r, and search keywords 24k, and a normal storage device can be used. For example, the search data storage area 24 is a memory composed of a volatile DRAM or the like. is there. The storage device used for the search data storage area 24 may be the same as the image storage area 23.

読取手段１１は、原稿を読み取って、生成した原稿画像データ２３ｇを画像記憶領域２３に保存する。 The reading unit 11 reads a document and stores the generated document image data 23 g in the image storage area 23.

文字認識手段１５は、画像記憶領域２３に保存された原稿画像データ２３ｇに対して、例えば、光学文字認識（ＯＣＲ：ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅｃｏｇｎｉｔｉｏｎ）などを施して、原稿の文字を認識して原稿文字データ２４ｍを生成する。 The character recognition means 15 performs, for example, optical character recognition (OCR) on the document image data 23g stored in the image storage area 23, recognizes the characters of the document, and the document character data 24m. Is generated.

レイアウト認識手段１６は、原稿の枠線を検出してレイアウトを認識する。この構成によると、枠線で区切られた原稿に対して最適な領域データ２４ｒを生成することができる。 The layout recognition unit 16 recognizes the layout by detecting the frame line of the document. According to this configuration, it is possible to generate the optimum area data 24r for the document divided by the frame line.

また、レイアウト認識手段１６は、原稿の文字の間隔を検出してレイアウトを認識してもよい。この構成によると、広い空白行で区切られた原稿に対して最適な領域データを生成することができる。 Further, the layout recognition unit 16 may recognize the layout by detecting the character spacing of the document. According to this configuration, it is possible to generate optimum area data for an original document separated by wide blank lines.

なお、文字の間隔を検出する際、レイアウト認識手段１６は、文字の間隔に対して予め設定された設定値に基づいて、レイアウトを認識する。この構成によると、より明確に原稿のレイアウトを認識することができる。ここで、画像処理装置１には、複数の設定値が登録されていてもよい。例えば、高さと幅といった、直交する２つの方向での文字の間隔を検出する構成としてもよい。さらに、レイアウトの認識に問題があるときのために、異なる設定値を登録できる構成としてもよい。 When detecting the character spacing, the layout recognizing unit 16 recognizes the layout based on a preset value for the character spacing. According to this configuration, the document layout can be recognized more clearly. Here, a plurality of setting values may be registered in the image processing apparatus 1. For example, it is good also as a structure which detects the space | interval of the character in two orthogonal directions, such as height and width. Furthermore, it may be configured such that different setting values can be registered in case there is a problem in layout recognition.

領域データ生成手段１７は、レイアウト認識手段１６の認識結果に基づいて、領域データ２４ｒを生成し検索データ記憶領域２４に保存する。 The area data generation means 17 generates area data 24 r based on the recognition result of the layout recognition means 16 and stores it in the search data storage area 24.

領域データ２４ｒは、例えば、原稿画像データ２３ｇでの座標などを示すものであって、複数の座標によって、領域を画定することができる。また、領域データ２４ｒと原稿文字データ２４ｍとを関連付けることによって、それぞれの領域に配置されている文字を把握することができる。 The area data 24r indicates, for example, coordinates in the document image data 23g, and the area can be defined by a plurality of coordinates. Further, by associating the area data 24r with the original character data 24m, it is possible to grasp the characters arranged in each area.

検索キーワード入力手段１８は、操作入力手段１３によって入力された検索キーワード２４ｋを検索データ記憶領域２４に保存する。ここで、検索キーワード入力手段１８によって、複数の検索キーワード２４ｋの入力を受け付けてもよい。つまり、複数の検索キーワード２４ｋを用いて文字の検索を行うことにより、対象となる記事を絞り込んで、検索の精度を向上させることができる。また、いずれかの検索キーワード２４ｋと一致することを条件として、検索の自由度を向上させることができる。 The search keyword input unit 18 stores the search keyword 24 k input by the operation input unit 13 in the search data storage area 24. Here, the search keyword input means 18 may accept input of a plurality of search keywords 24k. That is, by performing a character search using a plurality of search keywords 24k, it is possible to narrow down the target articles and improve the search accuracy. Further, the degree of freedom of search can be improved on condition that the search keyword matches any one of the search keywords 24k.

キーワード検索手段１９は、原稿の文字の中に検索キーワード２４ｋと一致する文字が存在するかどうかを検索するものである。ここで、検索キーワード２４ｋと一致する文字が存在しなければ、検索キーワード２４ｋを含む領域が存在しないことを検索結果として出力する。また、検索キーワード２４ｋを含む領域が複数存在することを検索結果として出力してもよい。 The keyword search means 19 searches for whether or not there is a character that matches the search keyword 24k in the characters of the document. If there is no character that matches the search keyword 24k, it is output as a search result that there is no region including the search keyword 24k. Further, it may be output as a search result that there are a plurality of regions including the search keyword 24k.

画像処理装置１は、抽出画像データ２３ｔを表示する画像表示手段１４と、画像表示手段１４に表示する抽出画像データ２３ｔを選択する画像選択手段２１とを備える。この構成によると、最適な抽出画像データ２３ｔを選択し、原稿から抽出された記事をユーザに視覚的に把握させることができる。 The image processing apparatus 1 includes an image display unit 14 that displays the extracted image data 23t, and an image selection unit 21 that selects the extracted image data 23t to be displayed on the image display unit 14. According to this configuration, the optimum extracted image data 23t can be selected, and the user can visually grasp the article extracted from the document.

画像表示手段１４は、抽出画像データ２３ｔなどを表示するものであって、例えば、液晶ディスプレイである。本実施の形態において、画像表示手段１４は、画像処理装置１の表面に設けられている。なお、画像表示手段１４と操作入力手段１３とを一体としたタッチパネルとしてもよい。 The image display means 14 displays the extracted image data 23t and the like, and is a liquid crystal display, for example. In the present embodiment, the image display means 14 is provided on the surface of the image processing apparatus 1. The image display unit 14 and the operation input unit 13 may be integrated as a touch panel.

画像処理装置１は、抽出画像データ２３ｔを用紙に印刷する画像印刷手段２０と、抽出画像データ２３ｔに基づいて、用紙のサイズを決定するサイズ決定手段２２を備える。この構成によると、原稿から抽出された記事のサイズに合わせて、適切なサイズの用紙に印刷することができることから、用紙やトナーを節約することができる。 The image processing apparatus 1 includes an image printing unit 20 that prints the extracted image data 23t on a sheet, and a size determination unit 22 that determines the size of the sheet based on the extracted image data 23t. According to this configuration, it is possible to print on an appropriately sized sheet in accordance with the size of an article extracted from a document, so that the sheet and toner can be saved.

画像印刷手段２０は、画像データ（例えば、文書、写真）を印刷するものであって、例えば、コピー機能、スキャナ機能などを有するデジタル複合機の画像形成部として機能する。デジタル複合機についての詳細は後述する。 The image printing unit 20 prints image data (for example, a document, a photograph), and functions as an image forming unit of a digital multi-function peripheral having a copy function, a scanner function, and the like. Details of the digital multifunction peripheral will be described later.

サイズ決定手段２２は、領域データ２４ｒから適切なサイズの用紙を決定することができる。つまり、抽出された記事が納まるように、用紙のサイズが決定される。 The size determining means 22 can determine an appropriate size sheet from the area data 24r. That is, the paper size is determined so that the extracted articles are accommodated.

画像処理手段１２、文字認識手段１５、レイアウト認識手段１６、領域データ生成手段１７、検索キーワード入力手段１８、キーワード検索手段１９、画像選択手段２１およびサイズ決定手段２２は、画像処理装置１が有するＣＰＵ（中央処理装置。不図示）に対して予め組み込まれたコンピュータプログラム（コンピュータ）によって実現される。 The image processing unit 12, the character recognition unit 15, the layout recognition unit 16, the area data generation unit 17, the search keyword input unit 18, the keyword search unit 19, the image selection unit 21, and the size determination unit 22 are a CPU included in the image processing apparatus 1. It is realized by a computer program (computer) incorporated in advance in (central processing unit, not shown).

図２は、本発明の実施の形態に係る画像処理装置の操作入力手段と画像表示手段とを合わせたＵＩディスプレイの一例を示す説明図である。 FIG. 2 is an explanatory diagram showing an example of a UI display that combines the operation input unit and the image display unit of the image processing apparatus according to the embodiment of the present invention.

ＵＩ（ユーザインターフェース：ＵｓｅｒＩｎｔｅｒｆａｃｅ）ディスプレイ３０は、操作入力手段１３および画像表示手段１４として機能するものであって、外部から操作可能なように、画像処理装置１に設けられている。ユーザは、操作入力手段１３を選択することによって、画像処理装置１に対する操作を入力することができる。 A UI (User Interface) display 30 functions as the operation input unit 13 and the image display unit 14 and is provided in the image processing apparatus 1 so as to be operable from the outside. The user can input an operation to the image processing apparatus 1 by selecting the operation input unit 13.

ＵＩディスプレイ３０は、表示領域３１、キーボード３２、スタートキー３３、クリアキー３４、抽出キー３５、画像決定キー３６および画像選択キー３７を備える。なお、本実施の形態においては、表示領域３１をタッチパネルとして、操作入力手段１３としても機能するものとしているが、本発明はこれに限定されない。また、本実施の形態においては、キーボード３２、スタートキー３３、クリアキー３４、抽出キー３５、画像決定キー３６および画像選択キー３７が操作入力手段１３として機能する。 The UI display 30 includes a display area 31, a keyboard 32, a start key 33, a clear key 34, an extraction key 35, an image determination key 36 and an image selection key 37. In the present embodiment, the display area 31 functions as a touch panel and also functions as the operation input means 13, but the present invention is not limited to this. In the present embodiment, the keyboard 32, the start key 33, the clear key 34, the extraction key 35, the image determination key 36 and the image selection key 37 function as the operation input unit 13.

表示領域３１は、抽出画像データ２３ｔおよび原稿画像データ２３ｇといった画像データを表示するものであって、画像選択手段２１によって選択された画像データを印刷プレビューとして表示する。 The display area 31 displays image data such as the extracted image data 23t and the original image data 23g, and displays the image data selected by the image selection means 21 as a print preview.

ユーザは、キーボード３２を用いることによって、例えば、検索キーワード２４ｋを入力することができる。 By using the keyboard 32, the user can input the search keyword 24k, for example.

スタートキー３３は、画像印刷手段２０に印刷を実行させる指示を入力するものである。なお、画像処理手段１２によって画像処理が施されなかったときは、原稿画像データ２３ｇが印刷される。 The start key 33 is used to input an instruction for causing the image printing unit 20 to execute printing. When image processing is not performed by the image processing means 12, original image data 23g is printed.

クリアキー３４は、画像処理装置１に動作を中止させる指示を入力するものである。 The clear key 34 is used to input an instruction for causing the image processing apparatus 1 to stop its operation.

抽出キー３５は、画像処理手段１２に画像処理を実行させる指示を入力するものである。 The extraction key 35 is used to input an instruction for causing the image processing means 12 to perform image processing.

画像決定キー３６は、抽出画像データ２３ｔを確定する指示を入力するものである。スタートキー３３が選択されたとき、画像決定キー３６によって確定された抽出画像データ２３ｔが印刷される。表示領域３１には、画像決定キー３６と対応するように「本プレビュー画面でＯＫ？」と表示されている。 The image determination key 36 is used to input an instruction to confirm the extracted image data 23t. When the start key 33 is selected, the extracted image data 23t determined by the image determination key 36 is printed. In the display area 31, “OK in this preview screen?” Is displayed so as to correspond to the image determination key 36.

画像選択キー３７は、抽出画像データ２３ｔを選択する指示を入力するものである。つまり、ユーザが望まない記事が表示されたとき、画像選択キー３７を選択して、ユーザが望む記事を表示させることができる。表示領域３１には、画像選択キー３７と対応するように「次の選択原稿例を表示」と表示されている。 The image selection key 37 is used to input an instruction to select the extracted image data 23t. That is, when an article not desired by the user is displayed, the image selection key 37 can be selected to display the article desired by the user. In the display area 31, “display next selected document example” is displayed so as to correspond to the image selection key 37.

図３は、本発明の実施の形態に係る画像処理装置で読み取られる原稿に関する説明図であり、（ａ）は、枠線で区切られた原稿の一例を示す説明図、（ｂ）は、（ａ）を区切る領域を示す説明図、（ｃ）は、（ａ）から抽出された記事を示す説明図である。 3A and 3B are explanatory diagrams relating to a document read by the image processing apparatus according to the embodiment of the present invention. FIG. 3A is an explanatory diagram illustrating an example of a document divided by frame lines, and FIG. (a) is explanatory drawing which shows the area | region which divides | segments, (c) is explanatory drawing which shows the article extracted from (a).

原稿Ｇ１には、単語「ＡＢＣ」を含む記事Ｋ１、単語「ＤＥＦ」を含む記事Ｋ２、単語「ＧＨＩ」を含む記事Ｋ３、および単語「ＪＫＬ」を含む記事Ｋ４が記載されており、それぞれが枠線で区切られている。 The manuscript G1 includes an article K1 containing the word “ABC”, an article K2 containing the word “DEF”, an article K3 containing the word “GHI”, and an article K4 containing the word “JKL”. They are separated by lines.

原稿Ｇ１の枠線からレイアウトを認識することによって、記事Ｋ１に対応する領域Ｒ１、記事Ｋ２に対応する領域Ｒ２、記事Ｋ３に対応する領域Ｒ３、および記事Ｋ４に対応する領域Ｒ４を画定することができる。 By recognizing the layout from the frame line of the document G1, an area R1 corresponding to the article K1, an area R2 corresponding to the article K2, an area R3 corresponding to the article K3, and an area R4 corresponding to the article K4 can be defined. it can.

ユーザが単語「ＧＨＩ」を検索キーワード２４ｋとした場合、画像処理手段１２によって、領域Ｒ４が抽出画像データ２３ｔとして抽出される。つまり、画像表示手段１４には、図３（ｃ）に示す記事Ｋ３が表示される。このようにして、ユーザが必要とする記事だけを抽出することができる。また、記事Ｋ４のみを抽出することにより、原稿Ｇ１より小さいサイズの用紙に印刷することができる。 When the user uses the word “GHI” as the search keyword 24k, the image processing unit 12 extracts the region R4 as the extracted image data 23t. That is, the article K3 shown in FIG. 3C is displayed on the image display means 14. In this way, only articles that the user needs can be extracted. Further, by extracting only the article K4, it is possible to print on a sheet having a size smaller than the original G1.

図４は、本発明の実施の形態に係る画像処理装置で読み取られる原稿に関する説明図であり、（ａ）は、広い空白行で区切られた原稿の一例を示す説明図、（ｂ）は、（ａ）を区切る領域を示す説明図、（ｃ）は、（ａ）から抽出された記事を示す説明図、（ｄ）は、（ａ）から抽出された他の記事を示す説明図である。 4A and 4B are explanatory diagrams relating to a document read by the image processing apparatus according to the embodiment of the present invention. FIG. 4A is an explanatory diagram illustrating an example of a document divided by wide blank lines, and FIG. (A) is explanatory drawing which shows the area | region which divides | segments, (c) is explanatory drawing which shows the article extracted from (a), (d) is explanatory drawing which shows the other article extracted from (a). .

原稿Ｇ２には、単語「ＡＢＣ」を含む記事Ｋ５、単語「ＤＥＦ」と単語「ＧＨＩ」とを含む記事Ｋ６、単語「ＧＨＩ」を含む記事Ｋ７、および単語「ＪＫＬ」を含む記事Ｋ８が記載されており、それぞれが広い空白行で区切られている。つまり、空白行には文字が記載されていないので、空白行を間に挟む文字の間隔が特に広くなっている。 In the original G2, an article K5 including the word “ABC”, an article K6 including the word “DEF” and the word “GHI”, an article K7 including the word “GHI”, and an article K8 including the word “JKL” are described. Each separated by a wide blank line. That is, since no character is written in the blank line, the space between the characters sandwiching the blank line is particularly wide.

原稿Ｇ２の空白行からレイアウトを認識することによって、記事Ｋ５に対応する領域Ｒ５、記事Ｋ６に対応する領域Ｒ６、記事Ｋ７に対応する領域Ｒ７、および記事Ｋ８に対応する領域Ｒ８を画定することができる。 By recognizing the layout from the blank line of the document G2, an area R5 corresponding to the article K5, an area R6 corresponding to the article K6, an area R7 corresponding to the article K7, and an area R8 corresponding to the article K8 can be defined. it can.

ユーザが単語「ＧＨＩ」を検索キーワード２４ｋとした場合、画像処理手段１２によって、領域Ｒ６と領域Ｒ７とが抽出画像データ２３ｔとして抽出される。つまり、画像表示手段１４には、図４（ｃ）に示す記事Ｋ７または図４（ｄ）に示す記事Ｋ６が表示される。複数の記事が抽出された場合、ユーザは、画像選択手段２１によって必要な記事を選択することができる。 When the user sets the word “GHI” as the search keyword 24k, the image processing unit 12 extracts the region R6 and the region R7 as the extracted image data 23t. That is, the image display means 14 displays the article K7 shown in FIG. 4C or the article K6 shown in FIG. When a plurality of articles are extracted, the user can select a necessary article by the image selection means 21.

図５は、本発明の実施の形態に係る画像処理装置で印刷を実行する場合の処理フローを示すフロー図である。 FIG. 5 is a flowchart showing a processing flow when printing is executed by the image processing apparatus according to the embodiment of the present invention.

ステップＳ０１では、ユーザによる抽出キー３５の選択を待つ。抽出キー３５が押下げられない状態で、原稿の読み取りが指示されると（ステップＳ０１：ＮＯ）、ステップＳ０２へ進む。抽出キー３５が押下げられたときは（ステップＳ０１：ＹＥＳ）、ステップＳ０４へ進む。 In step S01, the user waits for selection of the extraction key 35. If reading of the document is instructed in a state where the extraction key 35 is not depressed (step S01: NO), the process proceeds to step S02. When the extraction key 35 is depressed (step S01: YES), the process proceeds to step S04.

ステップＳ０２では、読取手段１１によって、原稿が読み取られる。 In step S02, the document is read by the reading unit 11.

ステップＳ０３では、画像印刷手段２０によって、原稿画像データ２３ｇが印刷され、処理が終了する。 In step S03, the original image data 23g is printed by the image printing means 20, and the process is terminated.

ステップＳ０４では、検索キーワード入力手段１８によって、検索キーワード２４ｋの入力を受け付ける。 In step S04, the search keyword input unit 18 receives input of the search keyword 24k.

ステップＳ０５では、読取手段１１によって、原稿が読み取られる。 In step S05, the document is read by the reading unit 11.

ステップＳ０６では、文字認識手段１５によって、原稿画像データ２３ｇに対する文字認識が施される。 In step S06, the character recognition unit 15 performs character recognition on the document image data 23g.

ステップＳ０７では、レイアウト認識手段１６によって、原稿のレイアウトが認識される。ここで、原稿の枠線か空白行のどちらでレイアウトを認識するかは、適宜選択すればよい。例えば、まず、枠線を検出するようにして、枠線が検出されなかったときには、空白行を検出してレイアウトを認識するようにしてもよい。生成された領域データ２４ｒと原稿文字データ２４ｍとを関連付けることで、原稿の文字がどの領域に記載されているかを把握できる。 In step S07, the layout recognition unit 16 recognizes the document layout. Here, it is only necessary to appropriately select whether the layout is recognized by a frame line or a blank line of the document. For example, first, a frame line may be detected, and when no frame line is detected, a blank line may be detected to recognize the layout. By associating the generated area data 24r with the original character data 24m, it is possible to grasp in which area the original characters are described.

ステップＳ０８では、キーワード検索手段１９によって、検索キーワード２４ｋの検索が行われる。 In step S08, the keyword search unit 19 searches for the search keyword 24k.

ステップＳ０９では、原稿の文字に検索キーワード２４ｋが含まれるかどうかが判定される。検索キーワード２４ｋが含まれるときは（ステップＳ０９：ＹＥＳ）、ステップＳ１０に進む。検索キーワード２４ｋが含まれないときは（ステップＳ０９：ＮＯ）、ステップＳ０３へ進む。なお、ステップＳ０３へ進まずに、検索キーワード２４ｋの入力を受け付けて、新たな検索キーワード２４ｋを用いた検索をやり直してもよい（ステップＳ０４→ステップＳ０８→ステップＳ０９の順に処理を行う。）。 In step S09, it is determined whether or not the search keyword 24k is included in the characters of the document. When the search keyword 24k is included (step S09: YES), the process proceeds to step S10. When the search keyword 24k is not included (step S09: NO), the process proceeds to step S03. Instead of proceeding to step S03, the input of the search keyword 24k may be accepted and the search using the new search keyword 24k may be performed again (processing is performed in the order of step S04 → step S08 → step S09).

ステップＳ１０では、画像処理手段１２によって、抽出画像データ２３ｔが生成される。なお、検索キーワード２４ｋを含む領域が複数存在する場合は、複数の抽出画像データ２３ｔが生成される。 In step S10, the image processing means 12 generates extracted image data 23t. When there are a plurality of regions including the search keyword 24k, a plurality of extracted image data 23t is generated.

ステップＳ１１では、画像選択手段２１によって選択された抽出画像データ２３ｔが、画像表示手段１４に表示される。 In step S11, the extracted image data 23t selected by the image selection means 21 is displayed on the image display means 14.

ステップＳ１２では、ユーザからの印刷の実行の指示を待つ。スタートキー３３が押下げられたときは（ステップＳ１２：ＹＥＳ）、ステップＳ１４へ進む。画像選択キー３７が押下げられたときは（ステップＳ１４：ＮＯ）、ステップＳ１３へ進む。 In step S12, the process waits for a print execution instruction from the user. When the start key 33 is depressed (step S12: YES), the process proceeds to step S14. When the image selection key 37 is pressed (step S14: NO), the process proceeds to step S13.

ステップＳ１３では、画像選択手段２１によって、画像表示手段１４に表示する抽出画像データ２３ｔが選択され、ステップＳ１１に戻って、選択された抽出画像データ２３ｔが表示される。 In step S13, the image selection means 21 selects the extracted image data 23t to be displayed on the image display means 14, and the process returns to step S11 to display the selected extracted image data 23t.

なお、画像処理手段１２によって生成された抽出画像データ２３ｔが１つであるときは、他の抽出画像データ２３ｔを選択することができないので、ステップＳ１３を省略してステップＳ１１へ進むことができる。 Note that when there is only one extracted image data 23t generated by the image processing means 12, it is not possible to select another extracted image data 23t, and therefore it is possible to skip step S13 and proceed to step S11.

ステップＳ１４では、サイズ決定手段２２によって、画像印刷手段２０で印刷する際の用紙のサイズが決定される。 In step S <b> 14, the size determining unit 22 determines the size of the paper for printing by the image printing unit 20.

ステップＳ１５では、画像印刷手段２０によって、抽出画像データ２３ｔが印刷されて、処理が終了する。 In step S15, the extracted image data 23t is printed by the image printing means 20, and the process ends.

以上、本発明の実施の形態に係る画像処理装置で印刷を実行する場合の処理フローについて説明したが、本発明はこれに限定されず、例えば、スキャナ装置においては、ステップＳ１１の後で、処理を終了する処理フローとしてもよい。 The processing flow in the case where printing is executed by the image processing apparatus according to the embodiment of the present invention has been described above. However, the present invention is not limited to this. For example, in the scanner apparatus, the processing is performed after step S11. It is good also as a processing flow which complete | finishes.

次に、本発明に係る画像処理装置としてのデジタル複合機について説明する。本発明は、例えば、次に示すようなデジタル複合機によって機能を実現できる。 Next, a digital multifunction machine as an image processing apparatus according to the present invention will be described. The present invention can realize the functions by, for example, the following digital multi-function peripheral.

図６は、デジタル複合機の構成を示す概略断面図である。 FIG. 6 is a schematic cross-sectional view showing the configuration of the digital multi-function peripheral.

デジタル複合機５０は、原稿読取部５１と、画像形成部５２とで構成され、コピー機能、スキャナ機能を有するものである。 The digital multi-function device 50 includes a document reading unit 51 and an image forming unit 52, and has a copy function and a scanner function.

画像処理手段１２、文字認識手段１５、レイアウト認識手段１６、領域データ生成手段１７、検索キーワード入力手段１８、キーワード検索手段１９、画像選択手段２１およびサイズ決定手段２２は、デジタル複合機５０が有するＣＰＵ（中央処理装置。不図示）に対して予め組み込まれたコンピュータプログラム（コンピュータ）によって実現される。 The image processing unit 12, the character recognition unit 15, the layout recognition unit 16, the region data generation unit 17, the search keyword input unit 18, the keyword search unit 19, the image selection unit 21, and the size determination unit 22 are included in the CPU of the digital multi-function peripheral 50. It is realized by a computer program (computer) incorporated in advance in (central processing unit, not shown).

原稿読取部５１は、載置された原稿を読み取るものであって、読取手段１１として機能し、原稿搬送部５１ａ、原稿走査部５１ｂおよび画像読取素子５１ｃを備える。 The document reading unit 51 reads a placed document, functions as the reading unit 11, and includes a document transport unit 51a, a document scanning unit 51b, and an image reading element 51c.

画像形成部５２は、原稿読取部５１により読み取られた原稿の画像または外部装置から受信した画像をカラーもしくは単色で記録用紙に印刷するものであって、画像印刷手段２０として機能する。画像形成部５２には、矢符Ａ方向に回転する感光体ドラム５３を備え、感光体ドラム５３の周囲には、帯電チャージャ５４、現像装置５５、転写チャージャ５６、クリーニングユニット５７およびレーザスキャナユニット５８が配置されている。また、画像形成部５２は、用紙トレイ６１、給紙ローラ６２、レジストローラ６３、搬送手段６４および定着ユニット６５を備える。 The image forming unit 52 prints an image of a document read by the document reading unit 51 or an image received from an external apparatus on a recording sheet in color or single color, and functions as the image printing unit 20. The image forming unit 52 includes a photosensitive drum 53 that rotates in the direction of arrow A. Around the photosensitive drum 53, a charging charger 54, a developing device 55, a transfer charger 56, a cleaning unit 57, and a laser scanner unit 58 are provided. Is arranged. The image forming unit 52 includes a paper tray 61, a paper feed roller 62, a registration roller 63, a conveying unit 64, and a fixing unit 65.

デジタル複合機５０では、まず、画像読取部５１によって、載置された原稿が読み取られる。つまり、原稿搬送部５１ａによって原稿が搬送され、光源（図示しない）から原稿に光が照射される。原稿走査部５１ｂによって、原稿からの反射光が画像読取素子５１ｃに導かれ、画像読取素子５１ｃは、反射光を受光して原稿の画像を読み取る。読み取られた原稿の画像は、画像データとしてレーザスキャナユニット５８に入力される。 In the digital multi-function device 50, first, the placed document is read by the image reading unit 51. That is, the document is transported by the document transport unit 51a, and light is irradiated onto the document from a light source (not shown). The document scanning unit 51b guides reflected light from the document to the image reading element 51c, and the image reading element 51c receives the reflected light and reads the image of the document. The read document image is input to the laser scanner unit 58 as image data.

また、感光体ドラム５２は、帯電チャージャ５４によって表面が一様に帯電される。レーザスキャナユニット５８は、帯電された感光体ドラム５２を画像データに応じて露光して、画像データに対応する静電潜像を感光体ドラム５２に形成する。現像装置５５は、静電潜像にトナーを付着して、トナー像を形成する。 The surface of the photosensitive drum 52 is uniformly charged by the charging charger 54. The laser scanner unit 58 exposes the charged photosensitive drum 52 according to the image data, and forms an electrostatic latent image corresponding to the image data on the photosensitive drum 52. The developing device 55 attaches toner to the electrostatic latent image to form a toner image.

上記の動作に合わせて、給紙ローラ６２は、給紙トレイ６１に収容された用紙をレジストローラ６３に送る。レジストローラ６３は、所定のタイミングで用紙を感光体ドラム５３の転写部位に送り、転写チャージャ５６は、用紙の表面にトナー像を転写する。クリーニングユニット５７は、トナー像が転写された後の感光体ドラム５３の表面をクリーニングして、次の帯電に備える。 In accordance with the above operation, the paper feed roller 62 sends the paper stored in the paper feed tray 61 to the registration roller 63. The registration roller 63 sends the sheet to the transfer portion of the photosensitive drum 53 at a predetermined timing, and the transfer charger 56 transfers the toner image onto the surface of the sheet. The cleaning unit 57 cleans the surface of the photosensitive drum 53 after the toner image is transferred, and prepares for the next charging.

搬送手段６４は、トナー像が転写された用紙を定着ユニット６５へ搬送する。定着ユニット６５は、用紙に転写されたトナー像を熱定着して、用紙を外部へ送出する。 The conveying unit 64 conveys the sheet on which the toner image is transferred to the fixing unit 65. The fixing unit 65 thermally fixes the toner image transferred to the paper and sends the paper to the outside.

本実施の形態に係る画像処理方法は、画像処理装置に、載置された原稿を原稿画像データとして読み取らせる読取ステップと、前記原稿画像データに対して画像処理を施させる画像処理ステップとを実行させる画像処理方法であって、画像処理装置に、前記原稿画像データに文字認識を施して、前記原稿の文字に対応する原稿文字データを取得させる文字認識ステップと、画像処理装置に、前記原稿画像データを解析して、前記原稿のレイアウトを認識させるレイアウト認識ステップと、画像処理装置に、前記レイアウト認識ステップの認識結果に基づいて、前記原稿のレイアウトに対応した領域を画定する領域データを生成させる領域データ生成ステップと、画像処理装置に、文字の検索に用いる検索キーワードの入力を受け付けさせる検索キーワード入力ステップと、画像処理装置に、前記原稿文字データに対して、前記検索キーワードの検索を行わせるキーワード検索ステップとを実行させ、前記画像処理ステップは、前記キーワード検索ステップの検索結果に基づいて、前記原稿画像データから前記検索キーワードを含む領域を抽出して抽出画像データとすることを特徴とする。この構成によると、本発明に係る画像処理装置と同様の機能、作用を実現することができる。 The image processing method according to the present embodiment executes a reading step for causing the image processing apparatus to read a placed document as document image data, and an image processing step for performing image processing on the document image data. A character recognition step for causing the image processing device to perform character recognition on the document image data to obtain document character data corresponding to the characters of the document; and Analyzing the data to recognize the layout of the document; and causing the image processing apparatus to generate region data defining an area corresponding to the layout of the document based on the recognition result of the layout recognition step. A region data generation step, and a search that causes the image processing apparatus to accept input of a search keyword used for character search. A keyword input step, and a keyword search step for causing the image processing apparatus to search for the search keyword with respect to the document character data, wherein the image processing step is based on a search result of the keyword search step. An area including the search keyword is extracted from the document image data to obtain extracted image data. According to this configuration, the same functions and operations as those of the image processing apparatus according to the present invention can be realized.

本実施の形態に係る画像処理方法では、前記レイアウト認識ステップは、前記原稿の枠線を検出してレイアウトを認識させることを特徴とする。 In the image processing method according to the present embodiment, the layout recognition step detects a frame line of the document and recognizes the layout.

本実施の形態に係る画像処理方法では、前記レイアウト認識ステップは、前記原稿の文字の間隔を検出してレイアウトを認識させることを特徴とする。 In the image processing method according to the present embodiment, in the layout recognition step, the layout is recognized by detecting the character spacing of the document.

本実施の形態に係る画像処理方法では、前記レイアウト認識ステップは、文字の間隔に対して予め設定された設定値に基づいて、レイアウトを認識させることを特徴とする。 In the image processing method according to the present embodiment, the layout recognition step is characterized in that a layout is recognized based on a preset value set for a character interval.

本実施の形態に係る画像処理方法では、画像処理装置に、前記抽出画像データを画像表示手段に表示させる画像表示ステップと、画像処理装置に、前記画像表示手段に表示する前記抽出画像データを選択させる画像選択ステップとを実行させることを特徴とする。 In the image processing method according to the present embodiment, an image display step for causing the image processing device to display the extracted image data on the image display means, and the image processing device to select the extracted image data to be displayed on the image display means. And an image selection step to be executed.

本実施の形態に係る画像処理方法では、画像処理装置に、前記抽出画像データに基づいて、抽出画像データを印刷するときの用紙のサイズを決定させるサイズ決定ステップを実行させることを特徴とする。 The image processing method according to the present embodiment is characterized by causing the image processing apparatus to execute a size determination step for determining the size of the paper when printing the extracted image data based on the extracted image data.

本実施の形態に係るコンピュータプログラムは、コンピュータに、載置された原稿を原稿画像データとして読み取らせる読取ステップと、前記原稿画像データに対して画像処理を施させる画像処理ステップとを画像処理装置で実行させるコンピュータプログラムであって、コンピュータに、前記原稿画像データに文字認識を施して、前記原稿の文字に対応する原稿文字データを取得させる文字認識ステップと、コンピュータに、前記原稿画像データを解析して、前記原稿のレイアウトを認識させるレイアウト認識ステップと、コンピュータに、前記レイアウト認識ステップの認識結果に基づいて、前記原稿のレイアウトに対応した領域を画定する領域データを生成させる領域データ生成ステップと、コンピュータに、文字の検索に用いる検索キーワードの入力を受け付けさせる検索キーワード入力ステップと、コンピュータに、前記原稿文字データに対して、前記検索キーワードの検索を行わせるキーワード検索ステップとを実行させ、前記画像処理ステップは、前記キーワード検索ステップの検索結果に基づいて、前記原稿画像データから前記検索キーワードを含む領域を抽出して抽出画像データとすることを特徴とする。この構成によると、本発明に係る画像処理装置と同様の機能、作用を実現することができる。 The computer program according to the present embodiment uses an image processing apparatus to perform a reading step for causing a computer to read a placed document as document image data, and an image processing step for performing image processing on the document image data. A computer program to be executed, comprising: a character recognition step for causing a computer to perform character recognition on the document image data and obtaining document character data corresponding to the characters of the document; and the computer analyzing the document image data. A layout recognition step for recognizing the layout of the document; and a region data generation step for causing the computer to generate region data for defining a region corresponding to the layout of the document based on the recognition result of the layout recognition step; Search used to search for characters on a computer A search keyword input step for accepting an input of a word; and a keyword search step for causing the computer to perform a search for the search keyword with respect to the original character data. The image processing step includes: Based on the search result, a region including the search keyword is extracted from the document image data to obtain extracted image data. According to this configuration, the same functions and operations as those of the image processing apparatus according to the present invention can be realized.

本実施の形態に係るコンピュータプログラムでは、前記レイアウト認識ステップは、前記原稿の枠線を検出してレイアウトを認識させることを特徴とする。 In the computer program according to the present embodiment, the layout recognition step detects a frame line of the document and recognizes the layout.

本実施の形態に係るコンピュータプログラムでは、前記レイアウト認識ステップは、前記原稿の文字の間隔を検出してレイアウトを認識させることを特徴とする。 The computer program according to the present embodiment is characterized in that the layout recognition step recognizes a layout by detecting an interval between characters of the document.

本実施の形態に係るコンピュータプログラムでは、前記レイアウト認識ステップは、文字の間隔に対して予め設定された設定値に基づいて、レイアウトを認識させることを特徴とする。 In the computer program according to the present embodiment, the layout recognition step is characterized in that the layout is recognized based on a preset value set for the character spacing.

本実施の形態に係るコンピュータプログラムでは、コンピュータに、前記抽出画像データを画像表示手段に表示させる画像表示ステップと、コンピュータに、前記画像表示手段に表示する前記抽出画像データを選択させる画像選択ステップとを実行させることを特徴とする。 In the computer program according to the present embodiment, an image display step for causing the computer to display the extracted image data on the image display means, and an image selection step for causing the computer to select the extracted image data to be displayed on the image display means; Is executed.

本実施の形態に係るコンピュータプログラムでは、コンピュータに、前記抽出画像データに基づいて、抽出画像データを印刷するときの用紙のサイズを決定させるサイズ決定ステップを実行させることを特徴とする。 The computer program according to the present embodiment is characterized in that the computer is caused to execute a size determination step for determining the size of the paper when printing the extracted image data based on the extracted image data.

１画像処理装置
１１読取手段
１２画像処理手段
１３操作入力手段
１４画像表示手段
１５文字認識手段
１６レイアウト認識手段
１７領域データ生成手段
１８検索キーワード入力手段
１９キーワード検索手段
２０画像印刷手段
２１画像選択手段
２２サイズ決定手段
２３画像記憶領域
２３ｇ原稿画像データ
２３ｔ抽出画像データ
２４検索データ記憶領域
２４ｋ検索キーワード
２４ｍ原稿文字データ
２４ｒ領域データ DESCRIPTION OF SYMBOLS 1 Image processing apparatus 11 Reading means 12 Image processing means 13 Operation input means 14 Image display means 15 Character recognition means 16 Layout recognition means 17 Area data generation means 18 Search keyword input means 19 Keyword search means 20 Image printing means 21 Image selection means 22 Size determining means 23 Image storage area 23g Original image data 23t Extracted image data 24 Search data storage area 24k Search keyword 24m Original character data 24r Area data

Claims

An image processing apparatus comprising: a reading unit that reads a placed document as document image data; and an image processing unit that performs image processing on the document image data.
Character recognition means for performing character recognition on the document image data to obtain document character data corresponding to characters of the document;
Layout recognition means for analyzing the document image data and recognizing the layout of the document;
Area data generation means for generating area data for defining an area corresponding to the layout of the document based on the recognition result of the layout recognition means;
Search keyword input means for accepting input of a search keyword used for character search;
Keyword search means for searching for the search keyword with respect to the original character data,
The image processing device extracts an area including the search keyword from the document image data based on a search result of the keyword search device to obtain extracted image data.

The image processing apparatus according to claim 1,
The image processing apparatus, wherein the layout recognition unit recognizes a layout by detecting a frame line of the document.

The image processing apparatus according to claim 1,
The image processing apparatus, wherein the layout recognizing unit recognizes a layout by detecting an interval between characters of the document.

The image processing apparatus according to claim 3,
The layout recognition means recognizes a layout based on a preset value for a character interval.

An image processing apparatus according to any one of claims 1 to 4, wherein
Image display means for displaying the extracted image data;
An image processing apparatus comprising: image selection means for selecting the extracted image data to be displayed on the image display means.

An image processing apparatus according to any one of claims 1 to 5, wherein
Image printing means for printing the extracted image data on paper;
An image processing apparatus comprising: a size determining unit that determines the size of the paper based on the extracted image data.

An image processing method for causing an image processing apparatus to execute a reading step of reading a placed document as document image data and an image processing step of performing image processing on the document image data,
A character recognition step for causing the image processing apparatus to perform character recognition on the document image data to obtain document character data corresponding to the characters of the document;
A layout recognition step for causing the image processing apparatus to analyze the document image data and recognize the layout of the document;
An area data generation step for causing the image processing apparatus to generate area data that defines an area corresponding to the layout of the document based on the recognition result of the layout recognition step;
A search keyword input step for causing the image processing apparatus to accept input of a search keyword used for character search;
A keyword search step for causing the image processing device to search for the search keyword with respect to the document character data;
The image processing method is characterized in that, based on the search result of the keyword search step, an area including the search keyword is extracted from the document image data to obtain extracted image data.

A computer program for causing an image processing apparatus to execute a reading step for causing a computer to read a placed document as document image data, and an image processing step for performing image processing on the document image data.
A character recognition step for causing a computer to perform character recognition on the document image data to obtain document character data corresponding to the characters of the document;
A layout recognition step for causing the computer to analyze the document image data and recognize the layout of the document;
An area data generation step for causing the computer to generate area data for defining an area corresponding to the layout of the document based on the recognition result of the layout recognition step;
A search keyword input step for causing a computer to accept input of a search keyword used for character search;
Causing the computer to perform a keyword search step for searching the search character for the document character data;
The computer program characterized in that the image processing step extracts extracted image data by extracting a region including the search keyword from the document image data based on a search result of the keyword search step.