JP4501731B2

JP4501731B2 - Image processing device

Info

Publication number: JP4501731B2
Application number: JP2005066724A
Authority: JP
Inventors: 勝也小柳; 林　　和夫; 真人齊藤; 正彦大津
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2005-03-10
Filing date: 2005-03-10
Publication date: 2010-07-14
Anticipated expiration: 2025-03-10
Also published as: JP2006253995A

Description

本発明は、例えばスキャナにて読み取った画像のドキュメントについて、サムネイル画像を生成する画像処理装置に関する。 The present invention relates to an image processing apparatus that generates a thumbnail image for an image document read by a scanner, for example.

コンピュータや、複合機等のディスク装置等に格納されたドキュメントの検索を容易にするため、ドキュメントのプレビュー画像として、いわゆるサムネイル画像を生成して表示することが広く行われている。例えば、静止画像であれば、その静止画像の表示画像の縮小画像をサムネイル画像とする。また、文書ドキュメントであれば、その先頭ページの表示画像から縮小画像を生成してサムネイルとして用いる。 In order to facilitate the search of documents stored in a disk device such as a computer or a multifunction peripheral, so-called thumbnail images are widely generated and displayed as document preview images. For example, if it is a still image, a reduced image of the display image of the still image is used as a thumbnail image. In the case of a document document, a reduced image is generated from the display image of the first page and used as a thumbnail.

特許文献１には、サムネイルに加えて、識別に使用できる付加情報を付して表示し、さらにドキュメントの検索を容易にする技術が開示されている。
特開２００１−３３７９９４号公報 Japanese Patent Application Laid-Open No. 2004-151561 discloses a technique for displaying additional information that can be used for identification in addition to thumbnails, and for facilitating document search.
JP 2001-337994 A

しかしながら、上記従来のサムネイルの生成においては、ドキュメントが絵柄のみを含むものであろうと、文字・絵柄を組み合わせたものであろうと、また文字だけであろうと、表示画像の縮小画像を生成してサムネイルとしていたので、例えば文字などがつぶれて見にくくなるなどの事情に配慮されていなかった。 However, in the above-described conventional thumbnail generation, a thumbnail image is generated by generating a reduced image of a display image regardless of whether a document includes only a picture, a combination of characters / pictures, or only a letter. For example, it was not considered due to circumstances such as characters being crushed and difficult to see.

また、特許文献１に開示されているように、付加情報を付して表示することも考えられるが、付加情報の取得が必ずしも容易でない。例えば利用者に入力させることとすれば、操作が煩雑となって利便性が低下し、また、ＯＣＲ（光学的文字認識）を用いて付加情報を抽出するとしても、その精度などの問題から必ずしも正しく文字認識が行われるとは限らない。 Further, as disclosed in Patent Document 1, it may be possible to display with additional information attached, but acquisition of the additional information is not always easy. For example, if the user inputs it, the operation becomes complicated and the convenience is reduced, and even if additional information is extracted using OCR (optical character recognition), it is not always possible due to problems such as accuracy. Character recognition is not always performed correctly.

本発明は上記実情に鑑みて為されたもので、ドキュメントにも多様な種類のものがあることに配慮して、ドキュメントの検索を容易にできる画像処理装置を提供することを、その目的の一つとする。 The present invention has been made in view of the above circumstances, and it is an object of the present invention to provide an image processing apparatus capable of easily searching a document in consideration of various types of documents. I will.

上記従来例の問題点を解決するための本発明は、処理の対象となるドキュメントについて、当該ドキュメントの縮小表示画像を生成する画像処理装置であって、前記処理の対象となったドキュメントに基づいて、互いに異なる複数の縮小表示画像を生成する生成手段と、前記生成された複数の縮小表示画像を表示する表示手段と、を含むことを特徴としている。 The present invention for solving the problems of the above conventional example is an image processing apparatus that generates a reduced display image of a document to be processed, based on the document to be processed. And generating means for generating a plurality of different reduced display images, and display means for displaying the generated reduced display images.

ここで、ドキュメントを、その内容に基づいて、予め定めた類型のいずれかに分類する分類手段をさらに含み、前記生成手段は、前記分類の結果に基づいて、生成する、複数の縮小表示画像の内容を決定し、当該決定された内容に係る複数の縮小表示画像を生成することとしてもよい。 Here, the image processing apparatus further includes a classifying unit that classifies the document into any of predetermined types based on the contents thereof, and the generation unit generates a plurality of reduced display images generated based on the classification result. The content may be determined, and a plurality of reduced display images related to the determined content may be generated.

これらにおいて、前記分類手段は、ドキュメントの内容として、ドキュメントの表示画像のうち、文字画像を含む領域の占める割合との関係において、ドキュメントを、予め定めた類型のいずれかに分類することとしてもよい。 In these, the classification means may classify the document into one of predetermined types in relation to the ratio of the area including the character image in the display image of the document as the content of the document. .

さらに、前記処理対象となるドキュメントに基づいて、キーワード文字列情報を生成する手段をさらに含んでもよい。 Furthermore, a means for generating keyword character string information based on the document to be processed may be further included.

また、本発明の一態様は、処理の対象となるドキュメントについて、当該ドキュメントの縮小表示画像を生成する画像処理方法であって、前記処理の対象となったドキュメントに基づいて、互いに異なる複数の縮小表示画像を生成する工程と、前記生成された複数の縮小表示画像を表示する工程と、を含むことを特徴としている。 According to another aspect of the present invention, there is provided an image processing method for generating a reduced display image of a document to be processed, the plurality of reductions being different from each other based on the document to be processed. The method includes a step of generating a display image and a step of displaying the generated plurality of reduced display images.

また、本発明の別の態様は、処理の対象となるドキュメントについて、当該ドキュメントの縮小表示画像を生成するプログラムであって、コンピュータに、前記処理の対象となったドキュメントに基づいて、互いに異なる複数の縮小表示画像を生成する手順と、前記生成された複数の縮小表示画像を表示する手順と、を実行させることを特徴としている。 According to another aspect of the present invention, there is provided a program for generating a reduced display image of a document to be processed, the computer having a plurality of different ones based on the document to be processed. And a procedure for generating a plurality of reduced display images generated and a procedure for displaying the generated plurality of reduced display images.

本発明の実施の形態について図面を参照しながら説明する。本発明の実施の形態に係る画像処理装置は、例えば複合機として実現でき、図１に示すように、スキャナ部１１と、制御部１２と、記憶部１３と、ストレージ部１４と、操作表示部１５と、プリンタ部１６と、通信部１７とを含んで構成される。 Embodiments of the present invention will be described with reference to the drawings. The image processing apparatus according to the embodiment of the present invention can be realized as, for example, a multifunction peripheral. As shown in FIG. 1, the scanner unit 11, the control unit 12, the storage unit 13, the storage unit 14, and the operation display unit are provided. 15, a printer unit 16, and a communication unit 17.

スキャナ部１１は、例えば紙媒体上に形成されている画像を、光学的に読み取って、画像データとして制御部１２に出力する。 The scanner unit 11 optically reads, for example, an image formed on a paper medium and outputs the image data to the control unit 12 as image data.

制御部１２は、ＣＰＵ等によって実現でき、記憶部１３に格納されているプログラムに基づいて動作する。この制御部１２は、サムネイル画像（縮小表示画像）を生成する処理や、サムネイル画像を表示する処理などを実行する。また、この制御部１２は、ドキュメントごとに割り当てたセルを配列し、各セル内に生成したサムネイル画像等を表示して、ドキュメントを選択する処理に供する。本実施の形態では、制御部１２がドキュメント１つについて、複数のサムネイル画像を生成する場合もあるが、この場合は、複数のサムネイル画像をセル内にレイアウトして表示することになる。この制御部１２の具体的な処理の内容については、後に詳しく述べる。 The control unit 12 can be realized by a CPU or the like, and operates based on a program stored in the storage unit 13. The control unit 12 executes processing for generating a thumbnail image (reduced display image), processing for displaying a thumbnail image, and the like. In addition, the control unit 12 arranges cells assigned for each document, displays thumbnail images generated in each cell, and performs processing for selecting a document. In the present embodiment, the control unit 12 may generate a plurality of thumbnail images for one document. In this case, the plurality of thumbnail images are laid out and displayed in a cell. The specific processing contents of the control unit 12 will be described in detail later.

記憶部１３は、ＲＡＭやＲＯＭ等の記憶素子等によって実現される。この記憶部１３には、制御部１２によって実行されるプログラムが格納され、記憶部１３は、制御部１２によって実行されるプログラムを保持するコンピュータ可読な記憶媒体として動作する。また、この記憶部１３は、制御部１２のワークメモリとしても動作する。 The storage unit 13 is realized by a storage element such as a RAM or a ROM. The storage unit 13 stores a program executed by the control unit 12, and the storage unit 13 operates as a computer-readable storage medium that holds the program executed by the control unit 12. The storage unit 13 also operates as a work memory for the control unit 12.

ストレージ部１４は、例えばハードディスク装置であり、画像データ等を格納する。操作表示部１５は、液晶ディスプレイ及びタッチパネル等を用いて実現される。この操作表示部１５は、タッチパネルにて行われた利用者の操作の内容を制御部１２に出力する。また、この操作表示部１５は、制御部１２から入力される指示に従って、情報を表示する。 The storage unit 14 is, for example, a hard disk device, and stores image data and the like. The operation display unit 15 is realized using a liquid crystal display, a touch panel, or the like. The operation display unit 15 outputs the content of the user operation performed on the touch panel to the control unit 12. Further, the operation display unit 15 displays information according to an instruction input from the control unit 12.

プリンタ部１６は、制御部１２から入力される指示に従って、紙媒体上に画像を形成する。通信部１７は、ネットワークインタフェース等であり、制御部１２から入力される指示に従って、ネットワークを介して、指定された宛先に対してデータを送信する。また、この通信部１７は、ネットワークを介して到来するデータを受信して、制御部１２に出力する。 The printer unit 16 forms an image on a paper medium in accordance with an instruction input from the control unit 12. The communication unit 17 is a network interface or the like, and transmits data to a specified destination via the network in accordance with an instruction input from the control unit 12. The communication unit 17 receives data that arrives via the network and outputs the data to the control unit 12.

本実施の形態においては、処理の対象となるドキュメントは、スキャナ部１１によって読み取られる画像データであるとして以下の説明を行う。尤も、実際にはこれに限られるものではなく、処理の対処となるドキュメントとしては、ネットワークを介して受信されたワードプロセッサの電子的ドキュメントや、動画像などであってもよい。 In the present embodiment, the following description will be given on the assumption that the document to be processed is image data read by the scanner unit 11. However, the present invention is not limited to this, and a document to be processed may be an electronic document of a word processor received via a network, a moving image, or the like.

次にここで、制御部１２の処理の内容について説明する。本実施の形態の制御部１２は、スキャナ部１１から画像データの入力を受けると、当該入力された画像データをストレージ部１４に格納するとともに、そのサムネイル画像を生成する処理を行う。 Next, the content of the process of the control part 12 is demonstrated here. When receiving the input of image data from the scanner unit 11, the control unit 12 according to the present embodiment stores the input image data in the storage unit 14 and performs processing for generating the thumbnail image.

すなわち、制御部１２は、入力された画像データを、処理の対象となったドキュメントとして、当該画像データに基づいて、互いに異なる複数の縮小表示画像を生成する。具体的な例としては、制御部１２は、解像度・階調数（ないし色の数）・縮小の対象となる領域等を異ならせて、複数の縮小表示画像を生成する。 That is, the control unit 12 uses the input image data as a document to be processed, and generates a plurality of different reduced display images based on the image data. As a specific example, the control unit 12 generates a plurality of reduced display images by changing the resolution, the number of gradations (or the number of colors), the area to be reduced, and the like.

本実施の形態では、制御部１２は、ドキュメントを、その内容に基づいて、予め定めた類型のいずれかに分類して、その分類の結果に基づいて、生成する、複数の縮小表示画像の内容を決定し、当該決定された内容に係る複数の縮小表示画像を生成することとする。 In the present embodiment, the control unit 12 classifies the document into one of predetermined types based on the contents thereof, and generates contents of a plurality of reduced display images generated based on the classification result. And a plurality of reduced display images related to the determined contents are generated.

つまり、本実施の形態の制御部１２によって実行される、サムネイル画像の生成のためのプログラムは、例えば機能的に、図２に示すように、表示画像生成部２１と、ドキュメント分類部２２と、サムネイル生成部２３とを含んでなる。また、ドキュメント分類部２２は、領域分離部３１と、比率演算部３２と、分類決定部３３とを含んで構成される。 That is, a program for generating thumbnail images executed by the control unit 12 according to the present embodiment is functionally, for example, as shown in FIG. 2, a display image generation unit 21, a document classification unit 22, And a thumbnail generation unit 23. The document classification unit 22 includes an area separation unit 31, a ratio calculation unit 32, and a classification determination unit 33.

ここで表示画像生成部２１は、処理対象となったドキュメントの表示画像を生成する。ここでは処理対象のドキュメントは、スキャナ部１１にて読み取られた画像データであり、表示画像は、その画像データそのものであることから、ここでは表示画像生成部２１は、処理の対象となるドキュメントとしての画像データをそのまま表示画像として、ドキュメント分類部２２と、サムネイル生成部２３とに出力する。 Here, the display image generation unit 21 generates a display image of the document to be processed. Here, the document to be processed is the image data read by the scanner unit 11 and the display image is the image data itself. Therefore, here, the display image generation unit 21 is the document to be processed. Are directly output as a display image to the document classification unit 22 and the thumbnail generation unit 23.

ドキュメント分類部２２の領域分離部３１は、広く知られた文字・絵柄分離技術を用いて、表示画像のうち、文字領域・絵柄領域を分離する。ここで文字・絵柄分離技術は、例えば、表示画像を二値化して、黒画素の連続領域のうち、予め定めたサイズ未満のものを文字、予め定めたサイズ以上のものを絵柄とするものや、ＯＣＲ等におけるレイアウト処理を利用したものなどがある。 The region separation unit 31 of the document classification unit 22 separates the character region / design region from the display image by using a well-known character / design separation technique. Here, the character / picture separation technique is, for example, binarizing a display image, and using a continuous area of black pixels having a character smaller than a predetermined size as a character and a picture having a predetermined size or larger as a picture, And those using layout processing in OCR or the like.

比率演算部３２は、領域分離部３１において分離された、各領域のうち、文字領域（すなわち文字画像を含む領域）のサイズ（面積）の合計を演算する。そして、表示画像のサイズ（面積）で、この文字領域のサイズの合計を除した値を演算し、当該演算した値（文字領域の割合の情報）を分類決定部３３に出力する。 The ratio calculation unit 32 calculates the sum of the sizes (areas) of character regions (that is, regions including character images) among the regions separated by the region separation unit 31. Then, a value obtained by dividing the total size of the character area by the size (area) of the display image is calculated, and the calculated value (character area ratio information) is output to the classification determining unit 33.

分類決定部３３は、比率演算部３２から入力される文字領域の割合の値が、設定されているしきい値を超えている場合に「文字あり」のドキュメントとして分類し、しきい値以下の場合に「絵柄主体」のドキュメントとして分類する。なお、ここではドキュメントの類型として「文字あり」または「絵柄主体」のいずれかへの分類を行うものとしているが、文字領域の割合を段階的に分けて、さらに多くの類型に分類してもよい。さらに、下地の色、絵柄領域の割合等を利用してさらに多くの類型に分類してもよい。 The classification determination unit 33 classifies the document as a “character present” document when the ratio value of the character area input from the ratio calculation unit 32 exceeds a set threshold value, and is equal to or lower than the threshold value. In some cases, it is classified as a “picture-based” document. In this example, the document type is classified as either “character present” or “picture subject”. However, the ratio of the character area is divided in stages and classified into more types. Good. Furthermore, it may be classified into more types using the background color, the ratio of the pattern area, and the like.

サムネイル生成部２３は、分類決定部３３による分類の結果に基づいて、サムネイル画像を生成する。サムネイル生成部２３は、分類決定部３３による分類の結果から、生成するサムネイル画像の種類を決定する。例えば予め、分類の結果ごとに生成するサムネイル画像の種類を関連付けたテーブルを記憶部１３に格納しておき、このテーブルを参照しながらどのサムネイル画像を生成するかを決定する。具体的な例として、「文字あり」と分類されたドキュメントについては、ドキュメントの表示画像全体を縮小した全体サムネイル画像と、文字領域の少なくとも一部の表示画像を縮小した文字部分サムネイル画像とを含む、複数のサムネイル画像を生成する。また「絵柄主体」と分類されたドキュメントについては、ドキュメントの表示画像全体を縮小した全体サムネイル画像を生成することとしておく。 The thumbnail generation unit 23 generates a thumbnail image based on the classification result by the classification determination unit 33. The thumbnail generation unit 23 determines the type of thumbnail image to be generated from the result of classification by the classification determination unit 33. For example, a table that associates the types of thumbnail images generated for each classification result is stored in the storage unit 13 in advance, and it is determined which thumbnail image to generate while referring to this table. As a specific example, a document classified as “with text” includes an entire thumbnail image obtained by reducing the entire display image of the document, and a character partial thumbnail image obtained by reducing at least a part of the display image of the character area. A plurality of thumbnail images are generated. For a document classified as “picture subject”, an entire thumbnail image obtained by reducing the entire display image of the document is generated.

本実施の形態の画像処理装置では、制御部１２が、ドキュメントごとに割り当てた矩形のセルを、図３に示すようにマトリクス状に配列し、各セル内に生成したサムネイル画像等を表示して、ドキュメントを選択する処理に供する。複数のサムネイル画像が生成されたドキュメントについては、当該ドキュメントに対応するセル内に、複数のサムネイル画像をレイアウトして表示することとなる。そこで、サムネイル生成部２３は、「文字あり」と分類されたドキュメント（複数のサムネイル画像を生成するドキュメント）については、全体サムネイル画像と文字部分サムネイル画像との双方のサイズを、セルのサイズに基づいて決定する。また、各セルには、セル内に表示されているサムネイルに係るドキュメントの生成・修正日時の情報や、ファイル名などを表示する欄（日付等表示欄）が関連づけられていてもよい。 In the image processing apparatus according to the present embodiment, the control unit 12 arranges rectangular cells assigned to each document in a matrix as shown in FIG. 3, and displays thumbnail images and the like generated in each cell. , Used to select a document. For a document in which a plurality of thumbnail images are generated, a plurality of thumbnail images are laid out and displayed in a cell corresponding to the document. Therefore, for the documents classified as “characters present” (documents for generating a plurality of thumbnail images), the thumbnail generation unit 23 determines the sizes of both the entire thumbnail image and the character partial thumbnail image based on the cell size. To decide. Also, each cell may be associated with a column (date display column) for displaying information on the generation / modification date and time of the document related to the thumbnail displayed in the cell, a file name, and the like.

例えば、サムネイル生成部２３は、セルサイズＳｘ×Ｓｙのとき、「文字あり」と分類されたドキュメントについては、全体サムネイル画像についてＳｘ／２×Ｓｙ／２、文字部分サムネイル画像についてＳｘ／２×Ｓｙ／２及びＳｘ×Ｓｙ／２の合計３つのサイズを決定する。また、「絵柄主体」と分類されたドキュメントについては、全体サムネイル画像のサイズとしてＳｘ×Ｓｙとする。 For example, when the cell size is Sx × Sy, for the document classified as “character present”, the thumbnail generation unit 23 performs Sx / 2 × Sy / 2 for the entire thumbnail image and Sx / 2 × Sy for the character partial thumbnail image. A total of three sizes of / 2 and Sx × Sy / 2 are determined. For a document classified as “picture subject”, the size of the entire thumbnail image is Sx × Sy.

そしてサムネイル生成部２３は、決定したサイズの全体サムネイル画像を生成する。この全体サムネイル画像は、従来のものと同様である。また、ドキュメントが「文字あり」と分類されている場合は、サイズＳｘ／２×Ｓｙ／２の文字部分サムネイル画像と、サイズＳｘ×Ｓｙ／２の文字部分サムネイル画像とを生成する。 Then, the thumbnail generation unit 23 generates an entire thumbnail image of the determined size. This whole thumbnail image is the same as the conventional one. If the document is classified as “with text”, a character part thumbnail image of size Sx / 2 × Sy / 2 and a character part thumbnail image of size Sx × Sy / 2 are generated.

ここで文字部分サムネイル画像を生成する方法例について述べる。サムネイル生成部２３は、まず、ドキュメント分類部２２の領域分離部３１から、文字領域の情報の入力を受け、処理の対象となったドキュメントの表示画像（ここでは入力された画像データそのもの）のうち、文字領域の情報によって示される領域部分を抽出する。そして、当該抽出した部分の画像（文字部分画像）を、予め定めた解像度（文字の判読が可能とされる程度の解像度、例えば２００ｄｐｉ程度であればよい）の画像に変換する。またこの画像の階調数を変換（例えば二値化）してもよい。サムネイル生成部２３は、さらに、当該変換後の文字部分画像のうち、予め定められた条件に基づいて、決定されたサイズＳｘ／２×Ｓｙ／２の第１領域と、サイズＳｘ×Ｓｙ／２の第２領域とを切り出す。 Here, an example of a method for generating a character partial thumbnail image will be described. First, the thumbnail generation unit 23 receives input of character region information from the region separation unit 31 of the document classification unit 22, and among the display images of the document to be processed (here, the input image data itself). The region portion indicated by the character region information is extracted. Then, the extracted image of the portion (character partial image) is converted into an image having a predetermined resolution (a resolution that enables reading of the character, for example, about 200 dpi). Further, the number of gradations of this image may be converted (for example, binarized). The thumbnail generation unit 23 further includes a first area having a size Sx / 2 × Sy / 2 determined based on a predetermined condition in the converted character partial image, and a size Sx × Sy / 2. The second region is cut out.

ここで予め定められた条件とは、例えば文字に対応する各連続画素について、外接矩形を画定し、画定した各外接矩形のうち、サイズの大きい外接矩形が含まれている部分を抽出する（つまり、比較的大きい文字で記述されている部分を抽出する）との条件や、文字部分画像のうち、左上隅の部分（つまり、文頭と考えられる部分）を抽出するとの条件などがある。 Here, the predetermined condition is, for example, that a circumscribed rectangle is defined for each continuous pixel corresponding to a character, and a portion including a circumscribed rectangle having a large size is extracted from each of the defined circumscribed rectangles (that is, And a condition for extracting the upper left corner portion (that is, the portion considered to be the beginning of a sentence) of the character partial image.

また、文字部分画像を操作表示部１５に表示し、利用者に決定されたサイズＳｘ／２×Ｓｙ／２の第１領域と、サイズＳｘ×Ｓｙ／２の第２領域とを指定させ、当該指定されたことを条件として、当該条件により決定されたサイズＳｘ／２×Ｓｙ／２の第１領域と、サイズＳｘ×Ｓｙ／２の第２領域とを切り出すこととしてもよい。 In addition, the character partial image is displayed on the operation display unit 15 and the user is allowed to specify the first area of the size Sx / 2 × Sy / 2 determined and the second area of the size Sx × Sy / 2. On the condition that it is specified, the first area of size Sx / 2 × Sy / 2 determined by the condition and the second area of size Sx × Sy / 2 may be cut out.

さらにサムネイル生成部２３は、全体サムネイル画像上に、ここで切り出した第１領域及び第２領域に対応する部分を強調する枠線（Ｌ１，Ｌ２）などを描画してもよい（図４）。ここでサムネイル生成部２３が、文字領域画像のうちから切り出した第１領域及び第２領域のそれぞれの画像が、文字部分サムネイル画像となる。 Furthermore, the thumbnail generation unit 23 may draw frame lines (L1, L2) that emphasize portions corresponding to the first area and the second area cut out on the entire thumbnail image (FIG. 4). Here, each image of the first area and the second area cut out from the character area image by the thumbnail generation unit 23 becomes a character partial thumbnail image.

サムネイル生成部２３は、こうして生成したサムネイル画像（「文字あり」のドキュメントについて全体サムネイル画像１つと文字部分サムネイル画像２つ、「絵柄主体」のドキュメントについて全体サムネイル画像１つ）を、処理の対象となったドキュメントに関連付けて、記憶部１３に格納する。 The thumbnail generation unit 23 processes the thumbnail images thus generated (one overall thumbnail image and two character partial thumbnail images for a document “with text”, and one overall thumbnail image for a “picture-based” document) as processing targets. The data is stored in the storage unit 13 in association with the document.

また、制御部１２は、処理対象となったドキュメントについて、上記サムネイル生成部２３が生成した文字部分画像（変換前のものであっても、変換後のものであってもよい）に対してＯＣＲ処理を行い、その結果として得られた文字列を、キーワード文字列情報として、処理の対象となったドキュメントに関連付けて、記憶部１３に格納してもよい。 In addition, the control unit 12 performs OCR on the character partial image generated by the thumbnail generation unit 23 (either before conversion or after conversion) for the document to be processed. Processing may be performed, and a character string obtained as a result may be stored in the storage unit 13 as keyword character string information in association with a document to be processed.

さらに制御部１２は、処理対象となったドキュメントについて、利用者に対してキーワードの入力を促す表示を操作表示部１５にて行い、利用者からキーワードが入力された場合には、当該キーワードの文字列を、キーワード文字列情報として、処理の対象となったドキュメントに関連付けて、記憶部１３に格納してもよい。 Further, the control unit 12 performs a display for prompting the user to input a keyword for the document to be processed on the operation display unit 15, and when the keyword is input from the user, the character of the keyword is displayed. The column may be stored as keyword character string information in the storage unit 13 in association with the document to be processed.

さらに制御部１２は、上記処理におけるドキュメントの分類結果を、処理の対象となったドキュメントに関連付けて、記憶部１３に格納してもよい。 Further, the control unit 12 may store the document classification result in the above processing in the storage unit 13 in association with the document to be processed.

以上の制御部１２の処理により、記憶部１３には、図５に示すように、例えば、ドキュメントの本体の情報（Ｉ）と、当該ドキュメントの分類結果の情報（Ｃ）と、生成されたサムネイル画像（Ｐ）と、キーワード文字列情報（Ｋ）とが互いに関連付けて、ドキュメントデータベースとして格納されているようになる。 As a result of the above-described processing of the control unit 12, the storage unit 13 stores, for example, document body information (I), document classification result information (C), and generated thumbnails, as shown in FIG. The image (P) and the keyword character string information (K) are associated with each other and stored as a document database.

制御部１２は、ドキュメントの一覧を表示する場合に、このドキュメントデータベースを参照する。以下、制御部１２におけるドキュメント一覧の表示処理の例について説明する。制御部１２は、利用者から、記憶部１３に格納されているドキュメントのうちから、例えば印刷の対象となるドキュメントを選択する旨の指示を受けて、ドキュメントの一覧表示の処理を開始する。 The control unit 12 refers to the document database when displaying a list of documents. Hereinafter, an example of document list display processing in the control unit 12 will be described. In response to an instruction from the user to select, for example, a document to be printed from among the documents stored in the storage unit 13, the control unit 12 starts a document list display process.

制御部１２は、まず、各ドキュメントのサムネイル画像を表示するためのセル領域の座標情報を生成する。このセル領域の座標情報は、例えば、各セルのサイズＳｘ×Ｓｙの矩形領域をマトリクス状に配列して得られるものである。つまり、最初のセルの座標情報が（０，０）から（Ｓｘ，Ｓｙ）まで、次のセルの座標情報が（ラインの方向に配列するものとして）、（Ｓｘ＋１、０）から（２Ｓｘ＋１，Ｓｙ）まで…といったようになり、操作表示部１５の表示可能領域に応じて、次のラインに移り、（０，Ｓｙ＋１）から（Ｓｘ，２Ｓｙ＋１）まで…というように次のラインにおいても同様に配列を行う。以下、すべてのドキュメントが配列されるまで、ラインごとの配列を繰り返すことになる。 First, the control unit 12 generates coordinate information of a cell area for displaying a thumbnail image of each document. The coordinate information of the cell area is obtained, for example, by arranging rectangular areas of each cell size Sx × Sy in a matrix. That is, the coordinate information of the first cell is (0,0) to (Sx, Sy), the coordinate information of the next cell (assuming that they are arranged in the line direction), (Sx + 1,0) to (2Sx + 1, Sy) ), And so on, depending on the displayable area of the operation display unit 15, the next line is moved, and from (0, Sy + 1) to (Sx, 2Sy + 1). I do. Thereafter, the arrangement for each line is repeated until all documents are arranged.

また制御部１２は、記憶部１３に格納されているドキュメントデータベースを参照し、ドキュメントデータベースに格納されている各ドキュメントを順次、注目ドキュメントとして選択しながら、次の処理を行う。 The control unit 12 refers to the document database stored in the storage unit 13 and performs the following process while sequentially selecting each document stored in the document database as a document of interest.

まず注目ドキュメントに係る分類結果の情報（Ｃ）を参照する。そして、分類結果が「絵柄主体」である場合には、全体サムネイルを一つ表示することとして、生成されている、注目ドキュメントのサムネイル画像（Ｐ）を読み出す。そして現在のセル（次にドキュメントのサムネイル画像を配置するべきセル）の座標情報を取得し、当該座標情報によって画定される領域内に、サムネイル画像（Ｐ）を表示する。そして、次のドキュメントを注目ドキュメントとして選択して処理を続ける。 First, the information (C) of the classification result relating to the document of interest is referred to. When the classification result is “picture subject”, the thumbnail image (P) of the generated document of interest is read out by displaying one whole thumbnail. Then, the coordinate information of the current cell (the cell in which the thumbnail image of the document is to be placed next) is acquired, and the thumbnail image (P) is displayed in the area defined by the coordinate information. Then, the next document is selected as the document of interest and the process is continued.

また、取得した分類結果が「文字あり」である場合は、キーワード文字列情報の文字列の長さの情報を取得する。ここで当該文字列の長さが、予め定めた文字列長しきい値未満である場合は、全体サムネイルと、文字部分サムネイルとの複数のサムネイル画像を表示することとして、注目ドキュメントについて生成されている、複数のサムネイル画像（Ｐ）を読み出す。そして現在のセル（次にドキュメントのサムネイル画像を配置するべきセル）の座標情報を取得し、当該座標情報によって画定される領域内に、複数のサムネイル画像（Ｐ）をレイアウトして表示する（その例として図６（ａ）を参照）。そして、次のドキュメントを注目ドキュメントとして選択して処理を続ける。 Further, when the obtained classification result is “with character”, information on the length of the character string of the keyword character string information is obtained. Here, when the length of the character string is less than a predetermined character string length threshold, a plurality of thumbnail images of the whole thumbnail and the character partial thumbnail are displayed and generated for the document of interest. A plurality of thumbnail images (P) are read out. Then, the coordinate information of the current cell (the cell in which the thumbnail image of the document is to be placed next) is acquired, and a plurality of thumbnail images (P) are laid out and displayed within the area defined by the coordinate information (then See FIG. 6 (a) as an example). Then, the next document is selected as the document of interest and the process is continued.

また、取得したキーワード文字列情報の文字列の長さが、予め定めた文字列長しきい値以上である場合は、全体サムネイルと、文字部分サムネイルとの複数のサムネイル画像、並びにキーワード文字列情報を表示することとして、注目ドキュメントについて生成されている、複数のサムネイル画像（Ｐ）とキーワード文字列情報（Ｋ）とを読み出す。そして現在のセル（次にドキュメントのサムネイル画像を配置するべきセル）の座標情報を取得し、当該座標情報によって画定される領域内に、複数のサムネイル画像（Ｐ）とキーワード文字列情報（Ｋ）とをレイアウトして表示する（その例として図６（ｂ）を参照）。そして、次のドキュメントを注目ドキュメントとして選択して処理を続ける。 Further, when the length of the character string of the acquired keyword character string information is equal to or greater than a predetermined character string length threshold, a plurality of thumbnail images of the whole thumbnail and the character partial thumbnail, and the keyword character string information Are displayed, a plurality of thumbnail images (P) and keyword character string information (K) generated for the document of interest are read. Then, the coordinate information of the current cell (the cell in which the document thumbnail image is to be placed next) is acquired, and a plurality of thumbnail images (P) and keyword character string information (K) are included in the area defined by the coordinate information. Are laid out and displayed (see FIG. 6B as an example). Then, the next document is selected as the document of interest and the process is continued.

制御部１２は、これらの処理を、すべてのドキュメントについてのサムネイル画像が配列されるまで繰り返して行う。利用者は、各ドキュメントについて、単数、又は複数表示されているサムネイル画像や、キーワード文字列情報を参照しながら、印刷の対象となるドキュメントを選択する。この選択操作は、例えば選択の対象となるセルの座標の領域をタッチパネルの操作で選択することによって行うことができる。 The control unit 12 repeats these processes until thumbnail images for all documents are arranged. For each document, the user selects a document to be printed while referring to one or more displayed thumbnail images and keyword character string information. This selection operation can be performed, for example, by selecting a coordinate area of a cell to be selected by an operation on the touch panel.

制御部１２は、利用者がドキュメントの一つを選択する指示操作を行うと、ドキュメントデータベースを参照して、選択されたドキュメントの本体の情報（Ｉ）を読み出し、プリンタ部１６に出力して印刷を行わせる。 When the user performs an instruction operation to select one of the documents, the control unit 12 refers to the document database, reads information (I) on the body of the selected document, and outputs the information to the printer unit 16 for printing. To do.

なお、ここでは、「文字あり」と分類されたドキュメントについてのみ、キーワード文字列の情報を表示することとしているが、これに限られず、例えば「絵柄主体」と分類されたドキュメントについてもキーワード文字列の情報を表示することとしてもよい。 Here, the keyword character string information is displayed only for documents classified as “character present”. However, the present invention is not limited to this. For example, the keyword character string is also displayed for documents classified as “picture-based”. It is good also as displaying the information of.

またここまでの説明において、制御部１２は、ドキュメントの表示画像全体に占める文字部分の割合に基づいてドキュメントの分類を行うこととしていたが、例えば、パンフレットなどにおいては、絵柄が主体的でありながら、文字部分が重要な情報を有している場合もあり得る。そこで、制御部１２のドキュメント分類部２２の処理においては、ドキュメントの表示画像を予め、例えば上側半分と、下側半分と、といったように所定のブロックに分割した上で、各ブロックごとに文字部分の領域が占める割合を演算し、いずれかのブロックにおいて、文字部分の領域が占める割合が、予め定めたしきい値を超えている場合には、「文字あり」と分類することとしてもよい。これにより、例えばドキュメントの上半分が絵柄で占められているが、下半分に重要な文字情報が含まれているといった場合にも、当該文字情報に係る文字部分サムネイル画像が生成されて表示され、利用者のドキュメントの識別に資することができる。 In the description so far, the control unit 12 classifies the document based on the ratio of the character portion in the entire display image of the document. For example, in a pamphlet or the like, the pattern is dominant. The character part may have important information. Therefore, in the processing of the document classification unit 22 of the control unit 12, the display image of the document is divided into predetermined blocks in advance, for example, the upper half and the lower half, and the character portion for each block. The ratio of the area occupied by the area may be calculated, and if the ratio occupied by the area of the character portion in any block exceeds a predetermined threshold value, it may be classified as “character present”. Thereby, for example, when the upper half of the document is occupied by a picture, but important character information is included in the lower half, a character part thumbnail image related to the character information is generated and displayed, It can contribute to the identification of the user's document.

さらにドキュメントデータベースにおいて、各ドキュメントにアクセス権の情報を関連付けてもよい。ここでアクセス権としては、例えばドキュメントにアクセス可能な利用者名を列挙して関連付けることとすればよい。 Further, in the document database, access right information may be associated with each document. Here, as the access right, for example, user names that can access the document may be listed and associated.

この場合、制御部１２は、ドキュメントの一覧を表示する際に、利用者に対して利用者名やパスワードの入力を求めて、利用者の認証を行い、認証した利用者の利用者名が、アクセス可能として列挙されたリストに含まれていないドキュメントについては、そのサムネイル画像を表示しないように制御してもよい。この場合、セル内に「アクセス権なし」といった文字列を表示してもよいし、そもそも当該ドキュメントについてはサムネイル画像の表示に関する処理を行わずに次のドキュメントについての処理に移行してもよい。 In this case, when displaying the list of documents, the control unit 12 requests the user to input a user name and password, authenticates the user, and the user name of the authenticated user is You may control not to display the thumbnail image about the document which is not contained in the list enumerated as accessible. In this case, a character string such as “no access right” may be displayed in the cell, or the process may be shifted to the process for the next document without performing the process for displaying the thumbnail image.

本実施の形態によると、必要に応じて複数のサムネイル画像を生成・表示し、利用者に提示することとしているので、多様な種類のドキュメントに配慮した処理を行うことができ、ドキュメントの検索を容易にできる。 According to the present embodiment, a plurality of thumbnail images are generated and displayed as required and presented to the user, so that various types of documents can be processed, and documents can be searched. Easy to do.

本発明の実施の形態に係る画像処理装置の一例を表す構成ブロック図である。1 is a configuration block diagram illustrating an example of an image processing apparatus according to an embodiment of the present invention. 本発明の実施の形態に係る画像処理装置の処理の例を表す機能ブロック図である。It is a functional block diagram showing the example of a process of the image processing apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係る画像処理装置によって提示されるドキュメントの一覧の例を表す説明図である。It is explanatory drawing showing the example of the list of the documents shown by the image processing apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係る画像処理装置によって提示されるサムネイル画像の例を表す説明図である。It is explanatory drawing showing the example of the thumbnail image shown by the image processing apparatus which concerns on embodiment of this invention. ドキュメントに関する情報の例を表す説明図である。It is explanatory drawing showing the example of the information regarding a document. 本発明の実施の形態に係る画像処理装置によって提示されるドキュメントの一覧の例を表す説明図である。It is explanatory drawing showing the example of the list of the documents shown by the image processing apparatus which concerns on embodiment of this invention.

Explanation of symbols

１１スキャナ部、１２制御部、１３記憶部、１４ストレージ部、１５操作表示部、１６プリンタ部、１７通信部、２１表示画像生成部、２２ドキュメント分類部、２３サムネイル生成部、３１領域分離部、３２比率演算部、３３分類決定部。
DESCRIPTION OF SYMBOLS 11 Scanner part, 12 Control part, 13 Storage part, 14 Storage part, 15 Operation display part, 16 Printer part, 17 Communication part, 21 Display image generation part, 22 Document classification part, 23 Thumbnail generation part, 31 Area separation part, 32 Ratio calculation part, 33 Classification determination part.

Claims

Classification means for classifying each document to be processed into one of predetermined types based on the contents thereof,
Based on the results of the classification, the contents of the thumbnail images generated determined for each document, a generation unit which had One each document as the object of processing, and generates the thumbnail image according to the content the determined ,
Display means for displaying a reduced display image generated for each document ;
An image processing apparatus comprising:

The image processing apparatus according to claim 1 ,
The classification means classifies the document into one of predetermined types based on the ratio of the area including the character image in the display image of the document as the content of the document. .

An image processing apparatus according to claim 1 or 2,
An image processing apparatus, further comprising means for generating keyword character string information based on the document to be processed.

For each document to be processed, a step of classifying the document into one of predetermined types based on the contents thereof;
Based on the results of the classification, the contents of the thumbnail images generated determined for each document, a generation step of have One each document as the object of processing, and generates the thumbnail image according to the content the determined ,
Displaying a reduced display image generated for each document ;
An image processing method comprising:

Computer
Classification means for classifying each document to be processed into one of predetermined types based on the contents thereof,
Based on the results of the classification, the contents of the thumbnail images generated determined for each document, a generation unit which had One each document as the object of processing, and generates the thumbnail image according to the content the determined ,
Display means for displaying a reduced display image generated for each document ;
A program characterized by functioning as