JP5312310B2

JP5312310B2 - SEARCH DEVICE, METHOD, AND PROGRAM

Info

Publication number: JP5312310B2
Application number: JP2009289787A
Authority: JP
Inventors: 章裕宮田; 考藤村
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc
Current assignee: Nippon Telegraph and Telephone Corp; NTT Inc
Priority date: 2009-12-21
Filing date: 2009-12-21
Publication date: 2013-10-09
Anticipated expiration: 2029-12-21
Also published as: JP2011129070A

Description

本発明は、検索装置及び方法及びプログラムに係り、特に、画像と文字列のレイアウトが確定しているドキュメント内の領域を検索クエリとして、該領域が出現するドキュメント及びドキュメント内の各位置のインデックスを作成し、検索を実行する検索装置及び方法及びプログラムに関する。 The present invention relates to a search apparatus, method, and program, and in particular, uses a region in a document in which the layout of an image and a character string is determined as a search query, and an index of the document in which the region appears and each position in the document. The present invention relates to a search apparatus, method, and program for creating and executing a search.

特に、画像と文字列のレイアウトが確定しているドキュメント内の領域を含む可能性があるドキュメント及びドキュメント内における位置を網羅意的に取得するのではなく、一意に特定したい場合に適用される検索装置及び方法及びプログラムに関する。 In particular, a search that is applied when you want to uniquely identify a document that may contain an area in the document where the layout of images and character strings has been determined, and the position within the document, not exhaustively. The present invention relates to an apparatus, a method, and a program.

ドキュメントの一部領域から、該領域がどのドキュメントに含まれているか、あるいは、どのドキュメントのどの位置に含まれているか一意に特定することが必要なシーンは少なくない。 There are not a few scenes in which it is necessary to uniquely identify from which document a part of a document is included in which document or at which position in which document.

例えば、手元にカタログの切り抜きがある場合、切り抜いた元のカタログを探して、切り抜きの続きを読みたいことがある。この場合、当該切り抜きがどのカタログの一部であったか一意に特定できる必要がある。 For example, if you have a catalog cutout at hand, you may want to find the original cutout catalog and read the continuation of the cutout. In this case, it is necessary to be able to uniquely identify which catalog the cutout is part of.

上記の事例は、ドキュメントの一部領域をクエリとし、膨大な量のドキュメント群の中から、該領域を含むドキュメント名、あるいはドキュメント名及びドキュメントにおける位置を問い合わせる検索システムと捉えることができる。 The above example can be regarded as a search system that uses a partial area of a document as a query and inquires about a document name including the area or a document name and a position in the document from a huge amount of documents.

上記の検索システムを実現する一例として、予めドキュメント内の各画像の全体領域の色分布情報をデータベースに格納しておき、ドキュメント内にある画像の全体領域の色分布情報を検索クエリとし、該画像と色分布情報が一致する画像を含むドキュメント名、あるいは、ドキュメント名及びドキュメントにおける位置をデータベースに問い合わせる検索システムが考えられる（例えば、非特許文献１参照）。 As an example for realizing the above search system, the color distribution information of the entire area of each image in the document is stored in a database in advance, and the color distribution information of the entire area of the image in the document is used as a search query. A search system that inquires a database for the name of a document including an image whose color distribution information matches the document name, or the position of the document in the document (for example, see Non-Patent Document 1).

あるいは、各画像の部分領域のマッチングのみで検索を行うシステムもある。例えば、予めドキュメント内の各画像を既定数の複数の部分領域として分割し、各部分領域の色分布情報をデータベースに格納しておく。次に、ドキュメント内のある画像を既定数の複数の部分領域として分割し、各部分領域の色分布情報を検索クエリとし、該部分領域と色分布情報が一致する部分領域を含むドキュメント名、あるいは、ドキュメント名及びドキュメントにおける位置をデータベースに問い合わせる検索システムが考えられる（例えば、非特許文献２参照）。 Alternatively, there is a system that performs a search only by matching a partial area of each image. For example, each image in the document is previously divided into a predetermined number of partial areas, and color distribution information of each partial area is stored in a database. Next, an image in the document is divided into a predetermined number of partial areas, and the color distribution information of each partial area is used as a search query, and the document name including the partial area where the partial area matches the color distribution information, or A search system that inquires a database for a document name and a position in the document can be considered (for example, see Non-Patent Document 2).

M. J. Swain and D. H. Ballard, "Color Indexing," International Journal of Computer Vision, Vol. 7, pp. 11-32, 1991M. J. Swain and D. H. Ballard, "Color Indexing," International Journal of Computer Vision, Vol. 7, pp. 11-32, 1991 WALRUS: A Similarity Retrieval Algorithm for Image Databases, Apostol Natsev, Rajeev Rastogi and Kyuseok Shim, IEEE Transaction on Knowledge and Data Engineering, Vol. 16, No.3, pp. 301-316, 2004.WALRUS: A Similarity Retrieval Algorithm for Image Databases, Apostol Natsev, Rajeev Rastogi and Kyuseok Shim, IEEE Transaction on Knowledge and Data Engineering, Vol. 16, No. 3, pp. 301-316, 2004.

各画像の全体領域の色分布情報をデータベースに格納して、ある画像の全体領域の色分布情報を検索クエリとする方法は、データベース内の情報と検索クエリのマッチング処理がシンプルである一方、画像の全体領域を検索クエリに含めなければならないという問題がある。 The method of storing the color distribution information of the entire area of each image in the database and using the color distribution information of the entire area of a certain image as a search query is simple while matching the information in the database and the search query, There is a problem that the entire area of must be included in the search query.

一方、各画像の部分領域の色分布情報をデータベースに格納して、ある画像の部分領域の色分布情報を検索クエリとする方法は、画像の全体領域を検索クエリに含めなくてもよい一方、データベース格納時及び検索実行時に画像を複数領域の部分領域に分割し、データベース内及び検索クエリの部分領域同士で色分布情報のマッチングをとるため、マッチングをとる回数が大幅に増加するという問題がある。 On the other hand, the method of storing the color distribution information of the partial area of each image in the database and using the color distribution information of the partial area of a certain image as a search query may not include the entire area of the image in the search query. There is a problem that the number of times of matching is greatly increased because the image is divided into a plurality of partial areas at the time of database storage and search execution, and color distribution information is matched between the partial areas of the database and the search query. .

本発明は、上記の点に鑑みなされたもので、撮影対象に対してインデックスを付与する際に、正しい検索結果が得られる確率が向上し、マッチングに要する計算コストの削減が可能な検索装置及び方法及びプログラムを提供することを目的とする。 The present invention has been made in view of the above points, and a search apparatus capable of improving the probability that a correct search result is obtained when an index is given to a shooting target and reducing the calculation cost required for matching, and An object is to provide a method and a program.

図１は、本発明の原理構成図である。 FIG. 1 is a principle configuration diagram of the present invention.

本発明（請求項１）は、改ページや改行位置が確定しており、文字列を含むドキュメント内の、文字列を含む一部領域を検索クエリとして、該領域が出現するドキュメント及び該ドキュメント内における位置を取得する検索要求に応えるための検索インデックスを作成し、検索を行う検索装置であって、
インデックス作成対象のドキュメント群の入力を受け付けるドキュメント群入力手段１０と、
文字領域抽出方法を用いてドキュメント内の文字領域を特定する文字領域特定手段１１と、
各文字領域の周囲に存在する領域である周辺領域を特定する周辺領域特定手段１２と、
画像特徴比較手法を用いて、各周辺領域とドキュメントの背景領域を比較し、該周辺領域の中で該背景領域と異なるものを部分画像領域と判定する部分画像特定手段１３と、
文字領域と、該文字領域の周囲にある部分画像領域と、該文字列が出現するドキュメントにおける出現位置を関連付けてインデックスＤＢ１５に出力するインデックス出力手段１４と、を有する。 According to the present invention (Claim 1), a page break or a line feed position is fixed, and a document including the character string in a document including the character string is used as a search query, and the document in which the region appears A search device for creating a search index for responding to a search request to acquire a position in
A document group input means 10 for receiving an input of a document group to be indexed;
Character area specifying means 11 for specifying a character area in a document using a character area extraction method;
Peripheral area specifying means 12 for specifying a peripheral area that is an area existing around each character area;
A partial image specifying means 13 for comparing each peripheral area with a background area of a document using an image feature comparison method, and determining a part of the peripheral area different from the background area as a partial image area;
A character region, a partial image region around the character region, and an index output unit 14 that associates an appearance position in a document in which the character string appears and outputs it to the index DB 15.

また、本発明（請求項２）は、請求項１の検索装置において、
ドキュメントは、画像も含み、
ドキュメント内の、画像及び文字列を含む一部領域を検索クエリとする。 Further, the present invention (Claim 2) is the search device according to Claim 1,
The document also includes images,
A partial area including an image and a character string in the document is set as a search query.

また、本発明（請求項３）は、請求項１または２の検索装置において、
画像と文字列を含むドキュメントの一部領域を撮影した画像の入力を受け付ける検索対象画像入力手段と、
文字領域抽出方法を用いて検索対象画像内の文字領域を特定する検索対象文字領域特定手段と、
各文字領域の周囲に存在する領域である周辺領域を特定する検索対象周辺領域特定手段と、
画像特定比較手法を用いて、各周辺領域と検索対象画像の背景領域を比較し、周辺領域の中で背景領域と異なるものを部分画像領域と判定する検索対象部分画像特定手段と、
文字領域と、該文字領域の周囲にある部分画像領域が出現するドキュメントまたはドキュメント内における位置を、インデックスＤＢに問い合わせる問い合わせ手段と、
問い合わせの結果を出力する検索結果出力手段と、を更に有する。 Further, the present invention (Claim 3) is the search device according to Claim 1 or 2,
A search target image input means for receiving an input of an image obtained by photographing a partial area of a document including an image and a character string;
Search target character area specifying means for specifying a character area in a search target image using a character area extraction method;
Search target peripheral area specifying means for specifying a peripheral area that is an area existing around each character area;
A search target partial image specifying unit that compares each peripheral region with a background region of the search target image using an image specific comparison method, and determines a partial image region that is different from the background region in the peripheral region;
Query means for querying the index DB for the character area and the document in which the partial image area around the character area appears or the position in the document;
Search result output means for outputting the result of the inquiry.

また、本発明（請求項４）は、請求項３の検索装置において、
問い合わせ手段は、
事前に各ドキュメントの位置とコンテンツを関連付けたデータベースにアクセスし、検索結果であるドキュメント位置に関連付けられたコンテンツを検索結果として得る手段を含む。 Further, the present invention (Claim 4) is the search device according to Claim 3,
Inquiry means are
Means for accessing a database in which the position and content of each document are associated in advance and obtaining the content associated with the document position as a search result as a search result;

図２は、本発明の原理を説明するための図である。 FIG. 2 is a diagram for explaining the principle of the present invention.

本発明（請求項５）は、改ページや改行位置が確定しており、文字列を含むドキュメント内の、文字列を含む一部領域を検索クエリとして、該領域が出現するドキュメント及び該ドキュメント内における位置を取得する検索要求に応えるための検索インデックスを作成し、検索を行う装置における検索方法であって、
ドキュメント群入力手段が、インデックス作成対象のドキュメント群の入力を受け付けるドキュメント群入力ステップ（ステップ１）と、
文字領域特定手段は、文字領域抽出方法を用いてドキュメント内の文字領域を特定する文字領域特定ステップ（ステップ２）と、
周辺領域特定手段が、各文字領域の周囲に存在する領域である周辺領域を特定する周辺領域特定ステップ（ステップ３）と、
部分画像特定手段が、画像特徴比較手法を用いて、各周辺領域とドキュメントの背景領域を比較し、該周辺領域の中で該背景領域と異なるものを部分画像領域と判定する部分画像特定ステップ（ステップ４）と、
インデックス出力手段が、文字領域と、該文字領域の周囲にある部分画像領域と、該文字列が出現するドキュメントにおける出現位置を関連付けてインデックスＤＢに出力するインデックス出力ステップ（ステップ５）と、を行う。 According to the present invention (Claim 5), a page break or a line feed position is fixed, and a document including the character string in a document including the character string is used as a search query, and the document in which the region appears and the document A search method for creating a search index for responding to a search request for acquiring a position in a device for performing a search,
A document group input step (step 1) in which the document group input means receives an input of a document group to be indexed;
A character area specifying means specifies a character area in the document using a character area extraction method (step 2),
A peripheral area specifying step (step 3) in which the peripheral area specifying means specifies a peripheral area that is an area existing around each character area;
The partial image specifying unit compares each peripheral area with the background area of the document using an image feature comparison method, and determines a partial image area that is different from the background area as a partial image area. Step 4) and
The index output means performs an index output step (step 5) of associating the character region, the partial image region around the character region, and the appearance position in the document in which the character string appears in association with the index DB. .

また、本発明（請求項６）は、請求項５の検索方法において、
ドキュメントは、画像も含み、
ドキュメント内の、画像及び文字列を含む一部領域を検索クエリとする。 Further, the present invention (Claim 6) is the search method of Claim 5,
The document also includes images,
A partial area including an image and a character string in the document is set as a search query.

また、本発明（請求項７）は、請求項５または６の検索方法において、
検索対象画像入力手段が、画像と文字列を含むドキュメントの一部領域を撮影した画像の入力を受け付ける検索対象画像入力ステップと、
検索対象文字領域特定手段が、文字領域抽出方法を用いて検索対象画像内の文字領域を特定する検索対象文字領域特定ステップと、
検索対象周辺領域特定手段が、各文字領域の周囲に存在する領域である周辺領域を特定する検索対象周辺領域特定ステップと、
検索対象部分画像特定手段が、画像特定比較手法を用いて、各周辺領域と検索対象画像の背景領域を比較し、周辺領域の中で背景領域と異なるものを部分画像領域と判定する検索対象部分画像特定ステップと、
問い合わせ手段が、文字領域と、該文字領域の周囲にある部分画像領域が出現するドキュメントまたはドキュメント内における位置を、インデックスＤＢに問い合わせる問い合わせステップと、
検索結果出力手段が、問い合わせの結果を出力する検索結果出力ステップと、を更に行う。 Further, the present invention (Claim 7) is the search method according to Claim 5 or 6,
A search target image input unit for receiving an input of an image obtained by photographing a partial region of a document including an image and a character string;
A search target character area specifying unit, wherein the search target character area specifying means specifies a character area in the search target image using a character area extraction method;
A search target peripheral region specifying means for specifying a peripheral region that is a region existing around each character region;
A search target part in which the search target partial image specifying unit compares each peripheral area with the background area of the search target image using an image specific comparison method, and determines a part of the peripheral area different from the background area as a partial image area. An image identification step;
An inquiry step for inquiring the index DB about the character region and the document in which the partial image region around the character region appears or the position in the document;
The search result output means further performs a search result output step of outputting the result of the inquiry.

また、本発明（請求項８）は、請求項７の検索方法において、
問い合わせステップにおいて、
事前に各ドキュメントの位置とコンテンツを関連付けたデータベースにアクセスし、検索結果であるドキュメント位置に関連付けられたコンテンツを検索結果として得るステップを更に行う。 The present invention (Claim 8) provides a search method according to Claim 7,
In the inquiry step,
A step of accessing a database in which the position and content of each document are associated in advance and obtaining the content associated with the document position as a search result as a search result is further performed.

本発明（請求項９）は、請求項１乃至４のいずれか１項に記載の検索装置を構成する各手段としてコンピュータを機能させるための検索プログラムである。 The present invention (Claim 9) is a search program for causing a computer to function as each means constituting the search device according to any one of Claims 1 to 4.

上記のように本発明によれば、インデックス作成時及び検索実行時に、画像の全体領域を用いるのではなく、文字列と文字列周辺の部分画像のみを用いることにより、ドキュメント群の中から特定ドキュメントの特定位置を一意に取得する検索要求に応じることができる。例えば、手元にカタログの一部の切り抜きしかない場合でも、その切り抜きがどの書籍のどの部分であったかを突き止めることができる。 As described above, according to the present invention, a specific document is selected from a document group by using only a character string and a partial image around the character string instead of using the entire area of the image at the time of index creation and search execution. It is possible to respond to a search request for uniquely acquiring a specific position. For example, even if there is only a part of the catalog cut out at hand, it is possible to determine which part of the book the cut out is.

また、上記処理を実現するための一般手法の一つである、各画像の全体領域の色分布情報をデータベースに格納して、ある画像の全体領域の色分布情報を検索クエリとする方法では、画像の全体領域を検索クエリに含めなければならないという問題があった。しかし、本発明の方法では、画像の一部分と文字列を含む領域のみを検索クエリに含めればよい。ハンディデバイスで撮影した領域を検索クエリとするシステムの場合、ドキュメントの狭い領域だけを撮影すればよいので、検索クエリ画像が鮮明に撮影できる。ハンディデバイスで撮影した領域に関連付けられたコンテンツを取得できるシステムの場合、ドキュメントの狭い部分ごとに問い合わせを行うことができるので、ドキュメントにコンテンツを関連付けるユーザは細かい粒度で関連付けを行うことができ、ドキュメントに関連付けられたコンテンツを取得するユーザは、細かい粒度でコンテンツを取得することができる。 Further, in a method for realizing the above processing, the color distribution information of the entire area of each image is stored in a database, and the color distribution information of the entire area of a certain image is used as a search query. There was a problem that the entire area of the image had to be included in the search query. However, in the method of the present invention, only a region including a part of an image and a character string need be included in the search query. In the case of a system in which an area photographed with a handy device is used as a search query, only a narrow area of a document needs to be photographed, so that a query image can be photographed clearly. In the case of a system that can acquire the content associated with the area captured by the handy device, it is possible to inquire for each narrow part of the document, so that the user who associates the content with the document can perform the association at a fine granularity. The user who acquires the content associated with can acquire the content with fine granularity.

また、上記処理を実現するための一般手法の一つである、各画像の部分領域の色分布情報をデータベースに格納して、ある画像の部分領域の色分布情報を検索クエリとする方法では、データベース格納時及び検索実行時に画像を複数領域の部分領域に分割し、データベース内及び検索クエリの部分領域同士で色分布情報等のマッチングをとるため、マッチングを取る回数が大幅に増加するという問題がった。しかし、本発明の方法では、文字領域の周辺の部分画像同士のマッチングのみを行っているため、マッチング回数を大幅に減らし、処理を高速にできる。 Further, in a method for realizing the above processing, which is a method for storing color distribution information of a partial area of each image in a database and using the color distribution information of a partial area of an image as a search query, There is a problem that the number of times of matching is greatly increased because the image is divided into a plurality of partial areas at the time of database storage and search execution, and color distribution information etc. is matched between the partial areas of the database and the search query. It was. However, in the method of the present invention, only the matching of the partial images around the character area is performed, so that the number of matching operations can be greatly reduced and the processing can be performed at high speed.

また、上記記載の画像を全体領域単位で扱う方法、画像を複数領域に分割する方法の両方に共通して、各画像の全領域の情報をデータベースに格納しているので、格納領域が大量に必要になるという問題があった。しかし、本発明の方法では、文字領域の周辺の部分画像と文字領域の情報のみをデータベースに格納しているため、格納領域を少なく抑えることができる。 In addition, the information on all areas of each image is stored in the database in common with both the method of handling the above-mentioned image in the whole area unit and the method of dividing the image into a plurality of areas. There was a problem that it was necessary. However, according to the method of the present invention, only the partial image around the character area and the information on the character area are stored in the database, so that the storage area can be reduced.

本発明の原理構成図である。It is a principle block diagram of this invention. 本発明の原理を説明するための図である。It is a figure for demonstrating the principle of this invention. 本発明の第１の実施の形態におけるインデックス作成部の構成図である。It is a block diagram of the index preparation part in the 1st Embodiment of this invention. 本発明の第１の実施の形態におけるＰＤＦファイル（書名：vegetable、ファイル名：vegetable3.pdf、ページ：３ページ目）の例である。It is an example of a PDF file (book name: vegetable, file name: vegetable3.pdf, page: third page) in the first embodiment of the present invention. 本発明の第１の実施の形態におけるインデックス作成の処理のフローチャートである。It is a flowchart of the process of the index creation in the 1st Embodiment of this invention. 本発明の第１の実施の形態における入力されたドキュメントのページのデータ構造化リストである。It is a data structured list of the page of the inputted document in the 1st embodiment of the present invention. 本発明の第１の実施の形態におけるＰＤＦファイル（書名：vegetable、ファイル名：vegetable3.pdf、ページ：３ページ目）に対する文字領域特定の例である。It is an example of character area specification with respect to the PDF file (Book name: vegetable, File name: vegetable3.pdf, Page: 3rd page) in the first embodiment of the present invention. 本発明の第１の実施の形態における特定された文字領域のデータ構造化リストである。It is a data structured list of the specified character area in the 1st embodiment of the present invention. 本発明の第１の実施の形態におけるＰＤＦファイル（書名：vegetable、ファイル名：vegetable3.pdf、ページ：３ページ目）に対する周辺領域特定の例である。It is an example of the peripheral region specification for the PDF file (book name: vegetable, file name: vegetable3.pdf, page: third page) in the first embodiment of the present invention. 本発明の第１の実施の形態における特定された周辺領域のデータ構造化リストである。It is a data structured list of the specified peripheral region in the first exemplary embodiment of the present invention. 本発明の第１の実施の形態における周辺領域特定の他の実現方法（文字領域が「め」の場合の一例）である。It is another implementation | achievement method (an example in case a character area is "me") of the surrounding area specification in the 1st Embodiment of this invention. 本発明の第１の実施の形態におけるＰＤＦファイル（書名：vegetable、ファイル名：vegetable3.pdf、ページ：３ページ目）に対する部分領域特定の例である。It is an example of partial area specification with respect to the PDF file (book name: vegetable, file name: vegetable3.pdf, page: third page) in the first embodiment of the present invention. 本発明の第１の実施の形態における特定された部分画像領域のデータ構造化リストである。It is a data structured list of the specified partial image area in the first embodiment of the present invention. 本発明の第１の実施の形態における部分画像領域の色分布情報のデータ構造化リストである。3 is a data structured list of color distribution information of a partial image area in the first embodiment of the present invention. 本発明の第１の実施の形態におけるインデックス出力部の処理のフローチャートである。It is a flowchart of the process of the index output part in the 1st Embodiment of this invention. 本発明の第１の実施の形態におけるインデックスＤＢに出力されるデータの例である。It is an example of the data output to index DB in the 1st Embodiment of this invention. 本発明の第２の実施の形態における検索装置の構成図である。It is a block diagram of the search device in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における検索処理のフローチャートである。It is a flowchart of the search process in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における検索対象のドキュメント（紙媒体）の例である。It is an example of the document (paper medium) of the search object in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における入力された検索対象画像（ＪＰＥＧ）の例である。It is an example of the input search object image (JPEG) in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における検索対象画像のファイル名の例である。It is an example of the file name of the search object image in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における検索対象画像（2009-05-20-18-00-00.jpg）の文字領域特定処理を示す図である。It is a figure which shows the character area specific process of the search object image (2009-05-20-18-00-00.jpg) in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における文字領域データの構造化リストである。It is a structured list of character area data in a 2nd embodiment of the present invention. 本発明の第２の実施の形態における検索対象画像（2009-05-20-18-00-00.jpg）から周辺領域を特定する例である。It is an example which specifies a peripheral area | region from the search object image (2009-05-20-18-00-00.jpg) in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における周辺領域データの構造化リストである。It is a structured list of peripheral area data in the second embodiment of the present invention. 本発明の第２の実施の形態における周辺領域特定の他の実現方法（文字領域が「ト」の場合の一例）である。It is another implementation | achievement method (an example in case a character area is "G") of the surrounding area specification in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における検索対象画像（2009-05-20-18-00-00.jpg）から部分画像領域を特定する例である。It is an example which specifies a partial image area | region from the search object image (2009-05-20-18-00-00.jpg) in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における部分画像領域のデータの構造化リストである。It is a structured list of partial image area data in the second embodiment of the present invention. 本発明の第２の実施の形態における部分画像領域の色分布情報のデータの構造化リストである。It is a structured list of data of color distribution information of a partial image area in the second embodiment of the present invention. 本発明の第２の実施の形態における問い合わせ処理のフローチャートである。It is a flowchart of the inquiry process in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における文字領域のデータに基づく検索結果の例である。It is an example of the search result based on the data of the character area in the 2nd Embodiment of this invention. 本発明の第２の実施の形態におけるインデックスＤＢ検索のフローチャートである。It is a flowchart of index DB search in the 2nd Embodiment of this invention. 本発明の第２の実施の形態におけるインデックスＤＢ検索結果リストである。It is an index DB search result list in the second embodiment of the present invention. 本発明の第２の実施の形態におけるインデックスＤＢ検索結果の集約処理のフローチャートである。It is a flowchart of the aggregation process of the index DB search result in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における集約リストである。It is an aggregation list in the 2nd embodiment of the present invention. 本発明の第２の実施の形態におけるコンテンツＤＢの例である。It is an example of content DB in the 2nd Embodiment of this invention. 本発明の第２の実施の形態におけるコンテンツＤＢ検索処理のフローチャートである。It is a flowchart of content DB search processing in the 2nd Embodiment of this invention.

以下、図面と共に本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

以下では、インデックスを作成する処理と、そのインデックスを用いて検索を実行する処理に分けて述べる。 In the following, the process of creating an index and the process of executing a search using the index will be described separately.

［第１の実施の形態］
本実施の形態では、検索装置内のインデックスを作成する処理について述べる。 [First Embodiment]
In the present embodiment, a process for creating an index in the search device will be described.

図３は、本発明の第１の実施の形態におけるインデックス作成部の構成を示す。 FIG. 3 shows the configuration of the index creation unit in the first embodiment of the present invention.

同図に示すインデックス作成部は、ドキュメント入力部１０、文字領域特定部１１、周辺領域特定部１２、部分画像領域特定部１３、インデックス出力部１４、インデックスＤＢ１５から構成される。 The index creation unit shown in FIG. 1 includes a document input unit 10, a character region specifying unit 11, a peripheral region specifying unit 12, a partial image region specifying unit 13, an index output unit 14, and an index DB 15.

ドキュメントは、文字列または画像、または文字列及び画像を含むページの集合からなる紙媒体の書籍とする。本実施の形態では、この書籍の各ページをスキャナ（一般装置）で読み取り、図４に示すように、異なる閲覧環境（ＯＳ，ＰＤＦ閲覧ソフト等）においても文字列、画像のレイアウトが変化しない電子ファイル（ＰＤＦファイル等）に変換する。なお、ドキュメントの背景色は白と判明しているものとする。 The document is a paper book composed of a character string or an image, or a set of pages including the character string and the image. In this embodiment, each page of the book is read by a scanner (general device), and as shown in FIG. 4, the character string and image layout do not change even in different browsing environments (OS, PDF browsing software, etc.). Convert to a file (such as a PDF file). It is assumed that the background color of the document is known as white.

また、文字領域特定部１１、周辺領域特定部１２、部分画像領域特定部１３は処理の過程で文字列等の情報を一時格納するメモリ（図示せず）を有するものとする。 In addition, the character region specifying unit 11, the peripheral region specifying unit 12, and the partial image region specifying unit 13 have a memory (not shown) that temporarily stores information such as a character string in the course of processing.

以下に、本実施の形態におけるインデックスを作成する処理フローを示す。 The processing flow for creating an index in the present embodiment is shown below.

図５は、本発明の第１の実施の形態におけるインデックス作成の処理のフローチャートである。 FIG. 5 is a flowchart of index creation processing according to the first embodiment of this invention.

以下では、インデックス作成対象となるドキュメントを入力する入力ステップ（ステップ１１０，１２０）、ドキュメント中のページ内から文字領域を特定する文字領域特定ステップ（ステップ１３０）、ドキュメント中のページ内から周辺領域（ページ内の各文字領域周辺の領域）を特定する周辺領域特定ステップ（ステップ１４０）、ドキュメント中のページ内から部分画像領域（ページ内の周辺領域のうち、インデックス作成に用いる部分画像の領域）を特定する部分画像領域特定ステップ（ステップ１５０）、検索を実行するためのインデックスを出力する出力ステップ（ステップ１６０）について説明する。 In the following, an input step for inputting a document to be indexed (steps 110 and 120), a character region specifying step for specifying a character region from the page in the document (step 130), a peripheral region (from the page in the document) A peripheral region specifying step (step 140) for specifying each character region in the page), and a partial image region (region of a partial image used for index creation among the peripheral regions in the page) from the page in the document. The partial image region specifying step (step 150) to be specified and the output step (step 160) for outputting an index for executing the search will be described.

ステップ１１０）ドキュメント入力部１０は、分析対象のドキュメント群の入力を受け付ける。なお、各ドキュメントの形式は図４に示すようなＰＤＦファイルであり、スキャン元である書名とページは判明しているものとする。 Step 110) The document input unit 10 receives input of a document group to be analyzed. It is assumed that the format of each document is a PDF file as shown in FIG. 4, and the book name and page as the scan source are known.

ステップ１２０）ドキュメント入力部１０は、該ドキュメント群に含まれるドキュメント、スキャン元である書名、ページを図６に示すデータ構造でリスト化する。リスト内の各要素は、各ドキュメントを一意に示すものとする。なお、説明の便宜上、ここでは、各ドキュメントを一意に示す情報としてファイル名を用いているが、ファイルのハッシュ値等、ドキュメントを一意に識別できる情報であれば他の情報を利用しても構わない。 Step 120) The document input unit 10 lists the documents included in the document group, the title of the scan source, and the pages in the data structure shown in FIG. Each element in the list shall uniquely indicate each document. For convenience of explanation, the file name is used as information uniquely identifying each document. However, other information may be used as long as the information can uniquely identify the document, such as a hash value of the file. Absent.

ステップ１３０）文字領域特定部１１は、図６のリストに記載されている各ドキュメントを読み込み、各ドキュメントの各ページについて、特開２０００−４８１２０号公報等の文字領域抽出方法を用いて、図７のように各文字を含む矩形領域（文字領域）を特定し、各文字領域の文字領域ＩＤ、文字、書名、ページ、座標、サイズを図８に示すデータ構造でリスト化する。なお、文字領域ＩＤは、説明の便宜上設けた各文字領域を一意に示す識別子である。また、本実施の形態では、座標は、ドキュメントの左上隅を原点とし、右方向にｘ軸、下方向にｙ軸をとった場合に、文字領域の左上の座標をピクセル単位で表現する。他の実現方法として、座標は、各頁における行・列等で表しても構わない。 Step 130) The character area specifying unit 11 reads each document listed in the list of FIG. 6, and uses the character area extraction method disclosed in Japanese Patent Laid-Open No. 2000-48120 for each page of each document, as shown in FIG. A rectangular area (character area) including each character is specified as described above, and the character area ID, character, book name, page, coordinates, and size of each character area are listed in the data structure shown in FIG. The character area ID is an identifier uniquely indicating each character area provided for convenience of explanation. In the present embodiment, the coordinates are expressed in units of pixels when the upper left corner of the document is the origin, the x axis is in the right direction, and the y axis is in the lower direction. As another implementation method, the coordinates may be represented by rows / columns in each page.

また、本実施の形態では、サイズは、文字領域のｘ軸方向の長さ、ｙ軸方向の長さをピクセル単位で表現する。 In this embodiment, the size represents the length of the character area in the x-axis direction and the length in the y-axis direction in units of pixels.

ステップ１４０）周辺領域特定部１２は、図８のリストに記載されている各レコードを読み込み、図９のように各レコードが表す文字領域に隣接している領域（周辺領域）を特定し、各周辺領域の周辺領域ＩＤ、座標、サイズ、文字領域ＩＤ、隣接方向を、図１０に示すデータ構造でリスト化する。 Step 140) The peripheral area specifying unit 12 reads each record described in the list of FIG. 8, specifies an area (peripheral area) adjacent to the character area represented by each record as shown in FIG. The peripheral area ID, coordinates, size, character area ID, and adjacent direction of the peripheral area are listed in the data structure shown in FIG.

なお、本実施の形態では、周辺領域は文字領域と上下左右いずれかの方向に隣接している領域であり、かつ、該文字領域と相似形状の領域であり、かつ、文字領域と周辺領域の相似比は１：Ｎ（ここでは、Ｎ＝１とする）であり、１つ以上の文字領域の全体または一部を含まない領域である。他の実現方法としては、図１１のように１つ以上の文字領域の全体または一部を含む領域としても構わない。 In the present embodiment, the peripheral area is an area that is adjacent to the character area in one of the upper, lower, left, and right directions, is an area that has a similar shape to the character area, and includes the character area and the peripheral area. The similarity ratio is 1: N (here, N = 1), and is an area that does not include all or part of one or more character areas. As another implementation method, an area including all or a part of one or more character areas may be used as shown in FIG.

また、本実施の形態では、周辺領域ＩＤは、説明の便宜上設けた各周辺領域を一意に示す識別子である。 In the present embodiment, the peripheral area ID is an identifier that uniquely indicates each peripheral area provided for convenience of explanation.

また、本実施の形態では、座標は、ドキュメント左上隅を原点とし、右方向にｘ軸、下方向にｙ軸をとった場合に、周辺矩形領域の左上の座標をピクセル単位で表現する。他の実現方法として、座標は各頁における行・列等で表しても構わない。 In the present embodiment, the coordinates are expressed in pixel units in the upper left corner of the surrounding rectangular area when the origin is the upper left corner of the document, the x axis is in the right direction, and the y axis is in the lower direction. As another realization method, the coordinates may be represented by rows and columns on each page.

また、本実施の形態では、サイズは、周辺領域のｘ軸方向の長さ、ｙ軸方向の長さをピクセル単位で表現する。 In the present embodiment, the size represents the length in the x-axis direction and the length in the y-axis direction of the peripheral region in units of pixels.

また、本実施の形態では、文字領域ＩＤは隣接している文字領域の文字領域ＩＤである。 In the present embodiment, the character area ID is a character area ID of an adjacent character area.

また、本実施の形態では、隣接方向は、文字領域から見た、該文字領域に隣接している周辺領域の方向を上下左右で表すものである。他の実現方法としては、文字領域の重心と周辺領域の重心を結ぶ線分が水平方向からどの程度傾いているか等で表現しても構わない。 Further, in the present embodiment, the adjacent direction represents the direction of the peripheral area adjacent to the character area as viewed from the character area in the vertical and horizontal directions. As another realization method, it may be expressed by how much the line segment connecting the centroid of the character area and the centroid of the peripheral area is inclined from the horizontal direction.

ステップ１５０）部分画像領域特定部１３は、図１０のリストに記載されている各レコードを読み込み、図１２のように各レコードが表す周辺領域の中から背景領域であるものを除いた領域（部分画像領域）を特定し、各部分画像領域の部分画像領域ＩＤ、座標、サイズ、文字領域ＩＤ、隣接方向を、図１３に示すデータ構造でリスト化し、各部分画像領域の色分布情報を図１４に示すデータ構造でリスト化する。 Step 150) The partial image area specifying unit 13 reads each record described in the list of FIG. 10, and removes the area (part) that is the background area from the peripheral area represented by each record as shown in FIG. Image area) is specified, the partial image area ID, coordinates, size, character area ID, and adjacent direction of each partial image area are listed in the data structure shown in FIG. 13, and the color distribution information of each partial image area is shown in FIG. List with the data structure shown in.

なお、本実施の形態では、周辺領域の中から背景領域であるものを除く際、及び、各部分画像領域の色分布情報をリスト化する際には、前述の非特許文献１の方法を用いる。前述の通り、事前に判明しているドキュメントの背景色である白が持つ色分布情報Ａを算出し、同じ方法で各周辺領域の色分布情報Ｂも算出し、ＡとＢが異なる場合、該周辺領域を部分画像領域と判定し、Ｂを図４の形式でリストに追加する。他の実現方法としては、周辺領域の色分散（例えば、Ｒ値、Ｇ値、Ｂ値の頻度の分散値）が一定値を超えた場合、つまり、複数の色が混じり合っている領域の場合は背景領域ではなく、部分画像領域と判定しても構わない。 In the present embodiment, the method described in Non-Patent Document 1 described above is used when removing the background region from the peripheral region and when listing the color distribution information of each partial image region. . As described above, the color distribution information A of white, which is the background color of the document that has been determined in advance, is calculated, and the color distribution information B of each peripheral region is calculated by the same method. The peripheral area is determined as a partial image area, and B is added to the list in the format of FIG. As another realization method, when the color dispersion (for example, the dispersion value of the frequency of the R value, the G value, and the B value) in a peripheral region exceeds a certain value, that is, in a region where a plurality of colors are mixed. May be determined not as a background area but as a partial image area.

また、本実施の形態では、部分画像領域ＩＤは説明の便宜上設けた各部分画像領域を一意に示す識別子である。 In the present embodiment, the partial image area ID is an identifier that uniquely indicates each partial image area provided for convenience of explanation.

また、本実施の形態では、座標、サイズ、文字領域ＩＤ、隣接方向は、周辺領域における定義と同様である。 In the present embodiment, the coordinates, size, character area ID, and adjacent direction are the same as the definitions in the peripheral area.

ステップ１６０）インデックス出力部１４は、図８、図１３、図１４のリストに記載されている各レコードを読み込み、図１５示すフローチャートに従って、図８と図１３を文字領域ＩＤで関連付け、図１３と図１４を部分画像領域ＩＤで関連付け、図１６に示す形式のインデックスとしてインデックスＤＢ１５に出力する。 Step 160) The index output unit 14 reads each record described in the list of FIGS. 8, 13, and 14 and associates FIG. 8 and FIG. 13 with the character area ID according to the flowchart shown in FIG. FIG. 14 is associated with the partial image area ID, and is output to the index DB 15 as an index having the format shown in FIG.

ステップ１６１）図８の文字領域のリストを全件読み込む。 Step 161) Read all the character area lists in FIG.

ステップ１６２）カウントｉ＝１とする。 Step 162) Set the count i = 1.

ステップ１６３）ｉ≦Ｘである場合にはステップ１６４に移行し、ｉ＞Ｘである場合は処理を終了する。 Step 163) If i ≦ X, the process proceeds to step 164. If i> X, the process ends.

ステップ１６４）以下の処理を行う。 Step 164) The following processing is performed.

ステップ１６４１）図８の文字領域のリストのｉ番目のレコードと同じ文字領域ＩＤを持つレコード群全Ｙ件を図１３の部分画像領域のリストから読み込む。 Step 1641) All Y records having the same character area ID as the i-th record in the character area list in FIG. 8 are read from the partial image area list in FIG.

ステップ１６４２）カウンタｊをｊ＝１とする。 Step 1642) The counter j is set to j = 1.

ステップ１６４３）ｊ≦Ｙであればステップ１６４４に移行し、ｊ＞Ｙであれば処理を終了する。 Step 1643) If j ≦ Y, the process proceeds to Step 1644, and if j> Y, the process ends.

ステップ１６４４）図１３の部分画像領域のリストのｊ番目のレコードと同じ部分画像領域ＩＤを持つレコード群全Ｚ件（レコード群ｚとする）を図１４の部分画像領域の色分布情報のリストから読み込む。 Step 1644) All Z records (record group z) having the same partial image area ID as the j-th record in the partial image area list of FIG. 13 are selected from the color distribution information list of the partial image area of FIG. Read.

ステップ１６４５）図８の文字領域のｉ番目のレコードの文字、書名、ページ、座標、図１３の部分画像領域のリストのｊ番目のレコードの隣接方向、図１４の部分画像領域の色分布情報のリストのレコード群ｚのＲ値、Ｇ値、Ｂ値、頻度を関連付けて、インデックスの１レコードとしてインデックスＤＢ１４に出力する。 Step 1645) The character, book name, page, coordinates of the i-th record in the character area in FIG. 8, the adjacent direction of the j-th record in the list of partial image areas in FIG. 13, and the color distribution information of the partial image area in FIG. The R value, G value, B value, and frequency of the record group z in the list are associated with each other and output to the index DB 14 as one record of the index.

ステップ１６４６）カウンタｊをｊ＝ｊ＋１として、ステップ１６４３に移行する。 Step 1646) The counter j is set to j = j + 1, and the process proceeds to Step 1643.

［第２の実施の形態］
本実施の形態では、第１の実施の形態で生成されたインデックスＤＢを用いて検索装置で検索を実行する処理の流れを示す。 [Second Embodiment]
In the present embodiment, a flow of processing for executing a search by a search device using the index DB generated in the first embodiment is shown.

図１７は、本発明の第２の実施の形態における検索装置の構成を示す。 FIG. 17 shows the configuration of the search device according to the second embodiment of the present invention.

同図に示す検索装置は、検索対象画像入力部２０、文字領域特定部２１、周辺領域特定部２２、部分画像領域特定部２３、問い合わせ部２４、検索結果出力部２５、コンテンツＤＢ２６、インデックスＤＢ１５、集約リスト記憶部２７、ハンディデバイス２００から構成される。 The search apparatus shown in FIG. 1 includes a search target image input unit 20, a character region specifying unit 21, a peripheral region specifying unit 22, a partial image region specifying unit 23, an inquiry unit 24, a search result output unit 25, a content DB 26, an index DB 15, The aggregate list storage unit 27 and the handy device 200 are included.

インデックスＤＢ１５は、第１の実施の形態において生成されたＤＢである。 The index DB 15 is a DB generated in the first embodiment.

ハンディデバイス２００は、カメラ、ディスプレイ、データ送受信機能を有する一般装置であり、例えば、カメラ付携帯電話がこれに該当する。 The handy device 200 is a general device having a camera, a display, and a data transmission / reception function. For example, the handy device 200 corresponds to a camera-equipped mobile phone.

ドキュメントは、文字列、または、画像、または、文字列または画像を含むページの集合からなる紙媒体の書籍とし、検索対象画像は該ドキュメントの一部を撮影したものとする。なお、ドキュメントの背景色は白と判明しているものとする。 It is assumed that the document is a book of paper medium consisting of a character string, an image, or a set of pages including the character string or image, and the search target image is a part of the document. It is assumed that the background color of the document is known as white.

また、文字領域特定部２１、周辺領域特定部２２、部分画像領域特定部２３、問い合わせ部２４は、処理の過程で文字列等の情報を一時格納するメモリ（図示せず）を有するものとする。 In addition, the character area specifying unit 21, the peripheral area specifying unit 22, the partial image area specifying unit 23, and the inquiry unit 24 have a memory (not shown) that temporarily stores information such as a character string in the course of processing. .

以下に、本実施の形態における検索を実行する処理のフローチャートを示す。 Below, the flowchart of the process which performs the search in this Embodiment is shown.

図１８は、本発明の第２の実施の形態における検索処理のフローチャートである。 FIG. 18 is a flowchart of search processing according to the second embodiment of the present invention.

以下では、検索の対象となる検索対象画像を入力する入力ステップ（ステップ２０１，２０３）、検索対象画像中から文字領域を特定する文字領域特定ステップ（ステップ２０４）、検索対象画像内から周辺領域を特定する周辺領域特定ステップ（ステップ２０５）、検索対象画像内から部分画像領域を特定する部分画像領域特定ステップ（ステップ２０６）、検索対象画像にマッチするドキュメント、ドキュメント内における位置、該位置に予め関連付けられたコンテンツを問い合わせる問い合わせステップ（ステップ２０７〜２１０）、問い合わせを実行した結果を出力する出力ステップ（ステップ２１１，２１２）について説明する。 In the following, an input step for inputting a search target image to be searched (steps 201 and 203), a character region specifying step for specifying a character region from the search target image (step 204), and a peripheral region from within the search target image. Step of specifying peripheral region (step 205), step of specifying partial image region (step 206) for specifying a partial image region from within the search target image, document matching the search target image, position in the document, and pre-association with the position An inquiry step (steps 207 to 210) for inquiring the requested content and an output step (steps 211 and 212) for outputting the result of the inquiry will be described.

ステップ２０１）ハンディデバイス２００で図１９のようなドキュメント（書籍）内のページ内の、画像と文字列の両方を含む部分を撮影して図２０のようなJPEG形式のファイルを作成し、これを検索対象画像とする。 Step 201) Using the handy device 200, a portion of the page in the document (book) as shown in FIG. 19 is photographed to create a JPEG format file as shown in FIG. The search target image is used.

なお、撮影された画像と文字列の情報を含んでいればファイル形式はJPEG以外でも構わない。 It should be noted that the file format may be other than JPEG as long as the captured image and character string information are included.

また、本実施の形態では、検索対象画像は１つのみであるが、検索対象画像を複数用意しても構わない。 In the present embodiment, there is only one search target image, but a plurality of search target images may be prepared.

ステップ２０２）検索対象画像入力部２０は、ハンディデバイス２００から、ネットワークを介して検索対象画像の入力を受け付ける。 Step 202) The search target image input unit 20 receives an input of the search target image from the handy device 200 via the network.

なお、ネットワークを介さずに、ＳＤカード等の記憶媒体を介して検索対象画像の入力を受け付けても構わない。 In addition, you may receive the input of a search object image via storage media, such as an SD card, not via a network.

また、本実施の形態では、検索対象画像は１つのみであるが、検索対象画像が複数ある場合は、それらを同時に受け付けても構わない。 In the present embodiment, there is only one search target image. However, when there are a plurality of search target images, they may be received simultaneously.

ステップ２０３）検索対象画像入力部２０は、検索対象画像（複数ある場合は検索対象画像群）にシステム内で唯一となるようなファイル名を付け、図２１に示すデータ構造でリスト化する。 Step 203) The search target image input unit 20 assigns a unique file name in the system to the search target images (or a search target image group if there are a plurality of search target images), and lists them in the data structure shown in FIG.

なお、説明の便宜上、ここでは、検索対象画像を一意に示すファイル名として、ファイルが入力された時刻を用いているが、ファイルのハッシュ値等、検索対象画像を一意に識別できる情報であれば他の情報を利用しても構わない。 For convenience of explanation, the file input time is used as the file name uniquely indicating the search target image. However, as long as the information such as a hash value of the file can uniquely identify the search target image. Other information may be used.

以降、本実施の形態では、入力された検索対象画像が１つである場合のみを説明するが、複数の検索対象画像が入力された場合は、それらを一意に区別できるようなファイル名を付け、リストに追加し、以降の処理の各検索対象画像について繰り返し行えばよい。 Hereinafter, in the present embodiment, only the case where there is one input search target image will be described. However, when a plurality of search target images are input, a file name that can uniquely distinguish them is assigned. These are added to the list and repeated for each search target image in the subsequent processing.

ステップ２０４）文字領域特定部２１は、図２１のリストに記載されている各検索対象画像を読み込み、各検索対象画像について、前述の特開２０００−４８１２０号公報等の文字領域抽出方法を用いて、図２２のように各文字を含む矩形領域（文字領域）を特定し、各文字領域の文字領域ＩＤ、文字、サイズを図２３に示すデータ構造でリスト化する。 Step 204) The character area specifying unit 21 reads each search target image listed in the list of FIG. 21, and uses the character area extraction method such as the above-mentioned Japanese Patent Laid-Open No. 2000-48120 for each search target image. As shown in FIG. 22, a rectangular area (character area) including each character is specified, and the character area ID, character, and size of each character area are listed in the data structure shown in FIG.

なお、本実施の形態では、文字領域ＩＤは説明の便宜上設けた各文字領域を一意に示す識別子である。 In the present embodiment, the character area ID is an identifier that uniquely indicates each character area provided for convenience of explanation.

ステップ２０５）周辺領域特定部２２は、図２３のリストに記載されている各レコードを読み込み、図２４のように各レコードが表す文字領域に隣接している領域（周辺領域）を特定し、各周辺領域の周辺領域ＩＤ、サイズ、文字領域ＩＤ、隣接方向を、図２５に示すデータ構造でリスト化する。 Step 205) The peripheral area specifying unit 22 reads each record listed in the list of FIG. 23, specifies an area (peripheral area) adjacent to the character area represented by each record as shown in FIG. The peripheral area ID, size, character area ID, and adjacent direction of the peripheral area are listed in the data structure shown in FIG.

なお、本実施の形態では、周辺領域は、文字領域と上下左右のいずれかの方向に隣接している領域であり、かつ、該文字領域と相似形状の領域であり、かつ、文字領域と周辺領域の相似比は１：Ｎ（ここではＮ＝１とする）であり、１つ以上の文字領域の全体または一部を含まない領域である。他の実現方法としては、図２６のように、１つ以上の文字領域の全体または一部を含む領域としても構わない。 In the present embodiment, the peripheral area is an area that is adjacent to the character area in any of the upper, lower, left, and right directions, is an area that is similar to the character area, and is adjacent to the character area. The similarity ratio of the areas is 1: N (N = 1 here), and is an area that does not include all or part of one or more character areas. As another implementation method, as shown in FIG. 26, an area including all or a part of one or more character areas may be used.

また、本実施の形態では、サイズは、周辺領域ｘ軸方向の長さ、ｙ軸方向の長さをピクセル単位で表現する。 In this embodiment, the size expresses the length in the peripheral region x-axis direction and the length in the y-axis direction in units of pixels.

また、本実施の形態では、文字領域ＩＤは、隣接している文字領域の文字領域ＩＤである。 In the present embodiment, the character area ID is a character area ID of an adjacent character area.

ステップ２０６）部分画像領域特定部２３は、図２５のリストに記載されている各レコードを読み込み、図２７のように各レコードが表す周辺領域の中から背景領域であるものを除いた領域（部分画像領域）を特定し、各部分画像領域の部分画像領域ＩＤ、サイズ、文字領域ＩＤ，隣接方向を、図２８に示すデータ構造でリスト化し、各部分画像領域の色分布情報を図２９に示すデータ構造でリスト化する。 Step 206) The partial image region specifying unit 23 reads each record described in the list of FIG. 25, and removes the region (part) that is the background region from the peripheral region represented by each record as shown in FIG. 28), the partial image area ID, size, character area ID, and adjacent direction of each partial image area are listed in the data structure shown in FIG. 28, and the color distribution information of each partial image area is shown in FIG. List by data structure.

なお、本実施の形態では、周辺領域の中から背景領域であるものを除く際、及び、各部分画像領域の色分布情報をリスト化する際は、第１の実施の形態のステップ１５０と同様の方法を用いる。 In the present embodiment, when the background area is excluded from the peripheral areas, and when the color distribution information of each partial image area is listed, it is the same as step 150 in the first embodiment. The method is used.

また、本実施の形態では、部分画像領域ＩＤは、説明の便宜上設けた各部分画像領域を一意に示す識別子である。 In the present embodiment, the partial image area ID is an identifier that uniquely indicates each partial image area provided for convenience of explanation.

また、本実施の形態では、サイズ、文字領域ＩＤ、隣接方向は、周辺領域における定義と同様である。 In the present embodiment, the size, the character area ID, and the adjacent direction are the same as the definitions in the peripheral area.

ステップ２０７）問い合わせ部２４は、図２３の文字領域のデータ、図２８の部分画像領域のデータ、図２９の部分画像領域の色分布情報のリストに記載されている各レコードを読み込み、図３０のフローチャートに従って図２３と図２８を文字領域ＩＤで関連付け、図２８と図２９を部分画像領域ＩＤで関連付け図３１に示す形式のリストを作成する。 Step 207) The inquiry unit 24 reads each record described in the list of the character area data in FIG. 23, the partial image area data in FIG. 28, and the color distribution information in the partial image area in FIG. According to the flowchart, FIG. 23 and FIG. 28 are associated with the character area ID, and FIG. 28 and FIG. 29 are associated with the partial image area ID to create a list in the format shown in FIG.

図３０は、本発明の第２の実施の形態における問い合わせ処理のフローチャートである。 FIG. 30 is a flowchart of inquiry processing in the second embodiment of this invention.

ステップ３１０）図２３の文字領域のリストを全Ｘ件読み込む。 Step 310) Read all X lists of character areas in FIG.

ステップ３２０）カウンタｉをｉ＝１とする。 Step 320) The counter i is set to i = 1.

ステップ３３０）ｉ≦Ｘであればステップ３４０に移行し、ｉ＞Ｘであれば当該問い合わせ処理を終了する。 Step 330) If i ≦ X, the process proceeds to Step 340, and if i> X, the inquiry process is terminated.

ステップ３４０）以下の処理を行う。 Step 340) The following processing is performed.

ステップ３４１）図２３の文字領域のリストのｉ番目のレコードと同じ文字領域を持つレコード群全Ｙ件を図２８の部分画像領域のリストから読み込む。 Step 341) All Y records having the same character area as the i-th record in the character area list in FIG. 23 are read from the partial image area list in FIG.

ステップ３４２）カウンタｊをｊ＝１とする。 Step 342) The counter j is set to j = 1.

ステップ３４３）ｊ≦Ｙであればステップ３４４に移行し、ｊ＞Ｙであれば当該処理を終了する。 Step 343) If j ≦ Y, the process proceeds to Step 344, and if j> Y, the process ends.

ステップ３４４）図２８の部分画像領域ＩＤを持つレコード群全Ｚ件（レコード群ｚとする）を図２９の部分画像領域の色分布情報のリストから読み込む。 Step 344) All Z records (record group z) having the partial image area ID of FIG. 28 are read from the color distribution information list of the partial image area of FIG.

ステップ３４５）図２３の文字領域のリストのｉ番目のレコードの文字、図２８のリストのｊ番目のレコードの隣接方向、図２９の部分画像領域の色分布情報のリストのレコード群ｚのＲ値、Ｇ値、Ｂ値、頻度を関連付けて、図３１の検索結果のリストの１レコードとして出力する。 Step 345) The character of the i-th record in the list of character areas in FIG. 23, the adjacent direction of the j-th record in the list of FIG. 28, and the R value of the record group z in the color distribution information list of the partial image area in FIG. , G value, B value, and frequency are associated and output as one record in the search result list of FIG.

ステップ３４６）カウンタｊをｊ＝ｊ＋１とし、ステップ３４３に移行する。 Step 346) Set the counter j to j = j + 1, and proceed to Step 343.

ステップ３５０）カウンタｉをｉ＝ｉ＋１とし、ステップ３３０に移行する。 Step 350) Set the counter i to i = i + 1, and proceed to Step 330.

ステップ２０８）問い合わせ部２４は、図１３のデータ構造を持つインデックスＤＢ１５に、図３２のフローチャートに従って問い合わせを行い、図３３に示す検索結果を得る。 Step 208) The inquiry unit 24 makes an inquiry to the index DB 15 having the data structure of FIG. 13 according to the flowchart of FIG. 32, and obtains the search result shown in FIG.

図３２は、本発明の第２の実施の形態におけるインデックスＤＢ検索のフローチャートである。 FIG. 32 is a flowchart of the index DB search in the second embodiment of this invention.

ステップ４０１）図３１の文字領域のリスト全Ｘ件を読み込む。 Step 401) Read all X items in the character area list of FIG.

ステップ４０２）カウンタｉをｉ＝１とする。 Step 402) Set the counter i to i = 1.

ステップ４０３）カウンタｉ≦Ｘである場合はステップ４０４に移行し、ｉ＞Ｘである場合は当該処理を終了する。 Step 403) If the counter i ≦ X, the process proceeds to step 404. If i> X, the process is terminated.

ステップ４０４）図２３の文字領域のリストのｉ番目のレコードと同じレコードと同じ文字、隣接方向、Ｒ値、Ｇ値、Ｂ値、頻度を持つレコード群全Ｙ件の書名、ページ、座標を図１６のインデックスＤＢ１５から読み込み、図３３のインデックスＤＢ検索結果リストに追加する。 Step 404) The name, page, and coordinates of all Y records in the record group having the same character, adjacent direction, R value, G value, B value, and frequency as the i-th record in the character area list of FIG. 16 is read from the index DB 15 and added to the index DB search result list of FIG.

ステップ４０５）カウンタｉをｉ＝ｉ＋１とし、ステップ４０３に移行する。 Step 405) Set the counter i to i = i + 1, and proceed to Step 403.

ステップ２０９）問い合わせ部２４は、図３３のリストに記載されているインデックスＤＢ１５を検索した結果の各レコードを読み込み、図３４のフローチャートに従って、座標が近い検索結果をグループに集約し、各グループに集約されている元レコードの数（集約数）を集計し、図３５に示すリストを集約リスト記憶部２７に出力する。 Step 209) The inquiry unit 24 reads each record of the search result of the index DB 15 listed in the list of FIG. 33, and aggregates the search results having similar coordinates into groups according to the flowchart of FIG. The number of collected original records (aggregation number) is aggregated, and the list shown in FIG. 35 is output to the aggregation list storage unit 27.

図３４は、本発明の第２の実施の形態におけるインデックスＤＢ検索結果の集約処理のフローチャートである。 FIG. 34 is a flowchart of index DB search result aggregation processing according to the second embodiment of this invention.

ステップ５１０）図３３のインデックスＤＢ１５の検索結果のリスト全Ｘ件を読み込む。 Step 510) Read all X search result lists in the index DB 15 of FIG.

ステップ５２０）カウンタｉをｉ＝１とする。 Step 520) The counter i is set to i = 1.

ステップ５３０）カウンタｉ≦Ｘであれば、ステップ５４０に移行し、ｉ＞Ｘであればステップ５６０に移行する。 Step 530) If the counter i ≦ X, go to Step 540, and if i> X, go to Step 560.

ステップ５４０）以下の処理を行う。 Step 540) The following processing is performed.

ステップ５４１）集約先レコードの中に図３３のｉ番目のレコードと同じ書名・ページで、座標が近い（例えば、ｘ軸方向・ｙ軸方向の２０ピクセル以内）レコードがあるかを判定し、ある場合はステップ５４２に移行し、ない場合はステップ５４３に移行する。 Step 541) It is determined whether there is a record having the same book title / page as the i-th record in FIG. 33 and having coordinates close to each other (for example, within 20 pixels in the x-axis direction and the y-axis direction). If yes, then go to Step 542; if not, go to Step 543.

ステップ５４２）図３３のリストのｉ番目のレコードを、座標が一番近い集約先レコード（複数存在する場合は座標が原点に最も近いもの）に集約し、該集約先レコードの集約数を１増やし、ステップ５５０に移行する。 Step 542) The i-th record in the list of FIG. 33 is aggregated to the aggregation destination record having the closest coordinates (when there are a plurality of coordinates, the coordinates are closest to the origin), and the aggregation number of the aggregation destination record is increased by one. The process proceeds to step 550.

ステップ５４３）図３３のリストのｉ番目のレコードを、新たな集約先レコードとし、当該集約先レコードの集約数を１として、書名・ページ・座標と共にメモリ（図示せず）上に保存し、ステップ５５０に移行する。 Step 543) The i-th record in the list of FIG. 33 is set as a new aggregation destination record, the aggregation number of the aggregation destination record is set to 1, and stored in a memory (not shown) together with the title, page, and coordinates. 550.

ステップ５５０）カウンタｉをｉ＝ｉ＋１とし、ステップ５３０に移行する。 Step 550) The counter i is set to i = i + 1, and the process proceeds to Step 530.

ステップ５６０）ステップ５３０において、ｉ＞Ｘである場合は、メモリ（図示せず）上に保存されている集約先レコードの書名・ページ・座標・集約数を図３５のリストとして集約リスト記憶部２７に出力する。 Step 560) In step 530, if i> X, the list, page, coordinates, and number of aggregations of the aggregation destination records stored in the memory (not shown) are used as the list of FIG. Output to.

なお、本実施の形態では、座標がｘ軸方向・ｙ軸方向の２０ピクセルの範囲内にあれば、座標が近いと判定するものとする。 In the present embodiment, it is determined that the coordinates are close if the coordinates are within a range of 20 pixels in the x-axis direction and the y-axis direction.

ステップ２１０）問い合わせ部２４は、図３５の集約リスト記憶部２７のリストの中で集約数が一番大きいレコードを読み込み、事前にシステム運用者等が作成した図３６のデータ構造を持つコンテンツＤＢ２６に対し、図３７のフローチャートに従って当該レコードに座標が近い位置に関連付けられたコンテンツを問い合わせ、得られたコンテンツを検索結果出力部２５に渡す。 Step 210) The inquiry unit 24 reads the record having the largest aggregation number in the list of the aggregation list storage unit 27 in FIG. 35, and stores it in the content DB 26 having the data structure in FIG. 36 created in advance by the system operator or the like. On the other hand, in accordance with the flowchart of FIG. 37, the content associated with the position whose coordinates are close to the record is inquired, and the obtained content is passed to the search result output unit 25.

図３７は、本発明の第２の実施の形態におけるコンテンツＤＢ検索処理のフローチャートである。 FIG. 37 is a flowchart of content DB search processing according to the second embodiment of the present invention.

ステップ６０１）図３５の集約リスト記憶部２７のリスト中で集約数が最大のレコードＡを読み込む。 Step 601) Read the record A having the maximum number of aggregations in the list of the aggregation list storage unit 27 in FIG.

ステップ６０２）コンテンツＤＢ２６の中に、レコードＡと書名・ページが同一のレコードＢが含まれている場合はステップ６０３に移行し、含まれていないばあいはステップ６０５に移行する。 Step 602) If the content DB 26 includes the record B having the same book title / page as the record A, the process proceeds to step 603. Otherwise, the process proceeds to step 605.

ステップ６０３）レコードＡとレコードＢの座標が近い（ｘ軸方向・ｙ軸方向に２０ピクセル以内）場合はステップ６０４に移行し、近くない場合はステップ６０５に移行する。 Step 603) If the coordinates of record A and record B are close (within 20 pixels in the x-axis direction and y-axis direction), the process proceeds to step 604, and if not, the process proceeds to step 605.

ステップ６０４）レコードＢのコンテンツを出力する。 Step 604) The contents of record B are output.

ステップ６０５）「コンテンツ無し」と判定する。 Step 605) Determine “no content”.

なお、本実施の形態では、座標がｘ軸方向、ｙ軸方向に２０ピクセルの範囲内にあれば、座標が近いと判定する。 In the present embodiment, if the coordinates are within 20 pixels in the x-axis direction and the y-axis direction, it is determined that the coordinates are close.

ステップ２１１）検索結果出力部２５は、問い合わせ部２４からコンテンツを渡されていれば、ハンディデバイス２００にネットワークを介してコンテンツを送信する。問い合わせ部２４が「コンテンツ無し」と判定していれば、ハンディデバイス２００には何も渡さない。 Step 211) If the content is received from the inquiry unit 24, the search result output unit 25 transmits the content to the handy device 200 via the network. If the inquiry unit 24 determines “no content”, nothing is handed to the handy device 200.

なお、検索結果出力部２５からハンディデバイス２００にコンテンツを渡す際は、ネットワークを介さず、ＳＤカード等の記憶媒体を介してコンテンツを渡しても構わない。 Note that when the content is transferred from the search result output unit 25 to the handy device 200, the content may be transferred via a storage medium such as an SD card, not via a network.

ステップ２１２）ハンディデバイス２００は、検索結果出力部２５から渡されたコンテンツを、予め定められた手段で表示する。 Step 212) The handy device 200 displays the content delivered from the search result output unit 25 by a predetermined means.

本実施の形態では、拡張子がｈｔｍｌで終わるコンテンツが渡された場合は、ハンディデバイス２００に内蔵のＷｅｂブラウザで該コンテンツを表示するものとする。 In the present embodiment, when a content whose extension ends with html is passed, the content is displayed by a Web browser built in the handy device 200.

なお、上記の図３、図１７に示す各構成要素の動作をプログラムとして構築し、インデックス作成部、検索装置として利用されるコンピュータにインストールして実行させる、または、ネットワークを介して流通させることが可能である。 The operation of each component shown in FIG. 3 and FIG. 17 is constructed as a program and can be installed and executed in a computer used as an index creation unit and a search device, or distributed via a network. Is possible.

また、構築されたプログラムをハードディスクや、フレキシブルディスク・ＣＤ−ＲＯＭ等の可搬記憶媒体に格納し、コンピュータにインストールする、または、配布することが可能である。 Further, the constructed program can be stored in a portable storage medium such as a hard disk, a flexible disk, or a CD-ROM, and can be installed or distributed in a computer.

なお、本発明は、上記の実施の形態及び実施例に限定されることなく、特許請求の範囲内において種々変更・応用が可能である。 The present invention is not limited to the above-described embodiments and examples, and various modifications and applications can be made within the scope of the claims.

１０ドキュメント入力部
１１文字領域特定部
１２周辺領域特定部
１３部分画像領域特定部
１４インデックス出力部
１５インデックスＤＢ
２０検索対象画像入力部
２１文字領域特定部
２２周辺領域特定部
２３部分画像領域特定部
２４問い合わせ部
２５検索結果出力部
２６コンテンツＤＢ
２７集約リスト記憶部 DESCRIPTION OF SYMBOLS 10 Document input part 11 Character area specific part 12 Peripheral area specific part 13 Partial image area specific part 14 Index output part 15 Index DB
20 Search target image input unit 21 Character region specifying unit 22 Peripheral region specifying unit 23 Partial image region specifying unit 24 Inquiry unit 25 Search result output unit 26 Content DB
27 Aggregation list storage

Claims

A page break or a line feed position has been determined, and a search request for acquiring a document in which the area appears and a position in the document using a partial area including the character string as a search query in the document including the character string is satisfied. A search device for creating a search index for performing a search,
A document group input means for accepting input of a document group to be indexed;
A character region specifying means for specifying a character region in the document using a character region extraction method;
A peripheral area specifying means for specifying a peripheral area which is an area existing around each character area;
A partial image specifying means for comparing each peripheral area with a background area of the document using an image feature comparison method, and determining a partial image area that is different from the background area among the peripheral areas;
Index output means for associating the character area, the partial image area around the character area, and the appearance position in the document in which the character string appears, and outputting the associated position to the index DB;
A search device comprising:

The document also includes an image,
The search device according to claim 1, wherein a partial area including an image and a character string in the document is used as a search query.

A search target image input means for receiving an input of an image obtained by photographing a partial area of a document including an image and a character string;
Search target character area specifying means for specifying a character area in the search target image using a character area extraction method;
Search target peripheral area specifying means for specifying a peripheral area that is an area existing around each character area;
A search target partial image specifying unit that compares each peripheral region with a background region of the search target image using an image specific comparison method, and determines a partial image region that is different from the background region among the peripheral regions;
Inquiry means for inquiring the index DB for the character area and a document in which the partial image area around the character area appears or a position in the document;
Search result output means for outputting the result of the inquiry;
The search device according to claim 1, further comprising:

The inquiry means includes
The search device according to claim 3, further comprising means for accessing a database in which the position and content of each document are associated in advance and obtaining the content associated with the document position as a search result as the search result.

A page break or a line feed position has been determined, and a search request for acquiring a document in which the area appears and a position in the document using a partial area including the character string as a search query in the document including the character string is satisfied. A search method for a device that creates a search index and performs a search,
A document group input step in which the document group input means accepts input of a document group to be indexed;
A character region specifying means for specifying a character region in the document using a character region extraction method;
A peripheral area specifying step for specifying a peripheral area that is an area existing around each of the character areas;
A partial image specifying unit compares each peripheral area with the background area of the document using an image feature comparison method, and determines a partial image area that is different from the background area as the partial image area. Steps,
An index output step, wherein the index output means associates the character area, the partial image area around the character area, and the appearance position in the document in which the character string appears, and outputs the associated position to the index DB;
The search method characterized by performing.

The document also includes an image,
The search method according to claim 5, wherein a partial area including an image and a character string in the document is used as a search query.

A search target image input unit for receiving an input of an image obtained by photographing a partial region of a document including an image and a character string;
A search target character region specifying unit, wherein a character region in the search target image is specified using a character region extraction method;
A search target peripheral region specifying unit for specifying a peripheral region that is a region existing around each character region;
A search target partial image specifying unit uses an image specific comparison method to compare each peripheral region with a background region of the search target image, and to determine a peripheral image different from the background region as a partial image region A target partial image identification step;
Inquiry means for inquiring the index DB about the character area and a document in which the partial image area around the character area appears or a position in the document,
A search result output means for outputting a result of the inquiry;
The search method according to claim 5 or 6, further comprising:

In the inquiry step,
8. The search method according to claim 7, further comprising the step of accessing a database in which the position and content of each document are associated in advance and obtaining the content associated with the document position as a search result as the search result.

The search program for functioning a computer as each means which comprises the search device of any one of Claims 1 thru | or 4.