JP5186863B2

JP5186863B2 - Image aggregation device and image aggregation program

Info

Publication number: JP5186863B2
Application number: JP2007254770A
Authority: JP
Inventors: 志華鍾; 幸代上堀
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2007-09-28
Filing date: 2007-09-28
Publication date: 2013-04-24
Anticipated expiration: 2027-09-28
Also published as: JP2009086935A

Description

本発明は、複数の画像をステンドグラス状に集約して表示させる画像集約装置に関し、特に、集約後の各画像内の文字の可読性を高める画像集約装置に関する。 The present invention relates to an image aggregating apparatus that aggregates and displays a plurality of images in a stained glass shape, and more particularly to an image aggregating apparatus that improves the readability of characters in each image after aggregation.

画像用の検索エンジン等で検索結果として抽出され、表示装置上に縮小されて表示される画像（以下、抽出画像という。）は、その抽出画像内の文字の可読性が劣る場合がある。例えば、特許文献１では、ＰＤＡや携帯電話等の小型移動装置上で画像や文字等を表示させるに際し、表示領域に多くの情報量を表示させるために、画像や文字等を透過的に重ね合わせることが開示されている。 An image extracted as a search result by an image search engine or the like and reduced and displayed on a display device (hereinafter referred to as an extracted image) may have poor readability of characters in the extracted image. For example, in Patent Document 1, when displaying an image or a character on a small mobile device such as a PDA or a mobile phone, the image or the character is transparently superimposed in order to display a large amount of information in the display area. It is disclosed.

また、特許文献２では、関心領域処理（ＲＯＩ（Region Of Interest）処理）に基づき、複数の抽出画像をステンドグラス状に集約することで、小さな表示領域であっても多くの情報量を表示できることが開示されている。 Further, in Patent Document 2, a large amount of information can be displayed even in a small display area by aggregating a plurality of extracted images into a stained glass shape based on a region of interest process (ROI (Region Of Interest) process). Is disclosed.

特開２００５−３２２２１３号公報JP 2005-322213 A 特開２００６−９２５５６号公報JP 2006-92556 A

ところで、上述したようなステンドグラス状に抽出画像を集約して生成される画像（以下、集約画像という。）の表示においては、その集約画像を構成する各画像内に含まれる文字が縮小されたり、他の抽出画像によって各画像内の文字が分断されたりすることで文字の可読性が劣る可能性がある。 By the way, in the display of an image (hereinafter referred to as an aggregated image) generated by aggregating extracted images in a stained glass shape as described above, characters included in each image constituting the aggregated image are reduced. The characters in each image may be divided by other extracted images, which may result in poor character readability.

本発明は、このような事情に鑑みてなされたものであり、集約画像が生成されたことにより、集約画像を構成する画像内の文字の可読性が劣ることを抑制する画像集約装置及び画像集約プログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and an image aggregating apparatus and an image aggregating program that suppress the deterioration of the readability of characters in images constituting the aggregated image due to the generation of the aggregated image. The purpose is to provide.

請求項１に記載の発明は、入力装置により入力される検索用文字情報に基づき、該検索用文字情報に関連する画像情報を抽出画像として複数抽出する抽出手段と、抽出手段により抽出された複数の抽出画像の少なくとも１つが文字列と図柄とを備えた抽出画像であり、かつ、該文字列を示す面積が該図柄を示す面積より大きい場合に、該抽出画像が文字列で構成されると判定する判定手段と、文字列を拡大して表示させるための文字列表示領域を抽出画像の数に応じて生成するとともに、関心領域処理に基づいて、抽出された複数の抽出画像から集約画像を生成する生成手段と、生成手段により生成された集約画像を表示させるとともに、生成手段により生成された文字列表示領域のそれぞれに、判定手段により文字列で構成されると判定された抽出画像と、抽出画像が備える文字列と、を当該抽出画像ごとに表示させる表示制御手段と、を有する画像集約装置である。 The invention described in claim 1 is based on the search character information input by the input device , and extracts a plurality of image information related to the search character information as an extracted image, and a plurality of pieces extracted by the extraction unit When at least one of the extracted images is an extracted image including a character string and a pattern, and the area indicating the character string is larger than the area indicating the pattern, the extracted image is configured by a character string. A determination means for determining and a character string display area for displaying the character string in an enlarged manner are generated according to the number of extracted images, and an aggregated image is extracted from a plurality of extracted images based on the region of interest processing. The generation unit to be generated and the aggregated image generated by the generation unit are displayed, and each of the character string display areas generated by the generation unit is determined to be composed of character strings by the determination unit. An extraction image, a character string included in the extracted image, and display control means for displaying each said extracted image to an image aggregation device having a.

請求項２に記載の発明は、コンピュータを、入力装置により入力される検索用文字情報に基づき、該検索用文字情報に関連する画像情報を抽出画像として複数抽出する抽出手段、抽出手段により抽出された複数の抽出画像の少なくとも１つが文字列と図柄とを備えた抽出画像であり、かつ、該文字列を示す面積が該図柄を示す面積より大きい場合に、該抽出画像が文字列で構成されると判定する判定手段、文字列を拡大して表示させるための文字列表示領域を抽出画像の数に応じて生成するとともに、関心領域処理に基づいて、抽出された複数の抽出画像から集約画像を生成する生成手段、生成手段により生成された集約画像を表示させるとともに、生成手段により生成された文字列表示領域のそれぞれに、判定手段により文字列で構成されると判定された抽出画像と、抽出画像が備える文字列と、を当該抽出画像ごとに表示させる表示制御手段、として機能させるための画像集約プログラムである。 According to the second aspect of the present invention, the computer is extracted by extraction means and extraction means for extracting a plurality of image information related to the search character information as extracted images based on the search character information input by the input device. In addition, when at least one of the plurality of extracted images is an extracted image having a character string and a pattern, and the area indicating the character string is larger than the area indicating the pattern, the extracted image is configured by a character string. A determination unit that determines that the character string display area for displaying the character string in an enlarged manner is generated according to the number of extracted images, and the aggregated image is extracted from the plurality of extracted images extracted based on the region of interest processing Generating means for displaying the aggregated image generated by the generating means, and each character string display area generated by the generating means is constituted by a character string by the determining means. And the determined extracted image and the character string included in the extracted image, which is the image-intensive program for functioning as display control means, to be displayed for each said extracted image.

請求項１に記載の発明によれば、集約画像を構成する画像内の文字の可読性が劣ることを抑制することができる。 According to invention of Claim 1, it can suppress that the readability of the character in the image which comprises an aggregated image is inferior.

請求項２に記載の発明によれば、集約画像を構成する画像内の文字の可読性が劣ることを抑制することができる。 According to invention of Claim 2 , it can suppress that the readability of the character in the image which comprises an aggregated image is inferior.

以下、本発明の最良の実施形態について図面を参照して説明する。
（第１実施形態）
図１は本発明の実施形態に係る画像集約装置の構成図である。
画像集約装置は、図１に示すように、検索部１１、結果分析部１２、データ管理部１３、データベース１４、テキスト生成部１５、ＳＧ（Stained Glass）生成部１６、表示制御部１７等から構成される。尚、テキスト生成部１５、ＳＧ生成部１６、表示制御部１７等から本発明の表示制御手段が構成される。 DESCRIPTION OF EXEMPLARY EMBODIMENTS Hereinafter, exemplary embodiments of the invention will be described with reference to the drawings.
(First embodiment)
FIG. 1 is a configuration diagram of an image aggregating apparatus according to an embodiment of the present invention.
As shown in FIG. 1, the image aggregating apparatus includes a search unit 11, a result analysis unit 12, a data management unit 13, a database 14, a text generation unit 15, an SG (Stained Glass) generation unit 16, a display control unit 17, and the like. Is done. The text generation unit 15, SG generation unit 16, display control unit 17 and the like constitute the display control means of the present invention.

また、画像集約装置は、いわゆるコンピュータ、すなわち、図２に示すように、ＣＰＵ１０ａ等の処理装置、ＳＲＡＭ(Static Random Access Memory)、ＤＲＡＭ(Dynamic RAM)やＳＤＲＡＭ（Synchronous DRAM）、ＮＶＲＡＭ（Non Volatile RAM）等のＲＡＭ１０ｂ、フラッシュメモリ等のＲＯＭ(Read Only Memory)１０ｃ、入力装置や表示装置等の外部機器との入出力を行うＩ／Ｆ１０ｄ、図示しないハードディスク等の磁気ディスク等がバス１０ｅにより接続されたハードウェア構成により実現される。 The image aggregation device is a so-called computer, that is, as shown in FIG. 2, a processing device such as a CPU 10a, SRAM (Static Random Access Memory), DRAM (Dynamic RAM), SDRAM (Synchronous DRAM), NVRAM (Non Volatile RAM). ) And the like, a ROM (Read Only Memory) 10c such as a flash memory, an I / F 10d that inputs and outputs to an external device such as an input device and a display device, and a magnetic disk such as a hard disk (not shown) are connected by a bus 10e. This is realized by the hardware configuration.

したがって、ＣＰＵ１０ａがＲＯＭ１０ｃやハードディスクに格納された所要のプログラムを読み込み、当該プログラムに従った演算を行うことにより、画像集約装置内の各機能が実現される。尚、このようなプログラムとしては後述するフローチャートに応じたプログラムとすることができる。 Therefore, the CPU 10a reads a required program stored in the ROM 10c or the hard disk and performs an operation according to the program, thereby realizing each function in the image aggregation device. In addition, as such a program, it can be set as the program according to the flowchart mentioned later.

検索部１１は、キーボードやマウス等の入力装置からの指示に基づき、集約用の画像をインターネットや社内ＬＡＮ等から検索する。より詳しくは、図３に示すように、液晶ディスプレイやＣＲＴディスプレイ等の表示装置に表示される表示画像のうち、画像検索用の入力ボックス２１の文字情報に応じた画像を検索する。表示画像としては例えばＷｅｂブラウザ２０等がある。図３においては例えば文字情報「ＧＣＣ」が入力ボックス２１に入力されているため、文字情報「ＧＣＣ」を有する画像又はこれに関連する画像を検索する。検索結果として抽出された抽出画像は、結果分析部１２に送信される。 The retrieval unit 11 retrieves an image for aggregation from the Internet, an in-house LAN, or the like based on an instruction from an input device such as a keyboard or a mouse. More specifically, as shown in FIG. 3, an image corresponding to character information in the image search input box 21 is searched from display images displayed on a display device such as a liquid crystal display or a CRT display. An example of the display image is a Web browser 20 or the like. In FIG. 3, for example, character information “GCC” is input to the input box 21, so an image having character information “GCC” or an image related thereto is searched. The extracted image extracted as the search result is transmitted to the result analysis unit 12.

結果分析部１２は、検索部１１から送信される抽出画像を受信し、当該抽出画像が文字のみから構成されるか、図柄のみから構成されるか、これらが混在するか否かを分析する。
データ管理部１３は、結果分析部１２による分析結果に応じて、抽出画像を区別してデータベース１４に格納する。 The result analysis unit 12 receives the extracted image transmitted from the search unit 11, and analyzes whether the extracted image is composed of only characters, only the symbols, or whether they are mixed.
The data management unit 13 distinguishes and stores the extracted images in the database 14 according to the analysis result by the result analysis unit 12.

テキスト生成部１５は、データ管理部１３に対しデータベース１４から文字のみから構成される抽出画像の送信要求を出力する。さらに、テキスト生成部１５は、当該要求に基づき抽出画像を取得すると、抽出画像内の文字列を取得して、文字列を表示するための文字列表示領域（以下、テキストレイヤという。）を生成し、当該テキストレイヤ内の各表示レイヤに文字列を追加していく。文字列が追加されたテキストレイヤは表示制御部１７に送信される。 The text generation unit 15 outputs a transmission request for an extracted image composed only of characters from the database 14 to the data management unit 13. Furthermore, when the text generation unit 15 acquires an extracted image based on the request, the text generation unit 15 acquires a character string in the extracted image and generates a character string display area (hereinafter referred to as a text layer) for displaying the character string. Then, a character string is added to each display layer in the text layer. The text layer to which the character string is added is transmitted to the display control unit 17.

ＳＧ生成部１６は、データ管理部１３に対しデータベース１４から図柄のみから構成される抽出画像の送信要求を出力する。さらに、ＳＧ生成部１５は、当該要求に基づき抽出画像を取得すると、抽出画像内の図柄を取得して、関心領域処理に基づいて、ステンドグラス状の画像（集約画像）を生成する。このような画像を生成する場合には、上述した特許文献２に開示される技術や、特開２００５−２９３５７６号公報、特開２００５−２９３５７７号公報等に開示される技術を利用することができる。 The SG generation unit 16 outputs a transmission request for an extracted image including only symbols from the database 14 to the data management unit 13. Further, when the SG generation unit 15 acquires the extracted image based on the request, the SG generation unit 15 acquires a pattern in the extracted image and generates a stained glass image (aggregated image) based on the region of interest processing. In the case of generating such an image, the technique disclosed in Patent Document 2 described above, the technique disclosed in Japanese Patent Laid-Open No. 2005-293576, Japanese Patent Laid-Open No. 2005-293577, or the like can be used. .

表示制御部１７は、テキスト生成部１５で生成されるテキストレイヤ、ＳＧ生成部１６で生成される集約画像を表示装置に表示させる。 The display control unit 17 causes the display device to display the text layer generated by the text generation unit 15 and the aggregated image generated by the SG generation unit 16.

続いて、画像集約装置の動作について図面を参照して説明する。
図４は画像集約装置の動作の一例を示すフローチャート、図５は実施形態に係る抽出画像と表示結果を説明するための図、図６は実施形態に係る抽出画像と表示結果を説明するための他の図、図７は実施形態に係る表示画像と比較例に係る表示画像の一例である。 Next, the operation of the image aggregation device will be described with reference to the drawings.
FIG. 4 is a flowchart showing an example of the operation of the image aggregation device, FIG. 5 is a diagram for explaining the extracted image and the display result according to the embodiment, and FIG. 6 is for explaining the extracted image and the display result according to the embodiment. FIG. 7 is another example of the display image according to the embodiment and the display image according to the comparative example.

画像集約装置は、図４に示すように、まず、検索用文字情報に応じたページ内のテキストボックスをすべて取得する（ステップＳ１）。より詳しくは、抽出画像が、図５（ａ）に示すような、文字のみから構成され、各文字列が所定の区画で区切られているようなページである場合に、入力ボックス２１に入力された文字情報、例えば「ＧＣＣ」を含むページ内のすべてのテキストボックス取得する。この結果、同図におけるテキストボックスＢｘ４、Ｂｘ６、Ｂｘ９、Ｂｘ１０を取得する。そして、取得したこれらのテキストボックスを取得した順に領域番号ｉ＝１、２、３、４を付与する。 As shown in FIG. 4, the image aggregating apparatus first acquires all the text boxes in the page corresponding to the search character information (step S1). More specifically, when the extracted image is a page composed of only characters as shown in FIG. 5A and each character string is divided by a predetermined section, the extracted image is input to the input box 21. All text boxes in the page including the character information, for example, “GCC” are acquired. As a result, the text boxes Bx4, Bx6, Bx9, and Bx10 in FIG. Then, region numbers i = 1, 2, 3, and 4 are assigned in the order in which these acquired text boxes are acquired.

画像集約装置は、次いで、取得したテキストボックスが１番目のテキストボックスであるか否かを判定する（ステップＳ２）。ここで、１番目のテキストボックスであるか否かは、取得したテキストボックスを左上から右下にかけて取得していった場合に、最初に取得対象となったテキストボックスを１番目のテキストボックスとする。この結果、図５に示すテキストボックスＢｘ４が１番目のテキストボックスとなる。 Next, the image aggregating apparatus determines whether or not the acquired text box is the first text box (step S2). Here, whether or not it is the first text box is determined based on whether or not the acquired text box is acquired from the upper left to the lower right, and the first text box to be acquired is the first text box. . As a result, the text box Bx4 shown in FIG. 5 becomes the first text box.

画像集約装置は、次いで、取得したテキストボックスが１番目である場合、当該テキストボックスの左上隅を最小座標値（Ｘｍｉｎ、Ｙｍｉｎ）、右下隅を最大座標値（Ｘｍａｘ、Ｙｍａｘ）とする（ステップＳ３）。したがって、１番目のテキストボックスを取得した時点では、Ｘｍｉｎ＝Ｘ_１０、Ｙｍｉｎ＝Ｙ_１０、Ｘｍａｘ＝Ｘ_１１、Ｙｍａｘ＝Ｙ_１１となる。 Next, when the acquired text box is the first, the image aggregating apparatus sets the upper left corner of the text box as the minimum coordinate value (Xmin, Ymin) and the lower right corner as the maximum coordinate value (Xmax, Ymax) (step S3). ). Therefore, when the first text box is acquired, Xmin = X ₁₀ , Ymin = Y ₁₀ , Xmax = X ₁₁ , and Ymax = Y ₁₁ .

画像集約装置は、次いで、取得したテキストボックスが最後のテキストボックスであるか否かを判定する。図５によれば、最後のテキストボックスは領域番号ｉ＝４のテキストボックスＢｘ１０であるため、次の領域番号ｉ＝２として、Ｘ_２０がＸｍｉｎより小さいか否かを判定する（ステップＳ５）。そして、Ｘ_２０がＸｍｉｎより小さい場合には、Ｘｍｉｎを新たにＸ_２０とし（ステップＳ６）、そうでない場合には、ステップＳ６の処理を行わない。 Next, the image aggregating apparatus determines whether or not the acquired text box is the last text box. According to FIG. 5, since the end of the text box is a text box Bx10 area number i = 4, as the next region number i = _{2, X 20} determines whether Xmin smaller (step S5). _{When X 20} is Xmin smaller than, the new _{X 20} and Xmin (step S6), and otherwise does not perform the processing of step S6.

画像集約装置は、同様に、Ｙ_２０がＹｍｉｎより小さいか否かを判定する（ステップＳ７）。そして、Ｙ_２０がＹｍｉｎより小さい場合には、Ｙｍｉｎを新たにＹ_２０とし（ステップＳ８）、そうでない場合には、ステップＳ８の処理を行わない。 Image centralizing device, _{likewise, Y 20} determines whether Ymin is smaller than (step S7). _{When Y 20} is Ymin smaller than, the new _{Y 20} and Ymin (step S8), and otherwise does not perform the processing of step S8.

画像集約装置は、次いで、Ｘ_２１がＸｍａｘより大きいか否かを判定する（ステップＳ９）。そして、Ｘ_２１がＸｍａｘより大きい場合には、Ｘｍａｘを新たにＸ２１とし（ステップＳ１０）、そうでない場合には、ステップＳ１０の処理を行わない。 Image centralizing device, _{then, X 21} determines whether Xmax greater (step S9). _{When X 21} is larger than Xmax is a new X21 to Xmax (step S10), and otherwise does not perform the processing of step S10.

画像集約装置は、同様に、Ｙ_２１がＹｍａｘより大きいか否かを判定する（ステップＳ１１）。そして、Ｙ_２１がＹｍａｘより大きい場合には、Ｙｍａｘを新たにＹ_２１とし（ステップＳ１２）、そうでない場合には、ステップＳ１２の処理を行わない。 Image centralizing device, _{likewise, Y 21} determines whether Ymax larger (step S11). _{When Y 21} is larger than Ymax is the new _{Y 21} to Ymax (step S12), the otherwise does not perform the processing in step S12.

画像集約装置は、ステップＳ４の処理において最後のテキストボックスであると判定した場合、すなわち、本実施形態によればｉ＝４が終了した場合に、後続の処理に移行する。このようにステップＳ５からステップＳ１２の処理を繰り返すことにより、図５においては、Ｂｘ９（ｉ＝３）の左上隅のＸ座標：Ｘ_３０がＸｍｉｎに、Ｂｘ４（ｉ＝１）の右下隅のＸ座標：Ｘ_１１がＸｍａｘに、Ｂｘ４（ｉ＝１）の左上隅のＹ座標：Ｙ_１９がＹｍｉｎに、Ｂｘ９（ｉ＝３）の右下隅のＹ座標：Ｙ_３１がＹｍａｘになる。この結果、座標（Ｘｍｉｎ，Ｙｍｉｎ）と座標（Ｘｍａｘ，Ｙｍａｘ）で構成される矩形の仮想領域ＡＲ内のテキストがテキストレイヤ生成対象となる。この仮想領域ＡＲはレイヤ生成対象として所定の区画を含む最大の矩形となっている。尚、当該仮想領域ＡＲをテキストＧＥＲＭという。 If the image aggregation device determines that it is the last text box in the process of step S4, that is, if i = 4 is completed according to the present embodiment, it proceeds to the subsequent process. By repeating the process in step S12 in this way from the step S5, in Figure 5, X coordinate of the upper left corner of Bx9 (i = _3): the _{X 30} is Xmin, Bx4 (i = 1) X in the lower-right corner of the _coordinates: the _{X 11} is Xmax, Y coordinate of the upper left corner of _{Bx4 (i = 1): Y} 19 is a Ymin, Y coordinates of the lower right corner of _{Bx9 (i = 3): Y} 31 is Ymax. As a result, the text in the rectangular virtual area AR composed of the coordinates (Xmin, Ymin) and the coordinates (Xmax, Ymax) becomes the text layer generation target. This virtual area AR is the largest rectangle including a predetermined section as a layer generation target. The virtual area AR is referred to as text GERM.

画像集約装置は、次いで、ページ内のすべてのテキストボックスを取得して左上から右下にかけて領域番号ｊ＝１，２，・・・を割り当てた上で、ｊ番目のテキストボックスを取得し（ステップＳ１３）、当該テキストボックスの座標がＸｍｉｎ≦Ｘｉ０かつＹｍｉｎ≦Ｙｉ０かつＸｍａｘ≧Ｘｉ１かつＹｍａｘ≧Ｙｉ１か否かを判定する（ステップＳ１４）。そして、これらの判定条件が満たされた場合には、ｊ番目のテキストを取得し、今まで取得したテキストがある場合に当該テキストと結合する（ステップＳ１５）。このような処理の結果、ｊ＝１となるテキストボックスＢｘ１は結合（集約）対象から除外され、ｊ＝３となるテキストボックスＢｘ３が結合対象となる。 Next, the image aggregating apparatus acquires all the text boxes in the page, assigns region numbers j = 1, 2,... From the upper left to the lower right, and then acquires the jth text box (step S13), it is determined whether or not the coordinates of the text box are Xmin ≦ Xi0, Ymin ≦ Yi0, Xmax ≧ Xi1, and Ymax ≧ Yi1 (step S14). If these determination conditions are satisfied, the j-th text is acquired, and if there is text acquired so far, it is combined with the text (step S15). As a result of such processing, the text box Bx1 with j = 1 is excluded from the objects to be combined (aggregated), and the text box Bx3 with j = 3 becomes the object to be combined.

画像集約装置は、ここで、結合したテキストが最後のテキストであるか否かを判定し（ステップＳ１６）、最後のテキストでない場合には、ステップＳ１３からステップＳ１５までの処理を繰り返す。画像集約装置は、最後のテキストを結合し終えると、結合したテキストをデータベースに保存し（ステップＳ１７）、処理を終了する。 Here, the image aggregating apparatus determines whether or not the combined text is the last text (step S16), and if it is not the last text, repeats the processing from step S13 to step S15. When the image aggregating apparatus finishes combining the last text, it stores the combined text in the database (step S17), and ends the process.

このような処理による結果、テキストレイヤでは、図５（ｂ）に示すように、１つの表示レイヤ３１内に結合されたテキストボックス内のテキストがテキストボックスごとに段落に分けられて表示される。そして、他の表示レイヤ３２等には、他の抽出画像で結合されたテキストボックス内のテキストが表示される。これにより文章全体の把握が容易になる。 As a result of such processing, in the text layer, as shown in FIG. 5B, the text in the text box combined in one display layer 31 is displayed divided into paragraphs for each text box. Then, the text in the text box combined with another extracted image is displayed on the other display layer 32 or the like. This makes it easy to grasp the entire sentence.

また、図６（ａ）に示すように、仮想領域ＡＲで囲わずに、取得したテキストボックスだけで構成されるテキストボックス内のテキストを結合対象として含めるようにしてもよい。同図によれば、上述したテキストボックスＢｘ４、６、９、１０に含まれるテキストをテキストレイヤ内の表示レイヤ３１に表示させるようにしてもよい。これにより、携帯電話やＰＤＡ等の情報携帯端末で制限された表示領域でより多くの情報を表示できる。 Further, as shown in FIG. 6A, the text in the text box composed only of the acquired text box may be included as a combination target without being surrounded by the virtual area AR. According to the figure, the text included in the text boxes Bx4, 6, 9, and 10 described above may be displayed on the display layer 31 in the text layer. As a result, more information can be displayed in a display area limited by an information portable terminal such as a cellular phone or a PDA.

尚、本実施形態においては、抽出画像（ページ）に検索用文字情報を含むテキストボックスが複数あったため、仮想領域ＡＲをテキストＧＥＲＭとしたが、テキストボックスが１つである場合には、当該テキストボックスをテキストＧＥＲＭとしてもよい。また、検索用文字情報を含まない抽出画像である場合には、フォントが一番大きいテキストボックス、上部に配置されるテキストボックス、中央に配置されるテキストボックス、他のページに含まれないテキストボックス等をテキストＧＥＲＭとしてもよい。 In this embodiment, since there are a plurality of text boxes including search character information in the extracted image (page), the virtual area AR is set as the text GERM. However, if there is only one text box, the text The box may be the text GERM. Also, if the extracted image does not include search character information, the text box with the largest font, the text box placed at the top, the text box placed in the center, and the text box not included on other pages Etc. may be the text GERM.

図７は本実施形態に係る表示画像と比較例に係る表示画像とを説明するための図である。本実施形態に係る表示画像は、図７（ａ）に示すように、上述したテキストレイヤ３３がＷｅｂブラウザ２０等に表示される。一方、比較例に係る表示画像は、図７（ｂ）に示すように、検索用文字情報に基づいて抽出された抽出画像が縮小されてＷｅｂブラウザ２０等に表示される。このように、比較例において抽出画像が縮小され、その結果抽出画像内の文字の可読性が劣るという事象が、本発明の実施形態に係る画像集約装置により抑制される。 FIG. 7 is a diagram for explaining a display image according to the present embodiment and a display image according to a comparative example. In the display image according to the present embodiment, as shown in FIG. 7A, the text layer 33 described above is displayed on the Web browser 20 or the like. On the other hand, as shown in FIG. 7B, the display image according to the comparative example is displayed on the Web browser 20 or the like by reducing the extracted image extracted based on the search character information. Thus, the phenomenon that the extracted image is reduced in the comparative example, and as a result, the readability of the characters in the extracted image is deteriorated is suppressed by the image aggregation device according to the embodiment of the present invention.

（第２実施形態）
続いて、本発明の第２実施形態について図面を参照して説明する。
図８は画像集約装置の動作の一例を示すフローチャート、図９は比較用の表示結果の一例である。 (Second Embodiment)
Next, a second embodiment of the present invention will be described with reference to the drawings.
FIG. 8 is a flowchart showing an example of the operation of the image aggregation device, and FIG. 9 is an example of a display result for comparison.

画像集約装置は、図８に示すように、まず、検索用文字情報に基づいてインターネットや社内ＬＡＮ等から検索処理を行い（ステップＳ２１）、検索用文字情報に応じた抽出画像を取得する（ステップＳ２２）。 As shown in FIG. 8, the image aggregating apparatus first performs a search process from the Internet, an in-house LAN, or the like based on the search character information (step S21), and acquires an extracted image corresponding to the search character information (step S21). S22).

画像集約装置は、次いで、抽出画像がテキスト（文字列）のみから構成されるか否かを判定する（ステップＳ２３）。画像集約装置は、抽出画像がテキストのみから構成されると判定した場合には、第１実施形態で説明したように、ページ順にページデータを取得し（ステップＳ２４）、テキストボックスに含まれるテキストを取得し（ステップＳ２５）、当該テキストを最後のテキストまで表示レイヤに追加していき（ステップＳ２６、Ｓ２７）、テキストの追加が終了すると表示装置等にテキストレイヤを表示させる（ステップＳ２８）。 Next, the image aggregating apparatus determines whether or not the extracted image is composed only of text (character string) (step S23). If it is determined that the extracted image is composed only of text, the image aggregating apparatus acquires page data in the order of pages as described in the first embodiment (step S24), and the text included in the text box is acquired. It is acquired (step S25), the text is added to the display layer up to the last text (steps S26 and S27), and when the addition of the text is completed, the text layer is displayed on the display device or the like (step S28).

一方、画像集約装置は、抽出画像がテキストのみから構成されていないと判定した場合には、次いで、抽出画像が画像情報（図柄）のみから構成されるか否かを判定する（ステップＳ２９）。画像集約装置は、抽出画像がテキストのみから構成されると判定した場合には、検索結果に基づくすべての抽出画像を取得し（ステップＳ３０）、これらの画像からＳＧ作成を行い（ステップＳ３１）、図９に示すように、表示装置等にステンドガラス状の画像（集約画像）を表示する（ステップＳ３２）。 On the other hand, if it is determined that the extracted image is not composed only of text, the image aggregating apparatus then determines whether or not the extracted image is composed only of image information (design) (step S29). If it is determined that the extracted image is composed only of text, the image aggregating apparatus acquires all the extracted images based on the search results (step S30), creates an SG from these images (step S31), As shown in FIG. 9, a stained glass-like image (aggregated image) is displayed on a display device or the like (step S32).

一方、画像集約装置は、抽出画像が画像情報のみから構成されていないと判定した場合、すなわち、テキスト情報と画像情報との混在であると判定した場合には、ページ順にページデータを取得する（ステップＳ３３）。そして、取得したページデータが画像ＧＥＲＭか否かを判定する（ステップＳ３４）。 On the other hand, if the image aggregating apparatus determines that the extracted image is not composed only of image information, that is, determines that the extracted image is a mixture of text information and image information, the image aggregating apparatus acquires page data in page order ( Step S33). Then, it is determined whether or not the acquired page data is an image GERM (step S34).

画像集約装置は、ページデータが画像ＧＥＲＭでないと判定した場合、すなわちテキストＧＥＲＭであると判定した場合には、当該テキストＧＥＲＭに含まれるテキストを取得する。ここで、取得したページデータが画像ＧＥＲＭであるか否かは、画像を示す面積をＰ、１ページ全体の画像面積Ａ、テキストを示す面積をＴとした場合に、（Ｐ／Ａ）＞（１／３）又はＰ≧Ｔを満たす場合に画像ＧＥＲＭと判定され、そうでない場合には、テキストＧＥＲＭと判定される。尚、画像を示す面積Ｐは、図１０（ａ）に示す数式で表され、テキストを示す面積Ｔは、図１０（ｂ）に示す数式で表される。また、ａｂｓは絶対値を示す記号である。 When it is determined that the page data is not the image GERM, that is, when it is determined that the page data is the text GERM, the image aggregation device acquires the text included in the text GERM. Here, whether or not the acquired page data is the image GERM is determined by assuming that the area indicating the image is P, the image area A of the entire page is T, and the area indicating the text is T, (P / A)> ( 1/3) or when P ≧ T is satisfied, the image is determined as GERM. Otherwise, it is determined as text GERM. The area P indicating the image is represented by the mathematical formula shown in FIG. 10A, and the area T representing the text is represented by the mathematical formula shown in FIG. Abs is a symbol indicating an absolute value.

画像集約装置は、ページデータがテキストＧＥＲＭであると判定した場合には、テキストＧＥＲＭに含まれるテキストを取得し（ステップＳ３５）、当該テキストを表示レイヤに追加していく（ステップＳ３６）。そして、画像集約装置は、当該ページデータが最後のページデータであったか否かを判定する（ステップＳ３７）。 When determining that the page data is the text GERM, the image aggregating apparatus acquires the text included in the text GERM (step S35) and adds the text to the display layer (step S36). Then, the image aggregating apparatus determines whether or not the page data is the last page data (step S37).

一方、画像集約装置は、ページデータが画像ＧＥＲＭであると判定した場合には、当該ページデータを画像ページとして一時保存し（ステップＳ３８）、表示レイヤに画像のサムネイルを追加する（ステップＳ３９）。 On the other hand, when it is determined that the page data is the image GERM, the image aggregating apparatus temporarily stores the page data as an image page (step S38), and adds an image thumbnail to the display layer (step S39).

画像集約装置は、取得したページデータのすべてに対しステップＳ３５からステップＳ３９の処理を終了すると、画像ページからＳＧ作成処理を行い（ステップＳ４０）、テキストレイヤを表示して（ステップＳ４１）、処理を終了する。 When the image aggregating apparatus completes the processing from step S35 to step S39 for all the acquired page data, it performs SG creation processing from the image page (step S40), displays the text layer (step S41), and performs processing. finish.

この結果、図１１に示すように、検索用文字情報「ＧＣＣ」に基づく集約画像３６がＷｅｂブラウザ２０等に表示されるとともに、併せて、テキストレイヤ３３が表示され、当該テキストレイヤ３３には、表示レイヤ３１内に集約画像３６を構成する画像のサムネイル画像と、構成する画像が備える文字列が表示される。一方、図１２に示すように、抽出画像２５をＷｅｂブラウザ２０等に縮小して表示させたままでは集約画像３６を構成する画像に含まれる文字の可読性は望ましくない。 As a result, as shown in FIG. 11, the aggregated image 36 based on the search character information “GCC” is displayed on the web browser 20 or the like, and the text layer 33 is also displayed. In the display layer 31, thumbnail images of images constituting the aggregated image 36 and character strings included in the constituting images are displayed. On the other hand, as shown in FIG. 12, if the extracted image 25 is reduced and displayed on the Web browser 20 or the like, the readability of the characters included in the images constituting the aggregated image 36 is not desirable.

また、図１３に示すように、集約画像３６を構成する一の画像を指示画像としてのポインタＰｔで指示すると、集約画像の元となった画像に含まれる文字がテキストレイヤ３３に拡大して表示される。このように、集約元の画像に含まれる文字が集約されたことで分断されたり、縮小されたりすることとなっても、集約画像３６に含まれる文字の可読性が落ちることを抑制する。 Further, as shown in FIG. 13, when one image constituting the aggregated image 36 is designated by the pointer Pt as the instruction image, characters included in the image that is the origin of the aggregated image are enlarged and displayed on the text layer 33. Is done. In this way, even if the characters included in the aggregation source image are segmented or reduced due to aggregation, the readability of the characters included in the aggregated image 36 is suppressed from decreasing.

尚、同図においてはテキストレイヤ３３と集約画像３６との重なる領域に関しては、テキストレイヤ３３の表示レイヤ３１，３２を不透明にすることで表示レイヤ３１，３２内のテキストの読み易くしているが、テキストレイヤ３３内の表示レイヤ３１，３２を透明にし、テキストレイヤ３３の下の画像をテキストレイヤ３３の上から視認できるようにしてもよく、さらに、その透明度が設定されるようにしてもよい。これによりテキストレイヤ３３により隠れる画像の視認性が向上する。 In the figure, regarding the region where the text layer 33 and the consolidated image 36 overlap, the display layers 31 and 32 of the text layer 33 are made opaque so that the text in the display layers 31 and 32 can be easily read. The display layers 31 and 32 in the text layer 33 may be transparent so that the image below the text layer 33 can be viewed from above the text layer 33, and the transparency thereof may be set. . Thereby, the visibility of the image hidden by the text layer 33 is improved.

（その他の実施形態）
以下において上述したテキストＧＥＲＭと画像ＧＥＲＭとが混在した場合における表示例について図面を参照して説明する。
図１４は画像集約装置における抽出画像の一例、図１５及び図１６は集約画像における表示レイヤの表示例、図１７、図１８及び図１９は集約画像における表示レイヤの他の表示例、図２０は画像集約装置における他の抽出画像の一例及び集約画像における表示レイヤの表示例である。 (Other embodiments)
A display example in the case where the above-described text GERM and image GERM are mixed will be described below with reference to the drawings.
14 is an example of an extracted image in the image aggregation device, FIGS. 15 and 16 are examples of display layer display in the aggregate image, FIGS. 17, 18 and 19 are other examples of display layers in the aggregate image, and FIG. It is an example of another extracted image in an image aggregation device, and a display example of a display layer in an aggregate image.

画像集約装置は、抽出画像の一例として図１４に示すような画像を取得し、他に抽出した画像ととともに図１５や図１６に示すような集約画像を生成して表示装置等に表示させた場合、それぞれの図に示すように、集約後における画像内であって、文字列が記載された文字列領域内の文字列をポインタＰｔで触れると、当該文字列を拡大して表示させる。尚、拡大された文字列は集約画像上であっても集約画像上以外の場所であってもよい。 The image aggregating apparatus acquires an image as shown in FIG. 14 as an example of the extracted image, and generates an aggregated image as shown in FIGS. 15 and 16 together with the other extracted images and displays the aggregated image on a display device or the like. In each case, as shown in each figure, when the character string in the character string area in which the character strings are described is touched with the pointer Pt in the image after aggregation, the character strings are enlarged and displayed. The enlarged character string may be on the aggregated image or at a place other than on the aggregated image.

また、画像集約装置は、抽出画像の一例として図１４に示すような画像を取得し、他に抽出した画像ととともに図１７、図１８や図１９に示すような集約画像を生成して表示装置等に表示させた場合、それぞれの図に示すように、文字列以外の指示位置にポインタＰｔで触れることより表示する文字列を変化させてもよい。 Further, the image aggregating apparatus acquires an image as shown in FIG. 14 as an example of the extracted image, generates an aggregated image as shown in FIGS. 17, 18, and 19 together with the other extracted images, and displays it. As shown in each figure, the character string to be displayed may be changed by touching the designated position other than the character string with the pointer Pt.

この場合、表示させる文字列は図１４に示す全体ページのレイアウトに対応させてもよく、例えば図１７であれば、指示位置に最も近い「住宅所有の関係別割合全国（平成１５年）」と表示させることができる。このような表示制御は、切り出し領域中心（図１７等では真ん中の画像の中心）から見て、上部に指示位置がある場合は、切り出し画像の物理的上部に配置される文字列を表示させてもよい。 In this case, the character string to be displayed may correspond to the layout of the entire page shown in FIG. 14. For example, in FIG. 17, “the ratio of home ownership by country (2003)” closest to the indicated position. Can be displayed. Such display control is performed by displaying a character string arranged at the physical upper part of the clipped image when the designated position is at the upper part as viewed from the center of the clipped area (the center of the middle image in FIG. 17 and the like). Also good.

また、画像集約装置は、抽出画像の一例として図２０（ａ）に示すような画像を取得し、他に抽出した画像ととともに図２０（ｂ）に示すような集約画像を生成して表示装置等に表示させた場合、同図に示すように指示位置に最も近い文字列を表示させてもよい。このような場合においても抽出画像２５の中心からその距離を基準にして最も近い文字列をテキストレイヤ３３に表示させてもよい。 Further, the image aggregating apparatus acquires an image as shown in FIG. 20A as an example of the extracted image, generates an aggregated image as shown in FIG. 20B together with the other extracted images, and displays it. For example, the character string closest to the designated position may be displayed as shown in FIG. Even in such a case, the closest character string based on the distance from the center of the extracted image 25 may be displayed on the text layer 33.

以上、本発明の好ましい実施形態について詳述したが、本発明に係る特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。例えば、本発明のプログラムを通信手段により提供することはもちろん、ＣＤ−ＲＯＭ等の記録媒体に格納して提供することも可能である。 The preferred embodiments of the present invention have been described in detail above, but the present invention is not limited to the specific embodiments according to the present invention, and various modifications are possible within the scope of the gist of the present invention described in the claims.・ Change is possible. For example, the program of the present invention can be provided not only by communication means but also stored in a recording medium such as a CD-ROM.

本発明によれば、集約画像中の抽出画像内の文字の可読性が劣ることを抑制することができ、産業上の利用可能性が高い。 ADVANTAGE OF THE INVENTION According to this invention, it can suppress that the readability of the character in the extraction image in an aggregate image is inferior, and industrial applicability is high.

画像集約装置の構成図である。It is a block diagram of an image aggregation apparatus. 画像集約装置のハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware constitutions of an image aggregation apparatus. 検索画面の一例である。It is an example of a search screen. 画像集約装置の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of an image aggregation apparatus. 実施形態に係る抽出画像と表示結果を説明するための図である。It is a figure for demonstrating the extraction image and display result which concern on embodiment. 実施形態に係る抽出画像と表示結果を説明するための他の図である。It is another figure for demonstrating the extraction image and display result which concern on embodiment. 実施形態に係る表示画像と比較例に係る表示画像の一例である。It is an example of the display image which concerns on embodiment, and the display image which concerns on a comparative example. 画像集約装置の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of an image aggregation apparatus. 集約画像の一例である。It is an example of an aggregate image. 画像ＧＥＲＭとテキストＧＥＲＭの面積を算出するための数式である。It is a mathematical formula for calculating the area of the image GERM and the text GERM. 実施形態に係る表示結果の一例である。It is an example of the display result which concerns on embodiment. 比較例に係る表示結果の一例である。It is an example of the display result concerning a comparative example. 実施形態に係る表示結果の他の一例である。It is another example of the display result which concerns on embodiment. 抽出画像の一例である。It is an example of an extracted image. 集約画像の一例である。It is an example of an aggregate image. 集約画像の一例である。It is an example of an aggregate image. 集約画像の一例である。It is an example of an aggregate image. 集約画像の一例である。It is an example of an aggregate image. 集約画像の一例である。It is an example of an aggregate image. 抽出画像と集約画像の一例である。It is an example of an extracted image and an aggregated image.

Explanation of symbols

１０ａＣＰＵ
１０ｂＲＡＭ
１０ｃＲＯＭ
１０ｄＩ／Ｆ
１０ｅバス
１１検索部
１２結果分析部
１３データ管理部
１４データベース
１５テキスト生成部
１６ＳＧ生成部
１７表示制御部
２０Ｗｅｂブラウザ
２１入力ボックス
２５抽出画像
３１、３２表示レイヤ
３３テキストレイヤ
３６集約画像
ＡＲ仮想領域
Ｐｔポインタ
Ｂｔ検索ボタン 10a CPU
10b RAM
10c ROM
10d I / F
10e Bus 11 Search unit 12 Result analysis unit 13 Data management unit 14 Database 15 Text generation unit 16 SG generation unit 17 Display control unit 20 Web browser 21 Input box 25 Extracted images 31, 32 Display layer 33 Text layer 36 Aggregated image AR Virtual region Pt Pointer Bt Search button

Claims

Extraction means for extracting a plurality of pieces of image information related to the search character information as extracted images based on the search character information input by the input device;
When at least one of the plurality of extracted images extracted by the extracting means is an extracted image including a character string and a design, and the area indicating the character string is larger than the area indicating the design, the extracted image Determining means for determining that is composed of a character string;
Generating a character string display area for enlarging and displaying the character string according to the number of the extracted images, and generating an aggregated image from the extracted extracted images based on a region of interest process Means,
The aggregated image generated by the generating unit is displayed, and the extracted image determined by the determining unit to be composed of the character string in each of the character string display areas generated by the generating unit, and the extracted image Display control means for displaying the character string included in each extracted image;
An image aggregating apparatus.

Computer
Extraction means for extracting a plurality of image information related to the search character information as extracted images based on the search character information input by the input device;
When at least one of the plurality of extracted images extracted by the extracting means is an extracted image including a character string and a design, and the area indicating the character string is larger than the area indicating the design, the extracted image Determining means for determining that is composed of a character string;
Generating a character string display area for enlarging and displaying the character string according to the number of the extracted images, and generating an aggregated image from the extracted extracted images based on a region of interest process means,
The aggregated image generated by the generating unit is displayed, and the extracted image determined by the determining unit to be composed of the character string in each of the character string display areas generated by the generating unit, and the extracted image Display control means for displaying a character string included in each extracted image,
Image aggregation program to function as