JP2007018174A

JP2007018174A - Information processing unit and control method therefor, computer program, and storage medium

Info

Publication number: JP2007018174A
Application number: JP2005197822A
Authority: JP
Inventors: Masahiro Matsushita; 昌弘松下
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2005-07-06
Filing date: 2005-07-06
Publication date: 2007-01-25

Abstract

<P>PROBLEM TO BE SOLVED: To provide a technique for discriminating an image data type, and a high-speed image retrieval technique using the above discrimination technique. <P>SOLUTION: An information processing unit includes an acquisition means for obtaining image data constituted of one or more tiles, each at least including a header having coding information in regard to the coding of the tile concerned, and a discrimination means for discriminating the image type of each tile based on the above coding information. The above tile is divided on the basis of a plurality of frequency components. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は画像データの種類を高速に判別する技術および当該技術を利用した高速な画像検索の技術に関する。 The present invention relates to a technique for quickly identifying the type of image data and a technique for high-speed image retrieval using the technique.

近年のコンピュータの高性能化、ハードディスクドライブ（ＨＤＤ）等の記憶装置やメモリの大容量化、デジタルカメラ、デジタルビデオ、スキャナ、プリンタ、複合機（ＭＦＰ）、デジタル複写機といった画像を入出力する装置の普及および高性能化に伴い、デジタル画像を扱う機会が多くなっている。また、高速インターネット回線の普及により、それらの画像を送受信することも稀ではなくなっている。ＨＤＤを備えるＭＦＰやデジタル複写機、コンピュータ等は、画像サーバとして使用されていることもある。 Recent high-performance computers, storage devices such as hard disk drives (HDDs) and large-capacity memories, digital cameras, digital videos, scanners, printers, multifunction peripherals (MFPs), digital copiers, etc. With the popularization and high performance of digital cameras, opportunities to handle digital images are increasing. Also, with the widespread use of high-speed Internet lines, it is not uncommon to send and receive these images. An MFP, a digital copying machine, a computer, or the like equipped with an HDD may be used as an image server.

そのような背景により、扱っているデジタル画像は、高精細化かつ大量化の一途をたどっている。高精細化により、１つ１つのデータサイズは大きくなる傾向にあり、ＪＰＥＧやＪＰＥＧ２０００といった高画質を保ったまま高い圧縮率を備えた画像圧縮伸張アルゴリズムが必須となり、標準化されている。また、大量化に伴う問題を解決するために、画像検索アルゴリズムが提案されている。 With such a background, the digital images that are handled are becoming increasingly high-definition and large-scale. As data definition increases, each data size tends to increase, and an image compression / decompression algorithm such as JPEG or JPEG2000 that has a high compression ratio while maintaining high image quality is essential and standardized. In addition, an image search algorithm has been proposed in order to solve the problems associated with the increase in volume.

また、特許文献１では、あらかじめ画像データを解析して画像特徴量を抽出しておき、画像特徴量の配置に基づいて特徴量同士を比較することにより、高速に類似画像の検索を行う構成が開示されている。 Japanese Patent Laid-Open No. 2004-133867 has a configuration in which image data is extracted in advance by analyzing image data, and similar images are searched at high speed by comparing the feature amounts based on the arrangement of the image feature amounts. It is disclosed.

また、画像検索を行う際に、画像の種類の判別を行う手法が知られている。特許文献２では、画像データのヒストグラムを作成し、その頻度の分布に偏りがある場合には２値画像と判別し、偏りが無い場合には自然画像と判別する手法が開示されている。また、特許文献３では、注目画素とその近傍の周辺画素との相関が強い場合にはコンピュータグラフィックス画像と判別し、注目画素と周辺画素との相関が弱い場合には自然画像と判別する手法が開示されている。また、特許文献４では、画像データの色数が１つの場合には単色画像と判別し、色数が少ない場合にはパレット画像と判別し、色数が多い場合には自然画像と判別する手法が開示されている。
特開平１１−２８８４１８号公報特開２０００−２２２５６４号公報特開２００２−１５３２７号公報特開平１１−８８７００号公報 Also, a technique for determining the type of image when performing an image search is known. Patent Document 2 discloses a method of creating a histogram of image data, discriminating a binary image when the frequency distribution is biased, and discriminating a natural image when there is no bias. Further, in Patent Document 3, when the correlation between the target pixel and the neighboring pixels in the vicinity thereof is strong, it is determined as a computer graphics image, and when the correlation between the target pixel and the peripheral pixels is weak, it is determined as a natural image. Is disclosed. In Patent Document 4, when the number of colors of image data is 1, it is determined as a single color image, when the number of colors is small, it is determined as a palette image, and when the number of colors is large, it is determined as a natural image. Is disclosed.
Japanese Patent Laid-Open No. 11-288418 JP 2000-222564 A JP 2002-15327 A JP-A-11-88700

しかしながら、上記特許文献１に開示された構成においては、あらかじめ検索対象の画像データから画像特徴量を抽出しておかなければならないため、事前処理や初回の検索に時間がかかるという問題があった。また、画像データベース管理外において、画像ファイルに対して、移動、削除、追加、編集等の操作を行った場合、その画像データについて再び画像特徴量を抽出し、事前に登録された画像特徴量を更新しなければならなかった。 However, the configuration disclosed in Patent Document 1 has a problem in that it takes time for preprocessing and initial search because image feature amounts must be extracted from image data to be searched in advance. In addition, when an operation such as movement, deletion, addition, or editing is performed on an image file outside the management of the image database, the image feature amount is extracted again for the image data, and the pre-registered image feature amount is obtained. Had to be updated.

また、上記特許文献２乃至４に開示された画像判別の手法では、いずれも画像の画素値を読み込み、解析して、画像の種類を判別していた。そのため、画像データを読み込むためのメモリを要し、かつ、画像データを解析するための時間を要していた。 Further, in the image discrimination methods disclosed in Patent Documents 2 to 4, all of the image pixel values are read and analyzed to determine the type of the image. Therefore, a memory for reading the image data is required, and a time for analyzing the image data is required.

本発明は上記の問題点に鑑みてなされたものであり、画像データの種類を判別する技術、及び、当該技術を利用した高速な画像検索の技術を提供することを目的とする。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a technique for discriminating the type of image data and a technique for high-speed image search using the technique.

本発明によれば、
１以上のタイルから構成された画像データを取得する取得手段であって、該タイルは当該タイルの符号化についての符号化情報を含むヘッダを少なくとも含む、取得手段と、
前記符号化情報に基づいて前記タイル毎の画像種類を判別する判別手段と、
を備え、
前記タイルは複数の周波数成分毎に分割されていることを特徴とする情報処理装置が提供される。 According to the present invention,
Acquisition means for acquiring image data composed of one or more tiles, wherein the tile includes at least a header including encoding information about encoding of the tile;
Discrimination means for discriminating the image type for each tile based on the encoding information;
With
An information processing apparatus is provided in which the tile is divided into a plurality of frequency components.

本発明によれば、画像データの種類を判別する技術、及び、当該技術を利用した高速な画像検索の技術を提供することができる。 ADVANTAGE OF THE INVENTION According to this invention, the technique which discriminate | determines the kind of image data, and the technique of the high-speed image search using the said technique can be provided.

以下、添付図面を参照して本発明に係る実施の形態を詳細に説明する。ただし、この実施の形態に記載されている構成要素はあくまでも例示であり、本発明の範囲をそれらのみに限定する趣旨のものではない。 Embodiments according to the present invention will be described below in detail with reference to the accompanying drawings. However, the constituent elements described in this embodiment are merely examples, and are not intended to limit the scope of the present invention only to them.

〔ＪＰＥＧ２０００フォーマットについて〕
本実施の形態は、階層化アルゴリズムにしたがって圧縮符号化された画像ファイルフォーマットの一つとして、ＪＰＥＧ２０００アルゴリズムを利用するものである。このため、まず、ＪＰＥＧ２０００フォーマットについて説明する。ＪＰＥＧ２０００アルゴリズムは各種文献により周知のものであるため、ここでは、本実施形態に関係のある部分の概要についてのみ説明する。 [About JPEG2000 format]
In the present embodiment, the JPEG2000 algorithm is used as one of the image file formats compressed and encoded according to the hierarchization algorithm. Therefore, first, the JPEG2000 format will be described. Since the JPEG2000 algorithm is well known from various documents, only the outline of the portion related to the present embodiment will be described here.

画像をＪＰＥＧ２０００フォーマットで保存する際は、タイルと呼ばれる同サイズの長方形に画像を分割できる。図３は、ＪＰＥＧ２０００画像のタイル分割を模式的に示した図である。 When saving an image in the JPEG2000 format, the image can be divided into rectangles of the same size called tiles. FIG. 3 is a diagram schematically showing tile division of a JPEG2000 image.

それぞれのタイルについて、図４に示すように、色変換、離散ウェーブレット変換、量子化、エントロピー符号化、ビットストリーム形成の順で圧縮符号化が行われる。図４は、ＪＰＥＧ２０００アルゴリズムに係るエンコード処理手順を示した図である。 For each tile, as shown in FIG. 4, compression coding is performed in the order of color transformation, discrete wavelet transformation, quantization, entropy coding, and bitstream formation. FIG. 4 is a diagram showing an encoding process procedure according to the JPEG2000 algorithm.

図４において、４０１の色変換は、Ｒ（赤）、Ｇ（緑）、Ｂ（青）からなるＲＧＢ等の表色系から、ＹＣｂＣｒ等の表色系へコンポーネントの変換を行う処理である。 In FIG. 4, the color conversion 401 is a process of converting a component from a color system such as RGB composed of R (red), G (green), and B (blue) to a color system such as YCbCr.

４０２の離散ウェーブレット変換は、４０１において色変換が行われた画像を、離散ウェーブレット変換を用いて周波数空間に分解する処理である。図５は、ＪＰＥＧ２０００方式に係る離散ウェーブレット変換を示した模式図である。（ａ）のタイル画像（０ＬＬ）に対して、離散ウェーブレット変換を施し、サブバンド１ＬＬ、１ＨＬ、１ＬＨ、１ＨＨに分解する（ｂ）。引き続き、低周波成分１ＬＬに対し、離散ウェーブレット変換を施し、サブバンド２ＬＬ、２ＨＬ、２ＬＨ、２ＨＨに分解する（ｃ）。本例では、２レベルの変換を行っているが、離散ウェーブレット変換の回数は特に制限されるものではない。 The discrete wavelet transform 402 is a process for decomposing the image subjected to color transformation in 401 into a frequency space using the discrete wavelet transform. FIG. 5 is a schematic diagram showing discrete wavelet transform according to the JPEG2000 system. The tile image (0LL) in (a) is subjected to discrete wavelet transform and decomposed into subbands 1LL, 1HL, 1LH, and 1HH (b). Subsequently, a discrete wavelet transform is applied to the low frequency component 1LL to decompose it into subbands 2LL, 2HL, 2LH, and 2HH (c). In this example, two-level transformation is performed, but the number of discrete wavelet transformations is not particularly limited.

４０３の量子化は、離散ウェーブレット変換の変換係数を線形量子化する処理である。 The quantization of 403 is a process of linearly quantizing the transform coefficient of the discrete wavelet transform.

４０４のエントロピー符号化は、図６に示すように、プレシンクト分割、コードブロック分割、ビットプレーン分割、コーディングパスへの分割、二値算術符号化の順で符号化を行う処理である。図６は、ＪＰＥＧ２０００方式に係るエントロピー符号化手順を示した図である。 As shown in FIG. 6, the entropy encoding 404 is a process of performing encoding in the order of precinct division, code block division, bit plane division, division into coding paths, and binary arithmetic coding. FIG. 6 is a diagram showing an entropy encoding procedure according to the JPEG2000 system.

図６に置いて、６０１のプレシンクト分割は、２ＨＬ、２ＬＨといったサブバンドの係数を、プレシンクトと呼ばれる領域に分割する部分である。図７はＪＰＥＧ２０００方式に係るプレシンクト分割を模式的に表した図である。図７において、７０１の部分は、元の画像において同領域の部分を周波数変換したものであり、これらの部分は同じプレシンクトに属しているという。 In FIG. 6, the precinct division 601 is a portion that divides the subband coefficients such as 2HL and 2LH into areas called precincts. FIG. 7 is a diagram schematically showing precinct division according to the JPEG2000 system. In FIG. 7, a portion 701 is obtained by frequency-converting the portion of the same region in the original image, and these portions belong to the same precinct.

６０２のコードブロック分割は、プレシンクトをコードブロックと呼ばれるさらに小さな領域に分割する処理である。図８はＪＰＥＧ２０００方式に係るコードブロック分割を表した模式図である。このコードブロック単位がエントロピー符号化を行う際の基本単位である。 The code block division 602 is a process of dividing the precinct into smaller areas called code blocks. FIG. 8 is a schematic diagram showing code block division according to the JPEG2000 system. This code block unit is a basic unit when entropy coding is performed.

６０３のビットプレーン分割は、各コードブロックについて、線形量子化された離散ウェーブレット変換の変換係数をビットごとに展開する処理である。図９はＪＰＥＧ２０００方式に係るビットプレーン分割を表した模式図である。 The bit plane division of 603 is a process of developing the linearly quantized discrete wavelet transform coefficients for each code block for each bit. FIG. 9 is a schematic diagram showing bit plane division according to the JPEG2000 system.

図９において、９０１は、あるコードブロックにおける、線形量子化された離散ウェーブレット変換の変換係数を例示したものである。９０２は、変換係数９０１の符号を表すビット列であり、値０は正値、値１は負値を意味する。９０３は、変換係数９０１の絶対値をＭＳＢ（Most Significant Bit）からＬＳＢ（Least Significant Bit）に２値展開したビットプレーンである。 In FIG. 9, reference numeral 901 exemplifies transform coefficients of the linear quantized discrete wavelet transform in a certain code block. Reference numeral 902 denotes a bit string representing the sign of the transform coefficient 901. A value 0 means a positive value and a value 1 means a negative value. Reference numeral 903 denotes a bit plane in which the absolute value of the transform coefficient 901 is binary-developed from MSB (Most Significant Bit) to LSB (Least Significant Bit).

例えば、値が＋１２の変換係数（９０４）は正値であるため、対応する符号ビットは０（９０５）である。また、＋１２の絶対値１２の２進数表現は（１１００）であるため、ビットプレーンの対応する箇所９０６ａ乃至９０６ｄの値はそれぞれ「１」「１」「０」「０」となる。同様に、値が−６の変換係数（９０７）は負値であるため、対応する符号ビットは１（９０８）である。また、−６の絶対値６の２進数表現は（０１１０）であるため、ビットプレーンの対応する箇所９０９ａ乃至９０９ｄの値はそれぞれ「０」「１」「１」「０」となる。 For example, since the transform coefficient (904) having a value of +12 is a positive value, the corresponding sign bit is 0 (905). Since the binary representation of the absolute value 12 of +12 is (1100), the values of the corresponding portions 906a to 906d of the bit plane are “1”, “1”, “0”, and “0”, respectively. Similarly, since the conversion coefficient (907) having a value of −6 is a negative value, the corresponding sign bit is 1 (908). Also, since the binary representation of the absolute value 6 of −6 is (0110), the values of the corresponding portions 909a to 909d of the bit plane are “0”, “1”, “1”, and “0”, respectively.

ＭＳＢ側ですべて０であるビットプレーンをゼロビットプレーンといい、データは保存されない一方、コードブロック毎に、後述のゼロビットプレーン枚数がカウントされる。 A bit plane that is all zeros on the MSB side is called a zero bit plane, and no data is stored. On the other hand, the number of zero bit planes described later is counted for each code block.

６０４のコーディングパスへの分割は、ビットプレーンをさらにｓｉｇｎｉｇｉｃａｎｃｅｐｒｏｐａｇａｔｉｏｎパスと、ｍａｇｎｉｔｕｄｅｒｅｆｉｎｅｍｅｎｔパスと、ｃｌｅａｎｕｐパスに分割する処理である。図１０はＪＰＥＧ２０００方式に係るコーディングパスへの分割を表した模式図である。 The division into the coding path 604 is a process of further dividing the bit plane into a signature propagation path, a magnesium refinement path, and a cleanup path. FIG. 10 is a schematic diagram showing division into coding passes according to the JPEG2000 system.

図１０のように、各ビットプレーン１００１ａ乃至１００１ｄ（以下、これらをまとめて１００１と称する）は、コーディングパスへの分割により、それぞれｓｉｇｎｉｆｉｃａｎｃｅｐｒｏｐａｇａｔｉｏｎパス１００２ｂ乃至１００２ｄ（以下、これらをまとめて１００２と称する）、ｍａｇｎｉｔｕｄｅｒｅｆｉｎｅｍｅｎｔパス１００３ｂ乃至１００３ｄ（以下、これらをまとめて１００３と称する）、ｃｌｅａｎｕｐパス（１００４ｂ乃至１００４ｄ）のコーディングパスに分割される。ただし、最上位ビット（ＭＳＢ側）のビットプレーン１００１ａは、ｃｌｅａｎｕｐパス（１００４ａ）にのみ対応させる。以下、ｃｌｅａｎｕｐパス１００４ａ乃至１００４ｄをまとめて１００４と称する。 As shown in FIG. 10, each bit plane 1001a to 1001d (hereinafter collectively referred to as 1001) is divided into coding propagation paths 1002b to 1002d (hereinafter collectively referred to as 1002) by dividing into coding paths. , Divided refinement paths 1003b to 1003d (hereinafter collectively referred to as 1003) and cleanup paths (1004b to 1004d). However, the most significant bit (MSB side) bit plane 1001a is made to correspond only to the cleanup path (1004a). Hereinafter, the cleanup paths 1004a to 1004d are collectively referred to as 1004.

各ビットプレーン１００１、及び、各コーディングパス１００２乃至１００４は、すべて縦横方向の座標長によるサイズが等しい。また、各コーディングパス１００２乃至１００４にはビット値が定義された位置と定義されていない位置とが存在する。図１０においては、例えば、１００６、１００７のように、ビット値が定義された位置には網掛け（斜線）が施されている。そして、コーディングパス１００２乃至１００４（例えば、１００２ｂ乃至１００４ｂ）の網掛けの部分に定義されたビット値は、分割前のビットプレーン１００１（例えば１００１ｂ）上の対応する位置におけるビット値と等しい。 Each bit plane 1001 and each coding pass 1002 to 1004 have the same size according to the coordinate length in the vertical and horizontal directions. Each coding pass 1002 to 1004 has a position where a bit value is defined and a position where it is not defined. In FIG. 10, for example, positions where bit values are defined, such as 1006 and 1007, are shaded (shaded). The bit values defined in the shaded portions of the coding paths 1002 to 1004 (for example, 1002b to 1004b) are equal to the bit values at corresponding positions on the bit plane 1001 (for example, 1001b) before the division.

ビットプレーンをコーディングパスに分割する処理は、以下の（処理１）（処理２）に基づいて実行する。 The process of dividing the bit plane into coding passes is executed based on the following (Process 1) and (Process 2).

（処理１）最上位ビット（ＭＳＢ側）のビットプレーンについて、対応するｃｌｅａｎｕｐパスを生成する。最上位ビットのビットプレーンに対応するｃｌｅａｎｕｐパスは、全ての位置において最上位ビットのビットプレーンと同じビット値が定義されている。例えば、ビットプレーン１００１ａについて、全ての位置においてビットプレーン１００１ａと同じビット値が定義されたｃｌｅａｎｕｐパス１００４ａを生成する。 (Processing 1) For the most significant bit (MSB side) bit plane, a corresponding cleanup path is generated. The cleanup path corresponding to the most significant bit plane has the same bit value as the most significant bit plane defined at all positions. For example, for the bit plane 1001a, a cleanup path 1004a in which the same bit value as that of the bit plane 1001a is defined at all positions is generated.

（処理２）２番目に上位のビットプレーンから順に、２番目以降の全てのビットプレーンについて以下の（処理ａ）乃至（処理ｃ）の処理を行う。 (Process 2) The following (Process a) to (Process c) are performed on all the second and subsequent bit planes in order from the second most significant bit plane.

（処理ａ）ｓｉｇｎｉｆｉｃａｎｃｅｐｒｏｐａｇａｔｉｏｎパスの生成
処理対象のビットプレーンよりも上位のビットプレーンのいずれかにおいて、ビット値１が定義されている位置の、周囲に対応する位置にビット値が定義された、ｓｉｇｎｉｆｉｃａｎｃｅｐｒｏｐａｇａｔｉｏｎパスを生成する。例えば、ビットプレーン１００１ｂについて（処理ａ）を行う場合、ビットプレーン１００１ｂよりも上位であるビットプレーン１００１ａにおいて、１００５の位置にビット値１が定義されているため、１００５の周囲に対応する位置１００６にビット値を定義したｓｉｇｎｉｆｉｃａｎｃｅｐｒｏｐａｇａｔｉｏｎパス１００２ｂを生成する。位置１００６に定義されるビット値は、ビットプレーン１００１ｂにおける同じ位置のビット値と等しい。 (Processing a) Generation of a signature propagation path A signature in which a bit value is defined at a position corresponding to the surrounding of a position where a bit value 1 is defined in any of the bit planes higher than the bit plane to be processed Generate a propagation path. For example, when (processing a) is performed on the bit plane 1001b, since the bit value 1 is defined at the position 1005 in the bit plane 1001a that is higher than the bit plane 1001b, the position 1006 corresponding to the periphery of the 1005 A significance propagation path 1002b in which a bit value is defined is generated. The bit value defined at position 1006 is equal to the bit value at the same position in bit plane 1001b.

（処理ｂ）ｍａｇｎｉｔｕｄｅｒｅｆｉｎｅｍｅｎｔパスの生成
処理対象のビットプレーンよりも上位のビットプレーンのいずれかにおいて、ビット値１が定義されている位置に対応する位置にビット値が定義された、ｍａｇｎｉｔｕｄｅｒｅｆｉｎｅｍｅｎｔパスを生成する。例えば、ビットプレーン１００１ｂについて（処理ｂ）を行う場合、ビットプレーン１００１ｂよりも上位であるビットプレーン１００１ａにおいて、１００５の位置にビット値１が定義されているため、１００５に対応する位置１００７にビット値を定義したｍａｇｎｉｔｕｄｅｒｅｆｉｎｅｍｅｎｔパス１００３ｂを生成する。位置１００７に定義されるビット値は、ビットプレーン１００１ｂにおける同じ位置のビット値と等しい。 (Processing b) Generation of a magnesium refinement path A magnesium refinement path in which a bit value is defined at a position corresponding to a position where a bit value 1 is defined in any bit plane higher than the bit plane to be processed. Generate. For example, when (processing b) is performed for the bit plane 1001b, since the bit value 1 is defined at the position 1005 in the bit plane 1001a that is higher than the bit plane 1001b, the bit value is set at the position 1007 corresponding to 1005. Generate a definition refinement path 1003b. The bit value defined at the position 1007 is equal to the bit value at the same position in the bit plane 1001b.

（処理ｃ）ｃｌｅａｎｕｐパスの生成
ｓｉｇｎｉｆｉｃａｎｃｅｐｒｏｐａｇａｔｉｏｎパス、及び、ｍａｇｎｉｔｕｄｅｒｅｆｉｎｅｍｅｎｔパスにおいて、ビット値が定義された位置以外の全ての位置にビット値が定義された、ｃｌｅａｎｕｐパスを生成する。例えば、ビットプレーン１００１ｂについて（処理ｃ）を行う場合、１００２ｂにおいてビット値が定義された位置１００６、及び、１００３ｂにおいてビット値が定義された位置１００７以外の全ての位置にビット値が定義された、ｃｌｅａｎｕｐパス１００４ｂを生成する。ｃｌｅａｎｕｐパス１００４ｂにおいて定義されるビット値は、ビットプレーン１００１ｂにおける同じ位置のビット値と等しい。 (Processing c) Generation of a cleanup path A cleanup path in which bit values are defined at all positions other than the positions where the bit values are defined is generated in the signature propagation path and the magnesium refinement path. For example, when performing (processing c) on the bit plane 1001b, bit values are defined at all positions other than the position 1006 where the bit value is defined at 1002b and the position 1007 where the bit value is defined at 1003b. A cleanup path 1004b is generated. The bit value defined in the cleanup path 1004b is equal to the bit value at the same position in the bit plane 1001b.

図１０の例において、ビットプレーン１００１ｃ、１００１ｄも（処理２）を繰り返すことによってコーディングパスに分割する。 In the example of FIG. 10, the bit planes 1001c and 1001d are also divided into coding passes by repeating (Process 2).

コーディングパスは１個以上のレイヤに分配される。図１１はＪＰＥＧ２０００方式に係るレイヤ分割を表した模式図である。図１１で示すように、レイヤは各コードブロックの各コーディングパスの境界で分割分配される。各コーディングパスを伝送した場合の符号量増加と画質改善度から効率を判断してレイヤ分割分配されるのが一般的である。 A coding pass is distributed to one or more layers. FIG. 11 is a schematic diagram showing layer division according to the JPEG2000 system. As shown in FIG. 11, the layers are divided and distributed at the boundary of each coding pass of each code block. In general, the efficiency is determined from the increase in the amount of code and the degree of image quality improvement when each coding path is transmitted, and layer division is performed.

６０５の二値算術符号化は、コーディングパス分割後のデータを算術符号化する処理である。 The binary arithmetic encoding 605 is a process of arithmetically encoding the data after the coding pass division.

以上のようにして、エントロピー符号化４０４を行った後、４０５にてファイルにデータを書き込むためのビットストリームの形成を行う。レイヤ階層、解像度レベル（離散ウェーブレット変換の分解レベル）、コンポーネント、位置（プレシンクト）が同じデータがまとまって、一つのパケットを構成する。更に、パケットには、パケット長がゼロ（空のパケット）か否かを表すゼロ長パケット、現コードブロックがすでにそれ以前のレイヤ内のパケットに包含されているかを表すコードブロックの包含、ゼロビットプレーンの数、コーディングパスの数、コードブロックの圧縮画像データの長さからなるパケットヘッダが含まれる。 After entropy coding 404 is performed as described above, a bit stream for writing data to a file is formed at 405. Data having the same layer hierarchy, resolution level (decomposition level of discrete wavelet transform), component, and position (precinct) are collected to form one packet. Further, the packet includes a zero length packet indicating whether the packet length is zero (empty packet), a code block inclusion indicating whether the current code block is already included in a packet in the previous layer, zero bit A packet header including the number of planes, the number of coding passes, and the length of the compressed image data of the code block is included.

レイヤ階層数×解像度レベル数×コンポーネント数×プレシンクト数個のパケットが集まることによって、一つのタイルの画像が表現される。それらのパケットを図１２（ａ）のようにすべてまとめて一つのタイルパートとしてもよいし、図１２（ｂ）のように複数のタイルパートに分割してもよい。図１２のようにタイルパートの先頭にはタイルパートヘッダが付く。 An image of one tile is expressed by collecting the number of layer layers × the number of resolution levels × the number of components × the number of precincts. All of these packets may be combined into one tile part as shown in FIG. 12A or may be divided into a plurality of tile parts as shown in FIG. As shown in FIG. 12, the tile part header is attached to the head of the tile part.

さらに図１３で示すように、タイルパートを並べることによって、一つの画像を表現する。図１３はＪＰＥＧ２０００方式に係るタイルパートの並びを示した模式図である。タイルパートは、図１３（ａ）のように画像のタイルの順序で並べてもよいし、図１３（ｂ）のように優先したいタイルから順に並べてもよい。また、タイルを複数のタイルパートに分割した場合は、タイルパートは図１３（ｃ）のように並べられる。 Furthermore, as shown in FIG. 13, one image is expressed by arranging tile parts. FIG. 13 is a schematic diagram showing an arrangement of tile parts according to the JPEG2000 system. The tile parts may be arranged in the order of the tiles of the image as shown in FIG. 13A, or may be arranged in order from the tile to be prioritized as shown in FIG. 13B. When a tile is divided into a plurality of tile parts, the tile parts are arranged as shown in FIG.

〔情報処理装置のハードウェア構成〕
本実施形態に係る情報処理装置は、パーソナルコンピュータ（ＰＣ）やワークステーション（ＷＳ）、或いは、携帯情報端末（ＰＤＡ）等の情報処理装置で実現される。次に、情報処理装置のハードウェア構成について、図１を参照して説明する。図１は本実施形態に係る情報処理装置のハードウェア構成を示したブロック図である。 [Hardware configuration of information processing device]
The information processing apparatus according to the present embodiment is realized by an information processing apparatus such as a personal computer (PC), a workstation (WS), or a personal digital assistant (PDA). Next, the hardware configuration of the information processing apparatus will be described with reference to FIG. FIG. 1 is a block diagram showing a hardware configuration of the information processing apparatus according to the present embodiment.

図１において、１０１はＣＰＵであり、本実施形態に係る情報処理装置の各種制御を実行する。１０２はＲＯＭであり、情報処理装置の立ち上げ時に実行されるブートプログラムや各種データを格納する。１０３はＲＡＭであり、ＣＰＵ１０１が処理するための制御プログラムを格納するとともに、ＣＰＵ１０１が各種制御を実行する際の作業領域を提供する。 In FIG. 1, reference numeral 101 denotes a CPU that executes various controls of the information processing apparatus according to the present embodiment. A ROM 102 stores a boot program executed when the information processing apparatus is started up and various data. Reference numeral 103 denotes a RAM which stores a control program for processing by the CPU 101 and provides a work area when the CPU 101 executes various controls.

１０４はキーボード、１０５はマウスであり、ユーザによる各種入力操作環境を提供する。１０６は外部記憶装置であり、ハードディスクやフロッピー（登録商標）ディスク、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ等で構成される。 A keyboard 104 and a mouse 105 provide various input operation environments for the user. An external storage device 106 includes a hard disk, a floppy (registered trademark) disk, a CD-ROM, a DVD-ROM, and the like.

１０７は表示器であり、処理の内容や処理結果を表示してユーザに伝達する。１０８はネットワークインターフェースであり、ネットワーク上の各機器（不図示）との通信を可能とする。１０９はＩＥＥＥ１３９４、ＵＳＢなどのインターフェース（Ｉ／Ｆ）であり、スキャナ１１０やデジタルカメラ１１１などの機器と通信を行う。また、１１２は上記の各構成を接続するバスである。 Reference numeral 107 denotes a display that displays processing contents and processing results and transmits them to the user. A network interface 108 enables communication with each device (not shown) on the network. Reference numeral 109 denotes an interface (I / F) such as IEEE 1394 or USB, which communicates with devices such as the scanner 110 and the digital camera 111. Reference numeral 112 denotes a bus connecting the above-described components.

尚、上記の構成においてスキャナ１１０、デジタルカメラ１１１や外部記憶装置１０６は、ネットワーク上に配置されたもので代用してもよい。また、以上の各装置と同等の機能を実現するソフトウェアにより、ハードウェア装置の代替として構成することもできる。 In the above configuration, the scanner 110, the digital camera 111, and the external storage device 106 may be replaced with those arranged on the network. Moreover, it can also be comprised as an alternative of a hardware apparatus with the software which implement | achieves a function equivalent to the above each apparatus.

本実施形態では、外部記憶装置１０６から本実施形態に係るプログラム及び関連データを直接ＲＡＭ１０３にロードして実行させる例を示すが、これに限られない。例えば、本実施形態に係るプログラムをＲＯＭ１０２に記録しておき、これをメモリマップの一部をなすように構成し、直接ＣＰＵ１０１で実行することも可能である。 In the present embodiment, an example in which the program and related data according to the present embodiment are directly loaded into the RAM 103 from the external storage device 106 and executed is shown, but the present invention is not limited to this. For example, it is possible to record the program according to the present embodiment in the ROM 102, configure the program so as to form a part of the memory map, and execute the program directly by the CPU 101.

また、本実施形態では、説明の便宜のため、本実施形態に係る情報処理装置をそれぞれ１つの装置で実現した構成について述べるが、複数の装置にリソースを分散した構成によって実現してもよい。例えば、記憶や演算のリソースを複数の装置に分散した形に構成してもよい。或いは、情報処理装置上で仮想的に実現される構成要素毎にリソースを分散し、並列処理を行うようにしてもよい。 In the present embodiment, for convenience of explanation, a configuration in which the information processing apparatus according to the present embodiment is realized by a single device will be described. However, a configuration in which resources are distributed to a plurality of devices may be realized. For example, storage and calculation resources may be distributed in a plurality of devices. Alternatively, resources may be distributed for each component virtually realized on the information processing apparatus, and parallel processing may be performed.

〔情報処理装置の機能構成〕
図２は本実施形態に係る情報処理装置の機能構成を示したブロック図である。図２に示される各機能ブロックは、図１を参照して上述した情報処理装置のＣＰＵ１０１がＲＡＭ１０３にロードされたプログラムを実行し、図１に示される各ハードウェアと協働することによって実現される。もちろん機能ブロックの一部或いは全てが専用のハードウェアで実現されてもよい。 [Functional configuration of information processing device]
FIG. 2 is a block diagram showing a functional configuration of the information processing apparatus according to the present embodiment. Each functional block shown in FIG. 2 is realized by the CPU 101 of the information processing apparatus described above with reference to FIG. 1 executing a program loaded in the RAM 103 and cooperating with each hardware shown in FIG. The Of course, some or all of the functional blocks may be realized by dedicated hardware.

図２において、２０１は検索元画像であり、ユーザーにより指定される。２０２は画像蓄積部であり、外部記憶装置１０６上に存在し、１以上の画像データを格納する。本実施形態に係る情報処理装置は、後述の類似画像検索処理において、画像蓄積部２０２に格納された画像データのうち、検索元画像２０１に類似するものを検索する。 In FIG. 2, reference numeral 201 denotes a search source image, which is designated by the user. An image storage unit 202 exists on the external storage device 106 and stores one or more image data. The information processing apparatus according to the present embodiment searches for image data similar to the search source image 201 among the image data stored in the image storage unit 202 in a later-described similar image search process.

２０３は画像ヘッダ読込部であり、検索元画像２０１、及び、画像蓄積部２０２に格納された（階層符号化された）画像データについて、それぞれ画像ヘッダの読み込みを行う。後述するように、画像ヘッダにはコーディングパスの数、コードブロックの圧縮画像データの長さ、ゼロビットプレーンの数等の、符号化情報が記述されている。 An image header reading unit 203 reads the image header for each of the search source image 201 and the image data (hierarchically encoded) stored in the image storage unit 202. As will be described later, the image header describes coding information such as the number of coding passes, the length of the compressed image data of the code block, and the number of zero bit planes.

２０４は画像種類判別部であり、画像ヘッダ部を解析し、画像ヘッダに記述された符号化情報に基づいて画像の種類を判別する。２０５は画像種類ラベル行列化部であり、画像種類判別部２０４によって判別された画像の種類をラベルとして配列し、画像種類ラベル行列を作成する。画像種類判別部２０４および画像種類ラベル行列化部２０５が実行する処理（画像種類ラベル行列作成処理）の詳細は後述する。 An image type determination unit 204 analyzes the image header portion and determines the image type based on the encoded information described in the image header. An image type label matrixing unit 205 arranges the image types determined by the image type determining unit 204 as labels, and creates an image type label matrix. Details of processing (image type label matrix creation processing) executed by the image type determination unit 204 and the image type label matrixing unit 205 will be described later.

２０６は画像種類ラベル行列比較部である。画像種類ラベル行列比較部２０６が実行する処理（画像種類ラベル行列比較処理）の詳細は後述する。 Reference numeral 206 denotes an image type label matrix comparison unit. Details of the processing (image type label matrix comparison processing) executed by the image type label matrix comparison unit 206 will be described later.

２０７は画像データ読込部であり、検索元画像２０１、及び、画像蓄積部２０２から階層符号化された画像データの読み込みを行う。 An image data reading unit 207 reads the hierarchically encoded image data from the search source image 201 and the image storage unit 202.

２０８は画像特徴量抽出部である。２０９は特徴量ラベル行列化部であり、画像特徴量抽出部２０８によって得られた画像特徴量のラベルを配列し、特徴量ラベル行列を作成する。画像特徴量抽出部２０８および特徴量ラベル行列化部２０９が実行する処理（特徴量ラベル行列作成処理）の詳細は後述する。 Reference numeral 208 denotes an image feature amount extraction unit. Reference numeral 209 denotes a feature quantity label matrixing unit that arranges the image feature quantity labels obtained by the image feature quantity extraction unit 208 to create a feature quantity label matrix. Details of processing (feature amount label matrix creation processing) executed by the image feature amount extraction unit 208 and the feature amount label matrixing unit 209 will be described later.

２１０は特徴量ラベル行列比較部である。特徴量ラベル行列比較部が実行する処理（特徴量ラベル行列比較処理）の詳細は後述する。 Reference numeral 210 denotes a feature amount label matrix comparison unit. Details of processing (feature amount label matrix comparison processing) executed by the feature amount label matrix comparison unit will be described later.

以上のような構成を備えた本実施形態に係る情報処理装置の動作例を、以下に説明する。 An operation example of the information processing apparatus according to this embodiment having the above-described configuration will be described below.

〔類似画像検索処理〕
図１４のフローチャートに従って類似画像検索の処理を説明する。図１４は本実施形態に係る情報処理装置が実行する、類似画像検索処理の手順を示したフローチャートである。 [Similar image search processing]
The similar image search process will be described with reference to the flowchart of FIG. FIG. 14 is a flowchart showing a procedure of similar image search processing executed by the information processing apparatus according to the present embodiment.

まず、ステップＳ１４０１において、ユーザから類似検索元画像（検索元画像）２０１の指定を受け付ける。本実施形態に係る情報処理装置は、例えば、類似検索元画像２０１の識別情報を選択可能に表示器１０７に表示し、キーボード１０４、マウス１０５等からの入力を受け付ける。 First, in step S1401, designation of a similar search source image (search source image) 201 is received from the user. The information processing apparatus according to the present embodiment displays, for example, the identification information of the similar search source image 201 on the display unit 107 so as to be selectable, and accepts input from the keyboard 104, the mouse 105, and the like.

次に、ステップＳ１４０２において、後述の画像種類ラベル行列作成処理により、指定された類似検索元画像２０１について画像種類ラベル行列を作成する。 Next, in step S1402, an image type label matrix is created for the designated similar search source image 201 by an image type label matrix creation process described later.

次にステップＳ１４０３において、後述の特徴量ラベル行列作成処理により、当該類似検索元画像２０１について特徴量ラベル行列を作成する。 In step S1403, a feature amount label matrix is created for the similarity search source image 201 by a feature amount label matrix creation process described later.

次に、ステップＳ１４０４において、画像蓄積部２０２に格納された画像データから１つを類似比較先画像として選択し、この選択された類似比較先画像について、後述の画像種類ラベル行列作成処理により画像種類ラベル行列を作成する。 Next, in step S1404, one of the image data stored in the image storage unit 202 is selected as a similar comparison destination image, and the selected similar comparison destination image is subjected to image type label matrix creation processing to be described later. Create a label matrix.

次に、ステップＳ１４０５において、類似検索元画像２０１の画像種類ラベル行列と、ステップＳ１４０４で選択された類似比較先画像の画像種類ラベル行列との比較を行い、画像種類ラベル行列が不一致であるか否かを判定する。タイル形状の違い等で比較できなかった場合、または、比較の結果一致していた場合（ステップＳ１４０５でＮＯ）はステップＳ１４０６に進む。比較の結果不一致だった場合（ステップＳ１４０５でＹＥＳ）は、ステップＳ１４０８へ進む。画像種類ラベル行列の比較は、後述の画像種類ラベル行列比較処理に基づいて実行する。 Next, in step S1405, the image type label matrix of the similar search source image 201 is compared with the image type label matrix of the similar comparison target image selected in step S1404, and whether or not the image type label matrix does not match. Determine whether. If comparison is not possible due to a difference in tile shape or the like, or if the comparison results indicate a match (NO in step S1405), the process advances to step S1406. If the comparison results in a mismatch (YES in step S1405), the process advances to step S1408. The comparison of the image type label matrix is executed based on an image type label matrix comparison process described later.

ステップＳ１４０６では、後述の特徴量ラベル行列作成処理により、ステップＳ１４０４で選択された類似比較先画像について特徴量ラベル行列を作成する。 In step S1406, a feature amount label matrix is created for the similar comparison target image selected in step S1404 by a feature amount label matrix creation process described later.

次に、ステップＳ１４０７において、類似検索元画像２０１の特徴量ラベル行列と、ステップＳ１４０４で選択された類似比較先画像の特徴量ラベル行列との比較を行い、類似度を算出する。特徴量ラベル行列の比較は後述の特徴量ラベル行列比較処理に基づいて実行する。 In step S1407, the feature amount label matrix of the similar search source image 201 is compared with the feature amount label matrix of the similar comparison target image selected in step S1404 to calculate the similarity. The comparison of the feature amount label matrix is executed based on a feature amount label matrix comparison process described later.

類似画像検索処理においては、ステップＳ１４０４からステップＳ１４０７の処理を、画像蓄積部２０２に格納された全登録データ（画像データ）が類似比較先画像となるように繰り返し実行する。 In the similar image search processing, the processing from step S1404 to step S1407 is repeatedly executed so that all registered data (image data) stored in the image storage unit 202 becomes a similar comparison destination image.

ステップＳ１４０８では、ステップＳ１４０４からステップＳ１４０７の処理が画像蓄積部２０２に格納された全ての画像データについて実行されたか否かを判定する。実行されている場合（ステップＳ１４０８でＹＥＳ）はステップＳ１４０９へ進む。実行されていない場合（ステップＳ１４０８でＮＯ）は、ステップＳ１４０４へ戻り、画像蓄積部２０２から、まだステップＳ１４０４乃至Ｓ１４０７の処理を実行していない画像データを選択して、ステップＳ１４０４乃至Ｓ１４０７の処理を実行する。 In step S1408, it is determined whether or not the processing from step S1404 to step S1407 has been executed for all the image data stored in the image storage unit 202. If it is being executed (YES in step S1408), the process advances to step S1409. If not executed (NO in step S1408), the process returns to step S1404, image data that has not yet been processed in steps S1404 to S1407 is selected from the image storage unit 202, and the processes in steps S1404 to S1407 are performed. Execute.

ステップＳ１４０９では、ステップＳ１４０７において得られた類似度の高い順に、類似比較先画像の識別情報をソートし、ステップＳ１４１０へ進む。 In step S1409, the identification information of the similar comparison destination images is sorted in descending order of similarity obtained in step S1407, and the process proceeds to step S1410.

ステップＳ１４１０では、ソートされた類似比較先画像の識別情報の一覧を検索結果として表示器１０７に表示し、ユーザに提示する。そして処理を終了する。 In step S1410, the sorted list of identification information of the similar comparison target images is displayed on the display unit 107 as a search result and presented to the user. Then, the process ends.

〔画像種類ラベル行列作成処理〕
次に、ステップＳ１４０２及びＳ１４０４において実行する画像種類ラベル行列作成処理について説明する。画像種類ラベル行列作成処理は、画像種類判別部２０４が実行する画像種類判別処理と、画像種類ラベル行列化部２０５が実行する画像種類ラベル行列化処理からなる。 [Image type label matrix creation processing]
Next, the image type label matrix creation process executed in steps S1402 and S1404 will be described. The image type label matrix creating process includes an image type determining process executed by the image type determining unit 204 and an image type label matrixing process executed by the image type label matrixing unit 205.

（画像種類判別処理）
画像種類判別処理は、画像が保存されたときに分割されたタイル単位で、画像の種類（本実施形態の例では、写真（自然）画像であるか、文字画像・グラフィック画像であるか）を判別する処理である。画像種類は、パケットヘッダに書かれている、コーディングパスの数、コードブロックの圧縮画像データの長さ、ゼロビットプレーンの数等の符号化情報のうち、少なくともいずれかの情報に基づいて判断する。 (Image type discrimination processing)
The image type determination process is performed by determining the type of image (in the example of this embodiment, whether it is a photograph (natural) image, a character image, or a graphic image) in units of tiles divided when the image is stored. This is a process for determining. The image type is determined based on at least one of coding information such as the number of coding passes, the length of the compressed image data of the code block, and the number of zero bit planes written in the packet header. .

コーディングパスの数により画像種類判別する場合は、低周波成分（図５（ｃ）の２ＬＬ）におけるコーディングパスの総数と高周波成分（図５（ｃ）の１ＨＬ、１ＬＨ、１ＨＨ）におけるコーディングパスの総数を比較する。低周波成分におけるコーディングパスの総数に対する、高周波成分におけるコーディングパスの総数の割合が所定の値を超えている場合は文字画像・グラフィック画像であり、低周波成分におけるコーディングパスの総数に対する、高周波成分におけるコーディングパスの総数の割合が所定の値を下回っている場合は自然画像であると判別する。 When discriminating image types based on the number of coding passes, the total number of coding passes in the low frequency component (2LL in FIG. 5C) and the total number of coding passes in the high frequency components (1HL, 1LH, 1HH in FIG. 5C). Compare When the ratio of the total number of coding passes in the high frequency component to the total number of coding passes in the low frequency component exceeds a predetermined value, it is a character image / graphic image, and in the high frequency component with respect to the total number of coding passes in the low frequency component When the ratio of the total number of coding passes is below a predetermined value, it is determined that the image is a natural image.

ビットプレーンのレイヤ枚数が複数枚である場合は、レイヤごとにコーディングパスの数を算出し、判別してもよい。その場合は、最上位レイヤのコーディングパスの数を利用することによって、より精度の高い判別が可能となる。 When the number of bit plane layers is plural, the number of coding passes may be calculated for each layer and determined. In that case, by using the number of coding passes of the highest layer, discrimination with higher accuracy becomes possible.

コードブロックの圧縮画像データの長さ（圧縮画像データ長）により画像種類判別する場合は、低周波成分（図５（ｃ）の２ＬＬ）におけるコードブロックの圧縮画像データの長さと高周波成分（図５（ｃ）の１ＨＬ、１ＬＨ、１ＨＨ）におけるコードブロックの圧縮画像データの長さを比較する。低周波成分におけるコードブロックの圧縮画像データの長さに対する、高周波成分におけるコードブロックの圧縮画像データの長さの割合が所定の値を超えている場合は自然画像であり、低周波成分におけるコードブロックの圧縮画像データの長さに対する、高周波成分におけるコードブロックの圧縮画像データの長さの割合が所定の値を下回っている場合は文字画像・グラフィック画像であると判別する。 When the image type is determined based on the length of the compressed image data of the code block (compressed image data length), the length of the compressed image data of the code block and the high frequency component (FIG. 5) in the low frequency component (2LL in FIG. 5C). The lengths of the compressed image data of the code blocks in (c) 1HL, 1LH, 1HH) are compared. When the ratio of the length of the compressed image data of the code block in the high frequency component to the length of the compressed image data of the code block in the low frequency component exceeds a predetermined value, it is a natural image, and the code block in the low frequency component If the ratio of the length of the compressed image data of the code block in the high-frequency component to the length of the compressed image data is less than a predetermined value, it is determined that the image is a character image / graphic image.

ゼロビットプレーンの数により画像種類判別する場合は、高周波成分（図５（ｃ）の１ＨＬ、１ＬＨ、１ＨＨ）におけるゼロビットプレーンの数が所定の値を超えている場合は自然画像であり、高周波成分（図５（ｃ）の１ＨＬ、１ＬＨ、１ＨＨ）におけるゼロビットプレーンの数が所定の値を下回っている場合は文字画像・グラフィック画像であると判別する。 When the image type is determined based on the number of zero bit planes, if the number of zero bit planes in the high frequency components (1HL, 1LH, 1HH in FIG. 5C) exceeds a predetermined value, the image is a natural image. When the number of zero bit planes in the component (1HL, 1LH, 1HH in FIG. 5C) is below a predetermined value, it is determined that the image is a character image / graphic image.

上記の、コーディングパスの数、コードブロックの圧縮画像データの長さ、ゼロビットプレーンの数等の符号化情報のうち、少なくともいずれかの情報に基づく手法は、一定の精度で画像種類を判別することが可能であることが経験的に知られている。 The method based on at least one of the coding information such as the number of coding passes, the length of the compressed image data of the code block, the number of zero bit planes, etc., determines the image type with a certain accuracy. It is empirically known that it is possible.

上記のように、本実施形態に係る構成においては、画像ファイルの画像データ全域を読み取ることなく、ヘッダ部分に書かれている情報のみを読み込み、ヘッダ情報のみを解析することにより、使用メモリ量が少なく高速な画像種類判別を行うことができる。 As described above, in the configuration according to this embodiment, the amount of memory used is reduced by reading only the information written in the header portion and analyzing only the header information without reading the entire image data of the image file. It is possible to perform image type discrimination with little and high speed.

画像種類判別部２０４は、画像種類判別処理を実行した結果、写真（自然）画像と判別されたのタイルについては０、文字画像・グラフィック画像と判別されたタイルについては１のラベルを付与する。即ち、各タイルについて、タイルを識別する情報と画像の種類を示すラベル情報とを関連づけてＲＡＭ１０３、外部記憶装置１０６等の記憶装置に記憶する。 The image type discriminating unit 204 assigns a label of 0 for tiles that are discriminated as photograph (natural) images as a result of executing the image type discriminating process, and 1 for tiles discriminated as character images / graphic images. That is, for each tile, information for identifying the tile and label information indicating the type of image are associated and stored in a storage device such as the RAM 103 or the external storage device 106.

（画像種類ラベル行列化処理）
画像種類ラベル行列化処理は、画像種類判別処理によって判別された結果のラベル情報を、対応するタイルの画像データ全体における配置に基づいて配列し、画像種類ラベル行列を作成する処理である。 (Image type label matrix processing)
The image type label matrixing process is a process of creating the image type label matrix by arranging the label information obtained as a result of the discrimination by the image type discrimination process based on the arrangement of the corresponding tiles in the entire image data.

図１５は画像種類ラベル行列を作成する際のタイル順序列を説明する図である。図１５において、１５０１は画像データ、１５０２は画像データ１５０１を構成するタイルをそれぞれ模式的に示している。また、各タイル１５０２のます内に記述された数字はタイルの配列順序を示す通し番号である。画像種類ラベル行列化部２０５は、図５に例示した分割画像タイルの通し番号に基づいて上記の画像種類ラベル情報を配列し、ラベル行列を作る。本実施形態では、ラベル行列は、タイルの配置に対応してラベル情報が２次元的に配列された行列とするが、所定の順序で１次元に並べたものもラベル行列と称する。 FIG. 15 is a diagram for explaining a tile order sequence when an image type label matrix is created. In FIG. 15, 1501 schematically illustrates image data, and 1502 schematically illustrates tiles constituting the image data 1501. The numbers described in the tiles 1502 are serial numbers indicating the tile arrangement order. The image type label matrixing unit 205 arranges the image type label information based on the serial numbers of the divided image tiles illustrated in FIG. 5 to create a label matrix. In the present embodiment, the label matrix is a matrix in which label information is two-dimensionally arranged corresponding to the arrangement of tiles, but a matrix arranged one-dimensionally in a predetermined order is also called a label matrix.

〔画像種類ラベル行列比較処理〕
次に、ステップＳ１４０５において画像種類ラベル行列比較部２０６が実行する画像種類ラベル行列比較処理について説明する。 [Image type label matrix comparison processing]
Next, the image type label matrix comparison process executed by the image type label matrix comparison unit 206 in step S1405 will be described.

検索元画像２０１と比較先画像とでタイル分割の態様が等しい場合は、検索元画像２０１の画像種類ラベル行列と、比較先画像の画像種類ラベル行列との単純比較を行い、一致するか、一致しないかの判断をする。即ち、対応するタイル位置におけるラベル情報の値（画像種類ラベル行列の成分値）が全て等しいか否かを判定する。全て等しい場合は一致すると判断し、等しくない場合は一致しない（不一致）と判断する。尚、タイル分割の態様が等しいとは、画像データをタイルに分割した場合に、縦方向のタイル数と、横方向のタイル数とが共に等しい、即ち、画像種類ラベル行列の行数と列数が共に等しいことを意味する。 When the tile division mode is the same between the search source image 201 and the comparison destination image, a simple comparison is made between the image type label matrix of the search source image 201 and the image type label matrix of the comparison destination image to match or match Judge whether to do. That is, it is determined whether or not the values of the label information at the corresponding tile positions (component values of the image type label matrix) are all equal. If they are all equal, it is determined that they match. Note that the tile division mode is equal when the image data is divided into tiles, the number of tiles in the vertical direction is equal to the number of tiles in the horizontal direction, that is, the number of rows and the number of columns in the image type label matrix. Means that both are equal.

検索元画像２０１と比較先画像とでタイル分割の態様が異なる場合は、以下の処理を行う。即ち、画像種類ラベル行列の行数と列数とが共に等しくなるように、検索元画像２０１と比較先画像とのそれぞれについて、対応する画像種類ラベル行列の成分を複数のグループに分解（グループ化）し、各グループについてグループに含まれる成分を１つに合成する処理を行う。そして、合成後の画像種類ラベル行列について、検索元画像２０１と比較先画像との、それぞれ対応する位置の成分値が全て等しい場合に一致すると判断し、等しくない場合に不一致と判断する。 When the tile division mode is different between the search source image 201 and the comparison destination image, the following processing is performed. That is, for each of the search source image 201 and the comparison target image, the components of the corresponding image type label matrix are decomposed into a plurality of groups (grouping) so that both the number of rows and the number of columns of the image type label matrix are equal. And processing for combining the components included in the group into one for each group. The combined image type label matrix is determined to match when the component values at the corresponding positions of the search source image 201 and the comparison target image are all equal, and is determined to be mismatched when they are not equal.

ここで、対応する画像種類ラベル行列の複数の成分を１つに合成する処理は、画像データについて、複数のタイルを結合して１つのタイルにする処理に相当する。また、複数のタイルを結合した後の、画像データにおけるタイル境界の相対位置が、検索元画像２０１と比較先画像とで近接している場合に、類似度の判定を高精度で行うことができる。 Here, the process of combining the plurality of components of the corresponding image type label matrix into one corresponds to the process of combining a plurality of tiles into one tile for the image data. In addition, when the relative position of the tile boundary in the image data after combining a plurality of tiles is close between the search source image 201 and the comparison target image, the similarity can be determined with high accuracy. .

このため、本実施形態では、画像種類ラベル行列の成分のグループ化を以下の条件を満たすように実行する。
（１）合成後の画像種類ラベル行列の行数が、合成前の、検索元画像２０１に対応する画像種類ラベル行列の行数と、比較先画像に対応する画像種類ラベル行列の行数との最大公約数となる。
（２）合成後の画像種類ラベル行列の列数が、合成前の、検索元画像２０１に対応する画像種類ラベル行列の列数と、比較先画像に対応する画像種類ラベル行列の列数との最大公約数となる。 For this reason, in this embodiment, the grouping of the components of the image type label matrix is executed so as to satisfy the following condition.
(1) The number of rows of the image type label matrix after synthesis is the number of rows of the image type label matrix corresponding to the search source image 201 and the number of rows of the image type label matrix corresponding to the comparison destination image before synthesis. The greatest common divisor.
(2) The number of columns of the image type label matrix after combining is the number of columns of the image type label matrix corresponding to the search source image 201 before combining and the number of columns of the image type label matrix corresponding to the comparison destination image. The greatest common divisor.

これは、以下の条件を満たすようにタイルをグループ化することに相当する。
（１）タイル結合後の縦方向のタイル数が、結合前の検索元画像２０１の縦方向のタイル数と比較先画像の縦方向のタイル数との最大公約数となる。
（２）タイル結合後の横方向のタイル数が、結合前の検索元画像２０１の横方向のタイル数と比較先画像の横方向のタイル数との最大公約数となる。 This is equivalent to grouping tiles so as to satisfy the following conditions.
(1) The number of tiles in the vertical direction after tile combination is the greatest common divisor of the number of tiles in the vertical direction of the search source image 201 before combination and the number of tiles in the vertical direction of the comparison target image.
(2) The number of tiles in the horizontal direction after combining the tiles is the greatest common divisor of the number of tiles in the horizontal direction of the search source image 201 before combining and the number of tiles in the horizontal direction of the comparison target image.

このように行列成分（タイル）をグループ化することにより、合成に係るグループに含まれる行列成分（タイル）の個数及び配置が全て等しくなるように、行列成分（タイル）のグループ化を行うことができる。このようにグループ化して行列成分を合成することは、タイル境界の画像データにおける相対位置が検索元画像２０１と比較先画像とで近接するようにタイルを結合させることに相当するため、類似度の判定を高精度で行うことができる。 By grouping the matrix components (tiles) in this way, the matrix components (tiles) can be grouped so that the number and arrangement of the matrix components (tiles) included in the group related to the synthesis are all equal. it can. Since grouping and synthesizing matrix components in this way corresponds to combining the tiles so that the relative position in the image data of the tile boundary is close to the search source image 201 and the comparison target image, The determination can be performed with high accuracy.

なお、行列成分の合成（タイルの結合）は、以下の条件を満たすように実行する。
（条件１）合成後の行列成分（結合後のタイル）の総数が所定の値を下回る場合は、画像種類ラベル行列の比較が不可能として処理を終了する。
（条件２）結合するタイルのラベル、即ち、合成に係るグループに含まれる成分値がすべて１（文字画像・グラフィック画像）の場合は、結合（合成）後のラベル（成分値）を１とする。結合するタイルのラベルの０（写真画像）の割合、即ち、合成に係るグループに含まれる値が０の成分値の割合が所定の値を超えている場合は、結合後のラベル（合成後の成分値）を０とする。それ以外の場合は、画像種類ラベル行列の比較が不可能として処理を終了する。 Note that the synthesis of matrix components (combination of tiles) is executed so as to satisfy the following conditions.
(Condition 1) If the total number of matrix components after combination (combined tiles) is less than a predetermined value, the image type label matrix cannot be compared and the process is terminated.
(Condition 2) When the labels of tiles to be combined, that is, when the component values included in the group related to composition are all 1 (character image / graphic image), the label (component value) after combination (composition) is set to 1. . If the ratio of 0 (photo image) of the labels of the tiles to be combined, that is, the ratio of the component values with the value 0 included in the group related to combination exceeds a predetermined value, the combined label (after combination) Component value) is set to 0. In other cases, the image type label matrix cannot be compared and the process is terminated.

尚、上記の条件は用途や目的に応じて適切に変更することができる。例えば、（条件１）の代わりに、「タイルの結合数が所定の値を上回る場合は、画像種類ラベル行列比較不可能として処理を終了する」としてもよい。 In addition, said conditions can be changed suitably according to a use and the objective. For example, instead of (Condition 1), “If the number of combined tiles exceeds a predetermined value, the image type label matrix comparison is impossible and the process is terminated”.

上記のように、画像種類判別処理において判別された結果に基づいて画像の類似性を判定することで、画像の類似性を高速に判定することができる。 As described above, by determining the similarity of images based on the result determined in the image type determination process, the similarity of images can be determined at high speed.

〔特徴量ラベル行列作成処理〕
次に、ステップＳ１４０３、Ｓ１４０６において実行する特徴量ラベル行列作成処理について説明する。特徴量ラベル行列作成処理は、画像特徴量抽出部２０８が実行する特徴量抽出処理と、特徴量ラベル行列化部２０９が実行する特徴量ラベル行列化処理からなる。本実施形態では画像特徴量として後述のカラーラベルを用いる。 [Feature label creation process]
Next, the feature amount label matrix creation process executed in steps S1403 and S1406 will be described. The feature amount label matrix creation process includes a feature amount extraction process executed by the image feature amount extraction unit 208 and a feature amount label matrix formation process executed by the feature amount label matrixing unit 209. In the present embodiment, a color label described later is used as the image feature amount.

（特徴量抽出処理）
まず、当該画像を複数のブロックに分割する。本実施形態では、画像を縦横のブロックに分割する。図１６は本実施形態におけるブロック分割例を示す図である。図１６に示されるように、本実施形態では、３×３の計９個に画像を分割するものとする。なお、本実施形態で用いる３×３への分割はあくまで説明のためのものであり、どのような分割形態も採用することができる。実際には、例えば、自然画であれば１０×１０以上の分割数とするのが好ましい。また、白の無地背景に商品が写っているような場合であれば、１３×１３以上の分割数とするのが好ましい。尚、ブロック分割の態様は、タイル等のＪＰＥＧ２０００規格における画像分割単位に関わらず、自由に選択することができる。 (Feature extraction process)
First, the image is divided into a plurality of blocks. In this embodiment, an image is divided into vertical and horizontal blocks. FIG. 16 is a diagram showing an example of block division in this embodiment. As shown in FIG. 16, in this embodiment, it is assumed that the image is divided into a total of 9 × 3 × 3. Note that the 3 × 3 division used in the present embodiment is for illustrative purposes only, and any division form can be adopted. Actually, for example, for natural images, the number of divisions is preferably 10 × 10 or more. In addition, if the product is shown on a white plain background, the number of divisions is preferably 13 × 13 or more. The mode of block division can be freely selected regardless of the image division unit in the JPEG2000 standard such as tile.

図１７は本実施形態における多次元特徴量空間を説明する図である。図１７に示すように、多次元特徴量空間（ＲＧＢカラー空間）は複数のブロック（色ブロック）、即ちセル（色セル）に分割され、それぞれのセル（色セル）に対して通し番号でユニークなラベルが付与される。図１７の例では、ＲＧＢカラー空間を３×３×３＝２７のブロック（セル）に分割している。ここで、多次元特徴量空間（ＲＧＢカラー空間）をそれぞれ一定の空間を備える複数のブロックに分けたのは、厳密な画像特徴量（色、ＲＧＢ値）の違いを吸収するようにＲＧＢカラー空間の範囲に一定の幅を持たせることで、特徴量の類似度を高い精度で比較するためである。 FIG. 17 is a diagram for explaining a multidimensional feature amount space in the present embodiment. As shown in FIG. 17, the multidimensional feature space (RGB color space) is divided into a plurality of blocks (color blocks), that is, cells (color cells), and each cell (color cell) is unique by a serial number. Label is given. In the example of FIG. 17, the RGB color space is divided into 3 × 3 × 3 = 27 blocks (cells). Here, the multi-dimensional feature amount space (RGB color space) is divided into a plurality of blocks each having a fixed space in order to absorb strict differences in image feature amounts (color, RGB value). This is because the degree of similarity of feature amounts is compared with high accuracy by giving a certain width to the range.

次に、各分割ブロックについて定められた画像特徴量計算処理を行い、上記多次元特徴量空間上のどのセルに属するかを求め、対応するラベルを求める。この画像特徴量計算処理は、例えば、次のように行う。即ち、まず、処理対象のブロックに含まれる全ての画素についてＲＧＢ値を取得し、それぞれの画素について属する色セルを求める。そして、属する画素の数が最も多い色セルのラベルを、その分割画像ブロックのパラメータラベル（カラーラベル）として決定する。このような処理をすべてのブロックに対して行う。 Next, image feature amount calculation processing determined for each divided block is performed to determine which cell in the multidimensional feature amount space belongs to and a corresponding label. This image feature amount calculation processing is performed as follows, for example. That is, first, RGB values are acquired for all pixels included in the processing target block, and color cells belonging to the respective pixels are obtained. Then, the label of the color cell having the largest number of belonging pixels is determined as the parameter label (color label) of the divided image block. Such processing is performed for all blocks.

尚、上記の例では、画像特徴量としてＲＧＢ値をそのまま用いた構成を述べたが、本実施形態に係る構成はこれに限られない。例えば、事前に複数のサンプル画像データについて主成分分析等を利用した解析を行っておき、画像の類似度を的確に反映するような、画像特徴量の規格化条件を求めておき、この規格化条件において画像特徴量計算処理を行うように構成してもよい。 In the above example, the configuration in which the RGB value is used as it is as the image feature amount is described, but the configuration according to the present embodiment is not limited to this. For example, analysis using a principal component analysis or the like is performed on a plurality of sample image data in advance, and an image feature amount normalization condition that accurately reflects image similarity is obtained, and this normalization is performed. You may comprise so that an image feature-value calculation process may be performed on condition.

（特徴量ラベル行列化処理）
以上のようにして各ブロックに対してパラメータラベルを付与した後、各ブロックに付与されたパラメータラベルを所定のブロック順序で並べることにより、パラメータラベル行列（以下、ラベル行列とする）を生成する。 (Feature label matrix processing)
After assigning parameter labels to each block as described above, a parameter label matrix (hereinafter referred to as a label matrix) is generated by arranging the parameter labels assigned to each block in a predetermined block order.

ラベル行列を生成する場合も、図１５のように各分割画像ブロックに通し番号を付与し、この通し番号に基づいて上記のパラメータラベルを配列し、ラベル行列を作成する。本実施形態では、ラベル行列は、ブロックの配置に対応してラベル情報が２次元的に配列された行列とするが、所定の順序で１次元に並べたものもラベル行列と称する。 Also in the case of generating a label matrix, a serial number is assigned to each divided image block as shown in FIG. 15, and the parameter labels are arranged based on this serial number to create a label matrix. In the present embodiment, the label matrix is a matrix in which the label information is two-dimensionally arranged corresponding to the arrangement of the blocks, but the one arranged in one dimension in a predetermined order is also called a label matrix.

〔特徴量ラベル行列比較処理〕
次に、ステップＳ１４０７において特徴量ラベル行列比較部２１０が実行する、特徴量ラベル（カラーラベル）行列同士の類似比較（類似度の算出）を行う処理（特徴量ラベル行列比較処理）について、図１８を参照して説明する。図１８は、ラベル行列を比較し類似度を求める際に用いる、ラベル間のペナルティマトリックスの一例を示す図である。 [Feature Label Label Comparison Processing]
Next, the processing (feature amount label matrix comparison processing) performed by the feature amount label matrix comparison unit 210 in step S1407 to perform similarity comparison (similarity calculation) between feature amount label (color label) matrices is shown in FIG. Will be described with reference to FIG. FIG. 18 is a diagram illustrating an example of a penalty matrix between labels used when comparing label matrices and obtaining similarity.

図１８において、１８０１、１８０２は、それぞれ比較を行う特徴量ラベル行列の成分値（ラベル）を示している。また、表中に示された値は、比較する２つのラベルの類似度を示すペナルティであり、ペナルティの値が小さいほどラベルは類似していることを示す。例えば、１８０３のように、ラベル２とラベル６のペナルティは「７」である。また、例えば、１８０４のように、同じラベル同士のペナルティは当然のことながら「０」となっている。 In FIG. 18, reference numerals 1801 and 1802 denote component values (labels) of feature quantity label matrices to be compared. The values shown in the table are penalties indicating the similarity between two labels to be compared, and the smaller the penalty value, the more similar the labels are. For example, as in 1803, the penalty of label 2 and label 6 is “7”. Further, for example, as in 1804, the penalty between the same labels is naturally “0”.

本マトリックスの使用目的はラベルの類似に応じた距離判定を行うことにある。すなわち、本実施形態では、特徴量空間としてＲＧＢカラー空間を用いているので、色の類似に応じた距離判定が行えることになる。ラベル間のパターンマッチングの際に近接するセル同士ではペナルティ（距離）を小さくし、遠いものには大きなペナルティを与えるように、ペナルティマトリックスを定義しておく。 The purpose of this matrix is to perform distance determination according to the similarity of labels. That is, in this embodiment, since the RGB color space is used as the feature amount space, the distance determination according to the similarity of colors can be performed. A penalty matrix is defined so that a penalty (distance) is reduced between adjacent cells when pattern matching between labels is performed, and a large penalty is given to a distant cell.

本実施形態では、検索元画像２０１の特徴量ラベル行列と、検索対象画像の特徴量ラベル行列との、それぞれ対応する位置のラベルの値について、図１８のペナルティマトリックスを参照して距離を求める。そして、特徴量ラベル行列の全ての成分（ラベル）について求めたペナルティの値に基づいて類似度を求める。類似度は、各ペナルティ値に対する増加関数となるように算出する。例えば、類似度は、ラベル列中の全ラベルについての距離（ペナルティ）の和とすることができる。 In the present embodiment, distances are obtained with reference to the penalty matrix in FIG. 18 for the label values at corresponding positions in the feature amount label matrix of the search source image 201 and the feature amount label matrix of the search target image. Then, the similarity is obtained based on the penalty values obtained for all the components (labels) of the feature amount label matrix. The similarity is calculated so as to be an increasing function for each penalty value. For example, the similarity can be the sum of distances (penalties) for all labels in the label row.

図１９は、マッチングによるラベル列間の距離の算出を説明する図である。例えば、図１９の例においては、検索元画像２０１のラベル列１９０１が「１１２３１３４４１」であり、検索対象画像のラベル列１９０２が「１１３２２４４５２」であるので、図１８のペナルティマトリックスを用いてマッチングを行うと、ペナルティの列は「００２２１７０１１」となる。従って、類似度をペナルティの和とすると、
距離（最終解）、即ち、類似度は、
０＋０＋２＋２＋１＋７＋０＋１＋１＝１４
となる。 FIG. 19 is a diagram for explaining the calculation of the distance between label strings by matching. For example, in the example of FIG. 19, since the label column 1901 of the search source image 201 is “112313441” and the label column 1902 of the search target image is “1132244452”, matching is performed using the penalty matrix of FIG. The penalty column is “002217011”. Therefore, if similarity is the sum of penalties,
The distance (final solution), that is, the similarity is
0 + 0 + 2 + 2 ++ 1 + 7 + 0 + 1 + 1 = 14
It becomes.

本実施形態では、特徴量をラベル表現して類似検索を行っているが、特徴量をラベル化せずに類似検索を行うように構成してもよい。 In this embodiment, the similarity search is performed by expressing the feature quantity as a label. However, the similarity search may be performed without labeling the feature quantity.

上記のように、ステップＳ１４０５における画像種類判別処理において類似と判定された画像データについてのみ特徴量ラベル行列作成処理、特徴量ラベル行列比較処理を実行するため、画像検索を高速に実行することができる。 As described above, since the feature amount label matrix creation processing and the feature amount label matrix comparison processing are executed only for the image data determined to be similar in the image type determination processing in step S1405, the image search can be executed at high speed. .

〔その他の実施の形態〕
上記の構成によれば、あらかじめ画像特徴量を抽出することなく高速な画像検索が提供可能であるが、あらかじめ画像特徴量を抽出しておき、画像種類判別行列をプリサーチとして使用することも可能である。上記のように、画像特徴量の抽出処理に係る演算量は少ないため、事前処理を行う場合も高速に行うことができる。事前処理を行う場合はさらに高速な画像検索を実現することができる。 [Other Embodiments]
According to the above configuration, high-speed image retrieval can be provided without extracting image feature amounts in advance, but it is also possible to extract image feature amounts in advance and use the image type discrimination matrix as a pre-search. It is. As described above, since the amount of calculation related to the image feature amount extraction processing is small, it can be performed at high speed even when pre-processing is performed. When pre-processing is performed, a higher-speed image search can be realized.

また、上記各実施形態では画像特徴量として色情報を選んだが、本発明に係る実施形態はこれに限られるものではなく、その他の画像パラメータ（例えば、輝度値やエッジヒストグラム）を画像分割ブロックごとに求めることで実施することも可能である。 Further, although color information is selected as the image feature amount in each of the above embodiments, the embodiment according to the present invention is not limited to this, and other image parameters (for example, luminance values and edge histograms) are set for each image division block. It is also possible to implement it by asking for.

また、上記各実施形態では１つの特徴量に基づいた認識の例を挙げたが、その他の特徴量での検索結果との論理演算を行うことにより、複数の特徴量に基づく高速な検索を行うことも可能である。 In each of the above embodiments, an example of recognition based on one feature amount has been described. However, a high-speed search based on a plurality of feature amounts is performed by performing a logical operation on a search result with other feature amounts. It is also possible.

１つの画像に対して複数の画像特徴量を用いた検索を行う場合には、上記の構成において得られる類似度を１つの新たなる画像特徴量とみなし、複数のパラメータを用いた多変量解析を行い統計的な距離尺度を用いた検索を行うことも可能である。また、上記実施形態では、類似度が所定値を越える類似画像を検索結果として得るが、類似度の高い画像から順に前もって指定された個数の画像を検索結果として出力するようにしてもよいことは言うまでもない。 When performing a search using a plurality of image feature amounts for one image, the similarity obtained in the above configuration is regarded as one new image feature amount, and a multivariate analysis using a plurality of parameters is performed. It is also possible to perform a search using a statistical distance measure. In the above-described embodiment, similar images having a degree of similarity exceeding a predetermined value are obtained as search results. However, a predetermined number of images may be output as search results in order from images with high similarity. Needless to say.

なお、本発明は、例えばホストコンピュータ，インタフェイス機器，リーダ，プリンタなどの複数の機器から構成されるシステムに適用しても、一つの機器からなる装置（例えば、複写機，ファクシミリ装置など）に適用してもよい。 Note that the present invention can be applied to an apparatus (for example, a copier, a facsimile machine, etc.) composed of a single device even if it is applied to a system composed of a plurality of devices such as a host computer, an interface device, a reader, and a printer. You may apply.

また、本発明の目的は、前述した実施形態の機能を実現するソフトウェアのプログラムコードを記録した記憶媒体を、システムあるいは装置に供給し、そのシステムあるいは装置のコンピュータ（またはＣＰＵやＭＰＵ）が記憶媒体に格納されたプログラムコードを読出し実行することによっても、達成されることは言うまでもない。 Another object of the present invention is to supply a storage medium storing software program codes for implementing the functions of the above-described embodiments to a system or apparatus, and the computer (or CPU or MPU) of the system or apparatus stores the storage medium. Needless to say, this can also be achieved by reading and executing the program code stored in the.

この場合、記憶媒体から読出されたプログラムコード自体が前述した実施形態の機能を実現することになり、そのプログラムコードを記憶した記憶媒体は本発明を構成することになる。 In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiments, and the storage medium storing the program code constitutes the present invention.

プログラムコードを供給するための記憶媒体としては、例えば、フロッピディスク，ハードディスク，光ディスク，光磁気ディスク，ＣＤ−ＲＯＭ，ＣＤ−Ｒ，磁気テープ，不揮発性のメモリカード，ＲＯＭなどを用いることができる。 As a storage medium for supplying the program code, for example, a floppy disk, a hard disk, an optical disk, a magneto-optical disk, a CD-ROM, a CD-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

また、コンピュータが読出したプログラムコードを実行することにより、前述した実施形態の機能が実現されるだけでなく、そのプログラムコードの指示に基づき、コンピュータ上で稼働しているＯＳ（オペレーティングシステム）などが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。 Further, by executing the program code read by the computer, not only the functions of the above-described embodiments are realized, but also an OS (operating system) operating on the computer based on the instruction of the program code. It goes without saying that a case where the function of the above-described embodiment is realized by performing part or all of the actual processing and the processing is included.

さらに、記憶媒体から読出されたプログラムコードが、コンピュータに挿入された機能拡張ボードやコンピュータに接続された機能拡張ユニットに備わるメモリに書込まれた後、そのプログラムコードの指示に基づき、その機能拡張ボードや機能拡張ユニットに備わるＣＰＵなどが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。 Further, after the program code read from the storage medium is written into a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, the function expansion is performed based on the instruction of the program code. It goes without saying that the CPU or the like provided in the board or the function expansion unit performs part or all of the actual processing, and the functions of the above-described embodiments are realized by the processing.

本実施形態に係る情報処理装置のハードウェア構成を示したブロック図である。It is the block diagram which showed the hardware constitutions of the information processing apparatus which concerns on this embodiment. 本実施形態に係る情報処理装置の機能構成を示したブロック図である。It is the block diagram which showed the function structure of the information processing apparatus which concerns on this embodiment. ＪＰＥＧ２０００画像のタイル分割を模式的に示した図である。It is the figure which showed typically the tile division | segmentation of a JPEG2000 image. ＪＰＥＧ２０００方式に係るエンコード処理手順を示した図である。It is the figure which showed the encoding process sequence based on a JPEG2000 system. ＪＰＥＧ２０００方式に係る離散ウェーブレット変換を示した模式図である。It is the schematic diagram which showed the discrete wavelet transform based on a JPEG2000 system. ＪＰＥＧ２０００方式に係るエントロピー符号化手順を示した図である。It is the figure which showed the entropy encoding procedure which concerns on a JPEG2000 system. ＪＰＥＧ２０００方式に係るプレシンクト分割を模式的に表した図である。It is the figure which represented the precinct division | segmentation based on a JPEG2000 system typically. ＪＰＥＧ２０００方式に係るコードブロック分割を表した模式図である。It is the model showing the code block division | segmentation based on a JPEG2000 system. ＪＰＥＧ２０００方式に係るビットプレーン分割を表した模式図である。It is the model showing the bit plane division | segmentation which concerns on a JPEG2000 system. ＪＰＥＧ２０００方式に係るコーディングパスへの分割を表した模式図である。It is a schematic diagram showing the division | segmentation into the coding pass based on a JPEG2000 system. ＪＰＥＧ２０００方式に係るレイヤ分割を表した模式図である。It is a schematic diagram showing layer division according to the JPEG2000 system. ＪＰＥＧ２０００方式に係るタイルパートの構成を示した模式図である。It is the schematic diagram which showed the structure of the tile part which concerns on a JPEG2000 system. ＪＰＥＧ２０００方式に係るタイルパートの並びを示した模式図である。It is the schematic diagram which showed the arrangement | sequence of the tile part which concerns on a JPEG2000 system. 本実施形態に係る類似画像検索処理の手順を示したフローチャートである。It is the flowchart which showed the procedure of the similar image search process which concerns on this embodiment. 画像種類ラベル行列を作成する際のタイル順序列を説明する図である。It is a figure explaining the tile order sequence at the time of creating an image kind label matrix. 本実施形態におけるブロック分割例を示す図である。It is a figure which shows the example of a block division | segmentation in this embodiment. 本実施形態における多次元特徴量空間を説明する図である。It is a figure explaining the multidimensional feature-value space in this embodiment. ラベル間のペナルティマトリックスの一例を示す図である。It is a figure which shows an example of the penalty matrix between labels. マッチングによるラベル列間の距離の算出を説明する図である。It is a figure explaining calculation of the distance between label strings by matching.

Claims

Acquisition means for acquiring image data composed of one or more tiles, wherein the tile includes at least a header including encoding information about encoding of the tile;
Discrimination means for discriminating the image type for each tile based on the encoding information;
With
The information processing apparatus according to claim 1, wherein the tile is divided into a plurality of frequency components.

The encoded information includes a first coding pass number in a low frequency component and a second coding pass number in a high frequency component among the plurality of frequency components constituting the tile,
The information processing apparatus according to claim 1, wherein the determination unit performs the determination based on a ratio between the first coding pass number and the second coding pass number.

The encoded information includes a first compressed image data length in a low-frequency component and a second compressed image data length in a high-frequency component among the plurality of frequency components constituting the tile,
The information processing apparatus according to claim 1, wherein the determination unit performs the determination based on a ratio between the first compressed image data length and the second compressed image data length.

The encoding information is the number of zero bit planes in a high frequency component among the plurality of frequency components constituting the tile,
The information processing apparatus according to claim 1, wherein the determination unit performs the determination based on a size of the number of zero bit planes.

The information processing apparatus according to claim 1, wherein the image type is at least one of a character image and a graphic image and a natural image.

The information processing apparatus according to claim 1, wherein the image data is encoded data based on a JPEG2000 system.

The acquisition means further acquires first image data composed of a first tile and second image data composed of a second tile,
The determining means determines the image type of the first and second tiles for each tile,
Further, the similarity between the first and second image data is determined based on the comparison between the image type of the first tile and the image type of the second tile determined by the determining unit. The information processing apparatus according to claim 1, further comprising a first determination unit.

The tile further includes one or more pixel information,
When the first determination unit determines that the first and second image data are similar, the pixel information included in the first tile and the pixel information included in the second tile The information processing apparatus according to claim 7, further comprising: a second determination unit that determines the similarity between the first and second image data based on the second image data.

The pixel information includes at least RGB color information,
Storage means for storing a plurality of areas defined in the RGB color space;
The second determination means includes
The determination is performed based on an area to which the pixel information included in the first tile belongs and an area to which the pixel information included in the second tile belongs. The information processing apparatus according to claim 8.

An acquisition step of acquiring image data composed of one or more tiles, wherein the tile includes at least a header including encoding information about encoding of the tile;
A determination step of determining an image type for each tile based on the encoding information;
With
The method for controlling an information processing apparatus, wherein the tile is divided into a plurality of frequency components.

A computer program for causing a computer to function as the information processing apparatus according to claim 1.

A computer-readable storage medium storing the computer program according to claim 11.