JPH05165896A

JPH05165896A - Picture data retrieval device and picture data retrieval method

Info

Publication number: JPH05165896A
Application number: JP3330267A
Authority: JP
Inventors: Kazuhiro Kondo; 和弘近藤
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1991-12-13
Filing date: 1991-12-13
Publication date: 1993-07-02

Abstract

PURPOSE:To improve the retrieval efficiency by storing hierarchical encoded picture data according to hierarchy, selecting and displaying picture data which is similar to a prescribed criterion by reading the lowest hierarchical data at the time of a retrieval. CONSTITUTION:A picture signal to be obtained from an input terminal 1 is encoded by a hierarchical encoder 2 and is stored at the prescribed position on a disk 8 according to hierarchy through a disk input/output control unit 3. When picture data is retrieved, the frame number of a picture to be a retrieval object is successively outputted from a retrieval unit 4 to the disk input/ output control unit 3. The disk input/output control unit 3 reads the picture data having the pertinent number of layers of a designated picture data from the disk 8 and outputs the data to the degree of similarity decision unit 5. At this place, the lowest hierarchy picture data having a small amount of information is read from the disk 8, each picture data is compared with the prescribed criterion of degree of similarity of the degree of similarity decision unit 5, and picture data which is similar to this criterion is selected and displayed.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は大量に蓄積された画像デ
−タの中から目的としている画像デ−タを選出するため
の画像データ検索装置及び画像データ検索方法に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image data retrieving apparatus and an image data retrieving method for selecting desired image data from a large amount of accumulated image data.

【０００２】[0002]

【従来の技術】従来、この種の方法としては例えば情報
処理学会研究報告、マルチメディア通信と分散処理４２
−５に記載されているものがある。ここでは階層的に、
符号化された画像デ−タを蓄積した画像デ−タベ−ス・
システムを開示している。2. Description of the Related Art Conventionally, as a method of this kind, for example, a research report of Information Processing Society of Japan, multimedia communication and distributed processing 42
-5 are listed. Here, hierarchically,
Image data base that stores encoded image data
The system is disclosed.

【０００３】[0003]

【発明が解決しようとする課題】上記従来技術では、検
索後の符号化された画像デ−タを伝送して伝送時間を削
減している。また、階層符号化されたデ−タの内、下位
符号のみを復号することによりブラウジングの効率を上
げている。しかしながら、多量の処理を必要とする場
合、全ての下位符号から再生された画像をオペレータが
見なければならず、検索の効率そのものは効率化されな
い。In the above prior art, the encoded image data after retrieval is transmitted to reduce the transmission time. Further, the efficiency of browsing is improved by decoding only the lower order code of the hierarchically encoded data. However, when a large amount of processing is required, the operator must see the images reproduced from all the lower codes, and the search efficiency itself is not efficient.

【０００４】本発明の目的は、階層的に符号化されてい
る画像デ−タの下位符号を用い、それに任意の条件を与
え、自動的に目的の画像を検索するための画像検索装置
及び画像検索方法を提供することにある。An object of the present invention is to use a lower code of image data which is hierarchically coded, give an arbitrary condition to the lower code, and automatically search for a desired image. To provide a search method.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するため
に、本発明では階層符号化された画像デ−タを階層別に
記憶される。検索時には上記符号化デ−タの内、まずは
下位階層のデ−タのみを読みだし、これをあらかじめ定
められた判定基準と比較し、この基準を満たした画像デ
−タのみを改めて読みだす。In order to achieve the above object, in the present invention, layer-encoded image data is stored for each layer. At the time of retrieval, of the above-mentioned coded data, first, only the data of the lower hierarchy is read, this is compared with a predetermined criterion, and only the image data that satisfies this criterion is read again.

【０００６】[0006]

【作用】一般に階層符号化されたデ−タは、下位階層の
ものは元のデ−タを極めて粗く表現しており、これに上
位階層が付加されることにより徐々に詳細な表現にな
る。例えば周波数成分により階層符号化されたデ−タは
下位階層では低周波成分のみで表現され、上位階層が加
わるにつれより鋭敏なエッジ成分等が現れることにな
る。In general, the hierarchically encoded data of the lower layer expresses the original data extremely roughly, and by adding the upper layer to this, it becomes gradually detailed. For example, the data hierarchically coded by the frequency component is represented by only the low frequency component in the lower layer, and more sensitive edge components and the like appear as the upper layer is added.

【０００７】全階層のデ−タ量に比べ、下位階層の量は
数分の１であり、検索処理に必要となる時間もこれに比
例して少なくなる。このことを利用して、下位階層のデ
−タのみでまず高速に粗検索を行なう。この結果検索さ
れたいくつかの候補群の下位階層のデ−タを復号し、ユ
−ザに表示し、候補を会話的に絞っていく。このとき、
下位階層デ−タの転送、および復号に必要となる時間も
そのデ−タ量に比例して、全階層を用いる場合の数分の
１になる。The amount of data in the lower layers is a fraction of the amount of data in all the layers, and the time required for the search process also decreases in proportion to this. Utilizing this fact, a rough search is first performed at high speed using only the data of the lower hierarchy. As a result, the data of the lower hierarchy of some of the retrieved candidate groups is decoded, displayed on the user, and the candidates are narrowed down interactively. At this time,
The time required for the transfer and decoding of the lower layer data is proportional to the amount of the data, and is a fraction of that in the case of using all the layers.

【０００８】次に、残った候補群について更に上位の階
層符号も用いて検索処理を行なうことにより、更に候補
を絞り、再び復号画像を提示し、候補を絞る。以上の手
順を繰り返すことにより、目的の画像を高速に、また会
話的に検索することができる。Next, a search process is performed on the remaining candidate group also by using a higher layer code to narrow down the candidates, present decoded images again, and narrow down the candidates. By repeating the above procedure, the target image can be retrieved at high speed and interactively.

【０００９】[0009]

【実施例】以下、図面を用いて本発明の実施例について
説明する。図１は本発明による画像デ−タを検索する第
１の実施例を示す。入力端子１より得られる画像信号は
まず階層符号器２にて符号化され、ディスク入出力制御
ユニット３を通してディスク８上の所定の位置に格納さ
れる。階層符号化については例えばCommunication of t
he ACM, Vol. 34, No. 4(1991.4)に詳しく記載されてい
る。上記画像デ−タはディスク８に階層別に格納され
る。例えば周波数成分で階層化を行なう場合、ブロック
・サイズを８×８、階層数を１６とすれば、最低階層に
は変換係数１から４が、階層２には５から８が、以降各
階層に４係数づつが格納される。上記過程を複数種類の
画像シ−ケンスについて行なうことにより、ディスク上
に階層符号化画像デ−タベ−スが構築される。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 shows a first embodiment for retrieving image data according to the present invention. The image signal obtained from the input terminal 1 is first encoded by the hierarchical encoder 2 and stored in a predetermined position on the disk 8 through the disk input / output control unit 3. For hierarchical coding, for example, Communication of t
It is described in detail in he ACM, Vol. 34, No. 4 (1991.4). The image data is stored in the disk 8 in layers. For example, when layering with frequency components, if the block size is 8 × 8 and the number of layers is 16, conversion coefficients 1 to 4 are assigned to the lowest layer, 5 to 8 are assigned to layer 2, and thereafter each layer is assigned. Each of the four coefficients is stored. By performing the above process for a plurality of types of image sequences, a hierarchically encoded image database is constructed on the disc.

【００１０】次に上記画像デ−タベ−スよりユ−ザが必
要とする画像デ−タを検索する方法について説明する。
検索ユニット４からは検索対象となる画像のフレ−ム番
号がディスク入出力制御ユニット３に順に出力される。
ディスク入出力制御ユニット３は指定された画像デ−タ
の内、該当するレイヤ数の画像デ−タをディスクより読
みだし、類似度判定ユニット５に出力する。上記画像デ
−タはあらかじめ定められた判定基準と比較され、目的
とする画像にどの程度近いかが判定される。Next, a method of retrieving image data required by the user from the above-mentioned image database will be described.
The frame number of the image to be searched is sequentially output from the search unit 4 to the disk input / output control unit 3.
The disk input / output control unit 3 reads the image data of the corresponding number of layers from the specified image data from the disk and outputs it to the similarity determination unit 5. The image data is compared with a predetermined criterion to determine how close it is to the target image.

【００１１】図２に類似度判定ユニット５の構成を示
す。ここで、前記ディスク入出力制御ユニット３によ
り、画像デ−タメモリ２３にはあらかじめ所定フレ−ム
の画像デ−タが格納されているものとする。このメモリ
からカウンタ２０、２１、２２出力をアドレスとしたデ
−タが順に読みだされる。水平カウンタ２１は画素対応
に発生されるクロック信号Φ₁をカウントし、その出力
は画像デ−タメモリより読みだすデ−タのブロック内水
平アドレスを示す。一方、垂直カウンタ２２は水平カウ
ンタ２１のキャリ−をカウントし、その出力は画像デ−
タメモリより読みだすデ−タのブロック内垂直アドレス
を示す。カウンタ２１、２２の進数はブロック・サイズ
に一致している。よってブロック・サイズを８×８とす
れば、カウンタ２１、２２は各々８進となる。ブロック
カウンタ２０は垂直カウンタ２２のキャリ−をカウント
し、その出力は画像デ−タメモリより読みだすデ−タの
ブロック番号を示す。このカウンタの進数は１フレ−ム
を構成するブロック数に一致している。FIG. 2 shows the configuration of the similarity determination unit 5. Here, it is assumed that image data of a predetermined frame is stored in advance in the image data memory 23 by the disk input / output control unit 3. Data with the outputs of the counters 20, 21 and 22 as addresses are sequentially read from this memory. The horizontal counter 21 counts the clock signal Φ ₁ generated for each pixel, and the output thereof indicates the horizontal address within the block of the data read from the image data memory. On the other hand, the vertical counter 22 counts the carry of the horizontal counter 21, and its output is the image data.
Indicates the vertical address within a block of data read from the data memory. The base numbers of the counters 21 and 22 match the block size. Therefore, if the block size is 8 × 8, each of the counters 21 and 22 is octal. The block counter 20 counts the carry of the vertical counter 22, and the output thereof indicates the block number of the data read from the image data memory. The radix of this counter corresponds to the number of blocks constituting one frame.

【００１２】上記カウンタで指定されて画像デ−タメモ
リより読みだされた画像デ−タはアキュムレ−タ２６に
入力される。アキュムレ−タ２６はフレ−ムの先頭でク
ロック信号Φ₂に同期してクリアされる。入力デ−タは
加算同期信号２１２に同期してアキュムレ−タ２６の内
容に加算される。加算同期信号は水平アドレス、および
垂直アドレスが一定値以上のときに発生する。比較器２
５の出力は水平カウンタ値が閾値ＴＨｈ以上の時に１、
これ以下の時には０となる。同様に比較器２４の出力は
垂直カウンタ値が閾値ＴＨｖ以上の時に１、これ以下の
時には０となる。比較器２４、２５の出力は論理積回路
２７に入力され、双方が１の時のみに１となり、論理積
回路２８よりクロック信号２１１を通過させる。この結
果、フレ−ム内の各ブロックについて、直交変換係数の
内、高域成分の和が算出される。図３にこの例を示す。
上記算出値は比較器２９にて閾値ＴＨｆと比較される。
この結果は検索ユニット４に出力される。The image data designated by the counter and read from the image data memory is input to the accumulator 26. The accumulator 26 is cleared at the beginning of the frame in synchronization with the clock signal Φ ₂ . The input data is added to the contents of the accumulator 26 in synchronization with the addition sync signal 212. The addition sync signal is generated when the horizontal address and the vertical address are equal to or more than a certain value. Comparator 2
The output of 5 is 1 when the horizontal counter value is greater than or equal to the threshold value THh,
When it is less than this, it becomes 0. Similarly, the output of the comparator 24 is 1 when the vertical counter value is equal to or larger than the threshold value THv, and is 0 when the vertical counter value is smaller than the threshold value THv. The outputs of the comparators 24 and 25 are input to the logical product circuit 27, which becomes 1 only when both are 1, and the clock signal 211 is passed from the logical product circuit 28. As a result, the sum of the high frequency components of the orthogonal transform coefficients is calculated for each block in the frame. FIG. 3 shows this example.
The calculated value is compared with the threshold value THf by the comparator 29.
The result is output to the search unit 4.

【００１３】図４に検索ユニットの構成を示す。フレ−
ムカウンタ４０にはあらかじめ検索を開始するフレ−ム
番号４３、および最終検索対象フレ−ム番号４４が格納
されている。カウンタはクロック信号Φ₂２１０に同期
して検索対象となるフレ−ム番号４２をディスク入出力
制御ユニット３に出力する。類似度判定の結果、類似度
７が１となったフレ−ムの番号９はゲ−ト４１を通して
出力される。以上の処理を検索対象となっている全フレ
−ムについて行ない、検索終了フラグ１１によりディス
ク入出力制御ユニット３に検索終了を知らせる。FIG. 4 shows the structure of the search unit. Frame
The frame counter 40 stores a frame number 43 for starting a search and a frame number 44 for the final search. The counter outputs the frame number 42 to be searched to the disk input / output control unit 3 in synchronization with the clock signal Φ ₂ 210. As a result of the similarity determination, the frame number 9 having the similarity 7 of 1 is output through the gate 41. The above processing is performed for all the frames to be searched, and the search end flag 11 is used to notify the disk input / output control unit 3 of the end of the search.

【００１４】以上の過程を更に上の階層デ−タを用いて
会話的に行なって候補画像をしぼる。この手順を図５に
示す。まずユ−ザが検索範囲となるフレ−ムを指定し、
検索基準を選定する。例えば前記の様に、高周波成分量
が検索基準として選ばれる。階層数は最低階層のみでス
タ−トする。以降の処理はユ−ザが満足するまで繰り返
される。The above process is interactively performed using the upper hierarchical data to narrow down the candidate images. This procedure is shown in FIG. First, the user specifies the frame that is the search range,
Select search criteria. For example, as described above, the high frequency component amount is selected as the search criterion. Start only with the lowest number of layers. The subsequent processing is repeated until the user is satisfied.

【００１５】検索範囲内の全フレ−ムについて検索処理
が繰り返される。この後、検索されたフレ−ムに属する
最低階層の符号化デ−タはユ−ザ端末に伝送され、復号
されて表示される。ユ−ザはこれを見て更に検索を続け
るか否かを応答し、続ける場合は検索範囲を改めて指定
し、１階層上位を指定し、再び検索を繰り返す。The search process is repeated for all the frames within the search range. Thereafter, the lowest layer encoded data belonging to the retrieved frame is transmitted to the user terminal, decoded and displayed. The user sees this and responds whether or not to continue the search. If the search is to be continued, the search range is designated again, the upper one layer is designated, and the search is repeated again.

【００１６】以上の例では高周波成分が多いフレ−ムを
会話的に検索する場合について説明した。これにより、
微細パタ−ンの多い画像、例えばチェッカ模様が多く含
まれる画像が会話的に検出できる。検出対象の多い初期
段階では低階層のデ−タのみを用いているので検索処理
が短期間ででき、また低階層のデ−タのみをユ−ザ端末
に伝送、表示しているためこれも高速に、かつ経済的に
行なえる。検索範囲がある程度絞られたのちに、長処理
時間を必要とする上位階層を用いた詳細な検索を行なう
ので、全体としては処理時間を抑えることができ、効率
的である。尚、上記の実施例においては、高周波成分が
多いフレームを検索する場合に限定して説明したが、あ
る領域の周波数成分を持つフレームを検索することにも
応用できる。例えば、垂直・水平カウンタ２１，２２か
らの出力を所定の閾値と比較する際に、下限（ＴＨ_v，
ＴＨ_h）の条件の他、上限の条件を加え、それらのアン
ドをとって、画像データを検索することが可能である
（図示略）。これらの条件の具体的な値については、類
比判断の基準となる画像から予め垂直・水平方向の周波
数成分を測定しておき、その値の許容量を考慮して定め
ることができる。このようにすると、膨大な量の画像デ
ータから、特定の画像と類似する画像データのみを高速
に検索することが出来る。In the above example, the case of interactively retrieving a frame having many high frequency components has been described. This allows
An image with many fine patterns, for example, an image containing many checker patterns can be detected interactively. Since only low-layer data is used in the initial stage where there are many detection targets, the search process can be performed in a short period of time, and only low-layer data is transmitted and displayed to the user terminal. It can be done quickly and economically. After the search range is narrowed down to some extent, a detailed search is performed using the upper hierarchy that requires a long processing time, so that the processing time can be suppressed as a whole, which is efficient. In the above embodiment, the description has been limited to the case of searching for a frame having many high frequency components, but the present invention can be applied to searching for a frame having a frequency component of a certain area. For example, when comparing the outputs from the vertical / horizontal counters 21 and 22 with a predetermined threshold value, the lower limit (TH _v ,
It is possible to search for image data by adding the condition of the upper limit in addition to the condition of TH _h ) and taking the AND of them. Specific values of these conditions can be determined in advance by measuring the frequency components in the vertical / horizontal directions from the image serving as a reference for analogy determination, and considering the allowable amount of the values. By doing so, it is possible to quickly search only image data similar to a specific image from a huge amount of image data.

【００１７】また、上記の実施例において、階層符号化
デ−タを復号して、下位階層から順に検索を行なうこと
もできる。図６にこの構成を示す。この図で第１の実施
例と異なる点は、類似度判定ユニットに入力される前に
画像デ−タが復号されることである。その他の構成は第
１の実施例と同様なので、ここでは省略する。Further, in the above embodiment, it is also possible to decode the hierarchically encoded data and perform the search in order from the lower hierarchy. FIG. 6 shows this configuration. The difference from the first embodiment in this figure is that the image data is decoded before being input to the similarity determination unit. The rest of the configuration is the same as that of the first embodiment, so it is omitted here.

【００１８】図７に復号されたデ−タに占める赤色成分
の多い画像を検索する類似度判定ユニットの構成を示
す。ここで、前記ディスク入出力制御ユニット３によ
り、Ｒ成分メモリ７１、Ｇ成分メモリ７２、Ｂ成分メモ
リ７３各々には、あらかじめ所定フレ−ムの画像デ−タ
が復号され、色成分別に格納されているものとする。こ
のメモリからカウンタ７４の出力をアドレスとしたデ−
タが順に読みだされる。各メモリより読みだされた画像
デ−タは比較器７５、７６、７７に入力され、閾値ＴＨ
ｒ、ＴＨｇ、ＴＨｂに比較される。各々の閾値を越える
成分にたいしては１が出力される。この出力はゲ−ト７
８に入力され、Ｒ成分のみが閾値を越える場合に１を出
力し、その他の場合は０を出力する。すなわちこの出力
は該当する画素が一定以上の「赤さ」を示していること
を判定している。上記出力はカウンタ７９のカウント制
御入力に接続され、これが１のときのみクロック信号Φ
₁２１１をカウントする。以上の処理を１フレ−ムにつ
いて繰り返すことにより、対象フレ−ム内に赤色画素が
占める量が算出される。これが閾値ＴＨｆを越える場合
は比較器２９の出力は１となり、これ以外は０となる。
この出力は検索ユニットに出力される。以降の処理は第
１の実施例と同様である。この結果、赤色画素が占める
割合の大きいフレ−ムが検索される。上記過程を図５の
手順と同様に従い階層数を増やして、ユ−ザが満足する
まで繰り返す。FIG. 7 shows the configuration of a similarity determination unit for searching an image having a large amount of red color components in the decoded data. Here, the disk input / output control unit 3 previously decodes image data of a predetermined frame into the R component memory 71, the G component memory 72, and the B component memory 73, and stores the image data for each color component. Be present. The data output from the counter 74 is used as an address from this memory.
Data is read in order. The image data read from each memory is input to the comparators 75, 76 and 77, and the threshold value TH
r, THg, THb. 1 is output for components exceeding each threshold value. This output is gate 7
It is input to 8, and outputs 1 when only the R component exceeds the threshold value, and outputs 0 otherwise. That is, this output determines that the corresponding pixel exhibits "redness" above a certain level. The output is connected to the count control input of the counter 79, and only when this is 1, the clock signal Φ
Count ₁ 211. By repeating the above process for one frame, the amount of red pixels occupied in the target frame is calculated. If this exceeds the threshold THf, the output of the comparator 29 is 1, and otherwise 0.
This output is output to the search unit. The subsequent processing is the same as in the first embodiment. As a result, a frame having a large proportion of red pixels is searched. The above process is repeated in the same manner as in the procedure of FIG. 5 until the number of layers is increased and the user is satisfied.

【００１９】以上の例では赤色成分が多いフレ−ムを会
話的に検索する場合について説明した。例えば服飾カタ
ログ等の検索で、赤をベ−スにしたものを会話的に検出
する場合等が考えられる。第１の実施例と同様、検出対
象の多い初期段階では低階層のデ−タのみを用いている
ので検索処理が短期間ででき、また低階層のデ−タのみ
をユ−ザ端末に伝送、表示しているためこれも高速に、
かつ経済的に行なえる。検索範囲がある程度絞られたの
ちに、長処理時間を必要とする上位階層を用いた詳細な
検索を行なうので、全体としては処理時間を抑えること
ができ、効率的である。また、復号後のデ−タで検索を
行なっているので、より直接的な基準での検索が行なえ
る。In the above example, the case of interactively searching for a frame having a large amount of red component has been described. For example, it is conceivable to search for a clothing catalog or the like and interactively detect a red-based product. Similar to the first embodiment, since only low-layer data is used in the initial stage where there are many detection targets, search processing can be performed in a short period of time, and only low-layer data is transmitted to the user terminal. , Because it is displayed, this is also fast,
And can be done economically. After the search range is narrowed down to some extent, a detailed search is performed using the upper hierarchy that requires a long processing time, so that the processing time can be suppressed as a whole, which is efficient. In addition, since the search is performed using the decrypted data, the search can be performed based on a more direct criterion.

【００２０】次に複数の判定基準を検索に用いた第３の
実施例について説明する。図８にこの構成を示す。第２
の実施例に比べて多数の類似度判定ユニット５を持って
いることが特徴となっている。階層符号器２、ディスク
入出力ユニット３等の構成は前記実施例と同様であるた
め、その説明はここでは省略する。ただし、階層符号器
２は複数種類の階層化を行ない、ディスク内に別々に格
納する。図９にこの例を示す。本図にはＲ成分のみを示
し、同様のデ−タがＧ、およびＢ成分についても存在す
るとする。４階層よりなる例について示し、最低階層の
み３種の階層構造を持つとする。前記３種の階層のう
ち、第１の階層種はスペクトル成分による階層であり、
この例を図１０に示す。入力画像は１画素当り８ビット
のスペクトル成分に変換されるとし、８×８のブロック
単位で処理されるものとする。このスペクトル成分は図
１１に示す順序で読み取られ、最低階層では１ブロック
につき最初の１６成分のみが１フレ−ム当り１６２０ブ
ロック、１シ−ケンス８フレ−ム分について格納され
る。スペクトル成分については第２、３、４階層も格納
されており、各々１７から３２、３３から４８、４９か
ら６４番目の係数が各々１６係数ずつ格納されている。Next, a third embodiment using a plurality of judgment criteria for searching will be described. FIG. 8 shows this configuration. Second
It is characterized in that it has a large number of similarity determination units 5 as compared with the embodiment of FIG. The configurations of the hierarchical encoder 2, the disk input / output unit 3 and the like are the same as those in the above-mentioned embodiment, and the description thereof will be omitted here. However, the hierarchical encoder 2 performs a plurality of types of hierarchization and stores them in the disc separately. FIG. 9 shows this example. In this figure, only the R component is shown, and it is assumed that similar data exists for the G and B components. An example of four layers is shown, and it is assumed that only the lowest layer has three types of hierarchical structure. Of the three layers, the first layer type is a layer based on spectral components,
An example of this is shown in FIG. It is assumed that the input image is converted into a spectral component of 8 bits per pixel and is processed in 8 × 8 block units. The spectral components are read in the order shown in FIG. 11, and in the lowest layer, only the first 16 components per block are stored for 1620 blocks per frame and 1 frame for 8 frames. The second, third, and fourth hierarchies are also stored for the spectral components, and 16th to 17th to 32nd, 33 to 48th, and 49th to 64th coefficients are stored respectively.

【００２１】第２の階層種は時間による階層であり、す
なわち駒落しを行なって階層化を行なう。図１２にこの
例を示す。１フレ−ムについては１画素当り８ビットの
デ−タを各ブロックについて格納し、上記デ−タを４フ
レ−ム毎、すなわちこの例では１、および４フレ−ム目
について格納する。この時のデ−タ量は前記スペクトル
成分の第１階層と同量である。The second layer type is a layer based on time, that is, frame dropping is performed to form a layer. FIG. 12 shows this example. For one frame, 8-bit data per pixel is stored for each block, and the data is stored for every four frames, that is, for the first and fourth frames in this example. The amount of data at this time is the same as that of the first layer of the spectral component.

【００２２】第３の階層種は振幅成分による階層であ
る。この例を図１３に示す。１係数当り上位２ビット、
１ブロック当り６４係数、１フレ−ム当り１６２０ブロ
ック、１シ−ケンス８フレ−ム分について格納される。
この時のデ−タ量は前記２種の階層と同量である。The third layer type is a layer based on amplitude components. An example of this is shown in FIG. Upper 2 bits per coefficient,
Stored are 64 coefficients per block, 1620 blocks per frame, and 8 frames for one sequence.
The amount of data at this time is the same as that of the two types of layers.

【００２３】上記３種の最下位階層３種のデ−タがまず
読みだされ、２種の階層デ−タは局部復号器６０にて復
号され、残り１種は復号されずに、各々対応する類似度
判定ユニット５に分配される。階層別類似度判定ユニッ
ト５は、復号されたデ−タは図７と同様のもの、復号さ
れないデ−タは図２で図示したものが用いられる。ただ
し、双方とも最終段の比較器２９を除いた構成とする。
その動作は前記と同様である。各類似度判定ユニットは
あらかじめユ−ザが設定した重みで乗算され、総和が採
られた後に検索ユニットに出力される。以降の処理は第
１、および第２の実施例と同様である。必要に応じて上
記過程を図５の手順と同様に従い階層数を増やして、ユ
−ザが満足するまで繰り返す。また、図５の手順を、階
層数を増やすのに加え、重み係数を会話的に変更するこ
とにより、ユ−ザが所望する画像を検索することができ
る。また、重み係数を０にすることにより、検索に用い
る階層の種類を変更することも可能である。The data of the three types of the three lowest layers is read out first, the two types of layer data are decoded by the local decoder 60, and the remaining one type is not decoded and corresponds respectively. Are distributed to the similarity determination units 5. In the hierarchical similarity determination unit 5, the decoded data is the same as that shown in FIG. 7, and the undecoded data is that shown in FIG. However, both are configured so that the comparator 29 at the final stage is removed.
The operation is similar to the above. Each similarity determination unit is multiplied by the weight set by the user in advance, the sum is taken, and then output to the search unit. The subsequent processing is the same as in the first and second embodiments. If necessary, the above process is repeated in the same manner as the procedure of FIG. 5 by increasing the number of layers and repeating until the user is satisfied. Further, in addition to increasing the number of layers in the procedure of FIG. 5 and interactively changing the weighting coefficient, the image desired by the user can be searched. Also, by setting the weighting coefficient to 0, it is possible to change the type of hierarchy used for the search.

【００２４】以上の実施例では第１、第２の実施例と同
様、検出対象の多い初期段階では低階層のデ−タのみを
用いているので検索処理が短期間ででき、また低階層の
デ−タのみをユ−ザ端末に伝送、表示しているためこれ
も高速に、かつ経済的に行なえる。更にここでは複数の
階層化デ−タを用いて、各々をユ−ザ定義の重み付けを
加えた後検索基準としているため、多種類の判定基準を
柔軟に用いて効率的な検索作業を行なうことができる。In the above-described embodiment, as in the first and second embodiments, only the data of the lower hierarchy is used in the initial stage where there are many objects to be detected, so that the retrieval process can be performed in a short period of time, and the data of the lower hierarchy is Since only the data is transmitted and displayed to the user terminal, this can be done at high speed and economically. Further, since a plurality of layered data are used as search criteria after adding user-defined weights, it is possible to flexibly use various types of criteria to perform efficient search work. You can

【００２５】[0025]

【発明の効果】本発明によれば、検出対象の多い初期段
階では低階層のデ−タのみを用いているので検索処理が
短期間ででき、また低階層のデ−タのみをユ−ザ端末に
伝送、表示しているためこれも高速に、かつ経済的に行
なえる。検索範囲がある程度絞られたのちに、長処理時
間を必要とする上位階層を用いた詳細な検索を行なうの
で、全体としては処理時間を抑えることができ、効率的
である。以上の検索過程をユ−ザが必要とした範囲まで
会話的に行なえば良く、非常に柔軟な検索作業が可能と
なる。According to the present invention, since only low-layer data is used in the initial stage where there are many detection targets, search processing can be performed in a short period of time, and only low-layer data can be used by the user. This is also fast and economical because it is transmitted and displayed on the terminal. After the search range is narrowed down to some extent, a detailed search is performed using the upper hierarchy that requires a long processing time, so that the processing time can be suppressed as a whole, which is efficient. It is sufficient to interactively perform the above search process to the extent required by the user, and a very flexible search operation becomes possible.

[Brief description of drawings]

【図１】本発明の第１の実施例の構成を示す図である。FIG. 1 is a diagram showing a configuration of a first exemplary embodiment of the present invention.

【図２】本発明の第１の実施例における類似度判定ユニ
ットの構成を示す図である。FIG. 2 is a diagram showing a configuration of a similarity determination unit in the first exemplary embodiment of the present invention.

【図３】本発明の第１の実施例において、総和を採る直
交変換係数の範囲を示す図である。FIG. 3 is a diagram showing a range of orthogonal transform coefficients for which a sum is calculated in the first embodiment of the present invention.

【図４】本発明の第１の実施例における検索ユニットの
構成を示す図である。FIG. 4 is a diagram showing a configuration of a search unit in the first embodiment of the present invention.

【図５】本発明の第１の実施例における会話的検索手順
を示す図である。FIG. 5 is a diagram showing a conversational search procedure in the first embodiment of the present invention.

【図６】本発明の第２の実施例の構成を示す図である。FIG. 6 is a diagram showing a configuration of a second exemplary embodiment of the present invention.

【図７】本発明の第２の実施例における類似度判定ユニ
ットの構成を示す図である。FIG. 7 is a diagram showing a configuration of a similarity determination unit according to a second exemplary embodiment of the present invention.

【図８】本発明の第３の実施例の構成を示す図である。FIG. 8 is a diagram showing a configuration of a third exemplary embodiment of the present invention.

【図９】本発明の第３の実施例における階層符号の格納
の様子を示す図である。FIG. 9 is a diagram showing how a hierarchical code is stored in the third embodiment of the present invention.

【図１０】本発明の第３の実施例におけるスペクトル成
分による階層符号格納の様子を示す図である。FIG. 10 is a diagram showing how hierarchical codes are stored by spectral components in the third embodiment of the present invention.

【図１１】本発明の第３の実施例における直交変換係数
のスキャン順序を示す図である。FIG. 11 is a diagram showing a scan order of orthogonal transform coefficients in the third embodiment of the present invention.

【図１２】本発明の第３の実施例における時間による階
層符号格納の様子を示す図である。FIG. 12 is a diagram showing how hierarchical codes are stored according to time in a third embodiment of the present invention.

【図１３】本発明の第３の実施例における振幅成分によ
る階層符号格納の様子を示す図である。FIG. 13 is a diagram showing how hierarchical codes are stored by amplitude components in the third embodiment of the present invention.

[Explanation of symbols]

２…階層符号器、５…類似度判定ユニット、４…検索ユ
ニット、６０…局部復号器。2 ... Hierarchical encoder, 5 ... Similarity determination unit, 4 ... Search unit, 60 ... Local decoder.

Claims

[Claims]

1. An image data retrieval device for selecting arbitrary image data from a plurality of image data stored in advance in a plurality of hierarchies and hierarchized in a plurality of hierarchies. A recording unit for storing the image data in association with the frame number, and the image data of the lowest hierarchy having a small amount of information is sequentially read from the storage unit, and each image data is compared with a predetermined similarity determination standard to perform the similarity determination. An image data retrieving apparatus comprising: a similarity determining means for selecting image data similar to a reference; and a means for displaying the selected image data.

2. The similarity determining means determines the target image data by the weighted sum of the coded image data in each layer and the reference value obtained by comparing the determination criteria corresponding thereto. The image data according to claim 1, wherein the image data is selected.
Search device.

3. The similarity determining means searches for an image using the coded data of the lowest layer, and if the amount of data is equal to or more than a predetermined amount, the coded data of a higher layer is searched. Using the image again to calculate the selected data amount, the above procedure is repeated until the selected image data amount becomes less than or equal to a predetermined amount. The image data retrieval apparatus according to claim 1, wherein data is selected.

4. The similarity determining means searches for an image using the coding data of the lowest hierarchy and presents a representative frame to the user to select a candidate range. 2. A desired image data is selected by performing a search using the data of the upper hierarchy within the range and repeating the above procedure to narrow down the search range.
An image data retrieval device according to the item.

5. The target image data is selected according to the type and number of hierarchical codes used for retrieval set according to an instruction from the user, as the similarity criterion. The image data retrieval device according to claim 1.