JP2012124946A

JP2012124946A - Prediction vector generating method, image encoding method, image decoding method, prediction vector generator, image encoder, image decoder, prediction vector generating program, image encoding program and image decoding program

Info

Publication number: JP2012124946A
Application number: JP2012035461A
Authority: JP
Inventors: Takaya Yamamoto; 貴也山本
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2012-02-21
Filing date: 2012-02-21
Publication date: 2012-06-28

Abstract

PROBLEM TO BE SOLVED: To provide a prediction vector generating method giving superior encoding efficiency.SOLUTION: The prediction vector generating method of reference information such as a motion vector and a parallax vector, which are used for dividing an objective image into blocks and encoding or decoding them, includes a prediction vector generation step for generating the prediction vector of the objective block by using information showing a distance corresponding to the objective image and reference information of the block adjacent to the objective block.

Description

本発明は、予測ベクトル生成方法、画像符号化方法、画像復号方法、予測ベクトル生成装置、画像符号化装置、画像復号装置、予測ベクトル生成プログラム、画像符号化プログラムおよび画像復号プログラムに関する。 The present invention relates to a prediction vector generation method, an image encoding method, an image decoding method, a prediction vector generation device, an image encoding device, an image decoding device, a prediction vector generation program, an image encoding program, and an image decoding program.

従来の動画像符号化方式としてＭＰＥＧ（Moving Picture Experts Group)−２、ＭＰＥＧ−４、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４方式などがある。これらの動画像符号化方式では、動き補償フレーム間予測符号化という動画像の時間方向の相関性を利用し符号量の削減を図る符号化方式を用いている。動き補償フレーム間予測符号化では、符号化対象の画像をブロック単位に分割し、ブロックごとに動きベクトルを求めることで、効率的な符号化を実現している。 As a conventional moving picture coding system, MPEG (Moving Picture Experts Group) -2, MPEG-4, MPEG-4 AVC / H. H.264 system. In these moving picture coding systems, a coding system that uses a temporal correlation of moving pictures called motion compensation interframe predictive coding to reduce the amount of code is used. In motion compensation interframe predictive coding, an image to be coded is divided into blocks, and a motion vector is obtained for each block, thereby realizing efficient coding.

さらに、非特許文献１にあるように、ＭＰＥＧ−４やＨ．２６４／ＡＶＣ規格では動きベクトルの圧縮率を向上させるために、予測ベクトルを生成し、符号化対象ブロックの動きベクトルと予測ベクトルの差分を符号化している。具体的には、図１４に示すように符号化対象ブロックの上に隣接しているブロック（図中の隣接ブロックＡ）と右上に隣接しているブロック（図中の隣接ブロックＢ）と、左に隣接しているブロック（図中の隣接ブロックＣ）の動きベクトル（ｍｖ＿ａ、ｍｖ＿ｂ、ｍｖ＿ｃ）の水平成分及び垂直成分それぞれの中央値を予測ベクトルとし、動きベクトルと予測ベクトルの差分ベクトルを求めている。 Furthermore, as described in Non-Patent Document 1, MPEG-4 and H.264 are used. In the H.264 / AVC standard, in order to improve the compression rate of the motion vector, a prediction vector is generated, and the difference between the motion vector of the encoding target block and the prediction vector is encoded. Specifically, as shown in FIG. 14, the block adjacent on the encoding target block (adjacent block A in the figure), the block adjacent on the upper right (adjacent block B in the figure), and the left The median value of the horizontal and vertical components of the motion vector (mv_a, mv_b, mv_c) of the block adjacent to the block (adjacent block C in the figure) is the prediction vector, and the difference vector between the motion vector and the prediction vector is obtained. Yes.

また、近年、Ｈ．２６４規格にて、複数のカメラで同一の被写体や背景を撮影した複数の動画像である多視点動画像を符号化するための拡張規格であるＭＶＣ（ＭｕｌｔｉｖｉｅｗＶｉｄｅｏＣｏｄｉｎｇ）が策定された。この符号化方式では、カメラ間の相関性を表す視差ベクトルを利用して符号量の削減を図る視差補償予測符号化を用いている。
また、視差補償予測の結果として検出される視差ベクトルに対しても、予測ベクトルを利用することにより、符号量の削減が可能である。 In recent years, H.C. In the H.264 standard, MVC (Multiview Video Coding), which is an extended standard for encoding a multi-view video that is a plurality of moving images obtained by capturing the same subject or background with a plurality of cameras, has been formulated. In this encoding method, disparity compensation predictive encoding that uses a disparity vector representing the correlation between cameras to reduce the amount of code is used.
Also, the amount of code can be reduced by using a prediction vector for a disparity vector detected as a result of disparity compensation prediction.

ただし、動き補償フレーム間予測符号化と視差補償予測符号化ではそれぞれ時間方向の相関性とカメラ間の相関性を利用して符号化するため、検出される動きベクトルと視差ベクトル間に相関性は無い。そのため、隣接ブロックが符号化対象ブロックと異なる符号化方式で符号化された場合、その隣接ブロックの動きベクトルもしくは視差ベクトルを予測ベクトルの生成に活用できないという問題点がある。 However, since the motion compensation interframe prediction coding and the disparity compensation prediction coding are coded using the correlation in the time direction and the correlation between the cameras, the correlation between the detected motion vector and the disparity vector is No. Therefore, when an adjacent block is encoded by a different encoding method from the encoding target block, there is a problem that a motion vector or a disparity vector of the adjacent block cannot be used for generating a prediction vector.

この問題に対し、特許文献１では隣接ブロックの符号化方式が符号化対象ブロックと異なる場合に、符号化対象ブロックの符号化方式が動き補償フレーム間予測符号化の時には隣接ブロックの視差ベクトルが参照する領域に最も多く含まれるブロックの動きベクトルを予測ベクトル生成時に使用し、符号化対象ブロックの符号化方式が視差補償予測符号化の時には隣接ブロックの動きベクトルが参照する領域に最も多く含まれるブロックの視差ベクトルを予測ベクトル生成時に使用することにより、予測ベクトルの生成精度を向上させている。 With respect to this problem, in Patent Document 1, when the encoding method of the adjacent block is different from the encoding target block, the disparity vector of the adjacent block is referenced when the encoding method of the encoding target block is motion compensation interframe predictive encoding. The block containing the largest number of motion vectors in the region to be used is used when generating the prediction vector, and when the coding method of the coding target block is the parallax compensation prediction coding, the block containing the most in the region referenced by the motion vector of the adjacent block The generation accuracy of the prediction vector is improved by using the parallax vector when generating the prediction vector.

現在、ＭＰＥＧのアドホックグループであるＭＰＥＧ−３ＤＶにおいて従来のカメラで撮影した映像と合わせてデプスマップも伝送する新しい規格が策定されている。
デプスマップとはカメラから被写体までの距離を表した情報であり、生成方法としては例えば、カメラの近傍に設置された距離を測定する装置から取得する方法がある。また、複数視点のカメラから撮影された画像を解析することによってデプスマップを生成することも出来る。 At present, MPEG-3DV, which is an ad hoc group of MPEG, has developed a new standard for transmitting a depth map together with video captured by a conventional camera.
The depth map is information representing the distance from the camera to the subject, and as a generation method, for example, there is a method of obtaining from a device that measures the distance installed in the vicinity of the camera. In addition, a depth map can be generated by analyzing images taken from cameras of a plurality of viewpoints.

ＭＰＥＧ−３ＤＶの新しい規格におけるシステムの全体図を図１５に示す。この新しい規格は、２視点以上の複数視点に対応しているが、図１５では２視点の場合で説明する。
このシステムでは、被写体６０１をカメラ６０２、６０４で撮影し画像を出力するとともに、それぞれのカメラの近傍に設置されている被写体までの距離を測定するセンサ６０３、６０５を用いてデプスマップを生成し出力する。符号化器６０６は、入力として画像とデプスマップを受け取り、動き補償フレーム間予測符号化や視差補償予測を用いて、画像およびデプスマップを符号化し出力する。復号器６０７は伝送されてくる符号化器の出力結果を入力として受け取り、復号し、復号画像および復号したデプスマップを出力する。表示部６０８は入力として復号画像と復号したデプスマップを受け取り、復号画像を表示する、あるいは、デプスマップを用いた処理を復号画像に施してから表示する。 An overall view of the system in the new MPEG-3DV standard is shown in FIG. This new standard corresponds to a plurality of viewpoints of two viewpoints or more. In FIG. 15, a case of two viewpoints will be described.
In this system, the subject 601 is photographed by the cameras 602 and 604 and an image is output, and a depth map is generated and output using the sensors 603 and 605 that measure the distance to the subject installed in the vicinity of each camera. To do. The encoder 606 receives an image and a depth map as inputs, and encodes and outputs the image and the depth map using motion compensation interframe prediction encoding or disparity compensation prediction. The decoder 607 receives as input the output result of the transmitted encoder, decodes it, and outputs a decoded image and a decoded depth map. The display unit 608 receives the decoded image and the decoded depth map as input, displays the decoded image, or displays the decoded image after performing processing using the depth map.

国際公開第２００８／０５３７４６号International Publication No. 2008/053746

大久保榮監修、角野眞也、菊池義浩、鈴木輝彦共編、改訂三版Ｈ．２６４／ＡＶＣ教科書、インプレスＲ＆Ｄ、PP123-125（動きベクトルの予測）Supervised by Satoshi Okubo, Junya Tsuno, Yoshihiro Kikuchi, Teruhiko Suzuki, revised third edition H.264 / AVC textbook, Impress R & D, PP123-125 (motion vector prediction)

しかしながら、非特許文献１や特許文献１においては、動きベクトルや視差ベクトルなどを予測する際に、対象ブロックに表示されているオブジェクトと、対象ブロックに隣接するブロックに表示されているオブジェクトとが異なると、これらのオブジェクトが別々の方向に動いたり、カメラからの距離が大きく異なったりすることがあるために、予測ベクトルと、動きベクトルや視差ベクトルとの差分が大きくなり、符号化効率が低下することがあるという問題がある。 However, in Non-Patent Document 1 and Patent Document 1, when predicting a motion vector, a disparity vector, and the like, an object displayed in a target block is different from an object displayed in a block adjacent to the target block. Since these objects may move in different directions and the distance from the camera may differ greatly, the difference between the prediction vector and the motion vector or disparity vector increases, resulting in a decrease in coding efficiency. There is a problem that there is.

本発明は、このような事情に鑑みてなされたもので、その目的は、優れた符号化効率を奏する予測ベクトル生成方法、画像符号化方法、画像復号方法、予測ベクトル生成装置、画像符号化装置、画像復号装置、予測ベクトル生成プログラム、画像符号化プログラムおよび画像復号プログラムを提供することにある。 The present invention has been made in view of such circumstances, and an object thereof is to provide a prediction vector generation method, an image encoding method, an image decoding method, a prediction vector generation device, and an image encoding device that exhibit excellent encoding efficiency. An image decoding device, a prediction vector generation program, an image encoding program, and an image decoding program are provided.

（１）この発明は上述した課題を解決するためになされたもので、本発明の一態様は、符号化または復号の対象画像をブロックに分割し、前記ブロックの各々にフレーム間動き予測符号化方式もしくは視差補償予測符号化方式を適用し、符号化または復号の対象となっている前記ブロックである対象ブロックの参照画像と該参照画像における前記対象ブロックに対応する領域の位置を示す参照情報とに基づいて前記対象ブロックの予測画像を生成して画像を符号化もしくは復号するときに用いられる、前記参照情報の予測ベクトルを生成する方法において、前記対象画像に対応する距離を表す情報と、前記対象ブロックに隣接するブロックの参照情報とを用いて、前記対象ブロックの予測ベクトルを生成する予測ベクトル生成ステップを有することを特徴とする予測ベクトル生成方法である。 (1) The present invention has been made to solve the above-described problems, and one aspect of the present invention divides a target image to be encoded or decoded into blocks, and performs inter-frame motion prediction encoding on each of the blocks. A reference image of a target block which is the block that is the target of encoding or decoding, and reference information indicating a position of a region corresponding to the target block in the reference image In the method for generating a prediction vector of the reference information, which is used when generating a predicted image of the target block based on the image and encoding or decoding the image, information indicating a distance corresponding to the target image, A prediction vector generation step for generating a prediction vector of the target block using reference information of a block adjacent to the target block; It is predicted vector generating method according to claim.

（２）また、本発明の他の態様は、符号化の対象画像をブロックに分割し、前記ブロックの各々について、既に符号化済みの複数の画像から、対象の前記ブロックを予測する際に使用する参照画像を選択し、該参照画像中における前記対象のブロックに対応する領域を指定する参照情報を用いて予測画像を生成し、該予測画像と前記対象のブロックとの差分を符号化することで画像を符号化する画像符号化方法であって、前記対象画像に対応する距離を表す情報と、前記対象のブロックに隣接するブロックの参照情報とを用いて、前記対象のブロックの予測ベクトルを生成する予測ベクトル生成ステップを有することを特徴とする画像符号化方法である。 (2) In another aspect of the present invention, an encoding target image is divided into blocks, and each block is used to predict the target block from a plurality of already encoded images. Selecting a reference image to be generated, generating a predicted image using reference information designating an area corresponding to the target block in the reference image, and encoding a difference between the predicted image and the target block The image encoding method for encoding an image in step (a), wherein information representing a distance corresponding to the target image and reference information of a block adjacent to the target block are used to calculate a prediction vector of the target block. It is an image coding method characterized by having the prediction vector production | generation step to produce | generate.

（３）また、本発明の他の態様は、復号の対象画像全体をブロックに分割し、前記ブロックの各々について、既に復号済みの複数の画像から、対象の前記ブロックを予測する際に使用する参照画像を選択し、該参照画像中における前記対象のブロックに対応する領域を指定する参照情報を用いて予測画像を生成し、該予測画像と前記対象のブロックとの差分を復号することで画像を復号する画像復号方法であって、前記対象画像に対応する距離を表す情報と、前記対象のブロックに隣接するブロックの参照情報とを用いて、前記対象のブロックの予測ベクトルを生成する予測ベクトル生成ステップを有することを特徴とする画像復号方法である。 (3) According to another aspect of the present invention, the entire target image to be decoded is divided into blocks, and each block is used for predicting the target block from a plurality of already decoded images. An image is generated by selecting a reference image, generating a predicted image using reference information designating an area corresponding to the target block in the reference image, and decoding a difference between the predicted image and the target block A prediction vector for generating a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block An image decoding method comprising a generation step.

（４）また、本発明の他の態様は、符号化または復号の対象画像をブロックに分割し、前記ブロックの各々にフレーム間動き予測符号化方式もしくは視差補償予測符号化方式を適用し、符号化または復号の対象となっている前記ブロックである対象ブロックの参照画像と該参照画像における前記対象ブロックに対応する領域の位置を示す参照情報とに基づいて前記対象ブロックの予測画像を生成して画像を符号化もしくは復号するときに用いられる、前記参照情報の予測ベクトルを生成する予測ベクトル生成装置であって、前記対象画像に対応する距離を表す情報と、前記対象ブロックに隣接するブロックの参照情報とを用いて、前記対象ブロックの予測ベクトルを生成することを特徴とする予測ベクトル生成装置である。 (4) According to another aspect of the present invention, an image to be encoded or decoded is divided into blocks, and an interframe motion prediction encoding method or a disparity compensation prediction encoding method is applied to each of the blocks. Generating a predicted image of the target block based on a reference image of the target block that is the block to be converted or decoded and reference information indicating a position of a region corresponding to the target block in the reference image A prediction vector generation device for generating a prediction vector of the reference information used when encoding or decoding an image, wherein information representing a distance corresponding to the target image and reference to a block adjacent to the target block A prediction vector generation device that generates a prediction vector of the target block using information.

（５）また、本発明の他の態様は、符号化または復号の対象画像をブロックに分割し、前記ブロックの各々にフレーム間動き予測符号化方式もしくは視差補償予測符号化方式を適用し、符号化または復号の対象となっている前記ブロックである対象ブロックの参照画像と該参照画像における前記対象ブロックに対応する領域の位置を示す参照情報とに基づいて前記対象ブロックの予測画像を生成して画像を符号化もしくは復号するときに用いられる、前記参照情報の予測ベクトルを生成する予測ベクトル生成装置のコンピュータに、前記対象画像に対応する距離を表す情報と、前記対象ブロックに隣接するブロックの参照情報とを用いて、前記対象ブロックの予測ベクトルを生成する予測ベクトル生成ステップを実行させるための予測ベクトル生成プログラムである。 (5) According to another aspect of the present invention, an image to be encoded or decoded is divided into blocks, and an inter-frame motion prediction encoding method or a disparity compensation prediction encoding method is applied to each of the blocks. Generating a predicted image of the target block based on a reference image of the target block that is the block to be converted or decoded and reference information indicating a position of a region corresponding to the target block in the reference image A computer of a prediction vector generation device that generates a prediction vector of the reference information, which is used when encoding or decoding an image, and information indicating a distance corresponding to the target image and a reference to a block adjacent to the target block A prediction vector for executing a prediction vector generation step for generating a prediction vector of the target block using the information Is the generation program.

（６）また、本発明の他の態様は、符号化の対象画像をブロックに分割し、前記ブロックの各々について、既に符号化済みの複数の画像から、対象の前記ブロックを予測する際に使用する参照画像を選択し、該参照画像中における前記対象のブロックに対応する領域を指定する参照情報を用いて予測画像を生成し、該予測画像と前記対象のブロックとの差分を符号化することで画像を符号化する画像符号化装置であって、前記対象画像に対応する距離を表す情報と、前記対象のブロックに隣接するブロックの参照情報とを用いて、前記対象のブロックの予測ベクトルを生成することを特徴とする画像符号化装置である。 (6) Moreover, the other aspect of this invention divides | segments the encoding target image into a block, and is used when predicting the said target block from the several image already encoded about each of the said block. Selecting a reference image to be generated, generating a predicted image using reference information designating an area corresponding to the target block in the reference image, and encoding a difference between the predicted image and the target block An image encoding apparatus that encodes an image using the information representing a distance corresponding to the target image and reference information of a block adjacent to the target block, and calculating a prediction vector of the target block An image encoding device that generates the image encoding device.

（７）また、本発明の他の態様は、符号化の対象画像をブロックに分割し、前記ブロックの各々について、既に符号化済みの複数の画像から、対象の前記ブロックを予測する際に使用する参照画像を選択し、該参照画像中における前記対象のブロックに対応する領域を指定する参照情報を用いて予測画像を生成し、該予測画像と前記対象のブロックとの差分を符号化することで画像を符号化する画像符号化装置のコンピュータに、前記対象画像に対応する距離を表す情報と、前記対象のブロックに隣接するブロックの参照情報とを用いて、前記対象のブロックの予測ベクトルを生成する予測ベクトル生成ステップを実行させるための画像符号化プログラムである。 (7) In another aspect of the present invention, the target image to be encoded is divided into blocks, and each block is used for predicting the target block from a plurality of already encoded images. Selecting a reference image to be generated, generating a predicted image using reference information designating an area corresponding to the target block in the reference image, and encoding a difference between the predicted image and the target block The computer of the image encoding apparatus that encodes the image with the information representing the distance corresponding to the target image and the reference information of the block adjacent to the target block, the prediction vector of the target block It is an image coding program for performing the prediction vector production | generation step to produce | generate.

（８）また、本発明の他の態様は、復号の対象画像全体をブロックに分割し、前記ブロックの各々について、既に復号済みの複数の画像から、対象の前記ブロックを予測する際に使用する参照画像を選択し、該参照画像中における前記対象のブロックに対応する領域を指定する参照情報を用いて予測画像を生成し、該予測画像と前記対象のブロックとの差分を復号することで画像を復号する画像復号装置であって、前記対象画像に対応する距離を表す情報と、前記対象のブロックに隣接するブロックの参照情報とを用いて、前記対象のブロックの予測ベクトルを生成することを特徴とする画像復号装置である。 (8) According to another aspect of the present invention, the entire decoding target image is divided into blocks, and each block is used for predicting the target block from a plurality of already decoded images. An image is generated by selecting a reference image, generating a predicted image using reference information designating an area corresponding to the target block in the reference image, and decoding a difference between the predicted image and the target block An information decoding apparatus for decoding a target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block. This is a featured image decoding apparatus.

（９）また、本発明の他の態様は、復号の対象画像全体をブロックに分割し、前記ブロックの各々について、既に復号済みの複数の画像から、対象の前記ブロックを予測する際に使用する参照画像を選択し、該参照画像中における前記対象のブロックに対応する領域を指定する参照情報を用いて予測画像を生成し、該予測画像と前記対象のブロックとの差分を復号することで画像を復号する画像復号装置のコンピュータに、前記対象画像に対応する距離を表す情報と、前記対象のブロックに隣接するブロックの参照情報とを用いて、前記対象のブロックの予測ベクトルを生成する予測ベクトル生成ステップを実行させるための画像復号プログラムである。 (9) According to another aspect of the present invention, the entire decoding target image is divided into blocks, and each block is used to predict the target block from a plurality of already decoded images. An image is generated by selecting a reference image, generating a predicted image using reference information designating an area corresponding to the target block in the reference image, and decoding a difference between the predicted image and the target block A prediction vector for generating a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block. An image decoding program for executing a generation step.

この発明によれば、優れた符号化効率を奏することができる。 According to the present invention, excellent encoding efficiency can be achieved.

この発明の第１の実施形態における画像符号化装置１００の構成を示す概略ブロック図である。It is a schematic block diagram which shows the structure of the image coding apparatus 100 in 1st Embodiment of this invention. 同実施形態における画像符号化装置１００の動作を説明するフローチャートである。It is a flowchart explaining operation | movement of the image coding apparatus 100 in the embodiment. 同実施形態における予測ベクトル生成部１０８の構成を示す概略ブロック図である。It is a schematic block diagram which shows the structure of the prediction vector production | generation part 108 in the embodiment. 同実施形態における予測ベクトル生成部１０８の動作を説明するフローチャートである。It is a flowchart explaining operation | movement of the prediction vector production | generation part 108 in the embodiment. 同実施形態における符号化の対象画像の例７０１を示す図である。It is a figure which shows the example 701 of the encoding target image in the same embodiment. 同実施形態における符号化の対象画像の例７０１に対応するデプスマップ７０２を示す図である。It is a figure which shows the depth map 702 corresponding to the example 701 of the object image of encoding in the embodiment. 同実施形態における画像復号装置２００の構成を示す概略ブロック図である。It is a schematic block diagram which shows the structure of the image decoding apparatus 200 in the embodiment. 同実施形態における画像復号装置２００の動作を説明するフローチャートである。It is a flowchart explaining operation | movement of the image decoding apparatus 200 in the embodiment. 本発明の第２の実施形態における予測ベクトル生成部１０８ａの構成を示す概略ブロック図である。It is a schematic block diagram which shows the structure of the prediction vector production | generation part 108a in the 2nd Embodiment of this invention. 同実施形態における予測ベクトル生成部１０８ａの動作を説明するフローチャートである。It is a flowchart explaining operation | movement of the prediction vector production | generation part 108a in the embodiment. 同実施形態における符号化の対象画像の例７０１に対応するデプスマップ７０２を示す図である。It is a figure which shows the depth map 702 corresponding to the example 701 of the object image of encoding in the embodiment. 本発明の第３の実施形態における予測ベクトル生成部１０８ｂの構成を示す概略ブロック図である。It is a schematic block diagram which shows the structure of the prediction vector production | generation part 108b in the 3rd Embodiment of this invention. 同実施形態における予測ベクトル生成部１０８ｂの動作を説明するフローチャートである。It is a flowchart explaining operation | movement of the prediction vector production | generation part 108b in the embodiment. 従来の隣接ブロックを説明する図である。It is a figure explaining the conventional adjacent block. 従来のＭＰＥＧ−３ＤＶの新しい規格におけるシステムの構成を示す概念図である。It is a conceptual diagram which shows the structure of the system in the new standard of the conventional MPEG-3DV.

［第１の実施形態］
以下、図面を参照して、本発明の第１の実施形態について説明する。図１は、本実施形態における画像符号化装置１００の構成を示す概略ブロック図である。画像符号化装置１００は、立体表示用の２視点の動画像を符号化（圧縮）する。図１に示すように、本実施形態における画像符号化装置１００は、画像入力部１０１と、ブロックマッチング実施部１０２と、予測画像作成部１０３と、差分画像符号化部１０４と、差分画像復号部１０５と、参照画像メモリ１０６と、参照情報蓄積メモリ１０７と、予測ベクトル生成部１０８と、差分参照情報符号化部１１０と、参照画像指定情報蓄積メモリ１１１と、参照画像選択部１１２と、参照画像指定情報符号化部１１３と、減算部１１４、１１５と、加算部１１６とを備える。 [First Embodiment]
Hereinafter, a first embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a schematic block diagram illustrating a configuration of an image encoding device 100 according to the present embodiment. The image encoding device 100 encodes (compresses) a two-viewpoint moving image for stereoscopic display. As illustrated in FIG. 1, an image encoding device 100 according to the present embodiment includes an image input unit 101, a block matching execution unit 102, a predicted image creation unit 103, a difference image encoding unit 104, and a difference image decoding unit. 105, a reference image memory 106, a reference information storage memory 107, a prediction vector generation unit 108, a difference reference information encoding unit 110, a reference image designation information storage memory 111, a reference image selection unit 112, and a reference image A designation information encoding unit 113, subtraction units 114 and 115, and an addition unit 116 are provided.

画像入力部１０１は、符号化する画像（対象画像）の画像データＧＤの入力を受け付ける。本実施形態において符号化する画像は、立体表示用の２視点の動画像である。ブロックマッチング実施部１０２は、画像入力部１０１が受け付けた画像データＧＤの画像をブロックに分割する。ブロックマッチング実施部１０２は、参照画像メモリ１０６を参照して、該ブロックの各々について、既に符号化済みの画像の中から該ブロックの予測に用いる参照画像を選択し、参照画像選択部１１２に指定する。さらに、ブロックマッチング実施部１０２は、該ブロックの各々について、選択した参照画像から該ブロックに対応する領域を探し出すブロックマッチングを行い、該ブロック各々の参照情報を生成する。 The image input unit 101 receives input of image data GD of an image to be encoded (target image). The image to be encoded in the present embodiment is a two-viewpoint moving image for stereoscopic display. The block matching execution unit 102 divides the image of the image data GD received by the image input unit 101 into blocks. The block matching execution unit 102 refers to the reference image memory 106, selects a reference image to be used for prediction of the block from among the already encoded images for each of the blocks, and designates it to the reference image selection unit 112. To do. Further, the block matching execution unit 102 performs block matching for searching for an area corresponding to the block from the selected reference image for each block, and generates reference information for each block.

この対応する領域の位置を示す参照情報として、分割により得られたブロックから、対応する領域までのベクトルを用いる。このベクトルは、符号化済みの画像が、分割された画像と同一視点かつ異なるフレーム（時刻）の画像であるときは、動きベクトルといい、符号化済みの画像が、分割された画像と異なる視点の画像であるときは、視差ベクトルいう。なお、本実施形態において、ブロックマッチングを行う際に、対応するブロックを探す候補とする既に符号化済みの画像は、分割された画像と同一視点かつ異なる時刻の画像と、分割された画像と異なる視点かつ同一時刻の画像のみであるが、これに限定されない。 As reference information indicating the position of the corresponding region, a vector from the block obtained by the division to the corresponding region is used. This vector is called a motion vector when the encoded image is an image of the same viewpoint and different frame (time) as the divided image, and the encoded image has a different viewpoint from that of the divided image. Is the parallax vector. In the present embodiment, when block matching is performed, an already encoded image that is a candidate for searching for a corresponding block is different from an image at the same viewpoint and a different time as the divided image and the divided image. Although it is only a viewpoint and the image of the same time, it is not limited to this.

予測画像生成部１０３は、参照画像メモリ１０６を参照して、ブロックマッチング実施部１０２がブロックマッチングにより得た対応する領域の画像を、元の分割により得られたブロックの位置に配置して、予測画像を生成する。減算部１１５は、画像入力部１０１が受け付けた画像を構成する画素各々の画素値と、予測画像生成部１０３が生成した予測画像を構成する画素各々の画素値との差分をとり、差分画像を生成する。差分画像符号化部１０４は、減算部１１５が生成した差分画像に対して、量子化や離散コサイン変換などを施して、符号化した差分画像符号化データＤＥを生成する。差分画像復号部１０５は、符号化された差分画像を復号する。加算部１１６は、予測画像生成部１０３が生成した予測画像を構成する画素各々の画素値と、差分画像復号部１０５が復号した差分画像を構成する画素各々の画素値とを加算し、参照画像を生成し、参照画像メモリ１０６に記憶させる。 The predicted image generation unit 103 refers to the reference image memory 106, arranges the image of the corresponding region obtained by the block matching by the block matching execution unit 102 at the position of the block obtained by the original division, and performs prediction. Generate an image. The subtracting unit 115 takes the difference between the pixel value of each pixel constituting the image received by the image input unit 101 and the pixel value of each pixel constituting the predicted image generated by the predicted image generating unit 103, and obtains the difference image. Generate. The difference image encoding unit 104 performs quantization, discrete cosine transform, and the like on the difference image generated by the subtraction unit 115 to generate encoded difference image encoded data DE. The difference image decoding unit 105 decodes the encoded difference image. The adding unit 116 adds the pixel value of each pixel constituting the prediction image generated by the prediction image generation unit 103 and the pixel value of each pixel constituting the difference image decoded by the difference image decoding unit 105, thereby obtaining a reference image. And stored in the reference image memory 106.

参照情報蓄積メモリ１０７は、ブロックマッチング実施部１０２が生成した各ブロックの参照情報を記憶する。予測ベクトル生成部１０８は、各ブロックについて、距離を示す情報であるデプスマップＤＭのうち、該ブロックに対応する部分と、参照情報蓄積メモリ１０７が記憶する該ブロックに隣接するブロックの参照情報ＮＲとを用いて、該ブロックの予測ベクトルＰＶを生成する。なお、ここで隣接するブロックとは、画像内を左上から右下へブロック列ごとに走査していくラスタスキャン順で符号化している場合、対象のブロックの左側、上側、右上側の３つのブロックである。本実施形態における予測ベクトル生成部１０８は、予測ベクトルを生成する際に、参照画像指定情報蓄積メモリ１１が記憶する、各ブロックの参照画像を指定する情報ＲＡも用いる。 The reference information storage memory 107 stores reference information of each block generated by the block matching execution unit 102. For each block, the prediction vector generation unit 108 includes a portion corresponding to the block in the depth map DM, which is information indicating a distance, and reference information NR of a block adjacent to the block stored in the reference information storage memory 107. Is used to generate a prediction vector PV of the block. Here, the adjacent blocks are three blocks on the left side, the upper side, and the upper right side of the target block when encoding is performed in the raster scan order in which the image is scanned for each block row from the upper left to the lower right. It is. The prediction vector generation unit 108 in the present embodiment also uses information RA that specifies the reference image of each block stored in the reference image specification information storage memory 11 when generating a prediction vector.

参照画像選択部１１２は、ブロックマッチング実施部１０２が各ブロックについて選択した参照画像を指定する情報を、参照画像メモリ１０６、参照画像指定情報蓄積メモリ１１１、参照画像指定情報符号化部１１３に出力する。参照画像指定情報符号化部１１３は、参照画像選択部１１２から受けた各ブロックの参照画像を指定する情報を符号化して、参照画像指定情報符号化データＲＡＥを生成し、出力する。減算部１１４は、ブロックマッチング実施部１０２が生成した各ブロックの参照情報と、予測ベクトル生成部１０８が生成した各ブロックの予測ベクトルとの差分をとり、差分参照情報を生成する。差分参照情報符号化１１０は、減算部１１４が生成した差分参照情報を符号化して、差分参照情報符号化データＤＲＥを生成する。 The reference image selection unit 112 outputs information specifying the reference image selected by the block matching execution unit 102 for each block to the reference image memory 106, the reference image specification information storage memory 111, and the reference image specification information encoding unit 113. . The reference image designation information encoding unit 113 encodes information specifying the reference image of each block received from the reference image selection unit 112, generates reference image designation information encoded data RAE, and outputs it. The subtraction unit 114 takes the difference between the reference information of each block generated by the block matching execution unit 102 and the prediction vector of each block generated by the prediction vector generation unit 108, and generates difference reference information. The difference reference information encoding 110 encodes the difference reference information generated by the subtraction unit 114 to generate difference reference information encoded data DRE.

図２は、画像符号化装置１００の動作を説明するフローチャートである。ただし、既に符号化対象画像と同じ視点の複数の画像と、符号化対象画像と同じ時間位置（時刻）の別視点の画像を符号化済みであり、なおかつ符号化対象の画像も途中のブロックまで符号化済みであり、その結果が参照画像メモリ１０６、参照情報蓄積メモリ１０７、参照画像指定情報蓄積メモリ１１１に蓄積されている状態にあるものとして説明する。 FIG. 2 is a flowchart for explaining the operation of the image coding apparatus 100. However, a plurality of images with the same viewpoint as the encoding target image and an image with a different viewpoint at the same time position (time) as the encoding target image have already been encoded, and the encoding target image is also in the middle of the block In the following description, it is assumed that the data has been encoded and the result is stored in the reference image memory 106, the reference information storage memory 107, and the reference image designation information storage memory 111.

まず、画像入力部１０１より符号化対象となる画像の画像データＧＤが入力される（Ａ１）。ブロックマッチング実施部１０２は、入力された符号化対象画像を、ブロック単位に分割する。符号化は、このブロックごとに行われる。画像符号化装置１００は、符号化対象画像内の全ブロックを符号化するまで、以下の処理（ステップＡ２〜Ａ１５）を繰り返し実行する。 First, image data GD of an image to be encoded is input from the image input unit 101 (A1). The block matching execution unit 102 divides the input encoding target image into blocks. Encoding is performed for each block. The image encoding apparatus 100 repeatedly executes the following processing (steps A2 to A15) until all blocks in the encoding target image are encoded.

ブロックマッチング実施部１０２は、符号化対象ブロックに実施しようとしている符号化モードを示す情報を参照画像選択部１１２に送り、参照画像選択部１１２はその情報に基づき必要な参照画像を参照画像メモリ１０６に伝え、参照画像メモリ１０６は指定された参照画像をブロックマッチング実施部１０２に対して出力する。なお、参照画像とは既に符号化及び復号済みの画像のことである。このような方法でブロックマッチング部１０２は参照画像を入力として受け取りつつ、ブロック毎に全ての符号化モード(動き補償フレーム間予測符号化、視差補償予測符号化)でブロックマッチングを実施する（Ａ３）。
ブロックマッチングとは符号化対象ブロックと既に符号化済みの画像の領域との輝度値の差分絶対値を求める処理である。そして、ブロックマッチングの結果であるマッチングの残差と参照情報に基づいて、ブロックマッチング実施部１０２は、符号化効率の最も高い符号化モードを判定し、該符号化モードに必要な参照画像を示す情報である参照画像指定情報を参照画像選択部１１２に出力する（Ａ４)。 The block matching execution unit 102 sends information indicating the encoding mode to be applied to the encoding target block to the reference image selection unit 112, and the reference image selection unit 112 transmits a necessary reference image based on the information to the reference image memory 106. The reference image memory 106 outputs the designated reference image to the block matching execution unit 102. Note that a reference image is an image that has already been encoded and decoded. In this way, the block matching unit 102 receives a reference image as input, and performs block matching in all coding modes (motion compensation interframe prediction coding and disparity compensation prediction coding) for each block (A3). .
Block matching is a process for obtaining an absolute value difference between luminance values of an encoding target block and an already encoded image area. Then, based on the matching residual and reference information, which is the result of block matching, the block matching execution unit 102 determines the coding mode with the highest coding efficiency and indicates the reference image necessary for the coding mode. The reference image designation information, which is information, is output to the reference image selection unit 112 (A4).

後の符号化のために、ブロックマッチング実施部１０２は、参照情報を参照情報蓄積メモリ１０７に格納し（Ａ５）、参照画像選択部１１２は、ブロックマッチング実施部１０２が出力した参照画像指定情報を参照画像指定情報蓄積メモリ１１１に格納する（Ａ６）。予測ベクトル生成部１０８は予測ベクトルを生成する（Ａ７）。なお、予測ベクトルの具体的な生成方法については後で詳しく説明する。減算部１１４は、参照情報と予測ベクトルとの差分を取り、差分参照情報を生成する（Ａ８）。 For later encoding, the block matching execution unit 102 stores the reference information in the reference information storage memory 107 (A5), and the reference image selection unit 112 receives the reference image designation information output from the block matching execution unit 102. It is stored in the reference image designation information storage memory 111 (A6). The prediction vector generation unit 108 generates a prediction vector (A7). A specific method for generating a prediction vector will be described in detail later. The subtraction unit 114 takes the difference between the reference information and the prediction vector, and generates difference reference information (A8).

参照画像選択部１１２は、参照画像指定情報により指定されている参照画像を参照画像メモリ１０６から予測画像生成部１０３へ出力させる。予測画像生成部１０３は受け取った参照画像と参照情報とから予測画像を生成する。そして、減算部１１５は、符号化対象ブロックと前記生成した予測画像との差分をとり、差分画像を生成する（Ａ９）。
参照画像指定情報符号化部１１３は、参照画像指定情報を符号化し、その結果である参照画像指定情報符号データＲＡＥを生成する。差分参照情報符号化部１１０は、差分参照情報を符号化し、その結果である差分参照情報符号化データＤＲＥを生成する。差分画像符号化部１０４は、差分画像を符号化し、その結果である差分画像符号化データＤＥを生成する（Ａ１０）。そして、これら３つの符号化データを、画像符号化装置１００は、出力する。 The reference image selection unit 112 outputs the reference image designated by the reference image designation information from the reference image memory 106 to the predicted image generation unit 103. The predicted image generation unit 103 generates a predicted image from the received reference image and reference information. Then, the subtractor 115 takes the difference between the encoding target block and the generated predicted image, and generates a difference image (A9).
The reference image designation information encoding unit 113 encodes the reference image designation information and generates reference image designation information code data RAE as a result. The difference reference information encoding unit 110 encodes the difference reference information and generates difference reference information encoded data DRE as a result. The differential image encoding unit 104 encodes the differential image and generates differential image encoded data DE as a result (A10). Then, the image encoding device 100 outputs these three encoded data.

符号化対象画像が後の符号化で参照画像として用いられる場合には（Ａ１１−Ｙｅｓ）、符号化された差分画像を差分画像復号部１０５が復号する（Ａ１２）。次に、加算部１１６が、この復号した差分画像と予測画像を加算し復号画像を得て（Ａ１３）、復号画像を参照画像メモリ１０６に格納する（Ａ１４）。全ブロックについて、処理していなければ、ステップＡ３に戻り、処理しているときは、終了する（Ａ１５）。また、ステップＡ１１で、符号化対象画像が後の符号化で参照画像として用いられない場合は（Ａ１１−Ｎｏ）、ステップＡ１５に進み、全ブロックについて、処理していなければ、ステップＡ３に戻り、処理しているときは、終了する（Ａ１５）。 When the encoding target image is used as a reference image in later encoding (A11-Yes), the difference image decoding unit 105 decodes the encoded difference image (A12). Next, the adding unit 116 adds the decoded difference image and the predicted image to obtain a decoded image (A13), and stores the decoded image in the reference image memory 106 (A14). If all the blocks have not been processed, the process returns to step A3. If all the blocks have been processed, the process ends (A15). In step A11, if the encoding target image is not used as a reference image in later encoding (A11-No), the process proceeds to step A15. If all blocks have not been processed, the process returns to step A3. If it is being processed, the process is terminated (A15).

次に、予測ベクトル生成方法について詳しく説明する。図３は、本実施形態における予測ベクトル生成部１０８の構成を示す概略ブロック図である。図３に示すように、予測ベクトル生成部１０８は、ブロック間相関性算出部３０１と、予測ベクトル算出部１０９とを備える。予測ベクトル算出部１０９は、隣接ブロック参照情報判定部３０２と、隣接ブロック参照情報蓄積メモリ３０３と、予測ベクトル設定部３０４とを備える。 Next, the prediction vector generation method will be described in detail. FIG. 3 is a schematic block diagram illustrating a configuration of the prediction vector generation unit 108 in the present embodiment. As illustrated in FIG. 3, the prediction vector generation unit 108 includes an inter-block correlation calculation unit 301 and a prediction vector calculation unit 109. The prediction vector calculation unit 109 includes an adjacent block reference information determination unit 302, an adjacent block reference information storage memory 303, and a prediction vector setting unit 304.

ブロック間相関性算出部３０１は、符号化対象のブロックと、隣接するブロック各々とのデプスマップＤＭにおける相関性を算出する。本実施形態において、この相関性は、その隣接するブロックが、符号化対象のブロックに表示されているオブジェクトと同一のオブジェクトを表示している可能性を示す情報である。これは、隣接するブロックに表示されているオブジェクトが、符号化対象のブロックに表示されているオブジェクトと同一であれば、距離が大きく変化しないことを利用している。 The inter-block correlation calculation unit 301 calculates the correlation in the depth map DM between the block to be encoded and each adjacent block. In the present embodiment, this correlation is information indicating the possibility that the adjacent block displays the same object as the object displayed in the encoding target block. This utilizes the fact that the distance does not change significantly if the object displayed in the adjacent block is the same as the object displayed in the block to be encoded.

予測ベクトル算出部１０９は、ブロック間相関性算出部３０１が算出した相関性と、隣接するブロックの参照情報ＮＲとを用いて、予測ベクトルを生成する。
隣接ブロック参照情報判定部３０２は、符号化対象ブロックとその隣接ブロックの参照画像指定情報ＲＡと、隣接ブロックの参照情報ＮＲを受け取る。そして、隣接ブロック参照情報判定部３０２は、符号化対象ブロックの参照画像と隣接ブロックの参照画像が同一か否かを判定し、同一であれば該隣接ブロックの参照情報を、隣接ブロック参照情報蓄積メモリ３０３に出力する。 The prediction vector calculation unit 109 generates a prediction vector using the correlation calculated by the inter-block correlation calculation unit 301 and the reference information NR of the adjacent block.
The adjacent block reference information determination unit 302 receives the encoding target block, reference image designation information RA of the adjacent block, and reference information NR of the adjacent block. Then, the adjacent block reference information determination unit 302 determines whether or not the reference image of the encoding target block and the reference image of the adjacent block are the same. Output to the memory 303.

隣接ブロック参照情報蓄積メモリ３０３は、隣接ブロック参照情報判定部３０２が出力した参照情報を記憶する。予測ベクトル設定部３０４は、隣接するブロックで、符号化の対象ブロックと参照画像が同一もののうち、相関性が最も大きかった隣接するブロックの参照情報を、予測ベクトルＰＶにする。 The adjacent block reference information storage memory 303 stores the reference information output by the adjacent block reference information determination unit 302. The prediction vector setting unit 304 sets the reference information of the adjacent block having the highest correlation among the adjacent blocks having the same encoding target block and the reference image as the prediction vector PV.

図４は、予測ベクトル生成方法を示すフローチャートである。このフローチャートは、図２のステップＡ７の詳細を示すものである。また、図５、図６は本発明における予測ベクトル生成方法の概念説明図である。図５は、符号化の対象画像の例７０１を示す図である。図５において、符号Ｏ１は、一番手前にある被写体であり、符号Ｏ２は、被写体Ｏ１の後ろにある被写体である。これらの被写体Ｏ１、Ｏ２は、台Ｏ３の上に置かれている。
また、符号７０３は、符号化対象のブロックである。符号７０４は、この符号化対象のブロック７０３の左の隣接ブロックである。符号７０５は、符号化対象のブロック７０３の上の隣接ブロックである。符号７０６は、符号化対象のブロック７０３の右上の隣接ブロックである。 FIG. 4 is a flowchart showing a prediction vector generation method. This flowchart shows the details of step A7 in FIG. 5 and 6 are conceptual explanatory diagrams of a prediction vector generation method according to the present invention. FIG. 5 is a diagram illustrating an example 701 of an encoding target image. In FIG. 5, the symbol O1 is the foreground subject, and the symbol O2 is the subject behind the subject O1. These subjects O1 and O2 are placed on the base O3.
Reference numeral 703 denotes a block to be encoded. A reference numeral 704 is a block adjacent to the left of the encoding target block 703. Reference numeral 705 denotes an adjacent block above the block 703 to be encoded. Reference numeral 706 is an adjacent block on the upper right of the block 703 to be encoded.

図６は、符号化の対象画像の例７０１に対応するデプスマップ７０２を示す図である。
図６において、符号ＯＤ１は、デプスマップ中の被写体Ｏ１である。同様に符号ＯＤ２は、デプスマップ中の被写体Ｏ２である。符号ＯＤ３は、デプスマップ中の台Ｏ３である。
また、符号７０７は、符号化対象のブロック７０３に対応するデプスマップ中の領域である。符号７０８は、符号化対象のブロック７０３の左の隣接ブロック７０４に対応するデプスマップ中の領域である。符号７０９は、符号化対象のブロック７０３の上の隣接ブロック７０５に対応するデプスマップ中の領域である。符号７１０は、符号化対象のブロック７０３の右上の隣接ブロック７０６に対応するデプスマップ中の領域である。この図４のフローチャートおよび図５、図６を用いて予測ベクトル生成方法の説明をする。 FIG. 6 is a diagram illustrating a depth map 702 corresponding to an example 701 of an encoding target image.
In FIG. 6, the symbol OD1 is the subject O1 in the depth map. Similarly, the symbol OD2 is the subject O2 in the depth map. The code OD3 is a table O3 in the depth map.
Reference numeral 707 denotes an area in the depth map corresponding to the encoding target block 703. Reference numeral 708 denotes an area in the depth map corresponding to the left adjacent block 704 of the block 703 to be encoded. Reference numeral 709 denotes a region in the depth map corresponding to the adjacent block 705 above the block 703 to be encoded. Reference numeral 710 denotes an area in the depth map corresponding to the adjacent block 706 at the upper right of the block 703 to be encoded. The prediction vector generation method will be described with reference to the flowchart of FIG. 4 and FIGS. 5 and 6.

まず、ブロック間相関性算出部３０１は、入力として符号化対象画像７０１に対応する既に符号化済みのデプスマップ７０２を受け取る（Ｂ１）。ここで符号化済みのデプスマップ７０２を使用するのは、符号化装置と復号装置でデプスマップを同一にすることにより、生成される予測ベクトルを一致させるためである。デプスマップの符号化方法については、時間軸上に並べたデプスマップを動画像とみなし、従来の動画像符号化方式、例えばＭＰＥＧ−２やＨ．２６４／ＡＶＣを用いてもよい。そして、デプスマップを用いて符号化対象ブロック７０３と隣接ブロック７０４、７０５、７０６の間の相関性を表す情報を算出する（ステップＢ２）。 First, the inter-block correlation calculation unit 301 receives an already encoded depth map 702 corresponding to the encoding target image 701 as an input (B1). The reason why the encoded depth map 702 is used here is to match the generated prediction vectors by making the depth maps the same in the encoding device and the decoding device. Regarding the depth map encoding method, the depth map arranged on the time axis is regarded as a moving image, and a conventional moving image encoding method such as MPEG-2 or H.264 is used. H.264 / AVC may be used. Then, information representing the correlation between the encoding target block 703 and the adjacent blocks 704, 705, and 706 is calculated using the depth map (step B2).

相関性の具体的な算出方法としては、例えばデプスマップ７０２内で符号化対象ブロック７０３と同じ位置にあるブロック７０７のデプス値の平均値と、デプスマップ７０２内で符号化対象ブロック７０３の各隣接ブロック７０４、７０５、７０６と同じ位置にある各ブロック７０８、７０９、７１０のデプス値の平均値との差分絶対値を符号化対象ブロックと隣接ブロックの相関性を示す情報として算出する方法がある。なお、差分絶対値ではなく２乗誤差を用いて算出してもよい。図５の場合、ブロック７０８、７０９、７１０の中では、ブロック７０７と同一オブジェクト内に存在するブロック７０８とのデプス値の平均値の差分絶対値が最も小さくなり、全て背景領域に含まれているブロック７０９とのデプス値の平均値の差分絶対値が最も大きくなる。そして、デプス値の差分絶対値比較部３０１はデプス値の平均値の差分絶対値が小さいブロックはブロック７０８、ブロック７１０、ブロック７０９の順である事を示すブロック間相関度情報を出力する。すなわち、符号対象ブロックと同じオブジェクトを表示している可能性が最も高い隣接ブロックは、ブロック７０８であり、次に高いのはブロック７１０であり、その次はブロック７０９であることを示す情報を出力する。 As a specific calculation method of the correlation, for example, the average value of the depth values of the block 707 located at the same position as the encoding target block 703 in the depth map 702, and each adjacent of the encoding target block 703 in the depth map 702 There is a method of calculating an absolute difference value with respect to the average value of the depth values of the blocks 708, 709, and 710 at the same position as the blocks 704, 705, and 706 as information indicating the correlation between the encoding target block and the adjacent block. In addition, you may calculate using a square error instead of a difference absolute value. In the case of FIG. 5, in blocks 708, 709, and 710, the absolute difference value of the average value of the depth values of the block 707 and the block 708 existing in the same object is the smallest, and all are included in the background area. The difference absolute value of the average value of the depth values from the block 709 is the largest. Then, the depth value difference absolute value comparison unit 301 outputs inter-block correlation information indicating that a block having a small average difference absolute value of depth values is in the order of block 708, block 710, and block 709. That is, the adjacent block that is most likely to display the same object as the encoding target block is the block 708, the next highest is the block 710, and the information indicating that the next is the block 709 is output. To do.

次に、隣接ブロック参照情報判定部３０２は、入力として、符号化対象ブロック７０３とその隣接ブロック７０４，７０５，７０６の参照画像指定情報と、隣接ブロック７０４，７０５，７０６の参照情報を受け取る。まず、隣接ブロック参照情報判定部３０２は、ブロック間相関度情報で順位が最も高いブロック７０８の参照画像と、符号化対象ブロック７０３の参照画像が同一であるか否かを判定する（Ｂ３）。同一のときは、判定した隣接ブロック、ここではブロック７０８の参照情報を出力してステップＢ６に進み、同一でないときは、ステップＢ４に進む。 Next, the adjacent block reference information determination unit 302 receives, as input, the encoding target block 703, reference image designation information of the adjacent blocks 704, 705, and 706, and reference information of the adjacent blocks 704, 705, and 706. First, the adjacent block reference information determination unit 302 determines whether or not the reference image of the block 708 having the highest rank in the inter-block correlation information is the same as the reference image of the encoding target block 703 (B3). If they are the same, the reference information of the determined adjacent block, here the block 708, is output and the process proceeds to step B6. If not, the process proceeds to step B4.

ステップＢ４では、隣接ブロック参照情報判定部３０２は、ブロック間相関度情報で順位が、次に高いブロック７１０の参照画像と、符号化対象ブロック７０３の参照画像が同一であるか否かを判定する。同一のときは、判定した隣接ブロック、ここではブロック７１０の参照情報を出力してステップＢ６に進み、同一でないときは、ステップＢ５に進む。
ステップＢ５では、隣接ブロック参照情報判定部３０２は、ブロック間相関度情報で順位が、次に高い（最も低い）ブロック７０９の参照画像と、符号化対象ブロック７０３の参照画像が同一であるか否かを判定する。同一のときは、判定した隣接ブロック、ここではブロック７０９の参照情報を出力してステップＢ６に進み、同一でないときは、ステップＢ７に進む。 In step B4, the adjacent block reference information determination unit 302 determines whether or not the reference image of the block 710 having the next highest rank in the inter-block correlation information and the reference image of the encoding target block 703 are the same. . If they are the same, the reference information of the determined adjacent block, here the block 710, is output and the process proceeds to step B6. If not, the process proceeds to step B5.
In step B5, the adjacent block reference information determination unit 302 determines whether or not the reference image of the next highest (lowest) block 709 and the reference image of the encoding target block 703 are the same in the inter-block correlation information. Determine whether. If they are the same, the reference information of the determined adjacent block, here the block 709, is output and the process proceeds to step B6. If not, the process proceeds to step B7.

ステップＢ６では、参照情報蓄積メモリ３０３は、当該ステップの前に、隣接ブロック参照情報判定部３０２が出力した隣接ブロックの参照情報を格納し（Ｂ６）、ステップＢ８に進む。
ステップＢ７では、また、参照情報蓄積メモリ３０３は、参照情報として（０，０）、すなわち０ベクトルを格納する。
そして、ステップＢ８において、予測ベクトル設定部３０４は、参照情報蓄積メモリ３０３がステップＢ６またはＢ７にて格納した参照情報を、予測ベクトルに設定し出力する。 In step B6, the reference information storage memory 303 stores the reference information of the adjacent block output by the adjacent block reference information determination unit 302 before the step (B6), and proceeds to step B8.
In step B7, the reference information storage memory 303 stores (0, 0), that is, a 0 vector as reference information.
In step B8, the prediction vector setting unit 304 sets the reference information stored in step B6 or B7 by the reference information storage memory 303 as a prediction vector and outputs the prediction vector.

これにより、符号化対象ブロックとデプス値が一番近い値の隣接ブロック、すなわち、符号化対象ブロックと同一オブジェクトに属している可能性が最も高い隣接ブロックの参照情報が予測ベクトルとして選択されるため、予測ベクトルの精度が向上し、参照情報の符号化効率が向上する。 As a result, the adjacent block having the closest depth value to the encoding target block, that is, the reference information of the adjacent block most likely to belong to the same object as the encoding target block is selected as the prediction vector. The accuracy of the prediction vector is improved, and the encoding efficiency of the reference information is improved.

次に、上述の画像符号化装置１００により符号化されたデータを復号する画像復号装置２００について説明する。図７は、本実施形態における画像復号装置２００の構成を示す概略ブロック図である。図７に示すように、動画像復号装置２００は、差分画像復号部２０１と、差分参照情報復号部２０２と、参照画像指定情報復号部２０３と、予測画像作成部２０４と、参照画像メモリ２０５と、参照情報蓄積メモリ２０６と、予測ベクトル生成部１０８と、参照画像指定情報蓄積メモリ２０９と、加算部２１０、２１１を備える。図７に示すように、画像復号装置２００は、画像符号化装置１００と同様に予測ベクトル生成部１０８を備える。 Next, the image decoding apparatus 200 that decodes the data encoded by the above-described image encoding apparatus 100 will be described. FIG. 7 is a schematic block diagram illustrating the configuration of the image decoding device 200 according to this embodiment. As illustrated in FIG. 7, the moving image decoding apparatus 200 includes a difference image decoding unit 201, a difference reference information decoding unit 202, a reference image designation information decoding unit 203, a predicted image creation unit 204, a reference image memory 205, , A reference information storage memory 206, a prediction vector generation unit 108, a reference image designation information storage memory 209, and addition units 210 and 211. As illustrated in FIG. 7, the image decoding apparatus 200 includes a prediction vector generation unit 108 as with the image encoding apparatus 100.

図８は、本実施形態における画像復号装置２００の動作を説明するフローチャートである。このフローチャートに従って画像復号装置２００に、２視点の動画像に対応する符号化データを入力した際に実行する処理の説明をする。ただし、既に復号対象画像と同じ視点の複数の画像と復号対象画像と同じ時間軸の別視点の画像を復号済みであり、なおかつ復号対象の画像も途中のブロックまで復号済みであり、その結果が参照画像メモリ２０５、参照情報蓄積メモリ２０６、参照画像指定情報蓄積メモリ２０９に蓄積されている状態にあるものとして説明する。 FIG. 8 is a flowchart for explaining the operation of the image decoding apparatus 200 according to this embodiment. A process executed when encoded data corresponding to a moving image of two viewpoints is input to the image decoding apparatus 200 according to this flowchart. However, a plurality of images having the same viewpoint as the decoding target image and an image of another viewpoint having the same time axis as the decoding target image have already been decoded, and the decoding target image has also been decoded up to an intermediate block. Description will be made assuming that the reference image memory 205, the reference information storage memory 206, and the reference image designation information storage memory 209 are in a state of being stored.

まず、差分画像復号部２０１、差分参照情報復号部２０２、参照画像指定情報復号部２０３にそれぞれ差分画像符号化データＤＥ、参照画像指定情報符号化データＤＲＥ、差分参照情報符号化データＲＡＥが入力される（ステップＣ１）。上記データは、画像符号化装置１００におけるブロックに対応する単位で入力され、画像復号装置２００は、入力されたブロック順に復号を行う。 First, the difference image encoded data DE, the reference image specified information encoded data DRE, and the difference reference information encoded data RAE are input to the difference image decoding unit 201, the difference reference information decoding unit 202, and the reference image designation information decoding unit 203, respectively. (Step C1). The data is input in units corresponding to blocks in the image encoding device 100, and the image decoding device 200 performs decoding in the order of the input blocks.

画像復号装置２００は、復号対象画像内の全ブロックを復号するまで、以下の処理を繰り返し実行する（ステップＣ２〜Ｃ１４）。
参照画像指定情報復号部２０３は、参照画像指定情報符号化データＤＲＥを復号し、参照画像指定情報を取得する（ステップＣ３）。後の復号処理のために、参照画像指定情報復号部２０３は、復号した参照画像指定情報を、参照画像指定情報蓄積メモリ２０９に格納する（ステップＣ４）。 The image decoding apparatus 200 repeatedly executes the following processing until all the blocks in the decoding target image are decoded (steps C2 to C14).
The reference image designation information decoding unit 203 decodes the reference image designation information encoded data DRE, and acquires reference image designation information (step C3). For subsequent decoding processing, the reference image designation information decoding unit 203 stores the decoded reference image designation information in the reference image designation information storage memory 209 (step C4).

予測ベクトル生成部１０８は、画像符号化装置１００の予測ベクトル生成部１０８と同様の処理を行い、予測ベクトルを生成する（ステップＣ５）。差分参照情報復号部２０２は、差分参照情報符号化データＲＡＥを復号し、差分参照情報を取得する（ステップＣ６）。加算部２１１は、差分参照情報復号部２０２により復号された差分参照情報と、予測ベクトル生成部１０８が生成した予測ベクトルの和を取ることにより、参照情報を取得する（ステップＣ７）。後の復号処理のために、加算部２１１は、取得した参照情報を参照情報蓄積メモリ２０６に出力し、格納する（ステップＣ８）。 The prediction vector generation unit 108 performs the same process as the prediction vector generation unit 108 of the image encoding device 100, and generates a prediction vector (step C5). The difference reference information decoding unit 202 decodes the difference reference information encoded data RAE and acquires difference reference information (step C6). The adding unit 211 obtains reference information by calculating the sum of the difference reference information decoded by the difference reference information decoding unit 202 and the prediction vector generated by the prediction vector generation unit 108 (step C7). For later decoding processing, the adding unit 211 outputs the acquired reference information to the reference information storage memory 206 and stores it (step C8).

次に、参照画像メモリ２０５は、参照画像指定情報復号部２０３が復号した参照画像指定情報に従って参照画像を予測画像生成部２０４へ出力する。そして、予測画像生成部２０４は、参照画像メモリ２０５が出力した参照画像と、加算部２１１が取得した参照情報とから予測画像を生成する（ステップＣ９）。差分画像復号部２０１は、差分画像符号化データＤＥを復号し、差分画像を取得する（ステップＣ１０）。加算部２１０は、差分画像復号部２０１が取得した差分画像と、予測画像生成部２０４が取得した予測画像との和を取ることにより、復号画像を取得し（ステップＣ１１）、復号画像データＤＤを画像復号装置２００の出力として、出力する。この復号画像が後の復号で参照画像として用いられる場合には、加算部２１０は、この復号画像を参照画像メモリ２０５に格納する（ステップＣ１２，１３）。その画像に含まれるすべてのブロックのデコードが完了するまで、ステップＣ３に戻って処理を繰り返す。
なお、加算部２１０からの出力は、その画像より時間的に前に表示される画像が全て出力されてから、画像復号装置２００より出力する。 Next, the reference image memory 205 outputs the reference image to the predicted image generation unit 204 according to the reference image designation information decoded by the reference image designation information decoding unit 203. Then, the predicted image generation unit 204 generates a predicted image from the reference image output from the reference image memory 205 and the reference information acquired by the adding unit 211 (step C9). The difference image decoding unit 201 decodes the difference image encoded data DE and acquires a difference image (step C10). The adding unit 210 obtains a decoded image by calculating the sum of the difference image acquired by the difference image decoding unit 201 and the predicted image acquired by the predicted image generation unit 204 (step C11), and obtains the decoded image data DD. Output as the output of the image decoding apparatus 200. When the decoded image is used as a reference image in later decoding, the adding unit 210 stores the decoded image in the reference image memory 205 (steps C12 and 13). The process returns to step C3 and is repeated until decoding of all the blocks included in the image is completed.
Note that the output from the adding unit 210 is output from the image decoding apparatus 200 after all the images that are displayed temporally before that image are output.

［第２の実施形態］
以下、図面を参照して、本発明の第２の実施形態について説明する。第２の実施形態では、隣接ブロックが対象ブロックに表示されているオブジェクトと同一のオブジェクトを表示している可能性を示す情報として、デプスマップのエッジ情報と隣接ブロックとその周辺のブロックとのデプス値の大小関係を用いた場合の予測ベクトルの生成方法について説明する。 [Second Embodiment]
The second embodiment of the present invention will be described below with reference to the drawings. In the second embodiment, as information indicating the possibility that the adjacent block displays the same object as the object displayed in the target block, the edge information of the depth map and the depth of the adjacent block and its surrounding blocks are displayed. A method of generating a prediction vector when using a magnitude relationship between values will be described.

本実施形態における画像符号化装置１００ａは、図１に示す画像符号化装置１００と、予測ベクトル生成部１０８に変えて、予測ベクトル生成部１０８ａを備える点が異なる。
また、本実施形態における画像復号装置２００ａは、図７に示す画像復号装置２００と、予測ベクトル生成部１０８に変えて、予測ベクトル生成部１０８ａを備える点が異なる。
図９は、本実施形態における予測ベクトル生成部１０８ａの構成を示す概略ブロック図である。 The image encoding device 100a in this embodiment is different from the image encoding device 100 shown in FIG. 1 in that a prediction vector generation unit 108a is provided instead of the prediction vector generation unit 108.
Also, the image decoding device 200a in the present embodiment is different from the image decoding device 200 shown in FIG. 7 in that a prediction vector generation unit 108a is provided instead of the prediction vector generation unit 108.
FIG. 9 is a schematic block diagram illustrating the configuration of the prediction vector generation unit 108a in the present embodiment.

図９に示すように、予測ベクトル１０８ａは、エッジ検出部４０１と、ブロック間相関性判定部４０２と、予測ベクトル算出部１０９ａとを備える。予測ベクトル算出部１０９ａは、隣接ブロック参照情報判定部３０２と、予測ベクトル候補判定部４０３と、中央値による予測ベクトル生成部４０４を備える。エッジ検出部４０１は、デプスマップ中の隣接ブロックに対応する領域に対して、エッジ検出を行なうことで、符号化対象ブロックと隣接ブロックとの相関性を判定する。ブロック間相関性判定部４０２は、デプスマップ中の隣接ブロックに対応する領域と、該隣接ブロックの周辺ブロックに対応する領域との大小関係に基づき、符号化対象ブロックと隣接ブロックとの相関性を判定する。 As shown in FIG. 9, the prediction vector 108a includes an edge detection unit 401, an inter-block correlation determination unit 402, and a prediction vector calculation unit 109a. The prediction vector calculation unit 109a includes an adjacent block reference information determination unit 302, a prediction vector candidate determination unit 403, and a prediction vector generation unit 404 using a median value. The edge detection unit 401 determines the correlation between the encoding target block and the adjacent block by performing edge detection on the area corresponding to the adjacent block in the depth map. The inter-block correlation determining unit 402 determines the correlation between the encoding target block and the adjacent block based on the magnitude relationship between the area corresponding to the adjacent block in the depth map and the area corresponding to the peripheral block of the adjacent block. judge.

隣接ブロック参照情報判定部３０２は、図３における隣接ブロック参照情報判定部３０２と同様である。予測ベクトル候補判定部４０３は、エッジ検出部４０１によるエッジ検出の結果と、ブロック間相関性判定部４０２による判定結果とを併せて、隣接ブロックと符号化対象ブロックとの相関性を表す情報とし、前記情報に基づき、隣接ブロック参照情報判定部３０２による判定結果の中から、予測ベクトルの候補となる参照情報を判定する。 The adjacent block reference information determination unit 302 is the same as the adjacent block reference information determination unit 302 in FIG. The prediction vector candidate determination unit 403 combines the result of edge detection by the edge detection unit 401 and the determination result by the inter-block correlation determination unit 402 as information indicating the correlation between the adjacent block and the encoding target block, Based on the information, reference information that is a candidate for a prediction vector is determined from the determination results by the adjacent block reference information determination unit 302.

図１０は、本実施形態における予測ベクトル生成部１０８ａの動作を説明するフローチャートである。このフローチャート、図５および図１１を用いて、一つの符号化対象ブロックの予測ベクトルを生成する方法の説明をする。図１１に示すデプスマップ７０２は、図６に示すデプスマップ７０２と同じものであり、図５に示す画像７０１に対応するデプスマップである。符号７０７は、符号対象ブロックに対応する領域であり、符号７０８、７０９、７１０は、隣接ブロックに対応する領域である。また、隣接ブロックに対応する領域７０８の左側の矩形８０１、領域７０９の上側の矩形８０２、領域７１０の右上の矩形８０３は、隣接ブロックの周辺ブロックに対応する領域である。なお、周辺ブロックとは、隣接ブロックを基準として符号化対象ブロックと対称な位置にあるブロックである。 FIG. 10 is a flowchart for explaining the operation of the prediction vector generation unit 108a in the present embodiment. A method for generating a prediction vector of one coding target block will be described with reference to this flowchart and FIGS. 5 and 11. A depth map 702 shown in FIG. 11 is the same as the depth map 702 shown in FIG. 6, and is a depth map corresponding to the image 701 shown in FIG. Reference numeral 707 is an area corresponding to the encoding target block, and reference numerals 708, 709, and 710 are areas corresponding to adjacent blocks. A rectangle 801 on the left side of the area 708 corresponding to the adjacent block, a rectangle 802 on the upper side of the area 709, and a rectangle 803 on the upper right of the area 710 are areas corresponding to the peripheral blocks of the adjacent block. The peripheral block is a block that is in a symmetrical position with the encoding target block with respect to the adjacent block.

まず、エッジ検出部４０１およびブロック間相関性判定部４０２は、入力として符号化対象画像７０１に対応するデプスマップ７０２を受け取る（ステップＤ１）。なお、このステップＤ１は、符号化対象ブロック毎ではなく、１フレーム毎に行っても良い。そして、符号化対象ブロック７０３と各隣接ブロック７０４，７０５，７０６との相関性を示す情報としてデプスマップ７０２のエッジ情報を取得し、結果を出力する（ステップＤ２）。エッジの検出方法としては、例えばキャニーフィルタを用いるものや微分によるエッジ検出手法など、公知の方法を用いることができる。 First, the edge detection unit 401 and the inter-block correlation determination unit 402 receive the depth map 702 corresponding to the encoding target image 701 as an input (step D1). Note that step D1 may be performed for each frame instead of for each block to be encoded. Then, edge information of the depth map 702 is acquired as information indicating the correlation between the encoding target block 703 and each of the adjacent blocks 704, 705, and 706, and the result is output (step D2). As an edge detection method, for example, a known method such as a method using a Canny filter or an edge detection method by differentiation can be used.

次に、以下の処理（ステップＤ４からＤ９）を各隣接ブロックについて行う。まず、隣接ブロック参照情報判定部３０２は、第１の実施形態と同様に、当該隣接ブロックの参照画像が、符号化対象ブロックの参照画像と同一か否かの判定を行う（ステップＤ４）。同一でないと判定したときは（Ｄ４−Ｎｏ）、ステップＤ７に遷移する。同一であると判定したときは（Ｄ４−Ｙｅｓ）、ステップＤ５に遷移する。ステップＤ５では、予測ベクトル候補判定部４０３が、エッジ検出部４０１の出力結果を基に、当該隣接ブロックにエッジが含まれているかどうかを判定する。エッジが含まれていないと判定したときは（Ｄ５−Ｎｏ）、ステップＤ８に遷移し、予測ベクトル候補判定部４０３は、当該隣接ブロックの参照情報を予測ベクトルの候補に設定する。そして、ステップＤ９にて、全ての隣接ブロックについて、処理をしていれば、ステップＤ１０に遷移し、未処理の隣接ブロックが有るときは、ステップＤ４に戻る。 Next, the following processing (steps D4 to D9) is performed for each adjacent block. First, the adjacent block reference information determination unit 302 determines whether or not the reference image of the adjacent block is the same as the reference image of the encoding target block, as in the first embodiment (step D4). When it is determined that they are not identical (D4-No), the process proceeds to step D7. If it is determined that they are the same (D4-Yes), the process proceeds to step D5. In step D5, the prediction vector candidate determination unit 403 determines whether an edge is included in the adjacent block based on the output result of the edge detection unit 401. When it is determined that no edge is included (D5-No), the process proceeds to step D8, and the prediction vector candidate determination unit 403 sets the reference information of the adjacent block as a prediction vector candidate. In step D9, if all the adjacent blocks are processed, the process proceeds to step D10. If there is an unprocessed adjacent block, the process returns to step D4.

一方、ステップＤ５にて、エッジが含まれていると判定したときは（Ｄ５−Ｙｅｓ）、ステップＤ６に遷移する。ステップＤ６では、ブロック間相関性判定部４０２は、隣接ブロックとの相関性を示す情報を取得する。具体的には、以下の式（１）を用いて符号化対象ブロックと隣接ブロック（ブロックＸ）の相関性を判定する。 On the other hand, when it is determined in step D5 that an edge is included (D5-Yes), the process proceeds to step D6. In step D6, the inter-block correlation determining unit 402 acquires information indicating the correlation with the adjacent block. Specifically, the correlation between the encoding target block and the adjacent block (block X) is determined using the following equation (1).

｜Ｄｅｐｔｈ［符号化対象ブロック］−Ｄｅｐｔｈ［ブロックＸ］｜＜｜Ｄｅｐｔｈ［ブロックＸ］−Ｄｅｐｔｈ［ブロックＸ’］｜？相関性あり：相関性なし …（１）
ここで、Ｄｅｐｔｈ[ブロックα]は、デプスマップ中のブロックαに対応する領域の値の平均値を示し、｜β｜は、βの絶対値を示す。また、α？β：γは、式αが成立するときは、βであり、式αが成立しないときは、γであることを示す。 | Depth [Block to be encoded] −Depth [Block X] | <| Depth [Block X] −Depth [Block X ′] | Correlation: No correlation (1)
Here, Depth [block α] indicates an average value of values of regions corresponding to the block α in the depth map, and | β | indicates an absolute value of β. Also α? β: γ is β when the formula α is satisfied, and γ when the formula α is not satisfied.

また、式（１）において、ブロックＸ’は、ブロックＸの周辺ブロックである。すなわち、図１１の例では、Ｄｅｐｔｈ[符号化対象ブロック]は、領域７０７のデプス値の平均値である。ブロックＸが、図５のブロック７０４のときは、Ｄｅｐｔｈ[ブロックＸ］は、領域７０８のデプス値の平均値であり、Ｄｅｐｔｈ[ブロックＸ’]は、領域８０１のデプス値の平均値である。また、ブロックＸが、図５のブロック７０５のときは、Ｄｅｐｔｈ[ブロックＸ]は、領域７０９のデプス値の平均値であり、Ｄｅｐｔｈ[ブロックＸ’]は、領域８０２のデプス値の平均値である。また、ブロックＸが図５のブロック７０６のときは、Ｄｅｐｔｈ[ブロックＸ]は、領域７１０のデプス値の平均値であり、Ｄｅｐｔｈ[ブロックＸ’]は、領域８０３のデプス値の平均値である。 In Expression (1), the block X ′ is a peripheral block of the block X. That is, in the example of FIG. 11, Depth [encoding target block] is an average value of the depth values of the region 707. When the block X is the block 704 in FIG. 5, Depth [block X] is an average value of the depth value of the region 708, and Depth [block X ′] is an average value of the depth value of the region 801. When the block X is the block 705 in FIG. 5, Depth [block X] is the average value of the depth value of the area 709, and Depth [block X ′] is the average value of the depth value of the area 802. is there. When the block X is the block 706 in FIG. 5, Depth [block X] is an average value of the depth value of the area 710, and Depth [block X ′] is an average value of the depth value of the area 803. .

このように、ブロック間相関性判定部４０２は、符号化対象ブロックに対応する領域７０７のデプス値の平均値と、領域７０８，７０９，７１０のうち、ステップＤ３〜Ｄ９のループで、当該ループの処理対象となっている隣接ブロックに対応する領域のデプス値の平均値との差分絶対値と、当該ループの処理対象となっている隣接ブロックに対応する領域のデプス値の平均値と、当該ループの処理対象となっている隣接ブロックの周辺ブロックに対応する領域のデプス値の平均値との差分絶対値を比較する。そして、ブロック間相関性判定部４０２は、当該ループの処理対象となっている隣接ブロックについて、その大小関係によって符号化対象ブロックと該隣接ブロックの相関性があるかないかを判別する。 As described above, the inter-block correlation determination unit 402 is a loop of steps D3 to D9 among the average value of the depth value of the region 707 corresponding to the encoding target block and the regions 708, 709, and 710. The absolute difference between the average value of the depth value of the area corresponding to the adjacent block to be processed, the average value of the depth value of the area corresponding to the adjacent block to be processed of the loop, and the loop The difference absolute value is compared with the average value of the depth values of the areas corresponding to the neighboring blocks of the adjacent block to be processed. Then, the inter-block correlation determination unit 402 determines whether or not there is a correlation between the encoding target block and the adjacent block based on the magnitude relationship of the adjacent block that is the processing target of the loop.

このステップＤ６における判別の結果、相関性があるときは（ステップＤ６−Ｙｅｓ）、上述のステップＤ８に遷移する。一方、このステップＤ６における判別の結果、相関性がないときは（ステップＤ６−Ｎｏ）、ステップＤ７に遷移する。
ステップＤ７では、予測ベクトル候補判定部４０３は、当該隣接ブロックの参照情報を、予測ベクトルの候補に設定せず、ステップＤ９に遷移する。そして、ステップＤ９では、上述のように、全ての隣接ブロックについて、処理をしていれば、ステップＤ１０に遷移し、未処理の隣接ブロックが有るときは、ステップＤ４に戻る。 If the result of determination in step D6 is that there is a correlation (step D6-Yes), the process proceeds to step D8 described above. On the other hand, if the result of determination in step D6 is that there is no correlation (step D6-No), the process proceeds to step D7.
In step D7, the prediction vector candidate determination unit 403 transitions to step D9 without setting the reference information of the adjacent block as a prediction vector candidate. In step D9, as described above, if all adjacent blocks have been processed, the process proceeds to step D10. If there is an unprocessed adjacent block, the process returns to step D4.

そして、ステップＤ１０では、中央値による予測ベクトル生成部４０４が、入力として予測ベクトルの候補に設定された参照情報を０から３個受け取り、H.264/AVCと同様の方法で予測ベクトルを生成し出力する。具体的には、受け取った参照情報が３個の場合には、３個の参照情報から、水平成分及び垂直成分それぞれについて中央値を取りその値を予測ベクトルとし、２個の場合には、水平成分及び垂直成分が０の参照情報を加え参照情報を３個にし、参照情報を３個受け取った場合と同じ方法で予測ベクトルを生成し、１個の場合には、唯一入力として受けっとった参照情報を予測ベクトルとし、０個の場合には、予測ベクトルの水平成分及び垂直成分を０に設定する。 In step D10, the median prediction vector generation unit 404 receives from 0 to 3 reference information set as prediction vector candidates as input, and generates a prediction vector in the same manner as in H.264 / AVC. Output. Specifically, when three pieces of reference information are received, a median value is taken for each of the horizontal component and the vertical component from the three pieces of reference information, and the value is used as a prediction vector. The reference information is added to the reference information with the component and the vertical component being 0, the reference information is set to three, and the prediction vector is generated in the same manner as when the three pieces of reference information are received. The reference information is a prediction vector, and in the case of 0, the horizontal and vertical components of the prediction vector are set to 0.

上述のステップＤ６において、相関性があると判別したときに、ステップＤ８に遷移して、該隣接ブロックの参照情報を予測ベクトルに設定している。これは、その隣接ブロックにエッジが含まれている場合であっても、隣接ブロックの大部分が符号化対象ブロックと同一のオブジェクトに含まれていることも考えられ、その場合にはその隣接ブロックの参照情報を用いて予測ベクトルを生成した方が、予測ベクトルの精度が向上する可能性が高いからである。 When it is determined in step D6 that there is a correlation, the process proceeds to step D8, and the reference information of the adjacent block is set as a prediction vector. Even if the adjacent block contains an edge, it is possible that most of the adjacent block is included in the same object as the encoding target block. This is because it is more likely that the accuracy of the prediction vector is improved when the prediction vector is generated using the reference information.

そのため、上述のように、エッジが含まれている隣接ブロックに対しては、ブロック間相関性判定部４０２の出力である隣接ブロックと符号化対象ブロックとの相関性を示す情報が、相関性がある旨を示しているならば該隣接ブロックの参照情報を予測ベクトルの候補に設定し、相関性が無い旨を示しているならば該隣接ブロックの参照情報を予測ベクトルの候補に設定しない。 Therefore, as described above, for an adjacent block including an edge, information indicating the correlation between the adjacent block and the encoding target block, which is the output of the inter-block correlation determining unit 402, has a correlation. If it indicates that there is a correlation, the reference information of the adjacent block is set as a prediction vector candidate, and if it indicates that there is no correlation, the reference information of the adjacent block is not set as a prediction vector candidate.

このように、符号化対象ブロックと同一オブジェクト内に存在する可能性の高い隣接ブロックの参照情報のみを予測ベクトル生成時の候補とすることが出来るため、予測ベクトルの精度が向上し、参照情報の符号化効率が向上する。 In this way, only the reference information of an adjacent block that is likely to exist in the same object as the encoding target block can be used as a candidate when generating a prediction vector, so that the accuracy of the prediction vector is improved and the reference information Encoding efficiency is improved.

[第３の実施形態]
以下、図面を参照して、本発明の第３の実施形態について説明する。第３の実施形態では、符号化対象ブロックと隣接ブロックの相関性を表す情報を重みとした加重平均による予測ベクトルの生成方法について説明する。
本実施形態における画像符号化装置１００ｂは、図１に示す画像符号化装置１００と、予測ベクトル生成部１０８に変えて、予測ベクトル生成部１０８ｂを備える点が異なる。
また、本実施形態における画像復号装置２００ｂは、図７に示す画像復号装置２００と、予測ベクトル生成部１０８に変えて、予測ベクトル生成部１０８ｂを備える点が異なる。 [Third embodiment]
The third embodiment of the present invention will be described below with reference to the drawings. In the third embodiment, a method of generating a prediction vector by weighted average using information indicating the correlation between the encoding target block and adjacent blocks as a weight will be described.
The image encoding device 100b according to this embodiment is different from the image encoding device 100 illustrated in FIG. 1 in that a prediction vector generation unit 108b is provided instead of the prediction vector generation unit 108.
Also, the image decoding device 200b according to the present embodiment is different from the image decoding device 200 shown in FIG. 7 in that a prediction vector generation unit 108b is provided instead of the prediction vector generation unit 108.

図１２は、本実施形態における予測ベクトル生成部１０８ｂの構成を示す概略ブロック図である。予測ベクトル生成部１０８は、符号化対象ブロックと隣接ブロックの相関性を表す情報を重みとした加重平均によって予測ベクトルを生成する。予測ベクトル生成部１０８ｂは、ブロック間相関性算出部３０１、予測ベクトル算出部１０９ｂを備える。予測ベクトル算出部１０９ｂは、隣接ブロック参照情報判定部３０２、加重平均による予測ベクトル生成部５０２を備える。ブロック間相関性算出部３０１および隣接ブロック参照情報判定部３０２は、図３におけるブロック間相関性算出部３０１および隣接ブロック参照情報判定部３０２と同様である。加重平均による予測ベクトル生成部５０２は、ブロック間相関性算出部３０１が算出した相関性に応じた重みを用いて、隣接ブロックの参照情報の加重平均を算出し、予測ベクトルとする。 FIG. 12 is a schematic block diagram illustrating a configuration of the prediction vector generation unit 108b in the present embodiment. The prediction vector generation unit 108 generates a prediction vector by a weighted average using information indicating the correlation between the encoding target block and the adjacent block as a weight. The prediction vector generation unit 108b includes an inter-block correlation calculation unit 301 and a prediction vector calculation unit 109b. The prediction vector calculation unit 109b includes an adjacent block reference information determination unit 302 and a prediction vector generation unit 502 using a weighted average. The inter-block correlation calculation unit 301 and the adjacent block reference information determination unit 302 are the same as the inter-block correlation calculation unit 301 and the adjacent block reference information determination unit 302 in FIG. The prediction vector generation unit 502 based on the weighted average calculates the weighted average of the reference information of the adjacent blocks using the weight according to the correlation calculated by the inter-block correlation calculation unit 301, and sets it as the prediction vector.

図１３は、本実施形態における予測ベクトル生成部１０８ｂの動作を説明するフローチャートである。このフローチャートおよび図５を用いて予測ベクトル生成方法の説明をする。まず、ブロック間相関性算出部３０１は、入力として画像７０１に対応するデプスマップ７０２を受け取る（ステップＥ1）。そして、ブロック間相関性算出部３０１は、デプスマップを用いて符号化対象ブロック７０３と隣接ブロック７０４，７０５，７０６の間の相関性を表す情報を算出し、出力する（ステップＥ２）。具体的な算出方法としては、例えばデプスマップ７０２上の符号化対象ブロックと同じ位置のブロック７０７のデプス値の平均値とその隣接ブロック７０８，７０９，７１０のデプス値の平均値の差分絶対値を算出し、出力する。なお、差分絶対値ではなく２乗誤差を算出し、出力してもよい。 FIG. 13 is a flowchart for explaining the operation of the prediction vector generation unit 108b in the present embodiment. The prediction vector generation method will be described with reference to this flowchart and FIG. First, the inter-block correlation calculation unit 301 receives a depth map 702 corresponding to the image 701 as an input (step E1). Then, the inter-block correlation calculation unit 301 calculates and outputs information representing the correlation between the encoding target block 703 and the adjacent blocks 704, 705, and 706 using the depth map (step E2). As a specific calculation method, for example, the difference absolute value of the average value of the depth values of the block 707 at the same position as the encoding target block on the depth map 702 and the average value of the depth values of the adjacent blocks 708, 709 and 710 is calculated. Calculate and output. A square error may be calculated and output instead of the difference absolute value.

次に処理を行う隣接ブロック参照情報判定部３０２は、第１の実施形態と同様に、各隣接ブロックについて、符号化対象ブロックと参照画像が同一か否かを判定し（Ｅ４）、同一であれば該隣接ブロックの参照情報を予測ベクトルの候補とし（Ｅ５）、同一でなければ該隣接ブロックの参照情報を予測ベクトルの候補から外す（Ｅ６）。次に、加重平均による予測ベクトル生成部５０２は、入力として隣接ブロック参照情報判定部３０２から予測ベクトルの候補とした隣接ブロックの参照情報を受け取り、ブロック間相関性算出部５０１から隣接ブロックと符号化対象ブロックとの相関性を表す情報を受け取る。そして、その相関性を表す情報の逆数を重みとした参照情報の加重平均を水平成分、垂直成分それぞれについて算出し、その算出結果を予測ベクトルに設定する（ステップＥ８）。 The adjacent block reference information determination unit 302 that performs the next process determines whether the encoding target block and the reference image are the same for each adjacent block (E4), as in the first embodiment. For example, the reference information of the adjacent block is set as a prediction vector candidate (E5). If not, the reference information of the adjacent block is excluded from the prediction vector candidates (E6). Next, the prediction vector generation unit 502 based on the weighted average receives the reference information of the adjacent block as a prediction vector candidate from the adjacent block reference information determination unit 302 as an input, and encodes the adjacent block from the inter-block correlation calculation unit 501. Information indicating the correlation with the target block is received. Then, a weighted average of the reference information with the reciprocal of the information representing the correlation as a weight is calculated for each of the horizontal component and the vertical component, and the calculation result is set as a prediction vector (step E8).

これにより、符号化対象ブロックと同一オブジェクト内に存在する可能性の高い隣接ブロックに重きを置いた参照情報を取得出来るため、予測ベクトルの精度が向上し、参照情報の符号化効率が向上する。 This makes it possible to acquire reference information that places emphasis on adjacent blocks that are likely to exist in the same object as the encoding target block, thereby improving the accuracy of prediction vectors and improving the encoding efficiency of reference information.

なお、上述の各実施形態において、画像符号化装置および画像復号装置は、２視点の動画像を対象としているが、３視点以上の動画像や１視点の動画像、多視点の静止画像を対象にするようにしてもよい。ただし、１視点の動画像の場合は視差補償予測を、多視点の静止画像の場合はフレーム間動き補償予測を符号化モードとして選択することが出来ない。 In each of the above-described embodiments, the image encoding device and the image decoding device target two-viewpoint moving images, but target three-viewpoint moving images, one-viewpoint moving images, and multi-viewpoint still images. You may make it. However, the parallax compensation prediction cannot be selected as a coding mode in the case of a one-view video, and the inter-frame motion compensation prediction cannot be selected as a coding mode in the case of a multi-view still image.

以上の画像符号化及び復号に関する処理は、ハードウェアを用いた伝送、蓄積装置として実現することができるのはもちろんのこと、ROMやフラッシュメモリ等に記憶されているファームウェアや、コンピュータ等のソフトウェアによっても実現することができる。
そのファームウェアプログラム、ソフトウェアプログラムをコンピュータ等で読み取り可能な記録媒体に記録して提供することも、有線あるいは無線のネットワークを通してサーバから提供することも、地上波あるいは衛星ディジタル放送のデータ放送として提供することも可能である。
以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も特許請求の範囲に含まれる。 The above processing related to image encoding and decoding can be realized as a transmission and storage device using hardware, as well as firmware stored in ROM, flash memory, etc., and software such as a computer. Can also be realized.
The firmware program and software program can be recorded on a computer-readable recording medium, provided from a server through a wired or wireless network, or provided as a data broadcast of terrestrial or satellite digital broadcasting Is also possible.
The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and the design and the like within the scope of the present invention are also within the scope of the claims. include.

また、上述の各実施形態における画像符号化装置または画像復号装置の機能、または、これらの一部の機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより、これらの機能を実現してもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。 In addition, the functions of the image encoding device or the image decoding device in each of the above-described embodiments, or a program for realizing a part of these functions are recorded on a computer-readable recording medium and recorded on the recording medium. These functions may be realized by reading the executed program into a computer system and executing the program. Here, the “computer system” includes an OS and hardware such as peripheral devices.

また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含むものとする。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 Further, the “computer system” includes a homepage providing environment (or display environment) if a WWW system is used.
The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory in a computer system serving as a server or a client in that case, and a program that holds a program for a certain period of time are also included. The program may be a program for realizing a part of the functions described above, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system.

以上、この発明の実施形態を図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計変更等も含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes design changes and the like within a scope not departing from the gist of the present invention.

１００、１００ａ、１００ｂ…画像符号化装置
１０１…画像入力部
１０２…ブロックマッチング実施部
１０３…予測画像生成部
１０４…差分画像符号化部
１０５…差分画像復号部
１０６…参照画像メモリ
１０７…参照情報蓄積メモリ
１０８、１０８ａ、１０８ｂ…予測ベクトル生成部
１０９、１０９ａ、１０９ｂ…予測ベクトル算出部
１１０…差分参照情報符号化部
１１１…参照画像指定情報蓄積メモリ
１１２…参照画像選択部
１１３…参照画像指定情報符号化部
１１４…減算部
１１５、１１６…加算部
２００、２００ａ、２００ｂ…画像復号装置
２０１…差分画像復号部
２０２…差分参照情報復号部
２０３…参照画像指定情報復号部
２０４…予測画像生成部
２０５…参照画像メモリ
２０６…参照情報蓄積メモリ
２０９…参照画像指定情報蓄積メモリ
２１０、２１１…加算部
３０１…ブロック間相関性算出部
３０２…隣接ブロック参照情報判定部
３０３…隣接ブロック参照情報蓄積メモリ
３０４…予測ベクトル設定部
４０１…エッジ検出部
４０２…ブロック間相関性判定部
４０３…予測ベクトル候補判定部
４０４…中央値による予測ベクトル生成部
５０２…加重平均による予測ベクトル生成部 DESCRIPTION OF SYMBOLS 100, 100a, 100b ... Image coding apparatus 101 ... Image input part 102 ... Block matching implementation part 103 ... Prediction image generation part 104 ... Difference image coding part 105 ... Difference image decoding part 106 ... Reference image memory 107 ... Reference information storage Memory 108, 108a, 108b ... Prediction vector generation unit 109, 109a, 109b ... Prediction vector calculation unit 110 ... Difference reference information encoding unit 111 ... Reference image designation information storage memory 112 ... Reference image selection unit 113 ... Reference image designation information code Conversion unit 114 ... subtraction unit 115, 116 ... addition unit 200, 200a, 200b ... image decoding device 201 ... difference image decoding unit 202 ... difference reference information decoding unit 203 ... reference image designation information decoding unit 204 ... predicted image generation unit 205 ... Reference image memory 206 ... Reference information storage memory 209 ... Reference Image designation information storage memory 210, 211 ... Addition unit 301 ... Inter-block correlation calculation unit 302 ... Adjacent block reference information determination unit 303 ... Adjacent block reference information storage memory 304 ... Prediction vector setting unit 401 ... Edge detection unit 402 ... Between blocks Correlation determination unit 403 ... prediction vector candidate determination unit 404 ... prediction vector generation unit based on median value 502 ... prediction vector generation unit based on weighted average

Claims

The target image to be encoded or decoded is obtained by dividing an image to be encoded or decoded into blocks, and applying an inter-frame motion prediction encoding method or a parallax compensation prediction encoding method to each of the blocks, thereby encoding or decoding the block. Used when encoding or decoding an image by generating a prediction image of the target block based on a reference image of the target block and reference information indicating a position of a region corresponding to the target block in the reference image; In a method for generating a prediction vector of reference information,
Prediction vector generation, comprising: a prediction vector generation step of generating a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block Method.

A target image to be encoded is divided into blocks, and for each of the blocks, a reference image used for predicting the target block is selected from a plurality of already encoded images, and the reference image in the reference image is selected. An image encoding method that generates a predicted image using reference information that specifies an area corresponding to a target block, and encodes an image by encoding a difference between the predicted image and the target block. ,
An image having a prediction vector generation step of generating a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block Encoding method.

The entire target image to be decoded is divided into blocks, and for each of the blocks, a reference image used for predicting the target block is selected from a plurality of already decoded images, and the target in the reference image is selected. An image decoding method for generating a predicted image using reference information that specifies an area corresponding to a block of the image, and decoding an image by decoding a difference between the predicted image and the target block,
An image having a prediction vector generation step of generating a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block Decryption method.

The target image to be encoded or decoded is obtained by dividing an image to be encoded or decoded into blocks, and applying an inter-frame motion prediction encoding method or a parallax compensation prediction encoding method to each of the blocks, thereby encoding or decoding the block. Used when encoding or decoding an image by generating a prediction image of the target block based on a reference image of the target block and reference information indicating a position of a region corresponding to the target block in the reference image; A prediction vector generation device that generates a prediction vector of reference information,
A prediction vector generation device that generates a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block.

The target image to be encoded or decoded is obtained by dividing an image to be encoded or decoded into blocks, and applying an inter-frame motion prediction encoding method or a parallax compensation prediction encoding method to each of the blocks, thereby encoding or decoding the block. Used when encoding or decoding an image by generating a prediction image of the target block based on a reference image of the target block and reference information indicating a position of a region corresponding to the target block in the reference image; A computer of a prediction vector generation device that generates a prediction vector of reference information,
A prediction vector generation program for executing a prediction vector generation step of generating a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block.

A target image to be encoded is divided into blocks, and for each of the blocks, a reference image used for predicting the target block is selected from a plurality of already encoded images, and the reference image in the reference image is selected. An image encoding apparatus that generates a predicted image using reference information that specifies an area corresponding to a target block, and encodes an image by encoding a difference between the predicted image and the target block. ,
An image encoding apparatus, wherein a prediction vector of the target block is generated using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block.

A target image to be encoded is divided into blocks, and for each of the blocks, a reference image used for predicting the target block is selected from a plurality of already encoded images, and the reference image in the reference image is selected. A computer of an image encoding device that generates a predicted image using reference information that specifies an area corresponding to a target block, and encodes a difference between the predicted image and the target block. ,
Image coding for executing a prediction vector generation step of generating a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block program.

The entire target image to be decoded is divided into blocks, and for each of the blocks, a reference image used for predicting the target block is selected from a plurality of already decoded images, and the target in the reference image is selected. An image decoding device that generates a predicted image using reference information that specifies an area corresponding to a block of the image and decodes an image by decoding a difference between the predicted image and the target block,
An image decoding apparatus that generates a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block.

The entire target image to be decoded is divided into blocks, and for each of the blocks, a reference image used for predicting the target block is selected from a plurality of already decoded images, and the target in the reference image is selected. A computer of an image decoding apparatus that generates a predicted image using reference information that specifies an area corresponding to a block of the image and decodes a difference between the predicted image and the target block,
An image decoding program for executing a prediction vector generation step of generating a prediction vector of the target block using information representing a distance corresponding to the target image and reference information of a block adjacent to the target block .